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Preface 


This is the third edition of the textbook. We decided to rethink the 
size and content completely when planning this edition. Our sense 
was that a larger and larger archive of accumulated knowledge is no 
longer feasible or desirable in the digital age. We wanted to produce 
a single volume with a more defined point of view, that better reflects 
the challenge of the future. 

Psychiatry is a medical specialty. Medicine took its origins in simple 
observation and classification and the serendipitous discovery of pal- 
liative treatments. The application of science has transformed much of 
medicine by providing an understanding of mechanisms of pathology. 
The scientific method provides the only way to reliable knowledge, and 
medical science is slowly developing rational treatments that are po- 
tentially curative. However, aside from treatable infections, we have a 
long way to go. The trajectory of medical advance in the practice of 
psychiatry has been slower than for other disease areas in recent years, 
but neuroscience is difficult. What underwrites our confidence in what 
is sometimes disparagingly described as the medical model is the fact 
that psychiatric disorders, especially severe disorders, have a genetic 
basis. Genetic risks are largely unidirectional and they are biological. 
‘They guarantee some kind of future biological explanation for the phe- 
nomena they describe. So if you decide that schizophrenia is a myth, a 
social construct, or a plot by psychiatrists to enhance their social status, 
you have to explain why its inheritance is what it is. 

If, like us, you find the genetic data compelling, then you accept 
the grand challenge of working out the neurobiology of psychiatric 
disorder. We cannot know how quickly it will translate into im- 
proved treatments, but we think there are already promising devel- 
opments from molecular biology and neuroimaging. Imaging has 
been particularly important because it has stimulated the develop- 
ment of a completely brain-based cognitive neuroscience. This is a 
major intellectual shift. Forty years ago, an experimental psycholo- 
gist would have said that the brain was unimportant for the study of 
mental mechanisms and even less important for the development of 
psychological treatments. As this view changes, so the advances of 
neuroscience can be translated into patient benefit as scientifically 
guided psychotherapy. 

Our authors are drawn from all over the world, and they illustrate 
the simple truth that science is universal. We thank them most sin- 
cerely for their efforts in bringing the project to completion. 


The layout of the book 


The content of chapters was not highly pre-specified, and the chap- 
ters themselves have not been edited for conformity with the editors’ 


views. They can be read as free-standing contributions. Accordingly, 
there is both overlap and divergence in how topics are covered, 
which will reflect the writers’ priorities and interests. 

In the previous edition of the textbook, the editors identified con- 
vergence as an important theme of the book. We are not convinced 
further convergence has occurred since 2001. Instead we have seen 
a surprising amount of divergence in the claims made about psych- 
iatry. Our section on approaches to psychiatry reflects some of the 
key issues relating to the patients perspective, stigma, the global 
challenge of mental disorder, practical ethics, and the foundations 
of psychiatry as phenomenology and a medical discipline. It further 
sets the scene for current controversies around diagnosis, psycho- 
pathology, evidence, and drug terminology. 

The chapters in the section on the scientific basis of psychiatric 
aetiology and treatment provide simple introductions to the relevant 
disciplines that underpin our scientific understanding. 

Individual disorders are covered in sections that follow the struc- 
ture of the Diagnostic and Statistical Manual of Mental Disorders, 
fifth edition (DSM-5). DSM-5 was published in 2013. It had been 
envisaged that it would be possible to make major changes to the 
approach of DSM-IV. Thus, major advances in genetics, imaging, 
and neurobiology were widely expected to transform psychiatry, fol- 
lowing the success of the human genome project and the decade of 
the brain. This transformation has not yet happened. Hence, DSM-5 
(and the International Classification of Diseases, eleventh revision) 
follow a much more conventional, clinically led summary of how 
patients present with psychiatric disorder. We see no reason to deny 
the utility of symptom-based diagnoses and the consensus that cre- 
ated the current categories. However, the project of applying neuro- 
science to psychiatry has not failed, as has sometimes been implied 
by criticism of DSM-5. For these reasons, we have included chapters 
on genetics, neurobiological targets, and imaging in the sections of 
the book focused on specific disorders. 

We have also included sections on service provision and forensic 
psychiatry because these are critical to the context in which psychi- 
atric disorder is managed. 

We thank the staff of OUP for their support and encouragement 
and Andy Richford who has been our project manager sans pareil. 


John R. Geddes 
Nancy C. Andreasen 
Guy M. Goodwin 


Professor Michael Gelder (1929-2018) 


Michael Gelder, one of the founding editors of the New Oxford Textbook of Psychiatry, 
sadly died in 2018. We dedicate this new edition of the book to Michaels memory. 


di: Michael was the first WA Handley Professor of Psychiatry at the 

4 University of Oxford and founded the Department of Psychiatry in 
1969. He led the Department for 27 years until he retired in 1996. 
Before arriving in Oxford, at the Institute of Psychiatry, Michael 
developed a treatment for anxiety based on desensitization, in which 
gradual exposure to the feared stimulus was coupled with physical 
relaxation. He described the first controlled trial of this psychological 
therapy in patients with severe agoraphobia in his seminal 1966 
publication with Isaac Marks. 

Michael possessed remarkable organizational abilities and lead- 
ership skills and he built a thriving Department of Psychiatry in 
Oxford with a particular focus on developing both psychological and physical treatments. This 
departmental focus continues into the present. When JRG interviewed him in 2018, very shortly 
before he died, Michael admitted to being particularly proud of the Departmentť’s development 
of cognitive behaviour therapy (CBT) under his leadership. These treatments include highly 
effective forms of CBT for anxiety disorders, post-traumatic stress disorder, chronic fatigue 
syndrome, and eating disorders. All have been widely adopted in clinical practice and have 
benefited enormous numbers of people worldwide. Michael also developed a psychopharma- 
cology research unit based on powerful cross-departmental collaboration within the University. 
The unit has a strong track record of investigating the mechanisms of action of antidepressants 
and anxiolytics and its work has fundamentally shaped our understanding of the biology 
underlying psychiatric disorder. 

Michael was also a committed and inspirational teacher and the driving force behind a suc- 
cessful series of psychiatry textbooks. The first, in 1983, was the Oxford Textbook of Psychiatry 
(now in its seventh edition). Translated into six languages, this became the standard textbook for 
psychiatric trainees. Then came the Concise Textbook (aimed at medical students and now in its 
fifth edition) and the current New Oxford Textbook of Psychiatry (targeted at postgraduates—this 
is the third edition). 

As he passed the age of 80, Michael had finally retired from editing textbooks (although he was 
delighted that his colleagues continue to revise them!), but he closely followed the development 
of the Department. He will be greatly missed. Michael was a truly remarkable clinical academic, 
inspirational in his ability to combine research with clinical practice, teaching, and leadership. 
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avoidant restrictive food intake disorder 
autosomal recessive forms of intellectual disability 
at-risk mental state 

aripiprazole 

anxiety sensitivity 

Adult Social Care Outcome Toolkit 

autism spectrum disorder; acute stress disorder 
Anxiety Sensitivity Index 

acid-sensing ion channel 

arterial spin labelling 

asenapine 

antisocial personality disorder 

antisocial personality disorder 

advanced sleep phase syndrome 

advanced sleep-wake phase disorder 
Anatomical Therapeutic Chemical 

activating transcription factor 6 





Abbreviations 


ATL anterior temporal lobe 

ATP adenosine triphosphate 

ATPD acute and transient psychotic disorder 

ATX atomoxetine 

AUC area under the curve 

AUD alcohol use disorder 

AUDADIS-IV Alcohol Use Disorder and Associated 
Disabilities Interview Schedule-IV 

AVP vasopressin 

AvPD avoidant personality disorder 

AVPD avoidant personality disorder 

BBB blood-brain barrier 

BBV blood-borne virus 

BD bipolar disorder 

BDD body dysmorphic disorder 

BDI Beck Depression Inventory 

BDNF brain-derived neurotrophic factor 

BDSM bondage, dominance and submission, sadism, and 
masochism 

BED binge eating disorder 

BET brief eclectic therapy 

BF basal forebrain 

BI behavioural inhibition 

BIA budget impact analysis 

BIBD basophilic inclusion body disease 

BIPS Brief Intermittent Psychotic Symptoms 

BIT behavioural intervention team 

BLA basolateral amygdala 

BLIPS Brief Limited Intermittent Psychotic Symptoms 

BMI body mass index 

BMP bone morphogenetic protein 

BN bulimia nervosa 

BNM biophysical network model 

B/NRT bupropion/nicotine replacement therapy 

BOLD blood oxygen level-dependent 

BOTMP Bruininks—Oseretsky Test of Motor Proficiency 

BP blood pressure 

BPD borderline personality disorder; bipolar disorder 

BPI bipolar disorder type I 

BPI bipolar disorder type II 

bpm beats per minute 

BPRS Brief Psychiatric Rating Scale 

BPSD behavioural and psychological symptoms of 
dementia 

BS basic symptoms 

BSE bovine spongiform encephalopathy 

BTSAS Behavioural Treatment for Substance Abuse in 
Severe and Persistent Mental Illness 

bvFTD behavioural-variant fronto-temporal dementia 

BWLT behavioural weight loss therapy 

BZD benzodiazepine 

C&A children and adolescents 

Ca” calcium 

CAA cerebral amyloid angiopathy 

CAARMS Comprehensive Assessment of At-Risk Mental State 

CAC Clinical Assessment of Confusion 

CAD coronary artery disease 


CADASIL 


CAM 


CAMCOG(R) 


CAMHS 
cAMP 
CAPA 
CAPP 


CART 
CAS9 

CAT 
CATCH-IT 


CATIE 


CBA 
CBC 
CBCL 
CBCM 
CBD 
CBF 
CBG 
CBG-GSH 
CBI 
CBIT 
CBO 
CBS 
CBT 
CBT-E 
CBTi 
CBT-PD 


CC 
CCA 
CCK 
CCK-4 
CCL 
CCM 
CD 
CDC 
CDDG 


CEA 
CEST 
CET 
CETA 
CFIR 


CFS 
CGAS 
CGE 
CGI-I 
cGMP 
CGMV 
CH 


cerebral autosomal dominant arteriopathy with 
subcortical infarcts and leukoencephalopathy 
Confusion Assessment Method 

Cambridge Cognitive Assessment (Revised) 
child and adolescent mental health services 
cyclic adenosine monophosphate 
Child and Adolescent Psychiatric Assessment 
Comprehensive Assessment of Psychopathic 
Personality Disorder 
cocaine- and amphetamine-related transcript 
CRISPR-associated protein 9 
cognitive analytic therapy 
Competent Adulthood Transition with Cognitive 
Behavioural and Interpersonal Training 
Clinical Antipsychotic Trial of Intervention 
Effectiveness (study) 
cost-benefit analysis 
complete blood count 
Child Behavior Problems Checklist 
cognitive behavioural case management 
cortico-basal degeneration; cannabidiol 
cerebral blood flow 
cortico-basal ganglia 
guided self-help cognitive behavioural therapy 
classroom-based intervention 
Comprehensive Behavioral Intervention for Tics 
community-based organization 
cortico-basal syndrome 
cognitive behavioural therapy 
enhanced cognitive behavioural treatment 
cognitive behavioural therapy for insomnia 
cognitive behavioural therapy for personality 
disorder 
collaborative care 
cost-consequences analysis 
cholecystokinin 
cholecystokinin tetrapeptide 
conventional consultation liaison 
collaborative care management 
coeliac disease 
Center for Disease Control and Prevention 
Clinical Descriptions and Diagnostic Guidelines 


(from ICD-10 Classification of Mental and Behavioral 


Disorders) 

cost-effectiveness analysis 

chemical exchange saturation transfer 
cue-exposure treatment 

Common Elements Treatment Approach 
Consolidated Framework for Implementation 
Research 

chronic fatigue syndrome 

Child Global Assessment Scale 

caudal ganglionic eminence 

Clinical Global Impression of Improvement 
cyclic guanosine monophosphate 

cortical grey matter volume 

congenital hypothyroidism 


CHARGE Cohorts for Heart and Aging Research in Genomic 
Epidemiology 

CHAT Comprehensive Health Assessment Tool 

CHMP Committee for Medicinal Products for Human Use 

CHMP2B charged multivesicular body protein 2b 

CHOICE CHOosing Interventions that are Cost-Effective 
(project) 

CHOP Children’s Hospital of Philadelphia 

CHR clinical high-risk 

CI confidence interval 

CIDI Composite International Diagnostic Instrument 

CIR Clutter Image Rating 

CJD Creutzfeldt—Jakob disease 

CLiPS Collaborative Longitudinal Personality 
Disorders Study 

CLP consultation-liaison psychiatry 

CLPDS Collaborative Longitudinal Personality 
Disorders Study 

CLPS Collaborative Longitudinal Personality Study 

cm centimetre 

CM contingency management; crisis management 

CMA chromosomal microarray analysis; chaperone- 
mediated autophagy; cost-minimization analysis 

CMAT Changes to the Matrix Council 

CMD common mental disorder 

CMHD common mental health disorder 

CMHT community mental health team 

CMP comprehensive metabolic panel 

CMS-R Comorbidity Survey-Replication 

CNGC cyclic nucleotide-gated channel 

CNS central nervous system 

CNV copy number variant 

COG centre of gravity 

COGA Collaborative Studies on Genetics of Alcoholism 

COGEND Collaborative Genetic Study of Nicotine Dependence 

COMT catechol-O-methyltransferase 

CONSORT Consolidated Standards of Reporting Trial 


CONVERGE China, Oxford, and Virginia Commonwealth 
University Experimental Research on Genetic 
Epidemiology 


COPC chronic overlapping pain condition 

C9ORF72 chromosome 9 open reading frame 72 

CoSA Circles of Support and Accountability 

COX-2 cyclo-oxygenase-2 

CP choroid plexus 

CPA Care Programme Approach 

CPES Collaborative Psychiatric Epidemiological Studies 
CPR Civil Procedure Rules 

CPT cognitive processing therapy 

Cr creatine 

CR cognitive rehabilitation; conditioned response 
CRA community reinforcement approach 

CREB cAMP response element binding protein 

CRF corticotropin-releasing factor 

CRF1 corticotropin-releasing factor 1 

CRH corticotropin-releasing hormone 

CR/HT crisis resolution/home treatment 


CRISPR 


CRN 
CRP 
CrPR 
CRSWD 
CS 

CSA 
CSB 
CSF 

CSS 
CSTC 
CT 

CTD 
CTE 
CTO 
CU 
CUA 
CUtLASS 


CVD 
CVO 
CWMV 
CY-BOCS 
DA 
dACC 
DACCP 
DAG 
DAGK 
DALY 
DAMP 
DAPP 
DARI 
DAT 
DAWS 
DBH 
DBS 
DBT 
DCD 
DCM 
DCR 


DCS 
DD 
DDA 
DDP 
DEX 
DFC 
2-DG 
DHA 
DHPG 
DIAN 
DIRT 
DIRUM 


DIS 
DISC 
DISC1 


Abbreviations Si 





clustered regularly interspaced short 
palindromic repeat 

correct related negativity 

C-reactive protein 

Criminal Procedure Rules 

circadian rhythm sleep-wake disorder 
conditioned stimulus; compulsive shopping 
child sexual abuse 

compulsive sexual behaviour 

cerebrospinal fluid 

chromosomal substitution strain 
cortico-striato-thalamo-cortical 

computed tomography 

chronic tic disorder 

chronic traumatic encephalopathy 
community treatment order 
callous-unemotional 

cost-utility analysis 

Cost Utility of the Latest Antipsychotic drugs in 
Schizophrenia Study 

cardiovascular disease 

circumventricular organ 

cerebral white matter volume 

Children’s Yale-Brown Obsessive Compulsive Scale 
dopamine 

dorsal anterior cingulate cortex 

Dundee ADHD Clinical Care Pathway 
diacylglycerol 

diacylglycerol kinase 

disability-adjusted life year 
damage-associated molecular pattern 
Differential Assessment of Personality Pathology 
dopamine reuptake inhibitor 

dopamine; dopamine transporter 
dopamine agonist withdrawal syndrome 
dopamine-beta-hydroxylase 

deep brain stimulation 

dialectical behaviour therapy 
developmental co-ordination disorder 
dynamic causal model 

Diagnostic Criteria for Research (from ICD-10 
Classification of Mental and Behavioral Disorders) 
d-cycloserine 

delay discounting 

direct detection assay 

dynamic deconstructive psychotherapy 
dextroamphetamine 

dorsolateral prefrontal cortex 
2-deoxyglucose 

docosahexaenoic acid 
dihydroxyphenylethylene glycol 
Dominantly Inherited Alzheimer Network 
Danger ideation reduction therapy 
Database of Instruments for Resource Use 
Measurement 

Diagnostic Interview Schedule 

Diagnostic Interview Schedule for Children 
Disrupted in Schizophrenia 1 





Abbreviations 


DLB 
DLMO 
dIPFC 
DLPFC 
DM 
DMH 
DM-ID 
DMN 
DMT 
DNA 
DNIC 
DOMINO 


DOMS 
DOR 
DOSS 
DPD 
DPMS 
DR 
DRD4 
DRG 
DRN 
DRPLA 
DRS-R-98 
DS 
DSED 
DSM 
DSM-III 


DSM-III-R 
DSM-IV-TR 
DSM-5 


DST 


DSWPD 
DTC 
DTI 
DTS 
DUB 
DUD 
DUI 


DUP 
DURG 
DVA 
DWI 
DXA 
DY-BOCS 


DZ 
EAGG 
EAS 
EC 
ECA 
ECG 
ECNP 
ECT 


dementia with Lewy bodies 

dim light melatonin onset 

dorsolateral prefrontal cortex 

dorsolateral prefrontal cortex 

diabetes mellitus 

dorsomedial nucleus of the hypothalamus 
Diagnostic Manual-Intellectual Disabilities 
default mode network 

dimethyltryptamine 

deoxyribonucleic acid 

diffuse noxious inhibitory control 

Donepezil and Memantine in Moderate to Severe 
Alzheimer's Disease (study) 

delayed onset of muscular soreness 

delta opioid receptor 

Delirium Observation Screening Scale 
dependent personality disorder 

descending pain modulatory system 

dorsal raphe 

dopamine receptor type 4 

diagnosis-related group 

dorsal raphe nuclei 
dentatorubropallidoluysian atrophy 

Delirium Rating Scale-Revised-98 

dorsal striatum 

disinhibited social engagement disorder 
Diagnostic and Statistical Manual of Mental Disorders 
Third revision of the Diagnostic and Statistical 
Manual of Mental Disorders 

DSM-III-Revised 

DSM-IV “Text Revision’ 

5th edition of the Diagnostic and Statistical Manual 
of Mental Disorders 

daylight saving times; dexamethasone 
suppression test 

delayed sleep-wake phase disorder 
democratic therapeutic community 

diffusion tensor imaging 

diffusion tensor spectroscopy 
deubiquitinating enzyme 

drug use disorders 

daytime urinary incontinence; duration of untreated 
illness 

duration of untreated psychosis 

Drug Utilisation Research Group 

domestic violence and abuse 
diffusion-weighted imaging 

dual-energy X-ray absorptiometry 
Dimensional Yale-Brown Obsessive 
Compulsive Scale 

dizygotic 

European ADHD Guideline Group 
euthanasia or assisted suicide 

enhanced care 

Epidemiologic Catchment Area (study) 
electrocardiography 

European College of Neuropsychopharmacology 
electroconvulsive therapy 


ED 


EDNOS 
EDSP 


EEG 
EFFEKTE-E 


EGF 
EHS 
EI 
EMA 


EMDR 
EMG 
ENCODE 
ENIGMA 


EOG 
EOS 
EP 
EPA 
EPAD 


EPDS 
EPI 
ePREP 


EPS 
EPSE 
ER 
ERF 
ERK 
ERN 
ERP 


ES 

ESDM 
ESR 

ESS 
ESSENCE 


EU 
EUFEST 
EULAR 
EUnetHTA 


FA 
FACT 
fAD 
FASD 
fcMRI 
FDA 
FDG 
FDOPA 
FEP 
FFI 
FFM 


elimination disorder; emergency department; eating 
disorder; erectile disorder 

eating disorder not otherwise specified 

Early Developmental Stages of Psychopathology 
(study) 

electroencephalogram 

Entwicklungsférderung in Familien: Eltern- und 
Kinder-Training in emotional belasteten Familien 
epidermal growth factor 

essential hypersomnia syndrome 

early intervention 

European Medicines Agency; ecological momentary 
assessment 

eye movement desensitization and reprocessing 
electromyography 

Encyclopedia of DNA Elements 

Enhancing NeuroImaging Genetics through Meta- 
Analysis (Consortium) 

electro-oculography 

endogenous opioid system 

explaining pain 

eicosapentanoic acid 

European Prevention of Alzheimer’s Dementia 
Consortium 

Edinburgh Postnatal Depression scale 

echo planar imaging 

Prevention and Relationship Enhancement 
Programme 

extra-pyramidal side effect 

extra-pyramidal side effect 

endoplasmic reticulum 

event-related field 

extracellular regulated kinase 

error-related negativity 

event-related potential; exposure and response 
prevention 

effect size 

Early Start Denver Model 

erythrocyte sedimentation rate 

Epworth Sleepiness Scale 

Early Symptomatic Syndromes Eliciting 
Neurodevelopmental Clinical Examination 
European Union 

European First Episode Schizophrenia Trial 
European League Against Rheumatism 
European Network for Health Technology 
Assessment 

fractional anisotropy 

functional assertive community treatment 
familial Alzheimer’s disease 

fetal alcohol spectrum disorders 

functional connectivity magnetic resonance imaging 
US Food and Drug Administration 
fluorodeoxyglucose 

18F-fluorodopa 

first-episode psychosis 

fatal familial insomnia 

five-factor model of personality 


FFT 
FGA 
FGCB 
FGF 

FI 
FINGER 


FLAIR 


FL-APM 
FM 


FTI 

FTLD 
FTLD-ni 
FTLD-tau 


FTLD-UPS 


family-focused therapy; functional family therapy 
first-generation antipsychotic 

Family Group Cognitive-Behavioural 
fibroblast growth factor 

faecal incontinence 

Finnish Geriatric Intervention Study to Prevent 
Cognitive Impairment and Disability (study) 
fluid-attenuated inversion recovery 

first-line dopamine antagonist medication 
fibromyalgia 

functional magnetic resonance imaging 
fragile X mental retardation protein 
6-18F-fluoro-l-m-tyrosine 

functional neurological symptom disorder 
Families OverComing Under Stress 
frontal-parietal network 

Family Procedure Rules 

Family Study of Cocaine Dependence 
female sexual interest/arousal disorder 
femtotesla 

fronto-temporal dementia 

International Behavior-variant FTD Criteria 
Consortium 

full-time equivalent 

family therapeutic intervention 
fronto-temporal lobar degeneration 
fronto-temporal lobar degeneration without 
inclusions 

fronto-temporal lobar degeneration with tau- 
positive inclusions 

fronto-temporal lobar degeneration with 
immunohistochemistry against proteins of the 
ubiquitin proteosomal system 

fused in sarcoma (protein) 

fragile X syndrome 

gram; effect size 

Gamblers Anonymous 

gamma aminobutyric acid 

generalized anxiety disorder 

Global Assessment of Functioning (scale) 
General Assessment of Personality Disorder 
Global Attentiveness Rating 
glucocerebrosidase 

Global Burden of Disease (studies) 

gamma butyrolactone 

Genetic Consortium for Anorexia Nervosa 
B-glucocerebrosidase 1 

gas chromatography—mass spectrometry 
Glasgow Coma Scale 

gender-confirming treatment 

genome-wide complex trait analysis 

gender dysphoria; gambling disorder 

glial cell-derived neurotrophic factor 
guanosine diphosphate; gross domestic product 
graded exercise therapy 

germ-free 

gamma hydroxybutyrate 

growth-hormone releasing factor 


GI 

GID 

GIDC 
GIDYQ-AA 


GIP 

GJ 
GLM 
GM 
GMV 
GnIH 
GnRH 
GnRHa 
GO 
GORD 
GPCR 
GPPPD 
GR 
GRADE 


GRDS 
GRE 
GREML 


GRK 
GRML 


GRN 
GRS 
GSK-3ß 
GSS 

GTP 
GWA 
GWAS 
GWES 
HAI 
HAROLD 


HbA1c 
HBV 
HCR-20 
HCV 
HD 
HDAC 
HD-D 
HDE 
HDL 
HDRS 
HF 

HFS 
5-HIAA 
HIC 
HiTOP 
HIV 
HKD 
HLA 
HoNOS 
HOT 


Abbreviations 


gyrification index; gender incongruence 
gender identity disorder 

gender identity disorder of childhood 
Gender Identity/Gender Dysphoria Questionnaire 
for Adolescents and Adults 

G protein-coupled receptor-interacting protein 
gap junction 

general linear model 

grey matter 

grey matter volume 
gonadotrophin-inhibitory hormone 
gonadotrophin-releasing hormone 
gonadotrophin-releasing hormone analogue 
Gene Ontology 

gastro-oesophageal reflux disease 

G protein-coupled receptor 

genito-pelvic pain/penetration disorder 
glucocorticoid receptor 

Grading of Recommendations, Assessment, 
Development, and Evaluations 

genetic risk and deterioration syndrome 
gradient echo 

genomic-relatedness-matrix restricted maximum 
likelihood 

G protein-coupled receptor kinase 

genomic relationship-matrix restricted maximum 
likelihood 

granulin 

genetic risk scoring 

glycogen synthase kinase-3ß 
Gerstmann-Sträussler syndrome 

guanosine triphosphate 

genome-wide association 

genome-wide association studies 
genome-wide exome sequencing 

health care-associated infection 
Hemispheric Asymmetry Reduction in Old Adults 
(model) 

glycated haemoglobin 

hepatitis B virus 

Historical, Clinical Risk Management-20 
hepatitis C virus 

Huntington’s disease; hoarding disorder 
histone deacetylation 

Hoarding Disorder Dimensional Scale 
humanitarian device exemption 
high-density lipoprotein 

Hamilton Depression Rating Scale 

high frequency 

high-frequency stimulation 
5-hydroxyindoleacetic acid 

high-income country 

Hierarchical Taxonomy of Psychopathology 
human immunodeficiency virus 
hyperkinetic disorder 

human leucocyte antigen 

Health of the Nation Outcome Scales 
hyperbaric oxygen therapy 








Abbreviations 


IADL 
IAPT 
IBD 
IBMPFD 


IBS 
ICA 
ICCS 
ICD 
ICD 
ICD-10 


ICD-11 


ICECAP 
ICER 
ICF 


ICOCS 
ICSD-3 


ICU 
ID 
IDD 
IDO 
IED 
IFC 
IFG 
IFN 
IGF 
IGF-1 
IgG 
IHSC 
IL 
IL-2 
IL-6 


hypothalamus-pituitary-adrenal 

histrionic personality disorder 
high-performance liquid chromatography 
hallucinogen persisting perceptual disorder 
human protein reference database 

heart rate; hazard ratio 

high risk index 

health-related quality of life 

Hoarding Rating Scale-Interview 

Hoarding Rating Scale-Self Report 

habit reversal training; hormone replacement 
therapy 

heat shock protein 90 

5-hydroxytryptamine 

health technology appraisal; health technology 
assessment 

Health Technology Assessment international 
huntingtin 

homovanillic acid 

health year equivalent 

hertz 

instrumental activity of daily living 

Improving Access to Psychological Therapies 
inflammatory bowel disease 

inclusion body myopathy with Paget’s disease of 
bone and fronto-temporal dementia 

irritable bowel syndrome 

independent component analysis 
International Children’s Continence Society 
impulse-control disorder 

International Classification of Diseases 
International Classification of Diseases, tenth 
revision 

International Classification of Diseases, eleventh 
revision 

ICEpop CAPability 

incremental cost-effectiveness ratio 
International Classification of Functioning and 
Disability 

International College of Obsessive-Compulsive 
Spectrum Disorders 

International Classification of Sleep Disorder, third 
edition 

intensive care unit 

intellectual disabilities; insomnia disorder 
intellectual developmental disorder 
indoleamine 2,3-dioxygenase 

intermittent explosive disorder 

inferior frontal cortex 

inferior frontal gyrus 

interferon 

insulin-like growth factor 

insulin-like growth factor 1 

immunoglobulin G 

interhemispheric spectral coherence 
interleukin 

interleukin 2 

interleukin 6 


IM 
ImPACT 


IMPase 
IMPC 
INAHTA 


INN 
iNOS 
IOCDF-GC 


IOM 
IP, 
IPDE 
IPL 
iPSC 
IPSRT 
IPT 
IPV 
IQ 

IR 
IRE1 
IRGC 
IRLSS 
IRT 


ISBD 
ISC 

ISoS 
isvz 
ISWRD 
ITP 
IUPHAR 


IVF 
JASPER 


intramuscular 

Immediate Post-Concussion Assessment and 
Cognitive Testing 
inositol-1-monophosphatase 

International Mouse Phenotyping Consortium 
International Network of Agencies for Health 
Technology Assessment 

international non-proprietary name 

inducible nitric oxide synthase 

International OCD Foundation Genetics 
Collaborative 

Institute of Medicine 

inositol 1,4,5-triphosphate 

International Personality Disorders Examination 
inferior parietal lobe 

induced pluripotent stem cell 

interpersonal and social rhythm therapy 
interpersonal psychotherapy 

intimate partner violence 

intelligence quotient 

immediate release; insulin resistance 
inositol-requiring enzyme 1 

intermediate radial glia cell 

International Restless Legs Syndrome Study Group 
item response theory; individual resilience training; 
imagery relief therapy 

International Society for Bipolar Disorders 
International Schizophrenia Consortium 
International Study of Schizophrenia 

inner subventricular zone 

irregular sleep-wake rhythm disorder 

inferior thalamic peduncle 

International Union of Basic and Clinical 
Pharmacology 

in vitro fertilization 

Joint Attention, Symbolic Play, Engagement and 
Regulation 

kelvin 

potassium 

kilobase 

kilodalton 

Kyoto Encyclopaedia of Genes and Genomes 
Keeping Families Strong 

kilogram 

knockout 

kappa opioid receptor 

Schedule for Affective Disorders and Schizophrenia 
for School Age Children 

litre 

long-acting injected 

Lewy body 

Lewy body dementia 

locus caeruleus 

linkage disequilibrium; learning disability 
light/dark 

low-density lipoprotein 

levodopa 

lisdexamfetamine 


LF 

LFP 
LGD 
LGE 

IGI 

LH 
LHA 
LHb 
LHRH 
LMIC 
IncRNA 
LOC 
LOD 
LoF 
LOS 
LPFS 
LPS 
LSD 
LTC 
LTD 
LTG 
LTP 
LUR 
LUTS 
MABC 
MADRS 
MAM 
MANTRA 


MAO 
MAOA 
MAOA-H 
MAOA-L 
MAOI 
MAP 


MAPK 
MAPS 


MAPT 
MARAC 
MAYSI-2 


MBCT 
MBP 
MBSR 
MBT 
MBT-A 
MBU 
MCA 
MCC 
MCDA 
MC4R 
MCH 
MCI 
MCMI-II 
MCTQ 
MD 


low frequency 

local field potential 

likely gene disrupting 

lateral ganglionic eminence 

local gyrification index 

lateral hypothalamic 

lateral hypothalamus 

lateral habenula 

luteinizing hormone-releasing hormone 
low- and middle-income country 

long non-coding ribonucleic acid 

loss of consciousness 

logarithm of the odds 

loss of function 

length of stay 

Level of Personality Functioning Scale 
lipopolysaccharide 

lysergic acid diethylamide 

long-term care 

long-term depression 

lamotrigine 

long-term potentiation 

lurasidone 

lower urinary tract symptoms 

Movement Assessment Battery for Children 
Montgomery-Asberg Depression Rating Scale 
mitochondria-associated membrane 
Maudsley Model of Anorexia Nervosa Treatment 
for Adults 

monoamine oxidase 

monoamine oxidase A 

monoamine oxidase-high (allele) 
monoamine oxidase-low (allele) 
monoamine oxidase inhibitor 
mitogen-activated protein; microtubule-associated 
protein 

mitogen-activated protein kinase 
Multidisciplinary Association for Psychedelic Studies 
(project) 

microtubule-associated protein tau 
multi-agency risk assessment conference 
Massachusetts Youth Screening 
Instrument-Version 2 

mindfulness-based cognitive therapy 
myelin basic protein 

mindfulness-based stress reduction 
mentalization-based treatment 
mentalization-based treatment for adolescents 
mother and baby unit 

middle cerebral artery 

mid cingulate cortex 

multi-criteria decision analysis 
melanocortin-4 receptor 
melatonin-concentrating hormone 

mild cognitive impairment 

Millon Clinical Multiaxial Inventory-[I 
Munich ChronoType Questionnaire 

mean diffusivity 


MDA 
MDAS 
MDD 
MDI 
MDMA 
MDMA-AP 
MDT 
ME 
M/EEG 
MEG 
MET 
MFB 
MFC 
MFG 
MGB 
MGD 
MGE 
mGluR 
MGMH 
MHC 
mhGAP 
MHIN 
MHP 
MHPG 
MHRA 


MHS 
MI 
MIBG 


Abbreviations 





methylenedioxyamphetamine 
Memorial Delirium Assessment Scale 
major depressive disorder 
manic-depressive illness 
3,4-methylenedioxymethamphetamine 
MDM<A~-assisted psychotherapy 

mode deactivation therapy 

myalgic encephalomyelitis 

MEG and EEG 
magnetoencephalography 
motivational enhancement therapy 
medial forebrain bundle 

medial frontal cortical (regions) 
medial frontal gyrus 
microbiota—gut-brain (axis) 

Mouse Genome Database 

medial ganglionic eminence 
metabotropic glutamatergic receptor 
Movement for Global Mental Health 
major histocompatibility complex 
Mental Health Gap Action Programme 
Mental Health Innovation Network 
mental health professional 
3-methoxy-4-hydroxyphenylglycol 
Medicines and Healthcare products 
Regulatory Agency 

mental health services 

motivational interviewing 

13] metaiodobenzylguanidine 
monetary incentive delay 
myo-inositol-3-phosphate synthase 
microribonucleic acid 

millimetre 

mismatch negativity 

Minnesota Multiphasic Personality Inventory 
Mini-Mental State Examination 

motor neuron disease; Malingered Neurocognitive 
Dysfunction 

mechanism of action 

Montreal Cognitive Assessment 
medial orbitofrontal cortex 

mu-opioid receptor 

medial prefrontal cortex 
methylphenidate 
1-methyl-4-phenylpyridinium 
methyl-4-phenyl-1,2,3,6-tetrahydropyridine 
mineralocorticoid receptor; magnetic resonance 
modified Richmond Agitation and Sedation Scale 
modifiable risk factor 

magnetic resonance imaging 

medial raphe nuclei 

messenger ribonucleic acid 

magnetic resonance spectroscopy 
multiple system atrophy 

McLean Study of Adult Development 
mid-sleep on free day 
melanocyte-stimulating hormone 
Multiphasic Sex Inventory-2 





Abbreviations 


NDD 
NDRI 
NE 
NEAT 
NES 
NESARC 


NET 


NF-kB 
nfvPPA 
NGF 
NGO 
NGS 
NHMRC 
NICE 
NIDA 
NIFID 
NIH 
NIMH 


multiple sleep latency test 

magnetically shielded room 

multi-systemic therapy 

mid-sleep on workdays 

magnetization transfer 

mild traumatic brain injury 

mitochondrial DNA 

multi-dimensional treatment foster care 
mammalian target of rapamycin 

magnetization transfer ratio 

multivariate pattern classification 

monozygotic 

sodium 

noradrenaline 

N-acetyl aspartate 

nucleus accumbens; N-acetylcysteine 

nucleus accumbens 

nicotinic acetylcholine receptor 

negative allosteric modulation 

National Advisory Mental Health Council 
noradrenergic and specific serotonergic 
antidepressant 

third National Surveys of Sexual Attitudes and 
Lifestyles 

net benefit 

Neuroscience-based Nomenclature 

nucleus accumbens 

neurocognitive disorder 

neurocognitive disorder with Lewy bodies 
non-coeliac gluten sensitivity 

non-coding RNA 

National Comorbidity Survey 

National Comorbidity Survey Adolescent 
Supplement 

National Comorbidity Survey-Replication 
National Institute of Mental Health Data Archive; 
new drug approval 

neurodegenerative disease 
noradrenaline/dopamine reuptake inhibitor 
nocturnal enuresis 

non-exercise activity thermogenesis 

night eating syndrome 

National Epidemiological Survey on Alcohol and 
Related Conditions 

noradrenaline (norepinephrine) transporter; 
narrative exposure therapy 

nuclear factor kB 

non-fluent-variant primary progressive aphasia 
nerve growth factor 

non-governmental organization 
next-generation sequencing 

National Health and Medical Research Council 
National Institute for Health and Care Excellence 
National Institute of Drug Abuse 

neuronal intermediate filament inclusion disease 
National Institutes of Health 

National Institute of Mental Health 


NIMH-RGR NIMH Repository and Genomics Resource 


NJRE 
NK-1 
NMDA 
NMDAR 
NMR 
NMS 
NND 
NNI 
NNP 
NNT 
NO 
NOS 
NPC 
NPD 
NPI 
NPS 
NPY 
NREM 
NRI 
NRT 
NSAID 
NSS 
NSSI 
NSSID 
N24SWD 
NTD 
Nu-DESC 
NVAWS 
OAB 
OC 
OCD 
OCDUS 
OCGAS 
OCPD 
OCRD 
OCSD 
ODD 
OECD 


OED 
OFC 
OLZ 
ONS 
OPD 
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OPRI 
OR 
OSA 
OSE 
OSFED 
OST 
OSVZ 
OxCAP-MH 
OXTR 
PA 
PACAP 
PACT 
PAF 


Not Just Right Experience 

neurokinin 1 

N-methyl-D-aspartate 
N-methyl-D-aspartate receptor 

nuclear magnetic resonance 

neuroleptic malignant syndrome 

number needed to detain 
NMDAR-neuromodiulator interaction 
number needed to prevent 

number needed to treat 

nitric oxide 

not otherwise specified; nitric oxide synthase 
neural progenitor cell 

narcissistic personality disorder 
neuropsychiatric inventory 

novel psychoactive substance; neuropeptide S 
neuropeptide Y 

non-rapid eye movement 

selective noradrenergic reuptake inhibitor 
nicotine replacement therapy 

non-steroidal anti-inflammatory drug 
neurological soft sign 

non-suicidal self-injury 

non-suicidal self-injury disorder 
non-24-hour sleep-wake disorder 
neurofibrillary tangle dementia 

Nursing Delirium Screening Scale 

National Violence Against Women Survey 
overactive bladder 

obsessive-compulsive 

obsessive-compulsive disorder 

Obsessive Compulsive Drug Use Scale 

OCD Collaborative Genetic Association Study 
obsessive-compulsive personality disorder 
obsessive-compulsive and related disorder 
obsessive-compulsive spectrum disorder 
oppositional defiant disorder 

Organisation for Economic Co-operation and 
Development 

other eating disorder 

orbitofrontal cortex 

olanzapine 

Office of National Statistics 

operational psychodynamic diagnostics 
optically pumped magnetometer 
octapeptide repeat insertion 

odds ratio 

obstructive sleep apnoea 

other stressor event 

other specified feeding and eating disorders 
opiate substitution therapy 

outer subventricular zone 

Oxford CAPabilities questionnaire-Mental Health 
oxytocin receptor 

periaqueductal 

pituitary adenylyl cyclase-activating polypeptide 
Preschool Autism Communication Trial 
population-attributable fraction 


PAG 
PAI 
PAL 
PAM 


PAMP 
PANDAS 


PANESS 


PaPA 
PAR 
PATS 
PBMC 
PBP 
PCBD 
PCC 
PCL 
PCL-R 
PCL-YV 
PCP 
PCPA 
PCS 
PD 


PDAQ 
PD-CFRS 
PDD 


PDE 
PDE-5 
PD-MCI 
PD-TS 
PE 

PEG 
PEPS 
PERK 


PET 
PET-MR 
PFA 
PFG 
PGAD 
PGC 
PGC-ED 


periaqueductal grey 

Personality Assessment Inventory 

paliperidone 

positive allosteric inhibitor; positive allosteric 
modulation 

pathogen-associated molecular pattern 

Paediatric autoimmune neuropsychiatric disorder 
associated with streptococcal infections 

Physical and Neurological Examination for 

Soft Signs 

Perceptions and Practicalities Approach 
population-attributable risk 

Preschoolers with ADHD Treatment Study 
peripheral blood mononuclear cell 

Parent-Based Prevention 

persistent complex bereavement disorder 
posterior cingulate cortex 

paracentral lobule 

Psychopathy Checklist Revised 

Psychopathy Checklist: Youth Version 

primary care physician 

para-chlorophenylalanine 

post-concussion syndrome 

panic disorder; proton density; Parkinson's disease; 
personality disorder 

Penn Daily Activities Questionnaire 
PD-Cognitive Function Rating Scale 

pervasive developmental disorder; Parkinson’s 
disease dementia 

phosphodiesterase 

phosphodiesterase type 5 

Parkinson's disease with mild cognitive impairment 
personality disorder-trait specified 

prolonged exposure; premature ejaculation 
polyethyleneglycol 

psychoeducation with problem-solving 

protein kinase RNA-like endoplasmic 

reticulum kinase 

positron emission tomography 

positron emission tomography—magnetic resonance 
psychological first aid 

prefrontal cortex 

persistent genital arousal disorder 

Psychiatric Genetics Consortium 

Eating Disorders Working Group of the Psychiatric 
Genomics Consortium 

prostaglandin E, 

polygenic risk score 
phosphoinositide/phosphoinositol; polarity index 
Pittsburgh compound B 

postural instability gait disorder 

phosphotidyl inositol 4,5-biphosphate 
piwi-interacting ribonucleic acid 

protein kinase A 

protein kinase C 

phenylketonuria 

phospholipase C 

psychotic-like experience 
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Problem Management Plus 
paramethoxyamphetamine 

premenstrual dysphoric disorder 
paramethoxymethamphetamine 

postnatal depression 

preoptic area 

pro-opiomelanocortin 

post-partum (puerperal) psychosis 
peroxisome proliferator-activated receptor 
paranoid personality disorder 

DSM-5 Personality and Personality Disorders 
Work Group 

penile plethysmography 

protein-protein interaction 

positive predictive value 

photosensitive retinal ganglion cell 
Programme for Improving Mental Health Care 
(study) 

patient-reported outcome measure 

prion protein; Penn Resilience Program 
cellular prion protein 

scrapie form of prion protein 

polygenic risk scoring 

prostate-specific antigen 

post-synaptic density; post-stroke depression 
Present State Examination 

polysomnography 

progressive supranuclear palsy 

Pittsburgh Sleep Quality Index 
problem-solving therapy 

post-traumatic amnesia; Positive Thoughts and 
Action Program 

phosphorylated tau 

potentially traumatic event 

post-traumatic stress disorder 

premonitory urge 

polyunsaturated fatty acid 

partial volume effect 

post-viral fatigue syndrome 

paraventricular hypothalamic nucleus 
quality-adjusted life year 

quality and outcomes framework 

quality of life 

quantitative trait locus 

quetiapine 

rostral anterior cingulate cortex 

reactive attachment disorder; Reynolds Adolescent 
Depression 

Rapid Assessment, Interface, and Discharge (model) 
Royal Australian and New Zealand College of 
Psychiatrists 

Resourceful Adolescent Program 

retinoic acid receptor 

Repeatable Battery for the Assessment of 
Neuropsychological Status 

rapid eye movement sleep behaviour disorder 
regional cerebral blood flow 

regional cerebral metabolic rate for glucose 
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RDoC 


SAD 
SANS 
SAPS 
SAPS-PD 


SARI 
SAVRY 
SCAN 
SCC 
SCD 
SCFA 
SCID-II 


sCJD 
SCL-90 
SCM 
SCN 
SCO 
SCRD 


randomized controlled trial 

rare coding variant 

radial diffusivity 

Research Diagnostic Criteria 

Research Domain Criteria 

Research Domain Criteria Database 

resting energy expenditure 

rapid eye movement 

risk evaluation and mitigation strategies 

Repeated Episodes of Self-Harm (score) 
RE1-silencing transcription factor 
radiofrequency 

restriction fragment length polymorphism 
G-protein signalling protein 

retinohypothalamic tract 

real life experience 

restless legs syndrome 

ribonucleic acid 

ribonucleoprotein 

Real world Outcomes across the Alzheimer’s Disease 
spectrum for better care: Multi-modal data Access 
Platform 

region of interest 

routine outcome measure 

reactive oxygen species 

Reach Out, Stand Strong, Essentials for new mothers 
(programme) 

risk profile scoring 

relative risk 

restricted and repetitive behaviours and interests 
rapid response team 

rumination syndrome 

resting-state functional magnetic resonance imaging 
resting state network 

repetitive transcranial magnetic stimulation 
real-time quaking-induced conversion 

rostral ventromedial medulla 

Roux-En-Y gastric bypass 

sporadic Alzheimer’s disease 

social anxiety disorder; seasonal affective disorder 
Scale for the Assessment of Negative Symptoms 
Scale for the Assessment of Positive Symptoms 
Scale for Assessment of Positive Symptoms in 
Parkinsons Disease 

serotonin antagonist and reuptake inhibitor 
Structured Assessment of Violence Risk in Youth 
Schedule for Clinical Assessment in Neuropsychiatry 
subcallosal cingulate cortex 

social (pragmatic) communication disorder 
short-chain fatty acid 

Structured Clinical Interview for DSM-IV Axis II 
personality disorders 

sporadic Creuztfeldt—Jakob disease 

Symptom Checklist-90 

structured clinical management 

suprachiasmatic nucleus 

subcommissural organ 

sleep and circadian rhythm disruption 


SCZ 
SD 
SDQ 
SDS 
SEID 
SERCA 
SERT 
SES 
SF-36 
SFO 
SFT 
SG 
SGA 
sgACC 
sgp130 
sgRNA 
SHA 


schizophrenia 

sleep deprivation 

Strengths and Difficulties Questionnaire 
standard deviation score 

systemic exertion intolerance disease 
sarco(endo)plasmic reticulum calcium ATPase 
serotonin; serotonin transporter 
socio-economic status 

Short Form Health Survey 36 
subfornical organ 

schema-focused therapy 

somatosensory gating 
second-generation antipsychotic 
subgenual anterior cingulate cortex 
soluble glycoprotein 130 

single-guide ribonucleic acid 

System of Health Accounts 


SHORT IQ-CODEshort form of the Informant Questionnaire on 


SHQ 
SIADH 
SIDP-IV 


SIH 
SIHD 
SIPP 
SIPS 


SI-R 
siRNA 
SIT 
SLC 
sIMFB 
SMA 
SMD 
SMG 
SMI 
SMIT 1 
SMOC 
SMR 
SN 
SNAP 


SNP 
SNR 
SNRI 
SNV 
SOC 
SOC-7 


SOD 
SOFAS 


SORAG 
SOREMP 
SP 

SPD 


Cognitive Decline in the Elderly 

Clarke Sex History Questionnaire 

syndrome of inappropriate antidiuretic hormone 
Structured Interview for DSM-IV Personality 
Disorders 

stress-induced hyperthermia 

Structured Interview for Hoarding Disorder 
Severity Indices of Personality Problems 
Structured Interview for Prodromal Syndromes; 
Structured Interview for Psychosis-Risk Syndromes 
Saving Inventory-Revised 

short interfering ribonucleic acid 

stress inoculation training 

solute carrier 

superolateral branch of the medial forebrain bundle 
supplementary motor area 

standardized mean difference 

supramarginal gyrus 

severe mental illness 

sodium/myo-inositol transporter 1 

second messenger-operated channel 

standard mortality ratio; standardized mortality rate 
substantia nigra 

Swanson, Nolan, and Pelham (scale); Schedule for 
Nonadaptive and Adaptive Personality 
single-nucleotide polymorphism 

signal-to-noise ratio 

serotonin/noradrenaline reuptake inhibitor 

single nucleotide variant 

store-operated channel 

Standards of Care for the Health of Transsexual, 
Transgender, and Gender-Non-conforming People, 
Version 7 

superoxide dismutase 

Social and Occupational Functioning 

Assessment Scale 

Sex Offender Risk Appraisal Guide 

sleep-onset REM period 

specific phobia; subplate (zone) 

schizotypal personality disorder 


SPECT 
SPZ 
SQUID 
SRAI 
SRI 
SRS 
SRT 
SSCM 
SSRI 
STAT3 
STEP-BD 


STEPPS 


STL 
STN 
StPD 
STPD 
STPP 
SUD 
SUVr 
svPPA 
SVT 
SWAN 


TLR 


single-photon emission computed tomography 
subparaventricular zone 

superconducting quantum interference device 
structured risk assessment instrument 

serotonin reuptake inhibitor 

sex reassignment surgery 

sleep restriction therapy 

specialist supportive clinical management 
selective serotonin reuptake inhibitor 

signal transducer and activator of transcription 3 
Systematic Treatment Enhancement Program for 
Bipolar Disorder 

Systems Training for Emotional Predictability and 
Problem Solving 

superior temporal lobe 

subthalamic nucleus 

schizotypal personality disorder 

schizotypal personality disorder 

short-term psychodynamic psychotherapy 
substance use disorder 

regional standard uptake value 

semantic-variant primary progressive aphasia 
symptom validity test 

Strengths and Weaknesses of ADHD-symptoms and 
Normal-behavior (scale) 

susceptibility- weighted imaging 

slow-wave sleep 

tesla; testosterone 

transcranial alternating current stimulation 
Treatment for Adolescents with Depression Study 
tandem affinity purification and mass spectrometry 
treatment as usual 

thiobarbituric acid reactive substances 

traumatic brain injury 

TANK-binding kinase 1 

tract-based spatial statistics 

tricarboxylic acid; tricyclic antidepressant 
Temperament and Character Inventory 

typically developing; tardive dyskinesia 
transcranial direct current stimulation 

type 2 diabetes mellitus 

TAR-DNA binding protein 

TAR-DNA binding protein 43 

three times daily 

Temperament Evaluation scale from Memphis, Pisa, 
and San Diego 

transcutaneous electrical nerve stimulation 
transcription factor binding site 

trauma-focused cognitive behavioural therapy 
transference-focused psychotherapy 

transient global amnesia 

Test for Gross Motor Development, second edition 
T helper 2 

tetrahydrocannabinol 

transient ischaemic attack 

Treatment and Intervention in Psychosis Study 
tight junction 

Toll-like receptor 


TMF 


TMN 
TMS 
TNF 
TOR 
TPD 
TPJ 
TR 
TRD 
TRH 
TRN 
TRP 
TS 
TSC 
TSF 
TSH 
TSO 
TSPO 
TSST 
t-tau 
TTFL 
UA 
UDS 
UGDS 
UHR 
UHSS 
UI 
UK 
UN 
UP 


UPD 
UPR 
UPS 

US 
USD 
USI-model 
uVNTR 
VaD 
VasD 
VBM 
VCFS 
VCI 
vCJD 
VCP 
VC/VS 
VEGF 
VIAAT 
VIP 
VLPO 
VMAT2 
VMHC 
vmPFC 
VNS 
VNTR 
VNUT 
VPA 
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Trzepacz, Meagher, and Franco (research diagnostic 
criteria) 

tuberomammillary nucleus 

transcranial magnetic stimulation 

tumour necrosis factor 

target of rapamycin 

Tobacco Products Directive 
temporo-parietal junction 

repetition time 

treatment-resistant depression 
thyrotropin-releasing hormone 

thalamic reticular nucleus 

transient receptor potential 

Tourette’s syndrome 

tuberous sclerosis complex 

12-step facilitation 

thyroid-stimulating hormone 

total sexual outlet 

translocator protein 

Trier Social Stress Test 

total tau 

transcriptional-translational feedback loop 
uric acid 

urinary drug screen 

Utrecht Gender Dysphoria Scale 
ultra-high-risk 

UCLA Hoarding Severity Scale 
uncertainty interval 

United Kingdom cHECK 1-4!!! 

United Nations 

Unified Protocol for Transdiagnostic Treatment of 
Emotional Disorders 

uniparental disomy 

unfolded protein response 

unspecified prodromal symptoms 
unconditioned stimulus; United States 
United States dollar 

Universal, Selected and Indicated preventive model 
upstream variable number of tandem repeats 
vascular dementia 

vascular dementia 

voxel-based morphometry 
velo-cardio-facial syndrome 

vascular cognitive impairment 

variant Creuztfeldt—Jakob disease 
valosin-containing protein 

ventral capsule/ventral striatum 

vascular endothelial growth factor 
vesicular inhibitory amino acid transporter 
vasoactive intestinal peptide 

ventrolateral preoptic 

vesicular monoamine transporter-2 
voxel-mirrored homotopic connectivity 
ventromedial prefrontal cortex 

vagal nerve stimulation 

variable numbers of tandem repeat 
vesicular nucleotide transporter 

valproate 
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VR 
VRAG 
VRAG-R 
VRET 
VS/NcAcc 
VTA 
WASO 
WCST 
WESBP 


WHO 
WHO-DAS 


ventral periaqueductal grey 

virtual reality 

Violence Risk Appraisal Guide 

Violence Risk Appraisal Guide- Revised 
virtual reality exposure therapy 

ventral striatum/nucleus accumbens 
ventral tegmental area 

wake after sleep onset 

Wisconsin Card Sorting Task 

World Federation of Societies of Biological 
Psychiatry 

World Health Organization 

World Health Organization Disability Assessment 
Schedule 


WM 
WMH 
WPA 
WTCCC3 
XMRV 
Y-BOCS 
YFAS 
Y2H 
YLD 
YLL 
YSR 
ZIP 


white matter 

World Mental Health; white matter hyperintensity 
World Psychiatric Association 

Wellcome Trust Case-Control Consortium 3 
xenotropic murine leukaemia virus-related virus 
Yale-Brown Obsessive Compulsive Scale 

Yale Food Addiction Scale 

Yeast 2 Hybrid 

year of life lived with disability 

year of life lost 

Youth Self-Report 

ziprasidone 
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It is difficult to be a psychiatric patient, but a good doctor can make it 
less so. Confusion and fear can be overcome by knowledge and com- 
passion, and resistance to treatment is often, although by no means 
always, amenable to change by intelligent persuasion that leads to 
better healing. The devil, as the fiery melancholic Byron knew, is in 
the details. 








Patients, when first given a psychiatric diagnosis, are commonly 
both relieved and frightened—relieved because often they have 
been overwhelmed by pain, anxiety, and hopelessness for a con- 
siderable period of time, and frightened because they do not know 
what the diagnosis means, what the treatment will entail, and their 
likelihood of obtaining a meaningful response. They do not know if 
they will return to the way they once were, whether the treatment 
they have been prescribed will or will not work, and, even if it does 
work, at what cost it will be to them in terms of their notions of 
themselves, potentially unpleasant side effects, and the reactions of 
their family members, friends, colleagues, and employers. Perhaps 
most disturbing, they do not know if their depression, psychosis, 
anxieties, or compulsions will return to become a permanent part 
of their lives. Caught in a state often characterized by personal an- 
guish, social isolation, and confusion, newly diagnosed patients find 
themselves on a quest to regain a sense of mastery of themselves and 
their surroundings. One of the main goals of therapies of all types is 
to empower the patient and give them some control back over their 
world and rechart the meaning and purpose of their lives under al- 
tered circumstances. 

The specifics of what the doctor says and the manner in which he 
or she says it are critically important from the start and will colour 
the patient’s ongoing treatment course for years to come. Most pa- 
tients who complain about receiving poor psychiatric care do so 
on several grounds—their doctors, they feel, spend too little time 
explaining the nature of their illnesses and treatment; they are re- 
luctant to consult with, or actively involve, family members; they 
are patronizing and do not adequately listen to what the patient has 
to say; they do not encourage questions or sufficiently address the 

















concerns of the patient; they do not discuss alternative treatments, 
the risks of treatment, and the risks of no treatment; and they do not 
thoroughly forewarn about side effects of medications. 

Most of these complaints are avoidable. Time, although difficult to 
come by, is well spent early on in the course of treatment when the 
manifestations of confusion and hopelessness are greatest, the risk of 
non-adherence is highest, and the possibility of suicide substantially 
increased. Hope can be realistically extended to patients and family 
members, and its explicit extension is vital to those whose illnesses 
have robbed them not only of hope, but also of belief in themselves, 
their future, and the very meaning of their lives. The hope provided 
needs to be tempered, however, by an honest and realistic explica- 
tion of possible difficulties yet to be encountered: unpleasant side ef- 
fects from medications; a rocky time course to meaningful recovery 
which will often consist of many discouraging cycles of feeling the 
progress of marching towards wellness, only to stumble and slide 
temporarily backwards towards illness again; and the probable per- 
sonal, professional, and financial repercussions that come in the 
wake of having a psychiatric illness. 



































It is terrifying to lose one’s sanity or to be seized by a paralysing de- 
pression. No medication alone can substitute for a good doctor's 
clinical expertise and the kindness of a doctor who understands 
both the medical and psychological sides of mental illness. Nor can 
any medication alone substitute for a good doctor’s capacity to listen 
to the fears and despair of patients trying to come to terms with what 
has happened to them. A good doctor is a therapeutic optimist who 
is able to instil hope and confidence to combat bewilderment and 
despair. Great doctors are able to provide the unwavering care to 
their patients that they would want a member of their own family 
to receive, blending empathy and compassion with expertise and 
confidence. 

Doctors need to be direct in answering questions, to acknowledge 
the limits of their understanding, and to encourage specialist con- 
sultations when the clinical situation warrants it. They also need 
to create a therapeutic climate in which patients and their families 
feel free, when necessary, to express their concerns about treatment 
or to request a second opinion. There must also be a willingness by 
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doctors to collaborate across medical disciplines in the care of their 
psychiatric patients because of the influence and, likewise, the im- 
pact of somatic diseases on mental illness—for example, there is 
evidence that depression predisposes people to conditions such as 
myocardial infarction, diabetes, and multiple sclerosis, all of which 
conversely increase the likelihood of depression. Moreover, persons 
with major depression and schizophrenia have a 40-60% greater 
chance of dying prematurely than the general population, due to 
physical health problems that are often left unattended or exacer- 
bated by the side effects of psychotropic medications. Doctors are 
also frequently called upon to advocate for their psychiatric patients 
who are frequently stigmatized and therefore at great risk of being 
discriminated against by being deprived of their professional, eco- 
nomic, social, and cultural rights. Particular care must be taken by 
doctors to prevent their patients from receiving substandard care 
by refusing to share, against their patient’s better judgement, im- 
portant aspects of their mental illness with non-mental health med- 
ical practitioners. 

Treatment non-adherence, one of the major causes of unnecessary 
suffering, relapse, hospitalization, and suicide must be addressed 
head-on. Unfortunately, doctors are variable in their ability to assess, 
predict, and facilitate adherence in their patients [1]. Asking directly 
and often about medication concerns and side effects, scheduling 
frequent follow-up visits after the initial diagnostic evaluation and 
treatment recommendation, and encouraging adjunctive psycho- 
therapy or involvement in patient support groups can make a crucial 
difference in whether or nota patient takes medication in a way that 
is most effective. Aggressive treatment of unpleasant or intolerable 
side effects, minimizing the dosage and number of doses, and pro- 
viding ongoing, frequently repetitive education about the illness and 
its treatment are likewise essential, if common-sense, ways to avert 
or minimize non-adherence. 





Communication in the digital age 


The ever-expanding availability of health information technology, 
ranging from assistive devices (that permit regular tracking of symp- 
toms and reminders to facilitate treatment adherence such as auto- 
mated texting and telemedicine) to therapeutic tools (that provide 
interventions such as online cognitive behavioural therapy), will 
continue to improve the ease with which care can be delivered. But 
in the end, it is the therapeutic alliance between patient and clin- 
ician, honed and proven over two and a half millennia since the time 
of Hippocrates, that will and must remain central to the healing 
process. Technology can assist and enhance, but not replace, the 
doctor-patient relationship. 


Doctor as teacher 


Education is, of course, integral to the good treatment of any 
illness, but this is especially true when the illnesses are chronic 
and shrouded in the secrecy that is caused by both social and per- 
sonal stigma. The term ‘doctor’ derives originally from the Latin 
word for teacher, and it is in their roles as teachers that doctors 
provide patients with the knowledge and understanding to combat 


the confusion and unpredictability that surround mental illness. 
Patients and their family members should be encouraged to write 
down any questions they may have, as many individuals are intimi- 
dated once they find themselves in a doctor’s office. Any informa- 
tion that is given orally to patients should be repeated as often as 
necessary (due to the cognitive difficulties experienced by many 
psychiatric patients, especially when acutely ill or recovering from 
an acute episode) and, whenever feasible, provided in written form 
as well. Additional information is available to patients and family 
members in books and pamphlets obtainable from libraries, book- 
stores, and patient support groups, but, ever more commonly, in- 
formation is accessible through the Internet as videos, websites, and 
online support groups [2, 3]. Visual aids, such as charts portraying 
the natural course of the treated and untreated illness or the causes 
and results of sleep deprivation and medication cessation, are also 
helpful to many [4-6]. Finally, providing the patients with self- 
report scales to monitor their daily progress, such as mood charts 
in affective disorder, not only provides invaluable clinical data, but 
also teaches patients and their physicians to better understand the 
patient's illnesses and their response to therapeutic interventions 
and exacerbating stressors. Family members and significant others 
can, and usually do, play key roles as outside sources of information 
which can be critically important in ensuring that the proper diag- 
nosis is made at the outset. Patients, when they are well, also often 
benefit from a meeting with their family members and their doctor 
that focuses upon drawing up contingency plans in case their 
illness should recur. These meetings also provide an opportunity 
to shore up the support system the patient has by educating their 
caregivers about the nature, cause, manifestations, and treatment 
of their loved one’s mental illness. Such meetings may also include 
what is to be done in the event that a psychiatric emergency arises 
and hospitalization is required, a discussion of early warning signs 
of impending psychotic or depressive episodes, methods for regu- 
larizing sleep and activity patterns, techniques to protect patients 
financially, and ways to manage suicidal behaviour should it occur. 
Suicide, globally the second leading cause of death in 15- to 29-year 
olds, is the major cause of premature death in severe psychiatric 
illnesses [7, 8], and its prevention is of first concern. Those illnesses 
most likely to result in suicide (mood disorders, comorbid alcohol 
and drug abuse, and schizophrenia) need to be treated early, aggres- 
sively, and often for an indefinite period of time [2, 10]. Lithium, 
which has demonstrated significant efficacy in preventing suicide, 
should be considered when appropriate [11]. The increasing evi- 
dence that treatment early in psychiatric illness may improve the 
long-term course needs to be considered in light of the reluctance 
of many patients to stay in treatment [10, 12, 14]. 


Conclusions 


The ancient proverb medice, cura te ipsum (physician, heal thy- 
self) applies most pressingly to mental illness, because the rates of 
burnout, depression, and suicide among doctors are deeply con- 
cerning. A willingness to change the culture of medicine, so that 
more time, attention, and education is given to the critically im- 
portant aspects of mental health, routine screening, and treatment 
of depression to encourage, rather than punish, seeking help. 
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Stigma can be considered as an overarching term that includes chal- 
lenges faced by people with mental illness related to knowledge, atti- 
tudes, and behaviour [1]. The knowledge domain includes low levels 
of mental health literacy, for example among the general popula- 
tion (ignorance); the attitudinal domain relates to almost entirely 
negative affect towards people with experience of mental illness 
(prejudice), while the behavioural aspects reflect predominantly 
forces for the social exclusion and diminished citizenship for people 
with mental illness (discrimination). This chapter considers the evi- 
dence of the implications of these elements and also summarizes 
the literature on what can be done to effectively reduce stigma and 
discrimination. 












































The consequences of stigma and discrimination are wide-reaching 
and severe, and affect people with mental disorders, their family 
members, mental health staff, institutions, and treatments, as well 
as society as a whole. 

Discrimination, the behavioural consequence of stigma, adds to 
the disability of persons with mental illness and leads to disadvan- 
tages in many aspects of life, including personal relationships, educa- 
tion, and work [1, 2]. It limits the life opportunities of those affected, 
through loss of income, prolonged unemployment, reduced access 
to housing or health care, for example, and therefore reduced access 
to important means of recovery [3]. Commonly, people with mental 
disorders experience unequal treatment for physical health condi- 
tions, leading to rates of morbidity and mortality much beyond what 
is attributable to their primary mental disorder [4]. Discrimination 
because of mental illness is pervasive and universal—international 
studies of mental illness discrimination have shown that rates of 
both anticipated and experienced discrimination are consistently 
high across countries among people with mental disorders [5-8]. 

Yet another form of devaluation takes place when individuals 
affected by mental illness stigma accept the negative beliefs held 
against them and lose self-esteem, resulting in self-stigma (or ‘in- 
ternalized stigma) [9-11]. Internal consequences of stigma and 





discrimination have been the subject of a number of studies and 
include feelings of shame, a loss of emotional well-being, poor self- 
efficacy, and negative recovery outcomes [12-19]. 

What self-stigma can mean is vividly described in a quote by Gallo 
[20, pp. 407-8] quoted in Angell et al. (2005) [21]—a statement from 
a person with mental illness on how stigma and discrimination have 
changed the way she feels about herself: 


‘I perceive myself, quite accurately, unfortunately, as having a serious 
mental illness and therefore as having been relegated to what I called 
“the social garbage heap’, I tortured myself with the persistent and 
repetitive thought that I would encounter, even total strangers, did 
not like me and wished that mentally ill people like me did not exist. 
Thus I would do things such as standing away from others at bus 
stops and hiding and cringing in the far corners of subway cars. 
Thinking of myself as garbage, I would even leave the side walk in 
what I thought of as exhibiting the proper difference to those above 
me in social class. The latter group, of course, included all other 
human beings: [20]* 


Internal consequences of stigma and discrimination can fur- 
ther lead to hopelessness and depression, social withdrawal, and 
reduced participation in treatment programmes [3] and act as a 
stressor that perpetuates ill health and makes recovery more diffi- 
cult [22, 23]. Coping responses, such as secrecy about the condition 
and avoidance of others, further feed into the cycle of isolation and 
alienation [3]. 

In addition to experiences of direct discrimination from others, 
persons suffering from mental illness face several forms of struc- 
tural discrimination, for example manifest in the lack of resources 
allocated to the care of mental disorders, the location and quality of 
some treatment facilities, and inadequate attention to the physical 
health needs of people with mental disorders [24, 25]. 

Paradoxically, stigmatizing practices and even human rights vio- 
lations are found within mental health services worldwide [26-28]. 
Undesirable conditions in mental health institutions, as well as the 
shame and fear of disclosure associated with attending them, act asa 
barrier for help-seeking and the effective treatment of mental health 





Reproduced from Schizophr Bull., 20(2), Gallo KM, First person account: Self- 
stigmatization, pp. 407-410, Copyright (1994), with permission from Oxford 
University Press. 


conditions [29]. For example, people with mental disorders may 
delay seeking treatment or terminate treatment prematurely for fear 
of being labelled and discriminated against [3, 30]. 

A statement from Diana on restrained treatment by health-care 
professionals in a psychiatric hospital: 


“There were between six and eight staff members, I am not sure, 
I can’t remember too much. I didn't have a very clear vision. I saw 
people surrounding me, holding me by the hand, holding me by the 
legs. I dont think it was something they had to do. There was no 
talking. They would have helped better if they would have been more 
understanding and more talking... more respect. I felt really bad. 
While I was in hospital I tried to complain but I don’t know if any- 
body was listening. It was a nightmare’ [1, p. 87]? 


Another very commonly cited source of stigma is family mem- 
bers. Even although many people experience great support from 
their families, it is family members too who often hold negative atti- 
tudes towards people with mental illness and even within their fam- 
ilies treat them in a discriminatory way. 


“There I was, the eldest son suffering a sudden deep depression, 
crying and unable to work. Often threatened by my confused Dad 
as being “weak’, “a fuck-up’, and a “nutter”. No-one else in the family 
going back generations had gone “mad like that”. I was told not to 
tell any of the neighbours what was happening - to stop the gossip. 


(Paul)’ [1, p. 2] 


In many societies where services are scarce and support systems 
inadequate, families feel forced to resort to chaining and other prac- 
tices to restrain relatives with mental illness [28, 31]. 

Research has shown that mental health professionals them- 
selves hold negative stereotypes and attitudes similar to the general 
population and even more pessimistic views in the domain of re- 
covery, possibly due to their disproportionate contact with those 
with poorer outcomes [32]. Service users commonly report lack of 
empathy and interest from health professionals, diagnoses being 
given with negative prognosis, and lack of information and involve- 
ment in decision-making [33]. 


‘Some of the worst experiences I have had have been in psychiatric 
hospitals. I recognise the need to be kept safe but often I have felt 
that my rights and dignity have been stripped away. Being intimately 
searched again and again and constantly followed whilst under “close 
observation” just leaves me feeling singled out and perceived as little 
more than a nuisance (“theres to be no trouble on my shift”) [ ... ] 
I have heard many comments along the lines of “Oh, she's cut again. 
Why doesn't she do it properly and kill herself”. (Sandra) [1], p. 94]* 


Stigma and discrimination do not only affect persons suffering 
from mental illness, but also families [34-36]. The effect of nega- 
tive attitudes towards the family members of people with mental 
illness has been described as ‘stigma by association’ and may lead to 
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experiences of direct discrimination, as well as feelings of shame and 
self-blame [1]. In societies where the cohesion of family networks is 
strong, the impact of stigma by association may be severe and can in- 
clude economic consequences, as well as impact on work or marital 
prospects [37]. 


Contextual factors relevant to stigma 


and discrimination 





The manifestations of stigma and discrimination are subject to the 
influence of a range of cultural and contextual factors [38]. Key do- 
mains through which culture shapes the manifestations of stigma 
include: (1) notions of ‘mental illness’ and explanatory models (for 
example, in many settings, psychiatric symptoms may not be seen as 
indicative of an ‘illness’); (2) cultural meanings of the impairments 
and manifestations caused by the disorder and its stigma (for ex- 
ample, the impact of stigma on marital prospects may have more 
severe implications in cultural contexts where marriage is central); 
and (3) notions of self and personhood (for example, higher levels 
of family cohesion may offer more support but also go along with 
a more widespread impact of stigma across family members and 
generations). 

Also socio-economic factors, such as poverty and access to health 
care, determine the context in which stigma is enacted and experi- 
enced [7, 9, 39, 40]. In low- and middle-income countries (LMICs) 
and other settings where most people with mental illness do not 
have access to social welfare benefits, the negative economic con- 
sequences of stigma, for example, through discrimination in work, 
may be so severe as to threaten the economic survival of entire 
families [41]. 





Global patterns of stigma and discrimination 


There are few studies comparing the frequency of experiences of 
stigma and discrimination in different contexts, and recent research 
has sought to address this gap in the literature. International surveys 
of experienced and anticipated discrimination among people with 
schizophrenia (27 countries) and among people with depression (39 
countries), for example, found rates of both outcomes to be con- 
sistently high across cultures [5, 7, 8]. Significant between-country 
variation was found for experienced discrimination, but not for an- 
ticipated discrimination reported by people with schizophrenia [7]. 
A report on the qualitative data collected as part of the same study, 
however, found few transnational differences [6]. Another study 
looking at public attitudes across 16 countries identified a ‘back- 
bone’ of certain prejudices that were held across all settings, even 
where overall stigma was relatively low [42]. 

On the other hand, some smaller studies suggest stark differences 
between high-income country (HIC) and LMIC settings, for example, 
studies from China [43] and India [41], with rates of experienced dis- 
crimination much lower than those commonly reported from HIC 
studies, and qualitative differences in the meaning and appraisal of the 
experiences made. At first sight, this appears to support the findings 
of early cross-cultural research on stigma, suggesting that the stigma 
of mental illness may be less marked in non-industrialized societies 
due to a more supportive environment with more social cohesion, and 
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therefore less risk of prolonged rejection, isolation, segregation, and 
institutionalization [44, 45; 46, 47 cited in 48]. The better prognosis 
of schizophrenia found in international studies by the World Health 
Organization (WHO) [49-52] has therefore commonly been attrib- 
uted to less stigmatization in LMICs [53]. 

Yet, in contradiction to this, there is now a considerable body 
of evidence documenting that in many LMIC settings, experi- 
ences of stigma, discrimination, and human rights abuses due to 
mental illness are common and severe [5, 11, 27, 37, 54-62]. One 
international study using population-wide data from 16 countries 
found even higher rates of reported stigma among people with 
mental disorders in developing (31.2%) than in developed (20%) 
countries [55]. 

In conclusion, our understanding of global patterns of stigma 
and discrimination is still rather limited to date, and further high- 
quality cross-cultural research is needed to throw light on the forces 
that drive intercultural differences in the manifestation of stigma. 
Understanding the factors that shape stigma distinctly in different 
contexts will serve to inform the development of context-specific 
anti-stigma interventions. 


How to measure stigma 


Alongside the development of research into stigma, the creation and 
validation of instruments to measure stigma and discrimination 
took their beginnings in the 1960s. Early scales focused largely on 
the measurement of stigmatizing attitudes among the general popu- 
lation. Since, numerous scales have been developed, incorporating 
a wider range of perspectives on stigma and discrimination, not- 
ably the inclusion of the perspectives and experiences of service 
users and carers [63]. Nevertheless, there continues to be a distinct 
lack of measures developed or validated in LMIC settings and/or 
non-Western cultures [64]. Several methods have been put forward 
which seek to achieve cultural validity of measures of stigma and 
discrimination, including an approach by Yang et al. which proposes 
to focus on ‘what matters most’ in a given culture [65, 66]. A recent 
review concluded that future efforts in the domain of measuring 
stigma and discrimination should focus on: (i) procedures for 
achieving cultural validity of measurement tools, (ii) indicators for 
structural stigma and stigmatizing behaviour (underrepresented in 
current scales), and (iii) targeted or tailored measures for specific 
subgroups, all with a particular focus on LMIC countries where 
literature is sparse [63]. This is important as the appropriate meas- 
urement of stigma and discrimination is critical to understanding 
whether and how anti-stigma interventions are effective [63]. 


How to tackle stigma 


The critical question to tackle stigma in mental health is: what inter- 
ventions work? In the past years, research on anti-stigma interven- 
tions to change knowledge, attitudes, and behaviour towards people 
with mental illness has increased. Most interventions aim at chan- 
ging one or several of these aspects through education, social con- 
tact, or behavioural interventions. 

A recent narrative review concluded with the following main 
findings on the evidence of anti-stigma interventions [64]: 


(1) ‘at the population level there is a fairly consistent pattern of short- 
term benefits for positive attitude change, and some lesser evi- 
dence for knowledge improvement; 

(2) for people with mental illness, some group-level anti-stigma in- 
ventions show promise and merit further assessment; 

(3) for specific target groups, such as students, social-contact-based 
interventions usually achieve short-term (but less clearly long-term) 
attitudinal improvements, and less often produce knowledge gains; 

(4) this is a heterogeneous field of study with few strong study de- 
signs with large sample sizes; 

(5) research from low-income and middle-income countries is con- 
spicuous by its relative absence; 

(6) caution needs to be exercised in not overgeneralising lessons 
from one target group to another; 

(7) there is a clear need for studies with longer-term follow-up to as- 
sess whether initial gains are sustained or attenuated, and whether 
booster doses of the intervention are needed to maintain progress; 

(8) few studies in any part of the world have focused on either the 
service user’s perspective of stigma and discrimination or on the 
behaviour domain of behavioural change, either by people with or 
without mental illness in the complex processes of stigmatisation’® 


It has been found that generally the effectiveness of the interven- 
tions depends much on the target group and the time frame of the 
intervention. However, most studies are short-term effectiveness 
studies looking at attitudes of the general public towards people with 
mental disorders in HICs. The most widely evaluated interventions 
are education/information and social contact [63]. 

Overall there remains a large knowledge gap for medium- to long- 
term anti-stigma interventions, and particularly for interventions in 
low-income countries where evidence is almost absent [63]. There is 
also a need for: (i) more high-quality interventions based on robust 
methods and validated measures, (ii) more systematic reviews on 
long-term effectiveness, (iii) more randomized controlled trials, and 
(iv) more evidence from LMICs [67]. 


Social contact-based interventions 


Interventions using social contact as a key element have been found to 
be the most effective type of interventions [68]. At the same time, social 
contact is also the best evidence-based intervention, particularly in 
short-term outcomes. Evidence from systematic reviews suggests that 
social contact is the most effective intervention in terms of achieving 
short-term improvements in knowledge and attitudes among adults. 

An account by a young man who participated in the German 
school project ‘Crazy? So what!’: 


‘Eight years ago I became ill: I developed schizophrenia [ ... ]. Pve 
been feeling better now for two years. But I do have to take good care 
of myself. But hiding because of that? These times are over. I finally 
want to live now! Talking to the students is exhausting but also really 
great [ ... ] they discover that there are a lot more commonalities 
than differences between us, that their images of the ‘crazy ones’ are 





* Reproduced from The Lancet, 387(10023), Thornicroft G, Mehta N, Clement 
S, et al., Evidence for effective interventions to reduce mental-health-related 
stigma and discrimination, pp. 1123-1132, Copyright (2015), with permission 
from Elsevier. 


not true. It feels really good to contribute to achieving that we finally 
can talk openly about mental illness, and that nobody has to hide 
because of a mental health problem? [69]® 


Social contact is the most effective type of intervention in the 
short term, but it is not clear whether effectiveness is sustained in 
the medium to longer term [67]. While social contact has been re- 
ported to be the most effective intervention in adults, these evalu- 
ations are mostly based on intervention studies from HICs. There is 
a great need for more evidence from LMICs to assess whether social 
contact is as effective there and how to implement it to suit local re- 
quirements. In addition, more research is needed to investigate the 
long-term effectiveness of social contact interventions. 


Educational interventions 





‘The [ ... ] practical way to stop stigma and discrimination is by 
better education of schoolchildren at an early age and to reinforce 
this message through lifelong learning. Each course or class should 
not only start with “household” messages about fire escapes, etc., but 
that bullying or discrimination will not be tolerated whilst on the 
course’ (Paul) [1] 

(Thornicroft, 2006) 


While direct social contact interventions have been found to be 
the most effective intervention in adults, systematic reviews have 
found that in students, educational interventions are more effective 
in reducing stigma in students’ knowledge and attitudes in the short 
term. However, the evidence base for effectiveness in the medium 
to longer term is weak [64]. A meta-analysis found both social con- 
tact as well as educational interventions reduce stigma significantly 
and, importantly, irrespectively if these interventions are delivered 
face-to-face or via Internet programmes [70]. Moreover, Thornicroft 
et al. have found evidence that education and information seem to 
be the most effective interventions in the medium and long terms 
[64]. Evaluations in HICs have found that stigma and discrimin- 
ation against people with mental illness can be reduced through fo- 
cused, long-term information campaigns like Time to Change in the 
United Kingdom (UK) [71]. High-quality effectiveness evaluations 
for educational interventions are scarce for LMICs. Several national 
and regional campaigns from LMICs report qualitative changes in 
attitudes and behaviour; however, these effects lack high-quality 
evaluation for quantitative efficiency [72]. 


Behavioural domain 


Overall the effect of behavioural therapy and psychotherapy has 
not been sufficiently researched. In persons with mental illness, 
psychoeducational therapy, including elements of cognitive behav- 
ioural therapy (CBT), seem to be effective in reducing self-stigma 
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Association Irrsinnig Menschlich e.V., Copyright (2002), with permission from 
Irrsinnig Menschlich e.V. 


CHAPTER 2 Public attitudes and the challenge of stigma 


[73]. Yet, CBT has been found not to be effective in reducing stigma 
in other groups. 

For medium- or long-term outcomes, systematic reviews have 
found there was not sufficient research to believe psychotherapy or 
entertainment/arts interventions can help to reduce stigma [64]. 


Conclusions 


From this discussion, the authors draw the following conclusions. 
Stigma and discrimination appear to be universal in their presence 
and impact, although there are clear local and regional varations in 
their content and manifestations. Lay stigma by the general public 
constitutes a powerful force for social exclusion, and in addition there 
is also strong evidence that stigma among health-care professionals 
is a powerful barrier to the mental and physical health care needed 
by people with mental illness. There is now increasingly strong evi- 
dence that personal and social contact methods, including filmed/ 
virtual contact, is the most strongly evidence-based method to re- 
duce stigma and discrimination. This evidence is now accumulating 
at inter-personal, organizational, and national levels. But as yet, there 
are few longer-term studies to know if such gains are sustainable in 
the long term. Nearly all the research evidence is from HICs, with 
a distinct evidence gap from LMICs. For the future, it is clear that 
service users are the central pioneers/key active ingredients in anti- 
stigma programmes and that interventions specifically locally and 
culturally adapted for use in LMICs are a pressing priority. 
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Global health has been defined as ‘an area for study, research and 
practice that places a priority on improving health and achieving 
equity in health for all people worldwide’ [1]. In a similar vein, global 
mental health has emerged in recent years as a field of research, ad- 
vocacy, and practice that is focused on improving the mental health 
of populations and reducing inequity in the global burden of mental 
illness. Because these challenges are multi-faceted, global mental 
health has been marked from its inception as a multi-disciplinary 
field, incorporating disciplines of psychiatry, psychology, epidemi- 
ology, health economics, anthropology, and implementation sci- 
ence. Although much of the work of global mental health has been 
focused on the needs of women and men living in low- and middle- 
income countries (LMICs), there is increasing awareness that many 
of the mental health challenges faced by populations in LMICs are 
also encountered in high-income countries (HICs). 

Why then is it important to think about mental health in global 
terms? Firstly, despite the massive and growing global burden of 
mental illness, mental health has been neglected in global health 
and international development policy [2]. For example, mental 
health was excluded from the 2000-2015 Millenium Development 
Goals, and much of the focus and funding of agencies like the Global 
Fund and the Bill and Melinda Gates Foundation has been on com- 
municable diseases. Secondly, emerging evidence from the WHO 
Atlas surveys informs us that the world’s mental health resources 
are largely concentrated in HICs and in wealthier, largely urban set- 
tings of LMICs—in short, there is a global maldistribution of mental 
health resources [3]. Thirdly, human rights abuses and stigma, espe- 
cially against people with severe mental illness, are global problems 
that pervade all cultures of the world [4]. Fourthly, there is strong 
emerging evidence regarding the links between the global challenge 
of poverty and mental health—poverty is a key social determinant of 
population mental health, and mental illness leads to individual and 
household impoverishment [5]. Finally, there are important links 
between mental health and many other global development chal- 
lenges, including gender inequity, violence, forced migration, and 
climate change. As several have argued, there is no sustainable de- 
velopment without mental health [6]. 

This chapter sets out to provide an overview of the emerging and 
dynamic field of global mental health. It starts with a brief history 





of the field, noting landmark publications and events. The chapter 
then introduces key areas of enquiry and action in global mental 
health, including epidemiology, mental health policy and services, 
intervention research, implementation science, humanitarian set- 
tings, qualitative research, instrument development, and economic 
evaluation. Finally, it concludes with reflections on critiques of the 
field and future directions. 











The emergence of global mental health marks a departure from, 
but also continuity with, earlier efforts to conceptualize and address 
mental health problems around the world. While historical accounts 
have traced the evolution and proliferation of psychiatric knowledge 
back to antiquity, the rise of institutional care in Europe, and the spread 
of the psychiatric asylum to many countries during the colonial era 
[7], global mental health as a specific institutional and programmatic 
assemblage has emerged out of a more recent post-Second World War 
history. During this period, mental health became the subject of inter- 
national public health and psychiatric epidemiology, which rendered 
mental disorders comparable across regions and other diseases, and 
allowed for mental health to become visible as a ‘global’ problem. 
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Initially, after the founding of WHO in 1948, mental health promo- 
tion followed a pacifist agenda and conceived of mental health as 
a vehicle to further ‘harmonious relationships between mer and 
ultimately ‘world peace’ [8]. WHO's initial focus was not on dis- 
tinct mental illnesses, but on mental health as a positive state of 
well-being in the tradition of the ‘mental hygiene’ movement, which 
had advocated for the prevention of mental illness through edu- 
cation since the 1920s [9]. The dual goal of building harmonious 
‘world citizenship and mental health education also informed the 
founding of the World Mental Health Federation in 1948, the first 
non-governmental organization (NGO) with which WHO collab- 
orated on matters of mental health policy [10]. 

A stronger focus on mental disorders came about with the 
rise of psychiatric epidemiology in international mental health 
in the 1970s, which sought to assess the prevalence, distribu- 
tion, and determinants of mental illness. One key challenge to the 


epidemiological approach was the lack of standardization in the 
classification and interpretation of mental disorders. While the early 
versions of the WHO International Classification of Diseases (ICD- 
6) (1948) and ICD-7 (1955) contained sections on mental illness, 
they were not widely accepted internationally. Comparative studies 
on diagnostic practice had also shown large differences in the clin- 
ical interpretation of disease categories [11]. Comparable data on 
mental disorders thus depended on the development of standard- 
ized instruments and classifications. In 1966, WHO initiated what 
would become a series of cross-cultural proof-of-concept studies on 
schizophrenia that demonstrated that the disorder existed around 
the world, that it could be measured with standardized instru- 
ments, and that it had greatly varying outcomes across developed 
and developing settings [12]. Greater standardization of psychi- 
atric classification and practice was achieved through the publica- 
tion of the Diagnostic and Statistical Manual of Mental Disorders, 
version 3 (DSM III) in 1980, which abandoned the psychodynamic 
aetiological model in favour of a symptom-based approach with no 
aetiological underpinning. The DSM classification subsequently in- 
creased its harmonization with WHO's ICD system, and a number 
of instruments were developed to facilitate the collection of com- 
parable data, including the Present State Examination (PSE), the 
Schedule for Clinical Assessment in Neuropsychiatry (SCAN), and 
the Composite International Diagnostic Instrument (CIDI). 


Disability and disability-adjusted life years 


Another important factor lending mental health increasing import- 
ance within international health was the development of disability 
frameworks beginning in the 1980s and leading up to the creation 
of a new statistical measure that combined morbidity and mor- 
tality into the assessment of the global burden of disease through 
the disability-adjusted life years (DALYs) measure, as part of the 
World Development Report in 1993. This marked the World Bank's 
increasing investment in health, based on a ‘human capital’ logic, 
and its entrance into ‘Global Health’ [13]. DALYs laid the founda- 
tion for the influential ‘Global Burden of Disease (GBD) studies 
[14], which fundamentally changed how health was measured on a 
global scale. For the mental health agenda, this shift towards mor- 
bidity had a significant impact as mental and neurological disorders 
became visible as one of the largest contributors to the GBD [15]. 
Subsequently, medical anthropologists, including Robert Desjarlais, 
Leon Eisenberg, Byron Good, and Arthur Kleinman, drew further 
attention to this enormous burden and formulated a call for inter- 
national action on mental health in the landmark World Mental 
Health Report [16], which combined global data with local case 
studies. The report was launched at the United Nations in 1995, 
and in response, WHO initiated the Nations for Mental Health pro- 
gramme a year later, which promoted mental health service delivery 
for underserved populations [17]. These efforts marked a shift to- 
wards service delivery rather than pure epidemiology in inter- 
national mental health. 


World Health Report 2001 


The year 2001 was a significant year for global mental health when, 
with the support of the new WHO Director General Gro Harlem 
Brundtland, the World Health Day and World Health Report were 
dedicated in their entirety to mental health. The theme of the day 
‘Stop exclusion—Dare to care’ speaks of this shift towards care 
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delivery and the human rights agenda. In conjunction, the first 
WHO Mental Health Atlas provided a global overview over the ex- 
isting mental health care resources [18], marking another important 
step towards the quantification of the global ‘treatment gap’ Parallel 
to these developments, the wider field of global health consolidated 
in the early 2000s [13] and mental health advocates sought to emu- 
late its achievements and strategies, especially the enormous success 
of the HIV/AIDS advocacy movement. 


Global mental health 


Global mental health as a distinct global health community and pro- 
gramme was officially inaugurated in 2007 when The Lancet pub- 
lished a special series on global mental health, edited by Patel and 
colleagues. This series documented the links between mental health 
and other health conditions [19], presented a systematic review of the 
evidence for cost-effective prevention and treatment interventions in 
LMICs [20], and called for the scaling up of evidence-based mental 
health care in LMICs [21]. The field of global mental health has since 
grown significantly, with the creation of a distinct assemblage of in- 
stitutions and training programmes, including Masters and doctoral 
programmes and short courses; online networks [the Movement for 
Global Mental Health (MGMH) and the Mental Health Innovation 
Network (MHIN)]; specialty journals such as Journal of Global 
Mental Health and International Journal of Mental Health Systems; 
funding streams, for example United States National Institute of 
Mental Health’s Office on Disparity and Global Mental Health, Grand 
Challenges Canada, United Kingdoms Department for International 
Development, European Commission, and Wellcome Trust; inter- 
national agencies (in particular, the WHO Department of Mental 
Health and Substance Abuse); and international NGOs such as 
Transcultural Psychosocial Organization and BasicNeeds. 

What distinguishes global mental health from earlier inter- 
national mental health efforts are five particular features—firstly, 
the strengthened emphasis on the development of research-driven 
evidence-based interventions. Commitment to the production of 
evidence-based interventions favours randomized controlled trial 
(RCT) research designs to test innovative interventions, with the 
subsequent goal of implementation and scaling-up. Evidence-based 
practice guidelines, such as the WHO's mhGAP Intervention Guide 
that outlines the treatment of eight priority disorders [22] and the 
Inter-Agency Standing Committee's Guideline for Mental Health and 
Psychosocial Support in Emergency Settings [23], have thus become 
central tools in organizing this new field of practice. 

A second distinguishing feature is the consideration of care de- 
livery in low-resource settings, for example through the training and 
supervision of non-specialist health providers—a strategy referred 
to as task shifting or task sharing [24]. Global mental health pays 
particular attention to the modalities and limitations of service de- 
livery in low-resource settings by using task-sharing models and 
simplified and adapted treatment protocols, with an increasing 
focus on implementation science [25]. 

Thirdly, global mental health is characterized by multi- 
disciplinarity and multi-stakeholder involvement. For example, 
the landmark research agenda-setting article Grand challenges in 
global mental health was based on an extensive international Delphi 
stakeholder consultation process [26]. This commitment to multi- 
disciplinary engagement has built a novel knowledge base, adjacent 
to academic psychiatry, combining the expertise of public health, 
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psychology, health economics, development studies, anthropology, 
and the experience of NGOs. 

Fourthly, global mental health is characterized by a strong em- 
phasis on human rights and the need to strengthen legislation and 
regulatory environments to protect and promote the rights of people 
living with psychosocial disabilities [27]. This has been exemplified 
in the WHO QualityRights initiative, which has developed a tool 
to assess human rights protection in the provision of mental health 
care and campaigned for the reduction of human rights violations, 
particularly against people living with severe mental illness, whether 
in communities or in psychiatric institutions [28]. 

Fifthly, despite efforts to raise the profile of mental health as a spe- 
cific global health priority, global mental health is working towards 
the integration of mental health into other agendas, including sus- 
tainable development, non-communicable diseases, and maternal 
and child health. This includes efforts towards the integration of 
mental health care into primary care [29]. In particular, recent global 
mental health advocacy and research have highlighted its links 
with the development agenda by demonstrating the link between 
mental health and poverty [30] and the economically promising ‘re- 
turn on investment’ of mental health care [31], and by successfully 
advocating for the inclusion of mental health into the sustainable 
development goals. 

In 2016, the World Bank Group and WHO hosted a high-level 
meeting entitled Out of the shadows: making mental health a global 
development priority in Washington DC, which was attended by over 
500 delegates from the research, business, clinical, policy, and donor 
communities. At this meeting, the president of the World Bank Dr 
Jim Kim and the Director General of WHO Dr Margaret Chan com- 
mitted themselves and their organizations to giving greater priority 
to global mental health [32]. 


Epidemiology 


Global burden of mental illness 


A major breakthrough in the measurement of the global burden of 
mental disorders came with the development of the DALY measure, 
first published in the 1993 World Development Report [33]. The 
DALY combines a measure of mortality [years of life lost (YLLs)] 
with a measure of morbidity [years of life lived with disability 
(YLDs)], to assess the burden of all diseases and injuries using the 
metric of time. This allows for comparison of a wide variety of con- 
ditions, from human immunodeficiency virus (HIV)/acquired im- 
mune deficiency syndrome (AIDS) to cancers and to depression, 
with the latest 2015 GBD study covering over 300 diseases and in- 
juries. Further details of the assumptions and methods in the latest 
2015 GBD study can be found elsewhere [34]. From a health policy 
perspective, this has been transformative for the field of mental 
health, because it has moved to the foreground the substantial con- 
tribution of mental disorders to the global burden of disease. 

Data from the latest available 2015 GBD study show that the 
total proportion of the global burden of disease attributable to 
mental, neurological, and substance use disorders has increased 
substantially from approximately 6.5% in 1990 to over 10% in 2015. 
For mental and substance use disorders alone, there was a 14.9% 
increase between 2005 and 2015 [34]. These increases have been 


largely driven by demographic factors, chiefly population growth 
and ageing, with age-standardized DALYs per 100,000 population 
for mental and substance use disorders showing a 0.3% decrease be- 
tween 2005 and 2015. 

There are important gender differences in the burden of mental 
and substance use disorders, with women accounting for more 
DALYs in all disorders, except for childhood mental disorders, sub- 
stance use disorders, epilepsy, and Parkinson's disease, which are 
more prevalent among boys and men. There are also important age 
variations, with the burden of mental, neurological, and substance 
use disorders peaking in young adult years (Fig. 3.1). Fig. 3.1 also 
visually depicts the relative contribution of specific mental and sub- 
stance use disorders to the total burden of all mental and substance 
use disorders. 


Social determinants of mental health 


Along with the growing awareness of the burden of mental illness, 
social epidemiology in the last 30 years has generated new knowledge 
on the social determinants of mental health [35]. Social determinants 
are a wide range of social and economic risk and protective factors that 
influence population mental health across the life course. These social 
determinants include five broad domains: demographic, social, eco- 
nomic, neighbourhood, and environmental events [36]. The demo- 
graphic domain includes the effects of age, gender, and ethnicity on 
mental health. In addition to age and gender differences in the burden 
mental illness, members of minority ethnic groups have been found 
to have an increased risk for psychosis and depression, particularly 
in the context of migration or as a result of experiences of discrimin- 
ation or exclusion [37]. In relation to the social domain, social capital, 
particularly cognitive social capital, has been shown to offer protec- 
tion against common mental disorders such as depression and anxiety 
disorders [38]. Education is important for adolescent well-being, is 
highly protective against later-life depression, and improves cognitive 
reserve that may be protective against dementia [39]. In relation to 
the economic domain, there is robust evidence showing a strong asso- 
ciation between poverty, unemployment, and adverse mental health 
outcomes, including common mental disorders and schizophrenia 
[5, 40]. The relationship between poverty and mental illness is bidir- 
ectional, characterized by social causation and social drift/selection 
[30, 40]. In relation to the neighbourhood domain, the characteristics 
ofa local area or neighbourhood have an important bearing on popu- 
lation mental health, over and above the effects of individual level 
socio-economic deprivation [41]. Finally, environmental events, such 
as natural disasters, industrial disasters, war, civil conflict, forced mi- 
gration, and the effects of climate change, have been shown to have 
a range of negative mental health consequences. ‘These include post- 
traumatic stress disorder (PTSD), depression, and anxiety among 
adults [42], and emotional and behavioural disturbances, sleep diffi- 
culties, and disturbed play in children [43]. 


Mental health policy and services 


Given the major global burden of mental disorders and the powerful 
social determinants that influence the mental health of populations, 
what can governments and civil society do? A key area of global 
mental health research and advocacy since the World Health Report 
2001 has been mental health policy and service development. 
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Fig. 3.1 (see Colour Plate section) Disability-adjusted life years (DALYs) for each mental and substance use disorder in 2010, by age. 


Reproduced from The Lancet, 382(9904), Whiteford HA, Degenhardt L, Rehm J, et al., Global burden of disease attributable to mental and substance use disorders: findings 
from the Global Burden of Disease Study 2010, pp. 1575-86, Copyright (2013), with permission from Elsevier Ltd. 


One of the major WHO recommendations from that report was 
that countries should develop national mental health policies and 
plans, and make the most efficient use of their available resources 
by implementing these policies and plans using cost-effective inter- 
ventions. From 2001 to 2005, the WHO developed guidelines to 
assist countries (particularly LMICs) to develop national mental 
health policies, plans, and programmes. These practical step-by-step 
guidelines are set out in the WHO Mental Health Policy and Service 
Guidance Package [44]. Policies and plans are vital for several 
reasons—a policy provides a roadmap to guide a ministry of health 
and other relevant ministries; without such a policy, governments do 
not have a clear mandate or a mechanism to allocate much needed 
resources; the process of developing a national policy and plan is 
also an important means of developing consensus among a range 
of stakeholders; and a clearly articulated policy and plan with the 
appropriate targets and indicators can facilitate evaluation of the ex- 
tent to which the mental health needs of populations are being met. 

At a global level, these aspirations were subsequently developed 
and adopted in the WHO Global Mental Health Action Plan (2013- 
2020), endorsed by all United Nations member states. The objectives 
of this plan are to: (1) strengthen effective leadership and govern- 
ance for mental health; (2) provide comprehensive, integrated, and 
responsive mental health and social care services in community- 
based settings; (3) implement strategies for promotion and pre- 
vention in mental health; and (4) strengthen information systems, 
evidence, and research for mental health [45]. 

In relation to the delivery of mental health services, the WHO has 
recommended an optimal mix of services, which can be depicted 


in a pyramid (Fig. 3.2) [46]. This emphasizes the promotion of 
self-care, the development of primary care and community-based 
services, the integration of mental health into general health care 
services, and the provision of a small number of specialist mental 
health services. Within this overall framework, the Disease Control 
Priorities initiative has provided a detailed review of the most 
cost-effective interventions, to be delivered through service plat- 
forms, targeting mental health promotion and prevention, treat- 
ment, and rehabilitation of mental, neurological, and substance use 
disorders [47]. 





Intervention research, trials, and innovations 


Much of the intervention research in LMIC settings at the beginning 
of the twenty-first century has focused on treatments for common 
mental disorders, specifically depression, anxiety, and trauma-based 
syndromes. This focus was, in part, due to the high prevalence of 
these disorders in LMIC populations, as well as a growing recog- 
nition of the large treatment gap for accessing any type of mental 
health services outside of urban psychiatric centres [3]. The pre- 
dominant model of service provision has used task-sharing models. 
Evidence is growing that task sharing can be an effective strategy for 
providing mental health services where few mental health profes- 
sionals exist [48], though Padmanathan and De Silva [49] identified 
important barriers to the acceptability and feasibility of task-sharing 
models if sufficient financial resources are not provided to support 
these programmes. 
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Fig. 3.2 Optimal mix of mental health services. 


Reproduced from Mental Health Policy and Service Guidance Package, Copyright (2003) with permission from the World Health Organisation. Available from http:// 


www.who.int/mental_health/policy/services/essentialpackage1v2/en/ 


Numerous randomized trials in LMIC settings across multiple 
continents have demonstrated the effectiveness of evidence-based 
psychotherapies, including cognitive behavioural, interpersonal, 
and cognitive processing therapies, in reducing the burden of 
common mental disorders [48]. Research has also identified the 
utility of using these approaches to improve non-mental health out- 
comes among high-risk populations, with a growing evidence base 
of the feasibility and impact of treating mental health problems 
among HIV-infected populations on HIV treatment adherence [50]. 

Building on task-sharing approaches, intervention research in 
LMIC settings has also investigated collaborative care and stepped 
care models, which integrate task-sharing treatment into primary 
care systems [51]. 

An area of current innovation in intervention research is the 
development of cross-diagnostic and trans-diagnostic treatment 
models. These models are designed to treat multiple common dis- 
orders, rather than being disorder-specific (for example, treatments 
for depression alone). This is an important consideration, as disorder 
comorbidity is common and being able to train providers in a single 
treatment, rather than different treatments for each disorder, can 
improve treatment fidelity, coverage, and quality, particularly for lay 
providers. Some of these approaches, such as Problem Management 
Plus (PM+), have a unified treatment protocol that can be applied to 
individuals with a range of common mental health problems [52]. 


Current evidence for PM+ indicates that it is acceptable and feasible 
to implement [53]; impact results are forthcoming. An alternative to 
a unified treatment approach is the model developed by Murray and 
colleagues—Common Elements Treatment Approach (CETA)— 
that includes a set of treatment components that can be delivered in 
varying combinations to address a range of common mental health 
problems and comorbidities that present within a population [54]. 
Two randomized trials of CETA have shown that this model is ac- 
ceptable, feasible, and effective for reducing the burden of common 
mental health problems, such as depression, anxiety, and post- 
traumatic stress, in LMIC populations [55, 56]. 


Implementation science 


A critical finding from existing global implementation research is 
that, regardless of positive reports of acceptability and feasibility 
by providers and consumers, evidence-based mental health pro- 
gramming is still not being scaled up or sustained [57]. The field 
of implementation science is needed to provide evidence for how 
these interventions can be effectively integrated and sustained into 
existing care systems through identifying and addressing barriers at 
the organizational and policy levels that are impeding implementa- 
tion efforts. 


In LMICs, like in many HICs, a range of organizations provide 
mental health services, including hospitals and primary care clinics, 
NGOs, educational settings, and social service agencies. Primary 
care settings provide an important opportunity for integrating 
mental health care with care for other health conditions under a 
single system. One large research project that has investigated the 
integrating and scaling up of mental health care packages in pri- 
mary and maternal health care contexts in five LMIC settings is the 
Programme for Improving Mental Health Care (PRIME) study [58]. 
One of the first steps taken in PRIME was to develop and implement 
district-level mental health care integration plans in all five country 
sites at four service levels: health service organization, specialist 
mental health services, primary care facilities, and the community 
[59]. While the integration goals, to improve quality and access to 
mental health services, were common across sites, the system ap- 
proaches required to achieve these goals were variable, for example 
in the types of providers used to deliver mental health care (lay and 
community providers vs more highly qualified and specialist mental 
health workers). The identified variations mean that different sys- 
tems approaches will be needed to ensure quality mental health 
services are provided and sustained over time. 

While primary care settings are important for integrating mental 
health care with other health conditions, for many LMIC popula- 
tions, accessing mental health services in primary health care set- 
tings is neither feasible (because of stigma or logistical barriers) nor 
preferred (because mental health problems may not be considered 
to be in the domain of what doctors or health providers treat). NGOs 
and community-based organizations (CBOs) frequently are direct 
providers of mental health services, with some at the forefront of 
investigating how to scale up services through non-health care sys- 
tems. One example is the international NGO Strong Minds which 
has been working to scale up interpersonal psychotherapy (IPT), an 
evidence-based treatment for depression in Uganda [60], through 
local organizations and community groups [61]. 

In addition, it is recognized that the health sector often needs to 
interact with other sectors to improve mental health outcomes—what 
the WHO defines as ‘intersectoral action for health’ [62]. A qualita- 
tive study in South Africa that investigated the challenges and possi- 
bilities for intersectoral care for mental health services noted several 
challenges in this regard, including the lack of systematic strategies 
for delineating care roles and the lack of communication and referral 
systems. These limited the success of the multi-system strategies of 
care, despite the recognition among stakeholders that providing care 
across multiple systems is necessary [63]. 


Qualitative research 


Global mental health’s emphasis on multi-stakeholder engagement 
and the recognition that mental health interventions and instru- 
ments need to be culturally and linguistically adapted have made 
qualitative research an important aspect of piloting, implementing, 
and evaluating mental health interventions. Ethnographic ap- 
proaches have had a strong influence on global mental health, largely 
due to the anthropologist Arthur Kleinman’s seminal work on the 
expression of mental distress and somatization in China and Asia 
[64]. While such long-term, in-depth ethnographic engagement is 
rare nowadays, global mental health draws extensively on qualitative 
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health research to tailor interventions and research tools to local 
contexts and to ensure linguistic and cultural appropriateness [65]. 
A large body of literature has emerged on ‘local idioms of distress’ [66] 
describing local concepts for mental suffering such as ‘heart-mind’ 
problems in Nepal [67] and ‘thinking too much in Haiti (‘Reflechi 
twop’) [68] and Zimbabwe (‘Kufungisisa’) [69], which are similar, 
but not identical, to the category of ‘depression. 

Another area in which qualitative research has been integrated 
into the design of global mental health interventions are studies 
investigating the acceptability and feasibility of mental health inter- 
ventions or barriers to their implementation within specific popu- 
lations. For example, task-sharing approaches have been assessed 
through qualitative research in a great number of contexts [70], 
while other research has explored the barriers to collaboration with 
traditional healers [71]. 

Qualitative research has also become increasingly important for 
implementation science [25]. Multi-country consortia like PRIME 
routinely collect qualitative interview data on the implementation 
process of their interventions to assess health system bottlenecks, 
as well as service user and stakeholder perceptions of the service. 
Qualitative case studies have also been identified as an important 
tool for evaluation, as they allow for the documentation and dis- 
semination of the specific lessons that are learnt on the ground, 
including processes of delivering interventions [72]. 


Instrument development 


In the measurement of mental health problems globally, it is im- 
portant to consider the validity, reliability, and utility of measure- 
ment tools. In general, a valid instrument is one that measures 
the construct it is intended to measure and a reliable instrument 
is one that can consistently measure the construct of interest over 
time (test-retest reliability) and ensure that this consistency does 
not vary, based on who administers the instrument (inter-rater re- 
liability). Utility refers to how practical and useful the instrument is 
for a given problem ina given context. 

The development of valid and reliable tools to assess mental health 
problems across different cultures and contexts has been an area of 
research for some time. One reason for the importance of focusing 
on instrument development is that the nature and meaning of 
mental health symptoms can differ by culture and context [73]. For 
example, in a review of the qualitative literature on the expression of 
depression globally, Haroz and colleagues found that many symp- 
toms of depression in non-Western populations are not captured in 
standard measurement tools based on Western diagnostic criteria 
and that some of the symptoms that frequently appear in standard 
depression measures, such as psychomotor agitation or slowing, 
were infrequently mentioned in non-Western populations [74]. 

To reconcile local characterization of mental health problems 
with the desire to use standard instruments that can allow for cross- 
population comparisons, researchers are now using mixed methods 
approaches that include qualitative methods to identify symptoms 
relevant to local populations. For example, Wilk and Bolton's quali- 
tative study of depressive disorders among HIV-affected Ugandan 
adults found many of the standard symptoms related to depres- 
sive disorders, while also identifying symptoms such as ‘hating the 
world; ‘bad, criminal or reckless behavior, and ‘unappreciative of 
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assistance’ as relevant manifestations of locally defined depression 
[75]. These locally relevant symptoms were added to an existing de- 
pression measure to create a more locally valid measure that could 
still be used to compare the burden of illness with other populations 
[76]. Similar methods have been used, for example, to develop valid 
and reliable measures of post-partum depression among women in 
the Democratic Republic of Congo [77], post-traumatic stress prob- 
lems among youth in Zambia [78], and disruptive behaviour prob- 
lems among youth in Nepal [79]. 

While global mental health researchers have increasingly focused 
on improving reliability and validity of mental health instruments 
across contexts, developing tools that are practical and useful for 
screening and ongoing treatment monitoring and evaluation has 
been more difficult. And as mental health services are integrated in 
other care systems and scaled up for delivery, having useful meas- 
ures becomes increasingly important. In considering utility, two im- 
portant factors are the length of the measure and the usefulness of 
individual items to assess different mental disorders, as well as the se- 
verity of the disorders. One analytic approach that is currently being 
used to generate useful measures is item response theory (IRT). IRT 
is a type of latent variable modelling approach that models the prob- 
ability of a given response as a function of a respondent's underlying 
level of a latent trait. IRT can help to refine scales by identifying well- 
performing items at varying levels of a latent trait, assisting in short- 
ening of a scale for screening or monitoring purposes, preventing 
floor and ceiling effects [80], and identifying where, along a latent 
trait, a scale is underperforming and additional items are needed 
in order to better assess individuals with those levels [81]. Tang 
and colleagues have used this approach to develop a brief depres- 
sion measure for older adults in Hong Kong [82], and Haroz and 
colleagues have used IRT analyses to develop a depression measure 
that more accurately captures the presentation of depression across 
cultures [83]. 


Economic evaluation 


Given the severely resource-constrained environment facing mental 
health systems in most countries, particularly LMICs, it is vital 
that the available resources are used efficiently. Policymakers face 
several difficult economic questions in relation to mental health 
care, namely: (1) what goods and services should a health system 
produce—for example, should these provide care for infectious 
diseases, non-communicable diseases, or mental health, and how 
should these priorities be set?; (2) what is the most efficient and ef- 
fective way of delivering mental health interventions (for example, 
through community-based care, primary care services, or hospitals), 
and who should the key delivery agents be?; and (3) which popula- 
tion groups should be given special attention, for example women, 
children, older adults, or the economically vulnerable? 

A key area of enquiry in global mental health is therefore the eco- 
nomic evaluation of mental health interventions. Economic evalu- 
ation has been defined as ‘the comparative analysis of alternative 
courses of action in terms of both their cost and consequences’ [84]. 
Economic evaluations are essential to assist policymakers to set pri- 
orities among competing health needs [85]. One of the major con- 
cerns of economic evaluation is the relationship between allocated 
inputs (resources) and outcomes (health benefits). An intervention 


is considered technically inefficient if the same, or a greater, health 
benefit could be obtained with less resources. 

The three major methods of conducting economic evaluations 
are cost-effectiveness analysis, cost-utility analysis, and cost- 
benefit analysis. All three methods are interested in measuring costs 
but differ in their measurement of outcomes. In the case of cost- 
effectiveness analysis, the outcome is measured in the appropriate 
(usually clinical) measure to assess the effect of the intervention, for 
example reduction in clinical symptoms or improvement in func- 
tioning. In the case of cost-utility analysis, the outcome is meas- 
ured using a standardized comparable measure of utility such as a 
combination of life years with the quality of life [quality-adjusted 
life years (QALYs)] or DALYs, allowing for comparison of the health 
impacts across a range of interventions and health conditions. In 
the case of a cost-benefit analysis, the outcome is measured in mon- 
etary terms. This latter approach allows policymakers to assess the 
economic gains from a particular set of service costs or the return 
on investment. 

Some examples of economic evaluations in global mental health 
research have been Patel and colleagues’ cost-effectiveness analysis 
of primary care for common mental disorders in Goa in India [86] 
and a task-shifting cognitive behavioural therapy delivered by para- 
professionals for alcohol use in Kenya [87]. 

In a recent publication, to coincide with the historic World Bank/ 
WHO ‘Out of the Shadows’ meeting in April 2016, Dan Chisholm 
and colleagues conducted a global return-on-investment analysis 
for scaling up care for depression and anxiety disorders between 
2015 and 2030. They estimated that for every dollar invested, there 
would be a $2.3 to $3 return on investment when economic benefits 
alone were considered, and a $3.3 to $5.7 return on investment when 
the value of health returns were also included [31]. 





Gaps in the field and critiques 


In the years following The Lancet series on global mental health in 
2007, the field attracted some critical attention from transcultural 
psychiatrists and social scientists who questioned the agenda of this 
emerging field. These criticisms have focused on a number of key 
issues. 

Weak evidence. Critics have drawn attention to the long-standing 
controversies surrounding the DSM classification system, especially 
the cross-cultural validity of psychiatric disease models and the effi- 
cacy of psychotropic drugs that Western psychiatry promotes [88]. 
These critics have argued that the evidence base of psychiatry is too 
weak to be exported to low-resource settings. Existing evidence, they 
argued, has been predominantly collected in HICs and may not be 
applicable to other communities. Instead, local idioms of distress and 
cultural systems of healing should be included in the evidence base 
and outcome measures of global mental health [89]. Furthermore, 
scholars have argued that the production of evidence itself is an un- 
even playing field because RCTs are costly and complex, and require 
a high degree of technical expertise, which excludes poor countries 
from the production of formalized evidence [90]. 

Too medical. Global mental health’s focus on evidence-based 
interventions led some scholars to caution that the distribution of 
medication may, in practice, be prioritized over the development of 
complex and context-sensitive psychosocial interventions [89]. 


Too individualized. Other scholars critiqued that the Western psy- 
chiatric disease model only focuses on the individual, based on the 
assumption of an underlying, universal brain disorder [91]. Instead, 
they argued human emotions and behaviour cannot be understood 
without social context and a meaningful engagement of the commu- 
nity [92] and more attention should be paid to the social determin- 
ants of mental health [93]. 

Social determinants of health. Some have argued that the root 
causes of mental suffering, such as poverty, structural violence, and 
globalization [93], need to be addressed by tackling the upstream 
structural determinants of mental health, such as poverty and vio- 
lence, rather than dedicating most resources to narrower down- 
stream interventions such as mental health treatments [94]. 

Limitations of economic arguments for investing in mental health 
care. The economic argument for mental health care has also been 
challenged for its potential to undermine calls for care, based on 
equity, justice, and the right to social participation [95]. A sole focus 
on ‘return on investment’ may prioritize the treatment of common 
mental disorders at the expense of severe spectrum disorders, which 
require more care and resources [94]. 

Neo-colonial. Some scholars have gone so far as to describe global 
mental health as a neo-colonial imposition of Western psychiatric 
knowledge onto populations in LMICs, driven by the pharma- 
ceutical industry, powerful international institutions, and Western 
psychiatry [96]. Such an imposition of the biomedical model, they 
argue, risks the replacement of religious, spiritual, and communal 
systems of meaning, which may themselves be protective for mental 
health and offer effective strategies of care [91]. 

Insufficient attention to substance abuse. Much of the focus of 
global mental health innovations has been on mental illness, and 
substance abuse has been relatively neglected, despite the evidence 
for the major burden associated with substance use disorders and 
cost-effectiveness interventions. 

Critique of the critique. In the aftermath of some of these vocif- 
erous critiques, a set of counter-arguments have emerged, including 
an analysis of the assumptions underlying the polarized debates be- 
tween ‘global’ vs ‘local’ conceptions of mental health and highlighting 
the need and potential avenues of integrating both perspectives [97]. 

Although the above critiques present an important challenge to 
the field of global mental health, many who work in the field are 
acutely aware of these issues. For example, the issue of cross-cultural 
validity is an ongoing challenge and many of the leaders in global 
mental health have focused their research attention on trying to 
understand local idioms of distress and develop culturally valid and 
reliable instruments [69]. Similarly, global mental health as a field 
has focused a good deal of its attention on understanding the social 
determinants of mental health and developing interventions that 
can address these determinants [30]. 


Future directions 


As a young, rapidly expanding, and dynamic field, global mental 
health can be said to have ‘come of age [98]. Nevertheless, many 
challenges remain. In addition to the above critiques, there are 
ongoing challenges of inadequate policy priority and resources for 
mental health, particularly in LMICs. There is also a challenging 
geopolitical environment, with rising income inequality, ongoing 
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violence, and forced migration in the Middle East and sub-Saharan 
Africa, and the growing reality of climate change—all of which carry 
major implications for global mental health. 

These challenges require the field of global mental health to ex- 
pand its focus from the earlier calls to scale up mental health care 
[21] towards interventions that promote population mental heath 
and prevent mental illness. Given the powerful effects of the social 
determinants of mental health, it is unlikely that the global burden of 
mental disorders can be addressed through treatment alone. This re- 
quires a more refined understanding of the mechanisms of the social 
determinants of mental health, for example through larger longitu- 
dinal studies, particularly in LMICs. It also requires a greater inte- 
gration of mental health and well-being into the broader sustainable 
development agenda of 2015-2030 [6]. 

The field also requires a greater convergence of approaches, 
bringing together the fields of genetics and epigenetics with those 
of social epidemiology and anthropology, to gain a greater under- 
standing of gene-environment interactions and opportunities for 
interventions across the life course. This implies the need for a more 
nuanced understanding of the local and the global, and which inter- 
ventions work for whom, under what circumstances. In this process, 
greater partnerships are required between people with lived experi- 
ence of mental illness, their families, clinicians, and researchers. 

Finally, the field needs to move away from categorical classifica- 
tions of mental disorders towards a more dimensional understanding 
of mental health, on a continuum from disability to well-being. 
This requires a staged approach to detection and treatment using 
trans-diagnostic intervention models, with health systems having 
the capacity to detect problems early and provide care and support 
in an empowering and collaborative manner. These challenges re- 
main compelling and have managed to galvanize a significant level 
of international collaboration, funding, and advocacy, to constitute 
global mental health as a vibrant, dynamic, and growing field. 
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In 1918, Emil Kraepelin wrote [2]: 


‘A hundred years ago, there were practically no alienists. The care 
of the mental patients was nearly everywhere in the hands of head 
supervisors, attendants and administrators of the houses for the 
mentally ill and the role of the physicians was limited to the treat- 
ment of the physical illnesses of the patients.” 


He pointed out that, in the first decades of the nineteenth century, 
many of the books dealing with psychiatric themes were still written 
by medical doctors, such as Reil (who coined the word psychiatry), 
who had few contacts with mental patients or even by philosophers 
and theologians, and that only in the great scientific centres had spe- 
cialists appeared ‘who had decided to spend their life in the study 
and treatment of mental diseases. 

The history of psychiatry as a medical specialty has to be distin- 
guished from the history of psychiatric medical knowledge which 
began in ancient Greece with the birth of medicine as a science. For 
more than 2000 years, only physicians observed and treated mental 
illnesses, and institutions were created in which the ‘lunatics’ and the 
‘insane’ were received. But, as rightly pointed out by Kraepelin, the 
truth is that psychiatry was not really a medical specialty. One can 
argue about the precise date of the appearance of psychiatry as a spe- 
cific field of medicine and of the psychiatrist as a specialist, devoting 
his professional competence exclusively to the care of the mentally 
ill. Denis Leigh recognizes that ‘some degree of specialization oc- 
curred [in England] among respectable physicians’ in the middle of 
the eighteenth century when the monopoly of Bethlem was broken 
and new ‘lunatic hospitals, such as St Luke’s, were opened [2]. On 
the other hand, the American historian Jan Goldstein stresses that in 
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France the language, as an exact reflection of the underlying reality, 
began to use expressions such as homme spécial to describe a phys- 
ician specializing in a branch of medicine such as psychiatry only 
around 1830 [3]. 





Despite those divergences, it is generally accepted that the work 
of Philippe Pinel constitutes a turning point. His role has several 
aspects. He is known worldwide as the physician who ‘liberated 
the insane from their chains’ in a dramatic initiative he started in 
1793, at the height of the French revolution, at the Bicétre asylum 
and completed 3 years later at the Salpêtrière asylum. However, the 
reality is more complex. 

Pinel, who was born in 1745, had studied medicine, translated 
Cullen’s books into French, and published scientific papers on various 
subjects. He acted as a physician in a small Parisian ‘madhouse —the 
Pension Belhomme, in which wealthy lunatics were confined at the 
request of their families. At that time, most of the Parisian insanes 
were confined for a few weeks in the general hospital—the Hotel 
Dieu. If their state did not rapidly improve, they were considered as 
incurable and sent to Bicétre or the Salpétriére, built a century be- 
fore, which also received other social deviants like beggars and pros- 
titutes. Pinel, who was known by his politically influential friends for 
his progressive scientific ideas, was appointed physician to Bicétre. 
The division for the insane was under the direction of an overseer 
(surveillant)—Pussin, who had already introduced humanitarian 
reforms in the care of the patients. Pinel’s merit was to approve and 
systematically develop Pussin’s empirical measures and to propose 
an explicit scientific theory for their mode of action. Inspired by 
Crichton’s views about the nature of the ‘passions’ by Condillac’s 
psychology and by the ideas of Jean-Jacques Rousseau, he created 
the traitement moral, which he claimed to be effective with patients 
previously considered as incurably ill. 

The improvement of the conditions in which the insane were 
cared for, supported and expanded by Pinel, was not an isolated 
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French phenomenon. In Tuscany, Chiarugi in 1789 had already as- 
serted that the basis of the extensive reforms he had introduced in 
the local asylum for the insane was that ‘it is a supreme moral duty 
and a medical obligation to respect the mental patient as a person. 
In England, where the public had been shocked by the inhumane 
treatment to which King George III had been submitted during his 
mental illness, and where a pious Quaker—William Tuke—deeply 
affected by the conditions in which the wife of a member of the 
Society of Friends had died in York lunatic asylum, decided to set 
up a special institution under the government of the Friends ‘for 
the care and accommodation of their own members. At the Retreat, 
opened in 1796 near York, physical restraints were largely abolished, 
and religious and moral values were emphasized in relations with 
the patients. 

Chiarurgi’s reforms did not survive the upheavals caused by sub- 
sequent wars and the political divisions of Italy, and Tuke’s creation 
of the Retreat had not been prompted by medical considerations 
but was the expression of religious humanitarian purposes. The role 
played by Pinel was decisive, not so much because of the changes 
he promoted in the conditions of the patients, although they had a 
profound influence, but because he made the study and treatment of 
mental disorders a branch of medicine. 

In 1801, Pinel published the Medico-philosophical Treatise on 
Mental Alienation. In it, he presented the various clinical mani- 
festations he had observed, proposed a simple nosological system 
largely borrowed from older authors, examined possible aetiological 
factors, and described his ‘moral treatment’ in detail. The book has 
remained a landmark in the history of psychiatry, even being con- 
sidered by the philosopher Hegel as a ‘moment of capital importance 
in the history of humanity. For Pinel, insanity was a disease, and 
the patient affected by it remained, despite the loss of his reason, a 
human being. Its study, like the rest of medicine, had to be ‘a science 
which consists of carefully observed facts: Goldstein [3] has shown 
that Pinel’s main preoccupation was to prove this scientific nature 
of the new medical specialty by repudiating the previous practices 
of the ‘empirics’ and ‘charlatans’--the two terms being practically 
synonymous. He had accepted the method Pussin had developed 
empirically and transformed it in his moral treatment by providing 
a scientific theory of its mode of action. A curiously premonitory 
aspect of his emphasis on the necessity of a scientific methodology is 
to be found in his Tables to Determine How Probable is the Curability 
of Alienation, published in 1808. He provided statistical data on the 
efficacy of his therapeutic method according to the types of mental 
disorders and in comparison with spontaneous evolution, and con- 
cluded that medicine can only be a true science through the use of 
the calculus of probability! 


Psychiatry as a profession: Esquirol and 


the clinical approach 





If, because of the international influence of the ideas expressed in 
his book, Pinel is the founder of psychiatry as a medical discipline, 
he was not a psychiatric specialist in the strict meaning of the term. 
Although he retained his position at the Salpétriére until his death in 
1826 and is known today for his contributions to mental medicine, 
he had many other medical interests which gave him, in his time, 


a leading position among the Paris physicians; his Philosophical 
Nosology, published in 1796 and a classical reference for several 
decades, deals with general pathology. The case of his pupil and 
successor Esquirol, who became the prototype of the psychiatric 
specialist, was very different. At the Salpêtrière, he was only in 
charge of the ‘section of the insane. He was later appointed medical 
director of the Charenton psychiatric asylum near Paris and owned, 
in addition, a small clinic, in which he treated his private patients. 
All his activities were exclusively dedicated to the study and treat- 
ment of mental disorders and the teaching of psychiatry. His book 
On Mental Diseases, published in 1838, in which he collected his 
previous publications, acquired fame as great as Pinel’s Treatise. In 
1913, Karl Jaspers recognized that the later great representatives of 
German psychiatry, such as Griesinger and Kraepelin, were strongly 
indebted to Esquirol. He, and the school he founded, effectively de- 
veloped one of the basic tenets of the new medical specialty. For 
Esquirol, careful objective observation and analysis of the symp- 
toms and behaviour of the patients were fundamental. He origin- 
ated the descriptive clinical approach expanded by his pupils. Even 
more than Pinel, he was suspicious of unproved theories, and when 
he eventually suggested relations between pathogenic factors and 
syndromes, he remained extremely cautious in his interpretations. 
Zilboorg, the psychoanalytically oriented historian of psychiatry, 
has accused this predominantly descriptive approach of creating 
‘psychiatry without psychology’ because, lacking psychodynamic 
concepts, its attempted objectivity remained at an allegedly super- 
ficial level [4]. The truth is that it laid the foundations of the present 
description of the mental disorders. The ‘atheoretical’ descriptive 
approach adopted in the present nosological systems—both the 
American Diagnostic and Statistical Manual and the International 
Classification of Diseases—of which the proclaimed purpose is to 
emphasize the medical character of psychiatry is, in this respect, a 
return to Esquirol’s principles. 


The social aspects of psychiatry and 


the asylum system 





By the end of the eighteenth century it was recognized that the study 
of mental alienation was part of medicine. However, mental diseases 
were of such a nature that it was not possible to treat the insane in 
the same conditions as patients affected by other diseases. Their most 
obvious manifestations had social consequences. According to the 
prevailing philosophical view, the mentally ill were deprived of free 
will by their illness. In practice, they were unable to participate in 
the normal life of society and were often considered as potentially 
dangerous. Because of this, they had generally been confined in mad- 
houses of various kinds. One of the aspects of the reforms initiated by 
Pinel had been to make more explicit the difference in nature between 
the socially deviant behaviour of the insane, which, being the conse- 
quence of an illness, belonged exclusively to medicine, and the other 
deviations which society had to control and eventually to repress. The 
implementation of this fundamental distinction during the first half 
of the nineteenth century helped to give psychiatry its specific shape 
as a profession by being at the origin of forensic psychiatry and by 
leading to the formulation of precise rules concerning the commit- 
ment of the insane to institutions of a strictly medical character. 


The legal code promulgated by Napoleon in 1810 stipulated that 
‘no crime or delict exists if committed in a state of dementia, with 
the old term dementia being used as a synonym of Pinel’s mental 
alienation. This legal provision, introduced in similar forms in 
other countries, opened an important domain of activity to the 
medical profession of the psychiatrist. Because of their now recog- 
nized specialized knowledge, the alienists were to help the judges in 
determining whether the mental state of an individual convicted of 
a ‘crime or delict’ was normal or pathological, with decisive conse- 
quences on the subsequent decision. The title of Esquirol’s Treatise 
mentions explicitly that it describes mental diseases ‘in their med- 
ical, hygienic and medico-legal aspects. The conflict (which still 
exists) between the judges, usually supported by public opinion, 
who took a restrictive view of the concept of mental disease, and the 
psychiatrists, who tended to expand it to include new types of de- 
viant behaviour, is illustrated by the violent controversies provoked 
by Esquirol’s description of ‘homicidal monomania. They had an 
even more famous counterpart in England. J.C. Pritchard, an ad- 
mirer of Esquirol, had isolated ‘moral insanity’ as a specific mental 
disorder in two books published in 1837 and 1842; in the second, 
he examined its ‘relations to jurisprudence. Half a century later, in 
1897, Henry Maudsley, who was in favour of the use of this diag- 
nosis, recognized that this category, although internationally ac- 
cepted by the psychiatrists, corresponded to: 


“.,.a form of mental alienation which has so much the look of vice 
and crime that may persons regard it as an unfounded medical in- 
vention. Judges have repeatedly denounced it from the bench as a 
“most dangerous medical doctrine’, “a dangerous innovation” which, 
in the interest of society, should be reprobated’ 


Henry Maudsley, 1897. 


The general acceptance of the new medical concept of mental 
alienation implied the existence of adequate facilities for the treat- 
ment of the patients. The creation of new asylums—the term was 
retained—and the reorganization of the old ones were the answers. 
The French law of 1838 that fixed the detailed rules for the expan- 
sion of the new system to the whole country and for its functioning 
and financial support had a model character. Similar results were 
obtained in, for example, England with the Asylum Act 1828 and 
the Lunacy Act 1845. Outwardly, the new system was an extension, 
under more humane conditions, of the previous institutional prac- 
tices. However, it had radically original features. While recognizing 
the necessity of protecting society, it stressed the fact that the insane 
had a fundamental right to be protected and medically treated in a 
competent way. The deprivation of liberty for the patients, which it 
still implied, was strictly controlled to prevent possible misuse and 
was anyway justified, according to Esquirol and most contemporary 
psychiatrists, not only by the loss of free will, which was a conse- 
quence of the illness, but also by the therapeutic value of separation 
from a pathogenic milieu. 

The asylum system became the central element of psychiatric 
care and was both the consequence and the determining factor of 
the emergence of psychiatry as a medical specialty to which it gave, 
until the end of the nineteenth century and even beyond, an original 
character. The asylums acquired quasi-monopoly in the care of the 
mentally ill. The few private institutions reserved for the wealthier 
members of the population, which often belonged to alienists in 
charge of the asylum, were generally submitted to the same legal 


CHAPTER 4 The history of psychiatry as a medical specialty 


rules. Private practice with ambulatory patients, as existing today, 
was exceptional or dealt with cases which were not then considered 
to belong to mental alienation. As a result, the study of mental illness 
was predominantly restricted to the more severe forms of disorder. 
Another consequence was that the alienists in charge of patients 
committed to the asylums had a dual function, a fact that differ- 
entiated them from other hospital physicians. In addition to their 
medical duties, they were involved in legal procedures which deter- 
mined the conditions of admission, stay, and eventually release of 
the mentally ill. As superintendents, they also often had economic 
and financial responsibilities, being in charge of the material, as well 
as the medical aspects of the functioning of their institutions. 
Despite the fact that the laws now strictly differentiated the na- 
ture of the limitations of liberty in asylums and in prisons, the par- 
ticipation of the alienist in a form of social control was eventually 
perceived negatively by the public, and often by other physicians, 
and contributed to accentuating the specificity of psychiatry inside 
medicine. During the third and fourth decades of the nineteenth cen- 
tury, which saw the birth of the asylum system, psychiatrists became 
really conscious of their identity as a professional group. In England, 
France, Germany, and the United States, they founded societies and 
began to publish journals with specialized scientific goals. Such a de- 
scription oversimplifies an evolution which was progressive and, in 
some cases, took different directions. The creation and extension of 
the asylum system took many years; it did not reach its classical form 
until the last part of the century, as testified by the famous campaign 
conducted in the United States during the 1840s by Dorothea Dix 
who complained that many of the mentally ill were still incarcerated 
in almshouses and prisons. The moral treatment practised in the 
institutions was eventually used to justify brutal measures, alleged 
to be therapeutic, and the behaviour of the attendants, who were 
not usually medically trained (significantly, they were known as 
surveillants in France), was too often of a purely repressive character. 
It was a long time before the proposals made in 1856 by the British 
psychiatrist John Conolly in his book The Treatment of the Insane 
Without Mechanical Restraints were put into practice everywhere. 


The biological and the psychological model 


The clinical orientation of Pinel, Esquirol, and their followers was 
basically empirical. By concentrating on describing observable 
symptoms and abnormal behaviours, it avoided theoretical contro- 
versies. However, many believed that if psychiatry was to become 
a branch of the medical sciences and to progress, it had to adopt 
models similar to those accepted by the rest of medicine. According 
to the anatomoclinical perspective, which was now dominant, dis- 
eases were distinct entities. Each disease was defined by a charac- 
teristic pattern of symptoms provoked by a lesion or eventually a 
dysfunction of an organ to be discovered at autopsy. In 1821, Bayle, 
following this scheme, described the typical clinical symptoms and 
lesions of the brain in the general paralysis of the insane. Despite the 
disappointing results of further anatomopathological studies (brain 
lesions were observed in only a small proportion of cases), there 
was increasing conviction that, with better investigation methods, 
mental disorders, like other diseases, could be explained by somatic 
causes. The degeneration theory, proposed in 1857 by Morel, which 
attributed many forms of insanity to the hereditary transmission of 
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dysfunction of the nervous system produced by the noxious effects of 
environmental factors, and whose influence lasted until Kraepelin, 
is another expression of this biological orientation, the aim of which 
was to give psychiatry an undisputed medical status. 

The biological and the purely clinical approaches were con- 
cerned with different conceptual levels—the discovery of the causes 
of insanity and the description of its manifestations, respectively. 
Therefore, they could easily coexist. Even when the followers of 
Pinel and Esquirol expressed reservations about the applicability 
of the biological model to every type of mental disorder, they still 
believed in the medical nature of psychiatry. The situation created 
in the German-speaking countries by the school of the ‘mentalists’ 
(the term Psychiker by which they were known means ‘psychologic- 
ally oriented’), who were predominant during the first half of the 
nineteenth century, was very different. Influenced by philosoph- 
ical, religious, and romantic trends, these psychiatrists took a rad- 
ical dualistic position, postulating the absolute difference between 
the physical body and the spiritual soul. The soul was the source of 
the whole psychic life and hence eventually of its abnormal aspect— 
insanity. A term such as disease, appropriate for the somatic illness, 
could only be used metaphorically in psychiatry. The sins of the pa- 
tients were the origin of the mental disorders, and psychiatry be- 
longed more to moral philosophy than to medicine. These ideas were 
developed in various related forms by the majority of the German 
psychiatrists of the period (Heinroth, Ideler, Langerman, and 
many others). Their ideological position had two consequences— 
scientific relations with other schools, such as the French and the 
English who saw in the publications of the mentalists obscure philo- 
sophical theories devoid of medical character, were largely cut off; 
and they provoked a violent reaction in Germany itself. The most 
extreme representatives of the contending group of ‘somatists’ 
(Somatiker), such as Jakobi and Friedreich, saw the mental disorders 
as symptoms of somatic diseases, not necessarily of the brain. In fact, 
for them, mental diseases as such did not exist. They defended ag- 
gressively their biological and sometimes bizarre hypotheses, such 
as the aetiological role of intestinal worms, against the mentalists. 
Finally, around 1850, they gained the upper hand. The publication 
in 1845 of Pathology and Therapy of the Nervous Diseases by Wilhelm 
Griesinger, an heir to their school who was also influenced by the 
French alienists, is a landmark in the history of German psychiatry. 
With his appointment in 1865 as professor of psychiatry in Berlin, 
where he succeeded the mentalist Ideler, medical psychiatry was def- 
initely established in Germany as a branch of the natural sciences. 


The rise of neuropsychiatry 


Romberg’s Lehrbuch der Nervenkrankheiten symbolizes the birth 
of neurology as an autonomous medical specialty studying and 
treating the diseases of the nervous system. It was published 5 years 
after Griesinger’s Textbook in which, adopting and expanding Bayle’s 
anatomoclinical model, he had affirmed: ‘Mental diseases are dis- 
eases of the brain. If both psychiatric and neurological symptoms 
originated in the nervous system, some form of association between 
the two specialties was a logical step, at least at the conceptual level. 
One aspect of their complex relationship was the creation of neuro- 
psychiatry which developed its most characteristic aspects in the 
German-speaking countries. 


Universities acquired considerable power and influence in the 
second half of the nineteenth century. From the 1850s on, chairs 
were created for the teaching of the new common discipline, and 
special institutions—university clinics—were built with hospital 
beds for psychiatric patients (if their disorders became chronic, they 
were sent to the nearest asylum), and laboratories for research on 
neurophysiology and neuroanatomy and special wards for neuro- 
logical cases were developed. Griesinger’s first move when he took 
over the chair of psychiatry at Berlin was the creation of neurological 
wards at the Charité. The leading neuropsychiatrists in charge of 
these institutions often performed research in both fields with equal 
competence, as shown by the work of Wernicke and Westphal, and 
later of Kleist and Bonh6ffer in Germany and of Meynert in Austria. 

The concept of neuropsychiatry, appearing at a period during 
which the German school was progressively gaining influence, had 
a deep impact on psychiatric thought and the psychiatric profes- 
sion, even if its institutional driving force—the university clinic 
system—was not developed everywhere to the same extent as in 
Germany. For example, it was conspicuously absent in England, 
despite the fact that the theoretical position taken by the most im- 
portant psychiatrist of the time Henry Maudsley was very close 
to that of Griesinger. The National Hospital in Queens Square, 
London, founded in 1860, retained a virtual monopoly on the 
teaching of neurology for many decades, and psychiatry, taught es- 
sentially in hospitals, was not represented at university level until 
the 1930s. However, in most countries, neuropsychiatric institu- 
tions coexisted with asylums where the alienists had the unenviable 
task of caring for chronic mental patients, often with inadequate 
means. The concept of neuropsychiatry reflected a basically bio- 
logical perspective on the aetiology of the mental illnesses, ex- 
pressed in the creation of a new specialty associating competence 
in the two previously separated domains of medicine. However, it 
provoked ideological and professional tension between the ‘pure’ 
psychiatrists, mainly those in charge of asylums, and the neuro- 
psychiatrists, predominantly involved in teaching and research. 
In the long term, this conflict was one of the factors which finally 
led, in the 1960s, to the almost complete administrative and institu- 
tional separation of the two specialties in countries such as France 
where they had been, at least formally, associated. But many traces 
of the old situation remain. The most influential scientific journal 
published in German Nervenarzt still deals equally with neurology 
and psychiatry, and the term ‘neuropsychiatric’ survives in the titles 
of many teaching and research institutions. 


The neuroses and birth of the psychotherapies 





The study of the neuroses, in which the relation between psych- 
iatry and neurology was also involved, resulted in completely dif- 
ferent, but equally important, changes to psychiatry as a medical 
specialty. The term neurosis had been coined in 1769 by Cullen to 
describe a class of diseases he attributed to a dysfunction of the ner- 
vous system. In this very heterogenous group, two entities of very 
ancient origin—hysteria and hypochondriasis—had predomin- 
antly psychological manifestations. Since the affected patients were 
not usually commited to asylums, they were not normally studied 
by alienists, but by specialists in internal medicine such as Briquet, 
who, in 1859, wrote the classical Treatise on Hysteria. Because of the 


assumed nature of the neuroses, the new discipline of neurology 
rapidly took an interest in them. 

Charcot, the founder of the French neurological school, was re- 
sponsible for the internal medicine wards at the Salpétriére—they 
were not associated with the ‘divisions of the insane’ at the same hos- 
pital, the domain of the alienists. In about 1880, he became inter- 
ested in hysterical patients who, because of their seizures, were 
admitted to the same ward as the epileptics. He developed a purely 
neurological theory of the disease, which he described and studied 
using hypnosis. This was the former ‘animal magnetism, long fallen 
into disrepute, but to which he gave a new scientific status. Charcot'’s 
descriptions of the grande hystérie, which he demonstrated on 
selected patients in his famous public lectures, were justly criticized 
later, but his international fame attracted students from all over the 
world. One of them was a young lecturer in neuropathology at the 
University of Vienna—Sigmund Freud, who, impressed by Charcot'’s 
lectures, decided to devote all his energies to the study and treat- 
ment of the neuroses. Another was a French professor of philosophy 
(psychology was then a branch of philosophy)—Pierre Janet, who 
had become interested in the psychological aspects of the neur- 
oses. He was later to develop, in parallel with Freud, a psychopatho- 
logical theory which, despite the traces it has left (the concepts of 
psychasthenia and the dissociative processes in hysteria), was not to 
be as internationally successful as Freud’s psychoanalysis. Charcot’s 
ideas were opposed by Bernheim, the professor of internal medi- 
cine at the Nancy Medical School and also an adept of hypnosis. 
He attacked the neurological interpretations of the Salpétriére and 
claimed that suggestion played a central role in the phenomena de- 
scribed by Charcot. 

The general interest in the neuroses, which extended beyond 
medicine to fin de siécle literature, was an international phenom- 
enon. In 1880, Beard, an American neurologist, described a new 
neurosis—neurasthenia, which soon aroused even more interest 
than Charcot’s hysteria. Psychiatry had played almost no part in this 
evolution, but this was to change under the influence of three related 
developments: the changes which took place within the concept of 
neurosis, the birth of the psychotherapies, and the incorporation in 
the field of psychiatry of psychopathological manifestations, even if 
they were of minor intensity. 

The transformation of the concept of neurosis is apparent in the 
position taken by Kraepelin in the 1904 edition of his Textbook. 
He introduced a chapter called ’The psychogenic neuroses; on the 
grounds that ‘among the neuroses, to which belong epilepsy and 
chorea, one must isolate a sub-group characterized by the purely 
psychological cause of the apparition of the symptoms’ The disin- 
tegration of the old concept left to neurology, which, from now on, 
abandoned the generic term, diseases (such as epilepsy and chorea) 
whose somatic manifestations could be shown to express a dysfunc- 
tion of a precise part of the nervous system. Psychiatry took charge 
of hysteria, hypochondriasis, neurasthenia, and the related phobic, 
obsessional, and anxious disorders, which constituted the new neur- 
oses. This concept was justified by the psychological nature of the 
symptoms and the causes recognized even by a biologically oriented 
psychiatrist such as Kraepelin. This redrawing of the frontier be- 
tween the neurological and psychiatric specialties also testified to 
the extension of the limits of psychiatry. Pinel’s insanity, until then 
defined by the necessity of commitment to special institutions, was 
replaced by a broader concept. A new class corresponding to our 
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present personality disorders had already appeared in the 1894 edi- 
tion of Kraepelins Textbook. It had been isolated for the first time 
in 1872-1874 by the psychiatrist Koch. Like the neuroses, the cases 
were rarely observed in asylums, but nevertheless, they were now 
considered as belonging to the psychiatric field of study. 

This field was further modified by the birth of the psychotherapies. 
In fact, they had a long history. In 1803, one of the first German 
mentalists Reil had described, under the name of ‘psychic therapy’ 
(psychische Curmethode), a number of procedures, including very 
violent somatic ones, which could influence the ‘perturbed pas- 
sions of the soul; and Pinel’s moral therapy contained psychothera- 
peutic elements. However, psychotherapies, as techniques of which 
the formal rules were based on an explicit theory about their psy- 
chological mechanisms of action, derived mainly from Mesmer’s 
animal magnetism, as rehabilitated by Charcot. The emergence of 
the psychotherapies, characteristic of the last decades of the nine- 
teenth century, was intimately related to the renewed study of the 
neuroses. After he had abandoned hypnosis, Freud developed psy- 
choanalysis, but many other techniques evolved during the same 
period, which were as well, or even better, known at the time, al- 
though they were to have a less lasting success. One of these was 
the method of Janet, who still occasionally used hypnosis. In 1904, 
Dubois, a Swiss neuropathologist from Bern, introduced a technique 
influenced by Bernheim’s theory of suggestion in The Psychoneuroses 
and their Moral Treatment, and claimed to produce ‘psychological 
re-education’ by a combination of rational and persuasive elements. 
His international reputation brought him patients from all over the 
world. The ‘rest cure, proposed in 1877 by the American neurologist 
S. Weir Mitchell for the treatment of hysteria and later of neuras- 
thenia, was combined with Dubois’ method by Dejerine, Charcot’s 
successor as professor of neurology in Paris. 

This very incomplete summary illustrates the striking fact 
that, because of their intimate connections with the neuroses, 
psychotherapies originated inside neurology. When the study and 
treatment of the neuroses were incorporated into psychiatry, psychi- 
atrists considered that they were an integral part of their activity and 
tried to retain the monopoly of their practice. They never completely 
succeeded. Already Freud had, according to his biographer Jones, 
‘warmly welcomed the incursion in the therapeutic field of suitable 
people from another walk of life than medicine. The problem of ‘lay 
analysts, a source of conflict within the psychoanalytic movement, 
is only an aspect of a broader question which was later to involve the 
relations of the medical specialty of psychiatry with the new profes- 
sional group of clinical psychologists. 


From the beginning of the twentieth century to 


the Second World War 





During the first half of the twentieth century, psychiatry developed 
in many directions. Kraepelins monumental synthesis [5] estab- 
lished in around 1900 a nosological system which, in its broad out- 
lines, has remained valid until today. Without being radically altered, 
it was completed to mention only a few contributions, in 1911, by 
Bleuler’s description of schizophrenia and in 1913, by Jaspers’ psy- 
chopathological perspective, developed by the Heidelberg school 
and Kurt Schneider and by other psychiatrists working in academic 
institutions. However, the old conflict between the ‘mentalists’ and 
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the ‘somatists’ reappeared in a modified form. The mainstream 
of psychiatry had abandoned the extreme positions of the ‘brain 
pathologists’ of the Meynert-Wernicke type but, while recognizing 
a limited influence of psychological factors, admitted in a general 
way the biological origin of the more severe mental disorders—the 
psychoses. The empirical discoveries of biological treatments—of 
general paralysis by malaria therapy (Wagner von Jauregg in 1917), 
of schizophrenia by insulin coma (Sakel in 1933) or by chemically 
induced seizures (von Meduna in 1935), and of depression by elec- 
troconvulsive therapy (Cerletti in 1938)—not only helped to dispel 
the prevailing therapeutic pessimism, but also provided supporting 
arguments. However, an opposing ideological current represented 
by psychoanalysis had arisen from the study of the neuroses. Its 
attention was concentrated on the study of complex psychopatho- 
logical mechanisms postulated to be at the origin of the neurotic, 
and later also of the psychotic, symptoms, favoured psychogenetic 
aetiological theories, and advocated psychotherapy as the funda- 
mental form of treatment. Psychoanalysis expanded steadily during 
this period and gained enthusiastic adherents in many countries. 
However, partly because of the suspicion, and even hostility, of 
many members of the psychiatric establishment, they remained 
isolated in close-knit groups, with their own teaching system in- 
dependent of the official medical curriculum, and the use of their 
therapeutic technique was restricted to a small number of mostly 
neurotic patients seen in outpatient clinics or, more often, in pri- 
vate practice. 

The great majority of patients suffering from mental disorders 
were still confined in asylums, and the enormous increase in their 
number, mainly related to the social changes accompanying indus- 
trialization and urbanization, although other factors have been in- 
voked, was striking. In Great Britain, it grew from 16,000 in 1860 
to 98,000 in 1910, three times more rapidly than the population. 
A similar phenomenon was observed in all countries and persisted 
until the end of the 1940s, despite the introduction of the first bio- 
logical therapies. In the United States, there were already 188,000 
patients in mental hospitals in 1910, and by the end of the Second 
World War, 850,000 were lodged in huge institutions which were 
overpopulated and understaffed, and could only provide custodial 
care. This obvious degeneracy of the asylum system, contrasting 
with the progresses in the scientific field, stimulated efforts to im- 
prove the practice of psychiatry and its institutional framework. 
Most of these improvements took place after 1920 and, although 
their results remained relatively limited, they were the forerunners 
of later more drastic changes. 

The education of psychiatric specialists, which had varied widely 
from country to country, was improved and systematized. A conver- 
gence of evolution is apparent during this period which can be said, 
to some extent, to have seen the formal administrative recognition 
of psychiatry as a medical specialty. Educational programmes and 
controls of the level of competence were introduced, which extended 
beyond psychiatrists in academic positions. Limited teaching of 
psychiatry became compulsory, even in the general medical curric- 
ulum. In France, psychiatrists for public asylums and, in some cases, 
residents in psychiatry were selected by a competitive examination 
system. In England, the Board of Control recommended in 1918 that 
a leading position in a psychiatric institution could only be occu- 
pied by a physician who had obtained a Diploma in Psychological 
Medicine awarded by the Royal College of Physicians and by five 


universities. In the United States, the moving force behind the re- 
forms was Adolf Meyer, the Director of Henry Phipps Clinic at 
Johns Hopkins University from 1913 to 1939, who organized a sys- 
tematic residency system and promoted the creation of the Board of 
Neurology and Psychiatry. This Board was established in 1936 and 
awarded a diploma ,which it became necessary to hold, to be recog- 
nized as a specialist. 

The changes were reflected in the vocabulary. The term psych- 
iatry, originating in the German-speaking countries and mostly 
used there, was adopted everywhere at the beginning of the cen- 
tury. In France, the health authorities officially substituted ‘hôpital 
psychiatrique for ‘asile daliénés’ and ‘psychiatre for “aliéniste’ in the 
1930s. In England, a Royal Commission used the words ‘hospital; 
‘nurse, and ‘patient, instead of ‘asylum; ‘attendant; and ‘lunatic; for 
the first time between 1924 and 1926. However, efforts were also 
made to dissociate, when possible, the social protection function 
of the institutions from their medical role by allowing them to 
admit patients under the same conditions as the general hospitals. 
In 1923, a special section was created in the Paris Sainte-Anne 
asylum which provided treatment to voluntary patients and had 
both hospital beds and a large outpatient department. In England, 
the Mental Health Act of 1930 made voluntary admissions to psy- 
chiatric hospitals possible; by 1938, they already constituted 35% of 
all admissions. 

Social considerations had always been evident in psychiatry, but 
their traditional expressions had mainly been of a negative nature, 
that is the confinement of patients in asylums. The new possibility 
of free admissions reflected an increase in tolerance towards the 
disturbing character of mental illness. At the same time, a differ- 
ently oriented and broader social perspective appeared. The concept 
of mental hygiene originated in the United States in 1919 with the 
creation by a former patient Clifford Beers of an organization whose 
internationally growing influence was manifested by well-attended 
congresses held in Washington in 1930 and in Paris in 1937. From 
its beginning, the movement was not purely medical and was influ- 
enced by various humanitarian philosophical trends. It emphasized 
the role of social factors, such as living conditions or educational 
practices, in the origin of mental disturbances and promoted their 
prevention and treatment by the close co-operation of psychiatrists 
and nurses with non-medical groups in the community. One of the 
institutional consequences of these ideas was the creation of the pro- 
fession of social worker. They began their activity in Adolf Meyer's 
clinic (Adolf Meyer had been an early supporter of the mental hy- 
giene movement whose principles converged with his own ideas) 
at the Sainte-Anne Hospital in Paris, in England where the London 
School of Economics opened a special training course in 1929, and 
elsewhere. 

Contemporary with the emergence of psychiatric social work 
was the expansion of clinical psychology. The Binet-Simon scale 
for the measurement of intelligence, developed in 1905, was the 
first application to psychiatry of the new discipline of experimental 
psychology which had originated at the end of the previous century. 
This initial contribution led to the creation of a professional class of 
clinical psychologists who were initially concerned with the devel- 
opment and use of psychological assessment instruments and with 
theoretical research in a few psychiatric centres. Their number ini- 
tially remained low; in 1945, the United States, where they were the 
most numerous, had only 200 clinical psychologists. 





The expansion of psychiatry after 1945 


The Second World War coincided with a major transformation of 
the psychiatric specialty. The war had vividly demonstrated the fre- 
quency of mental disorders in the United States; they had proved to 
be the leading cause of medical discharges from the military service 
and the primary cause of almost 40% of selective service rejections. 
The previously prevailing view that psychiatry was a minor and 
often somewhat despised medical discipline, concerned primarily 
with the custodial care of psychologically deviant and potentially 
troublesome individuals, was progressively dispelled. The preserva- 
tion and restoration of mental health—an expression from now on 
often used by national and international institutions—began to be 
considered by governments as an important task. The fundamental 
changes which took place after 1945 and shaped psychiatry as we 
know it today were the result of this new atmosphere and of the 
emergence of new perspectives in the three traditional domains— 
the psychological, the social, and the biological. Some appeared in 
slightly different forms at different times; their relative influence was 
submitted to variations, and eventually they came into conflict. The 
result has been an impressive expansion and increase of the efficacy 
of psychiatry, profound institutional transformations, and succes- 
sive ideological waves which have had a major impact on the profes- 
sional position of the psychiatrist. 

The demographic data reflect the new importance of psychiatry 
in medicine. In the United States, the proportion of psychiatrists in 
the medical profession was 0.7% in 1920, 1.4% in 1940, and 5.5% in 
1970, the rate of growth having doubled after the Second World War. 
In France, at present, there are 18 psychiatrists for 100,000 inhabit- 
ants; they constitute 6% of all physicians. Similar levels were reached 
during the post-war decades in developed countries and remain 
relatively stable today. Even before this spectacular increase in num- 
bers, psychiatrists had been becoming conscious of the necessity 
to affirm the identity of their discipline. The First World Congress 
of Psychiatry, held in Paris in 1950, has been followed by periodic 
meetings and by the creation of the World Psychiatric Association 
to which almost every national society of psychiatry belongs. The 
health authorities of various countries have become conscious of the 
necessity to provide adequate financial means to support research 
and training in the discipline. In 1946, the United States government 
created the National Institutes of Mental Health for such a pur- 
pose, and similar efforts were made in many countries, although the 
structures of the organizations formed were different. To promote 
the same goals at an international level, the WHO, created immedi- 
ately after the Second World War, had a Section (later Division) of 
Mental Health which, among other co-ordinating activities, tried to 
overcome the difficulties of communication between the national 
schools by establishing a common nosological language. 

While the changes affected almost all countries, they were the 
most spectacular in the United States. From the end of the nine- 
teenth century until the 1930s, the concepts developed in German- 
speaking countries had been the most influential. This disappeared 
with the advent of the National Socialist regime which, under the 
cover of racist theories, expelled many of the leading psychiatrists 
from Germany and Austria, introduced compulsory sterilization 
for several varieties of mental illnesses, and promoted the volun- 
tary killing in psychiatric hospitals of mentally retarded children 
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and chronic patients. The United States, which had emerged from 
the Second World War as the most powerful country in the world, 
began to exert a widespread influence in psychiatry as in the rest of 
medicine. Because of the prestige of its research and teaching in- 
stitutions and the worldwide influence of its scientific publications, 
reinforced by the progressive adoption of English as the language 
of international scientific communication, American psychiatry be- 
came a model in many countries, even though many of the theor- 
etical trends and technical advances it adopted and developed had 
originated in Europe. However, in the United States, with a local col- 
ouring, they took on a special intensity. 


The psychodynamic wave 


An important factor in the spread of the doctrine of psychoanalysis 
was the emigration of a relatively large number of German and 
Austrian psychoanalysts to the United States from 1933 onwards. 
They had been compelled to leave their home countries for racial 
reasons—psychoanalysis had been condemned by the National 
Socialist regime, as Jewish and Freud’s books had been publicly 
burnt. Many of the young psychiatrists trained in large numbers to 
answer the demands of the armed forces adopted psychoanalysis 
under the influence of some of those in charge of the programmes. 
For a generation, until the end of the 1960s, psychoanalysis became 
the dominant ideology in American psychiatry. 

The American form of psychodynamism often deviated from 
Freudian orthodoxy, but it emphasized the role of psychogenetic 
factors, the value of the study of intrapsychic mechanisms, and the 
basic importance of psychotherapy, while giving little consideration 
to the traditional clinical approach and to nosology. The domin- 
ation of this essentially psychological orientation, sometimes com- 
pared with the success of the German mentalist school during the 
first half of the nineteenth century, had important consequences. 
Although the disorders of hospitalized psychotics were eventually 
interpreted according to the psychoanalytic theory, psychotherapy 
was mostly used, as it has been since its beginning, for ambulatory 
neurotic patients. As early as 1951-1952, three out of every seven 
American psychiatrists identified private practice as their main ac- 
tivity, and in 1954, the number of private psychiatrists exceeded that 
of their salaried colleagues for the first time, with a quarter of the 
former devoted exclusively to psychotherapy. However, with the 
initial encouragement of official institutions such as the Veterans 
Administration, clinical psychologists began to engage in psy- 
chotherapeutic activities. The number of members of the Clinical 
Psychology Section of the American Psychological Association 
reached 20,000 in 1980, at a time when they were 26,000 psychiat- 
rists in the United States. In public opinion, and to a certain extent 
in general medical opinion also, psychiatry was assumed to consist 
only of psychotherapy and psychology. 

In most other countries, the developments that occurred in the 
United States were not as intense, generally appeared later, and were 
modified by local traditions and influences. In German-speaking 
countries, they were delayed by the still powerful neuropsychiatric 
perspective and the temporary vogue for existential phenomen- 
ology. In the United Kingdom, the eclectic current fostered by the 
influential Institute of Psychiatry in London during the decades fol- 
lowing the war restricted the advance of psychodynamism; in 1956, 
Time Magazine could affirm, as a conclusion of a survey, that ‘all 
of Great Britain [had] half as many analysts as New York City. In 
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France, the psychoanalyst Jacques Lacan gave the doctrine a spe- 
cial colouring. On the whole, however, the rise of psychodynamism 
was a general phenomenon, except in communist countries where 
Freud’s doctrine had been condemned on ideological grounds. 

A reaction began in the 1960s with the successes of the new 
pharmacotherapies. Clinical psychologists had developed alterna- 
tive radically different psychotherapeutic methods based on learning 
theories, especially the behaviour therapy introduced in 1958 by 
Wolpe, supported in the United Kingdom by Eysenck, and the cog- 
nitive therapy often associated with it. These methods competed suc- 
cessfully with the psychodynamic techniques and conquered a large 
part of the field. Psychodynamism did not disappear; many of its 
concepts retained their place in psychiatry, and psychotherapeutic 
methods continued to be practised, but it lost its predominant ideo- 
logical position. In addition to its theoretical contributions, when 
its influence on the professional aspect of psychiatry is considered 
from a historical perspective, it has been an important factor in the 
further expansion of the activity of psychiatrists in the treatment of 
relatively minor disorders and has also encouraged clinical psych- 
ologists to play an active and independent role in this field. 


The social wave 


At the end of the Second World War, there was a great desire for 
social change; one of its aspects was the belief that everyone had 
a ‘right to health’ or at least the right to receive adequate medical 
care, regardless of the ability to pay. This resulted in the creation 
of the National Health Service in the United Kingdom in 1948 and 
the Social Security system in France, together with similar develop- 
ments in other countries. The social perspective, which was one of 
the basic principles underlying these developments, initiated major 
institutional changes in psychiatry. They were the result of anumber 
of factors—the necessity to give to the whole population easy access 
to psychiatric care, and also the belief that social elements played 
an important role in the aetiology of mental disorders and that they 
could greatly contribute to the healing process, with the aim of pro- 
gressively reintegrating the patient in the community. 

The most spectacular aspect of the new policy was the decline 
of the asylum system, still in a dominant position in psychiatry; in 
fact, the number of patients in psychiatric hospitals in developed 
countries reached its peak in 1955. The criticisms of the ‘degener- 
ation’ of the functioning of psychiatric hospitals and the segrega- 
tion of patients in institutions, often located far from their homes 
and families, were not new. However, the previous partial improve- 
ments, such as the decrease in the number of compulsory commit- 
ments or the creation of outpatient departments, were replaced by 
the creation of completely new structures. Ideally, the country would 
be divided into geographical zones or sectors with a population of 
about 100,000, and each zone would have a multi-disciplinary team 
of psychiatrists, nurses, clinical psychologists, social workers, and 
occupational therapists responsible for mental health. Visits and 
therapeutic interventions in the patient’s home and easily access- 
ible outpatient departments were to play an increasingly important 
role. If hospitalization was necessary, it should be, as far as possible, 
in small units located in a general hospital where the time of stay 
was to be reduced to the absolute minimum. Special institutions, 
such as day hospitals, night hospitals, and specially adapted work- 
shops, would contribute to the progressive readaptation of the pa- 
tient to life in the community. The introduction of this ‘community 


care, which was expected to work in close co-operation with gen- 
eral practitioners and various public and private institutions, would 
result in the disappearance of the traditional psychiatric hospital 
and ‘deinstitutionalizing’ psychiatry. The new system was intro- 
duced in various forms in most countries after 1969. In the United 
States, the Community Mental Health Center Act was promulgated 
in 1967. In the United Kingdom, which had strong traditions of 
social psychiatry, plans for the implementation of community care 
were discussed in the 1960s, and in 1975, the Government White 
Paper Better Services for the Mentally Ill encouraged the formation 
of multi-disciplinary ‘primary care teams, which also included gen- 
eral practitioners. In France, an official directive in 1960 created the 
psychiatrie de secteur, which was expected to result in the progres- 
sive elimination of hospitalocentrisme. The WHO encouraged all its 
member countries to adopt similar practices. 

Although, in the last 40 years, community care has become the 
official doctrine everywhere, except in Japan where the rate of hos- 
pitalization in mostly private hospitals has grown continuously, its 
implementation has not been easy despite the major therapeutic 
improvements brought about by pharmacotherapy. In some parts 
of the United States, the sudden closure of public psychiatric hos- 
pitals, combined with the inadequacies of the community mental 
health centers, was for a time at the origin of an appalling lack of 
care for a number of mentally ill people. The expected ‘fading out’ 
of hospitalization has been slow. According to the WHO, in 1976, 
the number of mental health beds (including beds for the mentally 
retarded) per 100 population was 6.5 in Sweden, 5.5 in the United 
Kingdom, 3 in France, and 2 in Germany. These figures have since 
decreased, and the types of hospitalization have changed. In 1955, 
77% of ‘psychiatric care episodes’ in the United States occurred in 
public psychiatric hospitals, compared with 20% in 1990. In 1994, 
1.4 million mental patients were hospitalized, but only 35% in 
public psychiatric hospitals, compared with 43% in general hos- 
pitals and 11% in private psychiatric hospitals, which increased 
in number from 150 in 1970 to 444 in 1988. In France, where the 
total number of psychiatric patients treated in public institutions 
(including children) is now about a million, 60% are seen exclu- 
sively on an ambulatory basis, but the number of hospital beds has 
only been reduced by half. 

Reflecting the increasing influence of social perspectives, the 
organizational changes modified psychiatry as a profession. The 
increase in the number of psychiatrists in private practice was paral- 
leled, in general to a lesser extent, by an increase in the public sector 
where their role was modified. In the traditional asylum, the au- 
thority of the psychiatrist was unchallenged and limited only by the 
legal provisions related to the procedures of commitment. Nurses, 
and later clinical psychologists, social workers, and occupational 
therapists, were ‘paramedical auxiliaries’ in a subordinate position. 
The creation of multi-disciplinary teams, working in various set- 
tings, gave the psychiatrist a function of co-ordination, made in- 
creasingly complex by the claims of professional autonomy made by 
the former auxiliaries. In some cases, such as in American mental 
health centres, psychiatrists, who were a small minority in the team 
and had less and less control over its functioning, resented what they 
considered to be the loss of their medical status. 

The importance given to social factors was not limited to the 
system by which care was delivered. Sometimes, combined with rad- 
ical ideological and political attitudes, it took more extreme forms. 


The criticisms, which first centred on the inadequacies of the ex- 
isting institutions, extended to the concept of mental disease itself. 
The antipsychiatry movement claimed that mental diseases were 
artificial constructs which were not related to diseases in the medical 
meaning of the term. The allegedly pathological behaviours, such 
as those conceptualized as schizophrenia, were in fact normal reac- 
tions to an inadequate social system. The so-called treatments were 
techniques used by the ruling classes to preserve the social order of 
which they were the beneficiaries. The only solution was a drastic 
reform of society. Such theses varied in their content and in the ar- 
guments used. They were developed by authors such as Szasz, Laing, 
and Cooper in the English-speaking world, the philosopher Foucault 
in France, and the psychiatrist Basaglia in Italy. They reached their 
greatest influence in the 1960s, and a few attempts were made to put 
their ideological principles into practice. Although they attracted 
much attention at the time, they were very limited and short-lived. 
One of the few countries where this movement had a practical im- 
pact was Italy. Basaglia’s strongly politically oriented theories were 
influential in the later legal reform of the antiquated asylum system, 
but, despite the apparently revolutionary character of some of the 
new administrative provisions, the changes made (notably the 
closure of large institutions) were very similar to those taking place 
in other countries. 


The biological wave 


Psychotropic drugs, such as opium, had been used since the origin 
of medical treatment of psychiatric patients. During the nineteenth 
century and the first half of the twentieth century, synthetic drugs 
such as the bromides, the barbiturates, and the amphetamines were 
developed. Some of them, especially the sedatives and hypnotics, 
had a real, but in practice marginal, value in alleviating some symp- 
toms. They had never constituted an effective treatment of mental 
disorders. Modern psychopharmacology not only initiated what has 
been rightly called a therapeutic revolution in psychiatry, but also 
gave a powerful new impulse to the biological perspective. Its date of 
birth is usually considered to be 1952, when the remarkable activity 
of chlorpromazine on the symptoms of schizophrenia and mania 
was discovered. This had been preceded in 1949 by the demonstra- 
tion of the value of lithium salts in manic states. A few years later, it 
was shown that the continuous administration of lithium salts pre- 
vented the recurrence of manic and depressive phases in the mood 
disorders. This was followed by the introduction of drugs acting on 
depressive manifestations (imipramine and monoamine oxidase in- 
hibitors in 1957) and on anxiety (including chlordiazepoxide, the 
prototype of benzodiazepines, in 1960). In one decade, clinicians 
had empirically discovered the fields of application of the main 
classes of psychoactive drugs—the neuroleptics, the antidepres- 
sants, the anxiolytics, and the mood stabilizers—which had been 
synthesized by biochemists and previously tested by pharmacolo- 
gists on animal models. The scale and rapidity of the spread of their 
use had major repercussions. 

The first was a modification of the image of psychiatry. The layman 
generally expected a physician to prescribe drugs to treat the dis- 
ease from which he suffered. In part, because it did not conform to 
the expected therapeutic behaviour, psychiatry had been seen as an 
atypical and almost non-medical specialty. In addition to the speci- 
ficity of the institutions in which it was generally practised, psycho- 
logical techniques were unknown in the rest of medicine, and even 
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the recently introduced biological techniques (the shock therapies 
and the lobotomy) had a somewhat strange and frightening char- 
acter. The establishment of pharmacotherapy contributed strongly 
to modifying this perception, even if it did not completely remove 
the traditional prejudices. 

The second consequence was even more important. There 
were, at least initially, controversies about the roles of pharmaco- 
therapy and of the new social perspectives in the restructuring of 
the mental health care system. In fact, the number of inpatients 
in psychiatric hospitals began to decrease from 1955 on, and it 
seems obvious that the main cause was the therapeutic efficacy of 
the drugs. They reduced the mean length of hospitalization and 
eventually even made it unnecessary. Although some types of pa- 
tients did not benefit from them and the mental state of others 
was only improved, many who had previously been condemned 
to long stays in the hospital were able to return to the community, 
with their treatment eventually being continued in rehabilita- 
tion settings and often on an ambulatory basis. Pharmacotherapy 
had made possible the practical implementation of social trends. 
In addition to this basic contribution to the ‘deinstitutionaliza- 
tion’ movement, pharmacotherapy was an essential factor in the 
growth of private practice. The success of psychotherapy had been 
one contribution to this, but the complexity of its techniques, the 
length of the treatment, its applicability to only a few types of dis- 
orders, and the uncertainty of the results limited its use to a rela- 
tively small number of selected patients, even in the United States 
during the period of the greatest popularity of psychodynamism. 
Pharmacotherapy could be used much more easily, on a much 
larger number of patients, and did not require long and complex 
training. Some of the drugs, such as the anxiolytics, had an imme- 
diate symptomatic effect, and others (the antidepressants and the 
neuroleptics) could attenuate or suppress the pathological mani- 
festations in a few weeks and, outside the acute phase requiring 
hospitalization, could be used on an ambulatory basis. It was not 
only private psychiatrists who were able to treat many of their pa- 
tients successfully; general practitioners also began to prescribe 
psychotropic drugs on a large scale. 

The third consequence was the explosive development of bio- 
logical research in psychiatry. The first therapeutic discoveries 
were largely empirical, but new biochemical techniques allowed 
some of the modes of action of the drugs to be elucidated. From 
1960 on, studies of the influence of these drugs on various aspects 
of neurotransmission in the brain stimulated hypotheses about 
the abnormal biochemical mechanisms considered to be the phys- 
ical substrate of the mental disorders. Meanwhile new methods 
had been introduced for the examination of morphological modi- 
fications of the living brain and even of the nature and localization 
of the biochemical processes taking place in its different parts. The 
discovery by Watson and Crick in 1953 of the chemical basis of 
heredity and the subsequent spectacular advances in molecular 
biology gave a fresh impulse to psychiatric genetics, which had 
been partly discredited by their misuse by the National Socialist 
regime. Under the name of neurosciences, these new fields of en- 
quiry progressively acquired a dominant role in psychiatric re- 
search at the same time as the introduction of an ever increasing 
number of drugs, eventually more potent, usually with less in- 
convenient side effects, and sometimes with new therapeutic 
indications. 
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medicalization’ of psychiatry 





In 1983, Melvin Sabshin, the director of the American Psychiatric 
Association, summarized the overlapping chronologies of the psy- 
chodynamic, biological, and social waves as follows [6]: 


‘Psychoanalysis surged through the United States during the 
1940s and the 1950s. During the 1950s, a new psychopharmaco- 
logical approach emerged which had great impact on psychiatric 
practice generally ... The 1960s saw the dawning of a community 
psychiatric approach which attempted to accomplish a massive 
desinstitutionalization of patients from public psychiatric hospitals’ 


Although less radical and not strictly identical, the general pic- 
ture was similar in other countries. The 1960s saw an often uneasy 
coexistence of three schools. ‘During that decade, wrote Sabshin, 
‘American psychiatry enlarged its boundaries and its practices so 
broadly that many critics grew increasingly concerned with the 
‘bottomless pit’ of the field. The extension of the practice of psy- 
chotherapy, frequently to cases with no clear pathological character, 
tended to blur the limits of the mental disease concept and to neglect 
the traditional diagnostic approach. Social work was also tempted to 
concern itself with problems with no obvious medical nature, such 
as those still described in 1978 in the United States by the President's 
Commission of Mental Health, which asserted that ‘American 
mental health cannot be defined only in terms of disabling mental 
illness and identified mental disorders’ and identified as a domain of 
concern for workers in the field ‘unrelenting poverty and unemploy- 
ment and the institutionalized discrimination that occurs on the 
basis of race, sex, class, age ...’ In sharp contrast, the new biological 
psychiatry recognized only a strictly medical model, stressing the 
necessity of an accurate diagnosis for the prescription of drugs and 
for the testing of their efficacy, and advocated restrictive limits in the 
definition of mental diseases. 

In around 1970, a profound change took place. Although the in- 
stitutional modifications of the care system favoured by the gener- 
alization of drug therapy continued and expanded under its various 
forms everywhere, the influence of psychodynamism began to de- 
cline within the psychiatric profession. According to the director 
of the National Institutes for Mental Health, ‘it was nearly impos- 
sible in 1945 for a non-psychoanalyst to become Chairman of a 
Department of Psychiatry (in the United States), but by the mid 
1970s, the situation was reversed. The publication by the American 
Psychiatric Association of the Third Revision of the Diagnostic and 
Statistical Manual of Mental Disorders (DSM-III) is often considered 
as the symbolic expression of the change. This took place in 1980, 
but its origins were more than a decade previously, and it was sig- 
nificantly presented by its apologists, such as Klerman, as ‘a decisive 
turning point in the history of American psychiatry ... an affirm- 
ation of its medical identity. The new nosology, which was categor- 
ical in nature and introduced diagnostic criteria borrowed from 
experimental psychology in the delimitation of the categories, did 
not allow any reference to ‘unprover aetiological factors or patho- 
genic mechanisms, unless ‘scientifically demonstrated’ It claimed to 
be purely descriptive and therefore acceptable as a means of commu- 
nication by all psychiatrists, whatever their individual orientation 
may be. It was, in fact, perceived, not only in its country of origin 
as a reaction against the extreme socio-psychological positions— 
the deletion of the term neurosis because of its usual association 


with the psychoanalytic theory of intrapsychic conflicts raised vio- 
lent controversies—but, despite its proclaimed ‘a-theorism, also as 
favouring the biological medical model. Although initially exclu- 
sively devised for the use of American psychiatrists, to the surprise 
of its authors, it was rapidly accepted in all countries, and the WHO 
adopted finally its principles in its own nosological system—the 
International Classification of Diseases. Originally, the result of a 
brutal reversal of trends in American psychiatry, it expressed a gen- 
eral change of direction in the psychiatric way of thinking towards 
affirmation, against the forces believed to threaten it of the medical 
character of psychiatry. 





Crisis in psychiatry? 


At first glance, the new status of psychiatry seems to have taken firm 
root in the last four decades. It rests on the general acceptance of the 
medical definition of the concept of mental disease and of the pro- 
gressive realization of a diversified, but co-ordinated, institutional 
system of mental health care. The biological perspective, even ifit has 
taken a prominent place in research and therapy, is now combined 
with psychological and social approaches in the bio-psychosocial 
model. The psychiatrist, in accordance with his medical professional 
responsibilities, occupies a central position in a multi-disciplinary 
team whose members contribute their special competences to the 
common goal. 

This idyllic picture is far from a reflection of reality, even in devel- 
oped countries, and the existence of a crisis in psychiatry is evoked 
with increasing frequency. Under the pressure of economic con- 
straints, efforts are made everywhere to control the rising burden 
of medical care. They have taken different forms according to the 
country—from the managed care system in the United States to 
the numerus clausus system in France, in which the number of in- 
ternships available is determined by the government—but their 
common aim is to limit the number of psychiatrists and the cost 
of their activities. In the United Kingdom, the most socialized top- 
down health care system in the world, managers are now respon- 
sible for clinical governance, rather than doctors themselves. This 
has already resulted in a complete system failure at one general hos- 
pital. Psychiatry has proved particularly vulnerable to management 
edict, and recruitment of competent consultant psychiatrists has 
become increasingly difficult. Paradoxically, the recognition of the 
frequency of mental disorders and the growing demand for psychi- 
atric treatments has been associated with a reduction in the domain 
of action of psychiatrists, who are now often vastly outnumbered by 
clinical psychologists and social workers. In the United States, by 
1990, 80,000 ‘clinical’ social workers were active in the psychiatric 
socio-psychological domain, a quarter of them in part- or full-time 
private practice. The claims of these powerful professional groups 
are not limited to a completely autonomous status but, in the case of 
clinical psychologists, extend to the demand for a legal recognition 
of such typical ‘medical privileges’ as the right to hospitalize patients 
and to prescribe drugs. The most impressive change has been in the 
proportion of mental disorders being now treated by general practi- 
tioners as a result of the availability of psychotropic drugs with fewer 
side effects; in France, 60% of antidepressants are now prescribed by 
general practitioners. Professional jealousies also wax and wane but 
undoubtedly affect the attractiveness of psychiatry as a profession. 


These examples may not be a fair representation of the global 
picture, but there is undoubtedly a movement towards a limitation 
of the psychiatric specialty to the care of the most severe cases—in 
practice, the psychotic cases. This reached its most extreme expres- 
sion in the United Kingdom with the development of the National 
Service Framework in 1997. This, in effect, refocused secondary 
care on ‘psychosis: Some neuroscientists raise doubts about the use- 
fulness of maintaining psychiatry as a specialty, even in this field. 
Influential, biologically oriented psychiatrists have recently pro- 
posed, on theoretical and practical grounds, that psychiatry should 
be absorbed into a new medical discipline, akin to former neuro- 
psychiatry, and all or most of its socio-psychological aspects should 
be left to non-medical professions. 

In the last decade, medicine in general, but psychiatry in particular, 
has been accused of selling out to the pharmaceutical industry. This 
has been based on the high profitability of patented drugs prescribed 
for psychiatric problems and the fact that large marketing budgets 
were devoted to the education and entertainment of prescribers. 
Individual doctors undoubtedly profited from their involvement 
with companies, and companies obviously influenced how informa- 
tion about new treatments was delivered. However, companies are 
highly regulated, and indeed off-label promotion, to take one ex- 
ample, has been heavily penalized. The idea that anyone with a so- 
called conflict of interest (that is, has been paid for professional work 
by acompany) forfeits the right to make a scientific judgement about 
a treatment is too extreme to be useful. For most doctors, it is their 
time that is for sale, not their integrity. Suspicion about relationships 
with big pharma is, however, set to decline with the virtual with- 
drawal of companies from neuroscience research in the last 10 years. 


Conclusions 


Since psychiatry has emerged as a specialty, it has been submitted to 
conflicting forces. The demands of society, changes in the concept 
of mental disorder and of its limits, variations in the role played by 
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different theoretical perspectives, and successive scientific discov- 
eries have been responsible for an evolution reflected in the profes- 
sional status and role of the psychiatrist. Displacements of the centre 
of gravity of a complex structure in which biological, psychological, 
and social factors interact have modified the image of psychiatry. 
The threat of being incorporated in other medical specialties or of 
being deprived of its medical character is but another transitory epi- 
sode in its history. 
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We need new ethics for twenty-first-century psychiatry. Twenty- 
first-century psychiatry will not be grounded in categorical diag- 
nosis, office-based therapy, medicalization, and use of drugs. It will 
be oriented around a view of mental health as a continuum; it will 
bring novel therapeutics, alternative sites of clinical practice, and 
new targets of intervention (for example, not patients, but the pro- 
drome; not the individual, but the family). In some ways, it may be a 
return to the bio-psychosocial formulation of the patient as an agent 
in an embedded social context, with interventions aimed at pro- 
moting well-being, rather than the treatment of psychiatric disorder. 

In this chapter, we consider a set of innovations in psychiatry that 
either raise new ethical questions for psychiatrists and for the psychi- 
atric profession or require a revision of prior understanding of what 
constitutes ethical behaviour in a psychiatric context. We examine 
two novel neurointerventions at opposite poles—as an example 
of neuroengineering efforts for increasingly specific interventions 
at the level of neural circuits, we highlight deep brain stimulation 
(DBS), and perhaps as an example of an intervention at the other 
end of the ‘specificity’ spectrum, we highlight the therapeutic use of 
placebos. We spend the remainder of the chapter discussing ethical 
and philosophical issues spurred by endeavours to enable a future 
in which psychiatry is increasingly preventative. The discussion is 
intended to be exemplary, rather than comprehensive; we hope to 
prompt sustained reflection and debate on these topics in the profes- 
sion and, in particular, as part of psychiatric training. 
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Perhaps the greatest challenges for psychiatric ethics have arrived 
in the form of technologies that allow for strategic interventions 
directly into neural circuits and mental processes. Examples in- 
clude gamma knife surgery and invasive and non-invasive brain 
stimulation, including optogenetics. As a case study of this trend 





in neurointerventions, we consider the use of DBS to treat anorexia 
nervosa. 

DBS is a non-ablative neurosurgical procedure that has 
been used with great success to ameliorate motor symptoms in 
treatment-refractory patients suffering from Parkinson’s disease. 
While the use of the procedure in this context was supported by 
the UK’s National Institute for Health and Care Excellence (NICE) 
in 2003, DBS is being increasingly considered as an experimental 
therapy for a wide range of neurological and psychiatric condi- 
tions, including (among others) chronic pain, depression, epilepsy, 
and anorexia nervosa. Preliminary evidence from the experi- 
mental use of DBS in these contexts suggests that the procedure 
may be used to achieve highly beneficial treatment outcomes for 
treatment-refractory patients suffering from these conditions 
[1, 2]. Furthermore, DBS has advantages over existing treatment 
methods since levels of stimulation can be tailored to the needs 
of the individual patient and it is reversible—treatment can be 
stopped and electrodes even removed at the request of the patient 
[3, 4]. 

The ethically challenging properties of DBS emerge from the 
technology’s ability to specifically interfere with neural activity in 
the brain and modify brain states. Of course, it is true that any 
intervention affecting feeling, experience, motivation, and behav- 
iour ultimately modifies neural activity. This is true of indirect psy- 
chological interventions such as psychotherapy, as well as direct 
interventions such as drugs, surgery, or DBS; in some sense, they 
all involve the modification of brain states, so ethical issues raised 
by one potentially apply to others. That said, the ‘holy grail’ of 
neurointervention is precise control of the activation of neurons in 
specific circuits to produce targeted effects. Perhaps the most con- 
troversial application of this kind of precision in neurointervention 
would be to bring desire under close cognitive control. Choosing 
what to desire to do, and being able to do that, is an enormous 
power with implications for freedom, autonomy, and well-being. 
Although we are not yet at the stage where agents are enabled to 
simply will a desire into existence, such that it motivates them suf- 
ficiently to act, DBS moves us much closer to this than we have ever 
been before. 


In the context of anorexia nervosa (AN), DBS could be used to 
impose a motivating desire to eat or to give patients the cognitive 
control required to resist compulsive motivation to engage in dan- 
gerous weight loss behaviour. However, other mechanisms, such as 
the use of this technology to alter first-order motivating desires or 
the patient’s emotional traits, may confer significant harms as well 
as potential benefits. For example, amplification of first-order de- 
sires to eat without higher-level endorsement may undermine, 
rather than promote, autonomy [5]. The various ways in which dif- 
ferent mechanisms are likely to affect AN patients’ experiences of 
themselves and their ability to be self-governing must be borne in 
mind in research and clinical development protocols for DBS treat- 
ment. Furthermore, practitioners should be wary of the new ways in 
which the use (and indeed, mere prospect) of DBS treatment could 
introduce new avenues of perceived coercion and new problems re- 
garding the authenticity of AN patients’ desires regarding their con- 
tinued treatment and eating behaviours [5]. 


The case of placebo for the treatment of depression 


In 2008, Irving Kirsch published the results of a meta-analysis 
that, for the first time, included unpublished trials of modern 
antidepressant drugs that were logged with the Food and Drug 
Administration. Across 47 clinical trials, these antidepressants 
failed to perform significantly better than placebo in mildly de- 
pressed patients and performed only somewhat better than placebo 
in severely depressed patients. Furthermore, while antidepressants 
did perform better than placebo for severely depressed patients, the 
data suggested that as patients became more severely depressed, the 
group who were given the drug did not fare any better; rather, the 
control group who were taking placebo fared worse [6, 7]. If these 
findings are correct, it would seem that severely depressed patients 
do not respond as strongly to the placebo effect. Consequently, al- 
though there is a larger difference between the drug group and the 
placebo group among severely depressed patients, the drug is no 
more effective. 

Such studies raise the important issue of the extent of the pla- 
cebo effect in psychiatric treatment and the moral status of such an 
effect. Since depression, like pain, is a subjective experience, what 
matters is subjective improvement, and not how it is brought about. 
That is, two challenges facing psychiatry are to properly ascertain 
the treatment effect, excluding publication bias, and to better utilize 
the placebo effect in modern psychiatric practice. Many doctors are 
forbidden by law or by professional codes of ethics from prescribing 
placebos without revealing them to their patients [7, 8] , but the 
moral status quo is under pressure as evidence mounts showing that 
placebos have substantial subjective benefits and that they may even 
form a larger component of existing popular therapies than was pre- 
viously thought [9, 10, 11]. 





Early intervention and prevention in psychiatry 


Efforts in predictive biomarker development 


‘A young man presents to a clinic concerned that people are talking 
about him on the public bus on his way to school. He knows that they 
are probably not, but he is troubled by the disconcerting feeling and 
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it is starting to make him dread the idea of taking public transport. 
He takes a clinical assessment designed in part to detect other sorts of 
“attenuated” psychotic symptoms; approximately one third of the indi- 
viduals who scored as highly as he does go on to have a full psychotic 
break within two years, one third maintain the current level of troub- 
ling attenuated symptoms, one third get better. What should the clin- 
ician do with this information? Does the young man have a disorder 
or is he “merely” at risk of a disorder?’ 
[Case adapted from Baum 2016] 


In the revision of DSM-IV-TR to DSM-5, a vigorous debate 
surfaced about whether to include a disorder characterized 
chiefly by the risk of first psychosis [12]. Clinical tools designed 
to assay attenuated psychotic symptoms had been shown in mul- 
tiple clinical sites to be able to delineate a group of help-seeking 
young people, 20-40% of whom would go on to have a psychotic 
break in the next 2 years. The main opposition to creating such 
a category in the DSM-5 was that it was inappropriate to have 
disorders ‘based on risk and that telling someone like the young 
man in this case that he is at high risk of psychosis could be need- 
lessly traumatic, as a large portion of individuals did not, in fact, 
go on to develop the psychosis. Proponents responded by side- 
stepping the concern about risk—that even those who did not de- 
velop psychosis still had troubling symptoms that were deserving 
of care. In the end, this category was not included in DSM-V [13], 
but the debate about the proper role of risk in psychiatry should 
be far from over. 

At the laboratory bench and super-computing clusters, neurosci- 
entists and bioinformaticians are hard at work to uncover biomarkers 
in blood, saliva, brain images, and cerebrospinal fluid for diverse 
phenotypes in psychiatry from psychosis, suicide, and abuse to PTSD 
and dementia. Efforts do not stop at individual biomarkers, however, 
as current thinking suggests that an assemblage of multiple bio- 
markers, demographics, and patient history into actuarial risk pre- 
dictor tools could yield more predictive power, as does the fracture 
risk score (FRAX) for all-cause risk of hip fracture, or the Reynolds 
risk score for 10-year cardiovascular event. The quest for a better clas- 
sification is increasingly aided by advances in machine learning and 
has even spilled beyond the medical world into social media where 
Facebook is currently developing and deploying machine-learning 
classifiers to identify posts that may indicate the author is at risk of 
suicide [14]. What makes matters more complex is that this work is 
beginning to reveal a risk landscape where biomarkers do not map 
cleanly onto existing disorders but sometimes indicate cross-disorder 
risk or risk of events like suicide, rather than disorders themselves. As 
this research progresses, the current lines between disordered and 
well will be blurred and replaced by differential magnitudes of risk 
of different harmful symptoms and events. Psychiatry as a field will 
have an increasing number of debates similar to that about young 
individuals with a clinical high risk of psychosis, and it will have to 
grapple with the question of how to incorporate risk into its nosology 
and, more broadly, how risk should change physicians, patients, and 
society's obligations to each other (for an extended discussion of the 
neuroethics of biomarkers, see [15]). 


The role of risk in the concept of disorder 


So what role should quantitative risk play in psychiatric nosology? 
Should we have disorders based on risk or on categories of risk of 
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disorders? Interestingly, if one delves into the (seldom read) first few 
pages of DSM where it discussed what, broadly, should be thought 
of as a disorder, one can see a telling evolution of a response to these 
questions. In DSM-IV-TR, risk is explicitly highlighted in a ‘risk 
clause [16]: 


‘[A disorder is a dysfunction in the individual] “Associated with pre- 
sent distress (e.g. a painful symptom) or disability (i.e. impairment 
in one or more important areas of functioning) or with a significantly 
increased risk of suffering death, pain, disability, or an important loss 
of freedom: 

(American Psychiatric Association [APA] 2000) [Our emphasis]. 


But faced with the prospect of throngs of predictive biomarkers, 
a proposal was put forward to drop the risk clause entirely, with an 
explanation that the purpose of doing so was to ‘differentiate more 
clearly between disorders and risk factors’ [17]. One of us has ar- 
gued that elevated risk of harmful symptoms or of harmful events 
is precisely why we care about a disorder [15] and that it is really 
the combination of an unhelpful idea that human existence is neatly 
cleaved into two conditions, healthy or sick, with disagreements 
over the magnitude of risk and whether the symptoms/events are 
indeed harmful that undergirds disagreements about whether a risk 
state should be a disorder or not. To see this intuitively, note that 
we can have highly painful, but quickly resolving, conditions like 
cramp, which we are unlikely to think of as a disorder (or at least 
a serious one) unless we think it is highly likely to happen again or 
repeatedly in the future; similarly, people who have a single, unpro- 
voked seizure, though often harmed during the seizure itself (if only 
by loss of bodily autonomy), are monitored for biomarkers of future 
seizure risk [for example, epileptiform activity on an encephalogram 
(EEG)] that could be suggestive of a disorder of sustained predispos- 
ition to seizures, e.g. epilepsy. On the other hand, a predictive bio- 
marker like profoundly thickened heart walls indicates a high risk of 
sudden cardiac death and is relatively uncontroversially thought of 
as the disorder of hypertrophic cardiomyopathy. 

What should be controversial and debated openly is the proper 
course of action indicated by a particular risk and the types of risks 
we should care about. If the ‘other risk factors’ are indeed of very very 
low predictive value, this is what matters, not that they are ‘risk fac- 
tors’; indeed, machine-learning classifiers based on the profile of risk 
factors may soon have higher predictive value of harms we care about 
than currently accepted clinical standards; for example, in a recent 
meta-analysis, all of the currently accepted ways of estimating who 
is at risk of suicidal ideation and attempt, including the oft held idea 
that those with disorders such as depression and bipolar disorder 
are at elevated risk, were found to perform only marginally better 
than chance [18]; several recent machine-learning classifiers, how- 
ever, performed astronomically better, with one metric of prediction 
accuracy—the area under the curve (AUC)—of 0.7-0.9 where AUCs 
of 1 represent perfect prediction and 0.5 represent chance level [for 
comparison, an elevated score on a prostate-specific antigen (PSA) 
test has an AUC of 0.68-0.83 for detecting cancer, depending on the 
PSA cut-off value and the grade of cancer] [19]. Though these al- 
gorithms require validation on independent datasets, presumably if 
part of the reason we care about some psychiatric disorders is the 
risk of suicide, should we not be even more concerned for the wel- 
fare of those identified by an algorithm to be high risk, even if the 
individual does not map to a currently named disorder? 


One of the benefits of being challenged by the development of pre- 
dictive biomarkers is that it forces us to recognize, more than is cur- 
rently the case, that the hard conceptual work in psychiatric nosology 
should be deciding whether a risk is clinically (or morally) significant, 
whether something is harmful, and whether harms are due primarily 
to an unjust organization of society [15]. Because the field of psych- 
iatry is unlikely to abandon current nosology entirely, perhaps an 
interim solution is to retain traditional psychiatric categories but to 
think of them as useful heuristics for estimating the likelihood of im- 
portant point outcomes like suicide, future decreases in well-being, or 
conversely, resolution of symptoms, which can be supplemented with 
algorithmic and machine learning—similarly to the way a diagnosis of 
lung cancer is now supplemented by a range of biomarkers from clin- 
ical and anatomical pathology to create a better estimate of prognosis. 


The role of welfare in the concept of disorder 


A complementary approach has also been put forward—treatment 
should be primarily about improving well-being, and therefore, dis- 
order categories are useful only in so far as they help us to that end 
[20, 21]. The debate in psychiatry should be less about whether an 
individual has a disorder, but how best to improve that individual’s 
quality of life. Thus, a more fruitful question than ‘does this child 
have attention-deficit/hyperactivity disorder’ is ‘would stimulants 
improve this child’s quality of life (with a whole-life view)?. Coming 
back to risk, if awareness, medical, environmental, or other inter- 
vention to alter that risk can improve the individual's well-being, 
then that course of action should be considered. Of course, this sen- 
tence should provoke the question ‘under what conditions would a 
reduction of risk improve well-being?’ 

To answer this question, it is helpful to frame it within a welfarist 
account of disability [20, 21]. According to this account, a person has 
a disability if they have some stable psychological or biological state 
that makes it likely that their life will get worse, in terms of their own 
well-being, in the social and environmental context they inhabit. 
This conception of disability has several morally relevant properties. 
Firstly, it makes no reference to normality; as such, it does not in- 
vite a distinction between treatments that count as therapy and those 
that count as enhancement. Secondly, it makes no recommendation 
about the best way to treat disability. The biological, psychological, 
social, cultural, and other factors that contribute to a person having 
a disability are relevant only in so far as they negatively impact that 
person’s well-being. The disability may be treated by addressing any 
one or more of these factors. While it may be appropriate to treat 
disability by addressing the biological factors that contribute to 
it—by prescribing medication or surgery, for example—it may also 
be appropriate, instead, to address non-biological factors. Imagine 
someone whose leg has been amputated and who qualifies by the wel- 
farist account as disabled because the amputation makes it difficult 
for her to negotiate the stairs to the second-floor flat where she lives. 
This combination of factors reduces her well-being. (For simplicity, 
imagine that no other factors combine with her amputation to reduce 
her well-being.) Her well-being will improve if she is fitted with a 
prosthetic limb; alternatively, it will also improve if a lift is installed in 
the building where she lives. The medical route to treating disability, 
then, is not the only route, nor even the most obvious or appropriate. 

We believe that the current approach to mental health and illness 
would benefit from considering a welfarist approach [22]. Such an 
approach to the goals of psychiatry requires an account of welfare 


or well-being. Measures of well-being have not been widely used 
in psychiatry until very recently, largely because the criterion of 
‘impairment has not been framed in alliance with such epidemio- 
logical measures. There is some suggestion that this is changing and 
that, as quality of life measures enter psychiatry, they may need to 
be re-normed to take account of the various cognitive, behavioural, 
and other differences entailed in living life with mental health 
challenges [22]. 

In philosophy, three broad theories of well-being have been de- 
scribed: hedonistic, desire fulfilment, and objective list. According 
to hedonistic theory, what makes someone's life go well are positive 
experiences such as pleasure. According to desire fulfilment theory, 
what is good is satisfaction of one’s desires, even if one does not gain 
pleasure from that satisfaction. According to objective theory, cer- 
tain things are good for people, irrespective of whether they desire 
them or gain pleasure from them [23]. 

Psychiatrists routinely encounter cases where these three accounts 
of well-being come into conflict. A patient may enjoy something he 
desires not to enjoy or should not enjoy, even if he does, e.g. sadistic 
pleasure. Hedonistic and desire fulfilment theories are both sub- 
jective theories of the good or value. In this sense, “beauty is in the 
eye of the beholder’ The quality of a life is relative to that individual’s 
own valuation. 

This is the kind of view that is often put forward by some dis- 
ability activists and patient advocacy groups; they claim that be- 
cause the disabled are as satisfied with the quality of their lives as the 
non-disabled, the quality of their lives is just as good. In a well-cited 
study, becoming paraplegic has very negative effects on a person in 
the short term—it causes a significant decrement in subjective life 
satisfaction. However, over time, the quality of life returns to nearly 
normal. Many people with paraplegia adapt to their state [24]. Yet, it 
could be argued that the loss of independence and mobility are ser- 
ious disadvantages (though clearly ones whose badness depends on 
the built environment), even if people are equally satisfied. 

There are good reasons to include at least an objective element 
within an account of well-being, especially within psychiatry. In 
psychiatry, the subjective perspective is often fundamentally af- 
fected by the disorder, and it is important to tether treatment to 
some objective perspective. It is important to recognize that even 
within a subjectivist framework, however, a person can be mistaken 
about what is of value. People can be wrong about the facts, for ex- 
ample believing that they are receiving medical care that is beneficial 
to them when it is not. Or people could be wrong about whether a 
particular state of affairs instantiates what they value [25]. For ex- 
ample, suppose someone values empathy but mistakenly applies the 
value of empathy to support a ‘tough love’ approach. Anorexia might 
be one psychiatric condition that involves both factual and evalu- 
ative errors about body image and shape. However, the problem with 
subjectivist accounts of well-being—and more generally of value or 
reasons—is that such accounts are, at base, unfettered—one can lit- 
erally desire anything. 

Each of the three accounts of well-being—hedonistic, desire fulfil- 
ment, and objective list theories—has some plausibility. Parfit con- 
cludes that an adequate account of well-being must accord weight 
to all of valuable mental states, desire satisfaction, and objectively 
valuable activity [26]. It may be best not only to engage in activities 
that possess objective value, but to also want to engage in such ac- 
tivities and to derive pleasure from them. Thus, a welfarist account 
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would posit that any changes to features of our genetics or biology 
that make it more conducive to possessing, desiring, and enjoying 
the things that are objectively good for us should be furthered as a 
goal of medicine. And in our utilization of biological or algorithmic 
technology, it is perhaps equally important to seek better estima- 
tions of the likelihood that things will go better for an individual as 
it is that things will go worse. 


Personalized medicine, predictive psychiatry, 





enhanced clinician responsibilities 


Consider the following hypothetical case. 

A young woman with florid first-episode psychosis and cogni- 
tive impairment was put on antipsychotic medication and conse- 
quently developed neuroleptic malignant syndrome (NMS). A case 
of negligence was brought against the prescribing psychiatrist; one 
piece of evidence brought forward by the prosecution was that 
the physician considered, but did not test the woman for, anti-N- 
methyl-D-aspartate (NMDA) receptor autoantibodies, which would 
have provided some probabilistic information on NMS risk (for ex- 
ample, see [27]). If the physician had ordered this serological test, 
he would have found that the woman had high titres of anti- NMDA 
antibodies. The prosecution claimed that since this test would have 
yielded relevant information, the physician should have ordered it; 
because he did not, his prescription was negligent. 

Are liability cases like this one poised to proliferate in psychiatry? 
A psychiatrist might be expected to ensure that a patient does not 
carry significant risks of a bad reaction to psychopharmaceuticals. 
How will the responsibilities of the clinician adapt when the ability 
to detect risks of bad reactions and to perform other risk assess- 
ments greatly expand with the development of other predictive bio- 
markers and algorithms? 

Aristotle posited that for someone to be properly held respon- 
sible for a state of affairs, she had to have acted voluntarily and with 
knowledge of what she was doing [28]. If I were not in control of my 
actions, or if I were not know what I did (or failed to do) was wrong, 
I would be excused from blame. If a patient deliberately overdosed 
on a tricyclic antidepressant, but the psychiatrist did not know that 
the patient was suicidal, this ignorance might excuse the psychiatrist 
from blame. Yet the technical ability to foresee, to some extent, clin- 
ically significant events in psychiatry is poised to drastically shrink 
the number of cases where one could validly appeal to the excuse of 
ignorance. 

All the way back to Aristotle, theories of moral responsibility have 
incorporated something similar to a risk clause that permits an indi- 
vidual to be blamed for harm, even if they were in no control or did 
not know what they were doing at the time the harm came about. 
Aristotle considers the case of someone who chooses to drink and 
because of his drunkenness does wrong; that person is still blame- 
worthy for the wrong, even though he had no control over his ac- 
tions during the highly drunken state, Aristotle argues, because the 
person could foresee entering such a state when he decided to drink 
(those who have had training in the treatment of alcohol and sub- 
stance abuse disorders may have a more nuanced version of these 
arguments). 

In the case of the young woman, even though the clinician had 
no meaningful control over the harmful reaction at the time that 
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the woman developed NMS, it might be argued that he did have 
some sort of control at an earlier time point in so far as he made a 
choice about whether to prescribe antipsychotics or another course 
of therapy and could have foreseen the possibility that she would 
develop NMS as a result of antipsychotic treatment. The prosecu- 
tion is arguing that the psychiatrist is therefore blameworthy for 
the harm to the patient. It is certainly true, however, that there is 
always a possibility that a patient given antipsychotics will develop 
NMS, and it is hardly true that the prescribing psychiatrist would 
properly be blameworthy for every case of NMS. The validity of 
the prosecution's claim therefore lies in arguing that the psych- 
iatrist becomes blameworthy because the likelihood of NMS would 
be elevated if the person had anti-NMDA antibodies, that is, the 
magnitude of the probability makes all the difference between NMS 
being unfortunate or culpable. 

One of us has pointed out, however, that philosophical theories of 
responsibility and legal structures offer little guidance as to the magni- 
tudes of probabilities necessary for a psychiatrist to be culpable; those 
theories of moral responsibility that mention risk often do so in a 
binary fashion (for example, it was foreseen or not foreseen, significant 
risk or insignificant risk) or even posit that the necessary magnitude of 
risk is a ‘non-zero probability’ [15]; indeed by ordering whole-exome 
sequencing, which is quickly becoming cheaper than ordering sev- 
eral individual genetic tests (and thus being considered for genomic 
screening of newborns in some countries), a clinician instantly gains 
knowledge of thousands of foreseeable non-zero probabilities. 

Discussion of which risks create liability and which risks can be 
justifiably ignored is all the more pressing because risk assessments 
in psychiatry are far from being restricted to estimating bad reac- 
tions to pharmaceuticals. Here we consider two further cases: risk of 
relapse and risk of harm to third parties. 

The miniaturization of biosensors has enabled them to be inte- 
grated into medications where they can wirelessly report ingestion 
events [29]. This is designed to aid monitoring of patient compliance 
with a prescribed drug therapy. In psychiatry, however, one could 
easily see how such a technology could create new obligations for the 
psychiatrist. It is common for discontinuation of antipsychotic medi- 
cation (aka non-compliance) to precede an acute phase of psychosis 
that sometimes results in traumatic, forced hospitalization. What 
would be the responsibility of the prescribing psychiatrist who would 
be notified by the biosensor that a patient might be discontinuing 
medication? One practical solution might be to involve the patient 
through an extension of a so-called “Ulysses Contract’ (for example, 
reference [30]). In Homers Odyssey, Ulysses ordered his men to 
lash him to the mast so that he could not throw himself into the sea 
upon hearing the sirens’ song, the watery plunge being a foreseeable 
side effect of carrying out his desire to hear their voices. So too, pa- 
tients might, when well and in full possession of mental faculties of 
judgement, pre-emptively give their treatment team permission to 
intervene and hospitalize early when medication discontinuation 
is detected, even though the patient at the time of discontinuation 
would not meet the criteria for involuntary hospitalization. 

Overall, the role of patient participation in decision-making may 
increase as prognostic ability grows through the use of biomarkers 
and big data algorithms. For example, there have been some cases 
where people with anorexia in the UK have been allowed to die after 
a long and protracted course with multiple relapses [31]), while 
it is more normal to force-feed people with anorexia who have a 


dangerously low body weight [32]. Greater prognostication may 
allow high-risk people with anorexia to be identified, and targeted 
in-depth discussions undertaken during periods of competence to 
elicit preferences for what should be done when life-threatening 
anorexia is reached. In this way, better definition of potential pa- 
tient pathways makes it possible for patients to better and more em- 
pathetically engage with their possible futures, and for clinicians to 
elicit their values in relation to those futures. In this way, patient 
autonomy can be enhanced as part of a framing of mental health and 
illness as continuous and dimensional. 

Risk assessment in psychiatry also can include risk of harm to 
third parties. Already, over 200 different actuarial tools are being 
used in forensic psychiatry to estimate the risk of violent behaviour 
[33]. There are also individual behavioural biomarkers that may be 
informative of the risk of violent behaviour, with one of the most 
discussed in the bioethics literature being the monoamine oxidase 
A (MAOA) gene x environment interaction (reviewed in [34, 35]). 
Briefly, MAOA, involved in the metabolism of monoamine neuro- 
transmitters, is present in two common genetic alleles, MAOA-low 
(MAOA-L) and MAOA-high (MAOA-H), named according to their 
metabolic activity in vitro. When raised in an environment where 
they experienced maltreatment, boys with MAOA-L were found 
to be at elevated risk of future violent behaviour, while boys with 
MAOA-H were found to be resilient. There are nuances and re- 
maining controversies about this gene x environment interaction, 
but for the purposes of this chapter, we wish to highlight the cross- 
cutting moral implications of such a biological risk of harm to 
others. Specifically, we might ask if knowledge of differential suscep- 
tibility to the violence-predisposing effects of maltreatment should 
change how public health programmes and social workers prioritize 
the use of limited social supports (if there is not enough support 
for all children, should programmes prioritize aiding boys at highest 
risk of expressing violent behaviour, even if the differential risk is 
small), or change how courts should evaluate the blameworthiness 
of individuals who do commit violence. In regard to the latter, one 
of us previously argued that such a predisposition would affect culp- 
ability only if the risk of violence was increased through a legally 
relevant mechanism; while most jurisdictions would offer no excuse 
or mitigation for a predisposition that resulted from decreased im- 
pulse control, some would allow a partial defence of provocation if 
an individual were more likely to react with violence due to an in- 
creased perceived gravity of a threat to the individual, a possibility 
supported by some evidence in the case of the MAOA-L gene x en- 
vironment interaction. 

One further challenge may arise for the child/adolescent psych- 
iatrist who suspects that a boy in their clinical care may be at high risk 
of maltreatment [34]. Should the threshold for intervention in the 
family be lowered if the boy has the MAOA-L variant? Conversely, 
since disruption of family units is itself not without risk to a young 
person, should the threshold for intervention be raised if the boy’s 
biomarker profile suggests he might be more resistant to one of the 
negative effects of possible maltreatment? 

We will conclude this section by highlighting that the increased 
ability to gain probabilistic information about the risks patients pose 
to themselves or pose to others, and that others pose to the patients 
begs the question of when an individual should have known some- 
thing; for example, they could have learnt more about the risk of sui- 
cide, the risk of bleeding, and the risk of violent behaviour but did 


not and, because of that ignorance, did wrong. For example, if I said 
I did not know that the person was drunk (and therefore at high risk 
of driving dangerously), someone may rightly counter that perhaps 
I should have known. Being blameworthy for not knowing something 
is referred to, in the legal-philosophy literature, as ‘culpable ignorance. 
A useful concept from the literature of culpable ignorance is the 
‘benighting act’: 
‘An initial act, in which the agent fails to improve (or positively im- 
pairs) his cognitive position, [followed by] a subsequent act in which 
he does wrong because of his resulting ignorance: 
(Smith 1983, p. 547). 


Smith uses the example of a clinician who fails to do relevant con- 
tinuing education to keep up with standards in the field and, be- 
cause of that benighting act, wrongs a patient. We can update the 
case to consider a physician who fails to use predictive technology 
and, because they lack that knowledge, wrongs a patient; though we 
discussed the case where the absence of action by the psychiatrist 
might harm the patient (by failing to intervene to minimize the risk 
of suicide), it may be the case that the clinician would actually over- 
estimate the risk of suicide (by relying on traditional risk factors; see 
[18]) and thus wrong the patient by acting, for example, to deprive 
them of their liberty, if only for a night, under the supervision of a 
psychiatric nurse. 

Smith (1983) [36] highlights that theorists differ in how they re- 
spond to culpable ignorance. Some (for example, see [37]), including 
Aristotle, hold the person equally blameworthy for the wrong, as if 
they did it without ignorance. Liberal theorists evaluate the blame- 
worthiness of the individual on the benighting act itself, modulating 
blame according to the likelihood that the act would lead to a wrong. 
Regardless of which stance one takes, the proliferation of predictive 
technologies renders the necessity of taking a stance increasingly 
likely for psychiatrists in training and practice. 





Prediction and distributive justice 


Distributive justice permeates every facet of psychiatry. There 
is a global shortage of psychiatrists (and indeed all mental health 
workers), for example, with only one psychiatrist per 1,000,000 
people in low-income countries (vs. one psychiatrist per 472 people 
in the UK), and even in countries with larger psychiatric work- 
forces, there is a paucity of psychiatrists per capita in rural, com- 
pared to urban, areas [38]. So in this sense, psychiatrists themselves 
are a limited resource that the academies and governing bodies have 
some concern about how to fairly distribute. Individual psychiatrists 
must decide: do I accept only private clients? Do I take public insur- 
ance? Do I spend time only with the patients I think will benefit the 
most from my clinical skills, or do I also take on ‘tough cases’ where 
the chances of my benefiting the person are lower but the person is 
more deeply affected by the psychiatric disorder? 

If we have a limited supply of goods—time, personnel, funding, 
etc.—with which to benefit people, how should we go about 
distributing those goods? What counts as fair, or just, distribution 
can take many shapes. We might think we should distribute all 
goods equally among the eligible beneficiaries. But sometimes this 
strategy can make no one better off and some worse off; if one man 
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had an overcoat on a cold winter day, it would do no one good to 
distribute the cloth equally among a throng of cold individuals—no 
one would receive a large enough piece of cloth to warm themselves, 
and now the man who used to have the coat would also freeze. To 
counter such an objection to increasing equality by ‘levelling down, 
many theories incorporate some element of distributing in order to 
do good, to benefit people. However, if one posits that it is better to 
benefit those who are currently worse off over those who are better 
off, one endorses some version of the priority principle—that one 
should give priority to benefiting the worse off, even if this means 
there is less benefit overall. 

If it is important to benefit those who are worse off, however, one 
must have ways of knowing who is worse off. It would be at best 
callous and at worst a miscarriage of justice to benefit those who 
are better off over those who are worse off. Historically in psych- 
iatry, diagnostic categories, clinical interviews, and demographics 
have been used to identify those who are worse off. This informa- 
tion determines who is given priority on waiting lists for mental 
health services or publicly supported social supports, who is given 
the ‘direct line’ to a psychiatrist’s pager or office and who will need 
to speak with a receptionist for triage, who is directly monitored for 
suicide risk and who is not, or, in forensic psychiatry, who is let out 
on parole and who stays incarcerated. 

If those at higher risk of harm are actually worse off, not ‘merely 
at risk of being worse off’ [15], then as they develop, predictive bio- 
markers and algorithms will increasingly be able to inform us about 
who is at elevated risk of harm, and thus worse off and deserving 
of distributive priority. Since many of the goods discussed would 
benefit someone, regardless of whether their risk is detectable 
through traditional or novel means, then it could be argued that the 
utilization of predictive bioinformatics could support fair distribu- 
tion, regardless of whether the test would change clinical manage- 
ment in the narrower sense. This sort of ‘moral utility’ of predictive 
testing should be considered, in addition to a test’s ‘clinical utility’ 
When we consider this in conjunction with the concept of ‘culpable 
ignorance’ raised earlier in this chapter, it is possible that psychi- 
atric practices could encounter scenarios where it is no longer per- 
missible to allocate certain resources considering traditional clinical 
information only, without utilizing novel bioinformatics methods. 

Just as the development of bioprediction may change what is 
considered as ethical behaviour of psychiatrists, so too may devel- 
opments in e-health. Many academies and public health systems 
have incentives for psychiatrists to practise in underserved areas. 
Often, such programmes come at considerable monetary cost; the 
Headspace programme for youth mental health in Australia came 
under the critical spotlight for the variance in the cost of services 
in remote communities vs. urban communities, for example [39]. 
However, it is generally viewed as permissible for psychiatrists to 
choose to practise in big cities where the population’s access to psy- 
chiatric services is much better; the justice claims in these cases 
might have been judged to have been outweighed by the magnitude 
of the costs to the psychiatrist of satisfying those claims (perhaps 
their family or research is in the city). 

But does the advent of better telepsychiatry and online-mediated 
treatments change this calculus? One might argue that access to 
modern outpatient psychiatric services can largely be accomplished 
online; indeed, emerging evidence suggests that there is little, if any, 
enhanced therapeutic value of having bodies in the same room, 
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especially in the treatment of anxiety and depression [40]. If this is 
true, psychiatrists living in cities may no longer have the same costs 
in serving rural or foreign populations. Conversely, academies and 
public health systems may have less justification in continuing to 
offer incentives for psychiatrists to physically locate their practices 
in underserved areas. Extending this line of reasoning, should na- 
tional health insurance move to save costs (and thus be able to re- 
allocate funds) by eliminating office space for outpatient visits and 
replacing with a telepsychiatry studio altogether? 

Against this possibility, one might raise two objections: (1) even 
if it does not substantively change outpatient experience, having 
psychiatric services with a presence in hospitals is important in 
establishing and maintaining parity between mental health care and 
the rest of health care; (2) as neurotechnologies and biomarkers im- 
prove, psychiatrists will increasingly avail themselves of a hospital’s 
other services, such as molecular pathology, imaging suites, etc., 
that are currently under-utilized in psychiatric practice. If we ac- 
cept these objections, psychiatrists will regrettably have to continue 
to grapple with concerns of geography when evaluating justice re- 
garding access to mental health care. 

Another justice-based issue arises in e-health with the rapid ex- 
pansion of peer-support apps and apps designed to aid in unme- 
diated psychiatric treatment. While there is justified concern that 
peer-support apps are under-regulated, leading to suboptimal sup- 
port for those who use it (for example, one peer-support app advises 
listeners to immediately end the chat if a user mentions suicide or 
suicidal ideation), it is reasonable to think that this family of apps 
may soon be able to provide some mental health services currently 
provided by mental health professionals. Because the time of mental 
health professionals is a limited resource, should mental health pro- 
fessionals strive to maximally use these unmediated tools, thereby 
freeing up the time they themselves currently spend providing those 
services and enabling them to provide to more people the services 
that are not outsourceable? Such a scenario would be supported by 
desires to do the most good and increase access to care for more 
people. 





Generalizing the new ethics 


The pace of change in therapeutics, diagnostics, and prediction in 
psychiatry is poised to enter an inflection point. This change raises 
new ethical challenges for the practice of psychiatry, whether from 
the precision of neuroengineering-based therapies or the embraced 
imprecision of placebos, but also challenges the field’s ethical frame- 
works as our ability to gain probabilistic insight into the mental vul- 
nerabilities of ourselves and others. There is much of moral relevance 
under way that we have not discussed—the Do-It-Yourself move- 
ment in brain stimulation and the rapid progress of brain-computer 
interfaces, to name two examples. We hope, however, to have chosen 
cases that raise transferrable ethical challenges and, hopefully, places 
to start in thinking about, and discussing, how the practice of psych- 
iatry should change in this rising tide of technology. 
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Phenomenology or descriptive psychopathology, that is the know- 
ledge of psychopathological phenomena or symptoms, has a long 
tradition in psychiatry and is much older than the diagnostic sys- 
tems commonly used today. Correctly applied, it forms the basis 
for clinical psychiatry as an empirical science [1]. Because the term 
phenomenology also has a broader, philosophical meaning, the 
term ‘descriptive psychopathology’ is much more commonly used 
nowadays. 

In psychiatry the term phenomenology is sometimes used in a 
wider sense to refer to a method of describing, holistically and her- 
meneutically, all that is characteristic for a person and his situation, 
a notion which is used in anthropological or ‘daseins’-analytical 
psychiatry [2]. This methodological approach, which is not very 
common in modern psychiatry, is based, among other things, on 
the philosophical tradition of the philosophers Husserl and, in par- 
ticular, Heidegger (with components such as ‘essences, ‘interpret- 
ation of senses, ‘existential interpretation of an individual rational 
world, ‘study of structures; etc.). The approach is characterized by 
intuitive holistic perception and understanding of the world and 
the person [3-5]. However, this chapter will not cover this exten- 
sion of phenomenology but rather review descriptive psychopath- 
ology in the stricter sense. 












































In this context, ‘descriptive’ means that symptoms are recorded as 
far as possible without any theoretical assumptions, for example 
with respect to certain disease diagnoses or hypotheses of causes, 
and as far as possible objectively. In clinical diagnostics, the descrip- 
tive psychopathological assessment and the creation of a disease/ 
disorder diagnosis or formulation are ideally performed in two in- 
dependent stages. Nevertheless, it may be useful later in the further 
course of diagnosing a particular patient, once a presumed diagnosis 





has been established, to explore more closely particular symptoms 
that may or may not be relevant for a specific diagnosis. 

Descriptive psychopathology is based on an assessment of symp- 
toms that is as objective as possible, that is, the abnormalities found 
in the experience and behaviour of the patient should preferably not 
depend on the individual examiner, the adherence to a particular 
psychiatric school, the methodology of exploration, the general the- 
oretical attitude (for example, more neurobiological or more psy- 
chological), positive and negative expectations, the psychological 
interaction with the examinee, etc. Moreover, the assessment should 
be determined in generally the same way by other professionally 
competent examiners. For various reasons, for example different 
definitions of symptoms, the completely independent psychopatho- 
logical diagnostic assessment is in danger of failing to achieve this 
ideal. To increase the validity and reliability of the assessment, exam- 
iners use aids such as symptom lists with exact symptom definitions 
or even fully structured interviews, especially in research contexts. 
The clinical psychopathological diagnostic assessment then moves 
increasingly towards being a fully standardized diagnostic approach, 
usually involving the use of scales that no longer only make qualita- 
tive, but also quantitative, statements. When applying these stand- 
ardized assessment methods, the diagnostic concordance of the 
examiners is often additionally improved by ‘rater training’ This use 
of scales is a modern variant of descriptive psychopathology [6] that 
does not have the width and subtle differentiation of classical de- 
scriptive psychopathology. 

Regardless of efforts to improve inter-observer reliability, every 
psychopathological diagnostic assessment—the classical clinical 
one, as well as the standardized one—is threatened by various biases, 
as described in the field of empirical social psychology. These cannot 
be completely excluded, but rather limited by careful conscious con- 
trol of the interview process [7, 8]. 

Systematic distortion of the assessor’s observations can result 
from the following factors in particular: 


e Rosenthal effect: the assessors expectations influence the result of 
the assessment; tendency on the side of the assessor to systematic- 
ally over- or under-rate the degree of disturbance. 
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e Halo effect: the results of the assessment of one characteristic are 
influenced by the assessors knowledge of the patient’s other char- 
acteristics or by the overall impression made by the patient. 

Logical errors: the result of the assessment is influenced by asses- 
sors reporting only those detailed observations that make sense to 
them in the context of their theoretical and logical preconceptions. 


Another limiting factor of a diagnostic assessment is the patient 
him- or herself. The patient may show only limited openness to 
describing psychopathological changes or may have a distorted per- 
ception of them. Besides these main types of self-report distortions, 
one must consider the following: 


e Conscious or unconscious tendencies to exaggerate or conceal 
symptoms. 
e Positive response bias and social desirability effects. 


There are several others causes of discrepancy between observer 
views and patient self-reports. In the field of standardized rating 
scales, examples of the quality and quantity of such discrepancies can 
be found by comparing the results of observer rating scales and self- 
rating scales [7, 8]. The risk of discrepant findings can be reduced 
by having as optimal an interaction with the patient as possible and 
using a competent and sensitive interview technique. 

Descriptive psychopathology is understood as the description of 
observable phenomena or those experienced and reported by the 
patient. It does not mean the phenomena developed under psycho- 
dynamic/psychoanalytic reconstruction. 

It is a general rule of assessment in the context of descriptive psy- 
chopathology that an empathic, understanding attitude based on 
respect is the essential basis for a careful and patient-oriented assess- 
ment of psychopathological findings. To guarantee as objective an 
assessment as possible, it is also important to find the right balance 
between empathy-related closeness and objectivity-related distance. 


Objective ‘description’ of psychopathological 





phenomenaas an ideal standard 


If we designate a psychopathological procedure as ‘descriptive; we 
require that the symptoms be recorded and named as simply as 
possible in a way that is verifiable and clearly organized, without 
letting aetiological assumptions, pathogenetic hypotheses, or inter- 
pretative elements about the individual or social ‘meaning’ of certain 
symptoms (or their meaning as would be typical for the reference 
group) be included in this phase of the diagnostic process. 

Some experts claim that less attention should be paid to symp- 
toms whose presence cannot be ‘determined’ but only inferred. 
This is an echo of the distinction between ‘signs’ and ‘symptoms, 
which is well established in the English-speaking world, but less 
known elsewhere. Thus, DSM-IV defines a sign as ‘an objective 
manifestation of a pathological condition’ that is ‘observed by the 
examiner rather than reported by the affected individual, and the 
symptom as ‘a subjective manifestation of a pathological condition’ 
that is ‘reported by the affected individual rather than observed by 
the examiner’. 

In practice, descriptive psychopathological diagnostic assessment 
no longer so clearly preserves this distinction, and it is no longer 
found in this strict version in DSM-5. This is demonstrated in 


particular by psychomotor symptoms, which refer to the inseparable 
binding of subjective experience that relates to mood with objective, 
directly observable motor behaviour [9]. 

Without question, the impartial assessment of mental phe- 
nomena, that is of what a patient describes and experiences, what 
they remember, what they plan, and the way they act, is a crucial pre- 
requisite for all careful psychiatric practice and research. However, 
it would be premature to consider this objective fully achievable by 
choosing a descriptive approach. 

Incidentally, this approach can be applied in the same way at the 
nosological level. DSM-III, published in 1980, was the first standard- 
ized diagnostic manual that placed the focus on the independence 
of the descriptive from the aetiological level and referred to this ap- 
proach with the rather prematurely chosen term ‘atheoretical’ Since 
then, a process of differentiation has taken place, which highlights 
more clearly that freedom from any theoretical presupposition— 
which is basically impossible—was and is not meant. Rather, 
freedom is sought from implicit assumptions about the cause of 
each symptom or of the respective disorder. In order to prevent the 
emergence or strengthening of scientific prejudices during the diag- 
nostic process, recognition of implicit assumptions is of crucial im- 
portance, be it on the symptom, syndrome, or nosological level [9]. 
‘Description’ can be easily understood as a sober, factual portrayal of 
something ‘objectively’ present, similar to the function of a photo- 
graphic apparatus used by the examiner. The description of psycho- 
pathological symptoms, however, is a communication between the 
patient and the examiner, that is an interpersonal process [4]. This 
has a different meaning for different symptoms—a symptom such 
as ‘disoriented to place’ can be more clearly and ‘objectively’ deter- 
mined than the presence of ‘thought broadcasting, for example. 
However, this does not change the fundamental nature of psycho- 
pathological description as a process that is characterized through a 
relationship, among other things. 

In general terms, this means that the description is highly de- 
pendent on the nature of the symptom, because observable behav- 
iour can be more easily described than internal experiences. Karl 
Jaspers attached great importance to the finding that ‘mental phe- 
nomena’ never show themselves directly (and consequently also 
cannot be directly observed), but only indirectly through language, 
writing, gestures, facial expressions, artistic expression, or behav- 
iour. Therefore, it is all the more important always to be clear about 
what is actually being ‘described’ in the descriptive approach [9], 
that is about: 


e The externally recognizable behaviour of the patient. 

e His own statements about the current experience. 

e Assumptions about the current subjective experience of the pa- 
tient that the examiner makes on the basis of certain perceptions 
and ratings (What? Why this one?), although the patient himself 
perhaps explains them quite differently or even not at all. 

e Information from a third party about the patients behaviour and 
experiences. 


These examples illustrate that in psychopathology, ‘description 
refers to a complex field of activity. Besides the recording of psy- 
chopathological information, it also includes the critical consider- 
ation of the respective sources of the things being described and 
the respective relational context. The procedure is complicated, 








SECTION 1 The subject matter and approach to psychiatry 


among other things, by the fact that the effort to achieve a descrip- 
tive recording of the ‘phenomenon cannot, in principle, be entirely 
separated from a basic understanding of psychopathology and the 
associated view of human beings. The examiner should be aware that 
every psychopathological ‘description’ moves necessarily between 
the poles of the subjective experience of the patient and examiner, 
their relationship, and the objectifying determination of facts. 


Pioneers of descriptive psychopathology 





Karl Jaspers 


Jaspers’ work General Psychopathology [10-12] is still regarded as 
the standard work of descriptive psychopathology. In it, he not only 
gave an excellent description of psychopathological phenomena, but 
also additionally discussed fundamental questions about assessing 
mental phenomena and, going far beyond a mere description of 
symptoms, discussed aspects of the relationship and backgrounds 
of psychopathological phenomena. His distinction between ‘ex- 
plaining’ and ‘understanding’ has meaning still today. 

Thus, Jaspers made a strong distinction between scientific ex- 
planation, which is based on causality, and understanding, which 
aims to recreate the mental thoughts of another person. He also 
spoke in this context of the ‘phenomenological direction’ in psy- 
chopathology, with which he mainly meant the respectful and 
careful engaging of the examiner with the patient's self-portrayal; 
he warned about prematurely following the supposed obligation to 
objectify and standardize the findings. He would not have favoured 
check-box approaches to the identification of symptoms in struc- 
tured interviews. 

Jaspers was fully aware of the importance of a clear terminology for 
psychopathology. His response to the inherent tension, present in each 
diagnostic assessment, between subjective experience and standard- 
ization, was: “The envisioning of mental experiences and states, their 
differentiation and determination, so that one can always mean the 
same with the terms, is the task of phenomenology [12, p. 22 ff]. As the 
first, indispensable step, however, such a phenomenological approach 
provides the examiner only with ‘a number of fragments of the real 
mental experience, and he envisions ‘individual qualities, individual 
states viewed as dormant’ and thus practises ‘static understanding. 

‘Genetic understanding’ appeared to be more important to him in 
the psychopathological assessment. This ‘understanding’ deals with 
the relationship between individual psychological states—in Jaspers’ 
words, with the understanding of ‘how mental processes emerge 
with evidence from mental processes’ [12]. 

An important aspect for Jaspers was the ‘genetic understanding 
of biographical contexts’ with psychopathological phenomena. ‘This 
may seem reminiscent of psychodynamic approaches, but the theor- 
etical context is completely different and makes no psychoanalytic 
assumptions. It is instead based on a ‘common sense psychology’ of 
the relevant own life experiences of the examiner. Jaspers described 
not only clinical-psychopathological phenomena (which he did 
with impressive conciseness and often supplemented with case re- 
ports), but also the basics of a healthy psyche. One could only ap- 
proach assessing the entirety of the mentally healthy or disturbed 
person—this is one of the core ideas—via the uniqueness of each 
patient, which is based on the patient’s biography. However, this en- 
tirety is never fully accessible by scientific means. 


This brief description will have made clear that he was concerned 
with far more than describing the specific mental/psychopatho- 
logical phenomenon, with the holistic assessment of the inner con- 
nection of mental phenomena and their relationship to personal 
experiences. 


Kurt Schneider 


Like Jaspers, Kurt Schneider is another important representative of 
the descriptive approach in psychiatry. Unlike Jaspers, he represents 
a descriptive approach in the narrower sense, and accordingly his 
remarks are less extensive. In continuation of Jaspers’ perspective, 
however, he did not split the mental state into unconnected adjoining 
single elements but preserved an understanding of the overall con- 
text. He called this process ‘descriptive-analytic. Even more so than 
Jaspers, Schneider was interested in carefully clinically justified and, 
if possible, highly selective psychopathological terminology that 
should be the guiding principle for the diagnostic process. He was 
also interested, among other things, in the question of whether char- 
acteristic, or possibly even pathognomonic, symptoms exist for cer- 
tain diseases. Representative of this approach are his remarks about 
‘first-rank symptoms’ and ‘second-rank symptoms, with which he 
paved the way for modern operationalized diagnostics. His most 
famous work Clinical Psychopathology was continually developed in 
individual contributions from the 1920s; the book appeared under 
this title for the first time in 1950; the fifteenth edition appeared in 
2007 [13, 14]. 

Recently, efforts have been made to assess descriptively and 
operationally even more theoretically fraught and not directly 
observable psychopathological conditions, such as aspects of psy- 
chodynamic relationships and defensive strategies [15]. They rep- 
resent attempts supplementary to operational psychodynamic 
diagnostics (OPD) [16] but go beyond traditional classical descrip- 
tive phenomenology. 


Descriptive assessment 


of psychopathological symptoms 





The descriptive assessment of psychopathological symptoms which 
follows is based on the most recent version of the manual by the 
Association for Methodology and Documentation in Psychiatry 
(AMDP) [17, 18]. Earlier editions of the AMDP manual, which was 
originally published in German, have been translated into several 
languages, including English [19]. 

The AMDP system was chosen because it reflects most compre- 
hensively the continental European psychopathological tradition. 
Also, despite all its differentiation, it covers the whole spectrum 
of psychopathology, with a special focus on productive-psychotic 
and affective symptoms, and is easy to use both clinically and sci- 
entifically. In the recent past, enhancements were made in certain 
areas that were previously assessed only somewhat or not at all in 
the AMDP system, such as psychomotor function, basic schizo- 
phrenic disorders, personality traits, and some symptoms common 
in ‘neurotic’ disorders. In order to address these shortcomings, in 
particular with a view to their use in psychiatric research, additional 
‘modules’ for the AMDP system were also developed [20]. 
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Disturbances of consciousness 


‘Disturbance of consciousness’ is the generic term for any changes 
in the level of consciousness. A distinction is made between quanti- 
tative changes in consciousness (reduced awareness in the sense of 
the sleep-wake scale) and qualitative changes (reduced, limited, and 
shifted consciousness). 

A quantitative disturbance of consciousness (reduction of vigi- 
lance) is assumed if a patient appears dazed or drowsy and a reduced 
perception of external stimuli can be detected. The degrees of im- 
paired consciousness can be described as follows: 


e Decreased clarity: the patient is very contemplative, slowed down, 
and restricted in acquiring and processing information. 

e Somnolence: the patient is abnormally drowsy but is easily 
woken up. 

e Stupor: the patient is asleep, and only strong stimuli can wake him. 

e Coma: the patient is unconscious and cannot be woken up. In a 
deep coma, the pupillary, corneal, and tendon reflexes are absent. 


Qualitative disturbances of consciousness 


e Clouded consciousness: lack of clarity of awareness of oneself or 
the environment. The associations of experience are lost, and con- 
sciousness is as if fragmented. Thinking and acting are confused. 
Clouding of consciousness is easily recognizable for anyone who 
has seen this state. 

e Narrowed consciousness: narrowing of field of consciousness, for 
example by focusing on a specific experience (intra-personal or 
in the environment), mostly combined with reduced response to 
stimuli (for example, the epileptic semi-conscious state). The ex- 
perience is changed in a dream-like way. Complicated and out- 
wardly organized actions, such as travel, are nevertheless still 
possible. The assessment of narrowed consciousness can be prob- 
lematic precisely because of the ability to perform outwardly or- 
ganized actions. 


Changed consciousness: change in consciousness, compared 
with the usual daily consciousness. There is a feeling of increased 
intensity and brightness and of increased awareness regarding 
alertness and perception of happenings inside or in the outside 
world and/or a feeling of magnification of the space or depth 
that can be consciously recognized (expansion of conscious- 
ness). This condition is difficult to detect and only possible on 
the basis of subjective information provided by the person being 
examined. 


Disturbances of orientation 


Lack of knowledge about temporal, spatial, situational, and/or per- 
sonal circumstances. Depending on the intensity of the disturb- 
ance, one can differentiate between restricted orientation or loss of 
orientation. 

The following forms are distinguished: 


e Time: patients do not know the date, day, year, or season. 

e Place: patients do not know where they are. 

e Situation: patients are not aware of the situation in which they cur- 
rently find themselves (for example, examination in the hospital). 

eè Self: patients lack knowledge about their own name, date of birth, 
and other important personal biographical circumstances. 


Disorders of attention and concentration 


Disorders of attention and concentration are defined as an impair- 
ment in the ability to fully direct the perception mediated by the 
sensations or perceptions to focus on a particular subject. 


e Attention disorders: scope and intensity of the assimilation of per- 
ceptions, ideas, or thoughts are impaired. 

e Concentration disorders: ability to maintain attention on a spe- 
cific activity or a particular object or situation is impaired. 


The course of the interview will provide indications of whether 
patients are restricted in their ability to fully focus their perception 
on the information mediated by their sensations or in their ability 
to concentrate on a particular object or situation. Abnormalities 
in writing, such as omissions or duplication of letters, may also 
provide clues. 


Perception disorders 


Perception disorders are defined as an impairment of the ability to 
understand the relevance of perceptual experiences and to inter- 
connect them. Perception may be wrong, slowed or completely 
missing. 


Retention and memory disorders 


These disorders are defined as a reduced ability to recall new and 
old experiences. Traditional psychopathology differentiates between 
retention and memory disorders. Modern psychological theories of 
memory differentiate between ultra-short (seconds), short-term 
(minutes), and long-term memory. 

Disorders of memory functions can be generally assessed during 
the evaluation interview. Can the patient remember the examiner's 
questions? Do they know what was discussed in an earlier part of 
the conversation? They may report spontaneously about subjectively 
perceived forgetfulness. Perhaps they now have to use written re- 
minders to help with shopping or in other life situations, whereas 
they previously did not make any such notes. The description of the 
life history and current life situation often provides clear indications 
of memory gaps, which are then sometimes filled by confabulations. 
Time lattice defects, that is the inability to report biographical facts 
in the correct temporal order, also provide information on memory 
disorders. 

Besides the question of subjectively perceived disorders of re- 
tention and memory, such disorders are assessed by a preliminary 
clinical evaluation. Objectively observable behavioural traits are of 
greater importance than the patient's self-assessment, which can be 
marked by affect-related feelings of insufficiency. 


e Retention disorders refer to a recall period of up to about 10 
minutes. 

e Memory disorders refer to a recall period of longer than about 
10 minutes. A distinction is made between recent and remote 
memory disorders: 

a Short-term memory disorder: the reduction or loss of the ability 
to retain impressions or experiences for up to 60 minutes. 

= Long-term memory disorder: the reduction or loss of the ability 
to retain impressions or experiences that took place longer than 
60 minutes ago or even further in the past, including biograph- 
ical events, for example. 
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Memory disorders also include amnesia. 

Amnesia: memory gaps limited to a particular event or time. With 
respect to a traumatic event (for example, brain damage), a distinc- 
tion is made between retrograde amnesia, in which a certain period 
of time before the event is affected, and anterograde amnesia, in 
which a certain period of time after the event is affected. In antero- 
grade amnesia, the duration of the memory gap is usually longer 
than the duration of unconsciousness. With regard to the period af- 
fected by the memory gap, one can distinguish between total and 
lacunar (episodic) amnesia. 


e Confabulations: memory gaps are filled with ideas that patients 
themselves view as memories. 

e Paramnesias (delusional memories): memory disorders with 
fantasy memories. These also include the so-called false recog- 
nition, for example the feeling of having experienced certain situ- 
ations previously (‘déja vu’) or never having experienced them 
(jamais vu’). 

e Transient global amnesia (TGA): acute, transient episode of reten- 
tion and memory disorders of unclear aetiology. Routine actions 
are possible. Amnesia exists for the duration of the episode. 


Disorders of intelligence 


Intelligence is a complex ability of people to find their way in un- 
familiar situations, to capture meaning and relationship con- 
texts, and to meet new requirements through logical functions. 
Intellectual impairment disorders can be congenital or acquired 
later in life. 

The main indications of the patient’s intellectual level are already 
apparent from their life story, for example the type of school educa- 
tion, being held back a year in school, school-leaving qualification, 
professional status achieved, recreational interests. Also the style of 
speaking and thought processes (abstract level) during the interview 
allow orientating conclusions to be drawn. A decrease in profes- 
sional standing and a reduction in the intellectual level of leisure ac- 
tivities, compared to the past, may also indicate an acquired mental 
retardation, after other factors have been excluded. 


Formal thought disorders 


Formal thought disorders are disorders of thought sequence. They 
are subjectively perceived by the patient or express themselves in 
verbal utterances. 

The following forms are distinguished: 


e Retarded thinking: the thought process is slow and delayed, ap- 
pears to be tedious for the patient and is often experienced sub- 
jectively by the patient as inhibited thinking. 

e Circumstantial thinking: thinking is circuitous; irrelevant thoughts 
are not separated from relevant ones. The main point is lost in the 
portrayal of insignificant details. 

e Restricted thinking: restriction of the substantive scope of thinking, 
attached to an issue or a few topics. 

e Perseveration: repetition of the same thought contents and adher- 
ence to previous words or information that were used but now no 
longer make sense. 

e Rumination: constantly busy with certain, usually unpleasant, 
thoughts that are not experienced by the patient as foreign and 
usually are related to the current life situation. 


e Pressured thinking: the patient feels under excessive pressure from 
many ideas and also constantly recurring thoughts. 


Flight of ideas: excessive imaginative thought. Thinking no longer 
follows a strict direction but changes or loses the goal because of 
interruptive associations. 


Tangential thinking: the patient does not respond to the question 
and talks around or past the point, although it is apparent from the 
response and/or situation that they have understood the question. 


Thought blocking: sudden interruption of an otherwise fluent train 

of thought for no apparent reason. 

e Paralogia: the sentence structure is grammatically still intact, but 
the consequence of the intellectual context and/or the level of de- 
tail of what is said is reduced. 

e Incoherence: erratic, dissociated thought process, in which the lo- 
gical and associative dimensions are missing. In severe forms, the 
grammatical sentence structure is lacking (paragrammatism), all 
the way to an incomprehensible mix of words and syllables (‘word 
salad, schizophasia). 

e Neologisms: building of new phrases or words that do not meet the 

usual language conventions and often are not easily understood. 


Delusions 


A delusion is an uncorrectable false assessment of reality that occurs 
independently of experience and which the patient holds onto with 
subjective certainty. The conviction is therefore contrary to reality 
and to the conviction of others. Delusional phenomena can occur in 
different forms and with different content. 

Delusions belong to the group of content thought disorders. They 
are often hidden and, if suspected, need to be specifically explored 
and differentiated from overvalued ideas. In the latter, strongly 
emotional thoughts about experiences dominate thinking in a 
non-objective and one-sided way, but such ideas are not absolutely 
incorrigible. 

Depending on the type of delusion formation, the following forms 
are distinguished: 


e Sudden delusional thoughts: sudden occurrence of delusional 
beliefs. 

e Delusional perceptions: a normal perception gives rise to an inter- 
pretation of delusional and abnormal importance. 

e Explanatory delusions: delusional conviction for the explanation 
of psychotic symptoms (for example, hallucinations). 


The following terms are useful for the further characterization of 
the delusional experience: 


è Delusional mood: sense of the uncanny, the ambiguous, from 
which delusional ideas arise. General, unclear feeling that some- 
thing is wrong and is in the air, and that everything concerns the 
person. The events in the surroundings seem strange and unusual 
to the person. Because they do not know what is happening to 
them and what is going on, they become anxious, perplexed, and 
bewildered. A delusional mood frequently precedes delusional 
perception. 

e Delusional dynamics: affective participation in the delusion; the 
driving force and strength of the emotions in the delusion. 

e Systematic delusions: delusional ideas are embellished with logical 
or paralogical associations to a delusional building. 
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Depending on the content of the delusion, the following can be 
distinguished: 


e Delusions of reference: random events are ascribed particular im- 
portance for the self. 

e Delusions of persecution: patients have delusions of being the 
target of persecution. 

e Delusions of jealousy: delusional conviction of being cheated on or 
deceived by one’s partner. 

e Delusions of love: delusional conviction of being loved by another. 

e Delusions of guilt: delusional conviction to have defied, for ex- 
ample, God’s commandments or a higher moral instance. 

e Delusions of impoverishment: delusional conviction that the finan- 
cial basis of life is threatened or lost. 

e Hypochondriacal delusions: delusional conviction that health is 
threatened or lost or that one has a specific physical illness. 

e Nihilistic delusions: delusional conviction that all is lost, every- 
thing is forlorn, everything is hopeless. 

e Delusions of grandiosity: delusional overestimation of self all the way 
to identification with famous personalities from the past or present. 

e Delusions of memory: delusional falsification of memories. 

e Doppelgänger delusions: delusional notion that there is a 
doppelganger. 


Hallucinations 


Hallucinations are perceptual experiences without an appropriate 
external stimulus, but which are still considered to be real sensa- 
tions. They are also referred to as false perceptions and can occur in 
all sensory domains. The degree to which they are convincingly real 
may differ. 

If the unreality of the hallucination is recognized, one refers to 
pseudohallucinations. These are to be differentiated from illusions, in 
which something that really exists is believed to be something else 
than it actually is (misreading of sensations). 

Depending on the sensory area, one differentiates between the 
following: 


e Auditory hallucinations: they can range from unformed, elem- 
entary acoustic perceptions (acoasma) to hallucinatory experi- 
ences of complicated acoustic phenomena (for example, hearing 
voices). 

e Optical hallucinations: they can range from unformed, elementary 
optical illusions (photomes) to the hallucinatory experience of 
crafted scenes. 

e Olfactory and gustatory hallucinations: patients with a delusional 
fear of poisoning claim, for example, to smell gas. 

e Bodily hallucinations: bodily perceptions are not perceived as 
being due to external stimulation. 


Hypnagogic hallucinations 


These are optical and acoustic hallucinations that occur when people 
are half asleep, that is when waking up or falling asleep. They can 
also occur in healthy people without mental illness, as can any hal- 
lucinations in various borderline situations (for example, in case of 
sensory deprivation and meditation). 


Other disorders of perception 


Unlike hallucinations, these changes in perception are usually much 
easier to assess because they do not appear to patients as being so far 
from normal mental experience. 


e Change in the intensity of perceptions: sensory impressions are 
more colourful, lively, colourless, or hazy. 

e Micro-/macropsia: objects are perceived to be smaller, or further 
away, or closer. 

e Metamorphopsia (dysmorphopsia): the colour or shape of objects 
is perceived to be changed or distorted. 


Disorders of ego 


Disorders in which the egocentricity of experience is altered (de- 
realization, depersonalization) or the boundary between self and the 
environment appears to be porous. 

One differentiates between the following: 


e Depersonalization: the own ego or bodily parts are perceived as 
being foreign, unreal, or changed. 

e Derealization: the environment appears to patients as unreal, 
strange, or spatially altered. 

e Thought broadcasting: the patient complains that their thoughts 
no longer belong to them alone and that others are sharing them 
and know what they think. 

e Thought withdrawal: patients have the feeling that thoughts are 
being taken away from them. 

e Thought insertion: patients are of the opinion that their thoughts 
and ideas are externally inserted, influenced, guided, and 
controlled. 

e Other feelings of alien influence: patients are of the opinion that 
their feelings, aspirations, will, and actions are externally made, 
directed, and controlled. 


Disorders of affect 


The realm of affect includes the usually only brief emotions (‘waves 
of emotion, for example, anger, rage, hate, joy) and the longer-lasting 
moods (for example, depression). 


e Affective lability/mood lability: rapid change in affect or mood. 

e Affective incontinence: lack of control of expressed emotions. 

e Blunted affect: state of low affect and emotional responsivity. 
Patients appear indifferent, emotionally restrained, listless, and 
disinterested. 

e Feelings of loss of feeling: painfully experienced lack or loss of af- 
fective emotion. 

e Affective rigidity: reduction in the ability to modulate affect. 
Patients remain in certain moods or affects, regardless of the 
external situation. 

e Inner restlessness: patients complain that they are emotionally 
moved and are agitated or tense. 

e Dysphoria: sullen mood. 

e Irritability: tendency for aggressive emotional outbursts. 
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e Ambivalence: contradictory feelings towards a particular person, 
idea, or action exist side-by-side and lead to a tense state. 

e Euphoria: state of excessive well-being and of pleasure, joy, confi- 
dence, and increased vitality. 

e Foolish affect: silly, empty cheerfulness with a touch of simple- 
mindedness, foolishness, and immaturity. 

e Depressed mood: depressed, rather negative state in the sense of 
dejection, sadness, listlessness, and hopelessness. 

e Loss of vitality: reduced general feelings of energy and vitality, of 
physical and mental freshness, and of being unimpaired. 

e Feelings of inadequacy: feeling of having no value and of being 
inept and incapable. 

e Exaggerated self-esteem: the feeling of being particularly valuable 
and particularly competent. 

e Parathymia: inappropriate mood; expressions of emotion and 
experiences do not match. 


The prevailing mood and emotions can be evaluated in the course 
of the interview, as long as the interview is sufficiently long and gives 
patients the opportunity to talk about their emotions. Through tar- 
geted exploration, one can try to obtain a differentiated description 
from patients about their affective state. 


Compulsions, phobias, anxiety, 


and hypochondriasis 





e Fear: feeling of threat and danger, usually accompanied by vegeta- 
tive symptoms such as palpitations, sweating, shortness of breath, 
tremor, dry mouth, or gastric pressure. 

e Phobias: fear of an object or situation. 

e Suspiciousness: fear that one is the object of someone's scheming. 

e Hypochondriasis: concern about one’s own health that cannot be 
objectively explained but that is tenaciously held on to. 

e Obsessive thoughts: thoughts that cannot be suppressed and that 
are either senseless or whose persistence and penetrance are per- 
ceived to be meaningless and mostly distressing. 

e Compulsive actions: actions perceived as meaningless and mostly 
distressing that usually cannot be suppressed; mostly due to com- 
pulsive impulses or compulsive fears. 


Disturbances of drive and psychomotility 





This term summarizes all disorders that affect the energy, initiative, 
and activity of a person (drive) and the overall movements affected 
by mental processes (psychomotility). These disorders are usually 
diagnosed by observing the patient. 

The following types are distinguished: 


e Lack of drive: lack of energy and initiative, can be recognized by 
the scarcity of spontaneous motor activity and lack of activity. 

e Inhibition of drive: in contrast to lack of drive, patients with in- 
hibited drive do not experience their initiative and energy as 
reduced, but rather as inhibited. “Everything is a little harder 


than usual; it’s as if Pm inhibited, but so far hardly anyone has 
noticed’ 

e Stuporous: motor immobility and lack of reactions despite normal 
neurological findings (in contrast to coma). 

e Mutism: ranges from taciturnity to not speaking, even though or- 
gans of speech and the ability to speak are intact. 

e Logorrhoea: excessive talkativeness. Because of an insatiable urge 
to speak, no meaningful communication is possible with the pa- 
tient. Patients ignore or reject attempts to interrupt them. 

e Increased drive: increased activity and initiative during organized 
(targeted) activity. Patients express numerous wishes and plans, 
which are only partially put into action. They are constantly active, 
are not impressed by counterarguments and ignore or do not care 
about personal consequences. 

e Motor restlessness: aimless and undirected motor activity that can 
increase up to a frenzy. Patients are constantly moving and can 
therefore have hardly any or no normal social contacts. During the 
evaluation they cannot stay sitting on the chair and have to get up 
and walk up and down. 

e Automatisms: patients perform automatic actions which they de- 
scribe as not being intentional. These include negativisms (in re- 
sponse to a request, they automatically perform the opposite or 
nothing at all), automatic obedience (automatic following of in- 
structions), and echolalia/echopraxia (everything a patient hears 
or sees is repeated or imitated). 

e Indecisiveness: simultaneous, contradictory impulses make de- 
cisive actions impossible. 


Stereotypy: language and motor expressions that are repeated in 
the same way and seem to be meaningless. 


Tics: uniformly recurring, rapid, and involuntary muscle twitches, 

possibly with expressive content. 

e Paramimia: Mimicking behaviour and affective experience do not 
concur. 

e Mannerisms: strange, unnatural, contrived, posing behavioural 
traits. 

e Histrionics: patients give the impression that they are presenting 
themselves in a way that dramatizes their situation or symptoms. 

e Aggressiveness: tendency to act violently. 

e Social withdrawal: reduction in social contacts. 

e Social activity: expansion of social contacts. Patients approach 

many people, frequently cling indiscriminately to people, lack dis- 

tance and are always on the go and querulous. They constantly 

talk to strangers and do not notice when they annoy others. The 

environment reacts negatively. 


Some of these symptoms are traditionally regarded as catatonic 
symptoms. These symptoms are particularly common in the con- 
text of the catatonic subtype of schizophrenia and are divided into: 


e Psychomotor hyperphenomena: psychomotor agitation, move- 
ment and speech stereotypes, automatic obedience (echopraxia, 
echolalia). 

e Psychomotor hypophenomena: blocking, stupor and mutism, 
negativism, catalepsy (remaining in a position with passive bodily 
posture), stereotypical positions, and flexibilitas cerea (wax-like 
flexibility when moved passively). 


CHAPTER 6 Foundations of phenomenology/descriptive psychopathology 


Summarizing psychopathological findings: the 
psychopathological state (mental state examination) 


At the end of the exploration, the symptomatology is summarized as 
a psychopathological evaluation. One tries to create a picture of the 
patient’s current psychopathological condition in an abstract, and 
yet still concrete enough, manner. 

One usually starts with the appearance (habitus, external pres- 
entation, physiognomy, and also psychomotility and drive), be- 
cause this is the easiest to assess. Then the specific behaviour and 
speech (way of speaking, sound of the voice, modulation, spontan- 
eity) during the interview are described. Thereafter (if applicable!), 
details are given on changes in consciousness, attention, attitude, 
orientation, memory, affect, and drive (affective contact, intensity 
and modulation of affective reactions, basic mood, mood swings, 
compulsive needs, will control, etc.). This is followed by a discus- 
sion of disorders of perception (including hallucinations), disturb- 
ances of formal thought and thought content (delusions, obsessive 
thoughts), and disorders of ego. 

One should note that the summary is not just a list of psycho- 
pathological terms with a note on whether they are not at all present 
or mild, moderate, or severe. Rather, one must create a true picture 
of the current mental state of the patient. 

One should also go beyond these areas and provide information 
on possible demonstrative traits or simulation/dissimulation ten- 
dencies, malaise and illness insight, and special risks. 


Conclusions 


Descriptive psychopathology, or the knowledge of psychopatho- 
logical symptoms and abnormal experiences and behaviours, is 
the basis for clinical diagnosis in psychiatry. The symptoms or syn- 
dromes recorded within the descriptive psychopathological assess- 
ment can be used in the clinical or scientific context, for example in 
the sense of a syndrome diagnosis [21]. They can also serve to make a 
diagnosis at a higher level of diagnostics by using certain traditional 
nosological entities or modern operationalized disease/disorder con- 
cepts such as those defined in ICD-10 or DSM-5, for example. While 
the criteria for the different diseases/disorders can change with each 
revision of the classification systems, as happened, for example, in the 
transition from DSM-IV to DSM-5 [22, 23], the description of the 
symptoms almost always remains stable. Thus, descriptive psycho- 
pathology represents the basis of our practice as clinical psychiatrists, 
and we should use it in everyday clinical practice, as well as in clinical 
research. Although knowledge of, and competency in, traditional de- 
scriptive psychopathology have tended continuously to decrease as 
part of the development of the operational diagnosis systems such as 
ICD-10 and especially DSM-IV and DSM-5 [24], descriptive psycho- 
pathology is still an important tool for clinical psychiatry [25]. 

The Research Domain Criteria (RDoC), developed by the 
National Institute of Mental Health (NIMH), tend to greatly limit 
the importance of classical descriptive psychopathology and clinical 
findings for research approaches [26] (see Chapter 8). The reason 
is that the RDoC focus more on reductionist dimensions detect- 
able at a neuropsychological level as the target of neurobiological 
research, rather than on complex psychopathological syndromes. 
This approach may make it easier to achieve research results, but 


the transformation of these results into the more complex constructs 
(syndromes) of clinical psychopathology is difficult; for example, a 
‘depressive syndrome’ is more than just a score on a ‘negative valence 
system. 

Psychopathological findings represent the core of psychiatric 
diagnosis. This statement, which, at first glance, appears obvious, 
is nowadays no longer a general consensus. In particular, other 
diagnostic procedures, such as neuropsychological, neurophysio- 
logical, biochemical, genetic, and imaging procedures, have greatly 
increased in relevance and authority through the scientific devel- 
opments of the last decades. We stand at an important historical 
juncture where these diagnostic methods have increased in import- 
ance but have not come close to replacing the psychopathological 
assessment. Whether they ever will is an intriguing question for the 
future. 
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The classification of medical conditions has historically included 
the naming (nomenclature) and description (nosology) of recog- 
nized illnesses, and the placement of all such conditions into an or- 
ganizational structure (taxonomy—classifications) that recognizes 
similarities and boundaries among these conditions. Similar efforts 
have been useful for other scientific areas such as chemistry, with 
the periodic table of elements, and biology, with the recognition and 
organization of genera and species in the classification of Linnaeus. 

With the establishment of asylums for the mentally ill in the eight- 
eenth and nineteenth centuries, there was an opportunity to system- 
atically observe the course of psychiatric illnesses, conduct medical 
autopsies, and advance new classification structures. Alzheimer’s 
recognition of pathological changes in brains of patients with de- 
mentia at autopsy became a biological standard, but Emil Kraepelin 
was unable to identify parallel changes for dementia praecox. 
However, Kraepelin’s descriptive classification, based on symptom 
expression and the course of illness, enabled an important separ- 
ation of schizophrenia and affective psychoses, and his textbooks 
were widely influential. For asylum directors in the nineteenth cen- 
tury, there was a need to collect statistical information on the types 
of disorders affecting their patients, and these rudimentary classifi- 
cations were then used to collect prevalence data by census takers. 
However, the earliest nineteenth-century international medical 
statistical classifications focused primarily on listing causes of death 
(and later diseases) that could aid in standardizing vital statistics re- 
porting for public health monitoring. 

At the end of World War II, the United Nations (UN) was es- 
tablished, including the World Health Organization (WHO) as its 
specialized agency in health. All member states of the UN, which 
then became signatories to the WHO, agreed to collect common 
mortality statistics on the causes of death, and morbidity statistics 
of known medical and mental disorders to develop comparable na- 
tional and international health statistics. The sixth revision of the 
Manual of International Statistical Classification of Diseases, Injuries, 
and Causes of Death (ICD-6) was approved by the World Health 
Assembly, the WHO's governing body, in 1948. Relatively minor re- 
visions were made in the Seventh Revision in 1955, followed by a 
more extensive Eighth Revision completed in 1965. The increased 
use of the classification for medical records resulted in individual 





countries, such as the United States, making ‘clinical modifications; 
which involved additional codes for both outpatient and inpatient 
hospital use. This continued for the ninth edition in 1977 and the 
tenth edition in 1992 [1]. 

















Although the American Psychiatric Association (APA) contributed 
to the ICD-6 mental disorders chapter, it also published a separate 
Diagnostic and Statistical Manual of Mental Disorders (DSM-1) 
in 1952 that more closely reflected Adolf Meyer’s view of mental 
disorders as psychobiological reactions to critical life and devel- 
opmental events. In preparation for ICD-8 in 1965, the WHO rec- 
ognized the wide disparity in different national nomenclatures and 
lists of mental disorders used throughout the world. The WHO 
commissioned a British psychiatrist—Erwin Stengel—to survey 
all the major national and academic classifications available at the 
time as the basis for proposing a feasible international mental dis- 
order classification in future editions of the ICD. Because of the lack 
of knowledge and differing opinions regarding the pathology and 
aetiology, he suggested using ‘operationalized definitions’ of mental 
disorders, based on observable syndrome criteria that could be re- 
liably reported. In bypassing disagreements about the aetiology of 
these syndromes, he suggested that such an approach could lead 
to a greater measure of agreement about the value of specific treat- 
ments and facilitate a broad epidemiological approach to psychiatric 
research [2]. 

Unfortunately, the recommendations of Stengel were not followed 
for ICD-8 in 1965 or the parallel second edition of the DSM pub- 
lished by the APA in 1968. Nonetheless, there were several factors 
which led to the eventual adoption of Stengel’s recommendations. 
The statistical classification of mental hospital admission diagnoses 
in the United States and the UK demonstrated marked differences in 
the prevalence rates of schizophrenia and manic depressive disorder. 
A study to evaluate the basis for this discrepancy showed that the use 
of common operationalized definitions’ of these two disorders and a 
structured psychiatric interview greatly increased the comparability 
of prevalence rates for these mental disorders [3]. After this study, 
the WHO realized that additional guidance was needed to improve 
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comparable clinical applications of ICD-8 diagnoses and published a 
glossary of terms in 1974—a glossary that was fully incorporated into 
ICD-9 in 1977. At about the same time, psychiatrists at Washington 
University, St Louis embarked on an effort to ‘operationalize’ diag- 
nostic criteria for 16 mental disorders (the Feighner criteria) in what 
was referred to as a neo-Kraepelinian or descriptive approach to 
psychiatric diagnosis. The approach was recognized as useful for a 
proposed NIMH longitudinal study of depressive disorders and was 
modified to become the Research Diagnostic Criteria (RDC) [4]. 
When Spitzer was also selected to be Chair of the DSM-III Task Force 
to prepare an edition comparable to the ninth edition of the ICD, the 
Task Force decided to go beyond the ICD-9 glossary of terms and 
introduce explicit symptom criteria for each psychiatric disorder— 
using the RDC as the model for the entire DSM-III [5]. 

Following the DSM-III publication in 1980, the WHO Division of 
Mental Health was contacted by the US Alcohol, Drug Abuse, and 
Mental Health Administration (ADAMHA) to develop a joint effort to 
bring greater international agreement on definitions and ‘operational- 
ized criteria’ for mental disorders. A series of international conferences 
was supported, as was a major international conference of psychiatric 
leaders from 37 countries held in Copenhagen in 1982 [6]. An agree- 
ment was reached at this meeting for these countries to work jointly 
with the WHO on the development of the Mental and Behavioural 
Disorders chapter of ICD-10 that would also follow the recommenda- 
tions of Stengel and the model most fully realized at that time by 
DSM-III. Subsequently, a co-operative agreement was reached with 
the WHO and NIMH to support collaboration with the DSM-IV Task 
Force and the development of three research instruments that would 
facilitate assessments of both DSM-IV and the newly ‘operationalized’ 
ICD-10 diagnoses [7]. These included the Composite International 
Diagnostic Interview (CIDI) for epidemiological studies that was 
based on the Diagnostic Interview Schedule (DIS) used in the NIMH 
Epidemiological Catchment Area Study—the first study to realize the 
epidemiological research potential of operationalized diagnostic cri- 
teria (in DSM-III) anticipated by Stengel [8]. The other two instruments 
included the Schedule for Clinical Assessment in Neuropsychiatry 
(SCAN), based on the Present State Examination (PSE) used in the 
United States/UK study, and the International Personality Disorders 
Examination (IPDE), based on Loranger’s Personality Disorder 
Examination. The most widely used of these instruments has been the 
CIDI, which, with some modifications, became the assessment instru- 
ment for the extensive World Mental Health Surveys [9]. This work 
substantially influenced guidance published by the WHO for ICD-10 
Mental and Behavioural Disorders, including the Clinical Descriptions 
and Diagnostic Guidelines [10], intended for use in clinical settings, and 
the Diagnostic Criteria for Research [11]. 

When the co-operative agreement between the NIMH and the WHO 
for ICD-10 diagnostic instruments was completed in 1992—it was ex- 
tended to 2001 to develop the WHO Disability Assessment Schedule 
(WHO-DAS) [12]—an assessment instrument that would support the 
International Classification of Functioning and Disability (ICF) [13]. 





DSM-5 and ICD-11 classification development 


In 1999, the APA initiated a review of DSM-IV [14] and ICD-10 
[1, 10, 11] approaches to mental disorder diagnosis and invited the 


WHO, the World Psychiatric Association, and the National Institutes 
of Health (NIH) to join in this effort. Although the exercise resulted 
in a monograph entitled A Research Agenda for DSM-V [15], it also 
served as the basis for a co-operative agreement application from the 
APA and the WHO to three NIH institutes entitled Developing the 
Research Base for DSM-V and ICD-11 [16]. From 2004 to 2008, there 
were 13 conferences, involving about 400 clinicians and scientists 
from around the world to review the evidence base for major groups 
of mental disorders and a review of the public health implications of 
changes in psychiatric classification [17]. The conferences, a list of 
conference series publications, on ‘developing a research agenda for 
DSM-5, are available on the web at http://www.dsm5.org. 

The DSM-5 Task Force Chair and Vice-Chair were appointed in 
2006, with the Task Force that included work group chairs and con- 
sultants in 2007, and the 13 diagnostic work groups involving a total 
of about 160 multi-disciplinary and international members were all 
vetted for conflict of interest issues by the APA Board of Trustees 
and fully functioning by 2008. Three drafts of proposed diagnostic 
criteria were posted on the http://www.dsm5.org website in 2010, 
2011, and 2012. There were over 13,000 recommendations from 
clinicians, research investigators, and the general public received 
on the http://www.dsm5.org website in response to these postings. 
In addition, there was a remarkable level of international media 
interest in the process, and thousands of petitions, e-mails, and let- 
ters were received about proposed changes for selected disorders. 
Field trials to assess the reliability of diagnoses and dimensional 
measures were conducted in 11 academic settings [18-20] and 
in over 600 routine clinical practice settings to assess the clinical 
utility of the revisions [21]. A multi-level review process was estab- 
lished by the APA Board of Trustees, and DSM-5 was approved by 
the Board of Trustees in December 2012 and published for release 
in May 2013 [22]. 

As with previous editions of the DSM and ICD, it was expected 
that there would be extensive consultation between the two classi- 
fications and that they would be published at about the same time. 
The ICD-11 revision process was launched in 2007, and a DSM-ICD 
harmonization co-ordinating group was organized to use the jointly 
developed research base from the conferences to enhance the con- 
sistency of DSM and ICD revisions for clinical guidance. A DSM-5 
Task Force initiative to develop a more useful and evidence-based 
organizational structure of the mental disorder classification was 
converted into a joint DSM-ICD effort to examine the degree to 
which individual disorders and groups of disorders had been ‘valid- 
ated’ in research studies. An expanded set of ‘validity criteria’ from 
the one proposed in 1970 by Robins and Guze was applied in a series 
of analyses and papers published in 2009 [23]. 

It was readily apparent that the alignment of multiple validators 
was much more meaningful for larger groups or disorder spectra 
than for individual categorical diagnoses. Asa result, there was agree- 
ment that both DSM-5 and ICD-11 would—to the extent possible 
given the different conventions and constituencies of the two sys- 
tems—share a common organizational structure or ‘metastructure’ 
that would be reflective of the evidence base accumulated in this ex- 
ercise. However, since the ICD-11 process continued for more than 
6 years after publication of DSM-5, several significant additional 
structural changes in the mental disorders chapter have been made 
in the context of the larger ICD-11 revisions. 


In the three decades between the publication of DSM-III and 
DSM-5, there was a remarkable increase in the amount of epi- 
demiologic, clinical, and basic neuroscience research on mental 
disorders. This research has demonstrated the high levels of mental 
disorder comorbidity, similarities in pharmacologic and psycho- 
social treatment effectiveness across diagnostic boundaries, and an 
increasingly complex set of genetic and pathophysiological correl- 
ates with a spectrum of clinical syndromes defined in DSM-III to 
DSM-IV as separate and distinct disorders. The cumulative effect of 
this research was to support a reconceptualization of the approach 
to classifying mental disorders that emphasized developmental 
similarities, common neurobiological correlates, and common 
phenomenological features that emerged from factor analyses of 
clinical symptoms and ‘personality’ traits in both general and clin- 
ical populations. 

This revised approach appears to be supportive of conceptual- 
izing clinical disorders as central tendencies in more continuous 
syndromes with multi-causal aetiologies. It acknowledges the 
heterogenous nature of current categorical mental disorder diag- 
noses, such as major depressive disorder, which requires any five 
out of nine diagnostic criteria for diagnosis and allows up to 256 
different combinations with 3-4 different levels of severity. In add- 
ition, there are now specifiers for anxiety symptoms and contextual 
factors such as peri-partum status in females that add to the het- 
erogeneity of this disorder. The inclusion of a ‘clinically significant 
distress or disability’ threshold criterion for almost all DSM-IV 
and DSM-5 disorders results in adding mood and anxiety symp- 
toms for virtually all disorders. The overlap of mood and anxiety 
disorder symptoms with stress-induced disorder criteria for a con- 
dition like PTSD further reduces the ability of strict categorical 
diagnostic criteria boundaries to identify homogenous clinical 
populations. 


Conceptual approach to linking categorical and 
dimensional diagnostic assessments 


The emergence of a new understanding of mental disorders concep- 
tualizes them as the likely result of multiple genetic and environ- 
mental exposure factors, rather than discrete causes such as single 
genes (for example, Down's syndrome—due to trisomy 21), nutri- 
tional or toxic exposures (for example, pellagra—due to niacin de- 
ficiency), or single infectious agents [for example, central nervous 
system (CNS) lues—due to syphilis spirochaete]. This provides an 
entirely different approach to assessing the validity and reliability of 
psychiatric diagnoses. It affects how we should conduct field trials 
of diagnostic criteria and interpret statistical measures of diagnostic 
reliability such as Kappa—a measure that is greatly affected by the 
comorbidity of the clinical population from which subjects are re- 
cruited and the use of exhaustive research vs usual clinical inter- 
views [19, 24]. If there are perhaps as many as a thousand different 
genes that contribute some degree of vulnerability to the diagnosis 
of schizophrenia, and some of these are shared with vulnerability 
genes for autism spectrum disorders (ASD), attention-deficit/ 
hyperactivity disorder, bipolar disorder, and major depressive dis- 
order [25], the potential for overlapping symptoms and comorbidity 
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becomes apparent. The emerging appreciation of the influence of 
psychosocial or physical environmental exposures to the epigenetic 
switching on or off of any of the vulnerable genes for mental dis- 
orders only adds to the aetiological complexity and interpretation of 
biological ‘validators. 

The tension between the psychoanalytic concept of all mental dis- 
orders being on a single continuum from normal to psychosis and 
that of the neo-Kraepelinians that all disorders are discontinuous, 
discrete disease entities can now be reinterpreted with a better 
understanding of aetiological factors and statistical analyses of psy- 
chopathological symptoms, traits, and clinical course of illness. 
From a clinician’s perspective, there is clearly something different 
about the core clinical expression and clinical course of patients 
with autism spectrum, schizophrenic spectrum, and bipolar and 
major depressive disorder diagnoses—even if some of the same vul- 
nerability genes are shared. Although a strict dimensional scoring 
of all biological and symptomatic domains as continuous variables 
would undoubtedly provide a more precise description of psycho- 
pathology, a hybrid model that contains both categorical diagnoses 
and dimensional variations is much more useful for clinical practice. 
As a result, DSM-5 has recommended the following modifications 
in clinical diagnosis: 


1. The assessment of cross-cutting symptom domains for adults that 
include depression, anger, hypomania, anxiety, somatic, suicide 
risk, hallucinations, sleep, cognition, obsessive-compulsive, dis- 
sociation, personality, substance use, attention, and irritability— 
the last two for children and adolescents. A Level 1 screening 
and a Level 2 confirmation of higher symptom levels are recom- 
mended, regardless of diagnosis. 

2. An assessment of disability using the WHO-DAS that replaces 
the previous DSM-IV Axis V Global Assessment of Function 
(GAB) scale. 

3. Revised syndrome-based categorical diagnoses with porous 
boundaries that are organized into larger spectrum groups to fa- 
cilitate assessments of common comorbidity. 

4. Explicit threshold criteria are retained for research and clin- 
ical communication about the characteristics of the diagnostic 
syndrome. 

5. Diagnostic severity measures that are freely available for 
downloading online in English and with select translation 
editions. 

6. Use of ‘Other specified’ and ‘Unspecified’ categorical diagnoses, 
instead of the ‘Not otherwise specified (NOSY convention of 
previous DSM editions. 

7. Associated text on diagnostic features, prevalence, development 
and course, risk and prognostic factors, culture-related diag- 
nostic issues, suicide risk, functional consequences, gender- 
related issues, differential diagnosis, and comorbidity. 

8. Recommended comprehensive case formulations for clinical 
assessments. 

9. Use of a Cultural Formulation Interview to facilitate under- 
standing of cultural expressions that may be misunderstood 
as evidence of psychopathology—particularly where there are 
cultural differences between the clinician and the individual 
patient. 
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Table 7.1 Proposed overall ‘metastructure’ or groupings of DSM-5 and ICD-11 mental, behavioural, and neurodevelopmental disorders 





Neurodevelopmental disorders 
Schizophrenia spectrum and other psychotic disorders 
(No ‘General mood disorders’ grouping) 
Bipolar and related disorders 

Depressive disorders 

Anxiety disorders 

Obsessive-compulsive and related disorders 
Trauma- and stressor-related disorders 
Dissociative disorders 

Somatic symptom and related disorders 
Feeding and eating disorders 

Elimination disorders 

Sleep-wake disorders 

Sexual dysfunctions 

Gender dysphoria 


Disruptive, impulse-control, and conduct disorders 


Substance-related and addictive disorders 
Paraphilic disorders 

(Included under ‘Somatic symptom disorders’) 
Neurocognitive disorders 

Personality disorders 

Other mental disordersz 


(Not a grouping in DSM-5) 


(DSM-5 divides these up into the section that corresponds to the expressed 
symptoms. For example, ‘Depressive disorder associated with another medical 
condition is listed under ‘Depressive disorders’) 


Medication-induced movement disorders and other adverse effects of 
medication 


Other conditions that may be a focus of clinical attention 


ES 


Neurodevelopmental disorders 

Schizophrenia spectrum and other primary psychotic disorders 
Mood disorders 

Bipolar and related disorders 

Depressive disorders 

Anxiety and fear-related disorders 

Obsessive-compulsive and related disorders 

Disorders specifically associated with stress 

Dissociative disorders 

Bodily distress disorders 

Feeding and eating disorders 

Elimination disorders 

(Separate ICD-11 chapter on ‘Sleep-wake disorders’) 

(In separate ICD-11 chapter on ‘Conditions related to sexual health’) 
(In separate ICD-11 chapter on ‘Conditions related to sexual health’) 


Impulse-control disorders (separate grouping from ‘Disruptive behaviour and 
dissocial disorders’, different order: appears after ‘Disorders due to substance use 
and addictive behaviours’) 


Disruptive behaviour and dissocial disorders (separate grouping from 
‘Impulse-control disorders’, different order: appears after ‘Disorders due to 
substance use and addictive behaviours’) 





Disorders due to substance use and addictive behaviours 

Paraphilic disorders 

Factitious disorders 

Neurocognitive disorders (different order; appears after ‘Personality disorders’) 
Personality disorders 

(Residual categories for each grouping are included by ICD-11 convention) 


Mental and behavioural disorders associated with pregnancy, childbirth, 
and the puerperium, not elsewhere classified 


Secondary mental and behavioural disorders syndromes 


(In ICD-11 chapter on ‘Diseases of the nervous system’ or chapter on ‘External 
causes of morbidity or mortality) 


(Separate ICD-11 chapter on ‘Factors influencing health status and encounters 
with health services’) 


Source: data from American Psychiatric Association, Diagnostic and statistical manual of mental disorders, Sth edition (DSM-5), Copyright (2013) American Psychiatric Association; 
World Health Organization, The ICD-11 Classification of Mental and Behavioural Disorders, Copyright (2018), World Health Organization. 


The metastructure of DSM-5 classification 
of diagnostic categories 


The anticipated reorganization of the entire ICD statistical coding 
system from the ICD-9 numeric (000.00-999.99) and ICD-10 
alphanumeric (A00.00-Z99.99) system to an ICD-11 numeric- 
alphanumeric (01A00-99Z99) system opened the possibility for 
reconceptualizing the grouping and specifiers for individual diag- 
noses. The previous hierarchy of ‘organic dementia, followed by 
substance-induced disorders and psychoses, could be replaced by 
a more developmental concept of psychopathology that begins with 
neurodevelopmental disorders, and place acquired neurocognitive 
disorders far later in the organizational structure. This structure has 


been largely adopted in both DSM-5 and ICD-11 and is shown in 
Table 7.1. 

DSM- 5 includes an introductory Section 1 that provides oper- 
ational definitions and guidance regarding the use of the manual. 
Section 2 contains the diagnostic criteria and relevant ICD statis- 
tical codes within the organizational structure shown in Table 7.1. 
There is also a very substantive Section 3 that includes the previously 
mentioned dimensional assessment measures, guidance related to 
cultural formulation, an alternative ‘hybrid model’ for personality 
disorders, and conditions for further study. 

Embedded within this new organizational arrangement is a 
much greater appreciation of more continuous diagnostic spectra 
containing previously described separate and discrete disorders. 


Examples of the compression of previous separate disorders into 
single disorder spectra include ASD, specific learning disorder, sev- 
eral sleep disorders, somatic symptom disorder, and substance use 
disorders. Altogether, 50 separate disorders in DSM-IV were reduced 
to 22 spectrum disorders—with the greatest number contributed by 
merging abuse and dependence disorders for ten substances into 
mild, moderate, and severe forms of substance use disorders. Several 
new disorders were added, including social (pragmatic) communi- 
cation disorder (SCD) and hoarding disorder, that contained more 
limited symptom domains than were present in other disorders in 
the related disorder spectra. For example, ASD required deficits in 
both social communication and restricted, repetitive behaviours and 
interest domains, whereas some patients who demonstrated deficits 
only in social communication required a separate SCD diagnosis. 

Major changes in the grouping of disorders include the move- 
ment of attention-deficit/hyperactivity disorder into the Neuro- 
developmental disorder’ chapter, and gambling disorder into the 
‘Substance-related and addictive disorders’ chapter. There are 
also new placements of disorders that were previously in other 
sections of the mental disorder classifications; for example, body 
dysmorphic disorder from the DSM-IV ‘Somatoform disorders’ 
group and trichotillomania (hair pulling disorder) from the DSM- 
IV ‘Impulse-control disorders’ group were moved into the DSM- 
5 ‘Obsessive-compulsive and related disorders’ group, which also 
added new hoarding disorder and excoriation (skin picking) dis- 
order conditions. 

For a more complete description of individual diagnoses in 
DSM 5, the reader is referred to Section II of DSM-5 (pp. 31-727). 
Highlights of changes in individual diagnoses from DSM-IV to 
DSM-5 are contained in the full DSM-5 (pp. 809-16). 


Mental and behavioural disorders in ICD-11 





ICD-10: the current global standard 


The WHO member states agree to use the ICD as the basis for re- 
porting health statistics on mortality and morbidity. The ICD-10 
(Tenth Revision) was approved in 1990 and included a listing of all 
of the health conditions in alphanumeric order, ranging from A00 to 
Z99, with each chapter corresponding to a major area (for example, 
neoplasms, diseases of the circulatory system). The ‘Mental and be- 
havioural disorders’ chapter of ICD-10 was the only chapter to in- 
clude glossary definitions for each condition. However, the WHO 
has specifically indicated that these definitions did not provide suffi- 
cient information for clinical implementation by mental health pro- 
fessionals. Rather, the statistical version of ICD-10 was intended for 
use by ‘coders or clerical workers and also serves as a reference point 
for compatibility with other classifications’ [1]. 

For implementation of ICD-10 in mental health service set- 
tings, the WHO developed the ICD-10 Classification of Mental and 
Behavioral Disorders: Clinical Descriptions and Diagnostic Guidelines 
(CDDG) [10], which provided a description of the main clinical and 
associated features of the disorder, together with diagnostic guide- 
lines designed to assist mental health clinicians in making a con- 
fident diagnosis. The diagnostic guidelines in the CDDG differed 
from DSM diagnostic criteria in offering more flexible, prototypic 
guidance intended to allow for clinical judgement and global vari- 
ation, rather than strict criteria. 
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The WHO also published the ICD-10 Classification of Mental and 
Behavioral Disorders: Diagnostic Criteria for Research (DCR) [11], 
containing fully operationalized diagnostic criteria. The differences 
between the CDDG and the DCR reflected their different purposes. 
The diagnostic guidelines contained in the CDDG were intended 
to help the clinician identify the category that is most likely to offer 
relevant information for treatment and management. In this con- 
text, false negatives based on arbitrary or overly specified criteria, 
such as precise duration requirements and symptom counts, were 
problematic because they offered no guidance to the clinician [26]. 
In contrast, the DCR were intended primarily for identifying more 
homogenous research populations such as for clinical trials or epi- 
demiological studies. 

Therefore, while the ICD-10 CDDG were substantively quite dif- 
ferent from DSM-IV, the ICD-10 DCR were intentionally highly 
similar to DSM-IV’s diagnostic criteria—a similarity that was ex- 
plicitly permitted by a joint agreement between the WHO and the 
APA. Still, there were important differences. Of the 176 diagnostic 
categories the two systems had in common, there were some dif- 
ferences in all but one category (transient tic disorder), with dif- 
ferences judged to be conceptual or substantive in the case of 39 
disorders [27]. 

The largest single source of differences between the ICD-10 DCR 
and DSM-IV was DSM-IV’s ‘clinical significance’ criterion, which 
effectively required the presence of distress and/or functional im- 
pairment as a part of DSM-IV criteria sets. This difference reflected 
the WHO's view of distress (other than the specific forms of distress 
that constitute the symptoms of particular disorders) and changes in 
functional status—the main components of DSM-IV’s clinical sig- 
nificance criterion—as consequences or outcomes of health condi- 
tions, rather than their inherent features [28]. The inclusion of these 
features as a part of specific ICD-10 disorder guidelines or RDC 
criteria sets was therefore restricted to those conditions in which 
they were considered necessary for distinguishing disorder from 
normality. These issues are further discussed in the section on the 
WHO's ICF. 

Finally, the WHO published a version of ICD-10 mental and 
behavioural disorders intended for use in global primary care set- 
tings [29]. The primary care version (ICD-10 PHC) contains only 
26 disorder categories and was the predecessor for a similar ICD-11 
version. 


Development of ICD-11 mental and 
behavioural disorders 


Because the ICD plays such a crucial role in the international health 
community, it is critical that it is based on the best available scien- 
tific knowledge and that it keeps pace with significant advances in 
health care that have the potential to improve its reliability, validity, 
and utility. Increasingly, in most countries, the ICD must also be 
compatible with electronic information infrastructure. ICD-10 was 
approved by the World Health Assembly in 1990 and published in 
1992, prior to the current digital age, making the current period the 
longest in the history of the ICD without a major revision. 

The development of ICD-11 was a huge undertaking that involved 
a review of necessary changes in the classification of all health con- 
ditions, of which mental and behavioural disorders represent only a 
small part. It was made even more complex by the requirement that 
ICD-11 must be suitable for a range of different uses by all WHO 








SECTION 1 The subject matter and approach to psychiatry 


member states. Responsibility for co-ordinating the development of 
the ICD-11 chapter on mental and behavioural disorders was as- 
signed to the WHO Department of Mental Health and Substance 
Abuse. That department appointed an International Advisory Group 
in 2007 to provide advice and consultation throughout the process. 

The public health focus of the WHO Department of Mental Health 
and Substance Abuse substantially influenced the priorities and 
methods for developing diagnostic guidelines for ICD-11. Mental 
and substance use disorders account for a greater proportion of 
global disease burden than any other category of non-communicable 
diseases and are the leading cause of disability worldwide [30]. Yet, 
they are dramatically undertreated, with up to half of individuals 
with severe mental disorders in HICs and more than three-quarters 
in LMICs receiving no treatment at all [31]. Moreover, people with 
serious mental disorders experience a much higher prevalence of 
other health conditions, such as cardiovascular, metabolic, and re- 
spiratory diseases [32, 33], and disproportionately higher rates of 
mortality, resulting in a substantial reduction in life expectancy [34]. 

The WHO therefore saw the development of the ICD-11 as an 
opportunity to provide WHO member states with a better tool to 
help them reduce the disease burden of mental, behavioural, and 
neurodevelopmental disorders and to provide health professionals at 
various levels of care with better tools for identifying people in need 
of mental health services and which treatments are most likely to be 
effective. With very limited exceptions, substantial gains in clinical, 
neuroscience, and genetic research has not produced validity infor- 
mation on which to base major changes in diagnostic classification 
[35, 36]. However, current knowledge is amply sufficient to provide 
vastly more effective and more accessible mental health services than 
are currently provided at a global level, and the WHO provided a set 
of specific strategies for doing so in its Comprehensive Mental Health 
Action Plan 2013-2020 [37]. These considerations have led the WHO 
to focus particularly on improving the classification’s clinical utility 
and global applicability in developing diagnostic guidelines for ICD- 
11 mental, behavioural, and neurodevelopmental disorders [38]. 

The International Advisory Group and all Working Groups for ICD- 
11 Mental, Behavioural, and Neurodevelopmental disorders included 
representation from all WHO global regions. Professional surveys 
[39,40] and formative studies [41, 42] intended to inform early decisions 
about the structure of the classification were conducted in multiple lan- 
guages and with international groups of participants. Diagnostic guide- 
lines for the ICD-11 CDDG include a specific section on cultural issues 
related to the clinical presentation and diagnosis of each condition. 

Clinical utility and global applicability are critical, in part, because 
the ICD-11 CDDG and primary care versions are intended to func- 
tion as important interfaces between health encounters and health 
information [41]. A classification that does not provide clinically 
and locally useful information to health professionals has little hope 
of being implemented consistently at the encounter level. In that 
case, diagnostic data that are aggregated from health encounters at 
the facility, system, national, and global levels will be unable to pro- 
vide an optimal basis for global health data or for decision-making 
such as resource allocation. 

To achieve its goal for ICD-11, the WHO took a highly system- 
atic approach to the development of ICD-11 diagnostic guidelines 
by multi-disciplinary and broadly international expert Working 
Groups, particularly including substantial representation of experts 
from LMICs where 80% of the world’s population lives. Substantial 


improvements, as compared to ICD-10, included the adoption of a 
lifespan approach and the incorporation of dimensional approaches, 
particularly for personality disorders and primary psychotic dis- 
orders, that are more consistent with current evidence, more compat- 
ible with recovery-based approaches, eliminate artificial comorbidity, 
and more effectively capture changes over time. New categories were 
added, including bipolar type II disorder, complex post-traumatic 
stress disorder, prolonged grief disorder, body dysmorphic disorder, 
olfactory reference disorder, hoarding disorder, binge eating dis- 
order, and compulsive sexual behaviour disorder [43]. In addition, the 
WHO implemented a systematic global programme of field studies 
using innovative methodologies specifically intended to examine the 
clinical utility and global applicability of the ICD-11 diagnostic guide- 
lines [44]. 

In June 2018 the WHO released a pre-final version of the 1CD-11 
for mortality and morbidity statistics to its 194 member states, for 
review and preparation for implementation, including considering 
how the ICD-11 will be incorporated in policies, laws, health sys- 
tems, translations, and training of health professionals. The World 
Health Assembly, comprising the ministers of health of all member 
states, formally approved the ICD-11 in May 2019. Member states are 
now beginning a process of transition from the ICD-10 to the ICD- 
11, with reporting of health statistics to the WHO using the ICD- 
11 to begin on 1 January 2022. The WHO Department of Mental 
Health and Substance Abuse will publish Clinical Descriptions and 
Diagnostic Guidelines (CDDG) and separate guidance for the iden- 
tification of mental health conditions in primary care settings. 


Structure of ICD-11 mental and behavioural disorders 
and compatibility with DSM-5 


The ICD-11 was also developed in the context of full knowledge 
of DSM-5, both in its formative stages and in its 2013 published 
version. Working Groups were specifically instructed to consider 
DSM-5 formulations and their suitability for global application but 
were not prohibited from departing from them when indicated. 
Therefore, differences between ICD-11 and DSM-5 are intentional. 

The proposed groupings for the ICD-11 chapter on mental and be- 
havioural disorders are listed in Table 7.1, which also includes a com- 
parison with the structure of DSM-5. Generally, the comparability of 
the structure of the two classifications can be counted as a success of 
harmonization efforts between the WHO and the APA. Some struc- 
tural differences reflect ICD-wide conventions related to residual 
categories and mental disorders associated with other underlying 
disease. Others are the result of deliberations involving the WHO, 
the Advisory Group, and the various Working Groups, such as in the 
diagnostic treatment of chronic irritability and anger in children [45] 
and somatoform disorders [46]. Another difference is related to the 
integration of the classifications of ‘organic and ‘non-organic’ aspects 
of sleep-wake disorders and conditions related to sexual health and 
gender identity [47] in new ICD-11 chapters in ways that are more 
consistent with current evidence and clinical practice. 


Structure and content of ICD-11 clinical descriptions and 
diagnostic guidelines 

The structure and information contained for each category of the ICD- 
11 CDDG are expected to enhance the clinical utility of the manual by 
providing clearly organized, consistent information across disorders 
that is flexible enough to allow for cultural variation and the exercise 


of clinical judgement. The categories of information to be provided 
for each category in the CDDG are: Category Name, Brief Description 
(100-125 words), Essential (Required) Features, Boundary with 
Normality (Threshold), Boundary with Other Disorders (Differential 
Diagnosis), Coded Qualifiers/Subtypes, Course Features, Associated 
Clinical Presentations, Culture-Related Features, Developmental 
Presentations, and Gender-Related Features. 

As noted previously, the utility and effectiveness of this format in pro- 
ducing more consistent clinical judgements in ICD-11, as compared to 
ICD-10, has been tested in a series of international field studies [48, 49]. 





ICD-11 mental disorders for primary care 


The first version of the ICD classification of mental disorders specific- 
ally designed for use in primary care settings (ICD-10 PHC) [29] was 
produced by the WHO in 1996 and included 26 mental disorders that 
were either very common in these settings or quite important for pri- 
mary care professionals to recognize. It had several features that dis- 
tinguished it from the parent version of ICD-10—for each disorder, 
common presentations in primary care, distinguishing features, and 
relevant differential diagnoses were described. Also included was infor- 
mation for the patient and family, response to both psychological and 
pharmacological treatment, and indications for specialist referral [50]. 

The most compelling rationale for a version of the ICD designed 
for primary care is that the spectrum of psychological disorders seen 
in general medical settings is very different from that seen in spe- 
cialist mental health settings. These patients in primary care settings 
commonly present with untidy combinations of somatic, anxious, 
and depressive symptoms, rather than the more elaborated diag- 
nostic entities that are seen as more prototypic by clinicians working 
in specialist settings. Many of these patients do not see themselves 
as psychologically disordered, despite having all the symptoms re- 
quired for various mental disorders—they are worried about their 
somatic symptoms and hope that the primary care clinician will ex- 
clude a physical cause for their symptoms and offer relief from the 
pain and distress associated with them. 

In specialist mental health settings, the diagnostic task of the 
mental health professional can be characterized as asking systemat- 
ically about the full range of known disorders. In contrast, the task of 
a clinician in primary care is to recognize a range of common mental 
disorders (CMDs), as well as some other severe mental disorders, 
and a range of problems associated with sleep, eating, alcohol, and 
drugs. The boundaries of the various overlapping syndromes of 
CMDs are of less importance than the existence of psychological 
distress accompanying any presenting physical disorder. The subtle 
distinctions between many of the individual disorders separately de- 
scribed by the main classifications are of limited value in primary 
care where clinicians work with the full range of general medical, 
behavioural, and social problems. A smaller set of disorders, broadly 
comparable to the main ICD, reflecting the most common condi- 
tions seen in this setting, is required for primary care. 


Conceptual developments for the new version of the 
primary care version of ICD-11 


Since ICD-10 PHC was released, there have been a number of devel- 
opments in knowledge and conceptualizations regarding common 
psychological disorders occurring in primary care settings. These are 
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related to the relationship between anxious and depressive symptoms, 
the importance of recognizing depressive symptoms that commonly 
accompany chronic physical disorders, and the management of mul- 
tiple somatic symptoms without any accompanying physical disease. 

Both major international classifications recognize that depression 
and generalized anxiety disorder (GAD) commonly accompany 
each other, but the diagnostic requirements for depression include a 
duration of only 2 weeks, while the requirement for GAD is several 
months. Yet many patients develop anxious symptoms around the 
time they develop depressive symptoms, so that states of both anx- 
iety and depressive symptoms are very much more common than 
‘comorbidity’ between depression and GAD. 

If the same duration of illness is used to define both depression and 
anxiety, those with both sets of symptoms have a much worse course, 
are more resistant to treatment, and are more likely to commit suicide 
[51]. The depressive symptoms in episodes of depression accompanied 
by significant anxiety are more severe than in depressive episodes not 
accompanied by anxiety [52, 53]. Individuals with episodes of such 
‘anxious depression’ are more likely to have harm-avoidant symp- 
toms and to also present with a wide range of family members with 
other anxiety disorders [54]. Focus groups of primary care physicians 
(PCPs) and nurses were held in eight countries in order to assess the 
views of PCPs towards the introduction of a category for anxious de- 
pression into ICD-11 PHC—they were uniformly enthusiastic [55]. 

It is true that both anxious and depressive symptoms can and do 
occur on their own, and therefore, they must both retain their dif- 
ferent names, but anxious depression deserves recognition in its 
own right [56]. Finally, states of mixed anxiety and depressive symp- 
toms that fall just short of the threshold requirements for either are 
very common in community settings and indeed account for more 
sickness absence in the UK than either depression or anxiety [57]. 
These mild disorders tend either to become formally diagnosable or 
to resolve within the next few months [58]. 

These various studies encouraged the Primary Care Consultation 
Group of the WHO to recommend that when the diagnostic require- 
ments for both anxiety and depression are satisfied, although using a 
shorter duration requirement for anxiety that matches that of depres- 
sion, patients are diagnosed as ‘anxious depression, but when one is at 
‘case’ level and the other well short of it, these conditions are diagnosed 
as ‘depression (non-anxious)’ or ‘current anxiety. When symptoms of 
both are present, but both just fail to reach the required severity, the 
individual may be diagnosed with ‘subclinical anxious depression. 


The use of screening instruments 
for psychological problems 


Even in HICs, primary care professionals have difficulty remem- 
bering and implementing the complex diagnostic algorithms used 
by mental health professionals [59], there is considerable pressure 
on time and resources, and options for specialist referral for CMDs 
may be limited. Available written screening questionnaires may be 
of limited usefulness for reasons of language or literacy, not only 
in low-resource settings, but also in primary care settings in HICs 
which serve increasingly diverse populations. 

The ICD-11 Primary Care Consultation Group therefore evaluated 
two brief screening scales for anxiety and depression to assist PCPs 
in deciding whether a diagnosable psychological problem was likely 
to be present. These screening questions were derived from an earlier 
WHO international study, in which it had been possible to compute 
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sensitivity and specificity values for both sets of questions [60]. Given 
time and resource pressures and linguistic and literacy issues, these 
screening scales may be of considerable value, especially in LMICs, 
but also in HICs, and will be published along with the ICD-11 PHC. 


International classification of functioning 


disability and health 





Mental disorders and disability: assessment 
and classification 


Mental disorders are strongly associated with disability—detriments 
in functioning of the brain, the body, and the person in general. 
People with mental disorders experience impairments in their body 
and brain functions, limitations in their personal daily activities, 
and restrictions in their social lives. These disabilities are part and 
parcel of the clinical picture of persons with mental disorders. The 
association between a mental disorder and disability may be causal, 
consequential, or difficult to disentangle. Disability is usually a key 
determinant for individuals seeking care and a significant factor for 
providers in making a clinical case formulation. Yet the form, fre- 
quency, and outcome of disabilities in mental disorders are not well 
defined or studied scientifically. Moreover, their use in formulating 
diagnoses of mental disorders has been unclear and inconsistent. 

The WHO and the APA have used the construct of disability 
differently in their classification systems until recently. The WHO 
ICD definitions have, as far as possible, kept disability outside the 
diagnostic classification, as per the model articulated in the WHO’s 
International Classification of Functioning, Disability and Health 
(ICF) [61]. DSM has kept disability as part of the functional impair- 
ment component of the clinical significance criteria. However, in 
the creation of DSM-5, the clinical significance criteria have been 
aligned to match the operational disability definition in the ICF. This 
has been an important step to scientifically disentangle disability 
from the disease process as much as possible in the formulation of 
mental disorder diagnoses in both ICD and DSM systems. 

Ideally, the scientific definitions of a disease or disorder should 
be based on the aetiology and the pathological process that takes 
place in bodily systems. Such definitions do not require any sort 
of disability such as detriments in work capacity. While the con- 
sequences of disease are important clinical factors that trigger the 
recognition of a case by providers and often determine the types of 
health services and level of care required, they are not always a part 
of the diagnostic definition of the disease process itself. For example, 
people with different levels of general intellectual functioning and 
education may have similar levels of neuronal loss as a result of a 
brain injury but display different patterns of limitations in their 
activities. However, poor self-care and social performance are in- 
cluded as negative symptoms in schizophrenia diagnoses. Hence, 
although most mental disorders do not have known aetiology and 
specific pathophysiology, the potential benefits of more explicitly 
defining disability separately from the other signs and symptoms of 
mental disorders are similar to those of other diseases and disorders. 
A common disability metric for GBD estimates associated with all 
diseases and disorders will be helpful [30]. 

DSM-IV and DSM-5, contrary to the ICD system, make clinical 
significance an explicit part of the criteria for establishing a diagnosis. 


The addition of this criterion to almost all disorders occurred after 
the National Institute of Mental Health Epidemiologic Catchment 
Area Program (ECA) study demonstrated that use of the DSM-III 
diagnostic criteria alone in community population studies led to 
relatively high prevalence rates of disorders—with some subjects 
having no evidence of distress or impairment [62]. Clinical signifi- 
cance has two main components: distress and functional impair- 
ment. Distress is expressed by the individual in the form of worry 
and concern about the condition. It has been operationally defined 
for epidemiological studies (for example, with the K-6 instrument) 
as including a combination of anxiety and depressive symptoms 
[63]. Sometimes it may not be expressed or may be explicitly de- 
nied. Functional impairment refers to limitations due to the illness, 
particularly in the social and occupational spheres of life, given that 
people with a disease or a disorder may not carry out certain func- 
tions in their daily lives. As part of the clinical significance criterion, 
distress and functional impairment help to establish the threshold 
for the diagnosis of a disorder. No guidance is given for determining 
the level of disability that would constitute the threshold for a diag- 
nosis; this is left open to the clinical judgement of the clinician. 

Operationally, functional impairment as used in the construct of 
clinical significance is roughly equivalent to the concept of ‘disability’ 
in WHO’s ICF. The ICF does not use the term functional impair- 
ment. In the ICF the term functioning is a neutral one, encompassing 
all body functions, activities, and involvement in life situations. The 
term disability means the decrements to these functions, which are 
known at the body level as impairments, at the person level as ac- 
tivity limitations, and at the societal level as participation restriction. 

During the development of DSM-5, an attempt was made to align 
DSM’°s use of functional impairment more explicitly with ICF’s con- 
cept of disability. The DSM’s concept of social functioning would 
include the ICF’s interpersonal interactions and relationships but 
might also include some of the items concerning participation in 
community, social, and civic life. The DSM’s occupational func- 
tioning would include the activities listed under the ICF’s categories 
of work and employment. 

DSM-5 and ICD-11 now use ICF as an anchor to define and 
quantify disability. The WHO's Disability Assessment Schedule 
(WHO-DAS 2.0) has been officially recognized to assess the level of 
disability in DSM-5 Section 3. In this way, it will be easier to measure 
and identify levels of disability in terms of activity limitations and 
participation restrictions. 


Disability and severity concepts in classifying 
mental disorders 


Traditionally, both in DSM and ICD systems, the disability and 
functioning concepts have often been used to determine the level 
of severity of the diagnosed disorder. Three levels of severity are fre- 
quently specified (mild, moderate, and severe), which include either 
the number or the intensity of symptoms and impairments in social 
and occupational functioning. Determining the level of severity is a 
clinical judgement. For example, DSM-5’s guidance for ‘mild’ and 
‘severe’ includes either ‘few or ‘many’ symptoms over the required 
number and either ‘minor’ or ‘marked’ impairments in social or oc- 
cupational functioning. ‘Moderate’ is in between. 

The criteria for neurodevelopmental disorders such as ASD are 
somewhat more explicit. Anchors are provided in DSM-5 for mild, 
moderate, and severe impairment in specific symptomatic domains 


for intellectual disability (disorders of intellectual development in 
ICD-11) and ASD. These domains include conceptual, social, and 
practical (occupational) areas of intellectual disability, and social 
communication and restricted, repetitive behaviours in ASD. In 
both of these disorders, the ability to engage in social activities and 
the need for supervision are assessed where the amount of supervi- 
sion required provides an anchor for severity. 

In ICD-11, functioning properties are being integrated into the in- 
formation that is being developed to describe health conditions [64]. 
These are listings of specific functional domains that are specific- 
ally relevant to a given disease or disorder, including mental, behav- 
ioural, and neurodevelopmental disorders, and may be important 
foci of assessment. The explicit identification of functioning prop- 
erties related to specific health conditions is intended to provide 
a broader and more meaningful picture of an individual's overall 
health in order to guide clinical decision-making. 

Domains of functioning and disability in ICF and WHODAS 2.0 
include the following: (1) understanding and communicating with 
the world (cognition); (2) moving and getting around (mobility); 
(3) self-care; (4) getting along with people (interpersonal relation- 
ships); (5) domestic life, occupation, school, and leisure; and (6) par- 
ticipation in society [65]. The more explicit descriptions of these 
six domains are intended to facilitate a clinical focus on reducing 
specific limitations separately, but in conjunction with reducing the 
symptoms of mental disorders. 


Future of mental disorder classification 





At the beginning of the DSM-5 and ICD-11 developmental process, 
there was a nicely articulated summary of the challenges and obstacles 
to developing a fully explicated aetiological and pathophysiologically 
based classification system for mental disorders [66]. An overreliance 
on DSM-IV and ICD-10 symptom clusters as phenotypes for genetic 
markers and targets for medication development was noted. Findings 
such as common genetic markers for multiple disorders and pharma- 
cological and psychosocial treatment effectiveness that did not corres- 
pond with DSM or ICD diagnostic boundaries called for new research 
strategies. Out of this background, the NIMH launched the RDoC 
strategy to facilitate greater attention to the relationships among mul- 
tiple biological and phenomenological domains—including negative 
valence systems, positive valence systems, cognitive systems, systems 
for social processes, and arousal/modulatory systems—as a basis for 
future breakthroughs in understanding and treating mental disorders 
[67]. Although it was fully understood in the field that it would be 
years, if not decades, before such a fully articulated system could be 
completed, there was an unfortunate initial juxtaposition of the DSM- 
5 and RDoC systems as competitive, rather than collaborative, efforts 
to advance our understanding of these disorders. It is now clear that 
when genes, molecules, cells, circuits, and physiology emerge that 
correlate with behavioural and symptomatic syndromes and can be 
assessed in routine clinical settings, they can be incorporated into fu- 
ture versions of clinical classification systems [68, 69]. Such biological 
measures must be able to demonstrate the level of sensitivity and spe- 
cificity needed for clinical decision-making. 

From a clinical practice perspective, the increasing concern about 
the quality of care will require more ‘measurement-based out- 
come’ approaches for clinicians, as well as for reassuring health care 
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administrators of the value received for their organizations and 
funding. The increasing standardization of medical terminology and 
the use of dimensional measures of symptomatic and disability do- 
mains will be fully operational only in the context of electronic health 
records. By providing more detailed information about clinical status, 
these measures will have a major impact on risk adjustment for severely 
ill patients and on outcome assessment for value-based reimbursement 
to clinicians. In primary care and low-resource settings, use of simple 
screening methods to identify mental disorders and to monitor treat- 
ment response will be important next steps. With the current DSM-5 
and ICD-11 classification structures in place, it is expected that more 
iterative revisions that can be integrated more seamlessly into health 
statistics and clinical practice will emerge over the next few decades, 
rather than large-scale, more disruptive edition changes. 
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The National Institute of Mental Health (NIMH) launched the 
Research Domain Criteria (RDoC) in 2009, in response to the 
2008 NIMH Strategic Plan, call for new ways of classifying mental 
illnesses that are based on dimensions of observable behavioural and 
neurobiological measures. The RDoC initiative was motivated by 
the need to address the scientific concern that the field had equated 
psychiatric illness with syndromes based on clinically observed 
diagnostic criteria, which were not well connected with neural and 
psychological mechanisms. The RDoC was also motivated because 
progress in treatment development had been slowed by a dispro- 
portionately narrow focus on disorders as defined by mainstream 
diagnostic criteria. Specifically, psychiatric research strategies have 
been limited by explanations of disorders that have been organized 
around clinical presentation. 

At the outset, we wish to be clear that DSM-5 and ICD-11 (and 
their predecessors) have been vital for informing the treatment of 
human distress and dysfunction, and continue to serve an important 
clinical role. Despite limitations (detailed in the forthcoming), these 
diagnostic manuals have been critical for the development of in- 
novative theories of mental disorders (for example, anxiety, de- 
pression, bipolar disorder, schizophrenia), which, in turn. have 
facilitated empirical advances to develop a number of effective 
treatments. That said, a counterpoint consensus that progress could 
be accelerated has existed for some time, such that gains in under- 
standing and treating mental illness are modest, relative to the more 
recent advances in integrative neuroscience. This view holds that re- 
search aimed at clarifying mechanisms of psychiatric illness (for ex- 
ample, disrupted neural circuits and mental processes) of clinically 
described syndromes has reached the asymptote of progress. As a re- 
sult, research into new and potentially more effective treatments has 
waned. We argue that a change in the conceptualizations of mental 























disorders is needed to improve diagnostic validity, to open new av- 
enues of research, and to expedite progress in the development of 
new and more effective treatments. 




















How did psychiatry research reach the paradoxical state of being 
constrained by a diagnostic system of its own creation? The answer is 
astory about reliability and validity. In the late 1960s and early 1970s, 
Washington University became the centre of psychiatric research in 
the United States, so much so that influential academic leaders of the 
time became known in some circles as “The St Louis Group. For that 
era, this was a radical collection of psychiatrists. Researchers leading 
this charge eschewed aetiological theories (mainly psychoanalytic) 
on the promissory note (mistaken, as it turns out) that if reliable 
diagnoses were developed, then laboratory studies to elucidate the 
biological and genetic underpinnings would follow. The motivation 
was to bring respect to the field by establishing psychiatry as science, 
in part because one consequence of the psychoanalytic zeitgeist 
that had prevailed up until that time was that clinicians frequently 
disagreed about a patient’s diagnosis (that is, reliability was poor). 
Some psychiatrists were concerned that physicians in other special- 
ties espoused the view that psychiatry was not ‘real medicine. The St 
Louis Group wanted to change this impression and led the charge 
for psychiatry researchers to focus squarely on improving reliability, 
with the expectation that the discovery of biological substrates 
would eventually follow. 

Drawing from the heritage of Emil Kraepelins work (as well as 
incorporating efforts by Kurt Schneider), clinically observable diag- 
nostic criteria for psychiatric research were developed—earning the 
researchers the label ‘neo-Kraepelinianians’ [1]. The new diagnostic 
criteria were first colloquially referred to as the Feighner Criteria [2], 


and later became the Research Diagnostic Criteria, or ‘RDC’ (not to be 
confused with RDoC). This was a major advance for psychiatry. The 
new criteria were largely descriptive, freeing researchers from the con- 
straint of psychoanalytic theory. Importantly, psychiatric researchers 
could now be confident that they were studying the same symptom sets 
in patients. Working from these clinical syndromes, it was assumed 
that valid diagnoses could be established. In this vein, Robins and Guze 
(1970) [3] proclaimed five criteria for the validity of psychiatric diag- 
noses: (1) clinical description; (2) laboratory tests; (3) delimitation 
from other disorders; (4) follow-up studies and course; and (5) family 
studies. These criteria became the bedrock principles guiding the de- 
velopment of psychiatric nosology for the next five decades [4]. 

The field was at a crossroads on the clinical side too. The develop- 
ment of a radically changed DSM was on the horizon, and the psych- 
iatrist Robert Spitzer was appointed to revise the manual. Among 
clinicians, there was resistance for overly rapid change, but the RDC 
were ready-made to be transposed to clinical diagnoses for the ‘new 
DSM diagnostic manual. As the research criteria were translated to the 
clinical realm, much of the resulting formulations were based on ‘ex- 
pert consensus, and this led to some notable changes. In the process, 
however, some duration criteria were relaxed for clinical translation in 
ways that, while stringent for research purposes, would be less inclu- 
sive for clinical purposes; for instance, in the case of major depressive 
disorder, depressed mood was tightened from 1 month to 2 weeks [5]. 

DSM-III [6] was both embraced and disparaged by the field. 
Clinicians could finally agree on a diagnosis with confidence. This 
was an improvement for developing coherent treatment plans. 
When used in research, the uniformity of patient groups was all but 
guaranteed. However, it gave false assurance that psychiatrists were 
treating ‘real’ diseases, giving the impression that these new diag- 
noses were ‘natural kinds, and this led to a problem of reification and 
overconfidence that corresponding mental and neural mechanisms 
could be unearthed for each disorder [7]. It also unleashed political 
fury, consequent to the incorporation of symptom criteria that were 
criticized for being chosen on the arbitrary basis of clinical con- 
sensus and for being value-laden with stereotypical cultural beliefs. 

Just over a decade after DSM-III was introduced, Jerome Wakefield 
(1992) [8] addressed these issues by articulating a definition of mental 
disorder that he termed ‘harmful dysfunction: His definition held two 
central tenets for a mental illness—firstly, that an ‘internal mechanism 
was not functioning, and secondly, that the dysfunction caused harm 
either to the individual, in the form of subjective distress, or to so- 
ciety by the inability for the afflicted person to fulfil his or her roles ac- 
cording to accepted cultural expectations. In psychology, Wakefield’s 
model became the textbook definition of mental disorder; in psych- 
iatry, the influence of the ‘harmful part of his definition was embraced 
and its influence persists throughout DSM-5, which relies, in many 
instances, on functional deficits to determine the clinical threshold. It 
is the second component of Wakefield's definition—the dysfunction 
of an ‘internal mechanism —that has yet to be elaborated for DSM- 
based diagnoses (post-1980 through the present), and it is now widely 
acknowledged that DSM sacrificed validity for reliability. 


Manifest problems of the descriptive approach 


The standardized criteria provided by the prevailing DSM diag- 
nostic systems (DSM-III [6] going forward through the current fifth 
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edition) and the more recent iterations of the WHO's ICD (through 
the current eleventh version) have made possible the reliable diag- 
nosis of patients for both clinical and research purposes. This has 
been essential to an important developmental phase of psychiatric 
science. The diagnostic manuals also improved the conception of 
mental disorders, as aspects of many disorders were refined, mainly 
on the basis of Robins and Guze (1970) [3] criteria of course and 
outcome, and some by delineation from other disorders. However, 
validation has come up short, particularly for biologically related 
systems. Moreover, with accumulated knowledge emerging from 
research using the tools of modern behavioural neuroscience, it be- 
came increasingly clear that the nature of psychopathology mech- 
anisms did not align with the descriptively derived categorical 
diagnoses [9]. 

The problems evident from poor validity include polythetic cri- 
teria sets, high rates of diagnostic co-occurrence (‘comorbidity’), 
heterogeneity within diagnoses, overspecification, arbitrary or sub- 
jective cut points, and a binary classification that excludes a dimen- 
sional gradience of symptoms. Phenotypes codified in DSM-III (and 
subsequently aligned in the ICDs) have yet to be validated with la- 
boratory studies. Mental disorder syndromes based on descriptive 
diagnosis, it seems, have been lost in translational research. The early 
commentaries that the fundamental tenets of clinical syndromes for 
DSM-III, carried forward through DSM-5, were flawed for research 
purposes had been realized (for example, [10]). 

By definition, DSM symptoms are imperfect indicators of the 
diagnosis to which they refer. No single symptom is required for a 
diagnosis; rather, collectively, a subgroup of symptoms represents 
the essence of a construct. This is termed ‘polythetic’ and means 
there are sundry ways to ‘get’ a DSM diagnosis. For instance, there 
are 256 combinations of symptoms for borderline personality dis- 
order (BPD) [11]; two people can meet criteria for major depres- 
sive disorder (MDD) and not share a single symptom in common 
(if weight loss and weight gain are considered as separate symp- 
toms) (APA, DSM-5). DSM also implies that many symptoms are 
equally important when actually they may be differentially related 
to functioning and the threshold number of symptoms required for 
a diagnosis is not empirically determined [12]. At the same time, 
clinicians demonstrate a proclivity to overweigh certain symptoms 
that steer them to a diagnosis, for instance, self-harm behaviour for 
BPD [13, 14, 15]. 

The exclusion of ‘comorbid’ diagnoses can artificially restrict 
the range of pathological mechanisms in psychiatric research [16]. 
Moreover, it has been argued that the term ‘co-occurrence’ is pre- 
ferred over ‘comorbidity’ by many, precisely because associated 
neural and mental mechanisms have not been elaborated and it is 
unclear if two ‘comorbid’ psychiatric diagnoses have interrelated 
psychopathology or are merely different sets of descriptors for the 
same psychopathological processes (see [17]). Individuals who are 
diagnosed with more than one disorder are often excluded from re- 
search, so that confounding by co-occurring disorders is avoided. 
This approach may increase the specificity of claims resulting from 
significant between-groups differences, but the result is a predom- 
inant focus on reified patient groups and ‘super-normal’ comparison 
groups, neither of which are representative of the general patient and 
non-patient populations. 

Another problem with research focusing on clinically described 
syndromes is the imposition of categories. By focusing research 
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efforts primarily on comparing individuals who meet diagnostic 
criteria with those who do not experience psychiatric symptoms, 
potentially meaningful variance in symptoms and neurobehavioural 
processes may be excluded. Arguments in favour of diagnostic 
categories include the idea that these extended phenotypes do not 
reflect a true continuum in symptoms due to varying degrees of dis- 
ruption of a common symptom-related mechanism, but rather may 
be the result of distinct processes that distinguish latent clusters of 
individuals and result in variable symptom presentations which may 
relate to the likelihood that an individual will seek treatment [18, 19]. 
However, dimensionality in neurobehavioural processes is con- 
sistent with other disease-relevant biological processes. More gener- 
ally, in internal medicine, hypertension provides an example. Blood 
pressure levels fall along a gradient, but a threshold—informed by 
research and subject to change in response to new information— 
defines the presence of disease and dictates the type of treatment. 

With the DSM approach, many individuals experience psychiatric 
symptoms such as anxiety, depression, or disruptions in thinking 
that are not sufficient in number, duration, or severity to meet diag- 
nostic criteria. Some of these cases are handled with the ‘Unspecified’ 
(formerly ‘Not otherwise specified’) categories. When the DSM is 
used for research purposes, however, these incomplete diagnoses are 
typically excluded from study design, eliminating a potentially sig- 
nificant range of psychopathology. Epidemiological studies find that 
sub-syndromal psychiatric symptoms are not unusual in the gen- 
eral population. Thus, using the DSM for research potentially ex- 
cludes a large segment of the population from a study. When DSM 
disorders are considered dimensionally, it has been shown for some 
disorders that those who are sub-diagnostic-threshold suffer similar 
degrees of functional impairment and distress as those who meet 
the full criteria for the disorder (for example, [20]). Indeed, there 
is a solid base of evidence and growing consensus for dimensional 
approaches to diagnosis [21]. When dimensional approaches are ap- 
plied to DSM symptom sets, considerable improvement in reliability 
and validity has been demonstrated [22]. However, to connect clin- 
ical symptoms to neurobiological mechanisms, more than a simple 
dimensionalization DSM category is needed (for example, [23]; see 
also [24]). 


A new approach 


These problems illustrate how dividing the lines between psy- 
chological health and mental illness can be clinically subjective 
and suggest that there may be useful discontinuities or thresholds 
with implications for treatment decisions or the illness course. 
However, narrowing the definition of a positive diagnosis for clin- 
ical purposes can create problems when applied to research design 
by focusing only on the extremes of a hypothetical health—illness 
spectrum, excluding the full variation of symptom expression and 
associated mechanisms. The reasoning implicit in the current diag- 
nostic frameworks is consistent with an infectious disease model in 
which an individual is either infected with the disease-causing agent 
or is not, prompting a dichotomous diagnostic decision for each 
disorder—are diagnostic criteria met or not? This approach pro- 
vides an acceptably reliable method for sorting research participants 
into patient and non-patient groups for the purposes of looking 
for group differences but may have the unintended consequence 


of distorting our understanding of the true nature of the landscape 
of mental disorders. In an ironic twist, other areas of medicine are 
confronting similar issues and trying to align their diagnostic sys- 
tems with a new understanding of systems biology. For instance, the 
National Academies of Sciences issued in 2011 a call for all disease 
taxonomies to be revised to align more closely with the new under- 
standing of molecular biological processes [25]. 

In sum, psychiatric diagnosis achieved good reliability, essential 
for clinical practice and research, but this came at the expense of val- 
idity. Definitions of disorders have evolved over centuries and rely 
heavily on patient self-report of internal experiences. The diagnoses 
identify common clinical presentations of syndromes, but the foun- 
dations for these frameworks precede modern behavioural neuro- 
science and do not incorporate an understanding of the ways in 
which the brain develops and functions. Although neuroscience has 
yielded important insights about mental disorders, these methods 
are failing to bring about the revolution in understanding and treat- 
ment that was anticipated when they were developed [26]. 

Finally, a persistent myth that has stymied novel solutions to 
these problems should not go unmentioned: Over the past several 
decades, the hope was held that mental disorders would be better 
understood once better tools were developed. That hope is gradually 
fading. The brain’s complexity is daunting, consisting of millions of 
neurons and other types of cells, with connections and patterns of 
activity continually changing according to developmental trajec- 
tories and in response to experiences and exposures. Technology for 
visualizing and quantifying the structure and function of the brain 
has yielded an ever expanding corpus of published studies, and the 
tools of genetics, molecular biology, and neuroimaging continue to 
improve; yet no method has emerged that has detected a specific 
neuropathological process that accounts for a psychiatric disorder 
and would determine the classification of individuals into diagnostic 
groups. 

The futility of mapping syndromal phenotypes (that is, DSM- 
based diagnoses) onto dysfunctional internal mechanisms has also 
been recognized by those developing pharmaceutical agents [27]. 
Over the past decade, research driven by pharmaceutical companies 
waned, in large part due to problems that symptom-based pheno- 
types were not the right targets for drug development (see [28]). 
Thus, both scientific concerns about psychiatry’s approach to diag- 
nosis, along with practical matters obviating treatment development, 
were motivations for the development of the RDoC. Indeed, others 
articulated this problem and argued for approaches that would em- 
brace intermediate phenotypes [29, 30]. 

Despite widespread recognition of these problems, funding for the 
bulk of psychopathology research in the United States has curiously 
remained harnessed to the DSM categories. The DSM classifications 
had become the de facto standards for psychiatric research, such 
that the entire scientific cycle, from grant applications, review and 
funding decisions, to publication, was dominated by DSM-based 
disorders, and little space was left for psychopathology research con- 
ducted outside this framework. Some investigators have reported to 
their programme officers a reluctance to submit non-DSM-based 
research for funding applications because of concerns such research 
would not fare well in review for scientific merit, even for proposals 
in response to specific requests for funding. Other investigators have 
complained of arbitrary constraints for how to handle co-occurring 
diagnoses in the culture of grant review committees, as well as 


favoritisms for certain DSM disorders [31]. The RDoC was intro- 
duced to provide an alternative path for researchers to study psy- 
chopathology mechanisms, in part so that these impositions could 
be circumvented. 

Van Praag and colleagues (1990) [32] alerted the field to the risks 
of ‘nosological tunnel vision, noting that ‘exclusive adherence to 
nosology has not served biological psychiatry well [but] has been 
a factor stunting its growth’ ([32], p. 507); a decade later, van Praag 
diagnosed psychiatry with ‘nosologomania [33]. Psychiatry de- 
partments have generally been organized according to DSM dis- 
orders, and this might also be seen as an impediment to progress, 
encouraging both research and training in ‘silos’ and cordoning 
off knowledge that ideally would be integrated. As recognition of 
these impediments has become clearer, the field is responding. 
The journal Schizophrenia Bulletin recently added the subtitle “The 
Journal of Psychoses and Related Disorders’ to reflect the changing 
nature of the field. The NIMH is presently addressing the inherent 
limitations of extramural research largely programmatically organ- 
ized around disorders by encouraging a focus on trans-diagnostic 
diagnoses, cross-cutting features, and the RDoC. 





Development of the RDoC framework 


In March 2009, the then NIMH director Thomas Insel formed 
an Internal Working Group to address the 2008 NIMH Strategic 
Objective to develop new ways to classify mental illnesses based 
on dimensions of observable behavioural and neurobiological 
measures (see [34] for a brief summary of the process, including 
the original working group members). The initial development of 
the RDoC was influenced by non-DSM-based models of psycho- 
pathology research, including the earlier mentioned intermediate 
phenotype approach [29, 30], as well as pre-DSM-III [6] experi- 
mental psychopathology approaches, including work from experi- 
mental psychopathology (for example, [35, 36]) and research in 
developmental psychology (for example, [37]). Facing a tabula rasa, 
and drawing from these approaches, five domains were initially pro- 
posed as a starting point: ‘Negative valence systems, ‘Positive valence 
systems, ‘Cognitive systems, ‘Social processes; and ‘Regulatory and 
arousal systems. Within each of these five domains were grouped a 
number of potential constructs (and subconstructs), each with evi- 
dence for a valid psychological construct, including evidence for a 
neural circuit associated with a psychological function relevant to 
psychopathology. 

To integrate basic neural and psychological mechanisms with 
observable behaviour relevant to palpable psychopathology, the 
Internal Working Group proposed ‘units of analyses. These included 
genes, molecules, neural circuits, physiology, behaviour, and self-report 
(the latter including patient verbal reports). The framework took 
shape as a matrix, with the original five domains grouping relevant 
constructs in the rows of the matrix, and ‘units of analysis’ forming 
the columns. An additional column—paradigms—was added to the 
matrix to include tasks and assessments to measure the constructs. 

The next step in the development of the RDoC matrix was the 
collaborative effort of leading scientists across multiple workshops 
to review the evidence and articulate and define each of the do- 
mains. Each of the workshops was focused on evaluating evidence 
for potential constructs and elaborating specific definitions for each 
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construct, including differentiating them from related constructs. 
One workshop was convened for each domain, except for cognitive 
systems, for which there were two because the first RDoC workshop 
piloted the Cognitive Construct “Working Memory’. Proceedings 
from the workshops can be found on the RDoC website (https:// 
www.nimh.nih.gov/research-priorities/rdoc/development-of-the- 
rdoc-framework.shtml). Prior to each workshop, surveys were sent 
to NIMH awardees and psychiatry and psychology department 
chairs across the United States. Survey feedback was presented at 
the start of each workshop. Workshop participants (invited by the 
NIMH, based on expertise in the relevant research domain, as evi- 
denced by published work and funding history) were organized 
into breakout groups to consider and refine the constructs or to 
discard them and propose new constructs. The groups examined 
what was known about the units of analysis for each construct and 
considered what questions remained unanswered, as well as poten- 
tial avenues of research that might answer these questions. These 
considerations were then reported out to each full domain group 
where construct definitions were minted. The end result included 
an annotated listing of the elements that would populate the RDoC 
matrix with respect to the genes, molecules, cells, circuits, physi- 
ology, and self-reports comprising each defined construct, as well 
as the nomination of promising and reliable behavioural tasks 
(paradigms) that could be used to assess function within a con- 
struct. The listings from all six independent meetings, when com- 
bined, comprised the first version of the RDoC matrix (see https:// 
www.nimh.nih.gov/research-priorities/rdoc/rdoc-snapshot- 
version-1-saved-3-7-2016.shtml). 


Structure of the RDoC matrix 


The RDoC matrix provides a research framework to specify fun- 
damental processes of mental illness in biological and behavioural 
units of analyses. The rows of the matrix represent specific dimen- 
sions of function (domains and constructs), and columns represent 
different kinds of observations for study (units of analysis). The con- 
structs embody specified functional dimensions of behaviour and 
are characterized in the aggregate by the genes, molecules, circuits, 
etc. that correspond to certain behaviours. Constructs are, in turn, 
grouped into higher-level domains of functioning, reflecting con- 
temporary knowledge about major systems of cognition, motivation, 
and social behaviour. In its present form, there are six domains in the 
RDoC matrix: ‘Negative valence systems, ‘Positive valence systems, 
‘Cognitive systems, ‘Systems for social processes, and ‘Arousal/regu- 
latory systems, and ‘Sensorimotor systems, the latter domain in- 
corporated in 2018 (‘Table 8.1). The matrix columns specify units of 
analysis used to study the constructs and include genes, molecules, 
cells, circuits, physiology (for example, heart rate or event-related 
potentials), behaviour, and self-reports. In the cells of the matrix are 
specific elements that serve as exemplars which are empirically as- 
sociated with the construct and the corresponding unit of analysis. 
The RDoC domains were selected to be purposefully broad, 
to link directly to psychopathological mechanisms, and for their 
potential for theoretical integration. The framework is directed 
towards constructs most relevant to mental disorders and is not 
intended to span the entire scope of functional behaviour [38- 
41]. The domains are ‘superordinate, and each one includes 
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Table 8.1 RDoC domains, domain definitions, and corresponding constructs and subconstructs 





Negative valence systems 


Positive valence systems 


Cognitive systems 


Social processes 


Arousal and regulatory systems 


Sensorimotor systems 


Activation of the brain's defensive motivational system 
to promote behaviours that protect the organism 
from perceived danger. Normal fear involves a 
pattern of adaptive responses to conditioned 

or unconditioned threat stimuli (exteroceptive 

or interoceptive). Fear can involve internal 
representations and cognitive processing and can be 
modulated by a variety of factors 


Systems primarily responsible for responses to 
positive motivational situations or contexts such 
as reward seeking, consummatory behaviour, and 
reward/habit learning 





Systems are responsible for various cognitive 
processes 


Systems that mediate responses to interpersonal 
settings of various types, including perception and 
interpretation of others’ actions 


Systems responsible for generating activation of 
neural systems as appropriate for various contexts, 
and providing appropriate homeostatic regulation of 
such systems as energy balance and sleep 


Sensorimotor systems are primarily responsible for 
the control and execution of motor behaviours and 
their refinement during learning and development 


Reproduced from https://www.nimh.nih.gov/research-priorities/rdoc/constructs/rdoc-matrix.shtm| 


Acute threat (‘fear’) 
Potential threat (‘anxiety’) 
Sustained threat 

Loss 

Frustrative non-reward 


Reward responsiveness 
Reward anticipation; 
nitial response to reward; 
Reward satiation 

Reward learning 
Probabilistic and reinforcement learning; 
Reward prediction error; 
Habit - PVS 

Reward valuation 

Reward (probability); 
Delay; 

Effort 


Attention 
Perception 
Visual perception; 
Auditory perception; 
Olfactory/somatosensory/multimodal perception 
Declarative memory 
Language 
Cognitive control 
Goal selection; updating, representation, maintenance 
Response selection; inhibition/suppression 
Performance monitoring 











Affiliation and attachment 

Social communication 

Reception of facial communication; 
Production of facial communication; 
Reception of non-facial communication; 
Production of non-facial communication 
Working memory 
Active maintenance; 
Flexible updating; 
Limited capacity; 
nterference control 








Arousal 
Circadian rhythms 
Sleep-wakefulness 


Motor actions 
Action, planning and selection; 
Sensorimotor dynamics; 
Initiation; 
Execution; 
Inhibition and termination 
Agency and ownership 
Habit - sensorimotor 
Innate motor patterns 


multiple, more specific constructs and subconstructs (indented 
in the third column of Table 8.1). In devising and defining the 
constructs that would comprise each domain, workshop mem- 
bers were asked to consider three criteria when evaluating the 
merits of a construct: (1) sufficient evidence for the validity of 
the construct as a functional unit of behaviour or cognitive pro- 
cess; (2) sufficient evidence for a neural circuit or system that 


played a primary role in implementing the construct’s function; 
and (3) sufficient relevance for understanding some aspects of 
psychopathology [38]. 

The current set of constructs is focused on, and constrained by, 
circuit definitions to avoid overspecification and proliferation 
of constructs. The intent is not to arbitrarily exclude constructs, 
but rather to foster thinking about how constructs are related 


across observations of different systems (that is, units of analysis). 
Acknowledged are the complexities of the brain and behaviour, such 
that circuits and constructs will necessarily overlap and interrelate, 
such that, to some degree, arbitrary separations are unavoidable. The 
spelling out of interrelations among constructs and across units of 
analysis will help to refine or otherwise clarify the constructs in a 
bootstrapping fashion. In fact, the framework is meant to foster, not 
discourage, research that explicates mechanisms within and across 
the constructs as listed. As such, RDoC research design is likely to 
foster interdisciplinary research collaborations. 

The RDoC matrix is best viewed as a work in progress, and not 
as fait accompli. The matrix is not a finite determination of the field 
of psychopathology, and by no means is—or was it ever—meant to 
be a comprehensive framework of psychopathology to be cast in 
stone. In other words, constructs (and subconstructs), as well as 
the tasks that are listed in the matrix, are best viewed as exemplars. 
From its inception, we have indicated that the matrix is expected 
to undergo revision and updates with accumulating data, and the 
understanding of psychopathological systems and symptoms is rec- 
ognized. Because the RDoC was initiated by the NIMH, the National 
Advisory Mental Health Council (NAMHC) oversees revisions. The 
NAMHC has covened a committee—the ‘Changes to the Matrix 
Council’ (CMAT) Workgroup—to evaluate alterations, updates, 
and additions to the matrix. Recent revisions include the addition 
of the earlier mentioned ‘Sensorimotor systems’ domain. There have 
also been revisions motivated by the accumulation of new scientific 
evidence, as well as for purposes of clarity, for instance an update 
to the ‘Positive valence systems’ domain. The CMAT Workgroup 
is provided with resources to consult with experts in the field and 
to convene meetings as appropriate. All revisions are detailed and 
available on the RDoC website (see https://www.nimh.nih.gov/ 
about/advisory-boards-and-groups/namhc/reports/rdoc-changes- 
to-the- matrix-cmat- workgroup-update-addition- of-the- 
sensorimotor-domain.shtml and https://www.nimh.nih.gov/about/ 
advisory-boards-and-groups/namhc/reports/cmat-pvs-report- 
508_157003.pdf, respectively, for these changes). 

Some have questioned whether the number of constructs is too 
sparse. In framing the RDoC, the goal was to include relatively high- 
level constructs to avoid overspecification of functions that could 
become unwieldy and thereby necessitate unnecessarily frequent 
revisions to the list or otherwise constrain investigators as research 
progresses. As one obvious example, a single ‘Perception construct 
is listed that includes visual, auditory, and other sensory modalities. 
Defined in this way, the constructs are more easily subject to refine- 
ment as new research emerges. Potential constructs not specified in 
the matrix can be studied with the RDoC framework. As new con- 
structs are considered, the three criteria outlined should be used 
to evaluate their merits in order to achieve the goals of the RDoC 
initiative [38]. 


Units of analysis 


The columns of the matrix represent the units of analysis typically 
employed in psychopathology research. The term ‘units’ was delib- 
erately chosen over the term ‘levels’ of analyses, so as not to priv- 
ilege any one type of observation or to imply a reductionist ideology 
(see [42] for a detailed discussion of these issues). As the RDoC 
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intends to promote integration of knowledge across multiple dis- 
ciplines, the framework emphasizes the integration of knowledge 
about genes, cells, and circuits, with knowledge about cognition, 
emotion, and behaviour. The ‘Circuit’ unit of analysis refers to meas- 
urements of particular neural circuits, as studied by neuroimaging 
techniques or other measures, such as event-related potentials with 
established source localization, that have been validated by animal 
models or functional neuroimaging. The ‘Physiology’ unit of ana- 
lysis refers to measures, such as heart rate and cortisol levels, that are 
well-established indices of certain constructs but that are not direct 
measures of neural circuits. The ‘Behaviour’ unit of analysis can refer 
variously to behavioural tasks (for example, a working memory task) 
or to systematic behavioural observations (for example, a toddler 
behavioural assessment). The ‘Self-report’ unit of analysis refers to 
interview-based scales, self-report questionnaires, or other instru- 
ments that may encompass normal-range and/or abnormal aspects 
of the dimension of interest. 

When initially constructed, the units of analyses for the ‘genes’ 
column included candidate genes relevant to psychopathology. The 
idea was that connecting certain gene products to physiology would 
help to clarify specific aberrations in psychopathology. However, 
some researchers have raised questions about statistical power 
issues and the replicability of candidate gene studies (see [43]; see 
also [44]), and“... the current state of the field emphasizes the need 
for robust evidence of association, generally resulting from ad- 
equately powered genome wide association studies, as opposed to 
candidate gene approaches’ [45]. Interestingly, during the original 
RDoC workshops, questions about how to distinguish genes from 
molecules (that is, gene products) were raised on multiple occasions. 
As evidence and various approaches to genetic studies are further 
evaluated, this column of the matrix will be updated accordingly. 


Paradigms (tasks) 


The matrix includes a separate column to specify well-validated, re- 
liable tasks and measures, or ‘paradigms, used in studying each con- 
struct. These paradigms may be relevant for more than one unit of 
analysis, and rather than list them in separate columns, they are in- 
cluded under the ‘Paradigms’ heading. A National Advisory Mental 
Health Council (NAMHC) workgroup was assembled and charged 
with developing recommendations for paradigms for each con- 
struct, according to several critical considerations. Their work con- 
sidered the development of exemplar tasks and led to revisions in the 
matrix (the full report can be found at: https://www.nimh.nih.gov/ 
about/advisory-boards-and-groups/namhc/reports/rdoc_council_ 
workgroup_report_153440.pdf). 





Development and environment 


Since its inception, the RDoC has held that developmental processes 
and environmental influences are inseparable from the consid- 
eration of psychopathology [46]. However, a logistical problem is 
created if one assumes the two-dimensional RDoC matrix to be a 
complete model, rather than an organizational framework, in which 
case the absence of explicitly representing development and envir- 
onment in the matrix would indeed be problematic. For research 
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purposes, empirical aspects of the matrix may be incorporated into 
the developmental theory. In this way, the RDoC additionally pro- 
vides a platform to organize the connections of research findings, or 
the development of a nomological network, conducted from various 
theoretical paradigms. Investigators are demonstrating the utility 
of the RDoC for developmental psychopathology research, as evi- 
denced by the substantial proportion (approximately half, to date) of 
NIMH-funded RDoC grants that focus on the role of developmental 
processes and/or environmental factors in psychopathology. 

Understanding developmental trajectories across various phases 
of the lifespan represents a critical consideration that is implicit 
for the RDoC and might be thought of as a third dimension in the 
overall framework. To fully explore the contributions of develop- 
ment to RDoC constructs, investigators must bear in mind the 
complexities of development. Not all capacities develop in a linear 
manner, and understanding the interaction between constructs as 
they appear, mature, and differentiate is important. Characterizing 
these developmental trajectories will help inform how development 
occurs, which systems come online before or after others, and how 
they interact differentially across development. It will likewise be 
important for investigators to explore the continuities (and discon- 
tinuities) that exist across development itself, with some behaviours 
being normal at certain developmental stages, but a sign of mental 
illness at others. Relatedly, the concept of sensitive periods, when 
the effects of particular experiences have a strong influence on brain 
and behaviour, is key to understanding how the timing of events can 
impact the risk for atypical development [46, 47]. 

The types of constructs typically found in the literature on human 
neurodevelopment are similar to several RDoC domains, and many 
areas of the child psychopathology literature (for example, reward 
sensitivity, cognitive and emotional dysregulation, behavioural in- 
hibition) serve as a more compatible model for a dimensionally 
based approach, compared to the highly specified categories of adult 
psychopathology. The goal is to place these constructs in the context 
of theories of development (for example, [48]). A more comprehen- 
sive understanding of the trajectories that lead to disorders, particu- 
larly in the context of quantified measures that can validly identify 
pre-symptomatic dysfunction, will aid early prevention efforts. In 
addition, the RDoC’s focus on integrative methodologies may lead 
to earlier detection of psychopathology. By not relying on self-report 
as the ‘gold standard’ deviations in developmental processes may be 
detectable before they are observable as overt clinical symptoms. 

The importance of environmental influences on psychiatric 
illnesses is incontrovertible, and RDoC-based research promotes a 
systematic focus on development and the environment—as well as 
with their mutual interactions—and their individual and interacting 
relationships to specific circuits and functions [38]. One way to ap- 
proach the environment is to equate it with the variable of time, 
typically represented in science as ‘the fourth dimension’ Just as 
empirical and theoretical interrelations are expected to be speci- 
fied across units of analyses and constructs, environmental factors, 
including those reciprocating with developmental changes, can be 
theoretically spelt out—and empirically tested—with changes in the 
elements across time. 

Older ideas about environmental influences have given way to ad- 
vances in neuroscience that articulate how stress and deprivation can 
have a negative impact, including epigenetic factors. Abnormalities 
in stress responses can result from maladaptive maturation of the 


nervous system, including its interaction with the wide variety of 
external influences beginning at conception. The social and phys- 
ical environment comprises sources of both risk and protection for 
many different disorders occurring at all points along the lifespan, 
and methods for studying phenomena such as gene expression, 
neural plasticity, and various types of learning are rapidly advancing. 
As with developmental aspects, environmental influences may thus 
be considered as another critical element of the RDoC matrix. 
Particular environmental stressors, such as early child abuse, may 
increase risk for a wide variety of disorders. Environmental effects 
are most ideally studied bidirectionally; for example, an individual’s 
behaviour affects his/her social environment (for example, family 
or friends), which, in turn, affects the nature of others’ behaviour 
towards the individual. Research organized around the relevant 
circuit-based dimensions that are affected, independent of a par- 
ticular disorder, may thereby accelerate knowledge regarding envir- 
onmental influences across multiple units of analysis. 


RDoC research design 


What does RDoC research look like? Early in our formulations of 
how one might conduct RDoC research, we emphasized the idea of 
choosing a ‘sampling frame’ or a study group of participants that 
varied widely enough across a spectrum of symptoms or clinical 
problems that no relevant mechanisms would be excluded. For in- 
stance, a researcher interested in studying fear mechanisms might 
take ‘all comers’ at an anxiety disorders treatment clinic (regardless 
of DSM-5 diagnosis) and then use results from a task or paradigm 
to determine different study groups or dimensions, for instance, the 
degree to which a fear response could be extinguished through an 
extinction paradigm. Even though the dimensions being studied 
would not involve DSM-5 diagnoses per se, those diagnoses could 
be adjunctively tracked and used to draw trans-diagnostic connec- 
tions. The latter would assist in future versions of clinical diagnostic 
manuals. 

A sometimes mistaken notion drawn by investigators is that the 
RDoC necessitates the complete exclusion of DSM diagnoses. This is 
not the case. For example, investigators might select participants on 
the basis ofa single DSM diagnosis to clarify component dimensions 
of that diagnosis, for instance, affective dysregulation among patients 
diagnosed with BPD. For the RDoC, the caveats of this approach 
would be that inclusion criteria should not be overly restrictive. 
For example, in this case, high rates of diagnostic co-occurrence 
would be expected and co-occurring diagnoses should not be used 
as a basis of exclusion, lest important variation in systems relevant 
to the regulation of affect be eliminated. Also, understanding of af- 
fective dysregulation would be expected to generalize—and might 
be studied in follow-up—to those suffering similar problems, but 
not necessarily diagnosable with BPD. 

Research that examines features that cross-cut traditional diag- 
nostic categories or trans-diagnostic research are consistent with the 
RDoC approach. Several groups of researchers have reported find- 
ings using trans-diagnostic approaches that have identified trans- 
diagnostic features for psychotic disorders (for example, Sabharwal 
etal.,2017 [49]), obsessive-compulsive disorders (for example, [50]), 
and attention-deficit/hyperactivity disorder (for example, [52]), and 
depression (for example, [52]). Similarly, for depression, Liston and 


colleagues have identified four biotypes of depression using compu- 
tational approaches [53]. Researchers have been making advances 
in precision medicine by clarifying such cross-cutting dimensions 
to serve as treatment targets (for example, [54]). With an RDoC ap- 
proach, these dimensions provide a new beginning and can serve 
as a starting point for defining participant groups or dimensions 
to serve as independent variables for study, in lieu of traditional 
diagnostic categories or dimensions (see [55]). Tamminga and col- 
leagues identified biotypes from a collective sample across bipolar 
disorder, schizoaffective disorder, and schizophrenia and found that 
the biotypes were more predictive of functional impairment [56]. 
Their approach exemplifies the RDoC approach. 


Change is afoot 


Collectively, the field of psychiatry, particularly the area of diag- 
nosis, is in a state of unrest that historically characterizes the con- 
ditions under which a paradigm change is afoot [57]. Motivations 
for NIMH efforts with the RDoC are evident in other quarters. The 
authors of DSM-5 had strived for the ambitious goal of a paradigm 
change [58]. For a number of reasons, some logistical and some 
based on the inherent structure, as discussed in earlier parts of 
this chapter, this goal was not achieved. However, the cross-cutting 
symptom checklists for DSM-5 are currently being developed, with 
evidence of good reliability [59], including dimensional character- 
istics of severity, number, and duration of symptoms (for example, 
[60, 61]). The alternative model of personality disorders provides 
another example where the DSM-approach is progressing [62]. 

Decades of work on structural approaches to psychopath- 
ology have now been codified in the Hierarchical Taxonomy of 
Psychopathology (HiTOP) [63]. As is the case with the RDoC, the 
HiTOP approach reorganizes clinical symptoms found throughout 
the DSM agnostic to the original diagnostic categories; however, 
links to cross-validate these constructs with internal mechanisms 
(biological or psychological) are limited [64]. Early efforts on this 
front are promising, especially the psychoneurometric approach 
developed by Patrick and colleagues [65, 66]. Observations from 
computational neuroscience working from the “bottom up’ may be 
particularly informative to help spell out a theoretical understanding 
of these connections [67, 68]. Finally, an international effort has been 
undertaken with the Neuroscience-based Nomenclature (NbN) 
[69] (see also Chapter 9). The NbN offers a template for clinicians to 
prescribe, based on common symptom sets of familiar mental dis- 
orders, but by aggregating symptoms on the basis of how they con- 
nect to neurobiological systems. 


Going forward with the RDoC 


The RDoC was developed out of the recognition by NIMH leader- 
ship that something radically different was needed to effect change 
in research, so that advances in integrative neuroscience could 
inform the understanding of biological and psychological processes 
involved with human suffering and dysfunction. From its inception, 
the NIMH has emphasized that the RDoC depends on input from 
the field and on strong empirical support for both its structure and 
research executed with the RDoC framework. The degree to which 
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the RDoC will be embraced by the field—or perhaps, more import- 
antly, the ways that the field will take hold of the RDoC and use it 
to move research forward—will ultimately be determined by scien- 
tific peer review of funding applications and journal publications. To 
this end, the NIMH/RDoC have relied on the NAMHC to evaluate 
the future development of the RDoC. New generations of clinical 
scientists will also determine the future of the RDoC, and early in- 
dications suggest that the RDoC offers a viable means to teach trans- 
lational neuroscience [70]. 

A second way in which the RDoC approach relates to the field 
concerns data sharing. The NIMH Data Archive (NDA) (see https:// 
data-archive.nimh.nih.gov/) houses and shares with qualified re- 
searchers harmonized, item-level data of all types and concerning all 
levels of biological and behavioural organization. The NDA is made 
up of several repositories, including the RDoC Database (RDoCdb) 
(see https://data-archive.nimh.nih.gov/rdocdb). The RDoCdb pro- 
vides an infrastructure for sharing subject-level data and serves as a 
resource for hypothesis testing and exploration. The NIMH encour- 
ages the use of these resources to achieve rapid scientific progress. 
Sharing data, associated tools, and methodologies, rather than just 
their summaries or interpretations, accelerates research progress 
by allowing re-analysis of data, as well as re-aggregation, integra- 
tion, and rigorous comparison with other data, tools, and methods. 
Data from more than 16,000 research participants have already been 
received. 

To facilitate data sharing and the combination of large data sets, 
in order to fully understand the mechanisms underlying the psy- 
chopathology, it is important to develop a set of paradigms and 
measures that are generally accepted by the field and can ensure that 
constructs are being measured using systematic and reliable means. 
As mentioned earlier, the NAMHC workgroup that was formed to 
look critically at the Paradigms’ column of the RDoC matrix offered 
recommendations for currently existing tasks that adequately 
measure each construct. An incentive was to have a set of common 
data elements to be used widely to allow for future aggregation of 
RDoC data. The resulting report includes an abundance of informa- 
tion from domain area experts, including the items they considered 
when discussing each task, the overall organization of each domain, 
and recommendations of tasks that are useable, that need more 
work, and that should not be used for each construct. 

In contrast to clinical diagnostic manuals, as a research frame- 
work, the RDoC has the luxury to be more nimble, building on re- 
cent findings to test new hypotheses. In contrast, changes in clinical 
diagnostic manuals have immediate implications for patients in 
terms of treatment prescriptions or reimbursement, as well as dis- 
ability services or accommodations. Even with the aspiration to up- 
date changes in the present DSM-5 more expeditiously, processes 
described so far (for example, [71]) still rely on consensus vetting 
procedures and will need to address the implications of change for 
patient groups. The RDoC, in contrast, can advance new research 
ideas that, once fully formed, may be utilized to inform clinical diag- 
nostic manuals. One limitation of RDoC research that will need to 
be addressed for treatment research will be the way that RDoC con- 
structs will be handled by regulatory bodies. 

As the RDoC matrix became established and understood by the 
field, changes to the matrix have become possible. Recently, some 
‘cosmetic’ changes to the matrix have been made, taking advantage 
of web-based tools to link constructs and elements of the matrix for 
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amore fluid and intuitive design, allowing for ease of future growth 
and expansion, along with providing a space to elaborate on cer- 
tain elements and constructs as data emerged. The CMAT and other 
appointed workgroups overseen by the NAMHC will continue to 
be tasked with evaluating matrix updates and revisions, including 
the evaluation of new domains and constructs. The hope is that the 
RDoC matrix will reflect the needs and growth of the field, with 
transparent input from the scientists who use it most. With the 10- 
year anniversary of the RDoC, the RDoC Unit and working groups 
have their ears open to the field and their eyes on new and innova- 
tive ways to help the RDoC grow and adapt, as psychopathology re- 
searchers work to build systematic ties to advances in neuroscience. 
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More than 20 years ago, evidence-based medicine was defined as 
‘the conscientious, explicit, and judicious use of current best evi- 
dence in making decisions about the care of individual patients, and 
the integration of clinical expertise and patient values’ [1]. Three 
components are essential: (1) the current best research evidence, 
which is found in methodologically sound and clinically relevant re- 
search reports; (2) clinical expertise, which relates to a clinicians cu- 
mulative experience, education, and clinical skills; (3) patients, who 
bring their own personal preferences, unique concerns, and values 
to the clinical encounter. The full integration of these three compo- 
nents improves clinical outcomes and quality of life. Evidence-based 
medicine aimed at closing the gap between research and practice 
by incorporating the advances in clinical epidemiology and medical 
knowledge into clinical activities [2]. The reason why evidence-based 
medicine was developed and adopted worldwide is for clinicians to 
keep abreast of the latest therapeutic advances, cope with rapidly 
changing policies, and face increasing public expectations and de- 
mands [3]. The best course of action is to use an evidence-based 
approach, because it optimizes clinical decisions (making practice 
more standardized and more efficient) and also justifies them [4]. 
This applies to all fields in medicine, including psychiatry. 

The idea that clinical practice should be based on good evidence 
is not new [5, 6]. Over the years, evidence-based medicine has 
been criticized because its ‘quality mark’ has been misappropriated 
by vested interests and the volume of evidence, especially clinical 
guidelines, has become unmanageable [7]. However, by providing 
clinicians with a set of skills which allow them to base clinical deci- 
sions on the best available and most up-to-date evidence, evidence- 
based medicine is not only a useful tool for clinical practice, but also 
an efficient method of self-directed, career-long learning [8]. 

Evidence-based medicine has been listed among the most im- 
portant 15 medical advances since 1840 [9] (Kamath 2016), and 
nowadays mental health professionals are encouraged to use an 
evidence-based approach in their daily activities [10]. However, it 
is known that one of the major challenges for clinicians is to move 
from the theory of evidence-based medicine to the practice of it. 





A study, now over 30 years old, estimated that, on average, five clin- 
ical questions are raised at each bedside encounter with a patient 
[11]. In 1998, it was estimated that only 4% of health care decisions 
were based on sound evidence, 45% on strong consensus among 
physicians, and 51% in neither [4]. Evidence-based practice requires 
new skills of the clinician, including efficient literature searching 
and the application of formal rules of evidence in evaluating the 
clinical literature. The implementation of evidence-based practice 
involves four steps [12]: 


e To frame a clear question, based on a clinical problem (which 
arises from the care of one patient). 

e To search for the relevant best available evidence in the literature 
(selecting the appropriate resource). 

e To critically appraise the validity (closeness to the truth) and 
applicability (usefulness in clinical practice) of the retrieved 
evidence. 

e To apply the findings to clinical decision-making and routine 
practice (integrating the evidence with clinical expertise and 
patient’s preferences and values). 


‘These steps are referred in some textbooks as the 5 As: Assess (the 
patient), Ask (the question), Acquire and Appraise (the evidence), 
and Apply (talk with the patient), with the fifth step being the all- 
important evaluation of the clinician’s own performance in consult- 
ation with the patient (the so-called self-evaluation). 

In this chapter, we will explore how research evidence should be 
used to inform real-world practice, taking into account the perspec- 
tive of both the practising clinician and the clinical researcher. 











Decision-making processes are enhanced through the use of valid 
and reliable evidence. Evidence is the information we gather on 
which to base, or support, our decisions. It is therefore important 
to be able to determine which evidence is ‘the best evidence, that 
is, the most valid and reliable. This term is used quite frequently in 
medical journals and international meetings, and it basically refers 
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to the methodological quality of the study design that produces the 
information we are looking for, in relation to the clinical question. 
Studies from the scientific literature can be broadly categorized ei- 
ther as experimental (when the investigators assign the exposures) 
or observational (when investigators observe usual clinical practice) 
[13]. Experimental studies become randomized trials only when the 
exposures are assigned by a random technique and the allocation of 
the upcoming assignment is concealed. Experimental and observa- 
tional studies are placed into a hierarchy which may vary, depending 
on the specific clinical question to address, but only the study de- 
signs at the top are considered ‘the best evidence. For example, 
in the case of assessing the efficacy of interventions, randomized 
controlled trials are at the top of the ranking [14]. Randomization 
approximates the controlled experiment of basic science, and it is 
the only known way to avoid selection and confounding biases in 
clinical research. Bias (that is, systematic error) can confound the 
outcome of a study, such that the study may overestimate or under- 
estimate what the true treatment effect is. Within a sample popula- 
tion, a properly conducted randomization is able to reduce the risk 
of selection bias by not only controlling for known, but also (and 
possibly more importantly) controlling for unknown prognostic 
variables. This bias-controlling measure helps attain a more accurate 
estimation of the truth. 

Randomized controlled trials have drawbacks too. For instance, 
a methodologically sound randomized controlled trial can have 
good internal validity (that is, the study measures what it sets out 
to measure), but it might not have ‘good enough’ external validity 
(that is, the extent to which results from the individual study can 
be generalized to the broader community of patients outside the 
trial). Internal validity relates to whether the study answers its re- 
search question correctly, that is, in a manner which is free from 
bias (and researchers and clinicians should learn how to appraise 
the validity of such studies). By contrast, external validity is closely 
connected with the applicability of a study’s findings, and its as- 
sessment depends on the purpose for which the study is to be used. 
Trials are usually conducted by industry on single patented com- 
pounds and are designed to meet the requirements of the regulatory 
agencies. In order not to delay the introduction of drugs onto the 
market, the majority of such trials are short term, highly controlled, 
highly monitored, and typically designed to separate between a 
new drug and placebo as efficiently as possible. This approach to 
clinical trial design has been described as explanatory, in which 
the aim is to determine if the treatment can work [15]. Patients in 
these trials are characteristically highly selected, and outcomes are 
essentially sensitive proxies of clinically relevant outcomes. Even 
for pharmaceutical companies, the risk of mounting a very large 
expensive clinical trial on a promising compound has to be offset 
against the risk that such a trial will demonstrate that the investi- 
gational compound is either of limited or no efficacy or causes ser- 
ious adverse effects. Pharmaceutical companies are therefore under 
pressure to do everything they can to design the trial to show their 
product in the optimal light and to ensure that any positive results 
have maximum impact, leading to as rapid uptake of the new drug 
as possible following market authorization. Because the design of 
these trials is based on a negotiation between regulators and the in- 
dustry, these trials often have substantial limitations for answering 
clinical questions in the real world, and hence limited clinical ex- 
ternal validity. The results of the available evidence often need to 


be extrapolated—to an unknown degree—when they are applied to 
the target clinical question. 

Finally, randomized controlled trials are usually very expensive, 
take many years to be conducted (from conception to publication), 
and cannot be used in some instances (for example, in special popu- 
lations such as pregnant women who should not be exposed to po- 
tentially harmful substances, as this would be unethical). 

It has been recognized that much larger trials are required, and 
trial procedures should be very simple and efficient, allowing wide- 
spread recruitment by practising clinicians [16]. If representative 
samples of patients are enrolled in the trial, the results will be of wide- 
spread applicability to future patients. Inclusion criteria are broad 
and as unrestrictive as possible. The key entry criterion is that both 
the patient and the investigator are substantially uncertain which 
of the trial treatments would be most appropriate. This ‘uncertainty 
principle’ is ethical because it effectively excludes patients for whom 
a specific treatment is known to be the most appropriate. All trial 
procedures are radically simplified. Keeping the trial procedures and 
data collection to an absolute minimum is essential to achieve wide- 
spread participation, and hence the required sample size for real- 
izing the key objectives of the trial. Such trials are termed pragmatic 
[17] and, in contrast to the question of “Can the treatment work?’ 
answered by the explanatory trial?, they ask: “Does the intervention 
work?’ [18] Pragmatic trials should not be seen as a research de- 
sign that should replace explanatory trials. There is a continuum be- 
tween explanatory and pragmatic trials and, ideally, pragmatic trials 
should be conducted after the results of explanatory trials allowed a 
new medicine to enter the market. Important examples of pragmatic 
trials in psychopharmacology include the Bipolar Affective dis- 
order: Lithium/ANticonvulsant Evaluation [19], the European First 
Episode Schizophrenia Trial [20], the Clinical Antipsychotic Trials 
of Intervention Effectiveness [21], the Cost Utility of the Latest 
Antipsychotic Drugs in Schizophrenia Study [22], and Clozapine 
plus Haloperidol or Aripiprazole Trial [23]. 


How to collect the best available evidence 


The number of randomized studies is growing, so too does the sci- 
ence of reviewing trials [24]. The history of synthesizing research is 
inextricably bound up in the history of evidence-based medicine, 
but it actually goes back centuries. The pioneer in the field was James 
Lind, a Scottish naval surgeon, who is credited not only with having 
produced one of the early records of a scientific trial, but also with 
having written one of the first systematic reviews of evidence. In 
1747, Lind took 12 patients with scurvy, whose cases ‘were as similar 
as I could have then? and divided them into six groups of two [25]. 
According to his planned intervention, Lind administered six dif- 
ferent treatments to each pair of sufferers: (1) cider; (2) elixir vitriol; 
(3) vinegar; (4) seawater; (5) a combination of oranges and lemons; 
and (6) a mixture of garlic, mustard seed, and balsam of Peru. 
A week later, Lind’s findings were clear: “The result of all my experi- 
ments was that oranges and lemons were the most effectual remedies 
for this distemper at sea’ (http://www.jameslindlibrary.org). The re- 
sults of this study were published a few years later, and interestingly, 
Lind wrote: ‘As it is no easy matter to root out prejudices ... it be- 
came requisite to exhibit a full and impartial view of what had hith- 
erto been published on the scurvy ... by which the sources of these 


mistakes may be detected. Indeed, before the subject could be set in 
a clear and proper light, it was necessary to remove a great deal of 
rubbish’ To gather the available research, get rid of the ‘rubbish, and 
summarize the best of what remains are essentially the science of a 
properly conducted systematic review. It took a while to understand 
the value and the methodological implications of Lind’s attitude; 
however, in the past century, there were eminent figures, like Archie 
Cochrane, a British epidemiologist, who persuasively advocated the 
scientific evaluation of commonly used medical therapies through 
objective sources of information [26]. Even though the importance 
of evidence synthesis in medicine was recognized since the 1970s, 
the widespread use of these systematic reviews and meta-analyses 
did not occur until 1990s, when it became clear that the judgements 
and opinions of experts were often biased [27]. 

There are two main approaches to reviewing literature: narrative 
reviews and systematic reviews [28]. 

Narrative reviews are the old, traditional approach. They are 
broad in scope and qualitative and do not include a method section 
(where the authors describe how they retrieved the relevant infor- 
mation); therefore, they are mainly based on the experience and 
subjectivity of the authors, who are often experts in the area. The ab- 
sence of an objective method section, together with the lack of clear 
and replicable criteria, leads to a number of methodological flaws 
(for instance, the bias in selecting the scientific literature), which 
can materially affect the author’s conclusions. The results from nar- 
rative reviews must be viewed with suspicion, as they are likely to 
be misleading (and one of the major problems is that the extent to 
which they are unreliable is almost impossible to judge). 

By contrast, systematic reviews (or sometimes called ‘overviews’) 
are syntheses of primary research studies and are qualitative and 
focused around clinical questions. They describe and use specific, 
explicit, and therefore reproducible, methodological strategies to 
identify, assemble, and critically appraise all relevant issues on a 
specific topic [29]. Systematic reviews should be designed to answer 
only a narrow and specific question, because, if the question is too 
broad or too generic, it will be impossible for the researchers to col- 
late and assess all the relevant information. The nature of the ques- 
tion determines the optimal primary study design, and hence the a 
priori inclusion and exclusion criteria. 

If the question concerns efficacy and safety of an intervention 
(Which treatment is better? Which dose is better tolerated?), the most 
reliable study design would be a randomized controlled trial— 
randomization avoids any systematic tendency to produce an un- 
equal distribution of prognostic factors between the experimental 
and control treatments, influencing the outcome [30]. However, 
randomized controlled trials are certainly not the most appropriate 
research design for all questions [31]. For example, for aetiological 
questions, it would be neither possible nor ethical to randomize 
subjects to many harmful exposures—systematic reviews would 
therefore need to include cohort and case-control studies. Likewise, 
a diagnostic question such as ‘How well can a screening tool identify 
patients with a psychiatric disorder? would be best answered by a 
cross-sectional study of patients at risk of being ill [32]. Systematic 
reviews of these other study designs have their own methodo- 
logical problems—guidelines exist for undertaking reviews (and 
meta-analyses) of diagnostic tests [33] and the observational epi- 
demiological designs used in aetiological research [34]. The review 
protocol is where these criteria are reported and defined before the 


CHAPTER 9 Application of research evidence in clinical practice 


review starts, and they should be followed by the researchers to 
select the relevant studies. 

There is a further difference between narrative reviews and sys- 
tematic reviews. New research data emerge continuously, and re- 
views are useful if they are up-to-date. This tends not to be the case 
with conventionally published reviews, but the systematic reviews 
of the Cochrane Library, for instance, are published electronically 
and periodically updated to take into account the emergence of new 
evidence (http://www.cochranelibrary.com/). There is evidence that 
systematic reviews improve the reliability and accuracy of conclu- 
sions; however, the results are rarely unequivocal and require careful 
assessment and interpretation [35]. Evidence-based practice is not 
cookbook medicine [36], also because clinicians need to integrate 
the results from systematic reviews with the clinical situation, their 
own expertise, and patients’ preferences. 

Despite their potential to avoid bias, a number of factors can ad- 
versely affect the conclusions of a systematic review. When con- 
ducting a primary study, it is important to ensure that the sample 
recruited is representative of the target population; otherwise, the 
results may be misleading (selection bias). The most significant form 
of bias in systematic reviews is analogous to selection bias in primary 
studies but applies to the selection of primary studies, rather than 
participants. There are various forms of selection bias, including 
publication bias, language of publication bias, and biases introduced 
by an overreliance on electronic databases. 

Publication bias is the tendency of investigators, peer reviewers, 
and also journal editors to differentially submit or accept manu- 
scripts for publication, based on the direction or strength of the 
study findings. The conclusions of systematic reviews can be signifi- 
cantly affected by publication bias. The potential pitfalls of publica- 
tion bias are obvious—if only studies which demonstrate a treatment 
benefit are published, the conclusions may be misleading if the true 
effect is neutral or even harmful. As early as 1959 it was noted that 
97.3% of articles published in four major journals had statistically 
significant results, although it is likely that many studies were con- 
ducted which produced non-significant results—but these were 
less likely to be published [37]. Various strategies have been pro- 
posed to counter publication bias. These include methods aimed at 
detecting its presence and preventing its occurrence. It is generally 
accepted that prevention is likely to be the most effective strategy, 
and therefore, prospective registries of all clinical studies were estab- 
lished. This means that nowadays a record of a trial exists, regardless 
of whether or not the study is published, and it reduces the risk of 
negative studies disappearing. Registries of ongoing research have 
been slow to establish—perhaps because it is not clear who should 
take a lead or fund them—although some of them are working ef- 
ficiently (for example, http://www.controlled-trials.com or http:// 
www.clinicaltrials.gov). While such registries are useful for newer 
or relatively new trials, they do not solve the problem of retrospect- 
ively identifying unpublished primary studies. A number of methods 
for estimating the likelihood of the presence of publication bias in 
a sample of studies have been developed. One commonly used way 
of investigating publication bias is the funnel plot [38]. In a funnel 
plot, the study-specific odds ratios are plotted against a measure 
of the study’s precision—such as the inverse of the standard error or 
the number of cases in each study. There will be more variation in 
the results of small studies because of their greater susceptibility to 
random error, and hence, the results of the larger studies, with less 








SECTION 1 The subject matter and approach to psychiatry 


random error, should cluster more closely around the true value. If 
publication bias is not present, the graphical distribution of odds 
ratios should resemble an inverted funnel. If there is a gap in the re- 
gion of the funnel where the results of small negative studies would 
be expected, then this would imply that the results of these studies are 
missing. This could be due to publication bias or mean that the search 
failed to find small negative studies. 

Restricting a search to one language—for example, searching only 
for English language papers—can be hazardous. It has been shown 
that studies which find a treatment effect are more likely to be pub- 
lished in English language journals, while opposing studies may be 
published in non-English language journals. Language of publication 
bias has also been called the Tower of Babel bias [39]. 

With the increasing availability of convenient electronic biblio- 
graphic databases, there is a danger that reviewers may rely on them 
unduly. This can cause bias because electronic databases do not 
offer comprehensive or unbiased coverage of the relevant primary 
literature. For instance, on investigating the adequacy of Medline 
searches for randomized controlled trials in mental health care, it 
has been shown that the optimal Medline search had a sensitivity 
of only 52% [40]. Sensitivity can be improved by searching other 
databases, in addition to Medline, for example Embase, PsycLIT, 
PSYNDEX, CINAHL, and Lilacs. To avoid the limitations of relying 
on electronic databases—or any other resource—for the identifi- 
cation of primary studies, reviews seek to use optimally sensitive, 
over-inclusive searches to identify as many studies as possible with 
a combination of electronic searching, hand searching, reference 
checking, and personal communications. Unpublished data refer to 
studies that are not published at all, but they may also refer to infor- 
mation that are not included in study reports published in scientific 
journals. Each study has a unique ID number. Trial registration is 
requested for study approval by local ethics committees and as a pre- 
condition for publication in many journals, so a full report of the 
study protocol, and sometimes of study results, can be found using 
the Internet. Clinical trial registries and websites of regulatory agen- 
cies are one option, but the most informative sources of unpublished 
data are pharmaceutical industries’ websites. Not all companies have 
study registries, and not all the companies that have study registries 
report results in an easy-to-use and comprehensive format. Having 
access to this information may be useful also when the published 
paper is available, because it can help retrieve some missing infor- 
mation or, in case of discrepancies or inaccuracies, clarify the exact 
figures (for instance, when changes in rating scales are reported only 
in figures or graphs). 

As a research tool, a systematic review of the literature can be ap- 
plied to any form of research question. Results of systematic reviews 
can reliably and efficiently provide the information needed for ra- 
tional clinical decision-making. However, the main conclusion of a 
systematic review may often be that there is little or even no good- 
quality information in the literature. Even this conclusion may be 
useful in that it highlights areas in need of further primary research. 





How to synthesize the best available evidence 


The terms ‘systematic review and ‘meta-analysis’ are often used 
interchangeably, but actually they refer to different things. Systematic 
reviews are reviews of primary research studies, in which specific 


methodological strategies that limit bias are used and clearly de- 
scribed in the protocol. By contrast, a meta-analysis is a statistical 
method to combine and summarize the results of several studies re- 
trieved by a systematic review into one single estimate. This method 
of pooling research data from more than one study provides an es- 
timate of effect size, which has greater power than any of the con- 
stituent studies. This has obvious advantages for clinical research 
and practice, because it minimizes random error and produces 
more precise, and potentially more generalizable, results. However, 
meta-analyses are not an essential part of a systematic review be- 
cause sometimes it may be inappropriate to proceed to a statistical 
summary of the individual studies (for instance, when studies are 
not similar enough to be lumped together). Meta-analyses make the 
assumptions that clinically important differences between treated 
and untreated groups are not so large as to be intuitively obvious 
and that, although the treatment’s effect varies in size among dif- 
ferent patient populations, the effect’s direction is consistently either 
beneficial or harmful [41]. A meta-analysis can estimate the degree 
of random error by presenting the quantitative results of individual 
studies and the ‘pooled’ weighted average result, with a confidence 
interval that provides a good indication of the precision of the esti- 
mate (it shows the extent to which the results are likely to differ from 
the ‘true result’ because of chance alone). 

The technique of meta-analysis has been controversial, and this is 
perhaps because it is potentially so powerful, but it is also particu- 
larly susceptible to abuse [42]. A number of key issues should be 
borne in mind when assessing or carrying out a meta-analysis: 


e A meta-analysis is a tool that can increase the sample size, and con- 
sequently the statistical power by pooling the results of individual 
trials (in psychiatry, many trials are small because of the difficul- 
ties in recruiting patients). Increasing the sample size also allows 
for more precision, that is, a pooled estimate with narrower confi- 
dence intervals [43]. Combining studies, however attractive, may 
not always be appropriate. The inappropriate pooling of disparate 
studies can make the final results meaningless. To avoid this, it is 
necessary to ensure that the individual studies are really looking 
at the same clinical or research question. Individual studies might 
vary with respect to study participants, intervention, duration of 
follow-up, and outcome measures. Such a decision will usually 
require a measure of judgement, and for this reason, a reviewer 
should always pre-specify the main criteria for including primary 
studies in the review protocol. Having decided that the primary 
studies are investigating a close enough question, an important 
role of the meta-analysis is to investigate variations between the 
results of individual studies (the so-called heterogeneity). When 
such variation exists, it is useful to estimate if more heterogen- 
eity exists than can be reasonably explained by the play of chance 
alone [44]. If so, attempts should be made to identify the reasons 
for such heterogeneity, and it may then be decided that it is not 
reasonable to combine the studies, or that it is, but that the overall 
pooled estimate needs to take the variation into account. 


Individual studies vary in their methodological quality. 
Randomization, allocation concealment, blinding, and the type of 
analysis (for instance, whether or not the study participants are 
considered in the groups to which they were randomly allocated, 
that is intention-to-treat analysis) can all affect the direction of 
the results. In general, studies with poor methodology tend to 


overestimate the effect of the intervention [45]. Rating scales and 
standardized tools have been developed to assess the quality (or 
risk of bias) of randomized trials. The quality scores can be incorp- 
orated into the meta-analysis, with a quality weight applied to the 
study-specific effect estimate. Although there are a large number 
of scales available, the current consensus is that their use is prob- 
lematic because of uncertain validity, as it has been shown that 
using different scales leads to substantial differences in the pooled 
estimate [46]. The optimal approach at present is to assess the risk 
of bias of individual studies, focusing on the aspects of trial de- 
sign that affect its internal validity. Nowadays, one of the most fre- 
quently used tools for assessing study quality is the Cochrane risk 
of bias tool. It is a domain-based evaluation, in which different 
domains are assessed individually, assigning a judgement of ‘low, 
‘high, or ‘unclear’ risk of bias: sequence generation, allocation 
concealment, blinding of participants and personnel, blinding of 
outcome assessment, incomplete outcome data, selective outcome 
reporting, and other issues. More recently, a new systematic and 
explicit approach to making judgements about the quality of evi- 
dence and the strength of recommendations has been developed 
by a group of researchers—the Grading of Recommendations, 
Assessment, Development, and Evaluations (GRADE) Working 
Group (http://www.gradeworkinggroup.org/). When using 
GRADE, researchers rate evidence not study by study, but across 
studies for specific clinical outcomes. So the GRADE approach 
can specifically assess methodological flaws within the compo- 
nent studies, the consistency of results across different studies, the 
generalizability of the results to the wider patient base, and ultim- 
ately how effective the treatments have been shown to be. 

e It is also possible to investigate and quantify the effect of poor- 
quality trials on the overall treatment outcome, by conducting 
pre-planned sensitivity analyses which exclude them from the 
pooled estimate. The main treatment effect of a trial gives an indi- 
cation of the average response for an average patient meeting the 
inclusion criteria. Individual patients in real-life clinical practice 
deviate from the average to greater or lesser degrees. To tailor the 
results of a trial to an individual patient, it is tempting to perform 
a sub-analysis of the trial participants with a specific characteristic 
or a set of characteristics. Estimates of the treatment in subgroups 
of patients are more susceptible to random error—and therefore 
imprecision—than the estimate of the average effect for all pa- 
tients’ overall effect, because subgroups reduce the sample size and 
statistical power of the results. Furthermore, unless randomiza- 
tion was initially stratified according to the important subgroups, 
protection from confounding afforded by randomization may not 
be valid anymore and any observed subgroup difference in treat- 
ment effect may be due to type I error (that is, the incorrect rejec- 
tion of a true null hypothesis, or a false positive). As a take-home 
message for clinicians, subgroup analyses should always be viewed 
cautiously. 


By synthesizing evidence from studies with a similar design which 
address the same research question within the frame of a system- 
atic review, standard meta-analyses can compare only two alterna- 
tive treatments at a time [47]. For most clinical conditions where 
many treatment regimens already exist, standard meta-analysis ap- 
proaches result into a plethora of pair-wise comparisons and do not 
inform on the comparative efficacy of all treatments simultaneously. 
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Moreover, if no trials exist which directly compare two interven- 
tions, it is not possible to estimate their relative efficacy, and thus, 
this specific information is missing from the overall picture. All this 
has led to the development of meta-analytical techniques that allow 
the incorporation of evidence from both direct and indirect com- 
parisons in a network of trials and different interventions to esti- 
mate summary treatment effects as comprehensively and precisely 
as possible [48]. This meta-analytical technique is called network 
meta-analysis, also known as multiple treatments meta-analysis or 
mixed-treatment comparison. The combination of direct and in- 
direct estimates into a single-effect size not only can provide infor- 
mation on missing comparisons, but also can increase the precision 
of treatment estimates of already existing direct comparisons, redu- 
cing confidence intervals and strengthening inferences concerning 
the relative efficacy of two treatments [49]. 

Another fruitful role of the network meta-analysis technique is to 
facilitate simultaneous inference regarding all treatments, in order 
to rank them according to any outcome of interest, for instance ef- 
ficacy and acceptability [50]. Using network meta-analyses within 
the frame of a more complex statistical procedure, it is possible to 
calculate the probability of each treatment to be the most effective 
(first-best) regimen, the second-best, the third-best, and so on, and 
thus to rank treatments according to this hierarchical order. This is 
a very easy-to-understand and straightforward way to present net- 
work meta-analysis results, most of all for clinicians who want to 
know which the best treatment to be prescribed to patients is, on 
average [51]. 

Recently, network meta-analyses have become more widely em- 
ployed and demanded, with the increased complexity of analyses 
that underpin clinical guidelines and health technology appraisals 
[52]. Expert statistical support, as well as subject expertise, is re- 
quired for carrying out and interpreting network meta-analysis 
results. Several applications of the methodology have depicted the 
benefits of a joint analysis, but network meta-analysis approaches 
are far from being an established practice in the medical literature. 
Concerns have been expressed about the validity of network meta- 
analysis methods, as they rely on assumptions that are difficult to 
test [51]. Although network meta-analysis techniques preserve ran- 
domization, indirect evidence is not randomized evidence, as treat- 
ments have originally been compared within, but not across, studies. 
Therefore, indirect evidence may suffer the biases of observational 
studies (that is, confounding or selection bias). In this respect, direct 
evidence remains more robust, and in situations where both direct 
and indirect comparisons are available in a review, any use of net- 
work meta-analyses should be to supplement, rather than to replace, 
the direct comparisons. Several techniques exist, which can account 
for, but not eliminate, the impact of effect modifiers across studies 
involving different interventions [53]. However, recent empirical 
evidence suggests that direct and indirect evidence are in agreement 
in the majority of cases and that network meta-analyses can address 
biases that cannot be addressed in a standard meta-analysis such as 
sponsorship bias and optimism bias [54]. 

Meta-analyses can give an indication of the average response for 
an average patient meeting the inclusion criteria, but individual pa- 
tients in real-life clinical practice deviate from the average to greater 
or lesser degrees. A potentially useful approach is to carry out meta- 
analyses of raw data from the included trials, the so-called ‘indi- 
vidual patient data meta-analysis’ [55]. In an individual patient data 
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analysis, it is possible to investigate important predictors (that is, 
baseline clinical and demographic characteristics) that might affect 
treatment response or prognosis, for example gender, age, ethnicity, 
family history, dose, severity of illness, age of onset, and number 
of previous episodes. The individual patient data methodology has 
been used in large-scale collaborative overviews, in which data from 
many randomized trials in a particular disease area were brought 
together [56]. Individual patient data meta-analyses usually require 
more time, resources, and expertise than other forms of review; 
however, the process brings with it a number of advantages [57]. The 
main difference between a standard meta-analysis and an individual 
patient data meta-analysis is that while the former is based on infor- 
mation that refers to each included study, the latter is based on infor- 
mation that refers to each subject included in the study. Individual 
patient data meta-analyses allow to describe the effects of competi- 
tive treatments over time [58]. In fact, information that refers to the 
outcome of each included subject is usually collected not only at 
endpoint, but also at various time intervals after random allocation. 


As new evidence emerges, health care workers are constantly re- 
quired to update their knowledge. In the scientific literature, a lot 
of information (sometimes too much and too often misleading) is 
continuously available and many websites or other sources keep up- 
dating, almost in real time, on the newest articles. In 1999, clinicians 
spoke of surviving an ‘information flood, as trends demonstrated 
that the number of new journals doubled every 10-15 years [59]. 
In 2014, there were about 28,100 active scholarly peer-reviewed 
English language journals, collectively publishing 2.5 million art- 
icles a year, of which 30% were biomedical [60]. Even restricting 
reading to high-impact journals in a single field of interest, the 
number of articles is in the thousands. For this reason, health care 
workers chose to read so-called ‘secondary journals, those which 
aim to highlight and summarize the recent evidence, methodo- 
logical advances, and possible clinical implications of research [61]. 
Things may change in the near future, but at the moment, there are 
three secondary journals in the field of mental health with large 
readerships: Current Opinion in Psychiatry, Harvard Review of 
Psychiatry, and Evidence-Based Mental Health, established in 1988, 
1993, and 1998, respectively. Even though there are differences be- 
tween them, each adopts a systematic, comprehensive strategy to 
identify the best and most relevant new evidence for mental health 
workers and incorporates a rigorous peer review process for their 
articles [61]. 
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A nomenclature can be defined as the devising or choosing of names 
for things. It sounds innocent enough, but in science, it is closely 
linked to the idea of classification or the creation of categories of 
like things. Such categories can also be the basis for a nomencla- 
ture. Since the words we use carry meanings, they influence how we 
think and communicate. If we need to make distinctions between 
objects, we need appropriate categories—not too many, not too few. 
Moreover, we need classifications which assist creative thought and 
do not retard it. 

For many years, there has been dissatisfaction with the nomen- 
clature widely employed to classify the medicines used in the treat- 
ment of psychiatric disorder. It has lurched between the obscure, 
the inconsistent, and the incomprehensible. To start with the ob- 
scure, a WHO symposium in Oslo in 1969 agreed a consensus that 
an international system of drug classification was needed, and the 
Drug Utilisation Research Group (DURG) was established. It cre- 
ated the WHO ATC (Anatomical Therapeutic Chemical) classifica- 
tion system, first published in 1976 and in use to this day, to present 
drug utilization data and so make national and international com- 
parisons of drug utilization, evaluate long-term trends in drug use, 
assess the impact of external events on drug use, and provide de- 
nominator data in investigations of drug safety. 

The ATC system sorts drugs initially into a target organ system 
(the nervous system, for psychotropics obviously) and then per- 
mits an ad hoc further subdivision broadly by therapeutic indi- 
cation. Fig. 10.1 illustrates the actual first-stage terminology for 
drugs targeting the nervous system: anaesthetics, analgesics, anti- 
epileptics, anti-parkinsonians, psycholeptics, psychoanaleptics, and 
‘other’ (which includes drugs used to treat ‘addictive disorders’). The 
very words ‘psycholeptic’ and ‘psychoanaleptic’ are strikingly un- 
familiar to most prescribers, even though they fit logically with the 
therapeutic principle of the classification’s structure. 

Thus, as further illustrated in Fig. 10.1, ‘psychoanaleptics’ include 
anti-dementia, antidepressant, psychostimulant, and psycholeptics 
and psychoanaleptics in combination. The term psychoanaleptic 
means ‘to exert a stimulating effect on the mind? It reflects an antique 























way of thinking about drug action and now conjures up a property 
that is too vague to be useful. The further grouping is, in part, a clas- 
sification by indication (anti-dementia, antidepressant) and, in part, 
by imprecise drug action (psychostimulation). 
































Naming drugs by primary indication has an obvious superficial at- 
traction. It means we can remember howto use them easily, the classes 
automatically give us alternatives, and we can communicate our in- 
tention to treat with patients and their families. So, while the terms 
psycholeptic and psychoanaleptic are unfamiliar, we use the next- 
level terms routinely. The psycholeptics carve into antipsychotics, 
anxiolytics, and hypnotics, with corresponding indications of psych- 
osis, anxiety, and insomnia. Moving to the psychoanaleptics, anti- 
depressants are effective in treating depression. This terminology 
currently covers most of what many non-specialist prescribers prob- 
ably know about psychotropic drugs. 

The immediate difficulty is that these primary indications are not 
unique to the drugs assigned to the relevant classes. In the case of 
antipsychotics, a number are effective in resistant depression, quite 
independently of any antipsychotic action. Moreover, lithium is clas- 
sified as an antipsychotic but has no primary antipsychotic action. In 
the case of anxiolytics and hypnotics, many are benzodiazepine de- 
rivatives with both anxiolytic and hypnotic actions. Indeed, the most 
important differences are pharmacokinetic, with shorter-acting 
drugs being the most suitable to hypnotic actions, but such differ- 
ences cannot be represented explicitly in the classification. 

In the case of antidepressants, many of them are commonly pre- 
scribed as monotherapy for anxiety disorders. While a drug like 
paroxetine has multiple licensed indications for anxiety, other drugs 
with essentially identical mechanisms of action (selective serotonin 
reuptake inhibitors—the so-called SSRIs) do not necessarily have 
evidence to support actions on anxiety. However, they are used by 
extrapolation to treat anxiety disorders. In addition, SSRIs may be 
useful to treat a variety of pain states [1]. Nevertheless, SSRIs are all 
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Fig. 10.1 First-stage terminology for drugs targeting the nervous system. 





Source: data from ATC/DDD Index 2017, N NERVOUS SYSTEM, © WHO Collaborating Centre for Drug Statistics Methodology, Oslo, Norway. Available at https:// 


www.whocc.no/atc_ddd_index/?code=N 


classified as antidepressants. This inexactitude has a negative effect 
on public and patient perception of our clinical practice. 

Thus, the strident argument has been made that increasing num- 
bers of prescription of SSRIs imply that doctors are overdiagnosing 
‘depression’ and overprescribing ‘antidepressants’ [2]. It has made 
for unnecessary controversy and has devalued the evidence that 
prescribing an SSRI can be both evidence-based and useful to pa- 
tients [3]. Together with this negative public discourse, the cur- 
rent nomenclature clouds direct personal communication with 
patients. It must be confusing to find that your anxiety disorder is 
being treated with an antidepressant. The use of an antipsychotic 
to augment the treatment for your depression (added to an SSRI or 
a similar medication) means having to cope with the thought that 
you have developed psychosis. Psychosis implies both severity of 
illness and stigma. Finally, anti-epileptic drugs (valproate, carba- 
mazepine, lamotrigine) are used to treat bipolar disorder, but the 
package inserts for common formulations of these drugs may only 
reflect their use in epilepsy. The present classification certainly has 
the potential to confuse patients. Sophisticated prescribers may be 
well able to tolerate these inconsistencies, but it fails their needs in 
even more important ways, as will be illustrated later. 

Finally, the emphasis on indication has served the purposes of 
marketing by the makers of drugs perhaps too well. It lends itself to 
the addition of soothing adjectives. Therefore, we have words like 
‘atypical’, ‘second generation, and ‘novel’ entered in the marketing 
pitch for new business. It has been almost entirely designed to 
persuade, Madison Avenue style. Who would want an old smart 
phone if the next-generation phone were on offer? But the analogy 
suggested by ‘second-generation’ antipsychotic drugs is entirely 
specious. Even specialists appear to have been taken in by this ter- 
minology. To take an important example, the CUtLASS trial com- 
pared ‘first-’ with ‘second-’generation drugs for psychosis. However 
many patients in the study received sulpiride as a ‘first-generation 
drug’ and amisulpride as a ‘second-generation drug’. These drugs are 
very similar molecules both chemically and pharmacologically, so 
to compare them is very unlikely to reveal an important difference. 


Nevertheless, despite also being underpowered, the CUtLASS study 
is widely cited as evidence that new drugs for psychosis have no ad- 
vantage over what went before [4, 5]. 

In summary, the ATC principle may be adequate for its given 
limited purposes in pharmacoepidemiology. It probably makes little 
difference how licensed drugs are actually grouped, since groups 
may be recombined to answer specific epidemiological questions. 
Indeed, the home website of the WHO Collaborating Centre for 
drug statistics methodology carries an important disclaimer: ‘the 
classification of a substance in the ATC/DDD system is not a rec- 
ommendation for use, nor does it imply any judgements about ef- 
ficacy or relative efficacy of drugs and groups of drugs’ (https:// 
www.whocc.no/atc_ddd_methodology/purpose_of_the_atc_ddd_ 
system/). However, as a classification that is used in everyday prac- 
tice, it is both confused and confusing. Moreover, it poses a signifi- 
cant barrier to understanding drug action, and hence to rational 
prescribing. This is a problem psychiatrists need to solve, and the 
incentive to do so resides in our current practice. We need to change, 
and this chapter will explain why. 





Classification and the therapeutic challenge 


When chemical classification does not help 


The guiding principle of the ATC recognizes ‘chemistry’ as a key 
classificatory level. This may sometimes mean pharmacology, 
but just as often, it is simply a classification on the basis of the or- 
ganic chemistry of drugs. In the case of anti-epileptics, for example, 
pharmacology is not considered at all—it is just chemical structures 
of drugs. This means that the mechanisms of action of anti-epileptics 
may not be commonly known to prescribers at all. As an example, 
carbamazepine is an antagonist at the calcium channel [6]. It may 
be highly relevant that genome-wide association studies implicate 
variation in the encoding and expression of calcium channels in 
the aetiology of bipolar disorder and schizophrenia [7]. To describe 
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carbamazepine as an anti-epileptic makes it seem surprising that it 
is an effective anti-manic drug; to describe it as a calcium channel 
antagonist at least provokes the thought that its actions might extend 
beyond the epilepsy indication. 

There are many other examples within the existing terminology 
of psycholeptics and psychoanaleptics that also refer to chemistry. 
This has the virtue of neutrality, but the vice for most of us of con- 
veying no useful information whatsoever. The case of antipsychotics 
is particularly instructive. See https://www.whocc.no/atc_ddd_ 
index/?code=N05A for an example of how antipsychotics may be 
sub-classified. Fig. 10.2 shows how antipsychotics are sub-classified 
entirely on the basis of chemistry. As trainees, some of us may have 
attempted to remember the names, if not the chemical structures 
(for example, phenothiazines, butyrophenones, thioxanthines). All 
were antagonists at the dopamine D2 receptor, and this monocular 
perspective on their mode of action was probably encouraged by the 
pointless nomenclature. This only changed with the commercial ef- 
fort to produce drugs with a low propensity for producing extra- 
pyramidal side effects (EPS) in the 1980s and 1990s. The so-called 
atypical drugs fall into three new categories (Fig. 10.2): NOSAE (in- 
dole derivatives), NO5AH (diazepines, oxazepines, thiazepines, and 





NOSA 
Antipsychotics 








oxepines—the ‘-pines’), and NOSAL (benzamides). However, what 
is important about indole derivatives and the “-pines’ is the add- 
ition of affinity for neurotransmitter receptors other than dopamine 
into their pharmacology. This pharmacology is complex [8]. In the 
case of the ‘-pines’ from NO5AH, the affinity for 5-HT2A recep- 
tors is much higher than for D2 receptors. It is widely believed that 
this is key to the reduced risk of EPS; moreover, reduced function 
at 5-HT2A receptors may be intrinsically ‘antipsychotic. Indeed, 
pimavanserin is a completely novel antipsychotic (developed for the 
treatment of psychosis in Parkinson's disease), with no dopamine re- 
ceptor antagonism at all; its mechanism appears to be via an inverse 
agonist action at the 5-HT2A receptor. So in treating and combining 
drugs for psychosis, one needs to know that a drug like clozapine 
will have a small impact on D2 blockade and a large impact on 5- 
HT2A receptor function, while a drug like amisulpride will block 
D2/D3 receptors quite selectively (and effectively). If a patient fails 
to respond fully to clozapine, the addition of amisulpride as aug- 
mentation would be more logical than to add olanzapine (which 
simply mimics a range of the actions of clozapine) [9]. 

Finally, side effects are an important limitation of most anti- 
psychotic medications. There is a high affinity for histamine 





NOSAA Phenothiazines with aliphatic side-chain 
Chlorpromazine, promazine, trifluopromazine 








NOSAB Phenothiazines with piperazine structure 
Fluphenazine, perphenazine, prochlorperazine, trifluoperazine 








NOSAC Phenothiazines with piperidine structure 
Periciazine, thioridazine 





NOSAD Butyrophenone derivatives 
Haloperidol, trifluoperidol, bromperidol, droperidol 





NOSAE Indole derivatives 
Oxypertine, sertindole, ziprasidone, lurasidone 








NOSAF Thioxanthene derivatives 
Flupentixol, clopentixol, zuclopentixol 











NOSAG Diphenylbutylpiperidine derivatives 


Fluspirilene, pimozide 








NOSAH Diazepines, oxazepines, thiazepines, and oxepines 
Loxapine, clozapine, olanzapine, quetiapine, asenapine 











NOSAL Benzamides 
Sulpiride, sultopride, tiapride, remoxipride, amisulpride 








NOSAN Lithium 














Fig. 10.2 Current antipsychotic nomenclature under the WHO system. 


Source: data from ATC/DDD Index 2017, NOSA ANTIPSYCHOTICS, © WHO Collaborating Centre for Drug Statistics Methodology, Oslo, Norway. Available at https:// 
www.whocc.no/atc_ddd_index/?code=NO5A 
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receptors, which may result in weight gain and metabolic disturb- 
ance with clozapine, olanzapine, and quetiapine (but not asenapine). 
For other drugs, there may be a higher affinity for muscarinic recep- 
tors, which might be expected to produce a dry mouth, accommo- 
dation difficulties, etc. In general, to classify by indication fails to 
draw the prescriber’s attention to the properties of drugs which will 
be important for treating individual patients. 


Why is pharmacology so important to psychiatry? 





Obviously, most doctors take some pride in knowing how drugs 
work and how to use them. But pharmacological investigation 
of the mode of action has had a special place in psychiatry, be- 
cause effective medicines were discovered before their pharma- 
cology was understood. Investigation of the mode of action was, 
and arguably remains, a cutting-edge theme in neuroscience. 
Axelrod, Carlsson, von Euler, and Greengard won Nobel prizes 
for working out how the monoamine systems and the drugs acting 
on them function in the brain. Completely new fields of cogni- 
tive and computational neuroscience have opened up, because we 
know that dopamine neurons are involved in motor function and 
decision-making. This scientific development was recognized by 
the award of the Brain Prize in 2017 for multi-disciplinary ana- 
lysis of brain mechanisms that link learning to reward (http:// 
www.thebrainprize.org). 





NO6A 
Antidepressants 








Psychopharmacology also provided, and still provides, one of the 
few starting points for mechanistic understanding of psychiatric dis- 
order. By contrast, diagnosis (the indication) remains quite arbitrary 
and essentially provisional, based as it is on symptoms, rather than 
mechanisms of disease (see Chapters 7 and 8). So pharmacology 
provides a very appropriate space in which to name, describe, sum- 
marize, and classify psychotropic medicines. Obviously, some sub- 
classes of drugs are already described in this way. We have selective 
monoamine oxidase inhibitors (MAOIs) and SSRIs. The WHO clas- 
sification of antidepressants (that is, those drugs with evidence for 
efficacy as monotherapy in unipolar depression) is another case in 
point (see https://www.whocc.no/atc_ddd_index/?code=N06A). 
The classification of antidepressants (that is, those drugs with evi- 
dence for efficacy as monotherapy in unipolar depression) is shown 
in Fig. 10.3. Monoamine reuptake inhibitors (non-selective and 
selective for serotonin) and MAOIs (non-selective and type A- 
specific) comprise the first four reasonable categories. However, 
the fifth group NO6AX is ‘other’ It includes currently 26 chemical 
entities and seven mechanisms of action (Fig. 10.3). It automatically 
includes all new drugs with novel mechanisms of action. 

Unfortunately, we have even lacked a systematic approach to 
describing the pharmacological properties of those medicines we do 
classify by mode of action. The use of the term SSRI was promoted by 
companies marketing novel antidepressants in the ‘Prozac era. It is a 
fair summary of the key property of such drugs. However, the term 
SNRI does not stand for selective noradrenaline reuptake inhibitors, 





NO6GAA Non-selective monoamine reuptake inhibitors 

TCAs: imipramine, amitriptyline, clomipramine, dosulepin, doxepin, 
lofepramine, trimipramine, amoxapine, protriptyline 

NRIs: desipramine, nortriptyline, maprotiline 








escitalopram 


NOGAB Selective serotonin reuptake inhibitors 
Zimeldine, fluvoxamine, fluoxetine, paroxetine, sertraline, citalopram, 








NOGAF Monoamine oxidase inhibitors, non-selective 
Phenelzine, isocarboxazid, tranylcypromine 








NO6AG Monoamine oxidase A inhibitors 
Moclobemide, toloxatone 








NRIs: reboxetine 


NaSSAs: mirtazapine 





NO6AX Other antidepressants 


SNRIs: venlafaxine, milnacipran, duloxetine 
NDRIs: nomifensine, bupropion 


SARIs: trazodone, nefazodone 
MT receptor antagonist/5-HT2c antagonist: agomelatine 
5-HT1A partial agonist: gepirone 








Fig. 10.3 Current antidepressant nomenclature under the WHO system. 


Source: data from ATC/DDD Index 2017, NO6A ANTIDEPRESSANTS, © WHO Collaborating Centre for Drug Statistics Methodology, Oslo, Norway. Available at https:// 


www.whocc.no/atc_ddd_index/?code=NO6A 
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as you might expect. Instead, ‘SNRI’ stands for serotonin/noradren- 
aline reuptake inhibitor (Fig. 10.2). Selective noradrenergic re- 
uptake inhibitors are instead abbreviated NRIs. More consistently, 
noradrenaline/dopamine reuptake inhibitors are NDRIs. In the case 
of serotonin antagonist and reuptake inhibitors, the preferred ab- 
breviation is SARIs. Individual drugs like mirtazapine, agomelatine, 
and gepirone are single examples of particularly quite complex 
mechanisms of action. 


Conclusions 


So in summary, we have antipsychotics and anti-epileptics used 
to treat patients with bipolar disorder who are neither psychotic 
nor epileptic. We have SSRIs and SNRIs, although SNRIs are not 
selective noradrenaline reuptake inhibitors, but instead serotonin/ 
noradrenaline reuptake inhibitors, and finally we have a growing 
majority of antidepressants classed simply as ‘other’. It is undeni- 
ably a mess. Does it matter? It has been tolerated for a long time, 
without serious efforts to change. Our desire to make changes now 
should be fuelled by embarrassment. However, there are more ser- 
ious reasons to change too. With Lewis Carroll, we often do not 
know what we think until we speak. Defining our treatments by 
diagnoses of limited validity limits our capacity to speak. Our 
failure to use pharmacological criteria and define drug actions 
limts our capacity to think. An emphasis on mechanism should 
stimulate clinicians to think what they are doing when they pre- 
scribe for patients. It will be a continuing nudge to stay on top of 
the science that underpins our understanding. Moreover, a more 
nuanced system for describing medicines can grow flexibly—it 
can accommodate a mechanistic understanding at any number of 
levels, from molecules, to systems, to cognition. There is also the 
potential to harness the shift in emphasis from operational diag- 
nosis to dimensional neurobiology, as currently enshrined in the 
RDoC project (see Chapter 8). 


Neuroscience-based nomenclature 





The NbN initiative represents the efforts of colleagues from the five 
major international neuropsychopharmacological scientific organ- 
izations to reform the nomenclature of drugs used by psychiatrists. 
The taskforce was formed in 2008 (Box 10.1). 

All expenses related to the project were covered by the European 
College of Neuropsychopharmacology (ECNP). Throughout the 
entire process, there was no direct or indirect support from any 
pharmacological company or other organization. 

It was agreed that the guiding criteria were to: 


Be based on contemporary knowledge. 

Assist clinicians to make informed choices while prescribing. 
Provide a system that does not conflict with the use of medications. 
Be future-proof and accommodate new types of compounds. 


fe Ns 


Decrease stigma attached to a classification by indication (and 
enhance adherence). 


The methodology was expert consensus informed by prescriber 
feedback [10, 11]. 


Box 10.1 Organizations contributing to the NbN initiative 





ECNP—European College of Neuropsychopharmacology 
ACNP-—American College of Neuropsychopharmacology 
AsCNP-—Asian College of Neuropsychopharmacology 
CINP—International College of Neuropsychopharmacology 
IUPHAR—International Union of Basic and Clinical Pharmacology 








The future of naming by indication 


If our existing terminology is misleading and constrains creative 
thought, we should try to give up this approach completely. This rec- 
ommendation of the taskforce has been broadly accepted by the edi- 
tors of many major cognate journals (Box 10.2) 

This means trying to use less often the words antipsychotics, anti- 
depressants, anxiolytics, and hypnotics when what we could say is 
simply ‘drugs for psychosis, ‘drugs for depression, ‘drugs for anxiety, 
and ‘drugs for insomnia. Simply changing the stress to the indication, 
rather than the class of drug, is often an easy adjustment to make. 

Where an author wants to use the words antipsychotics, anti- 
depressants, anxiolytics, and hypnotics, then s/he should define 
exactly what is meant. If one is actually wanting to say dopamine ant- 
agonists/partial agonists, then that is preferable to ‘antipsychotics’ In 
the case of ‘antidepressants’ and ‘anxiolytics, it is much more difficult 
to substitute a single word based on the mode of action. However, 
defining in advance exactly what drugs are subsumed within 
one’s meaning of ‘antidepressant’ is a good discipline. The British 
Association for Psychopharmacology guidelines for the treatment 
of bipolar disorder were written from this perspective and proved 
acceptable to the consensus group involved [12]. 


Box 10.2 Journals that have accepted the recommendations 


of the NbN initiative 





American Journal of Psychiatry 

The Lancet Group 

Biological Psychiatry 

Neuropsychopharmacology 

Psychological Medicine 

International Journal of Neuropsychopharmacology 

European Neuropsychopharmacology 

World Journal of Biological Psychiatry 

European Psychiatry 

Journal of Psychopharmacology 

CNS Spectrums 

European Archives of Psychiatry and Clinical Neuroscience 

Current Psychiatry 

Japanese Journal of Neuropsychopharmacology (official journal of 
Japanese Society of Neuropsychopharmacology) 

Clinical Psychopharmacology and Neuroscience—Korean College of 
Neuropsychopharmacology (official journal of ASCNP) 

Chinese Journal of Psychiatry 

British Journal of Clinical Pharmacology 

Journal of Clinical Psychopharmacology 

Pharmacology International (IUPHAR journal) 

Pharmacopsychiatry 

Pharmakopsychiatrie (in German) 
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The NbN nomenclature 


The NbN nomenclature classifies drug action by domain (the neuro- 
transmitter target) and mechanism of drug action. As shown in 
Table 10.1, domains range from dopamine or orexin (with quite a re- 
stricted anatomy based on small groups of neurons and their projec- 
tions) to ‘ion channels’ or GABA (gamma aminobutyric acid) which 
implies a much more diffuse whole brain anatomy. We currently 
identify ten domains, but they are very likely to increase over time 
with the discovery of new drug targets. Some drugs (the SSRIs are 
an example) have been developed to be highly specific, so we already 
think of them as working in one domain exclusively. Many other 
drugs may act on more than one domain. Thus, drugs for psychosis 
will often be dopamine/serotonin drugs, as described previously. 
However, the reality is even more complicated, and we believe a di- 
mensional approach is a better way to capture this adequately. 

Along with a domain, drugs have different modes of action or 
functions. Table 10.1 shows the current proposals. Characterization 
as agonist, antagonist, partial agonist, or partial allosteric modu- 
lator defines drug action at the level of the neurotransmitter re- 
ceptor. Reuptake inhibitor or releaser addresses the functional 
consequences of drug action. Enzyme inhibitor or modulator, or ion 
channel blocker, identifies functional targets potentially outside the 
conceptual synapse. These categories are based on familiarity, rather 
than a strict understanding of molecular mechanism. It is an open 
question of whether they will stand the test of time; however, the 
NbN is equipped to accommodate new insights. 

Most of the current pharmacopoeia can be mapped into this no- 
menclature. For drugs working in a single domain with a single ac- 
tion, like the SRIs, there is little change. Drugs for psychosis pose 
a different challenge. The ‘-pines’ illustrated previously can be 
described as dopamine/serotonin receptor inhibitors. Other ex- 
amples (like aripiprazole and brexpiprazole) produce functional 
inhibition via partial agonist properties at dopamine receptors and 
have weaker antagonist properties at 5-HT2A receptors. The NbN 
approach pushes the prescriber to employ more words to describe 
their treatments and hopefully think more accurately about what 
they do. However, this increased complexity is potentially prob- 
lematic. It exemplifies that medicine is entering a new era, in which 


Table 10.1 The NbN nomenclature classifying drug action 
by domain (the neurotransmitter target) and mechanism 
of drug action 


Pharmacological domains Modes/mechanisms of actions 





1 Acetylcholine 1 Receptor agonist 
2 Dopamine 2 Receptor partial agonist 
3. GABA 3 Receptor antagonist 
4 Glutamate 4 Reuptake inhibitor 
5 Histamine 5 Releaser 
6  Orexin 6 Enzyme inhibitor 
7 Melatonin 7 on channel blocker 
8 Noradrenaline 8 Positive allosteric modulator (PAM) 
9 Opioid 9 — Enzyme modulator 
10 Serotonin 


it is being transformed by the explosion of knowledge and access 
to knowledge. We have deliberately invested in developing an app 
which will provide the prescribing information we think doctors 
and patients need. The founding version of the NbN was launched 
at the ECNP congress in Berlin in 2014 as a booklet and app. The 
app format allows medicines to be searched under the old categories 
and new ones. It will allow translation into many languages. It will 
soon be linked to another specialist database [International Union 
of Basic and Clinical Pharmacology ([UPHAR)]. It can expand or 
extend through linking, almost without limit. This would be part of 
a broader movement to make more informed choices and to practise 
a more precise pharmacological strategy. 


Use of the NbN app 


The NbN app can be searched and downloaded for use on Android 
(https://play.google.com/store/apps/details?id=il.co.inmanage. 
nbnomenclature&hl=en_GB)andiOSplatforms(https://itunes.apple. 
com/gb/app/nbn-neuroscience-based-nomenclature/id927272449? 
mt=8). Use of the app is explained by a brief tutorial, but it is best ap- 
preciated by downloading it and using it. However, Fig. 10.4 shows 
the opening screen/search page (left) and the result from typing 
‘olanzapine’ (right). Further information (practical notes and neuro- 
biology) can be obtained by scrolling down. The domains are dopa- 
mine and serotonin, and the mode of action is receptor antagonism. 

The app also shows a brief summary of the approved indications, 
the spectrum of efficacy, and the adverse reactions commonly as- 
sociated with olanzapine. The user can scroll down and see prac- 
tical notes (which describe the availability of depot and the use in 
combination with fluoxetine for bipolar depression), former termin- 
ology (antpsychotic), and a link to a summary of the neurobiology. 
There are currently 130 drugs in the second edition of NbN (NbN-2) 
and 40 drugs in the child and adolescent version of NbN (NbN-ca). 


The NbN as a multi-dimensional 





classificatory system 


The searchable fields of the NbN app can also be thought of as a 
pragmatic multi-axial classificatory system. It recognizes DSM-5 as 
a system for providing (mostly) familiar diagnoses, which we will 
continue to use to communicate with ourselves and our patients 
and estimate societal morbidity. However, the multi-dimensional 
system of NbN has some parallels with the approach of the RDoC 
initiative (see Chapter 8). Both highlight heterogeneity, challenge 
disease-based existing systems, detect subtypes, and provide a better 
match between research findings and clinical decision-making 
(Table 10.2). 

If the RDoC can create a reliable and valid alternative system for 
classifying disorders, the NbN may be able to align how it repre- 
sents drug action. For example, the RDoC proposes the constructs, 
negative valence, and positive valence, within which different 
levels of analysis are possible. These RDoC domains have some- 
thing in common with NbN domains (serotonin and dopamine, 
for example), and the different levels of analysis in the RDoC (for 
example, fear, anxiety, loss, frustrated non-reward) provide tar- 
gets for defining different drug actions. The key property of the 
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= NbN-2 < NbN-2 © 
Q Type Your Search Dopamine, serotonin 
NbN 
Receptor antagonist (d2, 
OR CHOOSE A SEARCH AXIS 5-ht2) 

Name (Generic & Brand) Schizophrenia; acute treatment of 
manic or mixed episodes associated 
with bipolar i disorder; maintenance 
treatment of bipolar i disorder; 

Pharmacology olanzapine and fluoxetine in 
combination in depressive episodes 
associated with bipolar i disorders (usa 
only) 

Mode of Action 
© Improvement of psychotic symptoms; 
© Approved indications mania; depression 
> . Weight gain, sedation, eps, 
G2) Efficacy & Side Effects galactorrhea (low), dizziness, risk of 
tardive dyskinesia, nms. risk of 
© diabetes; monitoring recommended as 
a class warning. class warning for 

Former terminology increased mortality in elderly dementia 

patients 











Fig. 10.4 The NbN app. 
Reproduced from the NbN Working Group. 


Table 10.2 Similarities between the RDoC and NbN classifications 


D 


Neuroscience-driven v v 
Highlighting heterogeneity y v 
Using updated medicine tools y v 
Challenging artificial grouping of heterogenous (a) syndromes with different a b 
pathophysiological mechanisms and (b) drugs with different pharmacology and/or 

mode of action 

Challenging disease-based existing systems (‘horizontal’) v v 
Detecting subtypes v y 
Aiming to address individual differences v v 
Paving the way to precise medicine y v 
Building a long-term framework that can accommodate future discoveries in y v 
neuroscience 

Providing a better match between research findings and clinical decision-making v v 
Focusing on diagnosis v z 

Focusig on drugs = v 





CHAPTER 10 Aneuroscience-based nomenclature for psychotropic drugs | §:/ 


NDN is that it is not bound by the limits of a two-dimensional 
table. Its creation as a digital device builds in a flexibility we have 
never enjoyed before. Moreover, being on a mobile platform 
makes it a widely accessible tool for use across the developed and 
developing world. 

As it has developed, the point of the NbN has grown more ob- 
vious to more and more people. The major journals have accepted 
our critique of existing practice and have published editorials re- 
commending change [13-24]. However, it is a work in progress. The 
hope is that the NbN encourages us to use different words, but cru- 
cially more words and concepts when describing our treatments. We 
may thereby be better informed when we take our initial ‘pharmaco- 
logical step’ or later when we combine treatments after a first mono- 
therapy fails. We can also improve how prescribers communicate 
among themselves and with their patients. 
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The central nervous system originates from the midline region of the 
embryo as a specialized area of the ectoderm—the neuroectoderm or 
neural plate (Fig. 11.1a). Fibroblast growth factor (FGF) signalling 
and bone morphogenetic protein (BMP), as well as Wnt (wingless 
gene) inhibition, are required as steps for neural induction [1]. As 
the neuroectodermal cells proliferate, the neural plate is transformed 
into an indentation—the neural groove (Fig. 11.1b). The lateral parts 
of this groove approach each other and join in the midline, forming 
the neural tube (Fig. 11.1c-f). This fusion of the groove to a tube by 
a folding process starts in the middle portion of the groove and pro- 
ceeds in rostral and caudal directions like a zipper. The most rostral 
and caudal parts close only later, initially leaving rostral and caudal 
neuropores. A small transitional zone between the neural plate 
and the surrounding ectoderm provides the cells of the neural crest 
(Fig. 11.1a), which develop into post-ganglionic cells of the sym- 
pathetic and parasympathetic nervous system, sensory neurons of 
the spinal ganglia and ganglia of cranial nerves, Schwann cells, and 
chromaffin cells of the suprarenal glands (Fig. 11.1f). 

Neural tube formation requires a controlled expression of cell 
adhesion molecules in the lateral folds of the neural groove. If the 
rostral neuropore fails to close, the development of the forebrain is 
impaired, leading to anencephaly. If the caudal neuropore fails to 
close, the most severe result is rachischisis, a malformation with a 
dorsally exposed neural groove. The mildest outcome is spina bifida 
occulta, which is a cleft of a vertebral arch covered by the epidermis. 

As development continues, the neural tube and crest are found 
between the ectoderm and the notochord. The rostral part of the 
neural tube differentiates into the brain; the caudal part (behind the 
fifth somite) differentiates into the spinal cord. 























The embryonic brain has three vesicular enlargements: the fore- 
brain (telencephalon plus diencephalon), the midbrain (mesen- 
cephalon), and the hindbrain (rhombencephalon, including the 
cerebellum). At this early stage of brain development, a principal 
organization of the wall of the neural tube and its derivatives is still 
recognizable. There is a concentric organization around the central 





canal, with the ventricular zone most interior, the intermediate zone 
surrounding the ventricular zone, and an outer cell-sparse mar- 
ginal zone (Fig. 11.2e). The ventricular zone contains neuroblasts. 
Sections orthogonal to the longitudinal axis of the neural tube 
allow the stratification of its wall into four zones in a dorsal to ven- 
tral direction: roof plate, alar lamina, basal lamina, and floor plate 
(Fig. 11.2e). The alar and basal laminae are separated by a shallow 
sulcus in the ventricular surface, that is the sulcus limitans (double 
arrow in Fig. 11.2e). This sulcus demarcates a principal functional 
subdivision, since the alar lamina is the source for sensory neurons 
in the spinal cord and brainstem, whereas motor neurons originate 
in the basal lamina. Local expression of homeobox genes leads to 
the formation of the pallium and the ganglionic hill in the fore- 
brain. Because the brain grows much faster than the rest of the 
embryo, it becomes deflected ventrally. A dorsally convex cephalic 
flexure marks the border between the hindbrain and the midbrain, 
and a cervical flexure marks the border between the spinal cord 
and the hindbrain (Fig. 11.2a—c). The ventrally convex pontine 
flexure is found at the central level of the hindbrain (Fig. 11.2b). 
During week 5, the forebrain differentiates further into the ros- 
tral telencephalon and the more caudal diencephalon. ‘The telen- 
cephalon consists of two hemispheric vesicles connected by a thin 
lamina terminalis. A part of the lamina terminalis develops into 
the commissural plate—the anlage of the later corpus callosum 
(Fig. 11.2d). The ventral part of the lamina terminalis differenti- 
ates into the anterior commissure. The hemispheric vesicles show 
a spatially directed growth around the future insular lobe, which 
leads to a backward-directed elongation towards the occipital lobe 
and a backward- and ventrally directed growth, forming the tem- 
poral lobe (Fig. 11.2f). Further enlargement of the frontal, parietal, 
and temporal lobes, together with a slower growth rate of the in- 
sular lobe, leads to a gradual covering of the latter brain region, a 
process called opercularization (Fig. 11.2g, h). This process is asso- 
ciated with the appearance of the lateral fissure (Fig. 11.2h). 
Extensive growth of the parieto-occipito-temporal association 
cortex leads to a bend in the temporal lobe around the lateral fissure, 
and the temporal pole is pushed rostrally. This direction of growth 
(Fig. 11.2) also affects the structures situated dorsomedially, that is, 
the archicortex with the hippocampus, corpus striatum, and lateral 
ventricles. The corpus striatum is split by the ingrowing fibres of the 
internal capsule into the caudate nucleus and putamen. The head of 
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Fig. 11.1 Early developmental stages of the nervous system in schematic drawings of cross-sections through embryos. 


the caudate is situated ventrolateral to the corpus callosum in the 
frontal lobe, and the tail of the caudate is located dorsal to the inferior 
horn of the lateral ventricle. The hippocampus forms its largest exten- 
sion (the retrocommissural part) in the temporal lobe, bends around 
the posterior end (splenium) of the corpus callosum, and reaches a 
position on top of the corpus callosum (the supracommissural part). 
The precommissural part of the hippocampus ends in front of the 
genu of the corpus callosum. Finally, the first primary sulci appear, 
for example, the central sulcus (Fig. 11.2h). 

The central cavity of the diencephalon (the third ventricle) is 
connected with the cavities of the hemispheric vesicles (the lateral 
ventricles) by the interventricular foramen (Fig. 11.2d). The di- 
encephalon forms bilateral evaginations—the eye vesicles—which 
differentiate into the retina and the optic nerve. 

The tectum with the superior and inferior colliculi evolves from 
midbrain enlargement. Gradients in the expression of Gbx2 and 
Otx2 determine the midbrain-hindbrain boundary [2]. This region 
is enriched in Engrailed-1, Engrailed-2, Pax-2, Pax-5, and mbx genes, 
which are important for the development of the tectum [3, 4]. 


Meanwhile, the hindbrain becomes subdivided into a rostral 
part (that is, the metencephalon) and a caudal part (that is, the 
myelencephalon). The cerebellum starts to develop from the ros- 
tral part of the metencephalon during week 6. At first, the enlarged 
central cavity of the neural tube (the future fourth ventricle) has a 
thin roof plate, bordered by two thickenings of the neural tube— 
the rhombic lips—which merge in the midline (Fig. 11.2d, f). These 
thickenings are the source of the later cerebellar hemispheres, while 
its midline develops into the cerebellar vermis. 

The hindbrain is temporarily divided into eight rhombomeres [5], 
the borders of which are specified by combinations of transcription 
factors (for example, Hox, Krox, Wnt genes). It develops in close as- 
sociation with the visceral arches, which appear during week 4. It 
innervates these arches and the organs derived from them by a 
group of branchial nerves, which later become the trigeminal (V), 
facial (VII), glossopharyngeal (IX), vagal (X), and accessory (XI) 
cranial nerves. Other cranial nerves develop connections between 
the hindbrain and peripheral organs not derived from the visceral 
arches. They are the oculomotor (III), trochlear (IV), abducens (VJ), 
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Fig. 11.2 Development of the human brain. Brains of (a) 4-mm, (b) 10.4-mm (c), 13.8-mm, and (d) 53-mm human embryos, and (e) 21-week and 

(f) 24-week fetuses. A, alar plate; b, basal plate; c, cerebellum or cerebellar anlage; cc, corpus callosum; CEF, cervical flexure; CF, cephalic flexure; ComP, 
commissural plate; cs, central sulcus; D, diencephalon; ES, eye stalk; EV, eye vesicle; F, floor plate; FL, frontal lobe; H, hindbrain; HV, hemispheric vesicle; 
IL, insular lobe; iz, intermediate zone; If, lateral fissure; LT, lamina terminalis; M, mesencephalon; MO, medulla oblongata; mz, mantle zone; N.II, optic 
nerve; OL, occipital lobe; OLB, olfactory bulb; P, pons; PF, pontine flexure; PL, parietal lobe; R, roof plate; RL, rhombic lip; sli, sulcus limitans; shy, sulcus 
hypothalamicus; T, telencephalon; TE, tectum; TL, temporal lobe; v, ventricle; vz, ventricular zone. 


vestibulocochlear (VHI), and hypoglossal (XII) nerves. The olfac- 
tory (I) and optic (II) nerves arise separately as evaginations of the 
forebrain. 


Histogenesis of the spinal cord 


The neural tube initially consists of a single layer of neuroepithe- 
lial cells surrounding a central canal filled with cerebrospinal fluid 


(Fig. 11.1c-f). The outer surface of the future spinal cord has an ex- 
ternal limiting membrane, and the inner surface bordering the cen- 
tral canal has an inner limiting membrane. The entire wall of the 
early neural tube is called the ventricular zone [6]. 

The cells of the ventricular zone proliferate, and the surface of 
the spinal cord enlarges. The cord then thickens, as cells divide 
further to produce a multi-layered epithelium. The daughter cells 
have different potentialities; one type of cell (the neuroblast) retains 
the capability for mitosis, whereas another type (the proneuron) is 
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post-mitotic and represents an immature neuron. Proliferation of 
neurons is almost complete around birth. 

Some neuroepithelial cells develop into precursors of glial cells, 
that is, glioblasts, which differentiate into astroglial, oligodendro- 
glial, and microglial cells. The first glioblasts differentiate into radi- 
ally extended cells, spanning the entire width of the wall of the spinal 
cord (the same occurs in the cerebral hemispheres and the cerebellar 
cortex, as described here). During later development, these radially 
extended cells are transformed into ependymal cells and astroglia. 

Histogenesis of the spinal cord starts at the cervical level and pro- 
gresses in a caudal direction. After week 3, the longitudinal sulcus 
limitans is recognizable on the inner surface of the neural tube 
(Fig. 11.2e). The alar lamina differentiates into a sensory zone—the 
dorsal horn of the adult spinal cord—and the basal lamina differ- 
entiates into a motor zone—the ventral horn. The sympathetic pre- 
ganglionic neurons form in the lateral horn, which is present only 
at thoracic levels. The subdivision into alar and basal laminae is 
functionally important not only in the spinal cord, but also in the 
brainstem. 

Proneurons leave the ventricular zone and migrate along radial 
glial cells into the intermediate zone where they become organ- 
ized into cell groups. The motor neurons develop the axons of the 
ventral root, and the processes from spinal ganglionic cells grow 
into the spinal cord to form the dorsal roots. Synapses develop 
first in the motor zone, and later in the sensory zone, during weeks 
10 to 13. 

During the third month, the ventricular zone is reduced to a 
small rim surrounding the central canal and is finally transformed 
into the ependymal cell layer. The intermediate zone becomes or- 
ganized into dorsal, ventral, and (at thoracic levels) lateral horns. 
The ascending and descending fibre tracts of the spinal cord are 
increased in size in the marginal zone. During weeks 14 and 
15, oligodendrocytes begin to myelinate these fibre tracts. The 
corticospinal, or pyramidal, tract becomes visible for the first 
time during week 14 and reaches its target neurons, mainly motor 
neurons of the ventral horn, between weeks 17 and 29. Myelination 
of the pyramidal tract is completed between the the first and second 
post-natal years. This late myelination explains the presence of the 
Babinski reflex in newborns and its disappearance during the first 
2 years of life. 


Histogenesis of the brainstem and cerebellum 





At the level of the fourth ventricle, the various zones of the hind- 
brain are arranged in a lateral-to-medial sequence (somatosensory— 
viscerosensory—visceromotor—somatomotor). In the hindbrain, 
proneurons not only migrate radially, as in the spinal cord, but also 
tangentially and longitudinally. This complex migration course and 
the growth of fibre tracts lead to further changes of the lateral-to- 
medial and dorsal-to-ventral ‘displacements’ of cranial nerve nuclei 
(for example, the facial nucleus) in the adult. 

At the level of the cerebellar anlage, the undifferentiated matrix 
cells form two major zones—a larger ventricular matrix zone and 
a smaller, most dorsally positioned upper rhombic lip zone. From 
the upper rhombic lip zone, a first wave of progenitor cells migrate 
between weeks 10 and 11, which sequentially express Pax6, TBR2, 
and TBR1, and form a transitory peripheral zone before migrating 


down to a position where the adult cerebellar nuclei with their 
glutamatergic neurons are found (Fig. 11.3a, b). A second wave from 
the upper rhombic lip zone leads to the formation of an external 
granular layer (dotted arrow in Fig. 11.3a). Cells in this layer remain 
mitotically active long after birth. Thus, the external granular layer 
is a major source of cells for the adult cerebellum. From this layer, 
cells migrate to form the internal granular layer of the adult cere- 
bellum, with its small and abundant granular cells and the somewhat 
larger and less abundant Golgi cells. Proneurons from the external 
granular layer also give rise to the basket and star cells of the adult 
molecular layer (Fig. 11.3a, b). 

A second source of progenitor cells is located around the ventricle, 
that is, the ventricular matrix zone. The progenitor cells begin to 
migrate during weeks 12 and 13 along radially extended glial cells— 
Bergmann glia—into a region below the external granular layer 
(Fig. 11.3a) where they mature into GABA (gamma aminobutyric 
acid)-ergic Purkinje cells of the ganglionic layer of the adult cere- 
bellar cortex (Fig. 11.3b). Therefore, the cerebellum originates out of 
two major migration streams, one starting in the ventricular matrix 
zone and the upper rhombic lip zone and the other one originating 
in the external granular layer. Migration of cerebellar proneurons 
is not completed until the first postnatal year. During weeks 16 and 
26, synapses develop and afferent fibre systems begin to form. The 
external granular layer finally disappears during the first 2 years of 
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Fig. 11.3 (see Colour Plate section) Development of the cerebellar 
cortex and nuclei. (a) Fetal development. (b) Adult stage. B, Bergmann 
glia; BA, basket cell; G, Golgi cell; GR, granular cell; P, Purkinje cell; SGE, 
stratum granulosum externum; ST, stellate cell; URL, upper rhombic lip. 





life, leaving the three-layered structure (molecular, ganglionic, and 
internal granular layers) of the adult cerebellar cortex. 


Histogenesis of the cerebral cortex, basal ganglia, 





amygdala, and basal forebrain 


Initially, the entire wall of the hemispheric vesicle consists of very 
densely packed mitotic cells. These cells undergo more than 28 mi- 
totic rounds in the human brain [7]. In week 5, an inner cell-dense 
periventricular zone and an outer cell-poor marginal zone can be rec- 
ognized. During week 6, the first post-mitotic proneurons leave the 
inner periventricular zone and form an intermediate (mantle) zone 
between the marginal and periventricular zones. By the end of week 
6, the periventricular zone is further subdivided into a cell-dense 
ventricular zone and a less cell-dense subventricular zone. 

During week 8, the cortical plate between the marginal and inter- 
mediate zones is formed by proneurons which have migrated along 
radial glial cells from the ventricular and subventricular zones through 
the intermediate zone (Fig. 11.4a) [6-10]. A single radial glial cell can 
span the entire distance between the ventricular and pial surfaces. 
As the proneurons ‘climb’ to the cortical plate along the processes of 
the radial glial cell, they produce a vertically oriented cortical cell 
column. This radial migration of immature neurons ultimately leads 
to the formation of cortical layers II to VI, and these proneurons dif- 
ferentiate into the various types of glutamatergic pyramidal neurons 
in the adult cerebral cortex. A prototype of the structural segregation 
of the cerebral cortex into numerous areas in the adult brain is already 
preformed during this migration [11]. A further feature of migration 
is the inside-to-outside layering of the immature, radially migrating 
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neurons. The earliest-born neurons are found in the deepest layers, 
and the latest in the most superficial layers of the cortical plate. Thus, 
adult layers V and VI of the cerebral cortex are generated before layer 
IV, and layer IV is generated before layers III and II (Fig. 11.4b). The 
inside-out layering process is controlled by the extracellular matrix 
glycoprotein reelin, expressed in the Cajal-Retzius cells of the mar- 
ginal zone. When cortical proneurons are migrating radially, reelin 
stops the approaching cells as they approach the marginal zone. The 
next wave of migrating cells moves through the first already ‘immo- 
bilized’ wave and is brought to halt again near the marginal zone. This 
process continues until the production of proneurons is exhausted, 
and thus migration is terminated. Therefore, reelin is important for 
the control of the inside-to-outside layering of the cortex [7]. Loss 
of reelin leads to severe disturbances of the normal layering of the 
cerebral cortex. In addition to reelin, the microtubule regulator pro- 
tein LIS1, the neuronal migration protein doublecortin DCX, and the 
tumour suppressor protein p73 are also crucial factors for the normal 
migration of proneurons. 

At the same time, tangential migration of immature cells born 
in the ganglionic eminence and the preoptic area (PoA) in the 
subpallium takes place. The ganglionic eminence can be subdivided 
into a medial (MGE) and a lateral ganglionic eminence (LGE) at 
more rostral levels, which merge to the caudal ganglionic eminence 
(CGE) caudally. These immature cells migrate from the MGE, CGE, 
and PoA through the marginal and intermediate zones into the cor- 
tical plate (Fig. 11.5) and develop into the inhibitory interneurons of 
the adult cerebral cortex [7, 12]. The cells stemming from the dorsal 
part of the MGE represent 60% of the later cortical interneurons, that 
is, the parvalbumin-positive basket and chandelier cells, whereas 
the ventral part of the MGE generates the somatostatin-positive 
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Fig. 11.4 (see Colour Plate section) Histogenesis and radial migration in the cerebral cortex. (a) Development in the fetal hemispheric wall based on 
recent data [48, 54-56, 62, 63]. (b) Adult cortical layering in the human neopallium. I-VI, cortical layers; EP, ependymal layer; cp, cortical plate; isvz, 
inner subventricular zone; iz, intermediate zone with the first arriving nerve fibres establishing connection with immature neurons; mz, marginal zone; 
sp, subplate; osvz, outer subventricular zone; vz, ventricular zone; WM, white matter. 
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Fig. 11.5 (see Colour Plate section) Schematic drawing of the precursor 
regions of inhibitory neurons later found in the cerebral cortex, basal 
ganglia, amygdala, and basal forebrain [64]. Precursor regions: CGE, 
caudal ganglionic eminence; DG, diagonal area; LGE, lateral ganglionic 
eminence; MGE, medial ganglionic eminence; PoA, preoptic area. 
Transcription factors: Dbx1, developing brain homeobox protein 1; Dlx2, 
Distal-less protein 2; Nkx2.1, thyroid transcription factor 1; Nr2f2, ligand- 
activated transcription factor Nr2f2. Cp, cortical plate; iz, intermediate 
zone; mz, mantle zone; vz + svz, ventricular + subventricular zone. 


Source: data from Nieuwenhuys R, Puelles L, Towards a new neural morphology, 
Copyright (2016), Springer International Publishing Switzerland. 








interneurons, mainly Martinotti cells [13-16]. The CGE gives 
rise to about 30% of the later cortical interneurons, which include 
calretinin and vasoactive inhibitory polypeptide-expressing bipolar 
and double bouquet cells, as well as reelin-expressing neurogliaform 
and multipolar cells [17-19]. The cells originating in the PoA con- 
tribute to 10% of the cortical interneurons and are represented in 
all groups of GABAergic interneurons. They all are derived from 
progenitor cells expressing the developing brain homeobox pro- 
tein 1 (Dbx1) [16, 20]. Additionally, oligodendrocyte precursor cells 
migrate from the PoA to the pallium and subpallial target regions 
[21]. Therefore, the cells of the cerebral cortex have, like those of the 
cerebellar cortex, two different sources. In the case of the cerebral 
cortex, the sources are the ventricular and subventricular zones for 
the glutamatergic pyramidal neurons, and the MGE and CGE, as 
well as the PoA, for the inhibitory interneurons. 

Although interneurons are mainly born in the ganglionic emi- 
nence and have completed their tangential migration by the end of 
fetal development [7, 12], a small number of perinatally born neurons 
which express doublecortin DCX and markers of interneurons are 
found during the first 2-5 months of life in the subventricular zone 
of the lateral ventricles [22]. Some of these young neurons migrate 
tangentially along the ventricular wall, while others move radially 
through the developing white matter into the overlying cortex where 
they develop into GABAergic interneurons, which are thought to 
contribute to developmental plasticity [22]. 


The LGE is the precursor region of GABAergic projection neurons 
of the striatum and pallidum [23, 24], and the MGE of those of the 
striatum and pallidum [25, 26]. The CGE develops into the larger 
subpallial part of the amygdala, although the MGE, LGE, CGE, and 
PoA also contribute precursor cells to the amygdala [23, 27-29]. The 
progenitor cells from the diagonal area (Fig. 11.5) migrate into the 
subcortical regions and develop into the adult cholinergic neurons 
of the basal forebrain and striatum [25, 27]. 

Regional differences in the development of the cortical plate sub- 
divide the hemisphere into segments. The lateral segment, with a 
well-developed cortical plate and presubplate, develops into the neo- 
cortex. The mediodorsal segment, with a wide marginal zone and a 
thin, folded cortical plate, develops into the archicortex, including 
the hippocampus. The mediobasal segment, with its inconspicu- 
ously developed cortical plate, is the precursor of the palaeocortex. 
The basolateral segment, that is, the ganglionic eminence, generates 
cortical interneurons and develops into the corpus striatum, amyg- 
dala, and septum. 

During weeks 10 and 12, the axons of serotoninergic and 
noradrenergic neurons contribute to the first synapses in the mar- 
ginal and presubplate zones where neurotrophin receptors are ex- 
pressed. During the following 3 weeks, the subplate zone develops, 
as axons grow in from the basal forebrain and thalamus, dendrites 
enlarge, and synapses form. From weeks 16-24, the cortical anlage 
has a small marginal zone, a wide cell-dense cortical plate, and a very 
wide and less cell-dense subplate. 

Transformation into the adult neocortical pattern starts between 
weeks 25 and 34, as migration and proliferation of proneurons di- 
minish. Dendrites begin to differentiate, and synapses begin to 
develop in the deepest cortical layers, progressing to the most super- 
ficial layer. Before birth, six cortical layers can be recognized in all 
regions of the neocortex. In the region of the motor cortex, layer IV 
(inner granular layer) is invaded by pyramidal cells during the post- 
natal period, to such a degree that this layer becomes inconspicuous. 
Therefore, the motor cortex has been classified as five-layered, 
agranular neocortex [30], although the typical granular cells of layer 
IV could be demonstrated between the pyramidal cells [31, 32]. 
Shortly before birth, the subplate, the subventricular zone, and most 
of the ventricular zone disappear; neuronal proliferation ceases, and 
the intermediate zone is transformed into the white matter of the 
pallium. The rest of the ventricular zone contributes to the epen- 
dymal layer of the ventricular surface. 

Dendritic and axonal differentiation continue after birth and 
into adult life, and their time courses differ across brain regions. 
Myelination of the cortical-vestibular system is finished shortly 
before birth; that of the somatosensory, visual, auditory, pyram- 
idal, and extra-pyramidal fibre tracts is nearly complete by the 
end of the third postnatal year, and that of the associative fibre 
tracts in the cerebral hemispheres is continued until the second 
decade [33, 34]. Although synaptic density in neonatal brains is 
comparable to that found in adults, synaptogenesis continues after 
birth, reaches a maximum during the first postnatal year, and pro- 
ceeds at a lower rate during childhood. The peak period of syn- 
apse formation in the primary visual cortex is reached between 3 
and 4 months, whereas in the prefrontal cortex, it occurs at around 
8 months of age [35, 36]. The key change in synapses after birth, 
however, is pruning, that is, the process of selective elimination 
of excess synapses, which is essential for the balance between 


excitatory and inhibitory synapses and for the normal develop- 
ment of functional neural circuits. Developmental pruning is initi- 
ated during late fetal stages; it increases considerably shortly after 
birth and continues into puberty, when synaptic density stabilizes 
at adult levels. Pruning occurs earlier in sensorimotor areas than 
in association cortices. 

Microglia play a role in the pruning process, and their activity 
is modulated by the initiating protein of the classical complement 
cascade (Clq) and the complement cascade components 3 and 4 
(C3 and C4, respectively), as well as the microglial complement re- 
ceptor 3 (CR3) [37-40]. Early loss of hippocampal synapses, as a 
result of an abnormally high Clq expression during development, 
was reported in a mouse model of Alzheimer’s disease [40], and the 
low synapse density found in the brain of schizophrenics has been 
associated with excessive C4 activity during development [41-43], 
highlighting the importance of normal pruning process for correct 
neurodevelopment. 


Cortical folding 


After the appearance of the lateral fissure, the neocortical surface 
develops many additional sulci and gyri during the fetal period 
(Fig. 11.6). The central, collateral, cingulate, parieto-occipital, su- 
perior temporal, and calcarine sulci appear between weeks 16 and 
21, followed by the pre- and post-central, frontal, temporal, and 
intraparietal sulci. Highly variable secondary and tertiary sulci de- 
velop between week 29 and birth, when all sulci have been formed 
[44, 45]. Although all gyri and sulci are present at birth, the depth of 
the sulci increases until two-thirds of the cortical surface is hidden 
in them in the adult stage [44]. Measurements of the intensity of 
cortical folding show that gyrification is greatest in the regions 
where association cortices are found, particularly in the posterior 
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prefrontal, parietal, lateral temporal, and occipito-temporal cortices 
and the cuneus and precuneus (Fig. 11.7b) [44, 46]. 

The reasons for the formation of gyri in many mammalian brains, 
including the human brain, are not completely understood. Since 
the vertical cell columns positioned side by side represent a basic 
organizational principle of the cerebral cortex, growth of the cortex 
inevitably leads to a considerable enlargement of the cortical sur- 
face. A large unfolded cortical surface would have two major dis- 
advantages: (1) the volume of the skull would increase to such a 
degree during fetal development that a normal delivery would be 
impossible; and (2) the distance between cortical regions intercon- 
nected by intrahemispheric projection fibres would increase and, 
with it, the information transmission time. Cortical folding there- 
fore allows a maximal cortical surface in a minimal volume, and an 
optimization of the speed of neural transmission between cortical 
areas. This has led to speculations about the relationship between 
the growing brain surface and the skull size at birth. A large brain 
surface must be folded in order to minimize the brain diameter 
and thus enable a minimal skull diameter at the time of birth. In 
fact, a comparison between the growth of brain weight and cortical 
folding during human lifespan shows that the increase in cortical 
folding clearly precedes that of brain weight before and after birth 
(Fig. 11.7a). This, however, does not explain the mechanisms of 
cortical folding. Presently, the two most widely discussed hypoth- 
eses are the mechanical tension hypothesis [47] and the grey matter 
hypothesis [48]. 

The biophysical basis of the mechanical tension hypothesis are the 
three-dimensional courses and the viscoelastic properties of fibre 
tracts [47, 49, 50]. The first fibre tracts between cortical and other 
brain regions are already visible during subplate development and 
maintained during migration. More tangentially organized cortico- 
cortical than radially organized cortico-subcortical connections are 
found in the human brain. Therefore, the tension hypothesis predicts 
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Fig. 11.6 Drawings of coronal sections through a fetal brain during the 30th week of gestation. (a) A newborn brain and (b) an adult brain, (c) showing 


the increase in cortical folding. 
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Fig. 11.7 (see Colour Plate section) (a) Development of brain weight and gyrification index (GI) as a measure of the intensity of cortical folding during 
human lifespan. Data from Zilles et al. [44] and Armstrong et al. [45]. (b) Distribution of the local degree of cortical folding throughout the adult human 
brain, after Jockwitz et al. [46]. Red (dark grey in the printed version) indicates highest, yellow (pale grey in the printed version) lowest degrees of cortical 


folding. 


Reproduced from Brain Structure and Function, 222(1), Jockwitz C, Caspers S, Heine H, et al., Age- and function-related regional changes in cortical folding of the default mode 
network in older adults, pp. 83-99, Copyright (2017), with permission from Springer Nature. 


an outward folding caused by strong cortico-cortical and weak 
cortico-subcortical connections [47]. However, tension is not found 
across developing gyri [51]. Despite this finding, regional variations 
in tension can shape the folding pattern and its correlation with the 
spatial organization of the connectome [52, 53]. Using computational 
modelling, it was demonstrated that differential growth of cortical 
sites drives folding, consistent with the folding geometry and stress 
distribution [51]. In summary, the tension-based hypothesis explains 
a major factor of the mechanisms behind cortical folding. 

The grey matter hypothesis is founded on the role of the 
subventricular zone as a precursor domain of a new type of radial 
glia cells, that is, the intermediate radial glia cells (IRGCs). The 
subventricular zone of the fetal hemisphere (Fig. 11.4) consists of 
an inner (isvz) and an outer (osvz) subventricular zone in human 
brains [54, 55]. The osvz contains neural stem and intermediate 
progenitor cells, becomes the predominant proliferative zone in 
the developing human brain, and expands considerably between 


gestational weeks 11-16 in humans [56], just before the beginning 
of cortical folding. Additionally, a self-amplifying progenitor cell 
has recently been found in the osvz. This IRGC generates a radially 
oriented scaffold, in addition to that formed by the classical bipolar 
radial glia cells (Fig. 11.4b). This way, the osvz can modify the trajec- 
tory of migrating proneurons. An experimentally induced reduction 
of proliferation in the osvz causes a reduced cortical surface area and 
reduced folding [55]. Thus, proliferation of IRGCs could play an im- 
portant role in the folding process of the fetal cortex [48, 55, 57] (for 
a recent review, see [58]). 

The subplate zone (SP) reaches its largest extension in the re- 
gion of the late-maturing and folding association cortices. During 
fetal development, transient cortico-cortical and callosal connec- 
tions are found in the SP zone, before they enter the cortical plate 
[59]. The long, enduring growth of the SP zone and the entire brain 
(Fig. 11.7a), the regional heterochronicity of cortical folding [45], 
and particularly the early connections in the SP zone may contribute 


to mechanical tensions, resulting in the late-appearing, numerous 
gyri in multimodal association regions. The differential growth of 
cortical layers may also contribute to cortical folding [60]. Since the 
growth of the supragranular layers exceeds that of the infragranular 
layers, folding could be a result of this differential growth pro- 
cess, but the regionally different proportions between supra- and 
infragranular layers have not been comprehensively studied and 
precisely related to regions of high or low cortical folding. Finally, 
previous reports do not support a simple relationship between 
gyrification and the total number of cortical neurons or the cortical 
volume. Instead, a recent study demonstrates a universal scaling be- 
tween cortical folding as a function of the product of the cortical sur- 
face area and the square root of cortical thickness in a large sample 
of mammalian gyrencephalic, as well as lissencephalic, brains [61]. 


REFERENCES 


1. Ladher, R. and Schoenwolf, G.V. (2005). Making neural 
tube: neural induction and neurulation. In: Rao, M.S. and 
Jacobson, M. (eds.) Developmental neurobiology. Kluwer 
Academic/Plenum Publishers, New York, NY. pp. 1-20. 

2. Wurst, W. and Bally-Cuif, L. (2001). Neural plate patterning: up- 
stream and downstream of the isthmic organizer. Nature Reviews 
Neuroscience, 2, 99-108. 

3. Wurst, W., Auerbach, A.B., and Joyner, A.L. (1994). Multiple 
developmental defects in Engrailed-1 mutant mice: an early 
mid-hindbrain deletion and patterning defects in forelimbs and 
sternum. Development, 120, 2065-75. 

4. Kawahara, A., Chien, C.B., and Dawid, I.B. (2002). The 
homeobox gene mbx is involved in eye and tectum development. 
Developmental Biology, 248, 107-17. 

5. Lumsden, A. and Krumlauf, R. (1996). Patterning the vertebrate 
neuraxis. Science, 274, 1109-15. 

6. Kostovic, I. (1990). Zentralnervensystem. In: Hinrichsen, K.V. 
(ed.) Humanembryologie. Springer-Verlag, Berlin. pp. 381-448. 

7. Meyer, G. (2007). Genetic control of neuronal migrations in 
human cortical development. Advances in Anatomy, Embryology, 
and Cell Biology, 189, 1-111. 

8. Levitt, P. and Rakic, P. (1980). Immunoperoxidase localization 
of glial fibrillary acidic protein in radial glial cells and astrocytes 
of the developing rhesus monkey brain. Journal of Comparative 
Neurology, 193, 815-40. 

9. Rakic, P. (1985). Limits of neurogenesis in primates. Science, 227, 
1054-6. 

10. Supèr, H., Soriano, E., and Uylings, H.B. (1998). The functions of 
the preplate in development and evolution of the neocortex and 
hippocampus. Brain Research. Brain Research Reviews, 27, 40-64. 

11. Rakic, P. (1988). Specification of cerebral cortical areas. Science, 
241, 170-6. 

12. Rakic, P. (1995). Radial versus tangential migration of neur- 
onal clones in the developing cerebral cortex. Proceedings of the 
National Academy of Sciences fo the United States of America, 92, 
11323-7. 

13. Flames, N., Pla, R., Gelman, D.M., Rubenstein, J.L., Puelles, L., 
and Marin, O. (2007). Delineation of multiple subpallial pro- 
genitor domains by the combinatorial expression of transcrip- 
tional codes. Journal of Neuroscience, 27, 9682-95. 

14. Wonders, C.P., Taylor, L., Welagen, J., Mbata, I.C., Xiang, J.Z., 
and Anderson, S.A. (2008). A spatial bias for the origins of 
interneuron subgroups within the medial ganglionic eminence. 
Developmental Biology, 314, 127-36. 


lS 


16. 


17. 


18. 


20. 


21% 


22. 


23. 


24. 


257 


26. 


Di 


28. 


29; 


30. 


Bil. 


32) 


CHAPTER 11 Neurodevelopment 


Wonders, C.P. and Anderson, S.A. (2006). The origin and speci- 
fication of cortical interneurons. Nature Reviews Neuroscience, 7, 
687-96. 

Gelman, D.M. and Marin, O. (2010). Generation of interneuron 
diversity in the mouse cerebral cortex. European Journal of 
Neuroscience, 31, 2136-41. 

Miyoshi, G., Hjerling-Leffler, J., Karayannis, T., et al. (2010). 
Genetic fate mapping reveals that the caudal ganglionic eminence 
produces a large and diverse population of superficial cortical 
interneurons. Journal of Neuroscience, 30, 1582-94. 

Lee, S., Hjerling-Leffler, J., Zagha, E., Fishell, G., and Rudy, B. 
(2010). The largest group of superficial neocortical GABAergic 
interneurons expresses ionotropic serotonin receptors. Journal of 
Neuroscience, 30, 16796-808. 


. Ma, T., Wang, C., Wang, L., et al. (2013). Subcortical ori- 


gins of human and monkey neocortical interneurons. Nature 
Neuroscience, 16, 1588-97. 

Gelman, D., Griveau, A., Dehorter, N., et al. (2011). A wide diver- 
sity of cortical GABAergic interneurons derives from the embry- 
onic preoptic area. Journal of Neuroscience, 31, 16570-80. 
Tekki-Kessaris, N., Woodruff, R., Hall, A.C., et al. (2001). 
Hedgehog-dependent oligodendrocyte lineage specification in 
the telencephalon. Development, 128, 2545-54. 

Paredes, J.A., James, D., Gil-Perotin, S., et al. (2016). Extensive 
migration of young neurons into the infant human frontal lobe. 
Science, 354, pii: aaf7073. 

Bupesh, M., Legaz, I., Abellan, A., and Medina, L. (2011). 
Multiple telencephalic and extratelencephalic embryonic do- 
mains contribute neurons to the medial extended amygdala. 
Journal of Comparative Neurology, 519, 1505-25. 

Yun, K., Garel, S., Fischman, S., and Rubenstein, J.L. (2003). 
Patterning of the lateral ganglionic eminence by the Gsh1 and 
Gsh2 homeobox genes regulates striatal and olfactory bulb 
histogenesis and the growth of axons through the basal ganglia. 
Journal of Comparative Neurology, 461, 151-65. 

Marin, O., Anderson, S.A., and Rubenstein, J.L. (2000). Origin 
and molecular specification of striatal interneurons. Journal of 
Neuroscience, 20, 6063-76. 

Pauly, M.C., Débréssy, M.D., Nikkhah, G., Winkler, C., and 
Piroth, T. (2014). Organization of the human fetal subpallium. 
Frontiers in Neuroanatomy, 7, 54. 

Garcia-Lopez, M., Abellan, A., Legaz, I., Rubenstein, J.L., Puelles, 
L., and Medina, L. (2008). Histogenetic compartments of the 
mouse centromedial and extended amygdala based on gene ex- 
pression patterns during development. Journal of Comparative 
Neurology, 506, 46-74. 

Morales-Delgado, N., Merchan, P., Bardet, S.M., Ferran, J.L., 
Puelles, L., and Diaz, C. (2011). Topography of somatostatin 
gene expression relative to molecular progenitor domains 
during ontogeny of the mouse hypothalamus. Frontiers in 
Neuroanatomy, 5, 10. 

Medina, L., Abellan, A., Vicario, A., and Desfilis, E. (2014). 
Evolutionary and developmental contributions for under- 
standing the organization of the basal ganglia. Brain, Behavior 
and Evolution, 83, 112-25. 

Brodmann, K. (1909). Vergleichende Lokalisationslehre der 
Großhirnrinde in ihren Prinzipien dargestellt auf Grund des 
Zellbaues. Barth, Leipzig. 

Barbas, H. and Garcia-Cabezas, M.A. (2015). Motor cortex layer 
4: less is more. Trends in Neuroscience, 38, 259-61. 
Garcia-Cabezas, M.A. and Barbas, H. (2014). Area 4 has layer IV 
in adult primates. European Journal of Neuroscience, 39, 1824-34. 








SECTION 2 The scientific basis of psychiatric aetiology and treatment 


335 


34. 


315), 


36. 


af 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


Yakolev, P.I. and Lecours, A.R. (1967). The myelogenetic cycles 
of regional maturation of the brain. In: Minkowski, A. (ed.) 
Regional development of the brain in early life. Blackwell Science, 
Oxford. pp. 3-70. 

Miller, D.J., Duka, T., Stimpson, C.D., et al. (2012). Prolonged 
myelination in human neocortical evolution. Proceedings of the 
National Academy of Sciences of the United States, 109, 16480-5. 
Huttenlocher, P.R. (1979). Synaptic density in human frontal 
cortex—developmental changes and effects of aging. Brain 
Research, 163, 195-205. 

Huttenlocher, P.R. (1990). Morphometric study of human cere- 
bral cortex development. Neuropsychologia, 28, 517-27. 

Schafer, D.P., Lehrman, E.K., Kautzman, A.G., et al. (2012). 
Microglia sculpt postnatal neural circuits in an activity and 
complement-dependent manner. Neuron, 74, 691-705. 

Bialas, A.R. and Stevens, B. (2013). TGF-b signaling regulates 
neuronal Clq expression and developmental synaptic refine- 
ment. Nature Neuroscience, 16, 1773-82. 

Fourgeaud, L. and Boulanger, L.M. (2007). Synapse remodeling, 
compliments of the complement system. Cell, 131, 1034-6. 
Hong, S., Beja-Glasser, V.E, Nfonoyim, B.M., et al. (2016). 
Complement and microglia mediate early synapse loss in 
Alzheimer mouse models. Science, 352, 712-16. 

Glantz, L.A. and Lewis, D.A. (2000). Decreased dendritic spine 
density on prefrontal cortical pyramidal neurons in schizo- 
phrenia. Archives of General Psychiatry, 57, 65-73. 

Garey, L.J., Ong, W.Y., Patel, T.S., et al. (1998). Reduced dendritic 
spine density on cerebral cortical pyramidal neurons in schizo- 
phrenia. Journal of Neurology, Neurosurgery, and Psychiatry, 65, 
446-53. 

Sekar, A., Bialas, A.R., de, R.H., et al. (2016). Schizophrenia risk 
from complex variation of complement component 4. Nature, 
530, 177-83. 

Zilles, K., Armstrong, E., Schleicher, A., and Kretschmann, H.-J. 
(1988). The human pattern of gyrification in the cerebral cortex. 
Anatomy and Embryology, 179, 173-9. 

Armstrong, E., Schleicher, A., Omram, H., Curtis, M., and Zilles, 
K. (1995). The ontogeny of human gyrification. Cerebral Cortex, 
5, 56-63. 

Jockwitz, C., Caspers, S., Lux, S., et al. (2017). Age- and function- 
related regional changes in cortical folding of the default mode 
network in older adults. Brain Structure Function, 222, 83-99. 
van Essen, D.C. (1997). A tension-based theory of morphogen- 
esis and compact wiring in the central nervous system. Nature, 
385, 313-18. 

Kriegstein, A., Noctor, S., Martinez-Cerdeno, V. (2006). Patterns 
of neural stem and progenitor cell division may underlie 


49. 


50. 


eile 


52. 


59 


54. 


Bos 


56. 


57 


58. 


59. 


60. 


6l. 


62. 


63. 


64. 


evolutionary cortical expansion. Nature Reviews Neuroscience. 7, 
883-90. 

Dennerll, T.J., Lamoureux, P., Buxbaum, R.E., and Heidemann, 
S.R. (1989). The cytomechanics of axonal elongation and retrac- 
tion. Journal of Cell Biology, 109(6 Pt 1), 3073-83. 

van Essen, D.C. (2007). Cerebral cortical folding patterns in 
primates: why they vary and what they signify. In: Kaas, J.H. 
(ed.) Evolution of nervous systems. Elsevier, Amsterdam. pp. 
267-76. 

Xu, G., Knutsen, A.K., Dikranian, K., Kroenke, C.D., Bayly, P.V., 
and Taber, L.A. (2010). Axons pull on the brain, but tension does 
not drive cortical folding. Journal of Biomechanical Engineering, 
132, 071013. 

Hilgetag, C.C. and Barbas, H. (2006). Role of mechanical fac- 
tors in the morphology of the primate cerebral cortex. PLoS 
Computational Biology, 2, e22. 

Toro, R., Perron, M., Pike, B., et al. (2008). Brain size and folding 
of the human cerebral cortex. Cerebral Cortex, 18, 2352-7. 

Fietz, S.A., Kelava, I., Vogt, J., et al. (2010). OSVZ progenitors 

of human and ferret neocortex are epithelial-like and expand by 
integrin signaling. Nature Neuroscience, 13, 690-9. 

Reillo, I., de Juan, R.C., Garcia-Cabezas, M.A., Borrell, V. (2011). 
A role for intermediate radial glia in the tangential expansion of 
the mammalian cerebral cortex. Cerebral Cortex, 21, 1674-94. 
Lui, J.H., Hansen, D.V., and Kriegstein, A.R. (2011). Development 
and evolution of the human neocortex. Cell, 146, 18-36. 

Molnar, Z. and Clowry, G. (2012). Cerebral cortical development 
in rodents and primates. Progress in Brain Research, 195, 45-70. 
Zilles, K., Palomero-Gallagher, N., and Amunts, K. (2013). 
Development of cortical folding during evolution and ontogeny. 
Trends in Neuroscience, 36, 275-84. 

Kostovic, I. and Judas, M. (2010). The development of the 
subplate and thalamocortical connections in the human foetal 
brain. Acta Paediatrica, 99, 1119-27. 

Richman, D.P., Stewart, R.M., Hutchinson, J.W., and Caviness, V. 
(1975). Mechanical model of brain convolutional development. 
Science, 189, 18-21. 

Mota, B. and Herculano-Houzel, S. (2015). Cortical folding 
scales universally with surface area and thickness, not number of 
neurons. Science, 349, 74-7. 

Hansen, D.V., Lui, J.H., Parker, P.R., and Kriegstein, A.R. (2010). 
Neurogenic radial glia in the outer subventricular zone of human 
neocortex. Nature, 464, 554-61. 

Rakic, P. (2009). Evolution of the neocortex: a perspective from 
developmental biology. Nature Reviews Neuroscience, 10, 724-35. 
Nieuwenhuys, R. and Puelles, L. (2016). Towards a new neural 
morphology. Springer, Cham. 


roimaging technologies 


Woolrich, Mark Jenkinson, and Clare Mackay 




















Brain imaging provides a window into the living brain and is an 
increasingly essential tool for experimental medicine in psych- 
iatry. The inaccessibility of central nervous system tissues to biopsy 
and the need to study the ‘system, and not just the cell, make ad- 
vanced brain imaging central to uncovering the pathophysiology, 
discovering and evaluating new targets, developing and testing the 
efficacy of drugs, and developing biomarkers for diagnosis, patient 
stratification, and therapeutic monitoring. Here, we introduce the 
three technical domains by which we can assay brain structure and 
function: magnetic resonance imaging (MRI), molecular imaging 
(positron emission tomography (PET), and single-photon emission 
computed tomography (SPECT); electrophysiology [electroenceph- 
alography (EEG) ]; and magnetoencephalograpy (MEG). Each of the 
imaging modalities has its own strengths and limitations, and the 
modalities should be regarded as complementary, rather than com- 
peting. Typically, the signal that is detected is an average of tens of 
thousands of cells. Nevertheless, imaging techniques are sensitive 
to everything, from molecular concentrations up to whole system 
dynamics, making the essential link between the cell and the system. 
Table 12.1 gives an overview of the sensitivity of each imaging mo- 
dality to aspects of neurophysiology. 





One of the most widely used and flexible tools for neuroimaging 
studies is MRI. This imaging method is non-invasive, causes no 
tissue damage, and can provide high-quality images of brain 
anatomy, with excellent soft tissue contrast (structural MRI), or for 
measuring properties of axonal architecture in the white matter (dif- 
fusion MRI), or for detecting neuronal activity (functional MRI). 
Spatial resolution for MRI is in the order of a millimetre, and MRI 
scanners are readily available in both clinical and research settings. 


Principles of MRI 


MRI is based on magnetic interactions with specific nuclei, using the 
principle of nuclear magnetic resonance (NMR). Although NMR re- 
lies on properties of the nuclei within the atoms, it interacts in a way 
that does not harm tissue. 





Two types of magnetic fields are important in MRI: (1) strong 
static B, field; and (2) radiofrequency (RF) fields. The B, field is cre- 
ated by the large, superconducting coil that defines the strength of 
the scanner; for instance, a 3T scanner has a B, field of 3 tesla (T). 
Both 1.5T and 3T scanners are common in clinical environments, 
while 3T and 7T scanners are more common for research. Higher- 
field scanners are advantageous, as they can reduce noise and im- 
prove spatial resolution. 

RF fields are used to manipulate the magnetization of mol- 
ecules in the brain. The magnetization can then be detected 
and measured by the MRI scanner to produce images. For most 
neuroimaging MRI applications, the RF is tuned to manipulate 
hydrogen nuclei in water molecules because of their high abun- 
dance in all biological tissues. Structural, functional, and diffu- 
sion MRI work by manipulating the magnetization in different 
ways to make it sensitive to particular microscopic properties of 
water and its environment. 


Overview of acquisition and analysis 


MRI acquisitions can be optimized for studies by varying options, 
for example, the type (modality) of image (for example, functional, 
diffusion, etc.) and various parameters (for example, echo time). 
However, some noise and artefacts are common in MRI, caused by 
scanner imperfections or the physics/physiology of subjects such as 
head motion. Most MRI artefacts are rare, but some are common 
(for example, head motion, bias field), and analysis strategies are 
needed to reduce their effects. 

Almost all neuroimaging research studies involve groups of 
subjects, to infer results about the wider population, accounting for 
between-subject variation. Statistical analysis is usually conducted 
separately on each voxel (three-dimensional pixel), with correc- 
tions needed for the multiple statistical tests this requires, and re- 
sults are generated in the form of statistical maps that are displayed 
as coloured overlays on brain images. These analyses require that 
all subjects are spatially aligned (registered) together, for anatomical 
consistency. 


Structural MRI 


The most common MRI modality is structural imaging, which is 
routinely used in both clinical practice and research studies. It pro- 
vides images of brain anatomy, with excellent soft tissue contrast, 
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Table 12.1 The sensitivity of various imaging modalities in aspects of neurophysiology 








Neurotransmitters PET, MRS Radiotracers can probe dopamine and serotonin systems, including labels for 
synthesis capacity, receptors, and transporters. The glutamate system also has PET 
ligands. Both GABA and glutamate can be indirectly assayed with high-field MRS 

Protein aggregates PET In ageing and dementia, tracers for amyloid plaques are now commercially available 
and novel tracers are in development for tau and alpha-synuclein 

Action potentials EG/EEG Synaptic potentials in the brain give rise to measurable changes in electrical and 
magnetic fields that are measurable outside the surface of the head 

Metabolism PET, MRI PET radiotracers provide relatively direct measurement of blood flow, blood volume, 
and oxygen/glucose metabolism. MRI techniques, such as arterial spin labelling, can 
produce similar measures non-invasively 

Individual brain structures R Structural MRI typically has a resolution of 1 mm? and good tissue contrast for 
examining individual brain structures 

Tissue integrity R MRI provides contrast mechanisms to explore tissue integrity, including relaxometry 
and susceptibility mapping 

White matter connectivity R Diffusion-weighted or diffusion tensor imaging provides indirect estimates of white 
matter pathways 

Regional brain function RI, PET, M/EEG PET has largely been replaced by fMRI for studying brain function. fMRI provides 
good spatial detail, but at low temporal resolution (seconds). This is complemented 
well by the high temporal resolution (milliseconds) of M/EEG 

Brain networks RI, M/EEG Brain connectomics is a new and growing field, enabling the use of neuroimaging 














data to model brain network dynamics 


typical spatial resolution of 1 mm, and typical acquisition times of 
3-15 minutes. In clinical practice, radiologists examine these im- 
ages for signs of pathology for disease diagnosis. In neuroimaging 
research, images are either analysed directly to extract quantitative 
measures from them (for example, hippocampal volume) or used as 
part of other analysis pipelines (for example, for alignment in func- 
tional studies). 


Measurement principles 


Most structural images are based on three basic properties: proton 
density (PD), and T, and T, relaxation times. Each of these influ- 
ences the signal that is received from the hydrogen nuclei in water 
molecules. Proton density is effectively the density of water in tis- 
sues, and the received signal is proportional to this, although this 
alone creates weak tissue contrast for grey and white matter. 

The relaxation times T, and T, are related to different processes 
that affect magnetization within water molecules. Many factors af- 
fect T, and T, relaxation times such as molecular tumbling, chem- 
ical environment, geometry of nearby cells/structures, etc. Hence 


relaxation times are a complicated function of tissue properties but 
depend on tissue composition (for example, proportion of neuronal 
bodies vs axons). These differences in relaxation time provide su- 
perior tissue contrast for grey and white matter. 


Acquisitions 


Many types of structural MRI acquisition exist, with PD, T,- 
weighted, T,-weighted, and fluid-attenuated inversion recovery 
(FLAIR) being the most common (Fig. 12.1). This variety is possible 
because of the detailed timings of fields applied by the scanner, spe- 
cified by an MRI pulse sequence. These settings enable the influence 
of the T, and T, relaxation processes on the intensity to be separately 
adjusted. 

For example, for a PD image, the sequence is set such that T, and 
T, relaxation times have minimal effect and the primary contrast 
(intensity difference between tissues) is driven by the PD. For a 
T,-weighted image (which is the most commonly acquired struc- 
tural MRI scan), the sequence is set to enhance the effect of T, 
relaxation, while minimizing the effect of T, relaxation, although 





Fig. 12.1 Examples of the variety of structural images that can be acquired with MRI. From left: T,-weighted, T,-weighted, fluid-attenuated inversion 
recovery (FLAIR), FLAIR again, susceptibility-weighted gradient echo (SWI). Different types are sensitive to different tissue characteristics, particularly 
pathological lesions and tissues, and so it is common to get a range of structural images for each subject. 


PD weighting remains. Another commonly acquired structural 
image is FLAIR, which attenuates signal from the cerebrospinal 
fluid (CSF), to reduce CSF contamination and highlight patho- 
logical lesions. 


Limitations and artefacts 


Noise is common across all MRI modalities, and this constrains 
the accuracy of quantitative results and limits statistical power. 
Consequently, research studies typically require groups of 20 
subjects or more. 

Another common limitation is the partial volume effect (PVE) 
due to the finite spatial resolution in MRI—typically between 0.5 
and 1mm for structural images. At this scale, a voxel often contains 
several tissues (for example, in the cortical folds, voxels often con- 
tain grey matter and some CSF or white matter). The voxel intensity 
is then a mixture of tissue intensities, weighted according to the pro- 
portion of the voxel volume they occupy. This results in the borders 
of structures appearing blurry, limiting the precision of boundary 
localization and volume calculations. 

Magnetic resonance intensity values are arbitrarily scaled, and 
not quantitative. Thus, image analysis techniques are necessary to 
estimate quantitative information, for example, the volume of grey 
matter. 

It is essential to check for artefacts in MRI studies and keep in 
mind existing artefacts when interpreting results. One common 
artefact is bias field, caused by inhomogeneities in the RF fields, and 
results in large areas of the image being artificially brighter or darker 
than they should be. This artefact, unless extremely strong, is com- 
pensated for very well by analysis methods. 
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Analysis 


Several different types of analysis are done with structural MRI, 
some specific for studying anatomical changes and some useful for 
other experiments such as diffusion MRI, functional MRI, or other 
modalities like PET, MEG, and EEG. 

Registration is a fundamental analysis tool and involves aligning 
one image with another (for example, two images of the same sub- 
ject or different subjects). Although modern registration methods 
work well generally, they cannot overcome some fundamental bio- 
logical ambiguities such as aligning disparate folding patterns (for 
example, single vs double gyri, as is common in the cingulate). 

Segmentation is another fundamental analysis tool, which in- 
volves determining the location of anatomical tissues or structures 
in the image. The two main types of segmentation are tissue-type 
segmentation and structural segmentation. Tissue-type segmenta- 
tion aims to determine where the main ‘tissues’ (grey matter, white 
matter, and CSF) are in the image. The result of tissue-type segmen- 
tation is typically an image (or a map) where each voxel is assigned 
a tissue label. 

Structural segmentation can identify specific anatomical struc- 
tures (for example, hippocampus) or the grey matter cortex. Cortical 
segmentation, or modelling, usually outputs a surface, rather than a 
voxel-based, image, as this more accurately captures cortical folding 
and provides clearer ways of displaying the anatomy (Fig. 12.2). 
Notably, surface-based analysis of functional data is a powerful and 
growing alternative to voxel-based methods. 

Anatomical analysis usually measures differences in structural 
shape or volume. Cortical thickness is one quantitative measure, 
and analysing thickness differences is done between groups or with 
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Fig. 12.2 (see Colour Plate section) Examples of structural segmentation and analysis. Top left shows cortical segmentation (left hemisphere) where 
the inner and outer surfaces of the cortical grey matter are modelled, and from this, the cortical thickness can be calculated and analysed to show areas 
of differing thickness between groups (bottom left). Top right shows a set of subcortical structures (for example, caudate, hippocampus, brainstem, etc.), 
and bottom right shows an example of a shape analysis (hippocampus) displaying localized areas of difference in the shape between a patient and 


control group. 


Adapted from Jenkinson M, Chappell M, Introduction to Neuroimaging Analysis, Copyright (2017), with permission from Oxford University Press. 
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respect to a covariate of interest (for example, disability score). Such 
analyses use statistical methods to produce a map of significant 
results—normally displayed as a colour overlay (Fig. 12.2). 

Deep grey matter structure segmentations are typically summar- 
ized by volume measures or shape characteristics. In shape analysis, 
each localized region of the boundary is examined to find changes 
in position that relate to group differences or covariates of interest. 
This results in a map of significant results at the boundary of the 
structure (Fig. 12.2). 

Whole brain voxel-wise analysis can be used to find anatomical 
differences at any location, for example, voxel-based morphometry 
(VBM) [1]. This combines tissue-type segmentation and registra- 
tion to quantify the amount of ‘local’ grey matter volume for voxel- 
wise analysis (Fig. 12.3). 


Diffusion MRI 


White matter appears fairly uniform in structural MR images, 
whereas diffusion MRI provides richer information about micro- 
structure and anatomical connectivity. 


Measurement principles 


Axonal fibre bundles restrict how water molecules can diffuse, and 
this forms the basis of diffusion MRI. Both intracellular and extra- 
cellular water in a fibre bundle is less likely to encounter barriers 
when moving along the direction of the axons, as opposed to moving 
in other directions. Thus, maximum diffusion occurs along the dir- 
ection of the axons. 

To obtain a measurable signal, it is necessary for a large number of 
water molecules to behave in the same way, as a voxel is much bigger 
than an axon diameter. Therefore, only sizeable fibre bundles pro- 
duce detectable signals. The amount of diffusion, in all directions, 
is also affected by axon diameter, packing density, and myelination, 
and diffusion MRI is a sensitive marker of changes in these micro- 
structural properties. 


Acquisition 


Diffusion MRI uses special diffusion-encoding gradients (magnetic 
fields that change in strength along a particular direction) in order 


to measure the diffusion of water molecules. These gradients lead to 
a reduction in the signal from water that is diffusing in this encoding 
direction, and the amount of intensity reduction depends on the 
amount of diffusion. 

A single diffusion magnetic resonance image only provides infor- 
mation about diffusion in one direction. Therefore, many images, 
with many different encoding directions, are required to build up a 
full picture of the diffusion. Acquiring many images in a short time 
requires fast imaging sequences such as echo planar imaging (EPI). 
Using EPI acquisitions and other accelerations, such as parallel 
imaging methods [for example, sensitivity encoding (SENSE) or 
generalized autocalibrating partial parallel acquisition (GRAPPA) |] 
and/or simultaneous multi-slice methods, enables whole brain im- 
ages to be acquired in a few seconds or less. In neuroimaging re- 
search, the number of encoding directions acquired can be up to 
200, with acquisition times of around 5-15 minutes. Having a large 
number of encoding directions is required for tracing anatomical 
connections; however, this is not often used in clinical practice 
where a small number of directions (for example, 3-5) is often used 
for very short acquisitions. 


Limitations and artefacts 


Fast imaging sequences and diffusion encoding come with some 
drawbacks: lower spatial resolution (around 2-3 mm) and in- 
creased artefacts. Diffusion MRI artefacts include eddy current 
distortions and B, inhomogeneity distortions and motion, on top 
of artefacts that also occur in structural MRI. Many artefacts in- 
duce geometric distortions that require specialized pre-processing 
correction tools. 


Analysis 


Pre-processing steps are applied to correct for geometric distor- 
tions, though correcting for B, inhomogeneities requires separate 
scans (field maps or blip-up-blip-down pairs) to be acquired. After 
pre-processing, models of the diffusion process are fit to the inten- 
sities, to extract information about the physical diffusion of water 
molecules. One common model is the tensor model (as in diffusion 
tensor imaging or DTI), which assumes that the diffusion in different 





Fig. 12.3 (see Colour Plate section) Example of standard analysis techniques in patients with schizophrenia (a-c) and carriers of the VAL and MET 
alleles of the COMT gene (d). (a) VBM analysis with areas of significant grey matter reduction in schizophrenia relative to controls shown in red-yellow. 
(b) fMRI using a letter fluency task showing significant areas of reduced activity in schizophrenia, relative to controls. (c) Diffusion MRI-based TBSS 
(tract-based spatial statistics) analysis of the same patients with schizophrenia, relative to controls, showing reduced fractional anisotropy in the corpus 
callosum and forceps major. (d) Resting fMRI analysis of healthy VAL and MET allele homozygotes of the COMT gene, showing greater functional 


connectivity in VAL carriers, relative to METs. 


Fig. 12.3(a-c) reproduced courtesy of Tim Crow and Clare Mackay, Fig. 12.3(d) adapted from Neurolmage, 68, Tunbridge EM, Farrell SM, Harrison PJ, et al., Catechol-O- 
methyltransferase (COMT) influences the connectivity of the prefrontal cortex at rest, pp. 49-54, Copyright (2013), Elsevier Ltd. Reproduced under the Creative Commons 








Attribution License CC BY 3.0. 


directions can be modelled using a spatial Gaussian probability, 
often visualized by an ellipsoid. Each voxel is fit separately, resulting 
in a set of tensor parameters per voxel, that is, a separate map/image 
of each parameter is obtained. However, this model is not very ac- 
curate for voxels containing two or more fibre bundles running in 
different directions (for example, crossing fibres). 

From the tensor parameters, several key values are calculated such 
as mean diffusivity (MD) (average diffusion in all directions) and 
fractional anisotropy (FA), a measure of how diffusion varies with 
direction. The MD can be broken down into axial diffusivity (AD) 
and radial diffusivity (RD), which quantify the diffusion along the 
direction of the fibre bundle (AD) and perpendicular to it (RD). All 
of these quantities are surrogates of biological properties such as 
axon size, density, and myelination, but independent of direction. 
Changes in these quantities can help support or refute hypotheses 
relating to underlying biological changes, but they need to be inter- 
preted carefully. 

Analysing changes in DTI measures requires spatial alignment, 
but structural images are featureless within white matter, and there- 
fore, registrations are based on diffusion information (for example, 
using original diffusion or FA images). A widely used method for 
such analyses is tract-based spatial statistics (TBSS), which aligns 
the centre of major white matter tracts, providing results only in 
tract centres (the skeleton) where alignment is the most reliable 
(Fig. 12.3) [2]. 





CHAPTER 12 Neuroimaging technologies 


Tractography is an alternative analysis tool that traces out the 
pathways of the major axonal fibre bundles. Pre-processing is the 
same, but other models are sometimes used that can provide more 
precise estimates of the direction of the fibre bundle(s) and explicitly 
account for multiple fibres within a voxel. Information about the dir- 
ection of fibre bundles is then used to trace out pathways; starting 
at a seed point and then moving, in small steps, along the locally 
estimated direction of fibre bundles, allowing entire pathways to be 
mapped out (Fig. 12.4). 

Two main varieties of tractography algorithm exist: determin- 
istic and probabilistic. Deterministic tractography uses a single es- 
timate of direction (for each fibre) at each step. By using multiple 
seed points, a set of individual pathways, or streamlines, can be 
constructed, which are often displayed as an entire fibre tract. In 
probabilistic tractography, a probability distribution of directions is 
estimated at each voxel (for each fibre), in order to take into account 
noise and uncertainty. 

Tractography outputs can be used to estimate anatomical con- 
nectivity matrices (the connectivity between a set of grey matter 
areas) or to map out the spatial location of tracts (used in presurgical 
planning) or as a way of parcellating other structures. The latter 
case—connectivity-based segmentation—is able to parcellate areas 
of the brain on the basis of where they are most ‘strongly’ connected, 
for example, segmenting thalamic nuclei based on connections to 
the cortex. 
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Fig. 12.4 (see Colour Plate section) Examples of diffusion MRI and analysis results. Top row shows a structural image (left) for comparison, along with 
a non-diffusion-weighted image and a single diffusion-weighted image. Many of these EPl-based images need to be acquired in a diffusion MRI study. 
The two images on the right of the top row show DTl-based measures calculated from the diffusion-weighted acquisitions, which are often used as 
surrogate measures of white matter microstructure. The bottom row shows an example of estimated white matter tract directions on the left (colour 
coding based on direction: red = left-right, green = anterior-posterior, blue = inferior-superior), and the tractography results based on these (right) 
using either deterministic or probabilistic methods for tracing major fibre pathways. 

Adapted from Jenkinson M, Chappell M, Introduction to Neuroimaging Analysis, Copyright (2017), with permission from Oxford University Press. 
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Functional MRI 


Functional MRI (or fMRI) is sensitive to neuronal activity and has 
two main modes of use: task and resting state. In task fMRI, the 
participant performs a task in the scanner (for example, sensory, 
motor, cognitive), in order to determine the location and strength 
of brain activity related to the task. In resting state fMRI, no task is 
performed and the objective is to measure intrinsic characteristics of 
spontaneous brain activity such as functional connectivity between 
brain regions. 


Measurement principles 


fMRI measures neuronal activity indirectly via the BOLD 
(blood-oxygen level-dependent) effect, based on how activity 
changes the local concentration of deoxygenated haemoglobin. 
This is measurable because deoxygenated haemoglobin disturbs 
the magnetic field around it, reducing the signal from nearby 
water molecules. Notably, the haemodynamics lag considerably 
behind the electrical activity of neurons, with the signal peaking 
approximately 6 seconds after the electrical activity and taking 
approximately 20-30 seconds to return to the initial baseline 
level (Fig. 12.5). 


Basal state 


@ = HbO, 
© = Hbr 


Capillary 
bed 


ZN Venules 


Arterioles 


Acquisitions 


Measuring neuronal activity indirectly through the slow haemo- 
dynamics means that acquiring an image every few seconds is suf- 
ficient. This is typically achieved using gradient-echo EPI magnetic 
resonance images, tuned to detect signal changes caused by local 
magnetic field disturbances (which changes T,* relaxation). A typ- 
ical fMRI acquisition will have a spatial resolution of 2-3 mm and 
a temporal resolution [also repetition time (TR), the time taken to 
acquire one image] of 1-3 seconds. 


Limitations and artefacts 


The signal in fMRI is a surrogate haemodynamic measure of neur- 
onal activity, and precise timings in neuronal activity cannot be 
measured, making causality measurements extremely difficult. 
Furthermore, changes in fMRI signal between experimental 
or clinical conditions can be confounded by changes in the 
vasculature. 

The artefacts in {MRI and diffusion are similar because they both 
use EPI. However, fMRI uses gradient-echo EPI (spin-echo used in 
diffusion), which causes signal loss in the inferior frontal and tem- 
poral areas, due to the proximity of air-filled sinuses and air cells. 
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Fig. 12.5 Illustration of the BOLD (blood oxygenation level-dependent) signal used in fMRI. The top panel shows blood in a capillary within neuronal 
tissue that is in the basal (left) or activated state (right). Activity of the neurons increases both the cerebral blood flow (CBF) and the cerebral blood 
volume (CBV) in the tissue more than the oxygen extraction (CMRO.), resulting in a net decrease in the concentration of deoxygenated haemoglobin 
(Hbr). Decreased Hbr leads to a greater MRI signal, as it disturbs the local magnetic fields less. The bottom panel demonstrates the typical dynamics of 
this signal (the haemodynamic response function or HRF) in response to a very short stimulus at time zero. 

Adapted from Jenkinson M, Chappell M, Introduction to Neuroimaging Analysis, Copyright (2017), with permission from Oxford University Press. 


Head motion effects are also a major confound in fMRI, requiring 
the use of specialized analysis strategies. 


Analysis 


Pipelines for fMRI analysis start with pre-processing steps for redu- 
cing artefacts or improving the signal-to-noise ratio (SNR). Many 
pipeline versions exist but share common elements, for example By 
distortion correction, motion correction, slice-timing correction, 
spatial smoothing, and temporal filtering. Corrections for By dis- 
tortion and motion are broadly the same as for diffusion and still 
require an additional field map acquisition. 

Slice-timing correction is needed in fMRI, since the images are 
acquired a slice at a time (or several slices at a time for multiband 
acquisitions), and timing of the analysis and acquisition must 
match. Spatial smoothing aims to improve the SNR but reduces 
the effective spatial resolution, and is sometimes avoided in high- 
quality data to maintain resolution. Temporal filtering removes 
signals that do not match the characteristics of neuronal sig- 
nals such as very slow drifts caused by physical or physiological 
artefacts. 

For task fMRI, a model is constructed that incorporates stimulus 
timings and the known haemodynamic properties to create a pre- 
dicted response. These predicted responses to tasks are used in a 
multiple regression that is carried out separately in each voxel. The 
result is a spatial map showing brain areas that are significantly ac- 
tive (after correction for multiple statistical tests over many voxels) 
in response to particular tasks. The same framework can also be used 
to investigate how activation strength compares across differences in 
stimuli/tasks, groups, covariates, or experimental manipulations. 
For resting state fMRI, the two common forms of analysis are: net- 
work matrix analysis and independent component analysis (ICA). 
In network matrix analysis, functional connectivity, defined as any 
measure of statistical dependency, is computed between all pairs of 
time series (brain activity over time) extracted from all brain regions 
of interest to build up a network matrix or graph. In fMRI, the most 
commonly used metric of functional connectivity is correlation. 
These matrices can then be analysed in various ways, including 
using graph theory. The analysis of network matrices is of great 
interest and an active neuroimaging research area. 

InICA, the data set is decomposed into a set of components (similar 
to principal component analysis), and each one contains a spatial map 
and time series (Fig. 12.6). Components may relate to neuronal sig- 
nals, physiological noise, or artefacts. Each neuronally related com- 
ponent represents a set of brain regions that share, at least partially, a 
common time series. Importantly, certain components are reliably re- 
produced with recognizable spatial maps, often referred to (mislead- 
ingly) as ‘networks’ with specific names (for example, default mode 
network, dorsal attention network, etc.), and correspond well with the 
networks produced using a network matrix analysis approach. 

Network analyses can also be used to ask how networks change 
with task. In general, analysis and interpretation of networks are a 
relatively new field, and much remains to be understood about func- 
tional connectivity. Nonetheless, many applications already exist, 
including the use of network features as disease biomarkers. 


Other MRI modalities 


The three main MRI research modalities have been described in 
detail, but the versatility of MRI goes beyond these. For example, 
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Fig. 12.6 (see Colour Plate section) Illustration of resting state networks. 
[op panel shows ICA components corresponding to two networks 
motor and default mode in the first and second rows, respectively). 
These have spatial maps (left) that show the brain areas involved in 

each network (red-yellow), and time courses (middle) of fMRI signal 
uctuations that are common across all parts of that network. The power 
spectra of the time courses (right) show low frequency fluctuations, 
characteristic of slow neuronally induced haemodynamic changes. 
Bottom panel shows an example of the spatial map of a resting state 
network (default mode network) estimated from a group of subjects 

eft) and from individual subjects (right). Features from these intrinsic 
networks can be analysed for differences across clinical populations or 
experimental conditions. 

Adapted from Jenkinson M, Chappell M, Introduction to Neuroimaging Analysis, 
Copyright (2017), with permission from Oxford University Press; Bijsterbosch J, 
Smith SM, Beckmann CF, Introduction to Resting State fMRI Functional Connectivity, 
Copyright (2017), with permission from Oxford University Press. 
































magnetization transfer (MT) can be used to detect white matter 
lesions; susceptibility-weighted imaging (SWI) is used to detect 
microbleeds or iron deposition; and arterial spin labelling (ASL) 
magnetically tags blood as an intrinsic contrast agent, providing 
quantitative measures of perfusion and cerebral blood flow and 
volume. 

An MRI scanner can also be used to acquire information about 
molecular concentrations. Magnetic resonance spectroscopy (MRS) 
acquires signals from nuclei in molecules other than water, but 
their lower concentrations require reduced spatial resolution (sev- 
eral centimetres) to obtain sufficient signal, normally from one 
preselected region. Only signals from certain molecules can be sep- 
arated by MRS [for example, GABA, N-acetyl aspartate (NAA)], 
which is a limitation, although higher-field scanners (for example, 
7T) have greater specificity. 


Molecular imaging 


The vast majority of MRI-based research is based on contrasts that 
can be gleaned from manipulating the proton. The number of mol- 
ecules that can be probed by emission tomography is far greater, 
limited only by the creativity of radiochemists and the substantial 
resources required to develop new tracers. 
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How does it work? 


PET and SPECT both work on the same principle. A radiotracer is 
synthesized by adding a radioactive isotope to a molecule of interest. 
The tracer is injected into a participant who is then placed inside a 
PET or SPECT camera, and an image is formed that represents the 
rate of emission of positrons (PET) or gamma rays (SPECT) from 
a region of interest in the body. The number of PET isotopes is far 
greater than for SPECT, but SPECT technology is cheaper and more 
readily available (Fig. 12.7). 


Radioisotopes 


The radioisotopes most commonly used for SPECT are iodine-123 
and technetium. For PET, the commonly used isotopes are oxygen- 
15 (°O), carbon-11 ("C), and fluorine-18 (F), which are manu- 
factured in a particle accelerator (cyclotron). To develop a tracer, 
the radioisotope must be inserted into the molecule of interest in a 
way that does not impact the biology of the molecule and in small 
enough quantities that it does not become pharmacologically active 
in its own right. In biological compounds, it is relatively easy to insert 
(or exchange) an oxygen or carbon atom. However, the isotopes for 
1O and ''C have very short half-lives (2 minutes and 20 minutes, re- 
spectively), meaning that for "C, the cyclotron must be proximal to 
the scanner, and for "O, the isotope must be piped directly from the 
cyclotron to radiochemistry and into the participant who is already 
lying in the scanner, with no delay. With its longer half-life, "°F (half- 
life of 110 minutes) might be considered as the most convenient PET 
isotope, but the radiochemistry can be complex. 


Image acquisition and analysis 


Compared with the complex physics involved in MRI acquisition, 
PET and SPECT image acquisition are relatively straightforward. 
PET and SPECT ‘scanners’ are gamma cameras, with detectors ar- 
ranged in a ring around the object of interest. PET radioisotopes 
all have unstable nuclei that will emit protons that rapidly collide 


HO p MHz 
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with an electron, causing an annihilation that generates two gamma 
rays (photons) with a separation angle of 180°. Two photons will be 
detected, and a ‘line of response’ used to localize the source of the 
annihilation (Fig. 12.7). Data are assimilated from tens of thousands 
of events, and an image of the relative distribution of the radiotracer 
in the sample is built up. 

Analysis of SPECT images in clinical settings is either qualitative 
(radiologists report) or semi-quantitative whereby a signal intensity 
is obtained from the region of interest and compared with a reference 
region where the signal is assumed to be unaffected by the experi- 
mental manipulation. PET data are amenable to more sophisticated 
analysis techniques. The unit of analysis is the ‘binding potential, 
which is typically turned into a regional standard uptake value (SUVr) 
and can be mapped as either a static quantity obtained at a fixed time 
post-injection or a dynamic quantity with multiple image acquisitions 
showing both wash-in and washout of the tracer (Fig. 12.8). 

PET can be fully quantitative if acquired with an arterial line in 
situ. The arterial input function, combined with metabolite measure- 
ment, can be used to produce a full arterial model. However, PET 
data do not contain any anatomical information, requiring the PET 
signal to be registered to, or acquired with, a structural scan. Modern 
PET cameras are usually combined with computed tomography (CT) 
scanners (PET/CT), and recently it has become possible to combine 
PET with MRI (PET-MR). PET-MR has advantages relating to the 
superior tissue contrast of magnetic resonance relative to CT, but 
there are also methodological hurdles to overcome. In particular, the 
signal from the annihilation event is attenuated by passing through 
the skull, and this can be relatively easily mapped using CT (which 
is, in essence, a 3-dimensional X-ray). MRI is insensitive to bone, so 
attenuation correction requires a more complex solution. 


Radiotracers 


The main innovations in PET come from the development of ro- 
bust radiotracers by creating a ligand from a radioisotope and a 
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Fig. 12.7 How PET imaging works (similar for SPECT). The radioisotope is manufactured in a cyclotron and rapidly combined with the molecule 
of interest to create the tracer, which is injected into the patient. The patient is then positioned inside the PET scanner (gamma camera), which 


accumulates the data required to create the image. 
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Fig. 12.8 Signals obtained from semi-quantitative PET. Relative standard 
uptake units (SUVR) can be acquired from static data acquired at the 
optimal time post-injection (usually 40-90 minutes, depending on the 
tracer). Dynamic imaging requires long acquisition times but provides 
quantification of the pharmacokinetics of the tracer. 


molecule of interest. The most commonly used PET radiotracer is 
fluorodeoxyglucose (FDG), a glucose analogue, labelled with '*F, 
which is readily taken up by metabolically active cells. The resulting 
FDG-PET images represent regional glucose consumption (and 
thus metabolic activity), which is widely used in oncology because 
tumours are typically highly metabolically active. Both FDG and 
HYO have been used as indirect measures of cerebral blood flow 
changes induced by neuronal activity in cognitive tasks. However, 
with fMRI being non-invasive, cheaper, and logistically simpler to 
implement, task-related PET has largely been consigned to history. 
FDG continues to be a useful tracer for assessment of metabolism, 
with particular application in dementia and neurodegeneration. 


Neurotransmitter/receptor tracers 


Tracers have also been developed for investigating neurotransmitter 
systems. Two good examples relevant to psychiatric research are 
the dopamine system in schizophrenia and the serotonin system in 
mood disorders. For dopamine, the best established PET tracers (for 
example, ''C-raclopride and '*F-fallypride) bind to D2 receptors and 
are sensitive to changes in the concentration of dopamine through 
competitive interaction. These tracers are important both for mech- 
anistic studies (for example, reduced D2 receptors in schizophrenia 
[3]) and to investigate the pharmacokinetic effects of antipsychotic 
drugs (for example, receptor occupancy studies [4]). Tracers for the 
serotonin system include ''C-MDL-1009 for 5-HT2A receptors and 
“C-DASB for the 5-HT transporter and are helping to elucidate 
mechanisms of response to antidepressants [5]. 


Protein aggregate tracers 


Pathological aggregated proteins are hallmarks of several 
neurodegenerative diseases, including Alzheimer’s disease and de- 
mentia with Lewy bodies (DLB). By developing tracers sensitive to 
these aggregated proteins, it becomes possible to perform in vivo 
histology and vastly improve stratification for clinical trials [6]. 
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Fig. 12.9 (see Colour Plate section) Examples of protein aggregate 
tracers for Alzheimer's disease. 
Reproduced courtesy of Roger Gunn and Azadeh Firouzian. 


Beta-amyloid plaques and tau tangles are the hallmarks of 
Alzheimer’s disease (see Plate 6). PET with amyloid ligands is part 
of the leading edge in Alzheimer’s disease research [7] and is fast 
becoming the ‘standard’ in clinical trials for disease modification. 
Hot on the heels of amyloid PET are the tau tracers, several of which 
are now in development [8]. Much international effort is also being 
spent on developing tracers for alpha-synuclein (the aggregated pro- 
tein in Lewy bodies) and TDP-43 (hallmark of both fronto-temporal 
dementia and motor neuron disease) [6] (Fig. 12.9). 





Non-invasive electrophysiology: EEG and MEG 


MEG and EEG provide direct measures of neuronal activity, by 
recording the magnetic or electric fields that propagate outside of 
the head at millisecond-temporal resolution. As with fMRI, there 
are two main modes of use—task-based and resting state, both 
providing a non-invasive window on the brain, at the kind of fast 
subsecond timescales associated with typical cognition. 


How it works 


The different technologies used in MEG and EEG, and their rela- 
tionship with the underlying electrophysiology, result in contrasting 
benefits between the two approaches (summarized in Table 12.1). 


EEG 


In EEG, the electric currents emanating from brain activity are 
measured by electrodes in contact with the scalp surface, with ar- 
rays of up to 256 electrodes typically housed within a fabric cap 
(Fig. 12.4). However, the currents measured at the scalp are limited 
by the distortive effects of the intervening structures in the head, 
which severely hamper efforts to localize the signal sources precisely. 
This forms one of the chief motivations for using the alternative 
technique of MEG, since magnetic fields pass through the dura, the 
skull, and the scalp relatively unaltered. 


MEG 


The magnetic fields produced by the brain are tiny, with field strengths 
of approximately 10 fT. The highly sensitive superconducting quantum 
interference device (SQUID) allows for detection of these tiny fields. 
Sensor arrays provide whole head coverage via a helmet containing up 
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Fig. 12.10 Left: subject wearing a 64-channel (electrode) EEG cap. Right: subject doing a visual stimulation study in an MEG scanner housing 306 
sensors. 
Adapted from Jenkinson M, Chappell M, Introduction to Neuroimaging Analysis, Copyright (2017), with permission from Oxford University Press. 


to 306 sensors, enveloped in cooling liquid helium. The subject's head 
is positioned underneath the helmet (Fig. 12.10). 


What are we measuring? 


In both MEG and EEG, the measured fields come from neuronal ac- 
tivity at the ‘inputs’ of neurons—the membrane potentials (also known 
as the post-synaptic, or local, field potentials), rather than at the ‘out- 
puts’—the action potentials. Furthermore, the activity produced by 
a single neuron is far too small to be detectable; instead signals must 
sum up over tens of thousands of neurons. This requires a population 
of neurons to be both temporally synchronous and spatially aligned, 
with their dendrites pointing in the same directions, such that their 
individual membrane potentials add up, rather than cancel out. 
Fortunately, within the cortex, populations of pyramidal neurons tend 
to fire synchronously and are arranged with their dendrites tending to 
extend in the same direction, perpendicular to the cortical surface. 


Acquiring data 


For EEG to work successfully, care needs to be taken to ensure good 
conductance between the electrodes and the scalp. This is often 
achieved by either soaking electrodes in a conducting fluid prior to 
fitting the cap (a ‘wet’ cap) or using conducting gels placed on each 
electrode. 

MEG does not require contact between the sensors and the scalp; 
instead the subject’s head is placed as close as comfort allows to the 
inner surface of the helmet of the scanner. The magnetic field gen- 
erated by the brain is several orders of magnitude weaker than that 
of ambient electromagnetic noise, requiring scanners to be placed in 
magnetically shielded rooms (MSRs). 

With both EEG and MEG, it is important that the location of the 
sensors with respect to subject’s head is known, as this aids with map- 
ping activity in the brain (source reconstruction) or with comparing 
results across subjects. In EEG, the locations of the electrodes are 
measured, for example by using a stylus pen, the position of which 
can be triangulated in space, with respect to some anatomical land- 
marks (for example, the ears and nose). In MEG, the scanner detects 
small magnetic coils placed at known locations on the subject's head. 
These can also be used to correct for head motion. 


Data analysis 


Both MEG and EEG analyses involve large amounts of data that 
often need substantial computing power to extract interpretable in- 
formation from them. 


Artefact cleaning 


MEG and EEG recordings typically contain electromagnetic arte- 
facts due to movement of the head or eyes (including saccades and 
blinking) and skeletal and cardiac muscle electromagnetic activity. 
To help with identifying artefacts, a combination of surface electro- 
oculography (EOG), infrared eye tracking, and electrocardiography 
(ECG) is often used. These recordings can be used in conjunction 
with techniques like ICA to identify and remove artefactual compo- 
nents in sensor recordings. 


Source reconstruction 


Recorded EEG and MEG data at each sensor location represent dif- 
ferent weighted summations of the underlying brain activity. In many 
cases, this sensor-space description of neuronal activity may be suf- 
ficient to answer a particular neuroscientific question. Indeed, a large 
number of studies operate exclusively at this level (see Plate 7a). 
Mapping the sources of neuronal activity in the brain (source re- 
construction) is often desirable though and requires modelling the 
relationship between activity and measurable fields in the sensors. 
This model uses information about the location and geometry of 
different head structures (for example, scalp, skull, brain tissue) 
through which the fields pass and is inverted to estimate brain ac- 
tivity. However, the inversion is ill-posed, requiring additional con- 
straints such as assuming the brain activity is sparse or smooth. The 
resulting spatial maps of brain activity typically have lower spatial 
resolution than fMRI, in the order of several millimetres (Fig. 12.11). 


Experimental analysis 


As with fMRI, M/EEG data analysis comes in two main flavours: task- 
based and resting state. In task-based M/EEG, the experiments typ- 
ically consist of many repeats of trials. Data are extracted from each 
trial and time-locked to a common experimental event of interest 
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Fig. 12.11 (see Colour Plate section) (a) Sensor space MEG data presented as a two-dimensional (2D) topographic sensor map of contrasted 
oscillatory power (between 8 and 12 Hz) in a visual attention task. Power is lower in the contralateral (attending) hemisphere. (b) The same MEG data 
shown reconstructed into the source (brain) space and presented on a three-dimensional cortical map. (c) Time-frequency plot of oscillatory power 
in MEG sensors above the visual cortex following presentation of a visual stimulus. Changes in power can be resolved at sub-second resolution, with 
increases (synchronization) occurring in some frequencies and reductions (desynchronization) occurring in others. Vertical lines denote stimulus 


onset/offset. 


Reproduced from Practical Neurology, 14(5), Proudfoot M, Woolrich MW, Nobre AC, et al., Magnetoencephalography, pp. 285-285, Copyright (2014), with permission from 


British Medical Journal. 


(for example, the presentation of a visual stimulus; see Plate 1c), and 
then an average response to the event can be computed over trials. 
The average response of raw signals is referred to as an event-related 
field (ERF) in MEG, or an event-related potential (ERP) in EEG. 

An alternative to working with raw signals, and a particular 
strength of M/EEG, is to work with oscillatory brain activity using 
time-varying frequency representations of the data. This describes 
task-locked responses in terms of the amount of oscillatory power 
or phase locking that is present in different frequency bands (see 
Plate 1c). It is thought that oscillatory power indicates the level of 
synchrony in the underlying neuronal populations [9]. 

Resting state M/EEG network matrix analysis, or ICA, often pro- 
ceeds in a similar manner to fMRI [10, 11]. However, more sophisti- 
cated estimates of functional connectivity are needed to account for 
M/EEG being direct measures of neuronal activity. These are often 
based on frequency representations (for example, by looking at cor- 
relation in oscillatory power) or the phase locking of oscillatory 
cycles (for example, coherence) between brain areas. As with fMRI, 
functional connectivity analysis can also be carried out in task data. 


Example applications 


MEG and EEG are fundamental tools in both basic and clinical 
neuroscience research. They can be used to complement the spa- 
tial information available from fMRI, with exquisite temporal 
information. 

Basic neuroscience research with M/EEG has often corresponded 
to the same kind of exploratory brain mapping often carried out with 
fMRI, albeit extended to also include exploration of the time and fre- 
quency domains. Arguably, the most constructive use of M/EEG, 
and arguably functional neuroimaging in general, comes from more 
hypothesis-driven work. For example, one approach is to collapse 
over the spatial dimension and make full use of the temporal infor- 
mation, by asking when in time particular task-related variables can 
be decoded from the M/EEG data (for example, [12]). 

Clinically, EEG and MEG have found important roles in sleep 
disorders and epilepsy. In sleep research, EEG is used routinely in 


sleep laboratories to diagnose sleep disorders [13]. In epilepsy, EEG 
and MEG are used for epileptic source localization, in particular 
for the planning of subsequent surgery. MEG is particularly prom- 
ising in this regard, owing to the increased spatial information it has 
to offer, and shows close correlation to invasive studies of cortical 
activity [14]. 


How imaging is used in psychiatry research 


Neuroimaging is now a mainstay of neuroscience research and 
experimental medicine for mental health diseases and disorders. 
Examples of its use in specific areas are to be found elsewhere in this 
book and will include the following. 


Pathophysiology 


The pathophysiology of most psychiatric diseases and disorders re- 
mains to be fully elucidated, and imaging plays an important role 
in discovering mechanisms and potential targets. For example, 
schizophrenia is characterized structurally by increased ventricular 
volume and subtle grey matter reductions (for example, [15]), and 
fMRI reveals reduced brain activity in frontal areas (for example, 
[16]), although the latter is now regarded to be an oversimplifica- 
tion (for example, [17]). Structural MRI abnormalities, albeit more 
subtle, have also been found in mood disorders (for example, [18]). 
fMRI has also been employed to produce signatures of brain activity 
associated with specific clinical phenomena such as auditory hallu- 
cinations [19] and PTSD-like flashbacks (for example, [20]). 


Drug action and drug development 


Functional imaging, in particular PET and fMRI, has an important 
role to play in investigations of therapeutic drug action and target 
engagement. D2 receptor PET has long been used in receptor oc- 
cupancy studies to estimate the optimal dose of antipsychotics, so 
that they hit the sweet spot between maximum antipsychotic effect 
with minimal motor side effects (for example, [4]). In addition to 
molecular imaging studies, the mechanism of action of antidepres- 
sant drugs has been investigated using fMRI, demonstrating that the 
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neural effects of serotonin reuptake inhibitors are detectable after 
a single dose, despite the therapeutic effect taking 2-4 weeks (for 
example, [21]). In future, molecular imaging is expected to be an 
ideal companion technology, alongside cellular/molecular neuro- 
science, in the development of potential drug targets. In particular, 
radioligands can be co-developed alongside drug targets to act as 
in vivo validation tools and companion biomarkers. This work, in 
model organisms and in humans, has the potential to accelerate 
drug development and to enhance the portfolio of evidence required 
for partnering on a given compound or target. 


Imaging biomarkers 


A common goal of many neuroimaging studies in the context of psy- 
chiatric research is to develop novel biomarkers that will be useful 
for diagnosis, prognosis, stratification of participants for clinical 
trials, and surrogate endpoints for trials. Neuroimaging biomarkers, 
including amyloid PET and structural MRI, are now standard in 
trials for Alzheimer’s disease drugs, and hippocampal volumetry 
is on the verge of being used as standard in memory clinics to aid 
with both earlier diagnosis and differential diagnosis for patients 
with cognitive decline. Neuroimaging is further from being ready 
for standard clinical use in other psychiatric diseases and disorders, 
but inclusion in trials of novel therapeutics for, for example, mood 
disorders is ever more common. 


The future of neuroimaging 


Each of the imaging modalities described in this chapter are active 
areas of research, and new ways of acquiring and analysing data are 
so frequent that a ‘state-of-the-art’ description would be rapidly out- 
of-date. For PET, the main areas of innovation are in developing new 
radiotracers, as well as adapting to new combined PET-MR acqui- 
sitions. For MEG, the analysis methodology is rapidly improving 
and becoming more standardized. The future might involve the use 
of atomic sensors [for example, optically pumped magnetometers 
(OPMs)], which could allow the sensors to be placed much nearer 
to the surface of the head, giving an estimated 5-fold increase in sen- 
sitivity. For MRI acquisition, the innovations include acquiring data 
more quickly, with greater spatial resolution and with an ever greater 
number of contrast mechanisms that improve sensitivity to specific 
pathologies. Some of the most exciting innovations come from com- 
bining data across modalities (within and beyond neuroimaging) and 
from ‘big data’ applications, including scanning much larger popula- 
tions than ever before (for example, [22] and developing systems and 
analysis tools for sharing imaging data across cohorts and studies. 
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The project of linking anatomical features of the brain to distinct 
psychological and cognitive processes has been a core objective of 
cognitive and systems neuroscience for well over a century. A cen- 
tral goal has been the creation of brain maps that chart the localiza- 
tion of function across brain areas and systems [1]. The emphasis on 
building a cognitive anatomy largely defined by ‘place’ (topography) 
is increasingly supplemented by the recognition that cognitive pro- 
cesses are distributed and involve overlapping sets of often remote 
anatomical regions [2-4]. This complementary view emphasizes the 
need to map ‘relations’ (topology) or connectivity across the brain. 
The idea that connectivity is essential to understand how brain 
architecture supports cognitive processing underpins the emerging 
science of brain networks, or connectomics [5-8]. As originally 
defined [5], the connectome is a comprehensive map of the struc- 
tural connections linking different elements of the nervous system. 
These elements may correspond to individual (micro-scale) neurons 
or entire (macro-scale) brain areas. This structural map is funda- 
mental for a mechanistic account of functional brain activity and 
connectivity, with structural connections shaping and constraining 
the response patterns of neurons and brain regions. The study of the 
structural and functional networks of the human brain comprises 
the emerging field of “human connectomics. 

This chapter offers an overview of the core concept of the 
connectome. After a brief outline of the concept's historical roots 
and origins, the chapter proceeds to define and contrast the two 
major modes of structural and functional brain connectivity. This is 
followed by a survey of the principal findings to date on the network 
architecture of the connectome, its role in shaping brain dynamics 
and functional networks, and applications in studies of development 
and brain disorders. 














Mapping the connections linking neurons and brain regions has 
been a central goal of neuroscience for many decades—indeed 
compelling arguments for the importance of connectivity in brain 
function extend back centuries [9, 10]. The fundamental role of 
neuroanatomy in neurobiological accounts of brain function and 





dysfunction was recognized by many nineteenth-century neur- 
ologists and anatomists, among whom were Carl Wernicke and 
Theodor Meynert. Meynert was among the first to clearly articulate 
that anatomical connectivity establishes physical links among other- 
wise remote parts of the brain. Distinguishing tracts of ‘projection 
systems’ and ‘association systems’ as major constituents of the cere- 
bral white matter, he wrote that ‘the wealth of such fibres, and their 
variation in length, connecting as they do near and remote parts of 
the cortex, will suffice, without formulating an anatomical hypoth- 
esis, to unite any one part of the cortex to any other’ (page 150, ref 
[11]). In Meynert’s theoretical framework, disorganized fibre archi- 
tecture was an important factor contributing to brain and mental 
disorders. 

Modern studies of brain connectivity at first proceeded in model 
organisms such as the macaque monkey [12, 13] and other mam- 
malian species. Invasive, but highly sensitive, methods for tracing 
the trajectories and terminations of axonal pathways were deployed 
to chart projections between anatomically distinct and functionally 
specialized brain areas. Comprehensive repositories of individual 
reports on projection patterns were created for monkey [13, 14] 
and cat cortex [15] and were studied extensively in search of ana- 
tomical substrates that could account for, or predict, functional 
specialization [16]. The lack of connectivity data for humans trig- 
gered calls for developing new methods to map the connectivity of 
the human brain [17] and eventually led to a proposal for mapping 
human brain connectivity in its entirety—the human connectome 
[5]. Over the past decade, these calls have been answered through 
the creation of multi-site projects aimed at mapping human brain 
connectivity in large cohorts of human subjects [18], including 
subject samples across the lifespan and from clinical populations. 
The connectome has emerged as a core concept in modern systems 
and cognitive neuroscience, providing a conceptual framework for 
characterizing structure—-function relationships in the healthy and 
dysfunctional brain. 











Most complex systems in the natural, social, and technological 
world are composed of large numbers of interacting elements whose 
collective dynamics gives rise to global system states and determines 
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the evolution of these states through time. To understand the struc- 
ture or function of a complex system, it is often advantageous to de- 
compose the system into a set of nodes and edges, which can then 
be studied with the mathematical toolset of graph theory. This is 
the main idea behind mapping and analysing brain networks, or 
connectomes. There are two main modes of brain networks, derived 
from observations on anatomy (structural connectivity) or physi- 
ology (functional connectivity). The networks derived from meas- 
uring structural and functional connectivity share a set of common 
attributes. Both comprise collections of nodes (elements such as 
neurons or areas) and edges (their interrelations or connections). 
The arrangement of these nodes and edges relative to one another 
defines the network’s topology, which is formally described as a 
graph. Defining nodes and edges is absolutely fundamental for the 
creation of any brain network, and a necessary step for any subse- 
quent quantitative analysis of the structure of the graph. At the level 
of whole-brain human connectome networks, nodes correspond 
to the basic elements of parcellation of the cortical and subcortical 
grey matter into a set of areas. Parcellation schemes can be derived, 
based on a number of anatomical or functional criteria such as cyto- 
architectonics, myelination patterns, gene expression profiles, or 
patterns of anatomical or functional connectivity [19]. The edges 
linking the nodes record their anatomical or physiological relations 
and, depending on the nature of the specific measure and how it is 
recorded, may be binary (0/1) or weighted, symmetric or directed. 

Structural connectivity defines networks that record the pres- 
ence/absence, as well as the strength or density, of physical links be- 
tween neurons, neuronal populations, or brain areas. The totality of 
structural connections constitutes the connectome, as originally de- 
fined [5]. Human connectome networks derived from non-invasive 
imaging methodology comprise interregional projections that tra- 
verse the brain’s white matter and connect cortical and subcortical 
regions. Because of fundamental limitations, these imaging ap- 
proaches cannot resolve the directionality of projections and instead 
deliver measures of undirected connectivity. While most current 
accounts of human connectomes express connection weights solely 
in terms of the density or number of streamlines, it should be noted 
that the relationship of such simple measures to the true anatomical 
magnitude of a pathway is unclear [20] and that other indices that 
express connectional microstructure should be taken into account. 
White matter architecture can show signs of neuroplasticity on 
timescales of days to weeks [21] and exhibits characteristic changes 
in the course of development and ageing. 

Functional connectivity differs from structural connectivity in 
several important respects. Firstly, functional connectivity is gener- 
ally derived from neuronal or haemodynamic time series data and 
expresses patterns of statistical dependence among these time series 
[22]. Neuronal time series can be recorded with a broad array of 
techniques, from invasive multi-electrode recordings to whole brain 
electrophysiology or fMRI. Analytic techniques for extracting stat- 
istical dependence include information theoretic measures (such as 
mutual information or transfer entropy), spectral coherence, phase 
locking, and simple Pearson cross-correlation. The latter is currently 
widely used in human studies carried out with fMRI that track fluc- 
tuations in haemodynamic signals, either in response to specific task 
conditions or in a task-free ‘resting state. Unlike structural connect- 
ivity, patterns of functional connectivity can exhibit significant re- 
organizations over short timescales, as interactions among neurons 


and brain areas are continually modulated, reflecting changes in 
sensory inputs and tasks. Measurement of functional connectivity, 
especially with fMRI, is prone to numerous sources of physiological 
noise, including involuntary head motion which can introduce sys- 
tematic biases into functional connectivity estimates. 

Once nodes and edges have been defined and pairwise relations 
among all system elements have been summarized in a graph or 
network, a large range of analysis and modelling tools from graph 
theory can be deployed to characterize or compare network archi- 
tectures [23-25] (Fig. 13.1). Broadly, these measures can be divided 
into several classes that capture aspects of segregation, integration, 
and influence. Measures of segregation capture the extent to which 
the network is clustered and can be subdivided into separate net- 
work communities or modules. Examples are the clustering coeffi- 
cient, network motifs, and methods for detecting network modules. 
The latter are particularly important, as they deliver putative struc- 
tural and/or functional building blocks that have been the focus 
of much interest and research in cognitive neuroscience [26]. 
Measures of integration express the degree to which the network is 
globally interactive. A fundamental concept is that of path length, 
generally defined as the minimal distance (in terms of topology, 
that is the number of steps in a network, which may be unrelated to 
metric separation) between network nodes. In structural networks, 
the shorter the path length, the more directly two nodes can com- 
municate and exchange information. Across an entire network, this 
capacity for direct communication is expressed as the network's 
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Fig. 13.1 Elementary graph concepts and measures, illustrated on a 
schematic illustration of a simple network. (a) The network shown here 
consists of 12 nodes linked by undirected and unweighted (binary) edges. 
(b) A low-degree node (maintaining a single edge) and a high-degree 
node (maintaining five edges) are indicated. Nodes A and B are linked by 
a minimally short path comprising three edges or steps. (c) The network 
is shown with a partition into two modules. The node labelled ‘connector 
hub’ maintains edges with its own module, as well as across module 
boundaries. The node labelled ‘provincial hub’, while highly connected, 
only links to nodes in its own community. 








global efficiency. Measures of influence are useful for detecting 
those network elements that occupy central positions in the top- 
ology and may therefore be particularly important for the func- 
tioning of the system. Important nodes are, somewhat informally, 
designated as network hubs. Some measures of influence rely on 
local features of a network element (such as the node degree, that is 
the total number of distinct connections that each node maintains), 
while others take into account the global web of communication 
paths across the network (such as betweenness, that is the fraction 
of optimally short communication paths to which each network 
elements contributes). Another way to define influence and detect 
hubs is by evaluating the position of a node or edge relative to net- 
work modules. For example, highly connected nodes whose con- 
nections span multiple modules (and thus form bridges between 
them) are considered connector hubs that cross-link different com- 
munities. In contrast, highly connected nodes whose connections 
remain predominantly within one module are considered provin- 
cial hubs, with potentially important roles in linking members of a 
single community to each other. 


The network architecture of the connectome 


Connectome data can be assembled from a very broad range of 
techniques and from virtually any species with a nervous system. 
Important achievements to date have included the complete recon- 
struction of the synaptic network of the brain of Caenorhabditis 
elegans [27] as well as the creation of network maps of large portions 
of the central nervous system of the mouse [28] and rat [29]. 


CHAPTER 13 Theconnectome 


Most studies of the structural connectome of the human brain 
have utilized non-invasive imaging methodology and computa- 
tional reconstruction to infer the spatial arrangement and trajec- 
tories of white matter interregional projections [30, 31]. These 
trajectories are represented as ‘streamlines’ that comprise putative 
white matter tracts. The complete set of these tracts, in combination 
with parcellation of the grey matter, forms a network of nodes and 
edges that can be quantitatively analysed with measures and tools 
from graph theory (Fig. 13.2). While current diffusion imaging and 
tractography approaches have significant limitations and pitfalls 
[32], continual refinements are made to better capture complex fibre 
architecture [33] and develop more principled methods for model- 
based connectome inference [34]. Despite these ongoing develop- 
ments, several core findings on human connectome topology have 
proven to be both robust and reproducible. These include unique 
‘connectivity fingerprints’ of brain areas, a broad distribution of 
node degrees indicating the presence of network hubs, dense con- 
nectivity among hubs linking them into a connective core or ‘rich 
club, and distinct network communities or modules. 

Connectivity fingerprints refer to the patterns of inputs and out- 
puts (indistinguishable with diffusion imaging) maintained by each 
network node [35]. It is thought that these fingerprints are important 
for defining the functional specialization of nodes, as they determine 
the neuronal information to which each node has access, as well as 
the downstream targets to which its output signals are distributed. 
In all connectome studies carried out so far, anatomically distinct 
areas have been found to maintain a characteristic and unique con- 
nectivity fingerprint. Connectivity fingerprints are more similar 
among brain regions that contribute to similar functional domains 





Fig. 13.2 (see Colour Plate section) Different representations of the connectome. (a) A set of streamlines, derived from diffusion imaging and 
computational tractography. Red, green, and blue lines indicate putative white matter tracts running along the medial-lateral, anterior-posterior, 

and dorsal-ventral directions, respectively. (b) Nodes derived from a cortical parcellation are shown as red dots, connected by edges (blue lines) that 
correspond to the density of streamlines linking each pair of nodes. For clarity, the diagram shows the strongest edges only. (c) Nodes are arranged 

in the same anatomical positions as in panel (b), with blue-coloured nodes indicating high betweenness centrality, a measure of influence that is 
computed as the number of optimally short communication paths to which each node contributes. Node diameter is proportional to the number of 


subjects (0-5), for which a given node received high betweenness scores. 








Adapted from PLOS Biol., 6, Hagmann P, Cammoun L, Gigandet X, et al., Mapping the structural core of human cerebral cortex, e159, Copyright (2008), PLOS Biology. 


Reproduced under the Creative Commons Attribution License CC BY 4.0 
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or cognitive processes; conversely, distinct fingerprints can be taken 
as indications of functional specialization [36]. Connectivity fin- 
gerprints can be used for the purpose of parcellation, defining the 
boundaries between distinct brain areas that maintain distinct pat- 
terns of connectivity [19]. 

Network hubs may be identified on the basis of node degree (that 
is, the number of connections per node) or other measures of in- 
fluence. The distribution of node degrees in human connectome 
studies has invariably been reported as heavy-tailed, that is, strongly 
skewed towards nodes with low degree. Put differently, a larger 
number of nodes have relatively few distinct pathways, while a 
smaller number of nodes maintain a larger set of distinct projec- 
tions. Broad degree distributions, assuming a log-normal shape, 
have also been described in tract tracing studies in non-human pri- 
mates [37]. In the human brain, hub regions have been identified 
in parts of the orbitofrontal, lateral prefrontal, superior frontal, cin- 
gulate, and medial parietal cortex [38], with most of these regions 
previously classified as multimodal or transmodal, based on their 
diverse functional responses. 

A number of human connectome studies have shown that highly 
connected hubs are also densely connected to each other, more so 
than expected by chance. This aspect of network topology, called a 
‘connective core’ or a ‘rich club’ [39], suggests that hubs are pref- 
erentially connected to allow direct communication and sharing of 
information. Indeed, graph analysis demonstrates that a very large 
proportion of all short communication paths across the human 
connectome access some portion of the brain’s rich club and that 
damage to rich club nodes or edges has the potential to dispropor- 
tionately disrupt patterns of interregional information transfer [40]. 
An attractive hypothesis suggests that rich club organization offers 
a structural substrate for integrative processes that may underpin 
high-level conscious processing. In support of this notion, rich club 
connectivity has been found to interlink the set of widely distributed 
resting state networks that form the basic building blocks of func- 
tional connectivity [41]. Connections among rich club members 
may thus play important roles in the exchange of neuronal informa- 
tion across different sensory and task domains. 

Human connectome networks exhibit high clustering, with dense 
structural connections that define local communities or modules. 
These modules comprise nodes that tend to share not only dense 
interconnections, but also common sets of inputs and outputs, as 
well as common physiological responses and co-activation pat- 
terns. The concept of modularity has been especially important in 
studies of human functional networks. A large body of work has 
examined the topography and specialization of modules corres- 
ponding to resting state or intrinsic functional networks [42, 43]. 
These modules can be derived with a variety of decomposition or 
clustering techniques. Network-based detection of modules often 
utilizes an approach known as ‘modularity optimization’ [26], which 
attempts to maximize a quality metric that is designed to optimally 
partition a network into blocks that are internally densely and ex- 
ternally weakly connected. Partitioning of human resting state func- 
tional connectivity has resulted in maps of components/modules 
that are both highly reproducible across different subject cohorts 
and imaging parameters [44, 45], as well as sensitive to individual 
variations, including developmental stages and clinical status [46]. 
Importantly, network modules derived from fMRI time series re- 
corded in the resting state are highly similar to the spatial patterns of 


co-activation networks derived from task-evoked activations across 
large numbers of imaging studies [47, 48]. Hence, it appears that the 
architecture of functional networks at rest recapitulates or rehearses 
spatial patterns of co-activation that are engaged as the brain is chal- 
lenged in different task contexts. 


From the connectome to brain dynamics 


The stability and reproducibility of resting state functional net- 
works strongly suggest that these networks have an anatomical 
origin. Three lines of experimental evidence suggest that this is in- 
deed the case. Firstly, systematic comparison of patterns of struc- 
tural and functional connectivity recorded in non-human primates 
[49, 50], as well as in the human brain [51, 52], has shown that they 
are statistically strongly related. For example, among anatomic- 
ally directly connected node pairs, the strength of the anatomical 
projection is partly predictive of the strength of the corresponding 
functional connection [50, 51]. Secondly, studies of specific resting 
state functional networks have shown that their constituent compo- 
nents are interconnected by anatomical projections. For example, 
the anterior and posterior divisions of the default mode network 
are directly linked by white matter projections running along the 
cortical midline, and variations in these anatomical projections are 
correlated with variations in functional connectivity [53]. Thirdly, 
in rare instances, it has been possible to observe changes in func- 
tional networks immediately after experimentally induced dis- 
turbances of anatomical connections. For example, transection of 
interhemispheric cortical pathways in a human patient undergoing 
surgery for epilepsy resulted in functional disconnection of the two 
cortical hemispheres [54]. 

Important insights into the relationship between structural and 
functional connectivity have come from computational studies 
of connectome networks (Fig. 13.3). Simulations of neural mass 
models using a structural coupling matrix, based on the macro-scale 
connectome of the macaque cerebral cortex, generated functional 
connectivity that was significantly correlated with the structure of 
empirical functional networks [50, 55]. Similar results were obtained 
in simulations of the human connectome [51, 56], including the 
topography of resting state networks. Additionally, computational 
studies have demonstrated a number of important dynamic prop- 
erties of functional connectivity, including temporal fluctuations 
in network topology [55, 57], as well as the important roles of con- 
duction delays and noise in shaping resting brain functional con- 
nectivity [56, 58]. While some research agendas aim to create more 
realistic and complex neuronal simulations of functional brain dy- 
namics, other studies have attempted to model functional connect- 
ivity on the basis of simple dynamic processes such as diffusion [59] 
and spreading [60]. One example of this approach utilized a measure 
of ‘searchability, derived from studies of network navigability and 
search, to successfully predict the strength and pattern of functional 
connectivity in the human brain [61]. What virtually all computa- 
tional studies have shown is that functional connectivity results (at 
least in large part) from a complex combination of many (direct and 
indirect) communication events and dynamic influences that travel 
across the structural connectome. 

While most fMRI studies of functional connectivity in the resting 
state have examined the structure of functional networks that were 
estimated over relatively long recording sessions (lasting in the 
order of 5-10 minutes), recent work has focused on fluctuations 
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Fig. 13.3 (see Colour Plate section) Connectome-based computational models of functional connectivity. (a) The matrix at the top shows structural 
connectivity (SC) of directed projections among a set of macaque cortical areas. This connectome formed the coupling structure for a simulation of 
neural mass dynamics that generated synthetic fMRI time courses and a ‘modelled FC’ (functional connectivity) matrix (lower plot, right half). This 
computational analogue of macaque FC can be compared to empirical recordings of resting-state fMRI. The correlation between the empirical and 
modelled FC patterns is R = 0.55 [54]. (b) Human SC matrix (from [30]) and empirical FC (from [30]), as well as predicted FC (from [61]). The predicted 
FC was computed from an analytic model based on measures of communication in the structural graph SC. The correlation between the empirical and 


predicted FC patterns is R = 0.60 [61]. 


in functional connectivity that occur on shorter timescales (in the 
order of tens of seconds) [62]. These fluctuations can be detected by 
tracking individual functional connections over time [63] or by ex- 
tracting and clustering network states [64]. Both approaches require 
the formulation of appropriate statistical null models to exclude 
spurious detection of fluctuations that are due to measurement arte- 
fact or noise. While much of the work in this area is still ongoing, 
a convergent set of findings suggests that resting brain dynamics 
seems to reflect ongoing transitions of functional networks between 
states of high segregation and high integration [65]. Statistically sig- 
nificant fluctuations coalesce into consistent network patterns with 
high modularity, characterized by a de-coupling of task-positive 
from task-negative (default mode) networks [66]. Much of the ef- 
forts to characterize dynamic changes in functional connectivity are 
motivated by the possibility that such changes may offer a novel bio- 
marker for clinical disorders. 

Another important emerging area is the study of functional con- 
nectivity in relation to task and cognitive function [43, 67]. Several 
studies have shown that functional networks are rapidly reconfig- 
ured as the brain engages in different tasks, including in continuous 
sensorimotor tasks [68] or in switching between continuous mental 
operation such as working memory, arithmetic, or navigation [69]. 
A comparison of resting state and task-evoked functional connect- 
ivity estimated across a battery of different tasks [70] has shown 


the presence of an intrinsic functional connectivity state that per- 
sists across tasks and that is highly similar to functional connect- 
ivity emerging in the resting state. Task-evoked networks emerge on 
top of this intrinsic state and are associated with specific patterns of 
modulated functional connections. A different study showed that 
transitions between different cognitive tasks were accompanied 
by transitions between distinct functional connectivity states [71]. 
Using machine-learning approaches, these functional connect- 
ivity states could be used to infer cognitive operations in individual 
subjects. 





Development and clinical disorders 


Important applications of the connectome include the character- 
ization of changes in connectome topology across developmental 
stages and in clinical populations. These studies pose a number 
of methodological and analytic challenges. While many study de- 
signs rely on comparison of group-averaged connectivity data sets, 
the heterogeneity of brain structure and function across individ- 
uals may be more appropriately captured by looking at longitudinal 
changes in individuals across development or by employing dimen- 
sional designs that can associate continuous variation in behavioural 
and/or cognitive phenotypes with network markers. In addition to 
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issues of study design, developmental or clinical subject cohorts can 
present issues related to differential head motion and the associated 
measurement artefacts. 

Numerous studies have shown that structural and functional net- 
works exhibit characteristic changes across the human lifespan. For 
example, an early study carried out on participants ranging between 
the ages of 2 and 18 years old showed that human connectome net- 
works increase in density and become more efficient in terms of 
communication paths and progressively less modular (less highly 
segregated) with age [72]. These trends mirror similar tendencies in 
functional networks that were observed across a similar age range 
[73], with functional modules becoming less determined by spatial 
clustering and more coherent and integrated across remote regions 
of the brain. Looking across the human lifespan [74], several struc- 
tural connectome metrics related to network density and efficiency 
exhibit inverted U-shaped time courses, with early increases fol- 
lowed by decreases later in life. Rich club organization was present 
across the entire lifespan, with the most pronounced expression 
in early adulthood [75]. In parallel with these evolving structural 
network features, resting state functional networks exhibit develop- 
mental trajectories of their own, for example, by becoming internally 
less strongly coupled, while developing stronger functional links be- 
tween networks, especially among components of the somatomotor 
network, the dorsal attention network, and the saliency/ventral at- 
tention network [76]. Ongoing lifespan connectomics efforts are ex- 
tending the range of these observations to neonates and the elderly 
population. 

The fundamental rationale behind connectomics-based ap- 
proaches to brain disorders is 2-fold. Firstly, most of these conditions 
cannotbetraced to highly localized pathologies but are instead associ- 
ated with disturbances of network organization across extended parts 
of the brain [77, 78]. Secondly, connectomics offers an intermediate 
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phenotype, based on structural and functional circuits and connect- 
ivity that is interspersed between the domains of genetics and mo- 
lecular interactions at the lower end and human behaviour and social 
networks at the upper end of the scale [79] (Fig. 13.4). Structural 
brain networks not only allow information to become shared and in- 
tegrated in the course of healthy brain function, but they also pro- 
mote the spreading of disease processes. Examples are distributed 
functional changes resulting from brain damage due to an ischaemic 
attack [80], the progressive spread of degenerative conditions such 
as Alzheimer’s disease along anatomical pathways [81], or the global 
effects such as generalized seizures that are triggered by focal epi- 
leptogenic activity [82]. Several classic clinical concepts, among them 
diaschisis and cognitive reserve, may be reconceptualized in the con- 
text of brain networks and the connectome. Diaschisis [83], which 
implies functional disruption that occurs at a distance, for example 
after a stroke, may reflect the brain’s response to acute disturbances 
of connectome topology. For example, network approaches pre- 
dict that the loss of a set of nodes and edges may be accompanied 
by altered functional connectivity among remote (including contra- 
lateral) brain regions and systems [84]. The extent of this reorganiza- 
tion would likely depend on the topological status of the damaged 
nodes, with damage to more central nodes (network hubs) causing 
more severe and widespread disruption. Cognitive reserve [85], or 
the capacity of the brain to counter impairment with compensatory 
strategies, may depend on the network's ability to provide alternative 
functional configurations or paths to counter the loss of nodes and 
edges. This ability would allow the brain to configure alternative sets 
of nodes and edges to carry out equivalent (‘degenerate’ [86, 87]) cog- 
nitive or behavioural processes. 

A common theme in many clinical and translational applications 
of the connectome has been the role of important network elem- 
ents in the origin and progression of brain disorders. Network hubs 
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Fig. 13.4 Brain phenotypes are arranged in a hierarchy spanning several levels of organization, ranging from molecular processes to behaviour and 
social environment. Genetic and environmental factors contributing to brain disorders converge onto the connectome which occupies the position of 


an intermediate phenotype that bridges molecular and behavioural scales. 


Adapted from Br J Psychiatry, 194(4), Bullmore ET, Fletcher P, Jones PB, Why psychiatry can't afford to be neurophobic, pp. 293-295, Copyright (2009), with permission from 


Royal College of Psychiatrists. 


have caught significant attention, as they appear to play a causal role 
across a wide range of brain disorders. A meta-analysis of data from 
more than 20,000 subjects and 26 different disorders [88] has shown 
that many of these disorders were associated with hub-centric 
neuropathology, that is with lesions that predominantly involved 
topologically central elements of the hum an connectome. For ex- 
ample, hubs in the medial temporal lobe are among the first brain 
regions to exhibit pathological changes in Alzheimer’s disease [89], 
while frontal and temporal cortical hubs are implicated in a number 
of studies of schizophrenia [90]. It appears that the existence of net- 
work hubs in the connectome offers unique functional advantages, 
while also creating points of vulnerability—the benefits of hubs in 
information integration turn into risk factors, as their disruption 
has disproportionate effects on the individual's cognitive and behav- 
ioural capacities. 


The connectome refers to a comprehensive map of connections 
among elements of a neural system. In studies of the human 
brain, connectomics has so far largely focused on the macro-scale 
projections among distinct brain areas and their role in shaping 
functional connectivity during both rest and task-evoked activity. 
Computational models of the connectome have been instru- 
mental in elucidating potential network mechanisms that create 
functional connectivity from communication events in structural 
networks. Numerous applications of the connectome in devel- 
opmental and clinical studies have shown patterns of change in 
connectome topology across the lifespan, as well as the network 
basis of brain disorders. Future work will likely further illuminate 
the role of the connectome in the functioning of the healthy and 
diseased brain. 
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The concept of signalling from one neuron to another began to 
form by the end of the nineteenth century when it was recognized 
from the histological work of Golgi and Ramon y Cajal that nerve 
cells were discrete entities and linked by specialized contacts, for 
which Sherrington coined the term ‘synapse’ It took another half 
century for scientists to finally agree that information passes be- 
tween neurons principally through the movement across synapses 
of chemicals, and not electrical current. Today changes in chemical 
transmission at brain synapses are accepted as being key to both suc- 
cessful drug treatment and the cause of many forms of psychiatric 
illness. This chapter focuses on fundamental aspects of chemical 
transmission and describes some recent advances relevant to psych- 
iatry that may indicate the direction of future research. 

Seminal work on the autonomic nervous system and glandular 
secretions around the turn of the twentieth century was conducted 
by pioneering physiologists like John Langley and his student 
Thomas Elliott. It culminated in studies in the 1920s by Otto Loewi 
and then Henry Dale who identified what is often cited as the first 
neurotransmitter—acetylcholine—the chemical that was released 
on stimulation of the vagus nerve to inhibit the heart. Subsequently, 
in 1946, Ulf von Euler reported the identification of the mono- 
amine noradrenaline as the other key autonomic transmitter [1, 2]. 
Today evidence suggests that, in the brain (and peripheral nervous 
system), there are many tens, if not hundreds, of molecules that are 
involved in chemical transmission at synapses. These molecules in- 
clude acetylcholine and monoamines [noradrenaline, dopamine, 
and 5-hydroxytryptamine (5-HT; serotonin)], certain amino acids 
(especially glutamate and GABA), and peptides, as well as specific 
purines, trophic factors, inflammatory mediators (chemokines and 
cytokines), lipid-like agents (endocannabinoids), and even gases 
[nitric oxide (NO)]. Examples of molecules that serve neurotrans- 
mitter functions in the brain are listed in Table 14.1. This list is not 
exhaustive, and more are likely to be discovered. 














Typically, a molecule is classified as a neurotransmitter if it is local- 
ized in neurons, released from nerve terminals (and often soma and 





dendrites as well) on membrane depolarization, and exerts physio- 
logical and molecular effects through acting on postsynaptic recep- 
tors. However, the degree to which a particular molecule satisfies 
these criteria may vary. For example, the term ‘neurotransmitter’ 
was once commonly used to define those molecules that exert fast 
synaptic effects (glutamate, GABA), whereas molecules that ex- 
erted slower synaptic effects were often termed ‘neuromodulators’ 
(for example, monoamines and peptides). These distinctions are 
less useful today (and will not be used here) because it is recog- 
nized that many transmitter molecules are capable of exerting both 
fast and slow synaptic effects. For example, both glutamate and 


Table 14.1 Examples of neurotransmitters in the brain 


Amines Dopamine 

Noradrenaline 
5-hydroxytryptamine (5-HT) 
Histamine 

Melatonin 


Acetylcholine 


Amino acids y-aminobutyric acid (GABA) 
Glutamate 


Glycine 


Substance P 

Leu- and met-enkephalin 
Galanin 

Orexin 


Neuropeptides 


Adenosine 
Adenosine triphosphate (ATP) 


Purines 


Neurotrophic factors* Neurotrophins (for example, BDNF NGF) 
p P p 
Insulin-like growth factor (IGF) 


Vascular endothelial growth factor (VEGF) 


Cytokines’ Interleukin-1 (IL-1) 
Tumour necrosis factor a (TNFa) 

Chemokines’ CC chemokines [for example, interleukin-8 (IL-8)] 
CXC chemokines 

Endocannabinoids* Anandamide 


2-arachidonyl-glycerol (2-AG) 


Nitric oxide (NO) 
Carbon monoxide (CO) 


Gases* 


* Putative class of neurotransmitters. 
* Retrograde messengers. 


GABA exert fast and slow synaptic effects, depending on which 
of their receptors they interact with. Moreover, in contrast to the 
classical view of neurotransmission in which information passes 
from the presynaptic to the postsynaptic neuron in an ‘anterograde’ 
direction, it is now recognized that certain molecules transfer in- 
formation at a synapse in a ‘retrograde’ direction. In this case, the 
molecules are located in the postsynaptic neuron, and, when their 
synthesis is activated, the molecules diffuse back across the synapse 
to act presynaptically. 

The general principles of chemical transmission at central syn- 
apses are similar for most neurotransmitter molecules. The differ- 
ences in the detail will be illustrated by transmission mechanisms for 






Dendritic 


; release 
Somatic 


release 





Dendritic 






peptide 
synthesis 
2 High 
Low frequency 
frequency or burst firing 


— Synapse 


density 


Classic 
transmitter <SS 
RAN 
Low frequency: High frequency 
* burst firing 






CHAPTER 14 Neurotransmitters and signalling 


small neurotransmitters such as monoamines and amino acids, com- 
pared to larger neurotransmitters such as peptides (Fig. 14.1) [3]. 


Small neurotransmitters 


Typically, small neurotransmitter molecules are synthesized at the 
nerve terminal by one or a few enzymatic steps, and then pack- 
aged in small membrane-bound vesicles via vesicular transporters 
(proton-coupled), prior to release into the synapse. The latter is 
triggered via a calcium-dependent mechanism on the arrival of a 
depolarizing action potential; this release process, referred to as 








© Synaptic vesicle with classic transmitter 
Sey GPCR for peptide or classic tranmitter 
a Transporter for classic transmitter 


o lonotropic receptor for classic transmitter 


Fig. 14.1 Summary of the principal steps involved in chemical neurotransmission at CNS synapses. Neuropeptides are synthesized in the cell 
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body and then packaged in large, dense core vesicles (LDCVs) that are transported into axons and dendrites (1). Small ‘classic’ neurotransmitters 

for example, monoamines and amino acids) are synthesized at the nerve terminal and stored in synaptic vesicles, and released into the synaptic 

cleft. Evidence for the co-release of small transmitters is now clear (2). LDCVs contain proteolytic enzymes (convertases) that generate the active 
neuropeptide from the precursor. Neurotransmitter receptors are either of the G protein-coupled (metabotropic) or ligand-gated ion channel 
ionotropic) type and are present on cell soma, dendrites, axons, and nerve endings (1, 4). The small neurotransmitters are released during low- and 
high-frequency firing, whereas neuropeptides are preferentially released under burst or high-frequency firing (2-4). Small transmitters have reuptake 
mechanisms (transporters) at both the plasma membrane and the vesicle membrane (5), which terminate neurotransmitter action and allow recycling 
4). In contrast, neuropeptides are broken down by extracellular peptidases (6), and replacement occurs via axonal transport. Glial cells can express 
neurotransmitter receptors and transporters (7). Receptors are trafficked to and from the cell membrane by G protein-interacting proteins (8). 
Reproduced from The Lancet Neurology, 2(8), Hékfelt T, Bartfai T, Bloom F, Neuropeptides: Opportunities for drug discovery, pp. 463-72, Copyright (2003), with permission 
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exocytosis, involves a complex machinery of 20-30 presynaptic 
proteins. After release, the neurotransmitter diffuses across the 
synapse to interact with specific receptors localized on the mem- 
brane of the postsynaptic neuron to trigger electrical and/or bio- 
chemical changes. Small neurotransmitters are also released from 
the soma and dendrites of neurons, one purpose being to interact 
with presynaptic receptors that signal negative feedback to the 
neuron; these receptors are often referred to as autoreceptors, and 
they are usually also located on the nerve terminals. 

Typically, once released, the small neurotransmitters are select- 
ively taken up by another type of transporter (sodium-coupled) 
located in the plasma membrane of the nerve terminal or neigh- 
bouring cells (neurons or glial cells). This transport helps terminate 
transmission at the postsynaptic receptor, maintains low extracel- 
lular levels of the transmitter, and allows its reuse by the neuron. 
Transport into the nerve terminal also presents the transmitter to 
catabolic enzymes, monoamine oxidase in the case of monoamine 
transmitters, to generate biologically inactive metabolites. Rather 
differently, acetylcholine is broken down in the synapse by the en- 
zyme acetylcholinesterase, and then the metabolic product choline 
is transported back into the nerve terminal, allowing resynthesis to 
acetylcholine. 





Neurotransmitter transporters 


Advances in cloning technology have led to new discoveries re- 
garding the structural and pharmacological identity of transporters 
located on the plasma membrane, as well as vesicular transporters 
located inside the nerve terminal [4]. The latter transporters con- 
centrate transmitters in synaptic vesicles prior to release, and play a 
key role in determining the neurotransmitter phenotype of a neuron 
[5]. A summary of plasma membrane and vesicular transporters is 
given in Table 14.2. 


Table 14.2 Examples of neurotransmitter transporters 


Neurotransmitter Transporter 


Plasma membrane transporters 


Dopamine DAT 
Noradrenaline NET 
5-HT SERT 
GABA GAT-1 
GAT-2 
GAT-3 
BGT-1 (primarily in kidney) 
Glutamate EAAT-1 (GLAST1) 
EAAT-2 (GLT-1) 
EAAT-3 (EAAC1) 
EAAT-4 
EAAT-5 
Glycine GLYT-1 
GLYT-2 
Acetylcholine (choline) CHT 
Vesicular transporters 
Monoamines (dopamine, VMAT1 
noradrenaline, 5-HT, histamine) VMAT2 
GABA VGAT 
Glutamate VGLUT1 
VGLUT2 
VGLUT3 
Acetylcholine VAchT 


Plasma membrane transporters 


The solute carrier (SLC) superfamily of transporters is the second 
largest family of membrane proteins in the human genome (ap- 
proximately 52 families, around 400 individual members), after G 
protein-coupled receptors (GPCRs). These transporters selectively 
transport into and out of cells a large diversity of solutes, ranging 
from inorganic ions to amino acids and more complex molecules 
like haem. 

The SLC6 family comprises four subfamilies that form high- 
affinity and selective transporters for GABA, glycine, and neutral 
amino acids, as well as the monoamines dopamine (DAT), nor- 
adrenaline (NET), and 5-HT (SERT). The latter three transporters 
have been identified and sequenced, and investigated in detail at 
the molecular level. Their distribution within the brain closely 
follows the pattern of innervation of their respective monoamine 
transmitter. These transporters play a key role in the homeostatic 
control of monoamine transmission, as is evident in the striking ef- 
fects on monoamine function when these transporters are genetic- 
ally knocked out [6]. These transporters are also the site of action 
of many important psychotropic drugs, including psychostimulants 
such as amphetamines and cocaine, tricyclic antidepressant drugs, 
and SSRIs [4]. The drug-binding site and the precise molecular 
mechanism of transporter inhibition have been revealed at a high 
level of structural resolution [7, 8]. 

When monoamines are in excess at the synapse and extra-synaptic 
areas, their clearance may also be facilitated by the low-affinity, non- 
selective organic cation transporters OCT 1-3 and PMAT [9, 10]. In 
support of this idea, the regional brain distribution of OCT3 and 
PMAT overlaps largely with that for SERT, NET, and DAT. 

Molecular cloning techniques have uncovered genes that generate 
four highly homologous, but pharmacologically distinct, trans- 
porters for the inhibitory neurotransmitter GABA: GAT-1, GAT-2, 
GAT-3, and BTG-1 [6]. GAT-1 and GAT-3 are the most abundant 
and preferentially expressed by neurons and glial cells, respectively, 
whereas GAT-2 and BTG-1 are expressed extra-synaptically by both 
cell types. Although their significance is yet to be fully understood, 
these transporters display overlapping, but different, expression pat- 
terns in the CNS, suggesting distinct functional roles. The anticon- 
vulsant effect of tiagabine is likely mediated by blockade of GAT-1, 
and there is much scope for new GABA uptake inhibitors of as yet 
unclear utility. 

Glycine, another inhibitory amino acid transmitter, also has spe- 
cific transporters located preferentially on the plasma membranes 
of glial cells in the forebrain (GLYT1) and neurons of the hindbrain 
and spinal cord (GLYT2). Interestingly, glycine is a positive allo- 
steric co-modulator of glutamate NMDA receptors. Therefore, dis- 
ruption of GLYT1 glycine transport blockade may offer a means to 
facilitate the functioning of the NMDA receptor without incurring 
excitotoxic effects [4]. The antipsychotic potential of glycine trans- 
port inhibitors has been under investigation, because this action is 
associated with procognitive effects and the symptoms of schizo- 
phrenia appear linked to low NMDA receptor function [11]. GLYT2 
is expressed by glycinergic neurons and thought to be important for 
the delivery of glycine into nerve terminals for loading into synaptic 
vesicles. 

Four transporters for the excitatory amino acid neurotrans- 
mitter glutamate have been cloned: EAAT1 (excitatory amino 


acid transporter 1; synonym GLAST), EAAT2 (GLT1), EAAT3 
(EAAC1), EAAT4, and EAATS5 [5]. These transporters are located 
on both neurons (predominantly EAAT3/4/5) and glial cells (pre- 
dominantly EAAT1/2) and serve to maintain low extracellular con- 
centrations of glutamate, as well as provide a source of intracellular 
glutamate for metabolism. Pharmacological blockade of glutamate 
transport may lead to cognition enhancement and other poten- 
tially useful therapeutic effects but holds the risk of excitotoxicity, 
whereas pharmacologically enhanced EAAT expression appears to 
be neuroprotective [12]. EAAT inhibitors are currently in develop- 
ment, but few brain penetrant or selective agents have been identi- 
fied thus far. 


Vesicular transporters 


Vesicular transporters facilitate the movement of neurotrans- 
mitters from the cytoplasm to be concentrated within synaptic 
vesicles. This transport is often driven by a proton gradient es- 
tablished by an adenosine triphosphate (ATP)-dependent proton 
pump that acidifies the secretory vesicles. There are two hom- 
ologous vesicular monoamine transporters—VMAT2 is present 
in central and peripheral neurons and transports dopamine, 
noradrenaline, and 5-HT, as well as histamine, into vesicles, 
whereas VMAT1 is an integral protein in the membrane of secre- 
tory vesicles of peripheral neuroendocrine and endocrine cells. 
Reserpine is a blocker of VMAT and causes depletion of mono- 
amines, and the drug’s tranquillizer effects are directly linked to 
this action. Acetylcholine is loaded into synaptic vesicles by a dis- 
tinct transporter, VAchT. 

Three homologous vesicular transporters for glutamate 
VGLUT1, VGLUT2, and VGLUT3 have been identified and char- 
acterized. While all possess similar molecular properties, they have 
differential expression patterns, with VGLUT1 arising predomin- 
antly from cortical neurons, VGLUT2 from subcortical neurons, 
and VGLUT3 largely from midbrain neurons [5]. The vesicular 
GABA transporter is VGAT (also termed vesicular inhibitory 
amino acid transporter, VIAAT), and this transports GABA or gly- 
cine into synaptic vesicles. Loss of VGAT causes a drastic reduc- 
tion in the release of not only GABA, but also glycine, indicating 
that glycinergic neurons do not express a separate vesicular trans- 
porter for glycine. The vesicular nucleotide transporter (VNUT) is 
a recent member of this transporter family and functions to load 
synaptic vesicles with ATP. 

There is long-standing evidence that the release of one or more 
neuropeptides accompanies the release of classical small-molecule 
neurotransmitters (Fig. 14.1). Interestingly, recent studies based 
on the localization of vesicular transporters suggest that clas- 
sical small-molecule neurotransmitters are also co-released. For 
instance, findings that VGLUTS is localized in 5-HT-containing 
neurons and appears to be functional [13] raises the possibility 
that the release of glutamate contributes to the actions of 5-HT 
neurons that were previously attributed to 5-HT itself. This is not 
an isolated case. Neurons previously thought to release only glu- 
tamate, acetylcholine, dopamine, or histamine have been found 
to also release the major inhibitory neurotransmitter GABA [14]. 
Such evidence supports the emerging view that neuronal commu- 
nication based on using more than one classical neurotransmitter 
is prevalent throughout the CNS. 


CHAPTER 14 Neurotransmitters and signalling 


Neuropeptides 


Following the chemical identification of the neuropeptide substance 
P in 1971, evidence has accumulated that numerous peptides play 
neurotransmitter roles in the brain [3]. Some examples are shown in 
Table 14.3. The properties of peptidergic synapses are in many ways 
different from those of synapses that utilize small neurotransmit- 
ters. The neuropeptides comprise 3-100 amino acids and, together 
with other putative signalling peptides, such as growth factors and 
cytokines, are synthesized in the nucleus by deoxyribonucleic acid 
(DNA) transcription, followed by translation from messenger ribo- 
nucleic acid (mRNA) into precursor polypeptides (Fig. 14.1). These 
precursors typically undergo extensive post-translational processing 
that includes cleavage into smaller peptides by endopeptidases, as 
well as other enzymatic modifications. The precursor peptides usu- 
ally contain an N-terminal signal sequence that directs the trans- 
port of newly synthesized protein to the lumen of the endoplasmic 
reticulum, and then the Golgi complex where it is packaged into 
vesicles (termed ‘large, dense core vesicles’ due to their appearance 
under the electron microscope) that are transported along the axon 
to the synapse. This obviates the need for neuropeptide vesicular 
transporters. 

Proteolytic processing of a single precursor peptide often gener- 
ates not one, but a family of biologically active peptides, although 
the proteolytic steps may be tissue-specific. The opioid peptides pro- 
vide one of the best worked-out examples of this form of processing. 
Pro-opiomelanocortin (POMC) is a hypothalamic precursor opioid 
peptide whose structure contains sequences for adrenocortico- 
tropic hormone (ACTH), a-melanocyte-stimulating hormone (a- 
MSH), and B-endorphin. In the anterior lobe of the pituitary gland, 
POMC is processed to form ACTH, while in the intermediate lobe, 
POMC is processed to form a-MSH and f-endorphin. On the other 


Table 14.3 Examples of families of neuropeptides 
Opioid peptides Leu-enkephalin 
et-enkephalin 
Dynorphin 
B-endorphin 
Ociceptin 
Substance P 


eurokinin A 
eurokinin B 


Tachykinins 





Hypothalamic-releasing 
factors 


Thyrotrophin-releasing factor (TRH) 
Corticotrophin-releasing factor (CRF) 

Growth hormone-releasing hormone (GHRH) 
Somatostatin 


Cholecystokinin (CCK) 
Galanin 
nsulin 
eurotensin 
europeptide Y (NPY) 
Vasointestinal polypeptide (VIP) 


Gut-brain peptides 


Other peptides Bradykinin 

Calcitonin gene-related peptide 
elanin-concentrating hormone (MCH) 
elanocortin 

Orexin 

Oxytocin 

Vasopressin 
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hand, post-translational processing of the opioid precursor peptide 
proenkephalin gives rise to multiple copies of the pentapeptide met- 
enkephalin, as well as a copy of leu-enkephalin, while a third opioid 
precursor prodynorphin gives rise to dynorphin. In total, the three 
separate opioid peptide genes give rise to at least 18 endogenous 
peptides with opiate-like activity. 

Many proteolytic enzymes involved in the processing of neuro- 
peptides have been cloned and characterized, including prohormone 
convertases that produce striking phenotypic effects when genet- 
ically manipulated in mutant mouse models [15]. The therapeutic 
utility of pharmacological manipulation of neuropeptide synthesis 
and degradation in the brain has yet to be fully realized. However, 
the success of inhibitors of the prohormone convertase that 
synthezises angiotensin in the periphery [angiotensin-converting 
enzyme (ACE) inhibitors], for the treatment of hypertension, sets 
an important precedent. 

In addition to enzymic processing, another mechanism to gen- 
erate neuropeptide diversity is through alternative ribonucleic acid 
(RNA) splicing of a single gene. For example, in the case of tachyki- 
nins, alternative splicing of preprotachykinin gene A mRNA results 
in three splice variants which, after translation and post-translational 
processing, collectively generate the five biologically active peptides 
of the tachykinin family (including substance P). 

To date, there is little evidence that neuropeptides are cleared from 
the synapse by transporters in the plasma membrane, indicating 
that they are not recycled after release. Rather, evidence suggests 
that their action is terminated by peptidases located on extracellular 
membranes. Thus, replenishment of neuropeptides during high 
levels of synaptic activity is dependent on the proteolytic enzymes 
that generate the active peptides in the neurons. 

As noted, a feature of most, if not all, neuropeptides is their co- 
localization with classic neurotransmitters. Some of the best ex- 
amples include GABA/dynorphin co-localization in movement 
control pathways (striatonigral neurons), cholecystokinin (CCK)/ 
dopamine in reward pathways (mesoaccumbens neurons), and 
glutamate/substance P in pain pathways (dorsal root ganglion 
neurons). The functional significance of this co-localization is 
not fully clear, but evidence suggests that peptide release requires 
higher frequencies of neuronal discharge than classical transmit- 
ters, and once released, the neuropeptide either facilitates or op- 
poses the function of the co-localized transmitter [3]. In a recent 
example, co-localization between 5-HT and galanin in midbrain 
raphe neurons was investigated to reveal an action of the peptide on 
5-HT feedback mechanisms. This knowledge has been exploited to 
develop galanin ligands that are under development as novel anti- 
depressant strategies [16]. 


Neurotrophic factors 


Neurotrophic factors are brain peptides that were originally recog- 
nized for their role in supporting growth, differentiation, and sur- 
vival of neurons but today these molecules are thought to possess 
many of the properties of neurotransmitters, including neuronal lo- 
calization and release and an ability to modulate synaptic function. 
In addition, there is evidence that neurotrophic factors signal in a 
retrograde fashion (see later). Neurotrophic factors are currently 
named according to the action with which they were originally 


characterized [brain-derived neurotrophic factor (BDNF), nerve 
growth factor (NGF)], and they comprise many families [17]. 

Certain features distinguish neurotrophic factors from neuropep- 
tides. In particular, neurotrophic factors are larger molecules; for 
example, BDNF has a molecular size of 14 kDa, whereas neuropep- 
tides are typically much smaller peptides. Also, while neuropeptides 
signal via GPCRs, neurotrophic factors signal via direct activation of 
a class of transmembrane-spanning proteins called protein tyrosine 
kinases (Trk receptors), of which four types have been identified so 
far (TrkA, TrkB, TrkC, and p75). In some cases, the neurotrophic 
factor receptor and protein tyrosine kinase reside in the same pro- 
tein, while in other cases, the receptor recruits an intracellular pro- 
tein tyrosine kinase. Specific neurotrophic factors signal via specific 
protein kinases (for example, NGF-TrkA, BDNF-TrkB). Activation 
of the protein tyrosine kinase leads to the phosphorylation of pro- 
teins via their tyrosine residues and the triggering of signalling 
cascades that produce not only trophic effects, but also changes in 
synaptic transmission. 

Much recent interest in neurotrophic factors derives from find- 
ings that they regulate synaptic transmission in the adult brain and 
that neurotrophic factor expression can be modulated through 
interactions with monoamine and amino acid neurotransmitters. 
For example, repeated administration with monoamine-targeted 
antidepressants increases BDNF expression in animal models and 
depressed patients, whereas decreases in BDNF have been linked to 
depression. These findings have informed a popular hypothesis that 
changes in neural plasticity at many levels (both BDNF-dependent 
and non-dependent) are important to the symptoms of depression, 
as well as the relief of these symptoms by antidepressant drug treat- 
ment [18, 19]. A more advanced version of this hypothesis combines 
with neuropsychological theories to suggest that increased neural 
plasticity at the molecular and cellular levels arms key neural circuits 
with the capacity to process emotional information and re-establish 
positive emotional associations that result in improved mood [20]. 


Chemokines and cytokines 


Chemokines and cytokines comprise large families of homolo- 
gous small proteins (6-10 kDa) and differ from neuropeptides and 
neurotrophic factors in that they are key signalling molecules of the 
immune system. However, these molecules and some of their re- 
ceptors are also present in the brain in both glial cells and neurons, 
raising the possibility that they might also have neurotransmitter- 
like functions. Although the evidence is incomplete, data show that 
chemokine and cytokine molecules are synthesized in the brain and 
have several of the characteristics that define neurotransmitters, 
including interaction with receptors and modulation of release of 
other neurotransmitters or neuropeptides [21]. This suggests that 
chemokine and cytokine signalling may have a role in neuronal and 
glial cell signalling that is distinct from their role in inflammatory 
processes. On the other hand, it is quite clear that inflammatory and 
immunologic changes can trigger responses that play a role in CNS 
protection, but also a role in CNS injury. 

This local presence of chemokine and cytokine signalling pro- 
vides a route of communication between the immune system and 
the CNS, but there are many others, including several pathways 
through which peripheral inflammatory signals can be transmitted 
to the brain [22]. For example, cytokines may pass directly into the 
brain via leaky regions in the blood-brain barrier and also bind to 


peripheral autonomic nerves such as the vagus nerve, resulting in 
modulation of afferent CNS inputs. A dysfunctional interaction 
between the immune system and the peripheral inflammatory re- 
sponse, in particular, is presently considered an important con- 
tributor to the pathophysiology of a number of psychiatric disorders, 
including major depression. Moreover, the irrefutable evidence of 
immunological and CNS interactions has challenged the previously 
held belief of CNS immune privilege, such that autoimmune mech- 
anisms are emerging as a pathophysiology disease mechanism, as 
evidenced in recent discoveries of circulating NMDA receptor anti- 
bodies in some patients with schizophrenia. The development of 
drug tools and biologics (for example, antibodies) to target cytokine 
and chemokine mechanisms is an intense area of current neurosci- 
ence research. 


Retrograde messengers 


Whereas classical neurotransmitters and neuropeptides are gen- 
erally considered to signal in an ‘anterograde’ direction (that is, 
presynaptic to postsynaptic), it is now recognized that certain 
brain molecules signal information at a synapse in a ‘retrograde’ 
direction. They are released from the postsynaptic neuron to act 
on the presynaptic neuron. Molecules falling into this category in- 
clude certain neurotrophic factors, gaseous molecules, and lipid 
messengers. 


One example of a retrograde messenger is the gaseous molecule NO 
that is produced in neurons from the amino acid L-arginine by a 
neuron-specific isoform of NO synthase (NOS). Some of the first 
evidence that NO might function as a chemical messenger in the 
brain came from findings that activation of glutamate NMDA recep- 
tors in the cerebellum caused the release of a diffusible messenger, 
which was subsequently identified as NO [23]. The current thinking 
is that increased activity at glutamatergic synapses triggers in post- 
synaptic neurons an NMDA-mediated, calcium-dependent activa- 
tion of NOS. The resulting NO then diffuses back across the synapse 
to enhance presynaptic transmission. The latter occurs, at least in 
part, through NO acting on guanylate cyclase to increase the pro- 
duction of the second messenger cyclic guanosine monophosphate 
(cGMP). In postsynaptic neurons, NO also regulates certain protein 
kinase pathways and gene transcription factors, and changes cell sig- 
nalling events by S-nitrosylation. 

Since NOS is abundant and widely distributed in the CNS, NO 
signalling is likely to contribute to many brain functions. Indeed, on 
the basis of studies on the effects of NO donors and the pharmaco- 
logical and genetic modulation of NOS, increased NO production 
is associated with a range of CNS functions, including improved 
cognition and an associated induction and maintenance of synaptic 
plasticity, and NO may be neuroprotective under some conditions 
[24]. However, because excess NO has neurotoxic potential, and be- 
cause of the difficulty of delivering NO to the CNS without inducing 
side effects through the many actions of NO on peripheral tissues, 
the development of NO-based therapies for the treatment of CNS 
disorders has yet to reach fruition. 
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Another example of retrograde signalling is by endocannabinoids 
[25]. These are a recently discovered family of naturally occurring 
lipids (including anandamide and 2-arachidonoylglycerol) that 
interact with cell surface receptors targeted by the psychotropic 
agent A’-tetrahydrocannabinol (THC). The latter is the principal 
biologically active constituent of the cannabis plant [26]. In essence, 
endocannabinoids appear to be to THC and cannabinoid receptors 
what opioid peptides are to morphine and opiate receptors. 

The current thinking is that endocannabinoids are synthesized 
enzymatically on demand within the postsynaptic neuron and, 
once produced, diffuse across the synapse in a retrograde direc- 
tion. Endocannabinoids then suppress neurotransmitter release 
through activation of a presynaptic CB, receptor, which is the main 
type of cannabinoid receptor in the brain (analogous in terms 
of structure and function to opiate receptors, but quite distinct 
pharmacologically). 

The central actions of THC, including its psychotropic effects, 
nociception, increased appetite, and anti-emetic effects, are thought 
to be principally mediated by CB, receptors. Since CB, receptors 
have a powerful influence on synaptic transmission in the brain and 
have limited distribution in the periphery (although CB, receptors 
are abundant in the immune system), drugs targeting these recep- 
tors and/or the enzymes involved in endocannabinoid synthesis and 
metabolism have interesting therapeutic possibilities. Indeed, CB, 
receptor agonist preparations are currently prescribed as analgesic 
and anti-emetic agents. 


Neurotransmitter receptors 


Neurotransmitter receptors are located on the cell surface of both 
pre- and postsynaptic neurons and, as a general rule, can be divided 
into two main types; one activates an ion channel that is intrinsic 
to the receptor (ligand-gated ion channel—sometimes called an 
ionotropic receptor), and the other activates a guanosine triphos- 
phate (GTP)-binding protein which acts as a transducer between 
the receptor and the effector system (GPCR—sometimes called a 
metabotropic receptor). As an exception to this general rule, cer- 
tain trophic factors and cytokines directly activate protein tyrosine 
kinases, as noted above. In addition, steroid hormones signal in the 
brain by crossing the plasma membrane and activating receptors in 
the neuronal cytoplasm that translocate to the nucleus where they 
bind DNA and function as transcription factors. 

Ligand-gated ion channels typically comprise a multimeric 
plasma membrane receptor complex (4-5 subunits, each with four 
transmembrane-spanning domains) that gate the influx of ions to 
evoke fast changes in synaptic signalling. For instance, nicotinic, 
5-HT, GABA, and glycine receptors are pentameric in their sub- 
unit composition; these receptors are often termed ‘Cys-loop’ re- 
ceptors due to a loop of amino acid residues formed by a disulfide 
bond in the extracellular domain. GPCRs comprise a superfamily 
of single proteins (seven transmembrane-spanning domains) that 
evoke slower changes in synaptic signalling through the generation 
of second messengers and interactions with intracellular signalling 
pathways. 
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A remarkable advance in molecular neuropharmacology in the 
last 20 years has been the discovery of huge diversity in neuro- 
transmitter receptors. This complexity takes the form of not only 
several hundred of GPCRs [27], but also considerable heterogen- 
eity in ligand-gated ion channels produced through the assembly of 
multiple receptor subunits [28, 29]. Receptors were once classified 
according to their pharmacological properties, but today receptor 
classification is based on a combination of pharmacological, func- 
tional, and structural properties. It is now evident that most, and 
probably all, neurotransmitters have more than one receptor type. 
As a consequence of their receptor diversity, individual neurotrans- 
mitters are conferred multiple downstream signalling properties. 
Typically, as soon as neurotransmitter receptors are identified, their 
distribution within the brain (and other organs and tissues) is es- 
tablished, and then a combination of pharmacological and genetic 
approaches are used to obtain an understanding of their function, 
which can then aid the development of novel drug therapies. Some 
examples are shown in Table 14.4, and more detailed information is 
available elsewhere [30]. 


Ligand-gated ion channels 


The amino acids glutamate and GABA are, respectively, the principal 
excitatory and inhibitory transmitters in the brain and exert their fast 
synaptic effects via ligand-gated ion channels. Acetylcholine (nico- 
tinic receptors) and ATP (Px receptors) transmitters also signal fast 
transmission via ligand-gated ion channels. The 5-HT; receptor is 
the only ligand-gated ion channel among the many monoamine re- 
ceptors, and none exists for neuropeptides. 





Ligand-gated ion channels for glutamate 


Glutamate elicits fast excitatory effects by activating ligand-gated ion 
channels, and there are three types: a-amino-3-hydroxy-5-methyl- 
4-isoxazolepropionic acid (AMPA) receptors, NMDA receptors, 
and less abundant kainate receptors. These receptors are named ac- 
cording to their preferred synthetic agonist, and gate cations (so- 
dium, potassium, and calcium) with varying degrees of selectivity. 
Each receptor is assembled as a tetramer, which can comprise a com- 
bination of subunits, and this generates additional heterogeneity. 
For instance, AMPA receptors are formed from a combination of 
four subunits (GluA 1-4), and NMDA receptors from two subunits 
(GluN 1-2). There are a large number of naturally occurring variants 
of both AMPA and NMDA subunits generated through RNA editing 
and alternative splicing (Table 14.4). 

The pharmacological and functional significance of this complexity 
is not yet fully clear, although evidence suggests that different receptor 
assemblies may confer distinct pharmacological and biophysical 
properties on the receptor [30]. For example, recent data suggest that 
changes in AMPA receptor subunit composition cause differences in 
calcium ion permeability and change synaptic efficacy [29]. 

In addition to a glutamate-binding site, both AMPA and NMDA 
receptors demonstrate allosteric modulatory sites, a feature common 
to many ligand-gated ion channels. Thus, in addition to glutamate, 
AMPA receptors are sensitive to AMPAkines, which comprise a 
chemically diverse group of exogenous agents that act at separate 


chemically sensitive site on the receptor. The result is potentiated 
AMPA receptor ion channel function and associated procognitive 
effects in vivo [31]. Non-glutamate sites on the NMDA receptor 
include a site for magnesium ions that is the source of a voltage- 
dependent NMDA receptor block, which requires membrane de- 
polarization to open. In addition, the NMDA receptor has a positive 
allosteric modulatory site for glycine and D-serine, and another for 
polyamines such as spermidine. While glutamate is released from 
presynaptic terminals in a phasic, activity-dependent fashion, en- 
dogenous glycine, D-serine, and polyamines (likely arising from 
non-neuronal sources) are thought to act as extracellular modula- 
tors that are present at more constant levels. These allosteric sites 
are under investigation as possible sources of NMDA receptor 
modulatory agents that do not suffer the excitotoxic effects of agon- 
ists acting directly at the glutamate site [32]. Such agents include 
inhibitors of glycine reuptake and D-serine metabolism. 


Ligand-gated ion channels for GABA 





GABA elicits fast inhibitory effects by activating the GABA, receptor, 
which is a ligand-gated ion channel that is selectively permeable to 
chloride ions. GABA, receptors are formed from five subunits, of 
which there are at least 19 types (a1-6, 81-3, y1-3, p1-3, ô, £, m, 8). 
Studies co-expressing different GABA, receptor subunits in simple 
cells, such as frog oocytes, indicate the potential for several hun- 
dreds, if not thousands, of functional GABA, receptor subunit com- 
binations. However, findings on the distribution and abundance of 
GABA, receptor subunits in brain tissue indicate the likely presence 
of a, B, and y subunits in the vast majority of receptors, and that the 
number of naturally occurring types of GABA, receptors is of the 
order of ten or fewer [33]. As with the glutamate ionotropic recep- 
tors, GABA, receptors have a number of allosteric modulatory sites, 
and such sites are sensitive to a variety of pharmacological agents, 
including benzodiazepines, certain endogenous steroids, steroidal 
anaesthetic agents such as propofol, and alcohol (ethanol). 

GABA, receptors are influenced by the binding of benzodiazep- 
ines, such as diazepam, and ‘Z drugs, like zolpidem, which produce 
their anxiolytic and sedative effects via the benzodiazepine-binding 
site. Given the high chemical specificity of the benzodiazepine- 
binding site, there has been an extensive search for an endogenous 
ligand. This has led to the discovery of the ‘endozepine, a diazepam- 
binding inhibitor, which is a peptide with many of the properties that 
would be predicted of an endogenous ligand at the benzodiazepine- 
binding site. This includes secretion (possibly from glial cells) and 
modulation of the GABA, receptor, although both positive and 
negative modulatory effects have been detected [34]. 

Genetic and pharmacological approaches have been used to iden- 
tify the pharmacological significance of multiple GABA, receptor 
subtypes, and specifically to determine the functional significance 
of six variants of the a subunit, which is critical to the binding of 
benzodiazepines [28]. In particular, studies with point-mutated 
mice have revealed that the sedative effect of diazepam is mediated 
by al-containing GABA, receptors, whereas the anxiolytic action 
is mediated by a2/a3-containing GABA, receptors. Moreover, find- 
ings that ligands with selective actions at a2- and/or a3-containing 
GABA, receptors display anxiolytic activity at doses lower than those 
that cause sedation [35]. This raises the possibility of interesting 





future drug therapies, for example drug treatments for anxiety dis- 
order that have the anxiolytic effect of benzodiazepines, such as di- 
azepam, but lack their unwanted adverse effects [36]. Interestingly, 
a5-containing GABA, receptors may be an important site of action 
of alcohol. The GABA, receptor subunit(s) targeted by steroids to 
produce CNS inhibitory effects of these agents are currently under 
investigation. 


G protein-coupled receptors 


Almostall neurotransmitters, including glutamate and GABA, signal 
effects via GPCRs, and most neurotransmitters signal via more than 
one type of GPCRs. For example, the monoamine 5-HT possesses 
14 receptor subtypes (comprising seven receptor families 5-HT,_,), 
13 of which are GPCRs and one is a ligand-gated ion channel (5- 
HT;). Each 5-HT GPCR has high affinity and selectivity for 5-HT, 
but individually the receptors are pharmacologically distinct, arise 
from different (but homologous) genes, and are formed from dif- 
ferent protein sequences with different distributions and signalling 
effects [37]. Since several 5-HT GPCRs can co-localize at a single 
synapse, the signal received by a postsynaptic neuron may be quite 
complicated. This complexity for 5-HT can be seen in many other 
transmitters, including dopamine (D,_;), glutamate (mGluR, ,), 
noradrenaline (aias 24.3, Pı-3) endocannabinoids (CB,_,), and 
neuropeptides (‘Table 14.4). 

Typically, GPCRs comprise a single membrane protein with seven 
transmembrane-spanning domains, an N-terminus facing the extra- 
cellular space, a C-terminus facing the cytoplasm, and several intra- 
cellular transmembrane domain linking loops. The N-terminus of 
some GPCRs (mGluR,_,, GABA») contains the ligand-binding site, 
while for most GPCRs, the predicted ligand-binding site lies within 
the transmembrane domains. Both the C-terminus and the third 
transmembrane intracellular loops are phosphorylated by protein 
kinases, which can result in altered GPCR function, as well as traf- 
ficking to the plasma membrane. ‘The third intracellular loop is the 
main site of G protein interaction. 


Each G protein is a heterotrimer comprising a, B, and y subunits that 
dissociate on binding of the ligand to the GPCR. On dissociation, 
the a subunit binds GTP and, through intrinsic GTPase activity, dir- 
ectly regulates a number of specific downstream effector enzymes 
and ion channels. The B/y subunits are also biologically active and 
regulate some of the same effector proteins. 

There are four major types of G proteins—G,, G» G and G,— 
that produce the following ‘canonical’ signalling effects, respect- 
ively: activation of adenylyl cyclase, inhibition of adenylyl cyclase, 
activation of phospholipase C (PLC), and interaction with calcium 
ion and potassium ion channels. Changes in the activity of adenylyl 
cyclase result in altered intracellular levels of the ‘second messenger’ 
cyclic adenosine monophosphate (cAMP). Similarly, PLC alters 
intracellular levels of inositol triphosphate (IP;) and diacylglycerol 
(DAG). Altered levels of these second messengers trigger changes in 
the activity of specific signalling cascades and ultimately changes in 
physiological responses. 
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Table 14.4 Examples of neurotransmitter receptors 







































































Dopamine D; family (dopamine D4, D5) | Adenylyl cyclase (G,) 
D, family (dopamine D,, Adenylyl cyclase (Go) 
Ds, D4 

Noradrenaline a, family (O44, 8, D) Phospholipase C (G,) 
az family (daa, B, c) Adenylyl cyclase (Gyo) 
B family (B4, 2, 3) Adenylyl cyclase (G,) 

5-HT 5-HT, family (5-HTy,,5,5,.¢.£) | Adenylyl cyclase (Gyo) 
5-HT, family (S-HT 4, ,c) Phospholipase C (G,) 
5-HT3 Cation channel 
S-HT, Adenylyl cyclase (G,) 
5-HTs family (5-HTsa, B) ot certain 
5-HT, Adenylyl cyclase (G,) 
5-HT, Adenylyl cyclase (G, 

Acetylcholine M, (muscarinic) Phospholipase C (G,) 
M2 Adenylyl cyclase (Gio) 
M; Phospholipase C (G,) 
M, Adenylyl cyclase (Gyo) 
M Phospholipase C (G4 


5 
Nicotinic (a1-10, 81-4, 6, Cation channel 





E, Y) 

GABA GABA; (a1-6, 81-3, y1-3, Chloride channel 
01-3, ô, €, 71, O) Adenylyl cyclase (Gi) 
GABA; 

Glutamate AMPA (GluA1-4) Cation channel 
NMDA (GluN1, GluN2A-D, Cation channel 
GluN3A-B) Cation channel 


Kainate (GluK1-5) 

Group | family (mGluR,,s) 
Group II family (mGluR;_s) 
Group III family 

(MGIUR;, 6,7, 8) 


Phospholipase C (Gq) 
Adenylyl cyclase (G,,,) 
G 


Adenylyl cyclase (Gyo) 




















Tachykinin (including NK; Phospholipase C (G,) 
substance P) NK Phospholipase C (Gq) 
Nk, Phospholipase C (G,) 
Opioid ô Adenylyl cyclase (Gyo 
K Adenylyl cyclase (Gyo 
u Adenylyl cyclase (Gyo 
Galanin GALI Adenylyl cyclase (Gyo 
GAL2 Adenylyl cyclase (Go 
GAL3 Adenylyl cyclase (Gyo 
Adenosine Ay Adenylyl cyclase (Go 
A; family (Aga, 8) Adenylyl cyclase (G,) 
A3 Adenylyl cyclase (Gio, 
ATP P2X family (P2X; 7) Cation channel 
RA Phospholipase C (G,) 
P2Y, Phospholipase C (G,) 
P2Y, Phospholipase C (G,) 
PIM Phospholipase C (G,) 
PANT Phospholipase C (G,) 
PN Adenylyl cyclase (G;.) 
paver Adenylyl cyclase (Go) 
P2Y 14 Phospholipase C (G, 
Cannabinoid CBI Adenylyl cyclase (Gi) 
CB2 Adenylyl cyclase (G,,,) 





The indirect opening of ion channels in response to 
neurotransmitter-induced GPCR activation leads to direct effects 
(excitatory or inhibitory) on the electrical properties of neurons, 
albeit on a longer timescale than effects produced by ligand-gated 
ion channels. Almost all neurotransmitter classes are able to evoke 
changes in ion channel opening via GPCRs, and some may be clinic- 
ally important. For example, the a,-adrenoceptor-induced opening 
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of potassium ion channels on noradrenaline neurons causes a fall 
in noradrenergic activity and release, which may contribute to the 
anxiolytic and sedative properties of a,-adrenoceptor agonists such 
as clonidine. On the other hand, the 5-HT,, receptor-induced closing 
of potassium ion channels on cortical neurons causes an increase in 
cortical neuron activity and may underlie the psychotropic effects 
of lysergic acid diethylamide (LSD) and related hallucinogens [37]. 


GPCR regulation 


Recent discoveries of interactions between GPCRs and other intra- 
cellular proteins have led to a new understanding of how the recep- 
tors are regulated and trafficked to and from the plasma membrane. 
Studies commencing on the B-adrenoceptor have identified two 
families of regulatory proteins called B-arrestins and GPCR kin- 
ases (GRKs). Within seconds of being activated by an agonist, the 
GPCR is phosphorylated by a GRK on the C-terminal cytoplasmic 
tail and other intracellular domains. This phosphorylation promotes 
the interaction of B-arrestins with the GPCR, which limits the signal 
duration, and causes loss of sensitivity to agonist activation and then 
receptor internalization from the cell surface [38]. 

In addition to B-arrestins, the C-termini of GPCRs associate 
with a large variety of transmembrane or soluble proteins, termed 
‘GPCR-interacting proteins’ (GIPs). Some GIPs are themselves 
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Fig. 14.2 Diagrammatic representation of ligand-gated ion channel and G protein-coupled 


GPCRs that form homo- or heterodimers, while other GIPs are ionic 
channels, ionotropic receptors, and proteins that control GPCR 
trafficking [39]. One interesting example of a GIP is the molecule 
p11, which reportedly functions to traffic a 5-HT GPCR (5-HT),,) 
to the plasma membrane. Evidence suggests that p11 expression is 
reduced in the post-mortem brain of patients committing suicide 
and that mice with a genetic deletion of p11 have a depressive-like 
phenotype [40]. 


Second messengers 


The generation of the second messengers cAMP and DAG by 
adenylyl cyclase and PLC, respectively, leads to activation of pro- 
tein kinases that add phosphate groups to specific protein targets 
to change their activity and ultimately trigger diverse physiological 
responses (Fig. 14.2). Enzymes called phosphatases, which remove 
phosphate groups, oppose these signalling effects. Guanylate cyclase 
is a cytosolic enzyme which also generates a second messenger 
cGMP. As noted, guanylate cyclase is activated by NO to produce 
effects on presynaptic function. 

Based on molecular cloning studies, nine forms of adenylyl cyclase 
have been identified (I-IX), and each exhibits a distinct distribu- 
tion in brain and peripheral tissues [41]. The full implication of this 
complexity is not yet understood, but it suggests that regulation of 
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eceptors. Ligand-gated ion channels comprise multiple 


protein subunits that form a central pore in the plasma membrane. On binding of the neurotransmitter, this receptor mediates fast excitatory or 
inhibitory transmission, depending on whether the channel gates cations or chloride ions, respectively. G protein-coupled receptors comprise a single 
membrane-spanning protein. On binding of the neurotransmitter, this receptor mediates slow transmission by enabling the dissociation of the G 
protein into an a subunit monomer and a B/y subunit dimer, both of which may activate an effector enzyme to generate a second messenger. Also, the 
B/y subunit dimer may directly interact with ion channels. Second messengers may also indirectly modulate ion channels through phosphorylation by 


activating protein kinases. 


Reproduced from Nestler EJ, Hyman SE, Malenka RC, Molecular Neuropharmacology, pp. 64, Copyright (200 





), with permission from McGraw-Hill Education. 


cAMP formation varies, depending on the form of adenylyl cyclase 
expressed in neuronal cells. 

Both cAMP and cGMP are degraded by phosphodiesterases 
(PDEs), which are expressed in numerous forms (types 1-11) in 
brain and peripheral tissues [30]. At high concentrations, caffeine 
and related methylxanthines inhibit PDE and this action contributes 
to the pharmacological effects of these drugs. Much effort is being 
made to develop inhibitors that are selective for brain-specific forms 
of PDE. Rolipram inhibits all isoforms of PDE4; this drug showed 
promise as an antidepressant, but its clinical utility was limited by 
peripheral side effects. However, because PDE4 enzymes comprise 
a number of isoforms, an inhibitor of one isoform may lead to the 
development of an effective antidepressant without the side effects 
of rolipram. 

GPCR-induced activation of PLC causes the breakdown of 
phosphatidylinositol, resulting in the generation and recycling 
of the second messengers IP; and DAG, via the phosphoinositide 
cycle. Both IP, and DAG produce downstream signalling effects, IP, 
through the mobilization of intracellular calcium stores and DAG 
through activating a protein kinase. There are two major isoforms 
of PLC in the brain—f and y, the B isoform being predominantly 
responsible for mediating the effects of GPCRs linked to G,. 

After its formation, IP; is recycled via a series of dephos- 
phorylations to form inositol, which is used in the regeneration of 
phosphatidylinositol. Lithium, which is an important drug in the 
treatment of bipolar disorder, inhibits one of the enzymes involved 
in the recycling of IP; [inositol-1-monophosphatase (IMPase)] and 
causes inositol depletion at therapeutic concentrations. Because in- 
ositol does not easily enter the blood-brain barrier, brain inositol 
levels are thought to fall and the production of the second messen- 
gers diminishes. It is a popular hypothesis that inositol depletion is 
responsible for lithium’s clinical effects, but this remains unproven. 
The recent discovery that the organoselenium antioxidant com- 
pound ebselen inhibits IMPase has stimulated interest in the idea 
that this agent has potential for repurposing, and specifically for use 
in bipolar depression [42]. 

In addition to IMPase, lithium interacts (albeit often at high con- 
centrations) with a range of other signalling systems, including 
various ion channels, adenylyl cyclases, and protein kinases. For 
example, lithium inhibits glycogen synthase kinase-3B (GSK-36), 
which also provides a source of inositol in the brain; certain mood- 
stabilizing anticonvulsants, such as valproate, also have this effect 
[43]. This has encouraged the development of GSK-36 inhibitors 
for bipolar disorder. However, GSK-3B has a range of functions, 
including a role in trophic mechanisms, and the safety of GSK-3B 
inhibitors is currently uncertain. 


Downstream signalling cascades 


The activation or inhibition of second messenger signalling cascades 
by GPCRs can profoundly change the intracellular environment of 
the neuron by regulating the activity of protein kinases and other 
proteins, including gene transcription factors and even enzymes in- 
volved in the regulation of chromatin structure. Consequently, these 
cascades may regulate gene transcription and protein synthesis and 
activate multiple downstream effectors, including those that form 
the cytoskeleton or contribute to mechanisms underlying synaptic 
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plasticity. Such effects can induce long-lasting changes in neuronal 
function. Increasing evidence suggests that the neuroadaptive re- 
sponses to repeated psychotropic drug administration are under- 
pinned by changes in gene expression that result in the remodelling 
of neural circuit function and structure. This thinking has been ap- 
plied to explain a multitude of neuropharmacological mechanisms, 
ranging from compulsive use of recreational drugs to the therapeutic 
action of antidepressant and antipsychotic drugs. 

As an example, recent research has seen the evolution of a fas- 
cinating theory to explain the delayed onset of antidepressant effect 
of drugs like fluoxetine and imipramine that act to inhibit plasma 
membrane monoamine transporters. It supposes that elevated 
monoamine levels (through transporter blockade) trigger GPCR 
signalling cascades that activate gene programmes to enhance neur- 
onal survival and connectivity, the latter having being weakened be- 
cause of the adverse effects of stress and other environmental factors 
[18, 44-46]. Some of the key genes involved in this process include 
trophic factors such as BDNF, which may be a trigger for the pro- 
duction of newly formed neurons and many other forms of neural 
plasticity that leads to the overall strengthening of synapses and in- 
creased information transfer through key neural circuits relevant to 
mood control. This theory is a driving force for the development of 
pharmacological strategies for improved antidepressant therapies, 
even though our knowledge of the key molecules that are changed 
by antidepressants to bring about the relief of the symptoms of de- 
pression is far from complete. 





Non-canonical GPCR signalling 


In addition to the classical (canonical) GPCR signalling, it is now 
recognized that GPCRs signal via a diversity of less well-known 
(referred to here as non-canonical) pathways, some of which are 
dependent on ‘small G proteins’ and others which are G protein- 
independent. Small G proteins (termed because of their low mo- 
lecular weight and monomeric composition) comprise a protein 
superfamily, which also bind GTP and possess intrinsic GTPase ac- 
tivity. These G proteins function as molecular switches that control 
several cellular processes, ranging from vesicle trafficking and exo- 
cytosis (for example, Rab) to assembly of cytoskeletal structures (for 
example, Rho). Among the best characterized small G proteins are 
those that comprise the Ras family. Numerous types of cell signals, 
including those of most neurotrophic factors, converge on Ras and 
related proteins to regulate mitogen-activated protein (MAP) kinase 
pathways. Some GPCR signalling is independent of G proteins and 
often involves the recruitment of members of the arrestin family of 
proteins, which then trigger downstream pathways such as those 
involving MAP kinase. 

This diversity in GPCR signalling is highly relevant in the context 
of emerging experimental evidence that different agonists acting at 
the same receptor can elicit different signals. This phenomenon is 
sometimes referred to as biased agonism or ligand-dependent sig- 
nalling. Thus, agonists for a specific receptor may differ not only 
in terms of potency and efficacy, as given in classical pharmaco- 
logical accounts, but also in terms of the signal that they elicit. 
Interestingly, evidence of ligand-dependent signalling through 5- 
HT), receptors has relevance to the psychotropic effects of 5-HT 5, 
receptor agonists, some of which are hallucinogenic, but not all, and 








SECTION 2 The scientific basis of psychiatric aetiology and treatment 


it may also explain the long-lasting effects of LSD which are diffi- 
cult to explain on the basis of pharmacokinetics alone [47]. Biased 
agonism offers intriguing possibilities for therapeutic potential (for 
example, avoidance of adverse side effects), although this is yet to 
be explored. 


Concluding remarks 


Until recently, studies on the chemistry of synaptic neurotransmis- 
sion have focused on a relatively small number of neurotransmit- 
ters and a narrow group of proteins involved in neurotransmitter 
function, specifically neurotransmitter receptors, transporters, and 
enzymes which bring about neurotransmitter synthesis or degrad- 
ation. Today, powerful molecular and genetic approaches are being 
used to identify and understand new proteins and mechanisms in- 
volved in neurotransmitter function and control. So far, just a few 
tens of perhaps thousands of neurotransmitter-related proteins have 
been successfully targeted by pharmacological agents and trans- 
lated into important treatments of psychiatric disorder, but there is 
promise of many more such treatments to come. Moreover, this huge 
diversity of neurotransmitter-related proteins is now emerging as a 
large resource for studies of genetic risk factors of psychiatric dis- 
order and investigations of biological markers of illness diagnosis 
and progression and treatment outcome. 


FURTHER INFORMATION 


Alexander, S.P., Kelly, E., Marrion, N., et al.; CGTP Collaborators. 
(2015). The concise guide to pharmacology 2015/16: overview. 
British Journal of Pharmacology, 172, 5729-43. 

International Union of Basic and Clinical Pharmacology 
(IUPHAR) Committee. Guide to pharmacology (official data- 
base on receptor nomenclature and drug classification). http:// 
www.guidetopharmacology.org 

Kandel, E.R., Schwartz, J., Jessell, T., Siegelbaum, S.A., and Hudspeth, 
A.J. (2013). Principles of neural science (5th edn). McGraw- 

Hill, New York, NY. Further details are available at: http:// 
en.wikipedia.org/wiki/Principles_of_Neural_Science 

Nestler, E.J., Hyman, S.E., Holtzman, D.M., and Malenka, R.C. (2015). 
Molecular neuropharmacology: a foundation for clinical neurosci- 
ence. McGraw-Hill, New York, NY. 


REFERENCES 


1. Maehle, A.H. (2004). ‘Receptive substances’: John Newport 
Langley (1852-1925) and his path to a receptor theory of drug 
action. Medical History, 48, 153-74. 

2. Langmoen, I.A. and Apuzzo, M.L. (2007). The brain on it- 
self: Nobel laureates and the history of fundamental nervous 
system function. Neurosurgery, 61, 891-907; discussion 907-8. 

3. Hokfelt, T., Bartfai, T., and Bloom, F. (2003). Neuropeptides: op- 
portunities for drug discovery. The Lancet Neurology, 2, 463-72. 

4. Iversen, L. (2006). Neurotransmitter transporters and their im- 
pact on the development of psychopharmacology. British Journal 
of Pharmacology, 147(Suppl 1), 882-8. 

5. Fremeau, R.T., Jr., Voglmaier, S., Seal, R.P., and Edwards, R.H. 
(2004). VGLUTSs define subsets of excitatory neurons and suggest 
novel roles for glutamate. Trends in Neurosciences, 27, 98-103. 


10. 


HE 


12 


IB: 


14. 


15: 


16. 


WE 


18. 
19. 


20. 


21 


22: 


23. 


24. 


25- 


26. 


. Kristensen, A.S., Andersen, J., Jørgensen, T.N., et al. (2011). SLC6 


neurotransmitter transporters: structure, function, and regula- 
tion. Pharmacological Reviews, 63, 585-640. 


. Zhou, Z., Zhen, J., Karpowich, N.K., et al. (2007). LeuT- 


desipramine structure reveals how antidepressants block neuro- 
transmitter reuptake. Science, 317, 1390-3. 


. Coleman, J.A., Green, E.M., and Gouaux, E. (2016). X-ray 


structures and mechanism of the human serotonin transporter. 
Nature, 532, 334-9. 


. Daws, L.C. (2009). Unfaithful neurotransmitter trans- 


porters: focus on serotonin uptake and implications for anti- 
depressant efficacy. Pharmacology and Therapeutics, 121, 89-99. 
Courousse, T. and Gautron, S. (2015). Role of organic 

cation transporters (OCTs) in the brain. Pharmacology and 
Therapeutics, 146, 94-103. 

Javitt, D.C. (2012). Glycine transport inhibitors in the treatment 
of schizophrenia. Handbook of Experimental Pharmacology, 213, 
367-99. 

Rothstein, J.D., Patel, S., Regan, M.R., et al. (2005). Beta-lactam 
antibiotics offer neuroprotection by increasing glutamate trans- 
porter expression. Nature, 433, 73-7. 

Sengupta, A., Bocchio, M., Bannerman, D.M., Sharp, T., and 
Capogna, M. (2017). Control of amygdala circuits by 5-HT 
neurons via 5-HT and glutamate cotransmission. Journal of 
Neuroscience, 37, 1785-96. 

Tritsch, N.X., Granger, A.J., and Sabatini, B.L. (2016). 
Mechanisms and functions of GABA co-release. Nature Reviews, 
17, 139-45. 

Scamuffa, N., Calvo, F., Chretien, M., Seidah, N.G., and Khatib, 
A.M. (2006). Proprotein convertases: lessons from knockouts. 
The FASEB Journal,0 20, 1954-63. 

Ogren, S.O., Kuteeva, E., Hokfelt, T., and Kehr, J. (2006). Galanin 
receptor antagonists: a potential novel pharmacological treat- 
ment for mood disorders. CNS Drugs, 20, 633-54. 

Chao, M.V. (2003). Neurotrophins and their receptors: a con- 
vergence point for many signalling pathways. Nature Reviews, 4, 
299-309: 

Castren, E. (2005). Is mood chemistry? Nature Reviews, 6, 241-6. 
Martinowich, K., Manji, H., and Lu, B. (2007). New insights into 
BDNF function in depression and anxiety. Nature Neuroscience, 
10, 1089-93. 

Harmer, C.J., Duman, R.S., and Cowen, P.J. (2017). How do anti- 
depressants work? New perspectives for refining future treatment 
approaches. The Lancet Psychiatry, 4, 409-18. 

Rostene, W., Kitabgi, P., and Parsadaniantz, S.M. (2007). 
Chemokines: a new class of neuromodulator? Nature Reviews, 8, 
895-903. 

Miller, A.H. and Raison, C.L. (2016). The role of inflammation in 
depression: from evolutionary imperative to modern treatment 
target. Nature Reviews Immunology, 16, 22-34. 

Garthwaite, J. and Boulton, C.L. (1995). Nitric oxide signaling 

in the central nervous system. Annual Review of Physiology, 57, 
683-706. 

Calabrese, V., Mancuso, C., Calvani, M., Rizzarelli, E., Butterfield, 
D.A., and Stella, A.M. (2007). Nitric oxide in the central nervous 
system: neuroprotection versus neurotoxicity. Nature Reviews, 8, 
766-75. 

Lu, H.C. and Mackie, K. (2016). An introduction to the en- 
dogenous cannabinoid system. Biological Psychiatry, 79, 
516-25. 

Piomelli, D. (2003). The molecular logic of endocannabinoid sig- 
nalling. Nature Reviews, 4, 873-84. 


Ms 


28. 


297 


30. 


31. 


32. 


29: 


34. 


35, 


36. 


SVE 


38. 


Fredholm, B.B., Hokfelt, T., and Milligan, G. (2007). G-protein- 
coupled receptors: an update. Acta Physiologica, 190, 3-7. 
Rudolph, U. and Mohler, H. (2006). GABA-based therapeutic ap- 
proaches: GABAA receptor subtype functions. Current Opinion 
in Pharmacology, 6, 18-23. 


Schuman, E.M. and Seeburg, P.H. (2006). Signalling mechanisms. 


Current Opinion in Neurobiology, 16, 247-50. 

Alexander, S.P., Mathie, A., and Peters, J.A. (2007). Guide to re- 
ceptors and channels, 2nd edition (2007 revision). British Journal 
of Pharmacology, 150(Suppl 1), S1. 

Lynch, G. and Gall, C.M. (2006). Ampakines and the threefold path 
to cognitive enhancement. Trends in Neurosciences, 29, 554-62. 
Kemp, J.A. and McKernan, R.M. (2002). NMDA receptor path- 
ways as drug targets. Nature Neuroscience, 5(Suppl), 1039-42. 
McKernan, R.M. and Whiting, PJ. (1996). Which GABAA- 
receptor subtypes really occur in the brain? Trends in 
Neurosciences, 19, 139-43. 

Farzampour, Z., Reimer, R.J., and Huguenard, J. (2015). 
Endozepines. Advances in Pharmacology, 72, 147-64. 

Mohler, H. (2012). The GABA system in anxiety and depression 
and its therapeutic potential. Neuropharmacology, 62, 42-53. 
Skolnick, P. (2012). Anxioselective anxiolytics: on a quest for the 
Holy Grail. Trends in Pharmacological Sciences, 33, 611-20. 
Barnes, N.M. and Sharp, T. (1999). A review of central 5-HT re- 
ceptors and their function. Neuropharmacology, 38, 1083-152. 
Lefkowitz, R.J. (2007). Seven transmembrane receptors: some- 
thing old, something new. Acta Physiologica, 190, 9-19. 


shh 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


CHAPTER 14 Neurotransmitters and signalling 


Bockaert, J., Roussignol, G., Becamel, C., et al. (2004). GPCR- 
interacting proteins (GIPs): nature and functions. Biochemical 
Society Transactions, 32(Pt 5), 851-5. 

Svenningsson, P., Chergui, K., Rachleff, I., et al. (2006). 
Alterations in 5-HT1B receptor function by p11 in depression- 
like states. Science, 311, 77-80. 

Cooper, D.M. (2003). Regulation and organization of adenylyl 
cyclases and cAMP. The Biochemical Journal, 375(Pt 3), 
517-29. 

Singh, N., Halliday, A.C., Thomas, J.M., et al. (2013). A safe 
lithium mimetic for bipolar disorder. Nature Communications, 4, 
11332: 

Gould, T.D. and Manji, H.K. (2005). Glycogen synthase kinase- 
3: a putative molecular target for lithium mimetic drugs. 
Neuropsychopharmacology, 30, 1223-37. 

Duman, R.S. (2002). Synaptic plasticity and mood disorders. 
Molecular Psychiatry, 7(Suppl 1), 829-34. 

Sharp, T. (2013). Molecular and cellular mechanisms of anti- 
depressant action. Current Topics in Behavioral Neurosciences, 14, 
309525) 

Duman, R.S., Aghajanian, G.K., Sanacora, G., and Krystal, J.H. 
(2016). Synaptic plasticity and depression: new insights from 
stress and rapid-acting antidepressants. Nature Medicine, 22, 
238-49. 

Wacker, D., Wang, S., McCorvy, J.D., et al. (2017). Crystal struc- 
ture of an LSD-bound human serotonin receptor. Cell, 168, 
377-89 e12. 





15 


Psychoneuroimmunology 
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Psychoneuroimmunology (also known as psychoneuroendocrino- 
immunology) is a growing scientific discipline that studies the 
complex bi-directional circuit between the nervous system and the 
immune system in health and pathological conditions. The study 
of these interactions requires a multi-disciplinary and convergent 
approach from psychology, neurosciences, immunology, pharma- 
cology, psychiatry, behavioural medicine, infectious diseases, endo- 
crinology, rheumatology, and other disciplines. 

The term was coined around the 1970s by R. Ader and N. Cohen, 
studying how episodes of stress and anxiety affect a person’s immune 
system and how this activation of the immune system affect mental 
processes and health in extension. 

Along history, a common place for psychoneuroimmunologists 
is the study of stress exposure and how individuals mount proper 
responses to cope with it. The mechanism/s implicated in the stress- 
induced modification of the immune system will be briefly discussed 
in this chapter, as well as the consequences in the structure and func- 
tion of the CNS, both at molecular and behavioural/cognitive levels. 
Some large meta-analytic studies indicate both activation and de- 
pression of the immune system after stress exposure, depending on 
the duration, intensity, and the kind of stressful event (for example, 
trauma or loss) [1]. 





The term stress has been widely used since a long time ago to define 
a range of situations and experiences like those in which somebody 
is suffering from pressure, anxiety, or sadness, for example, being 
its use correct in all of them. Physiologically, stress is a situation 
produced subsequent to experiencing an alteration in an organism's 
homeostasis mainly due to an aversive threat. W. Cannon and 
H. Selye made some of the initial discoveries in this field and were 
the first to use the terms stress and ‘stress response’ [2]. This re- 
sponse, constituted by a three-phase mechanism and defined by 
Selye as general adaptation syndrome, is the result of an adapta- 
tion necessary to allow the overcoming of situations in which an 





organism has to fight or flight to survive. Some of the physiological 
effects observed in this response are an increase in arousal and 
concentration capacity, faster cardiac rhythm, higher blood pres- 
sure, suppression of the digestive process, redirection of blood to 
muscles, and a reduction of sexual desire and the immune system 
activity, among others. All these co-operate to allow the organism 
to detect the danger and provide it with all the energy available, in 
order to achieve survival, while depriving the organism from en- 
ergy necessary for other processes. However, stress is a double-faced 
phenomenon, since while this fast and reversible response is essen- 
tial for survival, it may cause adverse effects when secretion of stress 
hormones is sustained. Indeed, very intense or long-lasting stress 
results in a new biological equilibrium that can be either beneficial 
(for example, exercise-induced conditioning of the cardiovascular 
system) or detrimental, causing damage or disease due to maladap- 
tation. In humans, the treat might be real or not. Although stress is 
not a disease in itself, continuous exposure to stressful stimuli has 
been clearly related with the onset, progression, or outcome of many 
psychopathological processes [3]. 

Stress response affects many different organs and systems, 
including the CNS. In the CNS, stress over the course of weeks causes 
reversible atrophy of hippocampal dendrites and apoptosis, whereas 
overexposure for months can cause permanent loss of neurons in 
rodents. Importantly, accumulating evidence of such stress-induced 
damage in the human brain has been presented [4], particularly in 
people with PTSD or major depression (that is, decreasing volumes 
in several brain areas). Studies carried out in animal models or in 
humans indicate that psychological stress induces a clear inflamma- 
tory response in the brain, accompanied by the release of cytokines 
and oxido-nitrosative mediators and the activation of several intra- 
and extracellular pathways [5, 6]. 





Although stress-induced inflammation occurs in the brain, there 
is much recent evidence indicating that stress induces a systemic 
inflammatory response, which suggests possible inflammatory- 
immune cross-talk between the brain and the periphery. The CNS 
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has long been considered an immune-privileged organ; however, 
this immune status is far from absolute. There are multiple neuro- 
immune pathways via which systemic inflammation reaches the 
CNS (Fig. 15.1). 

These could be directly related to behavioural (for example, 
sickness behaviour, social avoidance, anhedonia) and cognitive 
alterations (for example, memory disruption), which resemble 
psychiatric symptomatology. The relative relevance of each one 
may vary in the function of the type, duration, and severity of the 
stimuli. 


The humoral pathway: signalling through the 
blood-brain barrier 


The BBB structure is a complex histological, multi-layered struc- 
ture, formed by a thick, continuous glycocalyx (a complex struc- 
ture of proteoglycans and sialoproteins in the endothelium), 
non-fenestrated endothelial cells, linked by tight junctions (TJs), 
two basement membranes (vascular basement membrane and glia 
limitans), astrocytic end-feet, and two types of perivascular cells 
(pericytes and perivascular macrophages). All elements of this 
structure contribute to the functional BBB. 


NEUROINFLAMMATION 


Neuro-immune signalling through the BBB is called the ‘humoral 
pathway and occurs via two mechanisms: (1) energy-dependent 
transportation of immune signals by means of specific/non-specific 
transporters in the non-fenestrated endothelium of the BBB; and 
(2) the endothelium and perivascular-associated cells retaining the 
capacity to detect changes in circulating immune signals (cytokines) 
and initiating appropriate brain responses by increasing the synthesis 
and local release of signalling molecules, notably prostaglandins such 
as prostaglandin E, (PGE,). Release of cerebrovascular PGE, activates 
nearby catecholamine-containing neurons that project to the para- 
ventricular hypothalamic nucleus (PVN), regulating neuroendocrine 
responses to stress. Like PGE,, nitric oxide (NO) produced by the 
endothelium is also considered a second messenger implicated in the 
humoral pathway. 

In inflammatory conditions, passage of inflammatory cells 
(lymphocytes, neutrophils, and monocytes) across the BBB occurs 
primarily at the post-capillary venules in a process called cellular 
transmigration through a disrupted endothelium. Once leucocytes 
reach the perivascular space, a second step requires passage across 
the glia limitans to enter the brain parenchyma in a process regu- 
lated by perivascular cells (macrophages and pericytes). 
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Fig. 15.1 Multiple neuro-immune pathways by which systemic inflammation can reach the CNS. Possible functional, structural, and clinical 
consequences. BBB: blood-brain barrier; CVO: circumventricular organs; CP: choroid plexus; NTS: nucleus tractus solitarius, MDD, major depressive 


disorder; SCHZ, schizophrenia; BPD: bipolar disorder. 


Includes images adapted from Motifolio Drawing Toolkits (www.motifolio.com). 


Signalling through structures in the brain that lack a 
normal blood-brain barrier (circumventricular organs) 


The circumventricular organs (CVOs) can be functionally divided in 
two major types: (1) the sensory organs, including the area postrema 
(AP), the subfornical organ (SFO), and the vascular organ of the 
lamina terminalis; and (2) the secretory organs, mainly including 
the subcommissural organ (SCO) and the median eminence. The 
sensory organs detect and transmit peripheral immune signals to 
the CNS, and conversely, the secretory organs produce signals (that 
is, hormones) from the CNS to modulate the activity of peripheral 
organs and systems. The particular and extensive fenestrated vas- 
culature of these organs allows direct interaction between neurons 
and glial cells, which also form part of their structure and periph- 
eral blood flow. These organs also possess neuroanatomical connec- 
tions between them, as well as with other remote brain areas and 
structures involved in the regulation of the immune system such as 
the PVN and vagal sensory fibres. The exact contribution of each 
organ to transduce cytokine-dependent signalling is not known and 
may vary in function of the intensity of the stimulus. As occurs in 
the choroid plexus (CP), there are resident macrophage-like cells 
in the CVOs that respond to circulating stimuli by producing pro- 
inflammatory cytokines, and also in pathological conditions, there 
are increased numbers of CD45" leucocytes in the CVOs, suggesting 
the recruitment of inflammatory cells into the parenchyma of the 
CVOs, probably mediated by endothelial adhesion molecules. 


Signalling through the blood-cerebrospinal fluid 
barrier formed by the choroid plexus and meningeal 
arachnoid membrane (endothelium and meningeal 
macrophages) 


The CP is located in the brain ventricles and is formed by epithelial 
cells [with TJs and gap junctions (GJs) between them] that rest upon 
a basal lamina, and a second stromal structure consisting of central 
connective tissue and highly fenestrated vascularized tissue popu- 
lated by diverse cell types (fibroblasts, macrophages, and dendritic 
cells). Its main function is the production of CSF. 

It is accepted that CP response to acute peripheral stimuli is fast 
and strong but loses magnitude on chronicity. The CP is the pre- 
ferred site for initial transepithelial leucocyte trafficking from the 
periphery to the brain, because it contains resident immune cells 
(the function of CD# T cells is especially relevant in this regard) that 
produce pro-inflammatory cytokines and expresses major histo- 
compatibility complex (MHC) and leucocyte adhesion molecules. 
The CSF passes into the subarachnoid spaces surrounding the brain 
and spinal cord. Part of the CSF drains into blood via arachnoid villi 
in venous sinuses, but the CSF also drains from the cerebral sub- 
arachnoid space to cervical lymphatic organs. This pathway also 
allows traffic of antigen-presenting cells between the brain paren- 
chyma and regional lymph nodes. 


Direct recruitment of immune stimulus-primed 
peripheral immune cells to brain parenchyma 


Peripheral immune cells may act as important modulators of 
neuroinflammation. Under physiological conditions, several types 
of immune cells (monocytes, granulocytes, dendritic cells, T cells, 
and perivascular macrophages) of haematopoietic origin popu- 
late specific brain areas (vasculature, CP, and meninges) and exert 
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supportive and immunosurveillance actions. By contrast, in a state 
of chronic/uncontrolled neuroinflammation (for example, chronic 
exposure to stress), these cells are potentially detrimental due to 
their capacity to produce massive levels of pro-inflammatory medi- 
ators such as cytokines and chemokines. 

In the resolution phase of neuroinflammation after injury, re- 
cruitment of blood-derived cells (that is, macrophages) facilitates 
the resolution of the neuroinflammatory response by displaying an 
M2-like anti-inflammatory profile. As previously commented, these 
monocyte-derived macrophages enter the injured brain site through 
a designated barrier—the CP within the blood-CSF barrier, rather 
than through the breached BBB. 


Neural pathway 


The ‘neural pathway’ is mainly constituted by the activation of pri- 
mary sensory afferent nerves such as the vagus nerve by macro- 
phages, increased cytokine levels, or direct activation of the innate 
immune receptors Toll-like receptors (TLRs) by DAMPs (damage- 
associated molecular patterns) or PAMPs (pathogen-associated mo- 
lecular patterns). Inflammation signals reach the nucleus tractus 
solitarius, which interconnects with the dorsal motor nucleus where 
the majority of efferent vagus nerve fibres originate. 

The efferent component of the vagus nerve detects signals 
from the periphery, and through the release of acetylcholine 
(ACh) and consequent activation of a7 nicotinic ACh receptors 
(a7nAChRs) expressed in macrophages and T cells, the produc- 
tion of pro-inflammatory cytokines is markedly inhibited. In add- 
ition, stimulation of the vagus nerve induces activation of stress 
hypothalamus-pituitary-adrenal (HPA) axis, which results in 
glucocorticoid release by the adrenal cortex, with broad anti- 
inflammatory actions. This neuro-immune communication is 
termed ‘the inflammatory reflex. 

In conclusion, the brain and the immune system have mutual de- 
pendency. The brain regulates multiple organs, the immune system 
included, mainly at the level of lymphoid organs. On the other hand, 
immune cells help to maintain brain homeostasis and plasticity 
when brain-resident microglia, astroglia, and perivascular macro- 
phages are unable to cope with an exacerbated neuroinflammatory 
response. Neuro-immune interactions take place through different 
barriers, each with specific, and often complementary, functions. 
Elucidation of the multiple interconnections between them will 
improve the development of new approaches for the treatment of 
neurodegenerative and psychiatric diseases. 


Inflammation and depression (in animal models 


and humans) 





A biological basis for mood disorders was described as early as the 
fifth century Bc, when Hippocrates referred to melancholia as a con- 
dition associated with ‘aversion to food, despondency, sleeplessness, 
irritability and restlessness. Nevertheless, the fundamental patho- 
physiology of depression still stays elusive. In fact, different studies 
are indicating that, in some cases, more than 30% of depressed pa- 
tients fail to achieve remission despite multiple treatment trials [7]. 
Interestingly, mounting data indicate that inflammation may 
also play a role in the pathophysiology of major depressive disorder 
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(MDD). Given the accelerating development of biomarkers and 
treatments focused on the inflammatory response, there is tremen- 
dous promise that these advances, in addition to their relevance to 
general medicine, may have unique applications in psychiatry. 

Different epidemiological studies have demonstrated that MDD 
is associated with a higher prevalence of elevated markers of inflam- 
mation [8], and different meta-analyses have shown that the levels 
of cytokines and other indicators of an inflammatory scenario are 
elevated in the periphery of patients with MDD [9-11]. In add- 
ition, a recent meta-analysis indicated that elevated peripheral levels 
of inflammation are contributory to treatment resistance in MDD 
subjects [12]. Indeed, elevated levels of inflammatory markers pre- 
dict a poorer response to drugs against depression (antidepressants), 
and those who do not respond to AD treatments show persistently 
elevated inflammation. 

Thus, inflammation, both in the brain and in the periphery, is 
being presented as an element worth considering in the aetiology of 
psychiatric conditions such as MDD. This conception is grounded 
in the strongly activated inflammatory/immune response detected 
in these diseases [7, 13-15] and in that patients with MDD who are 
otherwise medically healthy have been repeatedly observed to have 
activated inflammatory pathways. In addition to those pathways are 
worth mentioning increased levels of pro-inflammatory cytokines 
and acute phase proteins and increased expression of chemokines 
and adhesion molecules. 

Long-term exposure to cytokines also has been shown to lead to 
marked behavioural alterations in humans. Thus, several studies in 
humans suggest that immune-targeted therapies may have clinical 
benefit. For example, medically healthy depressed patients who re- 
ceived the selective cyclo-oxygenase-2 (COX-2) (a key enzyme for 
the production of prostaglandins and other eicosanoids) inhibitor 
celecoxib, in combination with reboxetine, showed greater symp- 
tomatic improvement vs patients randomized to reboxetine plus 
placebo [16]. 

Antidepressant activity of anti-inflammatory therapy has also 
been observed in patients with autoimmune and inflammatory dis- 
orders. For example, in a large double-blind, placebo-controlled 
trial of the TNFa antagonist etanercept for the treatment of psor- 
iasis, participants who received etanercept exhibited significant 
improvement in depressive symptoms, compared with placebo- 
treated subjects, an effect independent of improvement in disease 
activity [17]. 

Thus, it seems reasonable to recognize that pro-inflammatory 
cytokines induce not only symptoms of sickness, but also true 
MDDs in physically ill patients with no previous history of mental 
disorders. 

Besides, clinical observations and epidemiological data have also 
demonstrated associations between suicide and inflammatory cyto- 
kines in the orbitofrontal cortex, a brain region involved in suicidal 
vulnerability. Interestingly, the perception of menace that directs 
suicidal individuals to consider suicide may trigger biological stress 
responses, including the inflammatory ones [18]. 

Aside from evidence of increased levels of inflammatory markers 
in patients with MDD, data from laboratory animals indicate that 
the administration of innate immune cytokines, including inter- 
feron (IFN)-a, or an immunological challenge induced by lipo- 
polysaccharide (LPS) injection (major component of the outer 
membrane of Gram-negative bacteria which induces the secretion 


of cytokines) leads to multiple behavioural changes that overlap 
with MDD, including depressed mood, anhedonia, psychomotor 
slowing, disrupted sleep, anxiety, etc. This is the so-called sickness 
behaviour [19, 20]. These findings are consistent with a vast litera- 
ture on laboratory animals, indicating that cytokine antagonists or 
anti-inflammatory agents can block the development of behavioural 
changes following immune activation. 

On the other hand, it seems fairly recognized nowadays that 
both genetic and environmental factors (such as stress) contribute 
to depression. Currently, experimental models of depression are 
based on exposure to stress, and it has been widely described that 
experimental stress induces pro-inflammatory actions mediated by 
glucocorticoids, catecholamines, glutamate, and other mediators re- 
leased by stress [5]. Thus, experimental data indicate that inflam- 
mation is an important component of MDD, and in particular, 
pro-inflammatory cytokines that are induced by injury and infec- 
tion, as well as by psychological stress, are implicated in depressive- 
like behaviour in rodent models. 

Some of the mechanisms that might be responsible for 
inflammation-mediated sickness and depression have now been 
elucidated. These findings suggest that the brain-cytokine system, 
which is, in essence, a diffuse system, is the unsuspected conductor 
of the ensemble of neuronal circuits and neurotransmitters that or- 
ganize physiological and pathological behaviour. Some of the evi- 
dence will be discussed in the next sections. 


Depression, serotonin, and cytokines 


Traditionally, the prevailing hypothesis is that depression is caused 
by some disorganization of brain serotonergic systems, perhaps a 
deficiency in serotonergic neurotransmission. This hypothesis is 
based largely on the observation that many effective drugs for de- 
pression affect serotonergic transmission. Also, serotonin receptor 
abnormalities are found in the brain of patients with MDD. While 
there is no doubt that drugs for depression increase synaptic levels 
of serotonin, noradrenaline, and (some of the drugs) dopamine, 
these compounds also suppress pro-inflammatory cytokine produc- 
tion (for example, IL-16, IL-6, TNFa) and stress hormone release 
and may additionally stimulate anti-inflammatory cytokine (for 
example, IL-10) release [21]. These data indicate again that neuro- 
immune and neuroendocrine system perturbations likely play an 
important role in the aetiology of depression. 


Depression, immune system activation, and 
indoleamine 2,3-dioxygenase 


As aforementioned, an immunological challenge, such as induced 
by LPS injection, leads to multiple behavioural changes that overlap 
with MDD. Thus, a factor which induces an immunological re- 
sponse (such as infection) and subsequent cytokine release could be 
implicated in the origin of this disease. 

Aninteresting hypothetical mechanism by which infections might 
induce depression relates to the metabolism of tryptophan and sero- 
tonin (5-HT). Infections induce IFN-y and TNFa, which are potent 
inducers of indoleamine 2,3-dioxygenase (IDO) in macrophages 
and certain other cells. This enzyme degrades tryptophan, so IDO 
enables the conversion of tryptophan to kynurenine. The resulting 
catabolism of tryptophan decreases its circulating concentrations, 
and lower plasma concentrations of tryptophan have been associ- 
ated with depression. It has been postulated that such a decrease may 


limit the availability of tryptophan to the brain, thus limiting sero- 
tonin synthesis and precipitating depression [22]. 

Furthermore, the induction of IDO through immune system acti- 
vation could also be playing a role in other theoretical mechanisms 
implicated in the pathophysiology of MDD such as excitotoxicity 
and glutamate metabolism. As it has been already mentioned, once 
activated by pro-inflammatory cytokines, IDO degrades trypto- 
phan. But IDO activation also generates potentially neurotoxic 
kynurenine metabolites, such as kynurenic acid and quinolinic acid, 
which could contribute to the pathogenesis of psychiatric diseases 
through their actions on the glutamate NMDA receptor (that is, 
kynurenic acid is an endogenous glutamate NMDA receptor an- 
tagonist, and quinolinic acid an endogenous glutamate NMDA re- 
ceptor agonist) [23]. 


Depression, bacterial translocation, and 
innate immunity 


Recent studies have postulated that increased gastrointestinal 
permeability with an increased translocation of LPS from Gram- 
negative bacteria may play a role in the pathophysiology of MDD. 
Specifically, it has been suggested that increased LPS translocation 
may mount an immune response, and thus an inflammatory status, 
in some patients with MDD and may induce specific sickness behav- 
iour symptoms [24]. 

Disruptions of the intestinal epithelium allow normally poorly 
invasive enterobacteria to exploit lipid raft-mediated transcytotic 
pathways or the enlarged spaces to cross the gut wall. Thus, loss of 
the epithelial barrier integrity caused by inflammation and/or stress 
may induce an increased bacterial translocation, provoking in- 
creased serum concentrations of LPS. And as it is well known, LPS 
causes induction of nuclear factor kB (NF-«B), the major upstream 
intracellular mechanism which regulates inflammatory and oxida- 
tive/nitrosative stress mediators, such as COX-2 and inducible NO 
synthase (iNOS), and also causes cytokine release through its union 
to the TLR-4 [6]. 

TLRs are one of main actors of innate immunity, and there are a 
good number of preclinical studies showing that TLRs are involved 
in depression-like behaviour induced by experimental models of 
MDD. Furthermore, there is increasing evidence from the clinical 
arena suggesting that TLRs are involved in the pathophysiology of 
MDD. Recently, TLR-4 mRNA expression in peripheral blood has 
been identified as an independent risk factor related to the severity 
of MDD. In addition, studies have shown that some elements of the 
TLR-4 signalling pathway were upregulated at mRNA level in per- 
ipheral blood mononuclear cells (PBMCs) of patients with MDD. 
And importantly, these peripheral alterations of the TLR-4 signal- 
ling pathway have been corroborated in post-mortem human brain 
tissue (dorsolateral prefrontal cortex) from depressed and suicidal 
patients [25-27]. 

Based on this, it is plausible to conclude that increased bacterial 
translocation is another pathway which may explain the inflamma- 
tory pathophysiology of MDD. Thus, the TLR-4 pathway may either 
primarily (increased translocation of LPS inducing inflammation) 
or secondarily (primary inflammation that may induce bacterial 
translocation) be involved in the inflammatory pathophysiology 
of MDD. 

Consequently, the study of possible bacterial translocation and 
the kind of bacteria present in an organism (microbiota) has a 
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crucial importance and represents an area of research in which fur- 
ther investigation is warranted. 

The psychoneuroimmunology approach to the study of the patho- 
physiology of MDD is crucial, as it could expose related biomarkers 
for the identification and monitoring of potentially responsive pa- 
tients. From this point of view, there are already compilations of 
common peripheral biomarkers measured in studies on MDD [28]. 

Among the inflammatory and immune response peripheral 
biomarkers, cytokines (for example, TNFa, IL-1, IL-6, IFN-y), 
C-reactive protein, neopterin (released by macrophages and con- 
sidered a marker of activation of cell-mediated inflammation), and 
tryptophan catabolites, as well as different elements of the IDO 
pathway, stand out. 

Bearing in mind that inflammation is often followed by oxida- 
tive damage, there are other peripheral biomarkers of oxidative and 
nitrosative stress to consider such as malondialdehyde (a product 
of lipid peroxidation), superoxide dismutases (SODs) (an important 
antioxidant defence in nearly all cells exposed to oxidative stress), 
and glutathione peroxidase and reductase. 


Inflammation and psychosis (in animal models 





and in humans) 


The idea about the existence of alterations of the innate immune 
system taking part in the aetiology and pathophysiology of psych- 
otic disease is long-standing but has been receiving renewed interest 
in the last years. Specifically, alterations of the innate immune 
system could contribute to the pathophysiology of these mental 
diseases, producing alterations in processes such as dopaminergic 
and serotonergic metabolism and transmission, neurotrophic factor 
levels, neurogenesis, and even behaviour and cognition. 

There is converging evidence from different fields of knowledge 
and approaches (genetics, pharmacology, brain imaging, clinical 
trials), both in experimental animal models and in human bio- 
logical samples from diverse organic compartments (serum, plasma, 
PBMCs, and post-mortem human brain tissue samples), as well as 
from subjects diagnosed with different degrees of psychotic disease. 

Most of the evidence supporting peripheral inflammatory changes 
in schizophrenia involves elevated plasma pro-inflammatory cyto- 
kine levels (that is, IL-16 and -6). These effects have been directly 
associated with certain clinical features such as cognitive impair- 
ment and brain volume loss or with negative symptoms. There is 
less evidence on alterations in the number, activity, and profile of 
peripheral immune cells in psychotic disease, but some authors have 
described T cell dysfunction in patients with schizophrenia and an 
imbalance of immune responses towards a major humoral [T helper 
2 (Th2)] response in plasma, correlated with a worse prognosis. In 
addition, recent studies have reported the existence of anti- NMDA 
receptor antibodies and surface dopamine-2 receptor antibodies in 
serum and CSF of subgroups of individuals with psychotic disease. 

Inflammatory evidence is also present in the brain. Abnormal 
CSF cytokine and leucocyte levels have been identified in the brain 
of schizophrenia patients. Post-mortem evidence for inflammation 
is growing. Brain microglial activation and abnormal lymphocyte 
levels have been suggested in post-mortem and PET studies, al- 
though novel, highly specific PET ligands for activated microglia 
are needed. Alterations in other immunity-related systems at the 
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CNS level have been also reported, such as the case of prostanoids 
(that is, prostaglandins and related lipid-derived compounds), 
endocannabinoids, and the tryptophan-degrading enzyme 
indoleamine 2,3-dioxygenase (IDO) generating potentially neuro- 
toxic kynurenine metabolites such as kynurenic and quinolinic 
acids. Schizophrenia manifest interrelated peripheral and central 
symptomatology. Determining the relationship between peripheral 
and central changes in inflammation-related molecules in subjects 
with a psychotic disorder could be useful not only for diagnosis and 
monitoring of the natural course of the disease (trait or state bio- 
markers, risk/protective factors), but also to reveal possible mech- 
anisms with aetio-pathophysiological relevance. 

One relevant question to address is the mechanism/s whereby 
inflammation leads to brain cell damage in psychotic disease. 
Some of the putative mechanisms involved include: (1) microglial 
overactivation or increased microglial cellular density; (2) oxida- 
tive/nitrosative stress; (3) uncontrolled activation of the HPA stress 
axis (hypercortisolism); (4) excitotoxicity and disrupted glutamate 
metabolism; (5) mitochondrial dysfunction and energy deficits; 
(6) reduced levels of neurotrophins (that is, BDNF, NGF); (7) im- 
paired neurogenesis; (8) apoptosis; (9) demyelination; (10) impaired 
glucose transport and metabolism; and (11) functional BBB break- 
down and ultrastructural abnormalities affecting a number of brain 
functions, including excitatory/inhibitory neurotransmission, stress 
and alarm phase responses, and synaptic plasticity. 

Another issue to elucidate is the possible origin of increased in- 
flammation in psychotic disease. There are multiple theories dealing 
with the origin of increased inflammation in schizophrenia. Five 
major categories exist: 


1. Inflammatory changes before birth (due to prenatal or maternal 
infections and/or obstetric complications). The common notion 
is that these factors ‘prime’ an immature fetal immune system 
that will remain impaired for a lifetime. This idea is gaining mo- 
mentum, and schizophrenia is now strongly related to anomalies 
in neurodevelopment. Currently, the use of maternal immune 
activation models in rodents is receiving increasing attention in 
translational psychiatry. These models are relevant because they 
reproduce in the progeny some functional and structural anom- 
alies in the brain, as well as behavioural alterations and cognitive 
deficits, characteristic of the pathophysiology of schizophrenia. 

2. Inflammatory changes in crucial stages of brain development 
and maturation (early adulthood) [29]. During early adulthood, 
the brain undergoes decisive changes, completing the myelin- 
ation of axon fibres and synaptic pruning in particular areas. 
These neurodevelopmental changes coincide with the matur- 
ation of dopaminergic neurotransmission in cortical structures 
and may be crucially affected by stimuli triggering potentially 
detrimental innate immune responses. 

3. Inflammatory lifetime changes produced by diverse environ- 
mental factors such as episodes of psychosocial stress, infections 
(viral reactivations, poor hygiene, periodontal problems), dietary 
deficiencies, alcohol, tobacco, metabolic and autoimmune dis- 
orders, medications, etc. In this regard, the family of TLRs have 
emerged as a possible mechanism involved. TLRs are the first 
line of defence against invading microorganisms and other im- 
mune stimuli. Their expression is modulated in response to 
pathogens and other environmental stresses. The most studied 


member of the family TLR-4 orchestrates neuroinflammation in 
animal models, and its expression is regulated by stress-induced 
bacterial translocation of gut microflora. Alterations in per- 
ipheral TLR activity have been described in schizophrenia and 
other psychiatric disorders such as depression. 

4. Inflammatory changes produced as a consequence of genetic sus- 
ceptibility due to the existence of hypo-/hyperpolymorphisms in 
certain immune/inflammatory-related genes as risk factors for 
this disorder. Genome-wide association studies have identified 
genes involved in the immune response (MHC, etc.) that cor- 
relate with schizophrenia diagnosis. 

5. Loss or impairment of protective function in at-risk cell types, 
such as parvalbumin-expressing cortical interneurons, in rela- 
tion to oxidative/nitrosative stress. 


Several studies have shown that peripheral inflammatory markers 
may vary with the clinical status of patients and can be used as pos- 
sible trait/state biomarkers for early diagnosis and also for moni- 
toring disease progression and treatment response. Excellent 
candidates are pro-/anti-inflammatory cytokines (IL-1, IL-6, and 
transforming growth factor-f), oxidative/nitrosative stress medi- 
ators [thiobarbituric acid reactive substances (TBARS), SOD], and 
even some lymphocyte phenotypes (CD4/CD8, CD56). The exist- 
ence of a ‘golden marker’ for the disease is highly improbable; ra- 
ther, studying complete and robust pathways with all elements of 
intracellular and intercellular pathways, including elements of bal- 
ancing mechanisms and their relationship with positive/negative 
symptomatology and cognition deficits, is an interesting approach. 

Besides studies for the search of biomarkers, inflammation can 
be pharmacologically modulated to improve the clinical picture. 
There are some completed and ongoing double-blind, randomized, 
placebo-controlled clinical trials with anti-inflammatory augmen- 
tation in schizophrenia, but the results are modest and only af- 
fecting specific subgroups of patients with schizophrenia. Based on 
this evidence, future studies should have a change of focus. Current 
studies propose the control of inflammation via direct inhibition 
of pro-inflammatory mediators, but pharmacological stimula- 
tion of antioxidant/anti-inflammatory pathways to potentiate the 
endogenous response against disease is emerging as a promising 
strategy. Complementary pharmacological studies have demon- 
strated an anti-inflammatory profile of some antipsychotics (that 
is, paliperidone, clozapine) with reduced expression and release 
of pro-inflammatory cytokine and stimulated production of anti- 
inflammatory cytokines such as IL-4, -10, and -17. 

We are still far from having a general idea of the role/s of innate 
immune system/inflammation in the aetiology and pathophysiology 
of psychotic disease, and consequently from offering an effective and 
safe treatment for patients based on the modulation of the inflam- 
matory response. There is still a long way to go, with some questions 
and limitations to resolve: 


1. The inflammatory state may vary, depending on the course of the 
disease. There may be compensatory mechanisms or exhaustion, 
so it is needed to study different states of psychotic disease, from 
prodromal stages to first episode to psychosis to full-blown, 
chronic schizophrenia. Efforts should be made to determine if 
symptomatic onset of the disease occurs in a vulnerable brain 
(immunologically/inflammation-primed months/years before 


symptoms manifest) or if a genetically prone subject develops 
symptoms that will increase the deleterious effects of inflamma- 
tion and/or infection. The search for ‘immunophenotypes’ is a 
nascent approach to take into account for the future. 

2. The inflammatory response is highly non-specific and is ac- 
tivated in response to multiple endogenous and exogenous 
factors in the context of psychiatric diseases [30]. Some of 
the parameters that need to be controlled and considered 
are: genetic/epigenetic susceptibility, disease state/duration, 
stress exposure, tobacco/cannabis use, body mass index, type 
and duration of antipsychotic medication, presence of ongoing 
infections, etc. 

3. The great degree of comorbidity between psychotic disease and 
inflammation-related diseases, such as depression, obesity, and 
diabetes, or psychotropic substance abuse, further complicates 
the scenario. 


The microbiota-gut-brain axis and 





immunopathogenesis of psychiatric diseases 


There is evidence indicating that there is a bi-directional communi- 
cation between the brain and the enteric nervous system, including 
the microbiota, which uses neural, hormonal, and immunological 
routes. 

Thus, the microbiota—gut-brain (MGB) axis would be constituted 
by acomplex system of communication between the microbiota, the 
gut, and the CNS, which would be able to regulate gastrointestinal, 
immune, and CNS functions. Among those paths of communica- 
tion between the gut and the CNS are the vagus nerve, the HPA axis, 
the cytokine network, and tryptophan absorption from the gut into 
the peripheral bloodstream, as well as the passage of short-chain 
fatty acids (SCFAs) into the circulatory system [31]. 

There are multiple pathways guiding the descending and as- 
cending directions of the MGB axis in health and disease. In par- 
ticular, from the gut to the CNS, studies have revealed that brain 
activity can be regulated through neural activation of neurons by 
the gut microbiome, and through endocrine (for example, release 
of serotonin), metabolic (for example, microbiota synthesis of 
neuroactive molecules), and immune (for example, systemic inflam- 
mation and CNS-infiltrating immune cells) pathways. 

The MGB axis is also a key player in neurodevelopment, and con- 
sequently it is plausible to consider that the microbiome, and in par- 
ticular early-life events during initial colonization and microbiota 
development, can determine general and mental health in later life, 
which could be playing a crucial role in psychiatric diseases with a 
neurodevelopmental component such as autism spectrum disorders 
and schizophrenia. 

Furthermore, in inflammatory bowel disease (IBD), comorbidity 
with mood disorders, such as depression and anxiety, is common, 
and chronic low-grade inflammation or immune activation that 
underlies the aetiology of IBD is also a driving risk factor in mood 
disorders [32]. Importantly, patients with schizophrenia quite often 
present with gut problems, and irritable bowel syndrome commonly 
co-occurs with psychiatric disorders. Thus, through diverse mech- 
anisms that can include bacterial translocation, the microbiota from 
the gut would be able to affect both local and distal locations within 
the host organism. 
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Stress, which is a main risk factor for psychiatric diseases, induces 
intestinal dysfunction and can increase intestinal permeability. But 
this is not the only factor to consider. Changes in microbiota may 
lead to brain dysfunction through the production of harmful bac- 
terial products that compromise the integrity of the intestinal tract 
(permeable intestine or leaky gut syndrome). Here appears a very 
important concept—the gut microbiota dysbiosis; shifts in micro- 
biota composition and functions from physiological/beneficial 
states to those pernicious to the host’s health are termed dysbiosis, 
and gut dysbiosis may exert a deleterious impact on brain func- 
tioning via the MGB axis. Moreover, it has been recently proposed 
that dysbiosis might contribute to abnormal brain development, 
immune regulation, and metabolic function in psychiatric diseases. 
However, the composition of the gut microbiota has yet to be fully 
characterized in psychiatric patients, and further research is very 
much needed. 


Hypersensitivity to food antigens 





Among the factors influencing the functioning of the MGB axis, 
there is one that shows up—hypersensitivity to food antigens. 
Several studies have found increased immune sensitivity to food- 
based antigens, namely gluten and milk casein, in psychiatric dis- 
eases and, in particular, in patients with schizophrenia [33]. 

Gluten-dependent disorders have been recently classified into 
three main groups, based on their pathogenesis [34]: autoimmune 
reactions, including coeliac disease; allergic reactions, including 
wheat allergy; and non-autoimmune and non-allergic (innate im- 
munity), which includes non-coeliac gluten sensitivity (NCGS) 
or intolerance. Schizophrenia has been repeatedly associated with 
both coeliac disease and NCGS [35]. Indeed, the average preva- 
lence of coeliac disease among patients with schizophrenia is higher 
than that estimated in the general population [36]. In fact, there 
are already meta-analyses showing significantly higher levels of 
some serum biomarkers of gluten hypersensitivity in patients with 
schizophrenia, although more research is very much needed in this 
field [37]. 

On the other hand, elevations in antibodies to milk casein have 
also been described in subjects with psychiatric diseases, mainly in 
patients with schizophrenia. The mechanisms mediating the associ- 
ation between heightened antibodies to gluten/casein and the risk of 
psychiatric disease are not fully understood. For example, in some 
individuals with a psychotic disorder, comorbidity with coeliac dis- 
ease/NCGS may disrupt intestinal permeability (that is, ‘leaky gut’), 
thereby allowing these antigens to enter the systemic circulation and 
generate a humoral immune response, which, in turn, would lead 
to psychotic symptoms. Indeed, gluten stimulates zonulin release, 
probably because it is mistaken for a microbial agent and therefore 
increases gut permeability in humans. 

In summary, there is growing evidence supporting that some of 
the key pathways that are dysfunctional in psychiatric diseases may 
also be regulated by the MGB axis. Microbiota-targeted therapeutic 
interventions may include dietary approaches, probiotics, prebiotics, 
anti-biotherapy, and faecal transplantation. Preclinical studies sug- 
gest that strategies manipulating the gut microbiota hold potential 
preventive applications in psychiatric disorders. Thus, this is a very 
promising field of research. At this time, human interventions in 
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psychiatry are still scarce and have focused on diet modification, 
probiotics, and antibiotics. Importantly, all these interventions are 
currently considered as adjunctive treatments to standard care (for 
example, drugs for depression, antipsychotics). 


Pharmacology 


We are still far from having ideally effective and safe treatments 
to offer to patients with affective disorders and psychosis. There is 
therefore a need for a change in the drug discovery strategy, mainly 
based on a better understanding of the pathophysiology, while, of 
course, taking into account the overwhelming experimental, epi- 
demiological, and clinical evidence suggesting that prolonged in- 
creases in the levels of pro-inflammatory and oxido-nitrosative 
mediators occur in depression, schizophrenia, and other neuro- 
psychiatric diseases. This fact justifies the ongoing clinical trials on 
adjunctive therapies with anti-inflammatory and antioxidant drugs 
in these diseases, including non-steroidal anti-inflammatory drugs 
and anti-cytokine and antioxidant compounds [38, 39] (that is, as- 
pirin, COX-2 inhibitors, N-acetylcysteine, omega-3 fatty acids, and 
others). Although none of these combinations have demonstrated 
to be clearly beneficial, much effort has to be made, from basic to 
clinical research, to address how inflammation and immune dys- 
function affect psychopathology. This would contribute to a better 
understanding of the disease mechanism and to the development of 
new effective interventions [40]. 
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The term functional genomics represents an approach to molecular 
biology that seeks to generate and exploit a wealth of diverse in- 
formation about the function of genes in order to understand the 
relationship between genotype and phenotype. As a genomics ap- 
proach, the methods employed usually involve efforts to systemat- 
ically survey every gene or gene product in the genome and thus 
provide a global view on aspects of gene function. Unlike the static 
genome, gene function is a dynamic process and thus varies in terms 
of both space (for example, cell type) and time (for example, devel- 
opment). For example, we may wish to know both which set of genes 
are switched on in a particular type of neuron and how this set of 
genes changes with age. 

As is often the case in science, our ability to measure is driven 
largely by technological development. For genomics approaches 
where scale is important, it is not just the development of technolo- 
gies that enables us to measure a particular feature, but also further 
developments that enable us to economically apply this measure- 
ment systematically across the genome. Importantly, our ability to 
measure at the scale of the genome fundamentally changes the na- 
ture of the questions we can ask, leading to a step change in dis- 
covery. For example, while a quantitative polymerase chain reaction 
assay allows us to detect changes in the expression of a pre-specified 
gene in diseased tissue, the advent of microarray methodologies en- 
abled us to measure the change in expression across thousands of 
genes simultaneously. Changing the question from ‘is that different?’ 
to ‘is anything different?’ enables us to discover disease-associated 
variation that we might not have thought to look for. This approach 
has been adopted across psychiatric disorders in recent years and 
will be covered in detail in the relevant chapters on genetics in this 
volume. It will form the prelude to understanding the variation in 
function that is likely to constitute the characteristics of disordered 
phenotypes. 

While functional genomics approaches are often simply scaling up 
the detection of a known phenomenon, the technologies developed 
can also extend our knowledge of the basic functional repertoire of 
the cell. For example, next-generation sequencing approaches en- 
abled the unbiased detection of all expressed RNA molecules in the 
cell, revealing the existence of tens of thousands of new genes that do 
not encode proteins and whose roles are currently being elucidated 





[1]. Nonetheless, while many genes express RNA molecules that are 
not templates for translation, the vast majority of our knowledge 
about the roles of genes comes from studying the functions of pro- 
teins, and thus our knowledge is focused towards the subset of genes 
that encode them. This is an illustration of one bias, and although 
genomics approaches are innately less biased, the biases in each ap- 
proach to measuring function should be considered when designing 
and interpreting studies, as discussed in the following sections. 

This chapter provides a gene-centric introduction to functional 
genomics, each section introducing the aspect of gene function 
being measured and then contextualizing variation in the context 
of neuropsychiatric disease studies (Fig. 16.1). Each section, namely 
‘Epigenomics, ‘Gene expression, ‘Proteomics, and ‘Functional an- 
notations, is, in themselves, the subject of dedicated textbooks and 
dozens of reviews, and thus this chapter provides only a succinct 
introduction to several large and rapidly growing fields of study. 











Epigenetic modifications are defined as reversible modifications 
of DNA or DNA-associated proteins which affect gene expres- 
sion without altering the DNA sequence. Epigenetics explains 
why two different cell types possessing exactly the same genetic 
code execute distinct gene expression programmes and diverge in 
their roles. 

There are two important classes of epigenetics modifica- 
tions: DNA methylation and histone modification. The transcrip- 
tional accessibility of DNA is determined by the condensation state 
of chromatin, which is formed from structural units called nucleo- 
somes, in which the DNA is wrapped around eight core histone 
proteins. Post-transcriptional modifications of these histones can 
alter the regional chromatin state between a transcriptionally ac- 
tive open euchromatic state and a transcriptionally silenced closed 
heterochromatic state, thereby impacting gene expression [2, 3]. 
Active and repressive histone marks delineate coding regions, non- 
coding regulatory elements including promoter regions near tran- 
scriptional initiation sites, and proximal and distal enhancers (Fig. 
16.2). In parallel to histone modifications, methylation of the DNA 
base cytosine within promoter and enhancer regions acts to repress 
transcription (Fig. 16.2). 
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Fig. 16.1 An overview of functional genomics. (a) Epigenomic analyses can indicate how poised different genes are being transcribed. Different 
chemical modifications of histones, the proteins that form the scaffold around DNA, and variation in the accessibility of the DNA to proteins that 
regulate and execute gene expression are associated with active or repressed genes (Fig. 16.2). (b) The variation in gene expression by cell type or 

with disease state informs on function. Where the expression pattern of genes are similar, it may reflect their co-ordinated regulation as part of the 
same biological process. (c) While correlated with gene expression level, the abundance of the protein product of a gene can vary due to translational 
regulation and degradation, while the functioning of a protein can be significantly altered by a range of post-translational chemical modification. 

A physical interaction detected between two proteins can indicate that they function together in a common role. (d) Functional genomics information, 
such as the co-ordinated expression of genes, the knowledge that their protein products physically interact, or large-scale information gathered from 
published literature can be used to infer sets of genes likely to function together in the same biological pathway (e). 








Using high-throughput approaches, the Roadmap and ENCODE 
projects have delivered genome-wide human epigenetic maps that 
highlight cell-specific regulatory elements, providing a valuable re- 
source for the interpretation of genetic variation [4, 5]. These maps 
reveal that a large majority of common genetic risk variants identi- 
fied do not affect the protein-coding sequence but are instead local- 
ized in non-coding regions regulating gene expression which these 
variants are likely to alter. Knowledge of the cell-specific utiliza- 
tion of these regulatory elements enables the identification of the 
cell type(s) likely affected by variants within them. For example, the 
108 genetic risk loci reported so far for schizophrenia (SCZ) are en- 
riched in brain-specific enhancer regions, suggesting these variants 
affect the expression of genes in the brain, but also within enhancer 
elements utilized by CD20/CD19 lymphocytes, in turn suggesting a 
link between SCZ and the immune system [6]. 

Inter-individual variation in DNA methylation found in fetal 
brain tissue coincides with SCZ risk loci [7]. However, epigenetic 
variation is dynamic, varying not just with the cell type, but also over 
the life of a cell. A study identifying regions of DNA that were dif- 
ferentially methylated in the prefrontal cortex between individuals 
with and those without SCZ found that these differentially methy- 
lated regions were associated with changes in DNA methylation 
that occurred during the transition from the second fetal trimester 


to postnatal life, implicating this transition as a key developmental 
point in the development of the disorder [8]. 


Gene expression 


The spatiotemporal expression profile of a gene is particularly in- 
formative. Knowing when (for example, in development, in disease 
progression) and where (in which cell types) a gene is switched 
on, or expressed, is key to understanding both its normal and its 
aberrant functioning (Fig. 16.1b). Genomic technologies that 
simultaneously report the expression of thousands of different 
RNA molecules have advanced rapidly and include microarrays 
that detect pre-specified RNA molecules and RNA-seq that can 
sequence the entire population of RNA molecules without pre- 
selection. However, for neuropsychiatric disorders, access to the 
relevant living tissue sample or cell population remains challen- 
ging. Nonetheless, using post-mortem brains, detailed maps of 
spatiotemporal gene expression are now available. For example, the 
BrainSpan atlas of the Developing Human Brain has examined 42 
brains to provide the gene expression profiles of up to 16 targeted 
cortical and subcortical structures across the full course of human 
brain development [9]. 
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Fig. 16.2 Epigenetics marks and gene regulation. Different sets of epigeneti 
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c modifications, or marks, are associated with three states of gene 


regulation. The epigenetic marks can map to the full gene body or only the 5’ part or 3’ part to the promoter region or to a (distal or proximal) 
enhancer [98]. (a) Histone modifications (methylation and acetylations marks) for a transcriptionally active gene. (b) DNA methylation and histone 
H3 methylation are found in the promoter and enhancer region and surrounding the TSS in transcriptionally repressed genes. (c) Bivalent chromatin 
domain defined by the simultaneous presence of histone modifications associated with both gene activation and repression [99]. 


Knowledge of a gene’s expression profile helps the causal inter- 
pretation of genetic variation observed in patients with neuropsychi- 
atric disorders. In SCZ, for example, newly arising genetic variants 
that were observed in patients, but not in either parent, and that af- 
fected protein-coding sequence tended to affect genes that were ex- 
pressed prenatally in the dorsolateral prefrontal cortex (DFC) and 
the hippocampus, with both regions previously implicated in SCZ 
[10]. Gene expression variation can also be detected directly in the 
analyses of post-mortem patient brain tissue. Large-scale studies of 
hippocampal tissue obtained from SCZ, bipolar disorder (BPD), and 
major depressive disorder (MDD) patients have found evidence for 
decreased expression of genes involved in antigen processing, GABA 
signalling, and endocytosis [11]. More specifically for BPD, but also 
reported in some SCZ studies, an upregulation of metallothionein 
genes has been observed in the DFC, while reductions in neuropep- 
tide gene expression has been associated with psychosis [12, 13]. 

Notably, there is little consistency in the observed variation in 
gene expression for the same disorder across different regions of 
the brain. Moreover, even within the same region, there is con- 
siderable variability among post-mortem gene expression studies, 
likely due to experimental variation, the relatively small numbers 


of samples used, and the effects of varying patient medications. For 
post-mortem studies on neuropsychiatric disorders, and especially 
so for neurodegenerative disorders, it is also unclear what point in 
the progression of a disease is being studied, and thus whether any 
observed variation in gene expression relates to the upstream causes 
or the downstream consequences. Genetic and epigenetic evidence 
discussed above points to key prenatal or early postnatal aetiological 
events for many neuropsychiatric disorders which may be undetect- 
able in adult post-mortem brain tissue. For neurodegenerative dis- 
orders, onset precedes the diagnosis by many years, making it hard to 
discern whether upregulated processes, such as neuroinflammation, 
are contributing causally to the pathology or are instead protective 
mechanisms, a vital distinction when targeting therapeutic inter- 
ventions. Furthermore, the cell type composition can change as a 
consequence of a disorder, for example with the death of neurons in 
neurodegeneration, which can then dominate the comparison with 
healthy tissue [14]. Accordingly, it is reassuring when variation in 
gene expression in post-mortem tissue can be linked to predisposing 
genetic variation. For example, genetic variation in CACNAIC is 
associated with an increased risk of BPD, while the expression of 
this gene has also been found to be downregulated in BPD patients’ 


brains [15, 16]. In other cases, the genetic support can be indirect; 
TYROBP has been found to be upregulated in the brain of late-onset 
Alzheimer’s disease patients and interacts with the Alzheimer’s 
disease-risk gene TREM2 [17, 18]. 


Co-expression networks 


The expression of multiple genes can be co-ordinated within the cell, 
for example in order to deliver all the required subunits necessary 
to form a functioning protein complex or to put in place all of the 
enzymes required at each step in a biosynthetic pathway. Thus, the 
co-expression of genes, that is genes whose variation in expression 
levels is correlated, is a commonly used approach to divine genes 
that might function together with a common purpose [19]. The de- 
gree of co-expression is measured largely as the correlation coeffi- 
cient of those gene expressions, as measured at multiple instances, 
for example their correlated expression across multiple tissues over 
time during a biological process such as development/ageing or 
disease progression, or simply from repeated recordings over mul- 
tiple replicates of the same experiment. Often, co-expression is rep- 
resented in a network diagram, with genes depicted as nodes and 
where connections, or edges, between nodes represent the fact that 
the connected genes are co-expressed (Fig. 16.1d). A group of genes 
connected together within a network through their correlated ex- 
pression patterns is often referred to as a co-expression module [20]. 
As compared to variation in a single genes expression, examining 
the variation of a group of co-expressed genes that form a module is 
both more robust to noise affecting the variation of individual genes 
and also enables the study of variation of the shared functionality 
captured by the module. Furthermore, following the reasoning that 
genes that are co-expressed are more likely to function together, 
genes within a module whose functions are unknown can be in- 
ferred from those genes with known functions [21]. This process of 
inferring function is commonly termed ‘guilt by association’ and is 
an often used approach for filling in gaps in knowledge in functional 
genomics (for example, see the section on protein-protein inter- 
actions in this chapter). 

Study designs for exploiting gene co-expression methods to under- 
stand disease have taken one of two approaches. In the first, gene co- 
expression modules may be formed from healthy individuals and 
then changes in expression level of multiple genes within a module 
within diseased tissue are used to infer up- or downregulation of that 
pathway. Large-scale gene expression data sets, suchas the BrainSpan 
database, enable the identification of genes whose expression pat- 
terns are co-ordinated across regions of the brain and throughout 
development, enabling the construction of gene-co-expression 
modules, capturing spatiotemporally co-ordinated gene modules in 
the healthy developing brain. Mapping mutations observed in SCZ 
patients onto the genes within these modules identified that these 
mutations often disrupted module functioning during fetal devel- 
opment within dorsolateral and ventrolateral prefrontal cortex [22]. 
Within Alzheimer’s disease, co-expression analyses of post-mortem 
tissue have recurrently identified disrupted modules of genes associ- 
ated with decreasing synaptic plasticity, while those associated with 
immunity and microglia increased [23, 24]. As noted, care must be 
taken in the interpretation, as a decrease in the expression of genes 
across a cell type-specific pathway might be due to the loss of the 
cell type expressing that pathway, rather than the downregulation of 
genes within that pathway within the relevant cell type. 
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In an alternative approach, studies can consider differential co- 
expression, which looks for changes in the co-ordination of gene 
expression that suggest that the regulatory relationships between 
genes that were co-expressed in healthy tissue have been altered in 
the diseased tissue [25]. For Alzheimer’s disease, this approach to 
identify the rewiring of gene regulation in disease found that many 
gene modules involved in immunity and microglia function were 
differentially regulated in patient post-mortem brain tissue [24]. 


Proteomics 


Proteins are the enactors of the central dogma, and our efforts to 
measure the expression of genes are largely in order to estimate the 
abundances of these genes’ protein products. The relationship be- 
tween mRNA abundance and protein abundance is confounded by 
several factors, including the rates of translation and protein deg- 
radation, both of which are highly variable, depending upon the 
cell state. Nonetheless, in a steady state, studies suggest that gene- 
gene variation in protein abundances within a cell are primarily 
determined by those genes’ relative mRNA expression. Specifically, 
upwards of 40% of the variance in protein abundances can be ex- 
plained by mRNA levels [26-28]. After accounting for the delay be- 
tween gene expression and protein synthesis, variation in mRNA 
levels explain around 80% of the change in protein abundance when 
a cell transitions between states [29, 30]. While this correlation is 
encouraging, protein activity is not solely determined by abundance. 
Dynamic post-translational modifications, such as cleavage or re- 
versible covalent additions of chemical side groups or lipids, are key 
regulators of their activity that are not captured by the transcrip- 
tome. For example, as described earlier, acetylation and methylation 
of histone proteins significantly modify their functional association 
with DNA. Furthermore, identification of the cellular location(s) of 
a protein and its molecular partners can reveal much about its role. 

Compared to reading the transcriptome, measuring protein 
abundance is far more challenging due to the diverse physiochem- 
ical properties of the constituent amino acids. Nonetheless, over 
the last 10 years, advances in mass spectrometry-based approaches 
have enabled the simultaneous estimation of the abundances of over 
10,000 proteins. A clear example contributing to the understanding 
of neuropsychiatric disorders is the proteomic profiling of the neur- 
onal post-synaptic density (PSD) from the human neocortex [31]. 
While some of the identified PSD proteins were known at the time of 
the study to have roles in neurological disorders, subsequent genetic 
studies identifying genes found to be disrupted in individuals with 
SCZ have found that many of these genes’ proteins are found in the 
PSD, giving a clear indication of the aetiology [32, 33]. 

As with transcriptomics, the differential abundances of proteins 
within post-mortem neuronal tissue yield insights into the causes 
and consequences of neuropsychiatric disease. For the neuropsychi- 
atric disorders SCZ, BPD, and MDD, these studies have revealed 
hundreds of proteins differentially abundant in patient brain tissue. 
In common, a meta-study identified a small number of proteins in- 
volved in forming and maintaining the myelin sheath, supporting 
a role for oligodendrocyte dysfunction in these disorders [34-36]. 
For SCZ more specifically, variation in the abundance of 14-3-3 
mediated-signalling proteins was found, for BPD variation in mito- 
chondrial function, and for MDD in oxidative phosphorylation 
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pathways [34]. As with transcriptomics, there is significant variation 
between individual studies and between the particular brain regions 
studied, but for proteomics, this is likely amplified by the current 
variability in multiple reaction monitoring-based proteomic meth- 
odologies [28, 37]. 


Protein-protein interactions 


Knowing which proteins physically interact with each other helps 
to identify groups of proteins that are functioning together in the 
same biological process. The set of protein interactions within the 
cell is commonly referred to as the interactome. These interactions 
can be transient, for example a receptor reversibly binding a ligand, 
or obligate such as the persistent binding between subunits of a 
stable protein complex. The causal implication in a disease of mul- 
tiple genes whose protein products interact provides insight into 
the perturbed molecular process(es), and the functional concord- 
ances between implicated genes give added confidence to their role 
in the same disease. There are many methods to experimentally 
determine protein-protein interactions (PPIs) [38-40]. In vitro 
methods often involve the use of a bait or tagged protein that is 
drawn out of a cell extract, along with any other proteins that are 
directly or indirectly attached to it [for example, tandem affinity 
purification and mass spectrometry (TAP-MS), affinity chromatog- 
raphy, co-immunoprecipitation]. In vivo methods include the Yeast 
2 Hybrid (Y2H) approach. Here, a transcription factor is cleaved 
into a binding domain and a separated activating domain, and each 
of a pair of query proteins is fused to one of the two domains. If 
the two query proteins interact, the two transcription factor do- 
mains are brought close to each other and transcription is initiated. 
However, current methods for detecting PPIs are prone to noise and 
high rates of false positives, for example due to artefactual inter- 
actions detected between proteins that do not reside in the same 
cellular compartment, and thus corroborative support is often 
sought to validate PPIs [41, 42]. 

An early example of the utility of PPIs in providing insights into 
neuropsychiatric genetics is an interactome study performed for the 
Disrupted in Schizophrenia 1 (DISC1) protein [43]. Proteins that 
interacted with DISC1 were identified using Y2H assays, and then a 
subset of these interactors were selected and their interactors simi- 
larly identified. Many of the proteins within the extended DISC1 
interactome had roles in synaptic-associated functions such as 
cytoskeletal stability and organization and intracellular transport, 
thereby implicating DISC1 in these processes through ‘guilt by as- 
sociation whereby the functionality of a protein is inferred through 
the known functions of the proteins with which it interacts. 

PPIs detected in large- and small-scale studies have been col- 
lated within several large, publicly available databases [for example, 
human protein reference database (HPRD) [44], BioGrid [45]). 
Although fewer than 100,000 interactions are currently known, 
which represents a fraction of the estimated complete interactome 
[46], these experimentally determined interactions can be supple- 
mented by computationally predicted interactions to provide a key 
resource for identifying common functions between genes impli- 
cated in neurological disease [39]. For example, genome-wide as- 
sociation (GWA) studies have been phenomenally successful in 
identifying regions of the genome that are associated with the risk 
of developing SCZ. However, these regions often contain many 
genes, and so identifying which of these genes is more specifically 


associated with disease risk remains a current bottleneck in trans- 
lating these genetic findings into actionable drug targets. By exam- 
ining the interactions of the proteins expressed by genes within risk 
loci identified with SCZ GWA studies with the proteins of genes that 
have been implicated in SCZ in past studies, one study identified 
interactions between 76 GWA genes and 25 previously implicated 
SCZ genes [47]. Such studies are able to both propose the causally 
associated gene within these GWA regions and identify common 
functions among the sets of interacting proteins, here for example 
synaptic plasticity, neurotransmission, and inflammation, that may 
be disrupted in the disease. 

The vast majority of human genes express not just one transcript 
but instead can selectively include or exclude different exons, to gen- 
erate multiple alternatively spliced transcripts. Alternatively spliced 
transcripts are expressed in a tissue-specific manner, with many 
genes expressing a dominant transcript in a particular tissue. Of par- 
ticular relevance, tissue-specific splicing are most often observed in 
brain cells with roles in both development and ageing [48-54]. By 
varying the protein sequence, the set of resulting protein isoforms 
expressed by a single gene can demonstrate remarkable variation that 
impacts on function. For example, by varying the inclusion of inter- 
action surface, protein isoforms can interact with alternative sets of 
proteins, changing the topology of the network [50, 55, 56]. Such 
tissue-specific, isoform-specific interaction networks are important 
in understanding cell type-specific gene function, and therefore in 
understanding the functional impact of genetic variation underlying 
neuropsychiatric disorders [57]. 


Literature annotations 


Every day, hundreds of research articles are published, with each art- 
icle reporting the results of molecular and/or genetic experiments, 
adding to a body of knowledge about the function of a single gene 
or a small number of genes. We are unable to read them all and we 
are unable to remember all that we have read, and what we can re- 
member is highly biased. There is a huge diversity in experimental 
approaches, reflecting the many questions that might be asked. For 
example, these articles might report on a study that localizes a pro- 
tein to a particular cellular structure, identifies a role for a protein 
within a specific biological function, or details the phenotypic con- 
sequences arising from the protein’s disruption within a cellular 
model, a model organism, or a human patient. However, in order to 
ask questions about groups of genes, such as whether a set of genes 
found to be disrupted in patients with a particular disease are asso- 
ciated with a particular function, the information about gene func- 
tion, the language of function, must be consistent between genes. 


Functional annotations 


There have been several attempts to organize the wealth of heter- 
ogenous information about the function of genes and their ex- 
pressed proteins. Most commonly, this involves the creation of an 
ontology, a defined vocabulary that describes a set of entities, or 
properties, and the interrelationships between them. The most suc- 
cessful and commonly used in molecular biology is Gene Ontology 
(GO) [58, 59]. Here, the properties of gene products, most com- 
monly proteins, are defined within three ontologies. A protein’s ac- 
tion at the molecular level, such as binding or catalysis, is described 


within Molecular Function ontology; where it is located within (or 
outside) a cell, it is described with Cellular Component ontology, 
while terms within the more abstractly defined Biological Process 
ontology are used to describe the higher-level functioning of a pro- 
tein such as signal transduction or regulation of gene expression. 
For example, if we consider the key metabolic enzyme malate de- 
hydrogenase, its Molecular Function terms include oxidoreductase 
activity; its Cellular Component terms include mitochondrion, 
while its Biological Process terms include tricarboxylic acid cycle. 
The relationships between terms within the ontology allow proteins 
to logically acquire additional annotation terms. For example, the 
GO molecular function ontology defines that the mitochondrion 
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is a cytoplasmic part, which is an intracellular part, and thus a gene 
whose protein is assigned with the GO property of mitochondrion 
logically inherits both the GO properties of cytoplasmic part and 
intracellular part. A fraction of the GO annotation tree for the 
Biological Process term ‘positive regulation of transmission of 
nerve impulse’ is illustrated in Fig. 16.3 where the inherited anno- 
tation terms can be seen. 

While GO successfully captures a huge amount of information 
about human gene function, unarguably more than any other 
single information resource, users exploiting GO functional an- 
notations should be aware of the evidence for a functional anno- 
tation. Each GO annotation is associated with an evidence code, 
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which can be broadly split into those codes derived from published 
experiments and those derived by computational predictions. As 
of writing, almost 16,000 out of a total of 22,000 human protein- 
coding genes are annotated with at least one GO annotation that 
has been derived from published experiments. However, for al- 
most 5000 human protein-coding genes, only computationally 
predicted functional evidence is available, which should be viewed 
with less confidence. For 1500 genes, GO provides no functional 
evidence at all, experimental or inferred. Furthermore, while 
there is thus some functional information ascribed to over 20,000 
protein-coding genes, this evidence is likely very incomplete for 
all but a small fraction of genes, with genes of more interest to 
the community (for example, those involved in disease) receiving 
far more experimental attention, and thus acquiring more func- 
tional annotations, than those attracting less interest. It is often 
necessarily assumed in functional genomics that genes that are not 
annotated with a particular function do not perform that func- 
tion, but this assumption is flawed: The absence of evidence is not 
evidence of absence. Compounding this, there is a strong bias to- 
wards publishing positive, rather than negative, findings, in this 
case showing what a gene does, rather than what it does not do. 
Accordingly, more systematically collected functional evidence, 
such as a genome-wide survey of gene expression, can provide a 
more robust, albeit significantly less functionally detailed, infor- 
mation source. Despite these concerns, GO remains the current 
standard for ascribing gene function, and almost all the functional 
associations of genes with disease described in this chapter, for ex- 
ample sets of genes whose expression or protein abundances are 
found to be different or dysregulated in diseased tissue, have been 
identified from the GO annotations of these genes. 


Pathway databases 


Although GO aims to capture the function of a gene, it does not 
embody well the functional relationships between genes. While GO 
will annotate several genes as belonging to the same biological pro- 
cess, for example the tricarboxylic acid cycle, it does not inform on 
the order in which the enzymes in this fundamental energy cycle 
act, which is key to understanding the biological process. To ad- 
dress this, pathway databases have been created that map out the 
constitutive genes, describing a set of interacting genes/products 
that together perform a particular biological function. Historically, 
the most well-used of these is the Kyoto Encyclopaedia of Genes 
and Genomes (KEGG), which attempts to map out the literature- 
reported relationships between genes within a diverse collection 
of metabolic and cellular processes, including processes associated 
with specific diseases [60]. More recently initiated databases, for ex- 
ample Reactome [61], attempt to more systematically map out the 
molecular operations of gene products into a hierarchical level of 
detail, similar to that described previously for GO. However, as dis- 
cussed for GO annotations, pathway annotations are very incom- 
plete and suffer from significant ascertainment biases. Furthermore, 
the biological processes that these pathway concepts seek to describe 
are often active under specific conditions and/or within particular 
cell types [62], information that is often missing. Nonetheless, from 
a deterministic perspective, where the aim is to understand the con- 
sequences of a genetic variant, understanding the relationship of 
that gene to the biological process(es) in which it participates will 
clearly aid in the interpretation. 


Model organism databases 


Although far beyond the remit of this chapter to discuss the utility 
of model organisms for individual neuropsychiatric disorders (for 
reviews, see [63-70]), the ability to determinedly introduce genetic 
variation into an organism is a robust approach to understanding 
the relationship between genotype and phenotype. The evolutionary 
conservation of the protein-coding sequence of orthologues be- 
tween two species implies that the function of these genes is simi- 
larly conserved between those species, and thus knowledge of the 
gene in one species can be transferred to the other. This conjecture 
holds best between unique 1:1 orthologous genes, defined as genes 
in two species that have been maintained without duplication since 
those species diverged from a cenancestor [71-73]. While the func- 
tional genomics resources described in this chapter thus far are fo- 
cused on molecular and cellular annotations, from the expression of 
genes through to the functional roles of their protein products, often 
it is not immediately obvious how the disruption ofa cellular process 
might manifest a complex behavioural abnormality. Thus, model or- 
ganisms offer a more direct description of the influence of genetics 
and environment upon behaviour. 

Due to their ready genetic manipulation, easy laboratory manage- 
ment, and study scalability, genotype/phenotype relationships have 
been extensively explored in the lower model organisms, namely 
the fruit fly Drosophila melanogaster, the nematode Caenorhabditis 
elegans, and the yeast Saccharomyces cerevisiae. From a functional 
genomics perspective, the knowledge amassed about the function of 
individual genes from studying these model organisms has contrib- 
uted to a genome-wide body of knowledge that has been collated for 
each model into community-supported databases (namely FlyBase 
[74], WormBase [75], Saccharomyces Genome Database [76], and 
Zfin [77]). Much of this knowledge forms a major contribution to 
functional and pathway annotation resources. 

From a neuropsychiatric perspective, rodent models are pre- 
eminent in the study of relevant disorders, due both to their more 
comparable neuroanatomy and to behaviours that are more compar- 
able to neuropsychiatric presentations. From a functional genomics 
perspective, the systematic collation from published reports by the 
Mouse Genome Database (MGD) of the phenotypes resulting from 
determined genetic manipulation of several thousands of mouse 
genes offers an excellent resource for neurological associations [78]. 
Currently, the MGD holds phenotypic associations for the unique 
orthologue of over 9000 human genes and reports this phenotypic 
information within structured ontology, enabling computational 
analyses [79]. As with other literature sources of gene functional in- 
formation, the original studies upon which these data are based were 
hypothesis-led, and thus only a specific set of phenotypes would 
have been examined in each study and reported on. Nonetheless, 
aetiological insights into neuropsychiatric disorders can be robustly 
obtained, for example an excess of genes whose orthologues’ disrup- 
tion in the mouse yielded synaptic dysfunction has been identified 
among genes found to be deleted or duplicated in individuals with 
autism [80, 81]. Of note, for functional genomics approaches, sig- 
nificant efforts are ongoing by the International Mouse Phenotyping 
Consortium (IMPC) to systematically pass each mutant mouse line 
through a series of standardized phenotyptic tests of broad relevance 
to human disease, with results from the first 1750 knockout mouse 
models recently reported [82-85]. A key criticism of functional 


annotation resources amassed from individual studies is that where a 
gene is not annotated with a particular function, researchers might as- 
sume that that gene is not associated with that function when, in fact, 
the association has never been tested. Thus, although the phenotyping 
of these models is inevitably incomplete, the IMPC’s efforts to anno- 
tate gene function are hugely powerful, as the systematic testing also 
provides evidence that a phenotype is not associated with a given gene. 


Future perspectives 


Functional genomics is a particularly technology-driven field. New 
methods are frequently published, enabling new properties of the 
genome to be assayed or known properties to be examined on an 
increasing scale. The overwhelming volume of biological data being 
generated consequently challenges bioinformaticians to develop new 
analytical techniques able to integrate these diverse data types and 
make these data coherently available for researchers to query. Thus, 
while the promises of new technologies are often noisily trumpeted, 
our organization and standardization of data, for example gene func- 
tion within GO or organismal phenotype within the Mammalian 
Phenotype Ontology or the Human Phenotype Ontology, are also 
important in gaining new biological understanding. 

Several current technologies are promising to revolutionize mo- 
lecular biology and functional genomics and are worth drawing at- 
tention to in the context of neuropsychiatric disease. An outstanding 
problem in the study of the brain is accessibility to brain tissue. As 
discussed, post-mortem tissue may better reflect the consequences of 
disease, rather than yield insights into the causes. The development 
of induced pluripotent stem cells (iPSCs) is enabling researchers to 
create cultures of patient-derived neurons in the laboratory [86-88]. 
Here, an accessible non-neuronal tissue is sampled from a patient 
and then these cells are coerced, through a cocktail of bioagents, 
into a pluripotent state, able to be subsequently driven down a cell 
lineage towards a neuronal cell identity. While there are many sig- 
nificant differences between iPSC-derived neurons and native adult 
neurons, these cell models are promising significant insights into 
neuropsychiatric disorders, for example, mimicking the differential 
lithium responsiveness of bipolar patients [89, 90]. 

While epigenomic, transcriptomic, and proteomic studies have 
yielded many insights into neuropsychiatric disease, a criticism 
already discussed when comparing healthy and diseased tissues is 
that differences in the underlying cell types that form these tissues 
are obscured when measuring across the whole tissue combined. 
Recently developed single-cell approaches allow the epigenetic and 
transcriptomic profiles of individuals cells to be obtained, enabling 
the cell type to be determined from the cell type-specific marker 
genes and particular and specific populations of cells to be examined 
[91-93]. Elsewhere, our ability to manipulate the genetic sequence 
in disease models to introduce or remove, and thus explore the ef- 
fects of, disease-predisposing mutations has been revolutionized in 
terms of ease and scale by the development of the clustered regu- 
larly interspaced short palindromic repeat (CRISPR)/Cas system, 
a prokaryotic antiviral defence mechanism whose constituent pro- 
teins are remarkably adept at introducing genome edits [94-96]. By 
combining these approaches, we might, for example, derive a cul- 
ture of neuronal cells via iPSCs from a patient. If we have a can- 
didate disease-predisposing genetic variant in that patient, then we 
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could edit the genome of the cells in a parallel culture to remove this 
variant and then compare the effect of this variant on the individual 
neuronal cell types (neurons, glial cells). This experiment is certainly 
not trivial, but remarkably it is now possible. 

While these, and many other, developments are exciting, we should 
resist the urge to generate data simply because we can. These new tech- 
niques bring substantial challenges to the bioinformatics interpret- 
ation of the data, for example huge variability in cell models and high 
noise in the single-cell readouts [97], and thus answers may not simply 
drop out. We must make sure that our applications of functional gen- 
omics are both well powered and hypothesis-led and, in particular, 
that we increase the informatics, mathematic, computational, and 
statistical expertise required to interpret these fantastical experiments. 
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Cognitive neuroscience is the scientific discipline concerned with 
understanding the neural mechanisms involved in supporting 
human mental function. The discipline therefore rests on a reduc- 
tionistic stance, or at least on the assumption that there are prin- 
cipled relationships between the objective, physical brain and the 
subjective, functional mind. The overarching aim is to build mech- 
anistic bridges connecting the mind to the workings of the organ 
from which it arises and to the behaviours that arise from it. 

Cognitive neuroscience has two primal roots. The first consisted in 
the observation that damage to the brain can lead to highly selective 
deficits in high-level human cognitive behaviours such as reasoning 
and comportment [1], language production and comprehension 
[2, 3], the recognition of complex objects [4-6], and the compre- 
hension of spatial relations [7, 8]. The second was the development 
of rigorous quantitative scientific experimental methods to identify 
and measure elemental processes in human behaviour [9-11]. The 
two roots came together in the elegant research in neuropsychology 
and behavioural neurology, which used rigorous psychophysics and 
cognitive psychology to understand associations and dissociations 
in elemental cognitive functions in individuals with different pat- 
terns of brain damage [12-14]. 

A third root was being established at around the same time but 
took longer to mature—the development of methods to measure ac- 
tivity in the human brain. Early attempts were made to measure blood 
circulation, based on the belief that it was linked to brain activity 
[15-17]. The coupling between brain activity and local circulatory 
changes was subsequently scientifically proven [18] and eventually 
supported the development of effective methods to measure haemo- 
dynamic responses using PET [19-21] and fMRI [22]. A comple- 
mentary set of methods was developed to measure the electrical 
signals of the brain directly [23, 24]. Once established, modern-day 
brain imaging and neurophysiological methods provided a signifi- 
cant impetus for the establishment of cognitive neuroscience. 

The term ‘cognitive neuroscience’ was famously coined in a 
New York taxi in the late 1970s when a renowned neuroscientist 
(Michael Gazzaniga) and cognitive psychologist (George Miller) 
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were on the way to a dinner meeting aimed at promoting studies into 
how the brain enables the mind (https://www.cogneurosociety.org/ 
background/). The name stuck, and the field thrives. 

The enormous scope of cognitive neuroscience prevents a com- 
prehensive review. The chapter therefore focuses on highlighting 
the emerging principles of the brain—mind relationship and the ever 
improving methods with which to investigate them. The conceptual 
and methodological breakthroughs provide a foundation for an ex- 
perimental medicine approach into understanding the neural basis 
of psychiatric conditions and their treatment. 
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The neural mechanisms that contribute to mental function (and 
dysfunction) can be investigated at various levels of organization. 
Starting from the most basic unit, it is possible to look for contrib- 
uting factors linked to molecules, biochemical cascades, cellular 
structures (for example, synapses), cells, local microcircuits, brain 
regions, large-scale brain networks, behaviour of individuals, social 
groups, and culture. There is no leading candidate for the most rele- 
vant level of organization. Instead, it is clear that factors at any of 
these levels can provide important insights into understanding the 
brain-mind relationship. 

As an example, consider the case of human memory. The prop- 
erties of the NMDA receptor molecule are essential for enabling 
changes in synaptic strength as a function of association between 
the activities of two interconnected neurons [25, 26]. Donald Hebb 
[27] had famously proposed such a mechanism to be able to sup- 
port long-term associative learning in the brain. Similar, though less 
technically detailed, proposals had also been put forth earlier [28]. 
Biochemical cascades, such as those involving calcium/calmodulin- 
dependent protein kinase II (CaMKII) are essential for transforming 
the calcium signal entering via the NMDA receptor channel into 
stable changes in synaptic transmission and structure [29]. The syn- 
apse serves as the unit of change, and different algorithms for plas- 
ticity may be manifest at different types of synapses, with different 


consequences for memory. For example, the rules for plasticity 
differ at the synapse between Schaeffer collaterals between CA3 and 
CA1 pyramidal neurons vs at the synapse between the mossy fibres 
from the granule cells of the dentate gyrus and the CA3 pyramidal 
neurons. What learning properties arise is highly dependent on the 
way both principle cells and interneurons are interconnected in the 
local microcircuitry of brain regions (for example, entorhinal cortex, 
dentate gyrus, CA3, and CA1) [30, 31]. For example, in the CA3 re- 
gion, the sparse pattern of potent connections between mossy fibres 
and pyramidal cells, together with the dense pattern of recurrent 
connections among CA3 pyramidal neurons, has been proposed to 
support sequential learning and auto-associative memory [32, 33]. 
Different brain areas are essential for supporting different types of 
memory. The hippocampus is essential for forming new long-term 
episodic memories [12, 34], whereas the cerebellum is required for 
certain types of sensorimotor forms of conditioning [35]. These 
brain areas do not operate in isolation, but within large-scale net- 
works of brain regions, in which activity injected by other brain 
areas can modulate the qualities and strength of memories. For ex- 
ample, the prefrontal cortex, which is linked to the hippocampal re- 
gion, can influence the integrity of source and temporal context of 
memories [36-38]. None of these neural insights would have come 
about without the observations of human behaviour. The patterns 
of dissociation among affected vs preserved memory functions in 
individuals after brain damage were essential to reveal the plurality 
of memory and to guide its taxonomy. Social, and ultimately cul- 
tural, factors strongly influence what individuals end up encoding 
and storing in memory [39, 40], revealing the fact that memory is 
biased towards items and events that are relevant to individuals, and 
therefore proactive and adaptive, rather than passive and impartial. 

Given that all levels of organization are informative, research at 
different levels is equally valid. The ambition is to abstract and for- 
malize the relevant computations at different levels and incorporate 
their contributions as we move to increasing scales. This is not yet 
possible with any certainty of capturing the true essence of all bio- 
logical mechanisms involved, though the use of biophysical and 
mathematical models is a powerful way to test and refine ideas about 
computations at different levels. For the empirical scientist con- 
ducting research at a given level, the most prudent way forward is to 
recognize the important constraints and influences brought to bear 
by the other levels of organization. In the case of human cognitive 
neuroscience, most of the methods provide access to macroscopic 
levels of organization, ranging from brain areas to human behaviour. 
However, these can also be deployed to understand neural changes 
as a result of molecular manipulations or changes, as in the case of 
genetic risk factors or pharmacological interventions. 


Structure and function 


Atall levels of brain organization, structure and function are bound. 
Thus, in order to understand neural mechanisms of cognition, one 
must consider the anatomy, physiology, and pharmacology. On a 
macroscopic scale, the anatomical pathways and connections be- 
tween areas provide the scaffolding on which neural computations 
unfold. Furthermore, the anatomical pattern of connections to 
and from a given brain area is a primary determinant of its func- 
tional role [41, 42] for contemporary partitioning of human brain 
anatomy. The nature, intensity, and timing of physiological signals 
exchanged within and among brain areas through the anatomical 
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highways carry the contents of neural information. These deter- 
mine our behaviours and our mental experience. The carriers of the 
physiological signals are neurotransmitters and neuromodulators 
acting through their various receptors. The main excitatory neuro- 
transmitters in the brain—glutamate and GABA—are responsible 
for the bulk of neuronal communication in the human brain, but 
neuromodulators such as dopamine, serotonin, noradrenaline, and 
acetylcholine play essential roles in shaping the specificity, gain, 
and plasticity of the communications. Thus, when considering the 
neural basis of psychological functions, it is essential to consider 
the anatomy, physiology, and pharmacology. As a corollary, when 
attempting to understand psychological or psychiatric disorders, 
it is necessary to understand the interplay of these three essential 
ingredients. 


Networks 


The intuitive approach to understanding brain-mind relationships 
remains stuck in phrenology [43]. We gravitate towards under- 
standing the contributions of individual brain areas to complex cog- 
nitive functions, and many scientists continue to focus on individual 
areas. But it is clear that brain areas are not independent functional 
modules within the brain. Instead, they are interconnected within 
large-scale networks. Functional specializations of individual brain 
areas come together and interact within networks. 

Networks can be considered effective functional units in their 
own right, the integrity of which correlates with complex human 
psychological functions most directly [3, 44, 45]. Functional brain 
networks are supported by the patterns of anatomical connectivity 
in the brain but are also assembled flexibly according to the con- 
text of the incoming stimulation and task demands. Some groups 
of brain areas have a strong and close allegiance with one another, 
working closely together to support given functions. Examples 
would be the left hemisphere-dominant language network for 
speech comprehension and production [46, 47] and the right 
hemisphere-dominant network for the control of spatial attention 
[48, 49]. However, new allegiances can be formed flexibly and spe- 
cific brain areas may be co-opted, depending on the nature of in- 
formation or computations required for the current task. Indeed, 
brain areas need not be exclusively dedicated to one network and 
may contribute to multiple networks that make use of their elem- 
ental computational functions. One example is the inferotemporal 
cortex, noted to contribute to networks involved in visual, working 
memory, and semantic tasks [50-53]. 

Large-scale networks have important and distinctive properties. 
Within networks, the categorical differences between the func- 
tions of contributing brain areas can become blurred and difficult 
to pinpoint. The network architecture can provide resilience to 
loss of function by providing compensatory mechanisms, unless 
critical or multiple nodes are damaged. When behavioural deficits 
occur, these can result from lesions that damage critical brain areas 
within the network or from lesions that disrupt their connections 
to other regions. Because of the multiple specialized nodes and 
their interconnections, behavioural symptoms can be varied and 
dissociable [49, 54]. 

The network pattern of organization is recapitulated at lower levels 
of organization. Individual neurons are embedded within func- 
tional ensembles and circuits. In many cases, neuronal ensembles 
may be assembled flexibly, though constrained by their anatomical 
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connectivity. At an even finer grain, active synapses form dynamic 
network structures that can support specific patterns of communi- 
cation and plasticity [55]. 


Functional specialization 


Although it is no longer sensible to think of brain areas as being 
fully responsible of delivering specific mental capacities in a phreno- 
logical sense, it is clear that brain areas contribute specialized func- 
tions to the networks within which they are embedded. The shift 
from functional localization to functional specialization may appear 
subtle, but it is fundamental. Functional specialization of brain areas 
is determined by multiple factors: the pattern of inputs it receives 
from its various afferent regions, the neuromodulatory influences, 
the mappings of functional units within the region, the intrinsic 
microcircuitry that determines how information is transformed 
locally, and the pattern of outputs to its various efferent regions. 
Though we have made tremendous progress in sketching the func- 
tional contributions of many brain areas, our knowledge of their 
functional specialization is largely still incomplete. 

Functional specialization occurs at different levels of organiza- 
tion. Previously, we considered how brain areas have specialized 
functions, but the same is true at higher and lower levels of or- 
ganization. At higher levels of organization, we have functional 
brain networks that perform different transformations of their 
characteristic patterns of stimulation. Higher still, in humans, we 
observe specialization at the level of brain hemispheres. Though 
the precise origins and determinants of hemispheric specializa- 
tion remain mysterious, certain brain networks and functions dis- 
play patterns of hemispheric dominance across the population. 
Language, for example, tends to be left hemisphere-dominant 
[2, 15], whereas spatial attention tends to be right hemisphere- 
dominant [48, 56, 57]. Anatomical differences, such as the 
torque [58], accompany these functional differences, and disrup- 
tion during the development of normal hemisphere dominance 
has been suggested as a possible cause for disorders of mental 
health [59]. 

At lower levels of organization, functional specialization occurs 
within subregions of brain areas, and at cellular and molecular levels. 
Let us focus on the hippocampal system as an illustration. In the 
main afferent to the hippocampus proper—the entorhinal cortex— 
spatial responses of neurons are organized in a grid-like fashion, 
which has been proposed to provide a scaffolding for constructing 
maps of spatial relations out of external stimulation [60]. Within in- 
dividual regions of the hippocampus and closely related structures, 
neurons display different types of functional specialization, coding 
for spatial locations within the environment [61, 62], interval dur- 
ations [63], or head direction [64]. Subcomponents of hippocampal 
cells are also specialized such as the giant synapses between the 
dentate mossy fibres and the ‘thorny excrescences’ on the proximal 
dendrites of the CA3 neurons and the exuberant pattern of recur- 
rent collaterals within the CA3 region itself. At the molecular level, 
the NMDA receptor channel, densely packed in some hippocampal 
subregions, provides a unique molecular logical ‘AND’ gate, sensing 
the coincidences of presynaptic and post-synaptic activity, and is 
thus well suited for gating synaptic plasticity [25, 26, 65]. The com- 
putational contributions and implications of these various levels of 
specialization are not fully understood and remain an area of ac- 
tive research. Interestingly, cognitive neuroscientists are beginning 


to explore whether some of these specializations, which have been 
observed within the context of spatial navigation and memory in 
animal models, may also contribute to the mapping of other types 
of functional relationships, and therefore underpin cognition more 
broadly [66-68]. 


Concurrent processing 


Another hallmark of network organization is the rich, multidir- 
ectional, and concurrent flow of information among its various 
nodes. It is intuitive to think of the brain as operating on simple sig- 
nals extracted from the external stream of information in an orderly 
way, reconstructing increasingly more complex and abstracted fea- 
tures and dimensions, and stitching these together into cohesive 
episodes. But, at best, this is an oversimplification. For example, 
although there is evidence for increasingly integrated and sophisti- 
cated information being coded along sensory hierarchies, it is also 
well known that information processing does not progress neatly 
in series through these hierarchies. There are many bifurcations, 
shortcuts, and re-entrant pathways of information. In vision, the 
best studied case, multiple paths of visual signal are established al- 
ready at the first relay. Activity flows from the retina both to the 
lateral geniculate nucleus of the thalamus and to the superior col- 
liculus. In the cortex, massive interconnections among visual areas 
are well documented [42, 69]. Bifurcations result in massive con- 
current processing. Processing at multiple specialized brain areas 
proceeds concurrently to extract information about the colour, 
motion, identity, and spatial relations among items. Analysis in 
these areas cannot be said to be fully parallel, since many inter- 
connections exist, with the possibility of mutual interaction and 
cross-talk during concurrent analysis. Re-entrant pathways are also 
abundant, so that higher-order areas receiving early signals can in- 
fluence stimulus processing at hierarchically lower-order areas. For 
example, in vision, prefrontal areas have some of the earliest laten- 
cies of stimulation and may influence lower-order areas [70]. One 
interesting observation, for example, is that there are many times 
more feedback connections between V1 and the lateral geniculate 
nucleus than there are feed-forward connections [71]. Information 
flow in the reverse hierarchical direction may be as important as 
that in the forward direction [72]. 

The computational architecture combining functional special- 
ization with distributed, concurrent processing is highly efficient, 
greatly speeding up perceptual analysis and providing some fault 
tolerance through compensatory pathways. However, it also poses 
a major problem, known as the ‘binding problem. How does the 
brain correctly integrate the various attributes of the individual 
objects of perception (or thought) to form a cohesive episode? To 
date, no specific brain repository of fully integrated objects fit for 
conscious experience has been identified. Most theories posit that 
the binding problem is solved through the co-ordination of activity 
across large-scale networks in the brain [73, 74]. These theories are 
often also invoked to explain the neural basis of perceptual aware- 
ness [75, 76]. However, it is fair to say that we still lack an under- 
standing for the exact way in which activity is co-ordinated and 
integrated throughout brain networks. The questions become par- 
ticularly thorny when one considers the different timings for infor- 
mation to reach the brain from the different sensory modalities and 
remembers that we experience the world dynamically as it changes 
and we move through it. 


Proactive and dynamic modulation 


Our conceptualization of the brain is evolving. Instinctively, we have 
considered the brain as an organ to mirror, or at least construct a 
good internal model of reality. In this interpretation, the brain is 
usually a passive organ that reacts to external stimulation, creating 
a mental percept or construct anew, from scratch, at each moment. 
Yet, advances in research reveal the brain to be of a very different 
character. It is a complex dynamical system, ever restless, full of con- 
tent used proactively and dynamically to interpret incoming signal 
perturbations to guide adaptive behaviour. 

From the very first measurements of electrical signals in the 
human brain, this organ was found to be full of structured activity 
[23]. Far from idle, brain activity had rhythms that changed in inten- 
sity and frequency with the individuals’ state or pursuit. With the ad- 
vent of modern haemodynamic brain imaging methods, researchers 
observed that, even at rest, the brain alternates among activated 
functionally specialized networks [77, 78]. Contemporary electro- 
physiological methods with good spatial and temporal resolution 
reveal rapid transitions, with multiple network states per second 
[79, 80]. At the level of neuronal ensembles, the ebbing and flowing 
of rhythmic fluctuations in voltage levels is thought to affect neur- 
onal excitability, influencing the timing of neuronal spiking and, ul- 
timately, their communication [81-83]. 

The earliest psychophysical studies of human behaviour also cast 
strong doubt on a reactive and bottom-up construction of percep- 
tion and cognition. For example, when multiple stimuli compete 
for attention, the resulting percept is strongly dictated by where one 
wilfully chooses to focus, independently of where one is looking 
[10]. Over the years, experimental psychology has taught us that, 
contrary to how it seems, we do not take in the complete and con- 
tinuous stream of events unfolding around us. Instead, we sample at 
most a handful of items at any given moment. What ends up being 
perceived, and subsequently remembered, depends largely on our 
current task goals, motivations, memories, and emotions [84]. These 
internal factors influence perception and cognition proactively and 
dynamically. By combining behavioural studies with measurements 
of brain activity, cognitive neuroscientists have been charting the 
brain networks and mechanisms involved in the top-down control 
of information processing in the brain. Anticipatory signals based 
on internal states (for example, task goals or memories) carry infor- 
mation about multiple attributes (for example, locations, features, or 
timings) of expected relevant events. These, in turn, influence mul- 
tiple stages of sensory and motor processing to determine the ob- 
jects of perception that occupy the mind or guide action [85]. 

This continual orchestration of human perception and cognition 
by internal factors has important repercussions for understanding 
and treating mental disorders. The delicate regulation and optimiza- 
tion of our mental experience is achieved through fine orchestration 
of activity in multiple brain areas by large-scale networks involved 
in attention, memory, motivation, and emotion—often working to- 
gether. Biased patterns of memories or deficits in motivation can 
lead to suboptimal or distorted pickup of external information from 
the environment, for example by prioritizing and selecting items 
that reinforce negative expectations. In this way, cognitive and emo- 
tional biases can compromise the very first exchanges with reality. 
Furthermore, because these selected items then come to occupy the 
mind and guide action, they fuel a cognitive vicious cycle that can 
become difficult to break. 
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Research into how disruption of proactive top-down control by 
various types of internal factors contributes to disorders of mental 
health has great promise. Some researchers are beginning to in- 
vestigate how mood disorders compromise attention to emotional 
stimuli [86, 87]. The results are promising and are already leading 
to some useful cognitive interventions [88, 89]. However, the op- 
portunities remaining to be explored are vast. To make incisive 
progress, it will help for cognitive neuroscientists to expand their 
inquiry of attention-related mechanisms to more ecological contexts 
and to consider the influence of internal factors highlighted by the 
clinical condition. 


The last few decades have produced methods for investigating 
the human brain that would have seemed unimaginable when re- 
searchers first started empirical investigations of the brain-mind 
relationship. Each of the methods has its characteristic strengths 
and limitations, and so none is superior to others. In order to make 
advances in understanding how the brain supports cognition, it is 
always prudent to keep methodological limitations in mind and to 
employ multiple methods in a complementary fashion, to overcome 
limitations and guide accurate interpretation. A similar approach 
should be used to develop biomarkers and treatments for disorders 
of mental health. 


Psychophysics 


Psychophysics is a key term introduced to launch the empirical study 
of mental functions and to establish psychology as a scientific discip- 
line [90]. Through psychophysics, the researcher investigates how 
systematic alterations in physical stimulation change an individual's 
private perceptual experience and behaviour. The use of the term 
is often confined to studies of perceptual systems, but the method- 
ology applies equally well to investigations of higher-order cognitive 
functions, such as attention, memory, and emotion, through the sys- 
tematic manipulation of stimulation. 

Careful psychophysical experimentation should be brought to the 
forefront of understanding the causes of mental health disorders. 
Their primary and defining symptoms are psychological functions, 
yet many of the scientific efforts to develop biomarkers and treat- 
ments include only coarse psychological assessments or sidestep 
cognition altogether. Instead, few approaches would make greater 
inroads to understanding mental disorders than careful psycho- 
logical testing, enabling reliable, sensitive, and selective quantifica- 
tion of specific cognitive functions. 

The establishment of experimental psychology as a scientific 
field has transformed our understanding of the landscape of mental 
functions. The constituent elements of cognition are distilled with 
increasing granularity, and their dissociations, associations, and 
mutual influences charted. We no longer think of cognitive domains, 
such as memory, language, executive control, or emotion, as mono- 
lithic constructs defined in folk psychological terms. Each of these 
domains comprises functions with characteristic input streams, 
functional variables, behavioural consequences, and relationships 
to other psychological functions. 

To dissect the psychological mechanisms of granular cognitive 
functions, experimental variables are manipulated systematically to 
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measure behavioural consequences. In addition to behavioural re- 
sponses that explicitly tap into the variables of interest, it is also pos- 
sible to use indirect behavioural measures, which are altered by the 
implicit use of the cognitive construct in question. Implicit meas- 
ures can be very sensitive and circumvent problems related to stra- 
tegic responding by participants. For example, rather than asking 
individuals about their reactions to emotional stimuli, it is possible 
to measure their response times to targets that follow emotional vs 
neutral stimuli [91] or to name the colour in which emotional vs 
neutral words are written [92]. The implicit association task [93] 
is a good example of how implicit measures can reveal instinctive 
psychological attitudes of prejudice, which would be difficult to as- 
sess through explicit judgements. Most behavioural tasks still rely 
on unitary responses, such as a button response related to detec- 
tion, discrimination, or choice of a stimulus. In addition to accuracy, 
response times often add valuable information about the quality of 
performance. It is possible, however, to obtain much more detailed 
measures of behavioural performance by using continuous responses 
in behavioural tasks and by recording the precision and trajectories 
of responses [94, 95]. For example, rather than indicating whether a 
particular stimulus held in working memory was on the left or right 
side of the screen, participants can place the stimulus at its remem- 
bered location on the screen. In this way, over trials, it is possible 
to verify the precision of memory by the dispersion of responses 
around the correct value, to detect whether participants swap the 
location of items in mind, or to test whether the response trajec- 
tory is influenced by the presence of other stimuli that may have had 
particular salience or significance [96, 97]. In addition to relying on 
manual responses, recording eye movements, body movements, fa- 
cial expressions, and muscle contraction can further add valuable 
sensitivity to the content and transformation of mental operations. 
The precise delineation of boundaries among functions, as well as 
their full characterization, is still a living science and subject to re- 
finement, but current knowledge is sufficiently mature to make sig- 
nificant contributions to the study of mental health. 

Our growing insights into psychological functions have not made 
their way very effectively into the batteries of psychological tasks 
used for diagnosing mental functions. The challenge is to develop 
a time-efficient set of tasks that are sensitive, selective, reliable, val- 
idated, and easy to administer and quantify. The requirement to use 
well-validated tasks keeps us tethered to old ways. Some of the most 
popular neuropsychological tasks combine many psychological 
functions of different sorts, making interpretation at the granular 
level problematic. Take, for example, the Wisconsin Card Sorting 
Task (WCST), used for diagnosing frontal lobe dysfunction. On 
each trial, participants view three items that can vary along multiple 
dimensions—colour, shape, and number. They have to learn the rule 
determining which two items go together. After a series of trials, the 
categorization rule changes without instruction. Participants must 
understand that the rule has changed, learn and use the new rule, 
and inhibit using the previous rule. This simple task therefore re- 
quires verbal comprehension, attention to individual features, cat- 
egorization, learning sensorimotor correspondences, rule switching, 
inhibition, working memory for the active rule, sustained attention, 
and more. Most of the favourite neuropsychological tasks, like the 
WCST, are blunt or, at best, messy tasks from which to interpret the 
precise functions that may be compromised. But they are also quick 
and powerful methods for revealing a deficit in the first place, and 


over the years, their interpretation has become a science in its own 
right [98]. 

Effective behavioural testing is essential. The quality of infer- 
ence about a given molecular biomarker, brain dysfunction, or 
experimental treatment for a mental disorder will depend heavily 
on the quality of behavioural testing. Researchers argue about the 
best neuropsychological tasks, or combination of tasks, to chart the 
mental landscape of individuals. The use of a consistent set of tasks 
is important in order to link research findings from various labora- 
tories around the world. For this reason, efforts are increasingly 
made to harmonize the use of neuropsychological tasks across re- 
search centres, with groups of researchers coming together to thrash 
out what constitutes the most effective task battery [99]. But even 
these distilled test batteries still combine old-fashioned and complex 
tasks that do not isolate and quantify relevant psychological func- 
tions with a high level of granularity. Given the state of experimental 
psychology, much cleaner and more detailed assessments should be 
possible. 

So how do we strike the right level of innovation vs validation? It 
would be foolish to throw away the old faithful tests, with their many 
decades of validation. It seems equally foolish not to benefit from the 
power of precise psychological phenotyping that is currently pos- 
sible. Multiple components should be considered for striking the 
right balance. The first step is to acknowledge the importance of cog- 
nitive and behavioural phenotyping. Occasionally, major research 
initiatives or trials of mental disorders use only perfunctory meas- 
ures of cognition such as administering one of the standard cogni- 
tive batteries. What is the logic in downgrading the importance of 
obtaining high-quality psychological data relative to obtaining other 
body-based measures? Could this be a lingering prejudice against 
psychological science? If so, it is high time that we leave this behind. 
Another contributor could be the lack of access to high-quality cog- 
nitive neuroscience, and for this, more cross-fertilization between 
clinical and basic science training would be highly effective. 

Once the importance of sophisticated psychological phenotyping 
is accepted, it will prove exciting to realize the enormous break- 
through potential that can be achieved through its incorporation into 
large-scale studies. At this juncture, multiple research approaches 
should be pursued and interrelated. We should continue to use 
standardized and well-validated batteries of proven neuropsycho- 
logical tasks. These are effective at revealing the neuropsychological 
deficits and tether new studies to the established literature, thus al- 
lowing for comparisons. In parallel, we should use contemporary 
cognitive neuroscience to zoom in on potentially compromised 
areas in much greater detail. In many cases, it will be worth investing 
the time to obtain reliable and precise measures through extensive 
psychological testing that enable the characterization and quantifi- 
cation of cognitive functions at a sufficiently granular level for map- 
ping onto functional brain networks. Granular psychological testing 
should be ported to digital platforms, which enables testing on mas- 
sive scales, and therefore greatly accelerates their validation and the 
development of norms. These tests can be delivered in an engaging, 
game-lack fashion, in short testing bursts. The opportunity to con- 
duct testing over multiple sessions, such as in a daily fashion, has the 
added benefit of producing measures of variability, rates of learning, 
and functions of amelioration or decline. Developing games to 
measure and track mental health during healthy lifespan develop- 
ment and in the context of disorders is becoming a major enterprise 


in digital health [100-102]. Finally, we should start using digital 
technology to measure psychological functions from ongoing nat- 
ural behaviour in the real world. Patterns and variations in speech, 
response times, balance, movement, navigation, social contexts, 
sleep, for example, are likely to provide much more information than 
constrained psychological tests or games. To understand how the 
various variables within any one of these behaviours map onto cog- 
nitive and brain systems, however, it will be necessary to correlate 
performance variables during natural behaviour and carefully de- 
signed experimental tasks in the first instance. 


Neuropsychology 


The first major insights about the relationship between the human 
mind and brain came from observations of behavioural and cogni- 
tive deficits after brain damage. Cases of Phineas Gage [1], Leborgne 
(Tan) [2, 15], and HM [12 ,34], for example, were pivotal to situating 
higher-order comportment, language production, and episodic 
memory in the brain. 

Interpreting the functional role of a brain area from a lesion is 
complicated. The first complication is that naturally occurring le- 
sions, such as after a haemorrhagic stroke, do not follow the bound- 
aries of functional brain areas. Multiple areas may be compromised, 
and portions of individual areas may be spared. In many cases, 
therefore, it is impossible to know what exact functional areas have 
been affected and to what extent. Importantly, lesions also often 
damage white matter, breaking connections between other, often re- 
mote, brain areas. 

Lesion location aside, there exist other complications. Psycholo- 
gical functions arise through the co-ordinated activity of large-scale 
brain networks. In this context, it is important to consider functional 
inputs and outputs, connections among areas, and possible redun- 
dancies or alternative pathways to achieve given functions. It is not 
possible therefore to conclude that the lesioned area performed the 
lost function. Instead, one can only infer that the brain cannot sup- 
port that function in the absence of that area. Rather than the com- 
promised psychological function under observation, a lesioned area 
could be performing a necessary upstream or downstream function. 
It is also not possible to conclude that the lesioned brain area does 
not contribute to a function that remains unaffected, since the given 
function may also be (or become) supported by an alternative set of 
brain areas. Lesions to the white matter may disconnect brain areas 
from their critical inputs and/or outputs. Some have argued that 
major neurological syndromes result from disconnections caused 
by lesions to key white matter hubs for inter-areal connections, such 
as neglect [103], aphasia [44], or amnesia [104]. 

To complicate matters further, the effects of lesions change over 
time. Damage to a given area may change the pattern of inputs 
or outputs to and from other areas, thus changing the levels of 
effective stimulation at these distant sites. Loss of normal func- 
tion at these sites can result in neurological damage and atrophy. 
When studying the effects of chronic lesions, therefore, it is im- 
portant to consider how other areas and networks have been af- 
fected through diaschisis [105, 106]. Complementing the erosion 
of function in damaged and interconnected areas, there can also 
be compensatory plasticity in preserved and complementary 
brain areas. 

Despite all its shortcomings and complications, neuropsychology 
is an essential method for mapping the functional architecture of 
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the human mind and for understanding how brain areas and net- 
works support its various functions. Associations and dissociations 
among psychological functions affected by different brain lesions are 
invaluable for identifying the natural kinds of psychological func- 
tions, separating those that are strongly interdependent from those 
that are naturally segregated in the brain. Patterns of dissociation 
have broken down the previously monolithical constructs into com- 
plex systems of constituent functions. Double dissociations are an 
especially powerful method for identifying neurally independent 
psychological functions [107]. These have helped us chart the rela- 
tive independence between syntax and semantics [108], perceptual 
and motor learning [109-111], and perception for guiding recog- 
nition and for guiding action [112]. In addition to carving the psy- 
chological landscape at its joints, neuropsychology is also essential 
to linking psychological function to brain structure. Disruption 
of a psychological function after damage to a particular area pro- 
vides confirmation that this area plays a necessary and causal role 
in supporting the given function. Although it can be problematic 
to infer the psychological function performed by a given brain area 
based solely on neuropsychological observation in the first place, for 
the reasons given, neuropsychological observation is the ultimate 
testing ground for hypotheses concerning the causal contribution of 
a brain area to a psychological function. 

Most of the neuropsychological literature considers focal brain le- 
sions resulting from haemorrhagic or ischaemic strokes. However, 
it is also possible to apply neuropsychological methods to under- 
stand psychological deficits that result from dysfunctions in brain 
networks in neurodegenerative and neuropsychiatric conditions. 
One may argue that network-level neuropsychology is more natural 
and appropriate, given that psychological functions arise from ac- 
tivity in large-scale networks in the first place. Furthermore, pat- 
terns of neurodegeneration and network dysfunction in psychiatric 
conditions tend to follow the patterns of intrinsic brain connectivity. 
Patterns of degeneration and dysfunction within networks can be 
graded along their core nodes and occasionally display focal points. 
This can reveal interesting focal patterns of relative specialization 
within large-scale networks. An example is the breakdown of pri- 
mary progressive aphasia into agrammatic, semantic, and logopenic 
subtypes, which compromise different aspects of language after 
pathology weighted to lateral frontal, medial temporal, or lateral 
temporal regions, respectively [113]. 

Though, in principle, investigating the cognitive deficits after 
network-level dysfunction in neuropsychiatric conditions using a 
neuropsychological approach should be straightforward, in reality, 
it is very challenging. Some of the reason for the difficulty is that it 
has taken longer to accept that psychiatric conditions have neural 
causes. Most efforts in developing neuropsychological methods 
therefore occurred in the context of neurological disorders. The psy- 
chological tests and batteries we have are therefore most sensitive 
to the psychological consequences of damage in neurological con- 
ditions, and especially stroke. Efforts should be increasingly dedi- 
cated therefore to develop test batteries that are sensitive to patterns 
of cognitive and affective breakdown in psychological and psychi- 
atric conditions [114]. The other major difficulty is in identifying 
the patterns of damage in brain structure, connectivity, and func- 
tion in psychiatric cases. Without reliable markers of brain dysfunc- 
tion, the power of the neuropsychological approach is diminished. 
Identification and quantification of neural markers are likely to 
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improve substantially, as methods for imaging the human brain 
develop to provide increasing sensitivity to alterations in cortical 
microstructure, neuromodulation, fine-scale structural connect- 
ivity, and integrity and vitality of activity in functional networks. 


Brain stimulation 


Transcranial magnetic stimulation (TMS) is a non-invasive method 
that can simulate the transient effect of a brain lesion [115, 116]. By 
briefly passing a current through a coil of wires, forming a high- 
field magnet with a very focal gradient, a strong and rapidly fluctu- 
ating magnetic field is formed, which generates electrical currents 
in the underlying neural tissue and affects the membrane potential 
of nearby neurons. The result is a brief stimulation of action po- 
tentials and interference with the natural pattern of regional neur- 
onal activity. In some cases, TMS elicits positive effects such as the 
generation of sensations or of motor responses by stimulation of 
primary sensory or motor areas. Most of the time, however, the re- 
sult of TMS pulses is to interfere with ongoing neural activity, thus 
acting as a brief virtual lesion. Virtual TMS lesions have the advan- 
tage of being spatially and temporally circumscribed and targeted 
under experimental control. TMS can be applied in repetitive trains 
or as single pulses, with varying consequences on the excitability 
of the targeted region [117]. Because magnetic fields decay rapidly, 
only superficial cortical brain regions can usually be targeted. The 
method therefore avoids issues linked to diaschisis and reorganiza- 
tion of function that follow from chronic stroke lesions. However, 
stimulating one brain area using TMS may also influence activity 
across the whole network of regions to which the stimulated area 
is functionally interconnected. By using TMS, it is also possible 
to get evidence about whether a brain area is causally involved in 
the psychological function under investigation. By using multiple 
stimulators, or by combining TMS with brain imaging and re- 
cording methods, it also becomes possible to investigate how brain 
areas causally influence activity in other brain regions and how 
these modulatory connections, in turn, modify behaviour [118]. 
Increasingly, other related brain stimulation methods are being 
developed, which are capable of modifying cortical excitability in 
subtler and different ways (for example, transcranial direct cur- 
rent stimulation, transcranial alternating current stimulation, and 
transcranial random noise stimulation) [119]. In addition to their 
use in testing the causal involvement of brain areas and circuits in 
particular psychological functions, these methods are being in- 
creasingly used to induce plasticity in specific areas and circuits 
within rehabilitative contexts [120]. 


Brain imaging 


Methods for imaging structure and function in the human brain in 
vivo and non-invasively have revolutionized the study of psycho- 
logical function and dysfunction. 

The first major technique to be developed was PET [19, 20, 77]. 
PET measures the concentration of a radioactive substance at dif- 
ferent locations within the brain. PET scanners use arrays of de- 
tectors to sense the coincidence of gamma rays emitted upon the 
annihilation of positrons emitted from a radioactive substance. 
PET can resolve signals at the level of functional brain areas, but 
its spatial resolution is limited by the dispersion of positrons from 
their original source. The temporal resolution of PET is dictated by 
the half-life of the specific radioactive substance being measured. 


Some commonly used substances decay very rapidly and thus have 
a relatively high temporal resolution, while others decay much more 
slowly and thus have a low temporal resolution. 

By using radioactively labelled tracers that mimic or alter the 
function of neurotransmitters or neuromodulators [for example, 
6-'*F-fluorodopa (FDOPA)], it is possible to investigate neuro- 
pharmacological dysfunctions related to psychiatric conditions. 
PET and the related method of SPECT are the best methods for 
investigating neuropharmacological alterations associated with 
psychiatric conditions. The growing availability and specificity of 
tracers make it possible to study neuropharmacological param- 
eters with ever increasing specificity. In the case of the dopamine 
system, for example, it is possible to measure synthesis [for example, 
6-'SF-FDOPA and 6-’%F-fluoro-l-m-tyrosine (FMT)] and transport 
(for example, ‘'C/'*F-labelled tropane analogues) of dopamine and 
to isolate multiple dopamine receptor subtypes in the nigrostriatal, 
mesolimbic, and mesocortical systems (for example, ‘'C-SCH 23390 
for D1 receptors or ''C-raclopride for D2/3 receptors). 

By using tracers linked to glucose metabolism [for example, 2- 
deoxyglucose (2-DG] or blood flow (for example, 15-02), it is 
possible to investigate patterns of brain activity correlated with psy- 
chological states or functions. Neuronal activity is metabolically 
costly, leading to local increase in glucose metabolism and blood 
flow. Tracers of glucose metabolism offer the most direct marker of 
brain activity using brain imaging methods. 2-DG is a glucose ana- 
logue that cannot be metabolized. It is taken up by cells with high 
metabolic demands and remains trapped. The half-life of 2-DG 
is about 2 hours, and images are typically acquired over 20 min- 
utes. Therefore, although 2-DG imaging provides a good proximal 
measure of neuronal activity, the time resolution is insufficient for 
resolving modulations of brain activity within psychological tasks. 
Measures of blood flow using PET made up some of the first images 
of brain performing different activities [121]. The most commonly 
used tracer is 15-O2, which is absorbed into water in blood. It has 
a short half-life (about 2 minutes), allowing for multiple measure- 
ments (usually 8-12) to be taken from one individual during one 
experimental session, with a temporal resolution of approximately 
30 seconds. This method enabled researchers to quantify and com- 
pare brain activity within brain regions and to correlate it across 
task conditions and groups of participants. Haemodynamic-based 
imaging with 1502-PET ushered in cognitive neuroscience as we 
know it today. 

PET-based haemodynamic brain imaging was soon to be super- 
seded by fMRI. Most fMRI experiments measure the BOLD signal, 
which mainly reflects changes in blood flow associated with neur- 
onal activity [22]. The development of the BOLD signal combined 
many pieces of knowledge about the nature of the MRI signal and its 
susceptibility to local distortions in the magnetic field. Distortions 
in the field by the deoxygenated form of haemoglobin [122] lead to 
signal loss, so that the resulting signal strength is proportional to the 
ratio of oxygenated-to-deoxygenated haemoglobin [123]. The net 
increased blood flow into an active brain region leads to an overall 
increase in the proportion of local oxygenated blood, and therefore 
of the MRI signal [124]. In addition to being a superb method for 
investigating brain activity, MRI image sequences can also be used 
to measure different aspects of brain structure and connectivity (see 
Chapter 12). This one method—MRI—therefore provides various 
complementary modalities of brain imaging. 


fMRI offers much better spatial and temporal resolution than 
1502-PET, and it allows for many more images to be taken within a 
given experimental session, thus providing much greater reliability 
and flexibility. The theoretical spatial resolution of MRI is unlimited, 
though, in practice, the resolution is limited by coupling between 
the active neuronal pool and the local source of blood flow, as well 
as by the requirement to pool over space to increase signal strength. 
MRI images can be acquired very rapidly, and methodological de- 
velopments are constantly pushing the time requirements down 
[125]. However, ultimately, the temporal resolution of fMRI is con- 
strained by the haemodynamic function linking neuronal activity 
to subsequent changes in the level of oxygenated blood linked to 
the influx of blood and adjustments to other haemodynamic param- 
eters. Whereas it is possible to obtain a magnetic resonance image 
within the time frame of local field potentials related to synaptic ac- 
tivity (tens of milliseconds), the haemodynamic response function 
lags far behind, tracking brain activity with a delay of seconds. Even 
though the HRF is much slower than its driving impetus by neuronal 
activity, it is still possible to individuate the responses elicited by dif- 
ferent events occurring in rapid succession, if these are appropri- 
ately timed and intermixed [126]. Furthermore, the extent to which 
haemodynamic responses are suppressed by successive stimulation 
can provide an index of overlap in the neuronal populations acti- 
vated by these events [127]. MRI-based imaging has been highly suc- 
cessful, and the method is constantly improving, with new hardware 
advances, imaging sequences, and analysis tools. Arguably, MRI has 
played a major role in the paradigm shift from a view of the brain as a 
reactive organ with phrenological units to that of the brain as a pro- 
active dynamical system of complex networks. Initial studies using 
fMRI in the 1990s focused primarily on brain areas [128, 129]. Soon 
researchers began investigating the relationships between activity in 
different brain areas and refining measures of their functional con- 
nectivity and interaction [130-132]. Changes in brain activity and 
functional connectivity by pharmacological manipulations further 
nuanced our appreciation of modulatory functional interactions in 
the brain [133]. MRI then enabled the observation that functional 
brain networks are spontaneously active, even during periods of rest 
[77, 78]. Resting state networks (RSNs) emphasize the dynamic and 
active nature of the brain. 

Advances in analysis techniques have substantially increased our 
statistical power and resolution to study the human brain across 
spatial scales. Going beyond univariate analyses of changes in the 
magnitude of regional BOLD responses, multivariate analyses can 
compare patterns of subtle signal variation across voxels [134, 135]. 
Furthermore, computational approaches are increasingly applied to 
understand the relationship between patterns of brain activity and 
their representational content [136], and to arbitrate among com- 
peting models about the nature of neural coding supporting psy- 
chological functions [137]. At the network level, analyses based on 
graph theory and network science are increasingly used to estimate 
parameters related to the patterns and strength of connectivity 
among various functional nodes on multiple scales [138, 139]. 


Electrophysiological recordings 


Electrophysiological methods are the most direct way to measure 
human brain activity. Rather than relying on the relationships be- 
tween brain activity and metabolic demands and between meta- 
bolic demands and blood flow, electrophysiological methods pick 
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up unmediated electrical correlates of neuronal activity. In doing so, 
these methods have the ability to register changes in brain activity 
with high temporal fidelity. 

Electrical signals generated in the brain were first measured from 
the exposed animal brain by Richard Caton (1875) [140]. He de- 
signed an ingenious voltage-sensitive mechanism to move mirrors— 
a reflecting galvanometer—and demonstrated systematic changes in 
the pattern of reflected light upon variation of light stimulation to 
the eye. Some decades later, Hans Berger (1929) [23] developed the 
‘electroencephalogram” (EEG), a non-invasive method for recording 
electrical signals originating from the human brain. The EEG meas- 
ures fluctuations in voltage over time through electrodes placed on 
the scalp and a reference electrode. Using his method, Berger de- 
scribed the characteristic frequencies of voltage changes recorded 
from the human brain in different functional states (for example, 
sleep, relaxation, intellectual effort, administration of cocaine) and 
neurological conditions (for example, epilepsy). 

Although the EEG records voltage directly, the signals available 
at the scalp are a macroscopic and distorted summary of activity 
over large populations of neurons. Voltage signals at the scalp ori- 
ginate mainly from the summation of synaptic potentials of syn- 
chronously active neurons that are well aligned spatially [141, 142]. 
Because voltage decays logarithmically with distance, the measures 
are heavily biased towards neuronal populations that are close to the 
active electrode. The orientation of the active neuronal population 
relative to the electrodes also influences the polarity and amplitude 
of the signals. The resulting signal therefore reflects the spatial sum- 
mation of co-active neurons, biased by the degree of co-alignment of 
contributing neurons, as well as the orientation of the active region 
within the brain. The skull and scalp further strongly blur the signals 
before they are recorded [143]. Given these principles, the EEG re- 
cordings tend to be biased towards picking up excitatory synaptic po- 
tentials from pyramidal neurons in the neocortex [141, 144]. Signals 
at the scalp have been estimated to reflect the activity in thousands 
to millions of co-activated neurons [144]. The precise polarity and 
amplitude of the signals recorded at any given scalp electrode are 
heavily dependent on the location of the reference electrode. 

Magnetoencephalography (MEG) is a more recent electrophysio- 
logical method, which uses superconducting quantum interference 
devices (SQUIDs) to record the magnetic fields that accompany the 
neuronal voltage signals [145, 146]. Magnetic fields have different 
properties to voltage potentials, which confer greater spatial reso- 
lution to the recordings. The underlying origin of the signal meas- 
ures with EEG and MEG is the same—summed synaptic potentials 
over large populations of well-aligned, co-active neurons, but some 
important details differ in the signals that are measured. 

MEG sensors detect local changes in the magnetic field, so the 
recordings are reference-free. Magnetic fields are unaffected by the 
conductivity of the skull and scalp, eliminating the problem with 
blurring and resulting in much sharper gradients of activity meas- 
ured from the scalp. Magnetic fields decay more sharply than elec- 
trical fields with distance, further focusing the activity measured to 
superficial neocortical sources. Furthermore, the sensors are more 
sensitive to neuronal activity in sulci, resulting in magnetic fields 
that are tangentially oriented to the scalp. Altogether, MEG there- 
fore provides more spatially resolved measures of active neuronal 
populations. The main drawback of MEG, relative to EEG, is its re- 
liance on expensive superconducting technology and on the scarce 
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resource of liquid helium. However, new generations of sensors 
are being developed to measure MEG with more accessible tech- 
nologies such as optically pumped magnetometers (OPMs) [147], 
hybrid quantum interference devices [148], and nitrogen vacancy 
magnetometers [149]. 

Compared to haemodynamic imaging methods, EEG and MEG 
have poorer spatial resolution. Their spatial sensitivity comes from 
the distribution of voltage or magnetic-field gradients at the scalp, 
which can be sampled with dense arrays of sensors. It has been 
known since Helmholtz (1853) [150] that the problem of estimating 
the sources of signals from within a three-dimensional volume from 
a pattern on a two-dimensional surface is ill posed. The problem is 
mathematically underdetermined and impossible to solve uniquely. 
An infinite number of possible configurations of sources can 
account for any given surface pattern. Mathematical impossibilities 
aside, increasingly powerful methods of source reconstruction are 
being developed, which incorporate knowledge about brain struc- 
ture and physiology, convergent findings from methods with high 
spatial resolution, and analytical methods to quantify likelihoods of 
different possible solutions [151-154]. 

The direct and time-resolved measures of brain activity from 
MEG and EEG (M/EEG) can be used to address many important 
questions relevant to psychiatry [155]. As in Berger’s original use of 
EEG, it is possible to extract useful information just from the raw 
M/EEG signal. At rest, brains display characteristic patterns of ac- 
tivity within different frequency ranges, thought to reflect the levels of 
excitability in different neural circuits and on different spatial scales 
[156]. Disruptions in synaptic function or in the interactions among 
brain areas within large-scale networks associated with psychiatric 
conditions can lead to systematic alterations in the characteristic 
frequencies and power of oscillatory activity. Abnormal patterns of 
oscillations have been linked to conditions such as schizophrenia 
[157], autism spectrum disorder [158], and Alzheimer’s disease 
[159]. In addition to characterizing the frequency profile of activity 
in raw M/EEG, it is also possible to resolve activity in functional 
networks from the raw MEG signal when individuals are at rest [79]. 
Investigating the strength in RSNs and connectivity among con- 
stituent regions can provide a powerful method to investigate net- 
work dysfunctions associated with psychiatric disorders or their risk 
factors [160, 161]. New analytical methods that identify which func- 
tional network is most likely to be active at a given time point enable 
researchers to quantify the dynamics within functional RSNs [162]. 
Using such methods, it becomes possible to compare the vitality 
of networks in psychiatric conditions, for example by determining 
whether the dwell time of networks indicates excessive rigidity or 
instability in network states 

In addition to general measures of brain activity at rest, M/EEG 
provide rich information about the brain’s response to specific 
events within tasks. Traditionally, brain activity linked to a percep- 
tual stimulus, a cognitive operation, or a motor response has been 
studied using event-related potentials (ERPs) or fields (ERFs). These 
are averages of waveforms triggered by several repetitions of such 
events, which reinforce the aspects of the signal that are systemat- 
ically related to the event and average away other artefacts and un- 
related brain activity. The resulting waveforms have characteristic 
patterns of peaks and troughs, known as ‘components. Components 
are defined by their latency, amplitude, voltage topography over 
the scalp, and functional modulation by experimental variables 


[141, 163]. Their relations to the underlying neural events can be 
complex. In many cases, there may be no specific single intracranial 
component, but instead multiple overlapping neural processes that 
give rise to a macroscopic component at the scalp. These are there- 
fore best understood as sources of controlled observable variability 
[164] that provide rich, dependent variables to study information 
processing in the brain during that time period. 

Using ERPs and ERFs, it is possible to investigate changes in in- 
formation processing in the brain on a millisecond-by-millisecond 
time frame. Such temporal resolution can reveal whether a given 
psychiatric condition affects early perceptual pickup of informa- 
tion or only later deliberative processes. For example, ERP studies 
have pointed to early visual deficits in conditions such as schizo- 
phrenia [165]. Such conclusions could not be derived from haemo- 
dynamic imaging studies showing alterations in the visual cortex, 
since it would not be clear whether the modulation came from 
early visual processing or from late feedback modulation of visual 
areas by re-entrant activity after extensive processing in other areas. 
Another nice feature of electrophysiological recording methods is 
that brain responses can be studied without requiring participants 
to respond. In this way, it is possible to study the extent to which 
patients with different conditions process irrelevant, distracting 
stimuli. Such studies can reveal deficits in inhibiting irrelevant infor- 
mation [166] or exaggerated engagement with irrelevant emotional 
stimuli [167, 168] in different psychiatric and psychological condi- 
tions. By focusing on ERPs and ERFs at various stages of processing 
and by careful experimental design, it is possible to investigate how 
different psychiatric conditions affect attentional capture, emotional 
processing, semantic access and integration, memory retrieval, etc, 
arguably in a more specific and direct way by using brain imaging. 

More recently, an increasing number of measures and approaches 
are being developed to derive and analyse brain signals linked to 
event processing from raw M/EEG. By relying on trial-by-trial fluc- 
tuations in stimulus and behavioural parameters, it is possible to 
analyse task-related brain signals without having to average brain 
activity [169], thus providing even greater sensitivity to identify 
deficits within particular stages of information processing. In add- 
ition, task-based analyses are increasingly separating the M/EEG 
signal into its various frequency components. Changes in the in- 
trinsic oscillatory rhythms induced by stimuli or cognitive oper- 
ations can suggest alterations in circuit-level activity or connectivity 
in psychiatric conditions such as schizophrenia and autism spec- 
trum disorders [170, 171]. 

Because MEG and EEG provide a direct measure of brain activity, 
unmediated by haemodynamic parameters, they are well suited for 
investigating pharmacological effects. The powerful combination 
of pharmacological manipulations with electrophysiology remains 
under-exploited but is likely to play a major role in investigating 
pharmacological contributions to psychiatric conditions, as well as 
developing drug-related treatments [172, 173]. 


Biomarkers 


By combining neuropsychological batteries and focused cognitive 
testing with non-invasive measures to image and record brain ac- 
tivity, one dramatically increases the likelihood of identifying rele- 
vant biomarkers associated with risk or early stages of psychiatric 


disorders. It becomes possible to enhance characteristic cognitive 
profiles with sensitive and selective quantitative measures of speed, 
accuracy, and variability in specific, relevant cognitive domains. 
These rich psychological measures can be accompanied by vari- 
ations in the structure and function of brain areas, their structural 
and functional connectivity, the strength in activation of large-scale 
brain networks, the vitality and temporal characteristics of net- 
works, markers of neuronal integration, and communication, the 
speed and strength of neural responses at various stages of infor- 
mation processing, and altered patterns of information processing 
at various stages. 

Two general approaches can be considered when developing 
psychiatric biomarkers. The various behavioural and neural meas- 
ures can be used in a data-driven way to pull out combinations 
of measures that are predictive of psychiatric risk, condition, or 
recovery. New multivariate decoding and machine-learning mod- 
elling methods are increasingly powerful in identifying combin- 
ations of factors associated with particular outcomes. While this 
chapter strongly endorses the inclusion of factors directly related 
to cognitive and brain structure and function in biomarker devel- 
opment, it does not exclude the utility of other factors linked to 
molecular and cellular characteristics. The multivariate decoding 
approach can easily assimilate all such factors to find the most 
promising combinations of factors, thus generating high-level 
composite biomarkers. Much progress can be made in patient 
stratification, diagnosis, and treatment by using data-driven bio- 
markers, even without a clear understanding of the mechanisms 
that link each cognitive or brain factor to the outcome in question. 
Additionally, these data-driven biomarker candidates can become 
subjects of enquiry in their own right, yielding new mechanistic 
hypotheses to be explored. 

The other approach is the more traditional, hypothesis-driven 
experimental-medicine model. In this case, candidate mechanisms 
of cognitive or brain deficits can be investigated directly. By ex- 
panding the methods with which these are investigated, it becomes 
possible to gain a fuller mechanistic understanding of how a hy- 
pothesized deficit is expressed at different levels of analysis, as well 
as to learn about the breadth of implications of a given deficit and 
how best to develop treatments. 

Good examples of successful hypothesis-driven approaches to 
psychiatric conditions are found in the context of mood disorders. 
In the context of depression, Harmer and colleagues [174, 175] have 
investigated the role of cognitive emotional biases in mediating the 
effects of drugs for depression (often referred to as antidepressants). 
By using a sensitive battery of cognitive tests to assess emotional rec- 
ognition and emotional biases, the researchers have suggested that 
monoamine reuptake inhibitors influence emotional and social cog- 
nition directly, which, in turn, re-dress the mood disorder over time. 
The proposed mechanism explains why clinical actions of drugs for 
depression are delayed. By combining cognitive and pharmaco- 
logical studies with brain imaging methods, the researchers have 
revealed the neural systems affected by drugs affecting different 
neuromodulatory systems and have been able to develop effective 
means to predict individual responses to antidepressant treatment. 

A similar experimental medicine approach is being developed in 
the context of bipolar disorder, which is also benefiting from data- 
driven discoveries [176]. The guiding hypothesis for this programme 
of research came as a combination of data-driven and clinical 
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observations. Long-term prospective weekly monitoring of mood 
levels by patients [177, 178] revealed that individuals with bipolar 
disorder display high levels of mood instability, rather than the text- 
book pattern of alternating discrete episodes of mania, depression, 
and euthymia. Such results from remote monitoring of mood states 
complement clinical observations and conventional questionnaire 
findings [179, 180], and also provide rich quantitative data for mod- 
elling with machine-learning methods, as well as with novel math- 
ematical techniques [181-183]. These findings raised the intriguing 
possibility that mood instability may provide a central feature that 
contributes to bipolar disorder. It occurs in individuals at high risk 
for bipolar disorder [184], predicts its onset [185], occurs during 
the prodrome of the disorder [186, 187], and is independently as- 
sociated with poor prognoses [188-190]. Mood instability may also 
contribute to other mood disorders, such as borderline personality 
disorder [191], and may further do so in different ways. 

A cognitive neuroscience experimental-medicine programme 
of work has been launched to characterize the pattern of mood in- 
stability associated with the risk for bipolar disorder, to investigate 
how mood instability is associated with changes in particular cog- 
nitive functions and to reveal how altered brain network dynamics 
may contribute to mood instability and cognitive deficits [176]. If 
successful, such a programme of research will identify effective bio- 
markers that will enable the stratification of individuals for trials to 
investigate the effects of mood-stabilizing drugs, as well as predict 
and measure their efficacy within individuals. By building on this 
approach, a double-blind, randomized, placebo-controlled study is 
investigating changes in mood, cognition, and brain network dy- 
namics after 6 weeks of lithium treatment in participants with bi- 
polar disorder and mood instability [192]. 

The field can get caught up in debating whether data-driven or 
hypothesis-driven approaches are the way forward. The answer is 
simple. They are both useful and can work together effectively, in 
complementary ways. Ultimately, hypothesis-driven research is 
essential for developing a deep and nuanced understanding of the 
mechanism. However, the approach requires building on good ini- 
tial assumptions and ideas. In any scientific field, there is a danger 
of building experimental edifices on false starts. The additional 
challenges in psychiatry related to the lack of simple and tangible 
phenotypes for grounding the research exacerbate the problem. 
Data-driven discoveries can help set scientists back on track for 
better hypothesis-driven research. The process can be iterative, and 
information can flow in both directions, with data-driven findings 
informing hypothesis-driven work and results from focused experi- 
ments contributing to the pool of data for further mining. 
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Age is an acknowledged confounder in psychiatric research, most 
apparent in younger and in older patient groups. There is hardly 
any study that does not address this variable, usually by matching 
controls and patients by chronological age, by separating psychi- 
atric disciplines into age groups (viz. child and adolescent or old age 
psychiatry), or by entering the chronological age as a covariate into 
regression analyses. Time as a physical category, whether defined 
as multiples of 9,192,631,770 cycles of radiation (the transition be- 
tween two energy levels of the caesium-133 atom at rest at a tem- 
perature of 0 K (= 1 second [1]) or by the emptying of an hourglass, 
clearly has no direct mechanistic impact on a person's biological age, 
nor does our perception of time [2], although chronological time is 
used as a proxy of ageing. It is understood, moreover, that there is 
good ageing and bad ageing, depending on genetic and non-genetic 
factors, which add variability to the association of biological changes 
with the linear progress of time. This chapter will attempt to identify 
the time-dependent, that is, cumulative, processes that interfere with 
the function of the body and, in particular, manifest with changes in 
behaviour and experience. It will cover genetic and epigenetic mech- 
anisms, the intermediate metabolic changes associated with ageing, 
as well as changes at a system level that increase the allostatic load 
and lead to overload. We will pay particular attention to age-related 
changes in the brain that are likely to be associated with psychiatric 
disease and that are becoming more and more amenable to exam- 
ination with in vivo imaging techniques. Empirical biology of psy- 
chiatric ageing will require a chain of evidence from predisposing 
variables, longitudinally throughout life to observable brain changes 
that are associated with changed behaviour and psychiatric illness. 
We will illustrate this principle with a few examples. 











Chronological age is the best predictor of many chronic dis- 
eases such as atherosclerosis, type 2 diabetes, most cancers, and 
neurodegenerative diseases [3]. But how does the passing of time af- 
fect the human body? There are many different pathways underlying 





ageing, but the common denominator is that they affect the accu- 
mulation of damage in DNA, proteins, and lipids over time, and thus 
in organelles, cells, and tissues [4]. This somatic body cell clock de- 
termines the fate of individual organisms over weeks, months, and 
years and is based on mechanisms similar to the germ cell molecular 
clock that allows us to reconstruct the history of species over hun- 
dreds of millennia [5]. It is the balance of two processes that defines 
how quickly damage occurs—on the one side, external and internal 
processes that cause cellular damage, and on the other side, systems 
for maintenance, repair, and turnover. 

One example of internal processes associated with age-related 
damage is the production of reactive oxygen species (ROS). ROS 
are a normal by-product of cell energy metabolism and are mainly 
produced in the mitochondria, the power generator of the cell. ROS 
are chemically very unstable and cause oxidative damage to DNA, 
proteins, and lipids. This damage, in turn, causes malfunctioning of 
cells, and ultimately cell death if cells cannot recover [6]. However, 
there are many systems in the cell to combat oxidative damage, from 
antioxidants that act as scavengers of ROS to DNA repair mechan- 
isms and degradation pathways for faulty proteins [7]. Accumulation 
of oxidative damage in cells is the cause of ageing, according to the 
‘free radical theory’ of ageing [8]. However, there is conflicting evi- 
dence on the involvement of ROS in shortening or extending the 
lifespan [9]. 

Pathways that play a role in the pace of ageing have been ex- 
tensively studied in the four main model organisms for ageing re- 
search: the budding yeast Saccharomyces cerevisiae, the nematode 
worm Caenorhabditis elegans, the fruit fly Drosophila melanogaster, 
and the house mouse Mus musculus musculus/domesticus. There 
seem to be converging pathways in these model organisms and hu- 
mans that not only lead to an extension of the lifespan, but also an 
improvement of health during ageing—an increased ‘health span’ 
[4]. One of the best studied ways of extending life in model organ- 
isms is to introduce dietary restriction. This led to the discovery 
that pathways involved in nutrient sensing are important players 
in ageing. These pathways are regulated by different proteins: target 
of rapamycin (TOR), adenosine monophosphate-activated protein 
(AMP) kinase, sirtuins, and insulin/insulin-like growth factor 1 
(IGF-1) [9]. On the one hand, these pathways affect cellular metab- 
olism, which intuitively makes sense; when food is scarce, the or- 
ganism needs to be able to enter a standby mode of energy-saving 


to support only those processes that are vital for survival [10]. On 
the other hand, there is also a link between these pathways and cel- 
lular damage because they affect the production of antioxidants and 
proteins involved in cellular maintenance and repair mechanisms 
[9]. While it is difficult to establish links between dietary restriction 
and ageing in humans, studies in rhesus monkeys have shown that 
it affects ageing in non-human primates as well. Dietary restriction 
leads to improved insulin sensitivity and inflammatory profiles and 
protects from age-related diseases such as sarcopenia, cardiovas- 
cular disease (CVD), type 2 diabetes, neoplasia, endometriosis, and 
brain atrophy [11]. 


Cellular brain ageing 


The brain is particularly susceptible to cellular damage because 
of its limited capacity for regeneration and its high metabolic de- 
mand, which leads to increased ROS production [12]. Many 
neurodegenerative diseases have in common the fact that post- 
mortem studies show increased oxidative stress and reduced levels 
of antioxidants in brain regions affected by pathology [6]. Whether 
this is cause or effect of pathology and the contribution of oxidative 
stress to brain ageing per se, however, is a matter of intense research 
[12]. Nevertheless, nutrient-signalling pathways provide a link be- 
tween cellular damage and the brain. They not only play a role in the 
periphery, but also mediate signalling in the brain. For example, in- 
sulin receptors can be found in the hippocampus and their function 
depends on insulin transport from the periphery to the brain [13]. 

One of the direct effects of dietary restriction on the brain is 
increased production of neurotrophic factors and neurogenesis in 
the hippocampus [14]. The hippocampus is one of the few areas in 
the adult human brain that show integration of newly generated 
neurons into brain circuits. While most studies on neurogenesis 
have been conducted in rodents, it is reasonable to assume that 
adult neurogenesis in the human hippocampus serves similar pur- 
poses, supporting memory processes and contributing to brain 
plasticity. Neurogenesis decreases with age, so measures that con- 
tribute to increased neurogenesis are thought to be protective 
[15, 16]. Another mechanism of brain plasticity that decreases with 
age is synaptic plasticity. It has been extensively studied in non- 
human primates where age-related cognitive decline is more asso- 
ciated with alterations in synaptic connectivity, particularly highly 
plastic synapses, than loss of neurons per se [17]. Similar effects 
have been shown in humans, and research into Alzheimer’s disease 
has shown a high degree of synapse loss, rather than neuron loss 
at early disease stages, which correlates with cognitive symptoms 
[18, 19]. Therefore, the loss of synaptic plasticity and connectivity 
plays a role in the ageing human brain as well. Insights into the 
mechanisms driving ageing and longevity in humans have focused 
on two main contributors: genetics and lifestyle and other envir- 
onmental factors. 


Genetics and epigenetics 


The simple concept that some people ‘look young’ for their age, or 
vice versa, shows that chronological age, the time that has passed 
since birth, and biological age, in this case defined by features such 
as smoothness of the skin, posture, etc., are not necessarily the 
same. Research into genetic factors that drive the pace of ageing 
have focused on two extreme ends of the spectrum: those who show 
accelerated ageing phenotypes due to progeroid syndromes and 
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centenarians who show exceptional longevity. One example of a 
progeroid syndrome is Werner syndrome, which is characterized by 
the development of age-related diseases such as osteoporosis, CVD, 
and cancer from the age of 20. Research into the genetics of many of 
these progeroid syndromes has shown the importance of DNA re- 
pair and maintenance in ageing [20]. The debate of whether or not 
familial forms of neurodegenerative diseases, such as Alzheimer’s 
disease, constitute an ‘accelerated ageing’ phenotype later in life 
[20] is beyond the scope of this chapter. It is striking, however, that 
individuals with exceptional longevity seem to escape age-related 
disorders, including neurodegenerative disorders, altogether and 
live mostly disease-free until late in life [21]. Longevity clusters 
in families point towards a link between genetics and lifespan. 
Additionally, studies have shown that centenarians do not lead a 
healthier life than the rest of the population, which makes it more 
likely that it is their genetic makeup that protects them from the de- 
velopment of age-related diseases [21]. However, genome-wide as- 
sociation studies (GWAS) have not yielded very promising results, 
apart from an association between the apolipoprotein E (APOE) 
e2 allele and longevity, which is most likely due to the link between 
a reduced risk of Alzheimer’s disease in carriers of this allele [21]. 
Some studies have also reported that centenarians are more likely to 
carry protective alleles for age-related diseases such as cancer, CVD, 
and type 2 diabetes. However, this appears to depend on the sample 
source of centenarians because other studies found that centen- 
arians were healthy, even though they carried risk genes for these 
diseases [21]. More targeted genetic approaches have also shown 
associations between longevity and genes that are associated with 
ageing pathways such as forkhead box O3A (FOXO3A) and IGF-1 
signalling, genomic stability, in particular telomere protection, and 
genes that encode antioxidants [21]. The contribution of mitochon- 
dria and oxidative stress may implicate mitochondrial DNA. For 
example, two-thirds of Japanese centenarians were found to carry 
a mitochondrial gene variant that is possibly associated with de- 
creased mitochondrial leakage of ROS [22]. For an updated list of 
genes associated with ageing, see The Ageing Gene Database avail- 
able at: http://genomics.senescence.info/genes/. 

One problem with the genetic approaches described is that it is 
probably too simplistic to assume that all centenarians have the same 
genetic profile. One could hypothesize that it is again the balance of 
processes that cause damage vs repair and maintenance pathways 
that will determine the lifespan in humans. While some centenar- 
ians live long and healthy lives because they are resistant to damage 
in the first place, others profit from systems that efficiently repair 
damage and allow them to be resilient. 

It is not necessarily only the genotype that determines longevity, 
but also which parts of the DNA are actually transcribed. Epigenetic 
modifications, such as histone modifications, DNA methylation, and 
chromatin remodelling, change with age and provide a link between 
genetics and lifestyle or environmental factors that affect ageing. 
Members of the sirtuin family of proteins, for example, have been 
extensively studied as epigenetic factors that influence longevity in 
model organisms [23]. However, it is unclear whether these proteins 
also play a role in human longevity [21]. Studies in monozygotic 
twins have shown that lifestyle or environmental factors contribute 
to epigenetic differences between twins that increase with age [24]. 
These factors and the mechanisms underlying their effects on ageing 
are the subject of extensive research. 








SECTION 2 The scientific basis of psychiatric aetiology and treatment 


Environmental and lifestyle factors 


Studies in model organisms, as mentioned previously, have already 
provided links between metabolism and ageing. While it is diffi- 
cult to determine the effects of dietary restriction in randomized 
controlled trials in humans, several short-term studies have shown 
beneficial effects, for example a reduction in risk factors for athero- 
sclerosis, that are similar to those observed in animal studies [4]. 
There are strong links between hypertension, dyslipidaemia, and 
oxidative stress, with studies showing increased ROS production 
and decreased antioxidant activity in CVD patients. Indeed, some 
antihypertensive drugs and cholesterol-lowering statins probably 
act, at least partly, via decreasing ROS production [25]. As CVD 
can be seen as a sign of ‘unsuccessful’ ageing, factors that affect 
CVD risk, such as diet [26], will also, in turn, affect mechanisms 
of ageing. Physical activity affects ageing, possibly partly by acting 
on CVD risk. Even at low levels, physical activity decreases mor- 
tality and thus leads to more ‘successful’ ageing [27]. On a mech- 
anistic level, there is an association between physical activity and 
leucocyte telomere length, which indicates the effects on path- 
ways that are important for genomic stability [28]. Physical ac- 
tivity also has direct effects on the brain where it leads to increased 
hippocampal neurogenesis [15]. Another link between the environ- 
ment and ageing is the immune system. Ageing is associated with 
upregulated inflammatory markers, possibly because the immune 
system has to deal with recurrent or chronic systemic infections 
and imbalances between pro- and anti-inflammatory networks 
[29]. Following an injury or infection, the immune response to sys- 
temic insults is not restricted to the periphery, but also affects the 
brain. This becomes evident in the associated behavioural symp- 
toms of sickness such as lethargy and poor concentration [30]. 
Systemic inflammation accompanies many chronic diseases such as 
rheumatological conditions and CVD. There is also a link between 
metabolism and the immune system, because increased periph- 
eral insulin levels are associated with increased pro-inflammatory 


markers in the cerebrospinal fluid (CSF) [31]. Chronic inflamma- 
tion leads to an increased state of activation of the brain immune 
response that accompanies ageing and might, in turn, contribute 
to neurodegeneration [32]. Not surprisingly, altered immune re- 
sponses have been described consistently in depressive disorders 
later in life [33]( see also Chapter 16). Another important lifestyle 
factor that affects brain ageing is stress, which will be discussed in 
the next section. 


Stress and ageing (allostatic load) 


From first principles, our chance of developing physical and psycho- 
logical illness increases with time. Any change or challenge within 
the internal or external environment, no matter how minor it may 
be, requires temporary adaptations of the internal milieu and a 
shift in its homeostasis (a process termed allostasis). The repeated 
demand for allostasis over time can lead to syndromes that are as- 
sociated with peripheral organ damage, as well as structural and 
functional brain changes [34], which, in turn, further damage the 
dysregulated stress response system. These maladaptive responses 
reflect the ‘allostatic load’ (AL) and ‘overload’ [35], physiological 
dysregulation, and the wear-and-tear of the body, thus representing 
biological ageing [36, 37] (Fig. 18.1). Common psychological stress, 
if perceived negatively, rather than as an incentive to strive, is said to 
accelerate biological ageing and precipitate the onset of age-related 
diseases [38]. Individual variability in the progressive loss of the 
ability to deal with stress and the development of age-related distress 
[39] may also reflect differences in individual potential for brain 
plasticity and resilience [40]. 

Chronic stress-related changes accumulate across the lifespan, 
and challenges faced in later life are associated with intermediate 
composite markers of AL such as the metabolic syndrome [41]. 
‘These, in turn, increase the risk of pathological brain changes and 
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Fig. 18.1 The secondary stress marker ‘allostatic load’ with predictors and behavioural and brain outcomes. Three versions of computing the 
allostatic load from abnormal physiological measures and cited based on the underlying papers [70-75]. BMI, body mass index; BP, blood pressure; 
CRP, C-reactive protein; DBP, diastolic blood pressure; GGT, gamma-glutamyltranspeptidase; GLC, glucose; HbA1c, glycated haemoglobin; HDL, 
high-density lipoproteins; HR, heart rate; IL-6, interleukin 6; LDL, low-density lipoproteins; PR, pulse rate; SBP, systolic blood pressure; WHR, 


waist-to-hip ratio. 


age-related diseases such as depression [42], impaired cognitive 
function [43, 44], and Alzheimer’s disease [45]. 


Composite measures of biological ageing 


The release of stress hormones and their antagonists, along with 
pro- and anti-inflammatory cytokines, triggers the cardiovascular, 
metabolic, and inflammatory systems through the HPA axis and 
the autonomic nervous system to shift their operational ranges in 
order to sustain allostasis (Table 18.1). While this is adaptive in the 
short run, prolonged activity results in an increase of AL markers, 
which predict stress-related illness. Markers include cardiovas- 
cular, metabolic, and inflammatory measures, such as summarized 
in the Framingham risk scores [46] (see also Fig. 18.1), metabolic 
syndrome [47], or AL index [41], and are better predictors of phys- 
ical, mental, and cognitive health and mortality risk than individual 
component measures [48, 49]. 


Table 18.1 Composite measures of biological ageing 


Allostatic Metabolic 
load index | stroke risk syndrome 
[41] [46] [47] 


Framingham 





Cardiovascular Systolic BP X X 
Diastolic BP X 
CVD 


Atrial fibrillation 


Left ventricular 
hypertrophy 


BP medication X 
Diabetes X X 


Diabetes X 
medication 


Metabolic 


Fasting glucose X 


Waist X 
circumference 


Weight X 
Fat mass 
% body fat X 


Serum X X 
triglycerides 


HDL X 
LDL 
Cholesterol 
BMI 
HbA1C 


os E | 25 | 28 | 28 


C-reactive 
protein 


Immune 


Interleukin-6 X 
Other Age 
Sex X X 
Smoking 
Cortisol X 


BP, blood pressure; BMI, body mass index; CVD, cardiovascular disease; HbA1C, 
glycated haemoglobin; LDL, low-density lipoprotein. 

Allostatic load index below is based on markers available in the Whitehall II study 
[139, 140]. 
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The effects of psychosocial stress have been most extensively 
studied in association with cardiovascular health and disease risk 
in the context of the workplace [50]. Mid-life hypertension is as- 
sociated with an increased risk of late-onset depression [49] and 
dementia [51]. It has been linked to atrophy in both occipital and 
frontal cortical regions [52] and in the hippocampus [53] in cogni- 
tively healthy older adults. Consistently high blood pressure across 
adulthood also predicts structural brain changes in the deep white 
matter in healthy older adults [54]. Multifactorial cardiovascular and 
stroke risk scores, such as the Framingham Coronary Heart Disease 
[55] and stroke risk [46] scores, are associated with cognitive decline 
[56, 57] and structural brain changes such as grey matter volume 
reduction [58] and white matter hyperintensities [59]. In turn, pat- 
terns of structural brain changes discriminate between those with 
high vs those with low coronary heart disease risk, as measured by 
the Framingham Coronary Heart Disease Risk score, even after 
taking into account the genetic risk for Alzheimer’s disease [60]. 

Metabolic syndrome [47] is a combination of cardiovascular and 
neuroendocrine factors. It is defined by a number of biomarkers: in- 
creased abdominal circumference, increased levels of triglycerides, 
decreased levels of high-density lipoprotein (HDL) cholesterol, ele- 
vated blood pressure (including prehypertension), and fasting glu- 
cose levels (prediabetes). A diagnosis of the syndrome is given if at 
least three of these markers are above their defined thresholds. The 
effects of workplace psychosocial stress have been shown to mani- 
fest as metabolic syndrome [61], besides adverse cardiovascular 
health, as mentioned in this chapter. In turn, metabolic syndrome 
is associated with a risk of stroke and CVD mortality [62], telomere 
shortening [38], and poor mental and cognitive functioning [42, 
43, 63]. Our understanding of the association between metabolic 
syndrome and brain changes, however, is limited [64]. Reduced 
cortical thickness in distinct areas of both hemispheres, as well as 
volume reductions in the right nucleus accumbens of middle- to 
older-aged participants with metabolic syndrome, have been docu- 
mented [65]. Vascular brain damage in the form of periventricular 
white matter hyperintensities and subcortical white matter lesions 
in middle-aged individuals [64], reduced white matter integrity in 
fronto-temporal regions [66], and silent brain infarction in older in- 
dividuals [64] have also been reported. 

The exact combination of biomarkers that define the AL index 
is subject to debate [41], as is the way it is computed [67, 68], but 
it tends to include a combination of stress hormones, inflamma- 
tory, cardiovascular, metabolic, dyslipidaemia, and neuroendocrine 
markers. In combination, they are better predictors of health out- 
comes and mortality than on their own [69]. Chronological age is a 
strong predictor of AL increase, and a combination of psychosocial 
factors, including those at work, predict high levels of the index 
[70]. A recent systematic review of 16 cross-sectional studies found 
that occupational stress was positively associated with AL index, in 
spite of the lack of consensus on its computation and the hetero- 
geneity in study methods and quality [71]. Low socio-economic 
status is associated with higher AL index across the lifespan [72], 
which is not surprising, given that the prevalence of stressors is 
higher in low socio-economic groups. A recent community-based 
prospective cohort study of middle-aged adults found that material 
possessions (car and house ownership) and smoking, but not other 
health behaviours, such as physical activity or alcohol consumption, 
or psychological factors, mediated the relationship between low 
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socio-economic status and high AL index [73]. Sex differences are 
present in the association of both socio-economic status and lifestyle 
with AL index—low occupational position and alcohol abstinence, 
compared to moderate drinking, is associated with higher AL index 
levels in women, but the opposite is true for men. Education level, 
physical activity, and low salt intake seem to be protective against 
high AL in both sexes [74]. The AL index is a significant predictor of 
cognitive and physical decline, CVD, and mortality in older adults 
[48]. In a birth-year cohort, Booth et al. [75] reported a negative as- 
sociation of AL index with total brain and white matter volumes, and 
no association with grey matter volume in participants aged 73 years. 
In addition, the index was negatively associated with general cogni- 
tive ability, processing speed, and knowledge, but not with memory 
or non-verbal reasoning. Although it is argued that combinations 
of physiological markers of wear-and-tear are better predictors of 
brain structure and function than any one of the AL markers [48], 
the mechanisms in which primary and secondary markers come to- 
gether to predict cognitive decline are yet to be understood [76]. 


Several brain structures have a central role in the regulation of the 
stress response ( ). These regions include the hippocampus, 


amygdala, and prefrontal cortex. They are important effectors, re- 
sponding to mediators of allostasis, as well as targets of allostasis, 
so that chronic dysregulation can result in AL and damage to cer- 
tain brain structures. Furthermore, they can become central to 
such dysregulation, stress-related vulnerability, and pathological 
plasticity [34]. Identifying the precise mechanisms that lead to 
the wear-and-tear of specific brain structures may lead to effective 
interventions [37]. 

The subcortical limbic brain structures, the hippocampus and 
amygdala, process information arising from less developed brain 
regions in the diencephalon (such as the hypothalamus) and 
myelencephalon (parts of the brainstem), along with higher cor- 
tical areas, particularly in the prefrontal cortex, to regulate physio- 
logical and behavioural responses of allostasis [34]. The formation 
of synapses and long-term potentiation to facilitate memory can 
make these responses adaptive. However, neuronal damage caused 
by chronic elevated glucocorticoid levels can lead to (hippocampal) 
neuronal death via further dysregulation of glucocorticoid secretion 
[77]. Although a lack of longitudinal studies makes it challenging to 
determine the mechanism for this, for example, higher sensitivity 
to glucocorticoids in PTSD may contribute to hippocampal atrophy 
[78]. Individuals with smaller hippocampi, on the other hand, may 





(see Colour Plate section) Negative association of vascular risk (Framingham Stroke Risk Score) averaged over 20 years from 
mid- to later life and grey matter density (GMD) in members of the Whitehall II cohort (N = 405). Images were analysed using FSL-VBM, 
an optimized voxel-based morphometry (VBM) protocol (for more details, see [140]. Using randomized and correcting for multiple 
comparisons, a voxel-wise general linear model (GLM) was applied between average Framingham Stroke Risk Scores and GMDs, correcting 
for age and sex. Significance threshold was set at P <0.05, using the threshold-free cluster enhancement (TFCE) method. Significant negative 
association is present in the right cerebral cortex: in the medial temporal lobe, temporal pole, planum polare (a), and post-central gyrus (b). 


A = anterior; R = right. 
Image courtesy of Dr Enikő Zsoldos. 


be more vulnerable to developing PTSD [79]. In older adults, a con- 
tinuous rise in resting glucocorticoid levels with age seems to be as- 
sociated with loss of hippocampal volume and memory decline [40]. 
Changes in white matter microstructure are also present with ele- 
vated diurnal and reactive glucocorticoid levels in elderly males [80]. 


Psychosocial and severe stress 


Common stressful psychosocial and environmental factors, such 
as socio-economic status and workplace stressors, are predictors of 
cardiovascular outcomes [81, 82] and mortality [83]. Although at- 
tempts to relate specific stressors to specific medical problems have 
been unsuccessful, stress remains an underlying theme in the de- 
velopment and course of virtually all physical illness [84]. Socio- 
economic status both in childhood and adulthood is associated with 
chronic stress [85] and has been linked with negative health out- 
comes and mortality in later life [86]. Composite measures of AL are 
often associated with low social status in childhood and adulthood, 
capturing the long-term effect of chronic stressors, which have a 
higher prevalence in low socio-economic strata [37]. There is some 
evidence for the negative association between psychosocial stress 
in childhood and adulthood [87, 88] and shorter telomere length, 
which may mediate impaired health in adulthood. Socio-economic 
status in childhood, however, is not unequivocally associated with 
telomere length [87]. Negative circumstances in both early and adult 
life independently predict higher AL index, and so does negative 
emotional response to stressful life events [89, 90]. There is some 
evidence that chronic perceived stress and the number of stressful 
life events is associated with decreased grey matter volumes in the 
limbic and prefrontal areas in adults without specific psychopath- 
ology [91-93]. However, studies like these are scarce and difficult 
to interpret. In principle, mild or moderately aversive conditions 
during childhood can shape an individual to be optimally adapted 
to similar conditions later in life [94] but can also potentiate the ef- 
fects of further negative events and circumstances—albeit current 
evidence for this only exists in men [89]. 

Crises or unexpected traumatic events exerting acute, sudden, or 
intense stress not only overwhelm coping mechanisms, but may also 
be associated with psychopathology, such as PTSD [95] or major 
depression [96]. Extreme life stressors can affect brain structures 
related to allostasis, such as the hippocampus, amygdala, anterior 
cingulate cortex, and medial frontal gyrus [93], especially in people 
with depression [36] and PTSD [78]. In people with major depres- 
sion or PTSD, childhood neglect or trauma can result in structural 
and functional brain changes [97]. However, psychopathology itself 
may already cause these [98, 99]. 


Not all stress is bad: potentials for resilience and 
the slowing of ageing 


Mild or moderately aversive conditions during childhood can shape 
an individual to be optimally adapted to similar conditions later 
in life [94]. The benefits of mild and limited stressors that result in 
physiological benefits (so-called ‘eustress’ or ‘hermetic stress’) are 
well documented in laboratory animal studies and can be supportive 
of successful biological ageing [100, 101]. The relationship between 
the animal's response to stress and the rate of ageing is extremely 
complex, and successful translation into human experiments are 
rare [102]. While the relationship between stressful circumstances 
(such as daily hassles, workplace stress) and physiological pathways 
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is plausible, demonstrating and measuring this aspect of biological 
ageing, independent of disease pathology, is challenged by labora- 
tory and ethical limitations [103]. Healthy eating, including cal- 
oric restriction, physical exercise, and gene expression in response 
to certain stressors, are recommended targets of future biological 
ageing studies [103]. The advent of longitudinal cohort studies and 
non-invasive multimodal imaging techniques presents a window of 
opportunity to bridge the gap between basic and clinical scientific 
enquiries. 

There is some evidence that cumulative adverse circumstances 
across the lifespan predict physiological dysregulation, measured by 
the AL index [89]. There is also evidence that this relationship is me- 
diated by the negative emotional response to stressful events [90]. 
Stress mindset, the extent to which an individual thinks a stressor 
is debilitating or enhancing is said to be instrumental in the mani- 
festation of the stress response and the acceleration of biological 
ageing [104]. A key feature of dysfunctional stress adaptation is loss 
of resilience, which manifests in the form of anxiety and depression. 
(Mal-)adaptive stress-related brain plasticity, such as dendritic re- 
modelling or shrinkage of dentate gyrus-CA3 pyramidal neurons in 
the hippocampus or medial prefrontal cortex, dendritic growth in 
the amygdala, neuronal replacement, and synapse turnover, if re- 
versible, that is, prior to permanent excitotoxic damage [105], can 
underpin treatment or slowing of age-related changes [106]. 


Ageing and the brain 


Structural changes with ageing 


That brains change with age has been observed for a long time in the 
general population. Until recently, this may even have contributed 
to the clinical neglect of brain imaging in the assessment of patho- 
logical changes in higher age groups [107]. Brain size changes with 
age, declining by 1% every 5 years from the ages of 20 to 80 [108]. 
Correspondingly, certain domains of cognition, such as memory, 
reasoning, phonemic and semantic fluency, but not vocabulary, de- 
teriorate from age of 45 at different speeds [109] (see also [110]). 
Distinct aspects of frontal lobe structure mediate age-related dif- 
ferences in fluid intelligence and multitasking [111]. Structural 
and functional differences in the medial prefrontal cortex underlie 
the known distractibility and suppression deficits in ageing [112], 
known to everybody who has played ‘Taboo with older and younger 
relatives. This family game requires one player to describe a con- 
cept to all other co-players, without using certain given keywords 
that are commonly linked to the concept, that is, the player has to 
suppress the ‘obvious’ responses (the ‘performance cut-off’ appears 
to be in the 30s [113]). Objective markers of ageing, such as leuco- 
cyte telomere length, have been found to be associated with total 
and regional brain volumes, in particular the hippocampus, amyg- 
dala, and inferior temporal region, in a large population-based 
cohort [114]. Not only the grey matter, but also the white matter, 
deteriorates with age [115]. Widespread age-related differences in 
brain microstructure are demonstrated by quantitative MRI, with an 
anterior—posterior gradient, for example, more changes in the genu 
than in the splenium corporis callosi [116]. The aetiology of these 
changes is not entirely clear, but an important contribution must be 
from vascular ageing that is also thought to contribute significantly 
to the development of dementia in Alzheimer’s disease [117, 118]. 
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Fig. 18.3 (see Colour Plate section) Associations of large vessel elasticity (from pulse wave velocity data supplied by Eric Brunner, University College 
London) with mean diffusivity in white matter (a, b, c: negative association) and grey matter density (d: positive association) (N = 444). Model: sex, 
education, mean arterial pressure, alcohol, antihypertensive medication, chronic illness, ethnicity, social class, and FRS (for image acquisition and 


analysis, see [140]). 


Image courtesy of Dr Sana Suri. 


Increases in 24-hour systolic blood pressure are associated with ar- 
teriolar fragility of the cerebral white matter in people aged over 65. 
The clinical relevance of such abnormalities in asymptomatic and 
moderate cardiovascular risk populations is unclear [119]. 

Figs. 18.2 and 18.3 demonstrate the associations of vascular risk 
(Framingham stroke risk score) and large vessel elasticity with grey 
matter density (N = 405). Alternative mechanisms underlying age- 
related white matter changes include excitotoxicity and mitochon- 
drial dysfunction [120]. Also, less decrease over 6 years in C-reactive 
protein, that is, persistent inflammatory reaction, was significantly 
associated in octogenarians with poorer white matter integrity in 
the dorsal and temporal superior longitudinal and uncinate fasciculi 
(N = 276 [121]). Figs. 18.4 and 18.5 summarize the negative associ- 
ations of grey matter intensity and white matter integrity (fractional 
isotropy) with age. 


Brain reserve in ageing 


Atrophic or regressive brain changes are, however, only half the 
story [122]—the brain retains plasticity and is able to adapt to 


damage by reorganization and recruitment of alternative circuits 
for impaired tasks [123, 124]. A number of mechanisms have been 
described. Functional activity tends to shift anteriorly with advan- 
cing age (‘posterior to anterior shift’ [125]). This may explain the 
devastating effect of additionally impaired executive function on 
patients with primary temporo-parietal lesions in Alzheimer’s dis- 
ease. Particularly within frontal lobes, activation that is lateralized 
in younger people (as, for example, for verbal and spatial working 
memory) tends to be distributed more symmetrically with advan- 
cing age [126]. This Hemispheric Asymmetry Reduction in Old 
Adults (HAROLD) model [127] has support in the domains of epi- 
sodic, semantic, and working memory, perception, and inhibitory 
control. The large resting brain network that is active when no task is 
engaged (default mode network) and suppressed during tasks tends 
to be disrupted with age, in that the link between anterior and pos- 
terior sections is weakened [128, 129]. Such changes are associated 
with poor white matter integrity and cognitive function. While such 
anterior and posterior uncoupling already occurs in the absence of 
amyloid [129], it is emphasized in those with amyloid deposits in, 
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(see Colour Plate section) Widespread negative association of grey matter density (GMD) with age in members of the Whitehall II cohort 
(N = 405). Images were analysed using FSL-VBM, an optimized voxel-based morphometry (VBM) protocol. Using randomized and correcting for 
multiple comparisons, a voxel-wise general linear model (GLM) was applied between age and GMD, correcting for sex and socio-economic status 
defined by employment grade. Significance threshold was set at P <0.05, using the threshold-free cluster enhancement (TFCE) method [140]. 


A = anterior; R = right. 
Image courtesy of Dr Enikő Zsoldos. 


for example, the posterior cingulate gyrus, whether dementing or 
not [130, 131] ( J; 

Cognitive impairment associated with such changes is therefore 
likely to be due to disruption of brain reserve, rather than damage to 
networks primarily involved in certain tasks. Apart from functional 
connectivity documented by PET or resting fMRI, white matter in- 
tegrity, as recorded by T,-weighted MRI and DTI, is likely to con- 
tribute to brain functional reserve [132]. 

While correlational studies are relatively easy to come by and may 
reflect a relatively static or pre-existing brain reserve, the mechan- 
isms of brain plasticity supporting cognitive function are more dif- 
ficult to identify. Valkanova et al. [124] cite 36 studies employing 
a variety of training modalities such as juggling, working memory 
training, meditation, learning abstract information, and aerobic 


exercise. There were training-related structural changes, increases 
or decreases in grey matter volume, a combination of increases and 
decreases in different brain regions, or no change at all. There was 
increased white matter integrity (fractional anisotropy) following 
training, but other patterns of results were also reported. Apart from 
study-dependent variations in outcome, a number of questions 
arise in the interpretation of such results. Are changes in the grey 
or white matter structure simply due to use, or are they associated 
with learning as such? What are the underlying neural correlates of 
learning, the temporal dynamics of changes, the relations between 
structure and function, and the upper limits of improvement? How 
can gains be maintained [124]? If and how exactly use and training 
of brain-related skills can contribute to counteract the effect of 
ageing therefore still awaits clarification. 





(see Colour Plate section) Widespread negative association of white matter integrity [fractional anisotropy (FA)] with age (in blue overlaid on 
green white matter skeleton) in members of the Whitehall II cohort (N = 395). Images were analysed using FSL-TBSS, an optimized tract-based spatial 
statistics (TBSS) protocol. Using randomized and correcting for multiple comparisons, a voxel-wise general linear model (GLM) was applied between 
age and FA, correcting for sex and socio-economic status defined by employment grade. Significance threshold was set at P <0.05, using the threshold- 


free cluster enhancement (TFCE) method [140]. A = anterior; R = right. 
Image courtesy of Dr Enikő Zsoldos. 
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(see Colour Plate section) Default mode networks generated from 323 Whitehall II participant scans (MoCA >25), showing split between 
frontal (top) and posterior (bottom) components in older people (mean age 70 years) (for method, see [140)}). 


Image courtesy of Dr Sana Suri. 


It is tempting to assume that it might be possible to prevent age- 
related changes in brain structure and function and cognitive de- 
cline by targeting the mechanisms of brain ageing. Indeed, an effect 
on ageing per se might have knock-on benefits for the host of dis- 
eases associated with the ageing process. There are several strat- 
egies that are under investigation. Some are aimed at slowing brain 
ageing with physical exercise, which has been shown in animal 
models to have beneficial effects on neurogenesis and cerebrovas- 
cular health [133]. Cognitive exercise is another preventive strategy 
that is thought to increase resilience to age-related brain changes 
[134]. Several pharmacological studies are also under way that 
target some of the molecular pathways of ageing, such as the TOR 
or IGF-1 pathways, using repurposed drugs that have already been 
approved for treatment of other diseases, for example the diabetes 
drugs metformin and pioglitazone [135,136]. In mouse models, it 
has been shown that blood transfusions from young mice might lead 
to rejuvenation of older mouse brains [137], and translation into hu- 
mans is being tested. However, there is currently not enough evi- 
dence to show that any of these strategies can prevent or slow down 
brain ageing in humans. 


With the advent of sophisticated in vivo imaging methods, brain 
ageing can now be investigated. Naturally, the first available studies 
are cross-sectional, while large longitudinal studies will be a lifetime 


in coming. Hypotheses regarding molecular, cellular, and system-level 
ageing allow for us to start interrogating the available data in a specific 
fashion. Such multiple testing will always suffer from ‘residual con- 
founding’—all samples collected for imaging will be selected in some 
fashion (willingness to participate, able to enter an MRI scanner), 
even if they are picked at random from the general population (which 
most are not). The large number of possible factors contributing to 
ageing, as well as secular changes in the nature and pattern of caus- 
ation, will keep researchers busy for decades to come [138]. 
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The emergence of deep brain stimulation (DBS) as an intervention 
for treatment-resistant psychiatric disorders is the result of an evo- 
lution of understanding of brain function and disease mechanisms 
across a variety of disciplines within neuroscience and medicine. 
Refinement of brain imaging strategies has increasingly linked spe- 
cific symptoms or behaviours with discrete brain regions. Highly 
sophisticated neuroscience techniques, such as optogenetics, have 
allowed for an increased understanding of neurocircuitry in awake 
and behaving animals and the development of more sophisticated 
translational animal models linking brain and behaviour to human 
disease. Refinement of neurosurgical techniques led to its expan- 
sion into neuropsychiatric disorders, most notably movement dis- 
orders. The success of lesioning techniques in Parkinson's disease 
gave rise to the use of high-frequency DBS at the same target sites, 
with the initial conception of DBS as a reversible lesion (Fig. 19.1). 
As these various disciplines evolved in parallel, there is no single 
rationale for using DBS to treat psychiatric disorders. (See Table 
19.1 and Fig. 19.2 [1] for a summary of targets.) For example, use of 
DBS in the region of the internal capsule for obsessive-compulsive 
disorder (OCD) and depression largely grew out of the history of 
performing anterior capsulotomy and other surgical lesions for se- 
vere, treatment-resistant cases. Targeting the subcallosal cingulate 
cortex (SCC) with DBS was the direct extension of neuroimaging 
findings demonstrating SCC hyperactivity in depression and sub- 
sequent normalization of activity following successful treatment, 
irrespective of treatment modality. Consideration of the nucleus 
accumbens (NAC) and medial forebrain bundle (MFB) as DBS 
targets relied heavily on the work in animal models to understand 
limbic circuitry and reward, implicating the NAC and associated 
monoaminergic inputs in drive, motivation, and positive reinforce- 
ment, all of which are important aspects of nearly any psychiatric 
syndrome. 

Regardless of the target or rationale, all of the data reviewed here 
represent experimental attempts to treat a subset of patients for 
whom standard treatments have failed (OCD, depression, eating 
disorders, addiction) or who suffer from an illness for which there 
is no treatment that significantly modifies the disease trajectory 
[Alzheimer’s disease (AD)]. Treatment resistance is defined differ- 
ently for each disorder, although typically it includes non-response 











or loss of response to multiple medication categories and a valid- 
ated psychotherapy trial. DBS for OCD does have the CE Mark ap- 
proval for use in Europe. In the United States, DBS is not US Food 
and Drug Administration (FDA)-approved for any psychiatric dis- 
order, although the FDA did grant a humanitarian device exemption 
(HDE) for use in OCD. (For comparison, DBS is FDA-approved for 
Parkinson's disease and essential tremor, and there is an HDE for 
dystonia.) The current state of DBS for psychiatric disorders is there- 
fore still very early and largely experimental, driven by the absence 
of any effective treatment in this subset of patients with severe and 
treatment-refractory illness. With continued research on brain cir- 
cuit abnormalities in other conditions, the breadth of potential ap- 
plications of targeted neuromodulation is likely to increase. 
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ye-compulsive disorder 


OCD affects approximately 2% of the population. It is a chronic psy- 
chiatric disorder, with an estimated 10-20% of affected individuals 
considered treatment-resistant. Standard treatment includes SSRIs, 
the tricyclic drug clomipramine, and cognitive behavioural therapy 
(CBT). Symptoms typically fall into themes such as obsessions, 
checking, contamination, symmetry, harming, and hoarding. Most 
patients diagnosed with OCD will also meet criteria for another 
psychiatric disorder at some point, with major depressive disorder 
being the most frequent. 

The cortico-striato-thalamo-cortical (CSTC) loop has been 
widely implicated in the pathology of OCD (Fig. 19.3) [2]. This 
loop includes the lateral orbitofrontal cortex (OFC) projecting, via 
fibres of the anterior limb of the internal capsule (ALIC), to the head 
of the caudate, in turn projecting to the globus pallidus and on to 
the thalamus, which, in turn, projects back to the OFC via fibres of 
the inferior thalamic peduncle (ITP). In imaging studies of OCD, 
hyperactivity in the OFC and caudate has been commonly demon- 
strated and decreases with medication or psychotherapy. Previously, 
anterior capsulotomy and gamma knife capsulotomy have been 
used to interrupt fibres reciprocally connecting the mediodorsal 
thalamus and prefrontal cortex. DBS for treatment of OCD devel- 
oped from this understanding of the neural circuitry underlying 
disease pathology, as well as from evidence of effectiveness of other 
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Fig. 19.1 Radiographs showing DBS leads implanted in the brain (left), with subcutaneous lead extensions connecting the brain leads to the internal 


pulse generator implanted subcutaneously in the chest wall (right). 
Image courtesy of Dr. Helen Mayberg. 


neurosurgical procedures that have improved OCD symptoms ei- 
ther intentionally or serendipitously. 


Anterior limb of the internal capsule, 
ventral striatrum/ventral capsule 


The most frequent DBS target for OCD is the ventral portion of the 
internal capsule, extending into the ventral striatum. The anatom- 
ical target of the first DBS for OCD was the ALIC, following the lit- 
erature on anterior capsulotomy lesions. Initial data came from 15 


Table 19.1 Psychiatric disorders and DBS targets 


Depression Subcallosal cingulate 

ucleus accumbens 

Ventral capsule/ventral striatum 
edial forebrain bundle 

nferior thalamic peduncle 

Lateral habenula 

Obsessive-compulsive disorder Ventral capsule/ventral striatum 

ucleus accumbens 

edial forebrain bundle 

Subthalamic nucleus 


nferior thalamic peduncle 


Anorexia Subcallosal cingulate 
ucleus accumbens 

Obesity Lateral hypothalamus 

Addiction ucleus accumbens 


Subthalamic nucleus 


Alzheimer's disease Fornix/hypothalamus 





ucleus basalis of Meynert 


patients across four studies and case reports [3]; 45% of 11 patients for 
whom data were tracked showed at least 35% improvement from their 
baseline Yale-Brown Obsessive Compulsive Scale (Y-BOCS) score, 
the threshold most commonly used to define treatment response. 
Collaboration among four work groups at this target demonstrated the 
effect of a learning curve, such that over time, the target shifted pos- 
teriorly to include the ventral internal capsule, caudate nucleus, and 
NAC [ventral capsule/ventral striatum (VC/VS)]. Results from 34 pa- 
tients receiving high-frequency stimulation for 36 months showed a 
62% response rate. Imaging and animal studies suggest that this more 
posterior position may more effectively stimulate fibres of the CSTC 
circuit. In the most recently published cohort of patients, 4 of 6 were 
responders at 1 year [4]. Full effects of stimulation typically take about 
3 months to evolve and plateau. Reports from long-term observational 
cohorts describe a sustained benefit from VC/VS DBS in OCD. 

Stimulation-related side effects are common, at least transiently 
during stimulation parameter testing. Hypomania affects 35-65% 
of patients, resolves with changes in parameters, and usually is not 
severe. Transient sensory effects have also commonly been reported. 
Other reported effects include: euphoria, giddiness, anxiety, panic 
attacks, sadness, and stimulation-induced contralateral smile. 

DBS at VC/VS appears to be differentially effective on specific 
symptom subtypes. DBS was 100% effective (n = 5) for predominant 
obsessions and checking, 56% effective (n = 5/9) for symmetry and 
ordering symptoms, and 45% effective (n = 5/11) for cleanliness and 
washing symptoms. 


Nucleus accumbens 


The rationale for targeting the NAC directly, rather than as part of 
the VC/VS approach, stems from evidence of reward system dys- 
function in OCD, as well as its role as a relay structure between 
limbic and cortical areas [5]. Hyperactivity of the NAC specifically, 
or more generally of limbic activity relative to cortical activity, may 
lead to fixed thought/action patterns overcoming more flexible and 
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Fig. 19.2 Deep brain stimulation targets for psychiatric indications. ALIC, anterior limb of the internal capsule; VS, ventral striatum; BST, bed nucleus 
of the stria terminalis; ITP, inferior thalamic peduncle; NAc, nucleus accumbens; STN, subthalamic nucleus; MFB, medial forebrain bundle; SCC, 


subcallosal cingulate. 


Adapted from Curr Psychiatry Rep., 13(4), de Koning PP, Figee M, van den Munckhof P, et al. Current status of deep brain stimulation for obsessive-compulsive disorder: a 
clinical review of different targets, pp. 274-82, Copyright (2011), with permission from Springer Science Business Media, LLC; Biol Psychiatry, 76(12), Riva-Posse P, Choi KS, 
Holtzheimer PE, et al., Defining critical white matter pathways mediating successful subcallosal cingulate deep brain stimulation for treatment-resistant depression, pp. 963-9, 


Copyright (2014), with permission from Society of Biological Psychiatry. 


appropriate behaviour patterns. Disruption of this aberrant activity 
with NAC DBS could thus restore behavioural flexibility [5, 6]. 

Fourteen patients have received right unilateral NAC stimulation; 
19 patients have received bilateral NAC DBS. A positive effect of 
right unilateral NAC stimulation in three of the first four patients 
reported led investigators to undertake a larger and more rigorous 
trial, though only one patient in ten met the response criteria of 35% 
Y-BOCS improvement after 1 year of stimulation. Five additional 
subjects demonstrated at least 25% improvement [7]. Three case re- 
ports of bilateral NAC DBS describe 38-52% improvement after 15- 
24 months of stimulation. In the largest study of NAC DBS for OCD, 
9 of 16 patients (56%) were treatment responders after 8 months of 
stimulation [8]. Further improvement was seen when CBT was sub- 
sequently added, despite the fact that patients had failed CBT prior 
to DBS [9]. Importantly, four patients with ego-syntonic symptoms 
of perfectionism, a need for symmetry, reassurance-seeking, and 
hoarding averaged only a 10% improvement on Y-BOCS, providing 
additional information about the differential effects of this treat- 
ment on different clinical subtypes of OCD. 

Similar to VC/VS stimulation, hypomania is the most commonly 
reported side effect. Transient olfactory phenomena and increased 
libido were also reported. In the right unilateral cohort, three pa- 
tients experienced transient agitation due to stimulation. Notably 
across studies, discontinuation of stimulation was associated with 
a rapid and poorly tolerated return of negative mood beyond base- 
line depression ratings, disproportionate to the return of OCD 


symptoms. This reversed rapidly with resumption of stimulation. 
The necessity of dose adjustments or evidence of long-term stability 
of response are not yet reported. 


Medial forebrain bundle 


The MFB is a white matter tract that connects the ventral tegmental 
area and NAC. Based on the rapid antidepressant effects observed 
with MFB DBS for depression (see next section), two patients with 
OCD underwent MFB DBS surgery [10]. Both were reported to have 
acute improvement in affect and compulsions. While one patient 
met and maintained >35% improvement in Y-BOCS from the first 
month through the first year of stimulation, the other did not meet 
response criteria until month 12 of stimulation. In both cases, tran- 
sient increased heart rate and oculomotor effects typical of stimula- 
tion at this target were observed, but hypomania was not. 


Subthalamic nucleus 


DBS of the subthalamic nucleus (STN) is an accepted treatment 
for Parkinson’s disease. In a 2002 report on two patients with 
Parkinson's disease and comorbid OCD who were treated with STN 
DBS, Parkinsons symptoms improved, as did obsessions and com- 
pulsions, with Y-BOCS reductions of 58-64% 2 weeks after surgery 
and remaining improved over 1 year later. A third, independent case 
report described similar outcomes. 

In a randomized, double-blind trial of STN DBS for patients 
with severe OCD without Parkinson's disease, 12 of 16 patients 
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Fig. 19.3 Cortico-striato-thalamo-cortical loop. DLPFC, dorsolateral prefrontal cortex; LOFC, lateral orbitofrontal cortex; GPi, globus pallidus internus; 
VAmc, ventral anterior thalamic nucleus magnocellular portion; VApc, ventral antrerior thalmic nucleus parvocellular portion; DMpc, dorsomedial 
thalamic nucleus parvocellular portion; MOFC, medial orbitofrontal cortex; SNr, substantia nigra pars reticulata; VTA, ventral tegmental area. 
Reproduced from Neuroscience and Biobehavioral Reviews, 32(3), Kopell BH, Greenberg BD, Anatomy and physiology of the basal ganglia: implications for DBS in psychiatry, pp 


408-22, Copyright (2008), with permission from Elsevier Ltd. 


randomized to treatment (75%) experienced improvement of at 
least 25% after 3 months of active stimulation. Three of eight patients 
in the sham stimulation group responded similarly [11]. Transient 
mania or hypomania was reported in 6 of 16 (38%). Other side ef- 
fects were anxiety, dyskinesias/motor effects, and depressed mood 
and suicidal ideation, which occurred in one patient. Although 
most adverse events were transient, serious adverse events occurred 
in 65% of implanted patients. No long-term efficacy data are avail- 
able. That OCD symptoms improved rapidly in Parkinson's disease, 
unlike the slower course in OCD patients, across DBS targets likely 
reflects the pathophysiology of the syndrome in each illness. 


Inferior thalamic peduncle 


The ITP was conceived of as a DBS target for OCD because of its loca- 
tion in the CSTC loop, connecting the thalamus to the OFC. A positive 
experience with ITP DBS in a single patient with MDD who experi- 
enced good antidepressant effect for years following DBS surgery, and 
the fact that many pharmacological and surgical treatments for MDD 
also treat OCD, also influenced a proof-of-concept study of ITP DBS. 
Unlike other DBS targets for OCD, the ITP has the advantage of being 
well defined anatomically and readily identifiable neurosurgically, 
based on electrophysiological properties. Six patients (100%) with se- 
vere, treatment-resistant OCD responded to ITP stimulation, with a 
50% average decrease in Y-BOCS scores over 1 year [12]. Improvement 


was largely stable after 3 months of stimulation. Three patients were 
still enrolled in the study 3 years after implantation, all having main- 
tained responder status. No side effects were reported for stimulation 
on the target contact. However, stimulation above the target, near the 
fornix, induced confusion in one patient, which resolved when stimu- 
lation at that contact was stopped. Additionally, stimulation below the 
target, at or near the hypothalamus, routinely induced anxiety and 
dysautonomia. Hypomania was not observed at this target. 

Across targets, there is clear evidence that high-frequency stimu- 
lation of the orbitofrontal basal ganglia loop system can improve 
some symptoms of OCD. There has yet to be a systematic study 
comparing responders to non-responders at a given target or a com- 
parison of the different targets, although OCD symptom subtype 
appears to be important. PET studies of implanted patients showing 
increased perfusion in the OFC, subgenual anterior cingulate, stri- 
atum, pallidum, and thalamus [13] support CSTC modulation gen- 
erally as underlying the DBS effect; however, there is not yet a clear 
understanding of which pathways within the CSTC network me- 
diate either clinical efficacy or side effects. 


Major depressive disorder 


MDD is a chronic illness with an episodic course. International 
surveys have estimated the lifetime prevalence of MDD to be 8- 
12% typically, although this varies by country, with the lifetime 


prevalence as high as 17% in the United States. An estimated 27% 
of individuals will have three or more episodes, while approximately 
20% will experience a chronic, unremitting course. An estimated 
one-third of patients demonstrate some degree of treatment resist- 
ance. Definitions of treatment-resistant depression (TRD) include 
failure to respond to antidepressant medications of adequate dose 
and duration from multiple pharmacologic classes and/or ECT. The 
risk of a suicide attempt in patients with MDD is estimated to be 
15%, with a 2-12% risk of suicide completion, a risk that increases 
7-fold with lack of remission or recurrence of a major depressive epi- 
sode [14], highlighting the mortality risk in TRD particularly. 

The CSTC loop system has also been implicated in the underlying 
pathology of depression, but involving different cortical compo- 
nents than in OCD. Volumetric and functional abnormalities have 
been observed in the SCC and ventromedial and dorsolateral pre- 
frontal cortices (DLPFC), as well as the amygdala, hippocampus, 
and habenula [15]. Studies in non-human primates show white 
matter connections from medial prefrontal regions to visceromotor 
output regions, such as the hypothalamus and periaqueductal grey, 
to limbic areas, including the amygdala and NAC, and to midline 
thalamic structures. Functional imaging studies have shown over- 
activity in the SCC and amygdala, and underactivity in the DLPFC, 
NAC, and ventral striatum—abnormalities reversed with anti- 
depressant treatment. DBS for depression has been guided by these 
neuroimaging findings, as well as by an understanding of CSTC and 
reward system anatomy and function from anatomic and animal 
behaviour studies and from earlier experience with neurosurgical 
interventions for depression and OCD. 


Subcallosal cingulate cortex 


The first brain target for DBS for depression, and the one with the 
most evidence to date, is the SCC. The rationale for targeting the 
SCC neurosurgically was functional neuroimaging evidence of 
pathological hyperactivity of this area in depressed subjects and 
subsequent normalization of activity with antidepressant treat- 
ment. With prevailing theories of the mechanism of DBS being a 
functional inhibition of the target region, it was hypothesized that 
inhibition of a hyperactive SCC would enable release from a patho- 
logically negative mood state. Further, connections between the 
SCC and the insula, hypothalamus, and brainstem would enable 
changes in neurovegetative symptoms, while connections between 
the SCC and the medial prefrontal, orbitofrontal, cingulate, and sub- 
cortical structures would allow for changes in cognitive, hedonic, 
and motivational symptoms of depression. 

A total of 77 depressed patients (70 MDD, 7 bipolar type 2) from 
eight clinical sites have been reported to date, most in the context 
of open-label stimulation experiments, with follow-up of at least 
6 months and most for 1 year [16-22]. Clinical response across 
all studies has been defined as a 50% decrease in the Hamilton 
Depression Rating Scale (HDRS-17), with a score of <8 defining 
clinical remission. Combined results of all open-label studies dem- 
onstrate a response rate of 53% at 6 months, 47% at 12 months, 
69% at 24 months, and 60% at 36 months. Remission rates are 
27% at 6 months, 30% at 12 months, 39% at 24 months, and 40% at 
36 months. Thus, response/remission is maintained over time and 
appears to continue to improve over years. 

Retrospective analyses of response outcomes, compared to 
final target placement within the SCC grey matter, were unable to 
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attribute better outcomes to a more specific location within the SCC 
[23]. More recently, visualization and targeting of specific white 
matter tracts passing through, and adjacent to, the SCC have been 
associated with SCC DBS treatment response [24]. 

Across research groups, it has repeatedly been observed that even 
sustained antidepressant effects are lost in response to device failure 
or naturalistic depletion of the battery, evidence that ongoing stimu- 
lation is required, even years after implantation. There has been one 
attempt to study this as a double-blind trial in five SCC DBS pa- 
tients who were in stable remission over >3 months [25]. They were 
randomized to 3-month blocks of active or sham stimulation in a 
crossover design. During active stimulation, four patients remained 
in remission, while one patient lost remission but maintained anti- 
depressant response. During sham stimulation (discontinuation), 2 
of 5 patients maintained remission, one patient lost remission but 
maintained response, and two patients relapsed into a major de- 
pressive episode. One subject withdrew from the study during the 
sham phase due to relapse severity. These results support observa- 
tions that discontinuation of SCC stimulation typically results in a 
gradual return of depression symptoms over a period of weeks; a 
pattern distinct from the rapid change in negative mood described 
with discontinuation of stimulation at other targets. 

Stimulation-related side effects are uncommon, and no mania 
or hypomania has been reported. On the other hand, there are re- 
ports of acute effects of intraoperative stimulation, including an 
increased sense of calm, increased interest, and ‘lightness. Recent 
studies suggest that these effects are actually predictable with careful 
intraoperative testing, and their presence identifies the site for long- 
term stimulation. Refined analyses of the link between white matter 
tract location, stimulation-induced acute effects, and long-term effi- 
cacy are now emerging [24, 26]. Of the 77 patients described in these 
reports, there were six suicide attempts (8%) and three completed 
suicides (4%). 


Ventral capsule/ventral striatum 


Stimulation of the VC/VS for treatment of depression extended from 
earlier studies of VC/VS DBS for OCD and from observations that 
depressive symptoms decreased in these patients. This region of the 
internal capsule contains fibre tracts connecting the SCC to the thal- 
amus, and the ventral striatum is linked to motivation and reward. 
Thus, this target is positioned to affect the neurocircuitry underlying 
some of the key symptoms of depression. 

The initial open-label series described outcomes in 15 depressed 
patients (14 MDD, 1 bipolar type 1) from three clinical sites [27]. 
Response rates (decrease in HDRS-24 >50%) were 50% at 3 months 
and 47% at 6 months; remission rates (HDRS-24 <10) were 20% at 3 
and 6 months. Response and remission at the last follow-up (mean 
23.5 months) were 53% and 33%, respectively. In five subjects with 
at least 36 months of follow-up, three had responded and two had re- 
mitted, suggesting that, as with the SCC target, response is maintained 
over time. Subsequently, a double-blind, sham-controlled study ran- 
domized 30 patients to either active (n = 16) or sham (n = 14) stimula- 
tion for 16 weeks, followed by an open-label stimulation continuation 
phase [28]. After the 16-week blinded treatment phase, three subjects 
in the active stimulation group vs two subjects from the sham stimu- 
lation group met response criteria, failing to demonstrate efficacy of 
VC/VS stimulation. After 8 months of open-label stimulation, 20% 
of subjects met response criteria and 13% were in remission. This 
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increased to 23.3% response and 20% remission at 24 months. No 
subject who met responder criteria maintained response for the dur- 
ation of the follow-up period. In an effort to address study design 
weaknesses brought to light by the double-blind trial, another group 
performed a double-blind crossover trial following an open-label op- 
timization phase [29]. Twenty-five subjects at two sites underwent 
VC/VS DBS. Sixteen subjects participated in a blinded crossover 
phase. The open-label response rate was 40% (similar to the initial 
open-label reports), and there was a significant difference in depres- 
sion rating scores between the active and sham arms of the crossover 
phase, strengthening the evidence that the antidepressant effect was, 
in fact, due to stimulation. However, the optimization period was 
prolonged (52 weeks) and the crossover phase was foreshortened, 
because 75% of subjects had to be prematurely crossed over due to 
increased depressive symptoms, often within 1 day of stimulation 
discontinuation, highlighting the relatively rapid return of depressive 
symptoms with discontinuation of stimulation at this target. 

Intraoperative stimulation in some cases produced improved 
mood, decreased anxiety, and increased awareness, but also in- 
creased anxiety, sweating, speech perseveration, and facial motor 
effects. Outside of the operating room, two episodes of hypomania 
were reported in a patient with bipolar disorder, while one uni- 
polar depression patient experienced mania, which resolved with 
a change in medication and stimulation parameters. During the 
blinded phase of the later study, irritability, suicidal ideation, hypo- 
mania, and mania were described in 1-3 active stimulation subjects 
but were not observed in any of the sham stimulation subjects. In the 
70 subjects described in the VC/VS reports, there were nine suicide 
attempts (13%) and one completed suicide (1%). 


Nucleus accumbens 


The NAC’s central role in mediating motivation in reward is key 
to its appeal as a DBS target for both OCD and depression. If an- 
hedonia is a defining symptom of depression, modulating hedonic 
centres makes theoretical sense. The NAC is known to be involved 
in reward processing and reward-seeking behaviours, as well as in 
drive and motivation. Dysfunctional reward processing has been de- 
scribed in patients with depression, as has abnormal activity of the 
ventral striatum. Further, the NAC connects with motor, memory, 
and emotion processing centres, which provides a mechanism by 
which NAC stimulation may produce antidepressant effects beyond 
the modulation of anhedonia. 

A single study of 11 patients reported a 46% response rate and 
30% remission rate at 1 year, with response maintained for up to 
4 years [30]. Although patients reported being unable to detect when 
the stimulation was turned on, initial stimulation induced prompt 
positive behavioural changes and decreased depression scores. Side 
effects included erythema, anxiety, sweating, paraesthesiae, hypo- 
mania, and, in one case, psychotic symptoms. All such side effects 
resolved with parameter changes. At times, stimulation parameter 
changes were associated with acute (2 weeks), but not long-term, 
mood improvement. Discontinuation of stimulation produced a 
rapid return of depressive symptoms. One suicide attempt and one 
completed suicide were reported. 


Medial forebrain bundle 


From the experience with NAC DBS and the anhedonia/reward 
modulation rationale, attention turned to the MFB. The superolateral 


branch of the MFB (sIMFB) connects the ventral tegmental area and 
the NAC. It crosses under the thalamus and joins the ALIC, pro- 
jecting to the OFC and DLPFC. Animal and imaging studies have 
supported the MFB as a DBS target. In addition, hypomania that 
has been seen with STN stimulation for Parkinson's disease has been 
linked to current spread to the MFB, and electric field modelling has 
suggested that current spread from the SCC, VC/VS, or NAC may 
reach the sIMFB. 

In the first pilot study of seven patients given MFB DBS for TRD, 
six patients experienced an improvement in depressive symptoms 
within the first 2 days of stimulation [31]. After 1 week, four patients 
experienced 250% improvement in depression rating scores. After 
12 weeks, six patients met this response criteria, four of whom add- 
itionally met criteria for remission. In most patients, response was 
maintained once achieved. The longest follow-up reported for these 
patients to date is 33 weeks. All patients had acute intraoperative 
effects suggestive of improved mood and engagement. At higher 
stimulation amplitudes, all patients experienced blurred vision and 
strabismus, which limited stimulation parameters. No hypomania 
was reported in this sample. Notably, the only non-responder in 
this cohort had suffered an intraoperative haemorrhage, essentially 
lesioning the sIMFB bundle. 

An early report on replicability of these findings demonstrated 
that 3 of 4 subjects implanted with MFB DBS showed acute 
intraoperative effects, as well as a 50% reduction in depression 
scores at 1 week [32]. At 6 months, two subjects continued to be 
responders. Acute intraoperative effects and visual adverse events 
were as described in the initial study. Long-term follow-up and ran- 
domized controlled studies are not yet reported. 


Other targets 


In addition to these targets for DBS for depression, two additional 
regions have been targeted, each with a single case report. As de- 
scribed in the OCD section, one patient was successfully treated 
with DBS targeting the ITP. The other area targeted is the lateral 
habenula (LHb). The habenula is a small, midline glutamatergic 
structure that is well positioned to influence the brain regions im- 
plicated in depression symptoms and the neurotransmitter systems 
modulated by current pharmacotherapies. Convergent lines of evi- 
dence support a role for the LHb in negative reward prediction sig- 
nalling, a possible role in risk-avoidant behaviours, and link LHb 
hyperactivity with depressed mood states [33]. The single patient 
implanted at this target had a complicated, but ultimately successful, 
treatment course. 

Across targets, open-label stimulation has been shown to improve 
symptoms in severe and refractory depression. Failure of large-scale, 
double-blind trials to replicate these effects warrants consideration 
of alternate clinical trial designs to better account for the nature 
and time course of DBS treatment effect. There is clear evidence of 
target-specific differences in the time course of treatment and dis- 
continuation effects. Suicide is always a concern in the treatment of 
TRD, but this risk is not exacerbated by DBS. 


Eating disorders 
Anorexia nervosa 


Anorexia nervosa (AN) consists of persistent food restriction, an in- 
tense fear of gaining weight or behaviour that interferes with weight 
gain, and a disturbance in self-perceived weight or shape. Lifetime 


prevalence for women is estimated to be 1-4%, with a ten times 
greater prevalence in women than in men. It has the highest fatality 
rate of any psychiatric disorder, with 5.1 deaths per 1000 person- 
years. One in five AN deaths is from suicide [34]. It also has a high 
rate of comorbidity with mood and anxiety disorders—68% for 
MDD and 26% for OCD according to one estimate. 

The first reports of DBS for AN were in patients treated for MDD 
or OCD. One woman treated with SCC DBS for MDD, who had 
comorbid AN, maintained a relatively stable body mass index 
(BMI) after DBS surgery, even despite depressive relapses. An in- 
crease in disordered eating was successfully treated behaviour- 
ally, and no further interventions related to weight in the 3 years 
of follow-up were needed. A woman with AN who received ALIC 
DBS for OCD reported feeling differently about food after DBS. She 
ate more and was less distressed about weight, caloric intake, and 
sweets. Her BMI after surgery and through 3 years of follow-up was 
in a healthy range. The authors did not report on OCD symptom 
improvement. 

In the first case series of DBS specifically to treat AN, the SCC 
DBS target was chosen, based on imaging studies showing similar- 
ities in SCC activity in AN and MDD. In addition, treating mood 
and anxiety comorbidities in patients with AN improves outcomes 
over treating weight alone. Six adult women with 3-37 years of 
AN were treated with SCC DBS [35]. All women completed a be- 
havioural treatment programme preoperatively to improve their 
weight and to be healthy enough for surgery. After 9 months of 
stimulation, three of six women had maintained their weight at 
or above their preoperative baseline. In the other half, weight had 
decreased to near their historical baseline. The three patients who 
had maintained healthier weights all had comorbid MDD, and all 
experienced significant improvement in MDD symptoms. These 
three women also had comorbid OCD, and two were OCD re- 
sponders as well. Of the four total women who had a depression 
rating scores in the severe range, all were MDD responders. Five 
of the six women met criteria for OCD at baseline, of whom three 
were OCD responders. After 6 months of stimulation, PET imaging 
demonstrated decreased SCC and medial frontal activity, decreased 
insula activity, and increased parietal activity, similar to findings in 
SCC DBS for MDD. 

In two Chinese samples, NAC lesions (n = 6) and NAC DBS (n = 2) 
were performed in adult women with a BMI of <14.5, and NAC DBS 
in an additional four women aged 16-17 with a BMI of <13.3. In all 
cases, BMI improved to 18-22, and comorbid depression, anxiety, 
and obsessive-compulsive symptoms improved. Weight changes 
tended to stabilize after 6-12 months of stimulation. No serious ad- 
verse events were reported. 


Obesity 


While DBS for AN would seem a logical clinical extension of the use 
of DBS, given the rates of comorbid depression and OCD and the 
similarities in imaging findings between these illnesses, the rationale 
for the use of DBS for disorders of overeating comes primarily from 
animal models of appetite, homeostasis, and self-regulation. Lesions 
and high-frequency stimulation of the lateral hypothalamus (LHA) 
decrease food intake in animal models. Given rising obesity rates 
worldwide, including an estimated 6.6% of persons in the United 
States having a BMI of >40, it is perhaps no surprise that innovative 
solutions are being sought to address this epidemic. 


CHAPTER 19 Development of brain stimulation 


The first individual to undergo DBS implantation specifically for 
obesity was reported by Hamani and colleagues in 2008 [36]. In 
the course of their intraoperative exploration of the hypothalamic 
target, they serendipitously elicited remarkable memory effects. Six 
months of high-frequency stimulation (130 Hz) did not result in 
any weight change. With low-frequency stimulation (50 Hz) over 
5 months, the patient lost 12 kg, without intentional changes to diet 
or exercise. However, after this weight loss, the patient began to turn 
the stimulation off in the evenings due to a desire to eat, and he re- 
gained the weight. Following this, three patients were implanted 
with DBS in the LHA, with outcomes reported for up to 39 months 
[37]. Stimulation raised the resting metabolic rate in two patients. 
Chronic stimulation on optimized settings for 9-11 months resulted 
in a modest weight loss of 12.3% and 16.4%. The third patient did 
not demonstrate an altered resting metabolic rate and experienced 
negligible weight loss over 16 months. Experientially, patients re- 
ported that the best contact settings resulted in a decreased urge 
to eat, which returned when stimulation was discontinued. Acute 
stimulation effects included nausea, feeling hot or flushed, anxiety or 
panic, and increased arousal or activity. Chronic stimulation did not 
result in any changes in nutritional, hormonal, or neuroendocrine/ 
neuropeptide changes. Psychological testing showed no worsening 
of binge eating, cognitive restraint, increased hunger, or worsening 
of body image or quality of life. No significant adverse events were 
reported over an average follow-up period of 35 months. 


Addiction 


Substance use disorders (SUDs) have high prevalence and high re- 
lapse rates, despite several proven and effective treatments. In animal 
models of addiction, behavioural changes have been reported after 
stimulation of the NAC, LHb, medial prefrontal cortex, and STN. 
Anecdotally, STN DBS for Parkinson's disease has been described 
to alleviate impulsivity and behavioural manifestations of dopamine 
dysregulation syndrome (hypomania, gambling, hypersexuality, 
punding), as well as drug use [38]. Retrospective observations in pa- 
tients receiving NAC DBS for Tourette syndrome describe decreased 
use of nicotine [39]. 

Despite highly developed animal models to explain the neuro- 
biology of addictive behaviours and behaviour changes in response 
to stimulation in these animals, the results in human case series are 
less compelling. Patients selected for DBS, to date, include the most 
chronic and treatment-resistant patients, all of whom had prior 
treatment, including multiple admissions to rehabilitation pro- 
grammes, agonist therapies (such as methadone), and negative re- 
inforcement treatments such as disulfiram. In NAC DBS for opiate 
dependence in two patients, levomethadone dose and opiate crav- 
ings were reduced over a period of 1 year; however, concurrent use 
of other drugs continued, and one patient relapsed on heroin [40]. 
In five patients who received NAC DBS for alcohol dependence, all 
reported decreased craving; however, only two patients remained 
abstinent [41]. In a single-case report on NAC DBS for cocaine de- 
pendence [42], after 2.5 years of chronic stimulation, the patient 
markedly decreased cocaine use but was not abstinent. As expected, 
given NAC DBS effects in other disorders, in the cases described, 
decreased depressive symptoms and possible parameter-dependent 
hypomania were reported. 

Persistent use of substances despite reduced cravings high- 
lights the biggest challenge for trying to treat addiction with 
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neuromodulation of a discrete brain target. While NAC DBS may 
reduce cravings, SUDs involve more complex behaviours than just 
reward system dysfunction, with long-standing habits and psycho- 
social factors that make abstinence difficult and which cannot be 
captured in animal models. 


Alzheimer’s disease 


AD is a progressive dementia marked by a gradual, but progressive, 
decline in memory and cognition, associated with accumulation 
of beta amyloid plaques and neurofibrillary tangles in the brain. 
The prevalence of dementia is roughly 24 million cases worldwide 
and rising, with approximately 70% of cases attributed to AD. The 
highest rates are in North America and Western Europe. Current 
treatments do not significantly alter disease progress. 

The first series of six patients with mild AD to undergo DBS fol- 
lowed an unexpected intraoperative finding that stimulation near 
the fornix evoked vivid memories and improvement in verbal re- 
call and spatial associative learning [36]. Based on this incidental 
finding in an individual without cognitive impairment, the same 
group performed DBS at the same fornix/hypothalamic target in 
six individuals with mild AD [43]. The fornix is a large white matter 
tract connecting the hippocampus and the medial temporal lobe. 
Lesions here disrupting the Papez circuit are known to produce 
memory deficits, underscoring its importance in memory func- 
tion. Of the six patients implanted, only two had the intraoperative 
stimulation-related recall events demonstrated by the index case. 
The primary outcome measure was the cognitive subscale of the 
Alzheimer’s Disease Assessment Scale (ADAS-cog). On average, 
the ADAS-cog score increased by 4.2 points after 1 year of stimu- 
lation, an indication of worsened cognition and disease progres- 
sion. However, as the expected rate of decline may be closer to 6-7 
points per year, this outcome may represent a slowing of expected 
disease progression. The absence of symptoms of hypothalamic 
dysfunction or serious adverse events prompted the undertaking 
of a larger double-blind, sham-controlled trial of 42 patients [44]. 
While there was comparable worsening of cognition after 1 year of 
active or sham stimulation, subanalysis revealed that AD patients 
aged <65 years receiving active stimulation had worse cognitive 
scores, compared with those receiving sham stimulation. Patients 
aged >65 years did appear to have some benefit from active vs sham 
stimulation. 

A second group of six patients with mild to moderate AD under- 
went DBS targeting the nucleus basalis of Meynert [45]. It should 
be noted that the specific subregion being targeted—the Ch4 
subdivision—was sometimes not able to be stimulated directly 
due to regional anatomy, thus demonstrating one of the challenges 
of DBS, that is maximizing benefit while minimizing side effects, 
a particular challenge in small brain areas where undesirable 
stimulation in nearby regions is unavoidable. Stimulation settings 
chosen by this group included low-frequency stimulation (20 Hz), 
in an effort to exert a potentially excitatory effect on target neurons 
and stimulate acetylcholine release. No acute stimulation effects 
were reported. After 11 months of stimulation, the patient group 
experienced an average increase of 3 points on ADAS-cog. This 
again indicates worsening of cognition but may represent slowing 
of expected disease progression. No serious adverse events were 
reported. 


There are unique challenges to pursuing DBS within psychiatry. 
Psychiatric illnesses are complex syndromes with biological and 
psychological drivers, currently defined categorically, without bio- 
logically informed criteria. The complexity of human behaviour 
cannot be reduced to a single circuit, any more than it can be re- 
duced to a single neurotransmitter. Our current understanding of 
focal neuromodulation is better suited to treat specific symptoms, 
rather than broad diagnostic syndromes. 

Unlike in DBS for Parkinson's disease, where treatment effect on 
motor symptoms can be fully realized almost immediately, clinical 
effects of stimulation on psychiatric symptoms appear to generally 
take weeks to months, with benefits continuing to accumulate over 
months to years. This time course does seem to be target-specific, 
with MFB DBS producing more rapid symptom improvement, but 
not disease-specific or procedure-specific, with similar response tra- 
jectories reported for OCD and MDD following DBS and surgical 
lesions. Understanding the time course of treatment response has 
implications for understanding the mechanism of action of DBS, as 
well as for timing supportive interventions such as psychotherapy. 

As reviewed here, much of the experience with DBS for psychi- 
atric disorders to date comes from individual case studies and series. 
Long-term studies suggest that there is good reason to be optimistic 
about DBS as a treatment for severe, treatment-refractory psychi- 
atric illness; however, traditional double-blind, sham-controlled 
clinical trials have been disappointing. Small sample size and sub- 
optimal control for placebo effects are frequent criticisms of DBS 
trials in psychiatry. Small samples exacerbate the difficulty of 
designing optimal dose-finding trials, as not only are there many 
parameters to adjust (pulse width, current, frequency), but the target 
itself may also have considerable inter-individual variability, pre- 
cluding the use of standard co-ordinates to guide surgical targeting. 
One strategy to mitigate concerns about placebo effects is the use of 
a blinded, staggered-onset of chronic stimulation. The advantage of 
this approach is the ability to evaluate for a response to sham stimu- 
lation without losing statistical power by withholding the interven- 
tion entirely from a portion of the subjects. Blinded discontinuation 
designs similarly preserve statistical power. The other advantage of 
a blinded discontinuation design is that it allows for optimization 
of stimulation parameters to occur at the beginning of stimulation. 
Discontinuing stimulation only after a period of stability has been 
achieved strengthens the argument that a decline in response is due 
to the loss of stimulation. The drawbacks to this approach, however, 
include difficulty in disambiguating whether symptom emergence 
is truly a return of the illness state vs a stimulation withdrawal syn- 
drome, and also the inability of many subjects to tolerate abrupt dis- 
continuation in some, but not all, of the proposed targets. Reports 
from multiple targets, most notably the NAC and VC/VS, describe 
dramatic worsening of symptoms in a short time after discontinu- 
ation, including accidental discontinuation. Such symptoms can 
include suicidal ideation. These challenges highlight the need for 
thoughtful trial design that takes into account the circumstances 
particular to DBS as a procedure, as well as the typical patterns of 
response to stimulation and discontinuation at specific targets. DBS 
remains a promising intervention for treatment-resistant psychi- 
atric disorders, not only due to its clinical application potential, but 


also because the discrete modulation of neural circuits sheds light on 
the underlying neurobiology of these complex illnesses. 
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Adherence to treatment is crucial for achieving optimal outcomes 
[1-3]. Non-adherence to medicines prescribed for mental health 
conditions is thought to be a common reason for poor treatment 
outcomes. Reviews of the prevalence of non-adherence in mental 
health disorders estimate that 25-60% of medicines prescribed for 
these conditions are not taken as advised [4-6]. Non-adherence is 
not, of course, unique to psychiatry. It is ubiquitous in medicine, 
with the WHO estimating that about half ofall medicines prescribed 
for long-term conditions are not taken as prescribed [6]. The conse- 
quences of non-adherence are great, both for the individual, society, 
and the health system [7]. In psychotic disorders, such as schizo- 
phrenia, poor adherence is associated with a 3.7 times higher risk 
of relapse, compared to those who are adherent [8], a greater risk of 
hospitalization and suicide [7], and an almost three times increase 
in external service costs [9]. In mood disorders such as depression or 
bipolar disorder, the statistics are similar. Patients who discontinue 
their medication early have a higher risk of relapse, hospitalization, 
and suicide [10]—in a 1-year period, those who were non-adherent 
to mood stabilizers had an almost four times higher risk of hospi- 
talization than those who were adherent [11], with associated costs 
being four times or more in non-adherent vs adherent individuals 
[12, 13]. 

These figures are, however, reported from published clinical 
studies, and we should be cautious how we interpret or extrapolate 
these results, as non-adherence rates and associated consequences 
vary widely between individuals, across different mental health 
conditions, and according to how adherence is defined and meas- 
ured. Nevertheless, non-adherence is an issue which needs to be 
addressed by those providing, receiving, or funding health care. It 
not only entails a waste of resources, but also a missed opportunity 
for health gain [3]. It is the missing link between effective treatment 
and outcomes in psychiatric care [1, 2]. In a world where resources 
are limited, it is imperative that we address non-adherence if we are 
to bridge the gap between treatment efficacy and clinical outcome 
[3, 6]. 

Yet despite the vast amount of adherence research published in 
the last few decades, non-adherence remains a key challenge in 





clinical practice. Adherence and non-adherence have been defined 
in various ways, with multiple terms used to describe a similar con- 
cept [14], ranging from the traditional paternalistic view of ‘com- 
pliance’ with orders from health care providers to ‘adherence’ with 
an agreed plan between the patient and the health care team [6]. 
Currently, adherence is defined by the WHO as ‘the extent to which 
a person's behaviour (such as taking medication) corresponds with 
the agreed recommendations from a healthcare provider. More re- 
cently, this overarching concept has been refined by considering 
three stages of adherence: initiation, implementation, and persist- 
ence [14]. Non-adherence can occur at each of these stages. The pa- 
tient may not initiate treatment (termed primary non-adherence), 
or they may fail to take the prescribed regimen correctly—where 
the medication is not taken at the right time, dose, or frequency, or 
the treatment may be discontinued prematurely (non-persistence). 
Non-adherence may be used as an umbrella term for each of these 
specific behaviours [15]. Recent research in schizophrenia seems 
to suggest that not only is adherence important per se, but the pat- 
tern of non-adherence may also be significant—different patterns 
of medication-taking have been associated with different outcomes 
[2, 16]. 

Achieving true adherence, where the patient and the provider 
agree to a particular treatment plan, is challenging in psychiatry, as 
there are unique mitigating factors such as mental health legislation, 
stigma, discrimination, social isolation, lack of insight, comorbid 
substance abuse, and cognitive impairment to consider [7, 17]. 


Challenges in psychiatry 


Although non-adherence is common to all health conditions where 
long-term adherence is key to outcomes, there are unique challenges 
in psychiatry. Firstly, patients commonly do not perceive themselves 
to be ill. The WHO International Pilot Study on Schizophrenia 
found that lack of insight occurred in 98% of patients and was the 
most common symptom of schizophrenia [18]. This is similar in pa- 
tients with bipolar disorder [19]. This lack of illness insight affects 
not only adherence, but also engagement with health services in 
general [20, 21]. People who do not perceive themselves to be unwell 
fail to recognize the need for treatment and care, and as such, poor 
insight has frequently been associated with non-adherence [8, 20]. 
Many treatment benefits in psychiatry are silent and long-term, 
which further reinforces the lack of insight. The ‘no symptoms, no 
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problem view of health is intuitive; thus, adherence is particularly 
challenging when there is minimal insight and understanding of the 
illness [20, 21]. Moreover, non-adherence may not lead to an im- 
mediate deterioration—relapses may not occur for months, or even 
years, after stopping medication [21]. Likewise, treatment benefits 
are often delayed—drugs for depression or psychosis usually take 4- 
6 weeks before a perceivable benefit is attained. Together, these fac- 
tors reinforce the perception that medication may not be necessary. 
As such, early discontinuation of medication is acommon manifest- 
ation of non-adherence in psychiatry [21], as treatment is stopped 
when patients judge that the condition has improved. 

Secondly, even when patients do gain illness insight, they may 
not have the capacity to adhere to treatment [2]. This may be due to 
cognitive impairment that can accompany mental health disorders, 
the illness itself, or their psychosocial circumstances, which make 
establishing a regular medication-taking routine or affording long- 
term medication difficult [2, 22]. Comorbid intellectual disability or 
substance misuse are other important factors to consider in psych- 
iatry which can influence treatment adherence [20]. 

Shared decision-making and therapeutic alliance between the 
clinician and the patient have been shown to be important medi- 
ators of adherence—adherence is likely to be higher when there is a 
positive, trusting clinician-patient relationship [20, 23]. However, 
this can be difficult to achieve in psychiatry when treatment is 
started in a compulsory treatment setting and when there is mental 
health legislation governing this process [24]. In a survey of 104 
individuals with schizophrenia, over one-third reported fear of 
coerced treatment as a barrier to seeking treatment; the use of re- 
minders or warnings about the consequences of non-adherence 
were perceived negatively as pressures to adhere and were reported 
as increasing barriers to treatment [25]. Lastly, stigma associated 
with mental health conditions has consistently been shown to be 
an adherence barrier [22]. This is common across all mental health 
conditions, with patient perceptions of stigma influencing treat- 
ment behaviours and adherence negatively [26]. These unique 
challenges need to be considered if we are to promote adherence 
in psychiatry. 


Interventions to improve adherence 


in psychiatry: what lessons can be learnt 
from trials of adherence support? 





What interventions are effective? A review of 20 years 
of adherence research 


Although the problem of non-adherence is well recognized globally, 
there continues to be no effective solution [27]. A vast amount of 
research have examined the reasons for non-adherence, and many 
adherence interventions have been tested in RCTs; yet a sustainable 
and effective solution remains elusive. In the most recent Cochrane 
review of adherence interventions, 182 published RCTs were in- 
cluded [27], of which 29 were on psychiatric disorders—primarily 
in schizophrenia and related psychotic disorders, followed by de- 
pression. The quality of these RCTs were low; of these 29 RCTs, only 
one [28] was considered to have a low risk of bias. Even when all 
studies are taken into account, the effects of these interventions on 
adherence are mixed and non-consistent. 


There is a general lack of convincing evidence to support one par- 
ticular intervention over another; where one type of intervention 
demonstrates positive effects on adherence, the same intervention in 
a similar population will report negative results. For example, com- 
pliance or adherence therapy—a cognitive behavioural intervention 
using techniques from motivational interviewing to discuss adher- 
ence—was reported to improve adherence in patients with psychosis 
in two studies [29, 30] but failed to demonstrate any effect in two 
other studies [31, 32]. The wide variability of studies, in terms of 
study populations, conditions, setting, intervention design, and ad- 
herence and outcome measurement, make it difficult to draw com- 
parisons between the different types of interventions. Even when 
effects are seen on adherence, the benefits are relatively small and 
effects on clinical outcomes frequently not assessed. Questions on 
the sustainability and feasibility of implementation into practice also 
remain. Many of the psychiatric adherence interventions tested are 
typically very complex—involving multiple resources and intensive 
follow-up—something that may not be able to be delivered in a real- 
world clinical setting. Furthermore, details on the actual content and 
delivery of the interventions are commonly not described in suff- 
cient detail to replicate in practice. These limitations that currently 
exist with adherence intervention literature make it difficult to de- 
sign effective interventions in practice; there is a need for more in- 
novative approaches to adherence. 


Lessons learnt from research—there is no ‘one size fits all’ 
for adherence interventions 


Adherence interventions which have been tested in psychiatry in- 
clude adherence therapy, education, telemedicine, shared decision- 
making, family therapy, and pharmacist-led education or medicines 
management [27]. As the literature highlights, interventions and 
outcomes are varied and many different approaches have been tried 
to address non-adherence. There are learnings that can be gained 
from these 20 years of adherence intervention research. Although 
there is no evidence to support one particular intervention over an- 
other, interventions which have been successful in improving ad- 
herence usually consist of: (1) multiple components; (2) ongoing 
support tailored to address barriers to adherence; and (3) delivery 
by health care professionals, with additional support from the family 
or peers [33-36]. Clues can also be provided from adherence studies 
that investigate relationships between particular patient or illness 
factors and adherence; although such findings provide little infor- 
mation on how to design effective interventions, they can be useful 
to help identify groups at risk of non-adherence. For example, pa- 
tients who lack a support system or have comorbid substance abuse 
are at a higher risk of non-adherence [20, 21]. This can help target 
and prioritize adherence interventions to people who need it the 
most, particularly in limited-resource settings. 

However, to design effective interventions on an individual level 
in practice, one must look beyond population studies and focus 
on the individual. Study findings may not be easily extrapolated to 
daily practice. An intervention that may be successful within a par- 
ticular study population may not be effective for a particular patient. 
Although some interventions have been able to improve medication 
adherence, the exact components of the interventions which con- 
tributed to its effectiveness are often not known [27]. The lack of 
detailed, consistent reporting and measurement makes it difficult to 
draw definitive conclusions to apply in practice. Previous reviews 


of interventions to improve adherence to psychotropic medicines 
echo these findings [33-35]. As Kane et al. describes—the idea of a 
‘standardized, universally valid and reliable’ approach to promoting 
adherence is a myth—individualized interventions tailored to the 
individual are likely needed to improve adherence [21]. 


Non-adherence: towards behaviourally 





intelligent solutions 


Adherence is a behaviour, not a trait 


With the vast amount of adherence research, it might seem a wonder 
why non-adherence still exists. Yet the problem might lie in the 
approach that has been taken in the past when investigating non- 
adherence—the limitations of this approach have been discussed 
previously in Section 2. Prior to the 1970s, much of the research fo- 
cused on the notion of the ‘non-adherent patient —where attempts 
to explain non-adherence used ‘easily identified and quantifiable di- 
mensions’ such as patient, regimen, or illness characteristics [37]. 
Yet the ‘non-adherent patient’ is a myth, as most of us are non- 
adherent some of the time [15]. Associations observed between 
sociodemographic variables, such as gender and age, and adherence 
are neither clear nor consistent [15]. 

Non-adherence is therefore best understood as a variable behav- 
iour, rather than a trait characteristic. For example, studies have 
reported ethnicity to be a significant predictor of antipsychotic 
medication non-adherence [38, 39]; yet other literature reports that 
ethnicity is not a consistent predictor [20]. Likewise, various per- 
sonality traits have been linked with non-adherence. Extraversion 
was significantly associated with non-adherence in one study [40], 
but not another [41]. While understanding some of these factors is 
important, focusing on sociodemographic factors alone will not fix 
non-adherence. This is because non-adherence is a feature of the 
way the individual interacts with their treatment, rather than any 
particular characteristic of the patient themselves. Non-adherence 
varies not only between individuals, but even within the same in- 
dividual over time. Non-adherence does not arise from irrational 
or misguided behaviour—more often than not, the patient goes 
through a knowledge process and the way they act come from the at- 
titudes they develop about their condition and medicines, and their 
personal health experiences [21]. 

Indeed, patient attitudes, illness perceptions, and health beliefs 
have been demonstrated to be one of the strongest predictors of ad- 
herence [42-44]. Beck et al. investigated factors influencing adher- 
ence in patients with schizophrenia or schizoaffective disorder and 
found that patients who viewed antipsychotics as necessary for their 
treatment had higher adherence; conversely, those who had nega- 
tive attitudes about medicines in general and were concerned about 
antipsychotics had poor adherence [43]. The authors called for ad- 
herence interventions to focus on treatment attitudes, rather than 
general education about the illness. These findings make sense when 
adherence is viewed as a health behaviour—attitudes and beliefs 
are more likely to be important mediators of behaviour. Adherence 
interventions should therefore focus on behaviour change principles 
and techniques. New strategies need to build on this concept of ad- 
herence as a health behaviour to allow effective interventions to be 
developed. Learnings from years of research show that there is no 
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‘adherent personality type and a ‘one size fits all’ adherence inter- 
vention. There is a need to develop more effective ways of tailoring 
support to meet the needs of individuals if we are to improve adher- 
ence in a sustainable fashion. 


Adherence is a product of motivation and ability 


In order to tackle this large-scale problem of non-adherence, it is 
important to understand why non-adherence occurs from an in- 
dividual patient perspective. Firstly, non-adherence may be inten- 
tional (for example, when we decide not to take the treatment or to 
take it in a way which differs from the recommendations) and/or un- 
intentional (for example, when we want to follow the recommenda- 
tions but lack the capability or opportunity to do so). The easiest 
way to think of this is to consider adherence behaviour as being 
two-pronged—patients do not adhere to treatment because: ei- 
ther (1) they do not want to or (2) they are not able to. How non- 
adherence arises therefore relates to two components which drive 
the behaviour, respectively—motivation and ability [15]. This ability, 
in turn, is affected by the individual’s environment—both internal 
factors (for example, knowledge and physical capability to take the 
medication on time) and external factors (for example, aspects of 
our environment affecting access to treatment such as not having 
easy access to a pharmacy) [15, 45]. 

This forms the basis of the Perceptions and Practicalities Approach 
(PaPA) to supporting optimal adherence [15], which has been ap- 
plied to designing interventions in psychiatric disorders [2] and in 
the National Institute for Health and Care Excellence (NICE) medi- 
cines adherence guidelines [46]. 


Perceptual or practical barriers to adherence: a 
Perceptions and Practicalities Approach to designing 
patient-centred adherence interventions 


The PaPA approach to adherence support derives from an analysis 
of the types of reasons why people do not take their medicines 
(Fig. 20.1). Recognizing that adherence is a product of motivation 
and ability, PaPA stipulates that adherence interventions should 
address both perceptual factors (for example, beliefs about the 
illness and treatment) as well as practical factors (for example, 
the ability to remember the medicine, establish a daily routine, 
and organize or pay for medication supply) [46]. Adherence 
support should be tailored to the needs of the individual using 
a menu-based approach where specific intervention components 
are selected to address specific perceptual and/or practical bar- 
riers. The approach also takes account of the social and environ- 
mental factors that provide the context around the interaction of 
the patient with the treatment [15] (Fig. 20.1). The importance 
of external factors (for example, social and environmental fac- 
tors) is described in Michie et al’s COM-B conceptual framework 
for key determinants of behaviour [47]. COM-B describes three 
components—Capability, Opportunity, and Motivation—which 
act together to influence behaviour. Capability and opportunity 
reflect a person's ability; opportunity can also comprise the ex- 
ternal factors that make the behaviour possible or prompt it, and 
thus relates to how environmental and social factors might influ- 
ence adherence. Together, these relate similarly to the motivation- 
ability paradigm described previously. 

Based on PaPA, adherence support targeted at the level of the in- 
dividual should be tailored to address the specific perceptions (for 
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Fig. 20.1 Figure depicting the interaction between perceptual and practical factors on adherence behaviour. 


Reproduced from Horne R, Weinman J, Barber N, et al., Concordance, adherence and compliance in medicine taking: Report for the National Co-ordinating Centre for NHS Service 
Delivery and Organisation R & D (NCCSDO), Copyright (2005), with permission from National Institute of Health Research. 


example, beliefs and emotions) and practicalities (for example, cap- 
ability and resources) influencing a persons motivation and ability 
to adhere, respectively [15, 46]. As shown in Fig. 20.1, perceptions 
and practicalities can overlap. For example, motivation may help 
the individual overcome limitations in capability and opportunity, 
which might, in turn, influence motivation—hence, the model is de- 
picted as a Venn diagram, rather than two discrete circles (Fig. 20.1). 


Adherence is a complex behaviour with multiple determinants— 
both internal and external (see Fig. 20.2 for a summary). The ‘in- 
ternal’ factors influencing motivation and ability may be moderated 
by ‘external’ variables, such as the quality of communication be- 
tween the patient and the health care provider [23], and by the wider 
societal contexts such as access to treatment and mental health 
legislation [24]. 
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Fig. 20.2 Conceptual map of determinants of adherence. 


Reproduced from Horne R, Weinman J, Barber N, et al., Concordance, adherence and compliance in medicine taking: Report for the National Co-ordinating Centre for NHS 
Service Delivery and Organisation R & D (NCCSDO), Copyright (2005), with permission from National Institute of Health Research. 


When considering adherence interventions, various aspects of 
this adherence behaviour can be targeted. Motivation and ability 
can be considered separately to help design adherence interventions, 
based on what factor or factors are driving the behaviour. For ex- 
ample, interventions to improve a patient's ability to adhere (such as 
improving access to treatment) will fail if the patient does not want 
to take the medication (such as when the patient has already decided 
against the treatment). Understanding what drives a patient's deci- 
sion to adhere or not to adhere is key to addressing non-adherence. 

The following sections explore these two drivers of non- 
adherence—perceptions and practicalities—in greater detail. 


Perceptions: the role of general and specific beliefs about 
medication in influencing decisions about treatment 


Beliefs about medicines can havea significant impact on adherence— 
simply providing information is often insufficient to achieve ad- 
herence to psychotropic medication [21, 35]. The literature on the 
effects of education and information provision on adherence shows 
that giving information about treatment benefits and harms may 
not impact on decisions to start and continue medication, even 
though knowledge is increased [48]. One study investigating ad- 
herence to tricyclic drugs found that giving treatment leaflets to pa- 
tients alone, with no counselling, had no effect on adherence [49]; 
this is supported by a later systematic review which found educa- 
tion interventions demonstrated no clear benefit on adherence and 
depression outcomes [33]—an association also seen in other psy- 
chiatric conditions [43, 50]. Although the quality of the studies do 
not allow definitive conclusions, together these suggest that infor- 
mation provision and increasing knowledge alone do not overcome 
non-adherence [15]. 

Qualitative studies show that many people seem to hold proto- 
typic beliefs about medicines and their capacity to produce harm, 
as well as benefit, and beliefs about the appropriateness of doctors’ 
prescribing of medicines [51, 52]. These beliefs exist even before a 
person takes the medication. A review of knowledge and beliefs held 
by members of the public about psychotropics found that views were 
generally very negative; this contrasted with positive views about 
medication for physical disorders [53]. Many were concerned about 
dependence, lethargy, and brain damage [52, 53], perceiving psycho- 
tropics as harmful, addictive substances that should not be taken for 
long periods of time but tend to be overprescribed by doctors [51]. 

These beliefs can influence how information is interpreted by pa- 
tients, what their experiences are, and how they act as a result of this 
information [15, 53, 54]. Indeed, negative attitudes and beliefs to- 
wards medicines used in psychiatry appear to be stronger predictors 
of non-adherence than demographic or psychosocial factors [20, 
42-44]. These attitudes are linked to wider concerns about scientific 
medicine, a lack of trust in doctors, and an increasing interest in al- 
ternative or complementary health care [51, 53]. People also seem to 
vary in their perceptions of personal sensitivity to medicines [55], 
with some being more concerned than others about their response 
to medication. 

These general beliefs influence the way in which people evaluate 
a specific medication prescribed for a particular condition [51]. 
These beliefs can affect a person’s initial expectations of the out- 
come of taking a medication, as well as how any subsequent events 
are interpreted—for example, whether symptoms experienced are 
attributed to the illness or the medication [56]. These beliefs may 
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even influence the clinical outcome directly via the ‘placebo/no- 
cebo effects of active drugs—terms describing the phenomenon of 
having beneficial or harmful effects occur when people have posi- 
tive or negative expectations about the medication, respectively 
[57]. Beliefs can also be specific for a particular medication—for 
example, concerns around weight gain and extra-pyramidal side 
effects with dopamine antagonists [21, 58] have been linked with 
poorer adherence. 


Perceptions: the role of perceived necessity and concerns 
in influencing decisions about treatment 


Once a person has started a new medication, they will begin to form 
particular beliefs and attitudes towards the treatment, based on their 
initial and subsequent evaluation of the medication. This evalu- 
ation process is captured by the Necessity-Concerns Framework 
[51]. The framework suggests that the motivation to start and per- 
sist with treatment is influenced by the way the individual judges 
their personal need for the treatment, relative to their concerns about 
potential adverse effects. For example, a 2-year prospective study 
of 254 patients recovering from first-episode psychosis found that 
the risks of non-adherence was 1.75 times higher in those whose 
belief in the need for treatment was less, and 2.88 times higher in 
those who thought medication was of low benefit [59]. Similarly, 
in a study of 223 individuals prescribed medication for bipolar dis- 
order, the odds of having poor adherence was twice as high in those 
who had stronger concerns about the negative effects of medication 
than those who did not—a finding that was independent of mood 
state, illness, and demographic characteristics [44]. Other studies 
involving patients from a wide range of other conditions have con- 
sistently found similar results—that poor adherence is related to 
doubts about personal need for medication and concerns about po- 
tential adverse effects [60]. 

Perceived necessity of a treatment is, however, not related to be- 
liefs about treatment efficacy. Although views about medication 
efficacy are likely to contribute to perceived need, the two are not 
synonymous. For example, perceived necessity can be influenced by 
illness beliefs—a patient might believe that a treatment is effective 
but may not perceive a personal need for the treatment. A common 
situation where this might occur is when the patient lacks illness 
insight. In this case, the patient may not believe they need any treat- 
ment, regardless of its perceived efficacy. Beck et al. explored the 
beliefs about drugs for psychosis in 150 outpatients and found that 
patients who were aware of their illness were more likely to adhere 
due to a greater perceived necessity of the treatment [43]. Conversely, 
a patient might perceive a strong need for a treatment, even though 
they believe it is only moderately effective—for example, if it is the 
only treatment that is available or acceptable to the patient. This may 
be seen with ‘natural’ remedies such as vitamins or herbs [53] where 
the patient may express a strong need to take these, instead of the 
prescribed medicine, despite believing these are only moderately 
effective. 

In terms of perceived concerns, there is much overlap in the type 
of concerns that patients report about medicines, regardless of the 
medication type. The experience of symptoms as medication ‘side 
effects’ and the disruptive effects of medication on daily living and 
quality of life are commonly reported concerns [56, 61]. Many pa- 
tients receiving regular medication who have not experienced ad- 
verse effects worry about possible problems in the future—a view 
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that may be related to beliefs that regular medication use can lead 
to dependence or accumulation within the body and corresponding 
long-term effects [51, 53]. Concerns also relate to the meaning that 
being on regular medication has for the individual and their sense 
of self or identity. Taking a daily treatment may be an unwelcome 
reminder of their illness, which may have a negative impact on how 
they view themselves or perceive how they are seen by others. This 
can be further exacerbated by the stigma associated with taking 
mental health medication itself [22]. In these circumstances, non- 
adherence might be seen as an implicit strategy to minimize the im- 
pact on their sense of self [62]. 

These necessity beliefs and concerns can influence adherence 
separately and in combination, and the effects may be through ex- 
plicit and implicit processes. For example, in some situations, non- 
adherence could be part of a deliberate strategy to minimize harm by 
taking less medication. Alternatively, it might simply reflect the fact 
that patients who do not perceive their medication to be important 
are more likely to forget to take it. The impact of perceptions of treat- 
ment on adherence may also influenced by beliefs about adherence 
behaviour itself such as whether or not strict adherence to medica- 
tion is needed to achieve the desired outcome. 


Practicalities: enhancing capability and opportunity 
for adherence 


Beyond addressing patient perceptions is the need to address factors 
that determine a patient's ability to adhere. Forgetting is the most 
commonly reported practical reason for medication non-adherence 
[63]. This may be due to the cognitive effects associated with mental 
health disorders, as well as a lack of routine and an erratic lifestyle 
[21]. Reminder systems or medication organizers, such as pill boxes, 
may be useful, though reported effects are typically modest [64]. 
Linking medication-taking to specific environmental cues may be 
more effective than a repeated reminder to help reinforce habits and 
routine. For example, placing the medication near the toothbrush, 
so that taking the medication becomes linked to an existing habit, 
may be useful. However, this is susceptible to changes in the envir- 
onment or routine such as going on holiday [65]. 

Linking medication-taking to specific environmental cues may 
also be useful [66]. This involves planning with the patient how and 
when they are able to take their medication. Turning a patient's in- 
tention to take medication (for example, ‘I will take my medicine’) 
into a more specific plan (for example, ‘I will take my medicine im- 
mediately after I brush my teeth every morning’) increases the like- 
lihood of the behaviour being performed [66]. 

Simplifying the regimen and reducing unnecessary polypharmacy 
is also important. Complex regimens with a high dosing frequency 
or complicated instructions for medication-taking can lead to poor 
adherence [67]. Reducing the dosing frequency to once daily can 
improve the patient's ability to adhere by making the treatment less 
intrusive and more convenient. The use of long-acting injections 
may also promote adherence through this mechanism and reduce 
the chances of forgetting treatment [68], though patients may have 
negative perceptions of long-acting injections as these are frequently 
associated with coercive treatment [69]. Strategies to improve ad- 
herence by changing formulation or dosing are therefore likely to 
be effective only if perceptual barriers to adherence have been ad- 
dressed [43]. Involving patients in treatment decisions is important 
to achieve ongoing adherence—patients who were prescribed at 


least one medication that had been requested by the patient in a 
psychiatric advance directive had higher adherence at 12 months— 
with the odds of adhering being 7.8 times higher—than those who 
did not receive medications as per their advance directive [70]. To 
achieve adherence, the clinician must therefore aim to elicit the 
patient’s perspective about treatment—including their beliefs and 
concerns—and ensure that decisions about treatment are informed 
by fact, rather than misperceptions [17]. Offering a medication 
choice can be an effective method of involving the patient in pre- 
scribing decisions—even as simple as involving the patient in the 
choice of dosage form can be useful [17, 70]. Medication cost and ac- 
cess to health services and medication may be other factors to con- 
sider when addressing practical barriers to adherence [17]. 


So ... you took all your medicines? Methods 





of assessing adherence 


Measuring non-adherence is a complex issue. While self-report is 
the most practical and convenient method in clinical practice to as- 
sess adherence, reports are subjective and often inaccurate. Patients 
often overestimate adherence [71], yet objective measures have their 
own shortcomings (Table 20.1). There is a need to remove the nega- 
tivity surrounding medication non-adherence to encourage honest, 
non-judgemental communication between the patient and the 
health care provider. In clinical practice, ‘detoxifying non-adherence 
and allowing sufficient time in the consultation to discuss barriers 
to treatment are necessary first steps to improve the assessment of 
adherence [17, 46]. This may be facilitated by opening up discus- 
sions about adherence in a non-judgemental way and explaining the 
reasons for the discussion. It is helpful to focus the discussions on a 
specific time period such as ‘in the past week and asking about spe- 
cific medication-taking behaviours such as skipping or changing the 
dose or stopping medication [46]. Patients should be encouraged 
to discuss freely their adherence behaviours and barriers in clinical 
practice—objective adherence measurement will become less of an 
issue when this occurs. Until then, however, there remains great 
interest in measurement, and multiple methods to assess adherence 
exist (Table 20.1). 


PaPA-—a 3-step process towards effective 


adherence interventions: tailoring support 
to individual needs 





Informed adherence in practice 


Shared decision-marking with the patient and informed choice 
should be a key facet of clinical practice and adherence. For inter- 
ventions to be effective, equitable, and efficient, one must facilitate 
informed choice [15]. A patient can be considered to have made an 
informed choice if they can demonstrate knowledge of relevant in- 
formation about the treatment and then act according to their be- 
liefs. This concept of informed choice has been extended to informed 
adherence [72] where evidence-based medicine is used to guide ini- 
tial treatment recommendations. The recommendations should be 
presented to patients in a way that takes account of their individual 
beliefs and preferences, and any incompatibilities between their 
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Table 20.1 Summary of adherence measurement methods 


Direct methods 
Direct methods provide evidence that the patient has actually taken the medication and are therefore the most accurate [77]. 





Detection of the drug or Laboratory testing of drug levels Levels are easily quantifiable Invasive 
metabolite in biological in bodily fluids Can provide data on dose-response Low patient acceptability 
fluids Tests can be costly 


Time-consuming 

Difficult to interpret 

Can only be used if a blood level test 
exists for the medicine 

Can exhibit large inter- and intra- 
patient variability 

Sampling times need to be accurate 

Does not provide information on 
adherence patterns 


Detection of adjunct An additional readily detectable, Levels are easily quantifiable Not relevant to medicines used 
biological markers but inert, stable, and non- Can provide data on dose-response in psychiatry, as no biological 
toxic, substance is added marker exists 
to the ingested medication, Similar disadvantages to detection of 
either directly into the drug drug in biological fluids 


formulation or taken as an 
adjunct to therapy 


Direct observation of Patient is observed while they Provides direct proof of medication ingestion Intrusive 
patient ingestion of take the dose in front of Allows direct interaction with the patient, which Resource-intensive as requires 
treatment another person, either a health can reveal other aspects of their lifestyle and repeated health care provider visits 
care provider or a family environment which affect their adherence Susceptible to patient manipulation 
member (for example, patient can hide 
medication in the mouth and feign 
ingestion) 


Indirect methods 
The majority of adherence measures used are indirect, which measure adherence using patient- or third-party-generated information. These methods are 
unable to determine whether or not actual medication-taking occurred but are generally more acceptable to patients due to their non-invasive nature. 




















Patient self-report Patient reports on their own Simple Lack of objectivity—high risk of 
adherence behaviour through Cost-effective inaccurate reports of adherence 
patient interviews, diary cards, Convenient Often overestimates adherence due 
journals, calendars, surveys, Most common method used in the literature and © patient's desire to be viewed 
or validated adherence in practice positively by others 
questionnaires and rating Does not require any extra planning or resources Risk of recall bias, especially if data 
scales High patient acceptability reported retrospectively 

Easy to use Relies on accuracy of patient 
Can provide detailed information on adherence records/reports 
patterns and patient awareness of behaviour if Dependent on patient's ability and 
obtained from diaries, journals, or interviews willingness to disclose information 
Encourages active patient involvement Patient responses can be affected 
Facilitates provider-patient discussions by how questions are asked and 
relationship with interviewer 

Medication counts Physical count of the number of Commonly used alternative to self-report Lack of detailed information on the 
doses that remain ina patients Simple, economical patterns of usage over time 
medicine bottle, delivery Enables medication use to be monitored without Wide variation in accuracy—patients 
device, or other medication the patient may combine multiple refills in same 
management system, after Been shown to be more accurate than self-report container, use multiple containers, or 
a pre-determined period of and prescription refill data, being aware of the share medication with others 
monitoring, and compares this parameter being measured Difficult to determine the dates of 
with the expected number of Allows some detection of changes in adherence, treatment period 
doses that would remain if the depending on frequency of counts eed for patients to return 
patient had taken the medicine medication containers or canisters 
exactly as prescribed Patients may suspect adherence is 

being monitored 

Prescription refill records Electronic records of prescription Objective data o standard method of data 
claims made by a pharmacy Easily accessible interpretation and analysis 
or a manual process of Readily available in most cases Data availability may be delayed 
prescription review Reduces risk of patients being aware of monitoring Only a proxy measure—dispensed 

Allows large-scale population analyses medicines may not be picked up 

Enables ease of patient follow-up or taken as prescribed 

Can determine trends over prolonged periods Patients may use more than one 
pharmacy 


(continued) 
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Table 20.1 Continued 





Electronic adherence 
monitoring 


Use of electronic monitoring 
devices to record medication 
use through monitoring of 
the opening of medication 
bottles, dispensing of drops, 
or depression of canisters for 
inhaled medication 


Objective 


Reliable 


personal beliefs and the prevailing evidence should be resolved by 
non-judgemental discussion [54]. 

One approach to achieving informed adherence is to consider the 
following ‘PaPA-based approach to adherence support’ in any con- 
sultation about treatment: 


1. Facilitate an honest and open discussion. The discussion should 
aim to normalize non-adherence and allow patients to report 
non-adherence and express doubts and concerns. This allows as- 
sessment of adherence in a non-judgemental way. Effective com- 
munication is important. Factors such as mental state, health 
literacy, language barriers or visual or hearing impairment may 
need to be considered to ensure effective communication. 

2. Communicate the necessity of treatment. In psychiatry, many pa- 
tients do not believe treatment is necessary, as they do not per- 
ceive themselves to be ill. Discuss with the patient what their 
understanding is of the reason for treatment. Explain the condi- 
tion and how the treatment will influence this, considering the 
aims of the treatment and what the patient themselves hope to 
achieve. Focus on how the patient may benefit from the treat- 
ment, taking into account the individual motivations the patient 
may have, which may not be directly related to the illness. 

3. Elicit and address any concerns raised about the treatment. Use 
open-ended questions to encourage patients to discuss and ask 
about their condition and treatment. Find out what the patient 
knows, believes, and understands about their treatment before 
starting or changing a medicine. Often these concerns centre on 
dependency and side effects of weight gain and lethargy and how 
this will affect their daily lives. Discuss and agree a plan of action 
to manage these concerns with the patient. 

4. Minimize any practical barriers to adherence. It is helpful to dis- 
cuss how the patient will fit the medication into their daily rou- 
tine and remember to take the medication. Identify any barriers, 
and agree a plan of action with the patient. 


This approach ensures that both the perceptual barriers (neces- 
sity/concern beliefs) and practical barriers are addressed. Previous 
interventions have had limited effects, partly because either they 
have not addressed all these factors or the intervention has not been 
individualized to the patient. Many have focused on single causal 
factors, whereas adherence is best seen as a complex health behav- 
iour with multiple determinants—both internal and external (see 
Fig. 20.2 for a summary). By using this approach, interventions can 
be tailored to the individual while achieving informed adherence. 


Not reliant on patient self-interpretation 
Accurate—non-biased data 


Less prone to patient deception 

Provides detailed information about 
medication taken 

Allows adherence patterns to be monitored 


Considered as the ‘gold standard’ of adherence 
measurement 


naccurate data recordings or data 
loss can arise due to device 
malfunction 

Devices can be lost or damaged 

ay require patient and practitioner 

training 

o confirmation of ingestion 

Presence of device may change 

patient behaviour 

Costly 

Ethical considerations with 

monitoring of behaviour 

Patients may suspect monitoring 





Practical considerations in intervention design 


When designing and implementing adherence interventions in 
practice, three dimensions of the intervention need to be optimized 
for success. This can be remembered as the “3 components to be- 
haviour change’ or ‘3CBC’—content, channel (delivery vehicle), and 
context. 


Content 


This is the basic substance of the intervention and how the spe- 
cific barriers and enablers of adherence are addressed. Approaches 
should be tailored to address both the perceptual factors influencing 
motivation to initiate and persist with treatment, as well as facilitate 
the ability to adhere, for example by addressing any capacity and 
resource limitations. The PaPA model described is one method that 
can be used to ensure all aspects of adherence are addressed. 


Channel 


Adherence support should occur, not just at the start of treatment, 
but also during treatment review, as perceptions, abilities, and ad- 
herence can change. For psychiatric disorders, support should ex- 
tend beyond the prescribing consultation to ongoing medication 
counselling and review. The increasing use of e-technology (such as 
smartphone apps) offers the prospect of additional channels to com- 
plement practitioner-delivered support [73]. However, despite the 
plethora of technology and digital solutions available, there is, as yet, 
little evidence for their efficacy. A recent systematic review found 
mobile and electronic interventions to be feasible and acceptable in 
patients with serious mental illness; however, effects on adherence 
and outcomes are yet to be determined [74]. Applying these prin- 
ciples to develop theory-based content might improve their effect- 
iveness and utility. 


Context 


Context considers how appropriate prescribing and adherence sup- 
port is facilitated by wider contextual factors, such as media repre- 
sentations of treatment and ease of access to treatment. Examples 
of such strategies include allocating appointments to patients with 
minimal delay and ensuring medicines are readily accessible and af- 
fordable [17]. Organisational and service delivery issues may also 
impinge on the opportunity to access treatment and support ser- 
vices [75]. The impact of community treatment orders is another 
contextual factor to consider, particularly as these are widely used in 


many countries yet the benefits of these on adherence and outcomes 
remain controversial [76]. 


Medication non-adherence is an age-old problem that has existed 
since the time of Hippocrates; yet it remains a significant problem 
facing psychiatry today, leading to many lost years of health and 
livelihood. Despite decades of research, advances made have had 
limited effect on addressing this issue. The new approach taken in 
the last few decades where medication adherence is understood as 
a variable behaviour, rather than a trait characteristic, has shown 
great potential in paving the way towards an effective interven- 
tion. An individualized approach, where each person’s unique be- 
liefs, capability, and motivation are taken into account, should be 
the foundation of every adherence intervention undertaken today. 
Adherence is everybody’s issue—we must endeavour to address this 
in our everyday practice with every patient we meet. Every small 
step we take towards targeting this complex behaviour is one step 
closer to building the missing link between psychiatric treatment 
and outcome. Only then can we begin to see the gains from the med- 
ical advances that our predecessors have achieved many years and 
decades ago. 
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This chapter focuses on two main issues: (1) the core dimensions 
that are part of systems of classification that define what is meant 
when a person is said to have an intellectual (learning) disability, 
and how these systems have developed with time; and (2) the be- 
havioural and mental health needs of people with intellectual dis- 
abilities (ID) and how such needs might be best conceptualized. The 
characterization and classification of a group of people as having ID, 
and including such a classification within the taxonomy of mental 
disorder, has not been without controversy. The reasons for this re- 
late to the key elements of any classification system—is it valid, and 
is it reliable? Does it inform and clarify our understanding? To what 
end is it being used? Does it enhance those it so classifies, or does 
it demean and contribute to stigma and marginalization? When it 
comes to illness or disease, accurate diagnosis based on agreed diag- 
nostic criteria is the cornerstone of epidemiological and aetiological 
research, enabling informed treatment development that can then 
be generalized to others with similar illnesses. However, an ID is 
not an illness, and identifying someone as having an ID is not fun- 
damentally a diagnosis because, on its own, the term provides no 
information about the cause, pathophysiology, or likely prognosis. 
Rather ‘intellectual disability’ serves as an umbrella term for a set of 
‘core dimensions’ that describe what is an extremely heterogenous 
group of people. In addition to the ways we are all different, children 
and adults with ID also differ extensively in other ways, such as in 
the cause, nature, and degree of their disabilities and in the pres- 
ence or absence of secondary sensory, physical, and/or psychiatric 
comorbidities and/or a pattern of early developmental character- 
istics of the autistic spectrum disorder. The assessment of children 
and adults with ID in clinical practice requires that these various 
strands are integrated into a coherent and comprehensive formula- 
tion, which, in turn, informs intervention. 











The initial development of modern systems of classification for 
‘mental retardation, as it was known then, was reviewed in a spe- 
cial edition of the American Journal of Psychiatry in 1972 [1]. The 
systems referred to used varied terms, and their use and some of 


e dimensions of intellectual disabilities 





the actions that resulted from such labelling would certainly now 
be seen as unacceptable. However, with time, thinking has devel- 
oped, and the core dimensions have been reshaped and expanded 
to better reflect the complexity and interactive nature of the concept 
of ‘intellectual disability, together with a better understanding of 
the reasons for, and potential pitfalls of, any system of classifica- 
tion. In the 1900s, in the UK, the characterization and classification 
of people with ID was considered to be necessary to facilitate the 
segregation of people deemed to be harmful to the population as 
a whole, or even a major source of criminality, by virtue of their 
abnormal genetic endowment. The eugenics movement took these 
ideas to be axiomatic, and they persisted well into the twentieth 
century. The work of Jack Tizard and others transformed thinking 
about how people with ID should be supported, leading to the ac- 
ceptance that people with ID should be, and could be, supported 
outside of institutions. The 1960s saw the emergence of the con- 
cepts of normalization and social role valorization as the principles 
that should guide the support of people with ID. Within Europe, 
North America, and other high-resource countries, these chan- 
ging approaches led to developments in legislation and in policy 
and practice. With the development of new ways of conceptualizing 
ID, there was a move away from seeing the nature and extent of the 
disability as exclusively based in the individual, rather than seeing 
a person's disability as a consequence of interactions between some 
innate limitations, the influence of the past and present social and 
family environments, and importantly educational and life oppor- 
tunities. Additionally, the nature and extent to which society itself 
accepted and responded to people with disabilities could ameliorate 
the disadvantages experienced by those affected. 

These social and attitudinal changes occurred during a period 
in which there were also major advances in areas such as genetics 
and neurosciences, and in understanding the major environmental 
causes of ID. The work, for example, of Lionel Penrose and the pub- 
lication in 1938 of his book the Biology of Mental Defect identified 
the causes of ID [2] (Penrose 1949). The normal human chromo- 
some complement of 23 pairs of chromosomes was identified in 
the 1950s, and trisomy 21, as the cause of Down's syndrome, was 
identified shortly afterwards. The sequencing of the human genome 
was undertaken in the late 1900s and early twenty-first century, 
and many abnormalities of chromosome number (aneuploidies), 
chromosomal rearrangements, DNA copy number variations, and 
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single-gene mutations have been described in association with the 
presence of developmental delay and ID [3]. 

The recognition of specific environmental causes of ID, as diverse 
as maternal iodine deficiency, fetal alcohol syndrome, or rubella 
embryopathy, has all had very clear public health implications in 
terms of primary prevention. In LMICs, it is recognized that varied 
environmental and social factors impact on the early cognitive and 
socio-emotional development of children [4]. These different issues 
are primarily the province of public health, child health, and mater- 
nity services. However, in addition, as findings from studies in the 
1970s onwards identified that children and adults with ID have high 
rates of behavioural problems and mental ill health [5, 6], so the need 
to address such issues became increasingly apparent. It is primarily 
for this reason that there is a focus on ID in a textbook of psych- 
iatry. Given research evidence about such comorbidities and with 
the earlier changes in attitudes and policy, the role of psychiatrists, 
in particular, has altered beyond recognition from being medical 
superintendents of large long-stay institutions to being members of 
community-based interdisciplinary teams undertaking much more 
focused and nuanced interventions. 


Classification of intellectual disabilities 





Given the heterogeneity of people with ID and the potential com- 
plexity of the health and support needs of people with ID, what is 
clear is that no one system of classification is fully adequate and 
which system is employed, if any, critically depends on the context 
and the reason for its use. If the question is about the nature and type 
of support to be offered, this will be better characterized through 
the lens of a more interactive and dynamic model, in which bar- 
riers to that person’s full participation and inclusion in society are 
the focus of enquiry. However, if the question is about the cause of a 
child’s significant developmental delay, then the focus will be more 
on an accurate description and characterization of his/her unique 
features—the phenotype, in order to identify the cause. The former 
approach helps guide educational and service planning and support 
and is based on what is described as a social model of disability, and 
the latter is based around a biomedical model. In each of these, the 
‘core dimensions’ will overlap, yet differ in terms of emphasis, as 
each system of classification has a different purpose. 

Historically, there has been a tension between the social and bio- 
medical ways of conceptualizing the needs of people with ID, and 
in papers advocating these different perspectives, the language and 
concepts used and the conclusions drawn at times appear irrecon- 
cilable. However, such debate has unnecessarily polarized each per- 
spective, and there is a need for a more nuanced understanding [7]. 
While those from different professional backgrounds may empha- 
size a different perspective, there is necessarily a coming together 
of these perspectives, thereby providing the means for structuring 
our thinking about the needs of people with ID and, in turn, how 
we might respond to these needs. Diagnostic and classifications sys- 
tems, such as the Diagnostic and Statistical Manuals (DSM) of the 
APA, first published in 1952, and now as DSM-V [8], and the WHO’s 
International Classification of Diseases (soon to be ICD-11), have been 
developed and modified over time, and newer systems have been 
developed, which seek to better reflect the nuances required of sys- 
tems of classification. These aim to both characterize the nature and 


extent of a persons impairment and disability on the one hand, and 
on the other, to be structured enough to be used reliably to compare 
findings across countries and over time. These include the American 
Association on Intellectual and Developmental Disabilities’ manual 
Intellectual Disability: Definition, Classification, and Systems of 
Support [9] and the WHO’s Classification of Impairments, Disabilities 
and Handicaps [10] and its successor the WHO's International 
Classification of Functioning, Disability and Health [11]. 

Although, in some areas, the use of systems for classification may be 
subject to criticism, there are at best very positive benefits to an indi- 
vidual being assessed as having an ID. These include: the prospect of 
going on to identify the exact cause of a person's ID; awareness of the 
potential for, and the identification and treatment of, associated health 
problems; and access to specialized educational support and additional 
financial support through the benefits system. It is against this back- 
ground that the defining of what it means to have an ID and the core 
dimensions that are part of such a definition need to be considered. 
For the paediatrician, the question may be why this child is develop- 
mentally delayed. For the geneticist advising the parents, the questions 
are: is there a genetic cause, and what is the implication in terms of 
recurrence risk? For teachers and educational psychologists: what is 
the nature of the child’s intellectual impairment, and how and in what 
way can education enhance the child’s development and acquisition 
of skills? For public health practitioners: what is the extent and nature 
of need at a population level, and are there causes that are potentially 
preventable? For the psychiatrist, clinical psychologists, and commu- 
nity nurses: what is the relationship between the nature and extent of 
any impairments and disabilities and the emotional and behavioural 
difficulties that may have brought them in contact with specialist child, 
adolescent, or adult mental health services? Classification should have 
a defined purpose, and because the reasons may differ, so then the as- 
sessments and classification systems used will vary. 

The diagnostic criteria are set out in DSM-V [8], Chapter V of ICD- 
10 [12], and the American Association of Intellectual Disabilities’ 
Intellectual Disability; Definition, Classification and Systems of 
Support (AAIDD-11) [9]. There are three essential components to 
all these definitions: (1) evidence of significant intellectual impair- 
ment assessed using an established and valid assessment and nor- 
mally considered to be present when there is a score of less than two 
standard deviations below the mean; (2) evidence of significant im- 
pairment in adaptive functioning, given the age and cultural back- 
ground of the person concerned; and (3) these features having their 
origins in childhood, with delays in, or an inability to reach, specific 
well-recognized developmental milestones and educational attain- 
ments, usually with evidence of delay in the first 5 years of life. The 
exact nature, extent, and severity of the delay will vary across people 
considered to have an ID but may involve delay in one or more of the 
following areas: gross motor, sensorimotor, language, and adaptive 
behaviours. Revisions of the standard classification systems have 
led to a change in terminology, and ‘intellectual developmental dis- 
order’ (IDD) is the term now used. 


Intellectual developmental disorder (DSM and ICD 





systems) (American Psychiatric Association, 2013) 


The structure of the classification within DSM-V has been shaped 
around developmental and lifespan considerations and within a 


cultural context that recognizes the dimensional nature of psychi- 
atric disorder. It encompasses how factors within an individual's en- 
vironment in which a person lives influence whether a particular 
symptom has functional significance or not, thereby moving beyond 
a simply diagnosis. In DSM-V, IDD is included in a section headed 
‘Neurodevelopmental disorders, which also includes communica- 
tion disorders, autism spectrum disorders (without distinguishing 
between Asperger’s syndrome and autism), attention-deficit/hyper- 
activity disorder (ADHD), specific learning disorders, motor dis- 
orders, and other neurodevelopmental disorders. The focus is not 
primarily one of aetiology, but rather of quantifying the extent of 
disability by defining the level of intellectual impairment and listing 
the range of possible adaptive functions that might be impaired. 
The definition makes it explicit that the onset is within the devel- 
opmental period and that IDD is the final common pathway of a 
number of potential aetiologies. Significant sub-average intellectual 
function is defined as an intelligence quotient (IQ) of 70 or below 
(using standard IQ tests). IQ is also used to help determine the level 
of ID (mild, moderate, severe, or profound). Adaptive functioning 
has to be measured against what would be expected for a person of 
that age, and the social and cultural experience of the person has to 
be taken into account. The Wechsler Scales for IQ and the Vineland 
Adaptive Behaviour Scales or the revised Adaptive Behaviour Scales 
of the American Association for Mental Retardation for character- 
izing functioning are established instruments for the measurement 
of these abilities for which there are normative data for comparison. 

Bertelli et al. (2016) [13] have reported on development work 
undertaken to inform ICD-11, specifically whether or not ID should 
be classified as a health condition. They also proposed the use of 
the term ‘intellectual developmental disorder, defined as ‘a group 
of developmental conditions characterized by a significant impair- 
ment of cognitive functions, which are associated with limitations 
of learning, adaptive behaviour and skills. They also suggested that 
the primary category, replacing that of ‘Mental retardation, would be 
‘Neurodevelopmental disorders. IDD would be a subset of this pri- 
mary category and encompass ‘a broad grouping of heterogeneous 
developmental conditions which result from significant interference 
with the growth and maturation of the brain during its early devel- 
opmental phases including the prenatal and perinatal periods, in- 
fancy, childhood, and extending into adolescence’ 

These developments in classification systems still use frameworks 
developed for the classification of illness and are thus unsatisfactory 
when it comes to ID. However, they do have value in bringing con- 
sistency and a degree of rigour to the classification process. Thus, 
it can be reasonably assumed when the classifications are properly 
used that there is a degree of reliability to the conclusion that an 
individual has an ID, and it will not have been based simply on ap- 
pearance or educational abilities, but rather will have taken into 
account evidence for a delayed and atypical pattern of development 
and the continuing presence of intellectual and functional impair- 
ments. Depending on circumstances, the next question might well 
be whether there is a single major cause for such developmental 
delay (genetic or environmental) or whether that is unlikely and a 
combination of factors has contributed to a person's atypical devel- 
opmental history. Given that these approaches have their limitations 
in terms of characterizing the possible multiple factors that may re- 
sult in disability, other means of classification with more extensive 
core dimensions have been proposed. 


CHAPTER 21 Core dimensions of intellectual disabilities 


Impairments, disabilities, and handicaps 


(World Health Organization, 1980) 





In 1980, the WHO proposed a system of classification that attempted 
to overcome the limitations of other earlier methods of classification 
and, most importantly, aimed to guide intervention. Box 21.1 sum- 
marizes the terms. In this system, ID can be conceptualized at dif- 
ferent levels. At the level of ‘impairment; in the case of ID, the organ 
system involved is the CNS. It is the impairment of this system for 
genetic, chromosomal, or environmental reasons that primarily af- 
fect the acquisition of developmentally related skills and the ability 
to learn. For the paediatrician or geneticist, a key task is to identify 
the reasons for any abnormality of brain development, and therefore 
intellectual impairment. This may have treatment implications, may 
guide prognosis, and most importantly may help to make sense of 
the disability for the parents of those affected. It may also have im- 
portant implications for genetic counselling. 


Organization's definitions 
ties, and handicaps 


of impairments, disab 





Impairment 

e Is any loss or abnormality of psychological, physiological, or anatom- 
ical structure or function. 

e Represents deviation from some norm in the individual's biomedical 
status. 
e Is characterized by losses or abnormalities that may be temporary or 
permanent. 

e Includes the existence or occurrence of an anomaly, defect, or loss 

in a limb, organ, tissue, or other structure of the body, or a defect in a 
functional system or mechanism of the body, including the systems 
of mental functioning. 

e Is not contingent upon aetiology. 





Disability 

e Is any restriction or lack (resulting from impairment) of ability to per- 
form an activity in the manner or within the range considered normal 
for a human being. 
e Is concerned with compound or integrated activities expected of the 
person or of the body as a whole such as represented by tasks, skills, 
and behaviours. 
e Is the excesses or deficiencies of customarily expected activities and 
behaviour, which may be temporary or permanent, reversible or irre- 
versible, and progressive or regressive. 
e Is the process through which a functional limitation expresses itself as 
a reality in everyday life. 














Handicap 
e Is a disadvantage for a given individual, resulting from an impairment 
or a disability that limits or prevents the fulfilment of a role that is 
normal for that individual. 
e Places some value upon this departure from a structural, functional, 
or performance norm by the individual or his or her peers in the 
context of their culture. 
e Is relative to other people and represents discordance between the 
individual's performance or status and the expectations of his or her 
social/cultural group. 
e Is asocial phenomenon, representing the social and environmental 
consequences for the individual stemming from his or her impair- 
ment and disability. 


Source: data from World Health Organisation, Definitions of Impairments, Disabilities 
and Handicaps, Copyright (1980), World Health Organisation. 
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The associated disability is the effect of the impairment on a 
person's ability to learn and acquire the new skills that come with 
development. These, in turn, enable the acquisition of increasingly 
advanced skills necessary for an independent life. The exact nature 
and extent of the disability may not only include ID, but also phys- 
ical and sensory disabilities. The extent to which a given impairment 
results in a loss of function (disability) may well be influenced by the 
extent and nature of interventions such as special education or, for 
example, the correction of hearing loss through treatment and/or 
the use of a hearing aid. 

The final level—that of ‘handicap’—is a result of an interaction be- 
tween the disability and the extent to which support is available or 
environmental adjustments made at a societal level. It is a measure of 
disadvantage. The extent of disadvantage can be ameliorated through, 
for example, the presence of support to enable individuals to go out, 
or environmental modifications (for example, wheelchair ramps) that 
diminish the impact of physical disabilities. Such interventions or en- 
vironmental modifications maximize independence and thereby re- 
duce disadvantage by ensuring that the impact of any given disability 
on an individuals independence and quality of life is minimized. 
Shakespeare (2006) [14] has argued that such a structure as this model 
helps to bring together the biomedical and social models of disability. 


Functioning, disability, and health (ICF) (World 





Health Organization, 2001) 


This new system of classification to replace the 1980 version was 
published by the WHO in 2001. The focus switched from a system 


that was seen as just characterizing the negative to a system of clas- 
sification that also emphasized the positive—what someone is able 
to do, rather than just what he/she could not do. While the context 
in this chapter is about people with ID, this system of health clas- 
sification was developed by the WHO to enable the characteriza- 
tion of ‘health domains’ across the whole population and therefore 
should be seen to have universal application. Part 1 is the means 
whereby body functions, structures, and activities and participation 
can be characterized. Part 2 relates to ‘contextual factors, whether in 
the environment or the individual. Fig. 21.1 illustrates the relation- 
ship between the different components of this classification system. 
The authors emphasize that it enables a multi-perspective approach 
and that it can be used by different disciplines as ‘building blocks 
for users who wish to create models and study different aspects of 
this process. Part 1 is divided into organ systems. With respect to 
people with ID, the sections on mental functions and structures of 
the nervous system may be of particular relevance, but these may be 
compounded by secondary disabilities, consequent upon abnormal- 
ities in structure and function of other organ systems such as those 
associated with sensory impairments. All of these categories are ex- 
tensively subdivided. In Part 2, the focus is very much on the specific 
personal circumstances and the characteristics of the individual en- 
vironment, including the health system and support available. 

This system seeks to do two things. Firstly, it aims to provide a re- 
liable structure for the description of the complex effects of ill health 
and those factors that might moderate its impact, thereby enab- 
ling accurate comparisons across countries, between cultures, and 
throughout the lifespan. Secondly, it aims to provide a more com- 
prehensive framework to aid intervention, one that moves beyond 
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Fig. 21.1 The World Health Organization's International Classification of Functioning, Disability, and Health model of intellectual disability (WHO ICF 


model). 


Adapted from World Health Organisation, International Classification of Functioning, Disability and Health, Copyright (2001), with permission from World Health Organisation. 


the single word or brief phrase of classification systems, such as in 
the ICD and DSM, to a more structured and meaningful description 
of an individual's strengths and difficulties. In this respect, it seeks to 
incorporate aspects of the biomedical and social models of disability 
and an understanding of the person within the context of his or her 
support network and culture. 

This more dynamic and interactive concept of what it means to 
have an ID and what might contribute to the extent of social inclu- 
sions and well-being of that individual is illustrated in the American 
Association on Intellectual and Developmental Disabilities model of 
disability (or the AAIDD-11 model). The emphasis is on the mod- 
erating effects of individual past and present circumstances and the 
wider national policy that seeks to ensure that the ‘mismatch be- 
tween competency and environmental demands’ is minimized and 
people with ID are fully included in society. The way in which these 
formulations influence the approach to treatment is described at 
greater length in Chapter 35. 





Prevalence and aetiology of intellectual disabilities 


As argued, the classification systems described in this chapter are 
not diagnostic. In addition, these different ways of conceptualizing 
ID emphasize that the paths to functional impairments are not 
simply to do with biology; there are interactions with respect to the 
extent and nature of the statutory provision of education, opportun- 
ities in adult life, and societal attitudes, all of which impact on ability 
and quality of life. Particularly in early life, the biomedical perspec- 
tive is fundamentally about understanding the reasons for identi- 
fied problems at birth or in childhood, and in this regard, there have 
been significant advances. Furthermore, the cause of a person’s ID 
may also inform understanding of behaviour problems and mental 
ill health, as they affect people with ID and it has become increas- 
ingly recognized that this may be an important aspect to psychiatric 
assessment. 

Two broadly distinct groups were identified through population- 
based studies such as the Aberdeen children’s cohort study of the 
1950s [15]. Firstly. there are children who have a definite or likely 
genetic abnormality or specific environmental causes with major 
effects on subsequent development. Secondly, there are those for 
whom there is no obvious single major cause for an early develop- 
mental delay and subsequent ID. Here, the small effects of many 
genes combined with social disadvantage may be crucial. The broad 
difference between these two groups are summarized in Box 21.2. 
This ‘two group’ perspective also illustrates the difficulty in arriving 
at a true prevalence of ID in any given community, given the fact 
that more than an IQ of below 70 is required when determining if 
someone should be considered to have an ID. The distribution of 
IQ in the general population is near normal, with a skew to the left, 
given the presence of specific neurodevelopmental disorders that are 
generally associated with a downward shift in IQ. Given this, a figure 
of 2-2.5% is estimated as the proportion of the population with an 
IQ of below 70. The problem then is to know how many of this group 
truly should be considered to have an ID. To address this issue, 
there have been studies of the administrative prevalence of people 
with ID, that is the percentage of any given geographic population 
known to ID services. Figures of between 0.4% to just under 1% are 
arrived at, depending on a number of factors, including whether 
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Box 21.2 Differences between biologically determined and 


subcultural ID 





Biological 

e Moderate/severe impairment. 

e Significant impairment in adaptive functioning. 

e Equal distribution across families of different socio-economic status. 
e Parents and siblings usually of normal intelligence. 

e Dysmorphic characteristics common. 

e Other impairments and disabilities common. 

e Neglect unusual. 


Subcultural 





ild or borderline impairment. 

e Minor or no impairment in adaptive functioning. 

e More common in families of lower socio-economic status. 
e Intellectual ability impaired in family members. 

e Dysmorphic characteristics unlikely. 

e Other impairments and disabilities unusual. 

e Neglect more common. 














active screening for people with ID in the population was attempted, 
whether it was the number of people already known to ID services, 
whether children and adults were included, and specific geographic 
factors such as levels of deprivation. Studies indicate that there is 
a peak in prevalence in childhood and that there may be signifi- 
cant regional variations in any given country. Examples of studies 
from different countries include that of Sondenaa et al. (2010) [16] 
in Norway; McConkey et al. (2006) [17] in the island of Ireland; 
Larson et al. (2001) [18] in the United States; and Wen (1997) [19] 
in Australia. In an important study in the United States, Fujiura 
(2003) [20] estimated that there were a further 1.27% of people with 
mild ID who had very substantial needs and were effectively falling 
through the net of ID services. A detailed consideration of the epi- 
demiology of ID is presented in Chapter 22. 

The striking feature about the ID population is its heterogeneity, 
both in terms of the nature and severity of disability and also in the 
presence, or not, of individuals whose ID is due to one of many pos- 
sible single major causes. Making up this population of people, there 
are those with specific environmental effects, such as fetal alcohol 
syndrome, very low birthweight, congenital infections, and maternal 
iodine deficiency (worldwide, of major significance—see [21]), and 
those with chromosomal and single-gene disorders which either 
arise de novo or are inherited. Down’s syndrome due to trisomy 21 
and fragile X syndrome due to X-linked FMR-1 mutations are two 
examples that are relatively common. Other neurodevelopmental 
syndromes due to copy number variations have been identified [22], 
and advances in genetics using microarray technologies (some- 
times referred to as molecular karyotyping) readily enable the 
identification of copy number variants of various sizes, including 
chromosomal rearrangements, deletions, and duplications [23]. 
Some of these are indicative of well-recognized chromosomal de- 
letion neurodevelopmental syndromes (for example, Williams 
syndrome, cri-du-chat, etc.). Others with previously unrecognized 
chromosomal abnormalities are still to be properly characterized 
by phenotypy. Whether such copy number variants should be con- 
sidered to be pathogenic or a normal polymorphic variation needs 
to be judged using established databases such as DECIPHER [24]. 
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With improvements in DNA sequencing technology, identification 
of specific gene mutations is now more readily available, for example, 
detecting de novo or inherited gene mutations such as in the TSC 1 
and TSC 2 genes that result in the tuberous sclerosis complex. These 
technologies make possible the characterization and subsequent 
identification of an increasing number of neurodevelopmental syn- 
dromes of genetic origin associated with ID. The specific causes of 
ID will be considered in more detail in Chapter 23. 

The assessment of a child or an adult with ID referred to child and 
adolescent mental health services or to adult mental health or spe- 
cialist services because of problem behaviour, and the subsequent 
formulation, will include any identified major cause of a person’s ID 
such as Down's syndrome. As identification of genetic and environ- 
mental causes of ID develops and, with it, our understanding of their 
specific impact on brain development and functioning, so interven- 
tions for mental health and behaviour problems across the lifespan 
will only become more nuanced and hopefully more effective. 


Core dimensions of disorders of behaviour 


and mental ill health in children and adults 
with intellectual disabilities 





For health professionals working in mental health or specialist ser- 
vices for children, adolescents, or adults with ID, the primary focus 
of their work will be on prevention and treatment of secondary 
disabilities that may arise in the form of challenging behaviours or 
mental ill health. In people with ID, the development of challenging 
behaviour may be for a number of reasons, including: being an in- 
dication of the development of a comorbid physical or mental dis- 
order, for example, mania presenting with irritability and aggression; 
changes in the environment and/or in the support provided; or the 
onset of a physical illness. In contrast, behaviours, such as self-injury 
or aggression, if long-standing, may be best understood from a de- 
velopmental or behavioural perspective. In addition, the cause of a 
person’s ID may also inform our understanding of the reasons for 
particular behavioural or mental health problems that occur. Well- 
known examples include the excessive eating behaviour and risk 
of psychopathology associated with Prader-Willi syndrome [25], 
the high rates of Alzheimer’s disease affecting people with Down’s 
syndrome [26], anxiety disorders affecting people with Williams 
syndrome [27], and severe self-injurious behaviour in people with 
Lesch-Nyhan and Smith-Magenis syndromes [28]. This observa- 
tion of an association between specific neurodevelopmental syn- 
dromes and a characteristic behaviour was described by Nyhan as 
‘the behavioural phenotype of organic genetic disease’ [29]. Such ob- 
servations have very significantly altered our understanding of the 
aetiology and pathophysiology of such behaviours and psychiatric 
disorders. 

Conceptually, comparative studies across different neuro- 
developmental syndromes have challenged the orthodoxy of ap- 
plied behavioural analysis and have required the development of 
more complex models, which recognize the role of the syndrome- 
specific developmental profiles and brain mechanisms in the aeti- 
ology of syndrome-specific behaviours (see, for example, [30] 
and [31] on Williams and fragile X syndromes, respectively; and 
[32]). Oliver and colleagues, for example, have reported very dif- 
ferent behavioural and developmental profiles across different 


neurodevelopmental syndromes, and they have gone on to propose 
specific models to account for their occurrence and maintenance 
[33]. Similarly, studies on Prader-Willi syndrome have proposed 
specific mechanisms to account for the different components of the 
behavioural phenotype [25] and the high rates of psychotic illness 
reported. The latter develops in early adult life but primarily affect 
those with the chromosome 15 maternal uniparental disomy (UPD) 
subtype [34]. These observations indicate the importance not only of 
developmental and bio-psychosocial perspectives, but also of having 
an understanding of the impact of particular patterns of brain de- 
velopment on behaviour and the risk of comorbid mental disorders. 
This diverse information informs intervention. 

The task of assessment under such circumstances is the bringing 
together of what is known about the person and his/her behaviour 
with models of understanding that have been informed by research. 
This process of assessment and formulation is illustrated in Fig. 21.2 
(see also [35] for discussion). 

Core dimensions to the understanding of challenging behav- 
iours therefore include establishing how best to conceptualize the 
problem that has led to the referral (for example, aggressive behav- 
iour, self-injury, apparent loss of skills, etc.). The conceptual models 
include: (1) that of applied behavioural analysis, with behaviours 
being identified as having a cause and a function (such as demand 
avoidant or attention maintained) and being shaped by various con- 
tingencies and occurring in the context of certain internal or ex- 
ternal setting conditions (for an overview, see [36] O’Reilly et al, 
2016); (2) the developmental model, whereby such behaviours (for 
example, repetitive behaviours) are seen as normally a self-limiting 
characteristic of typical development and their continuation into 
later life as a consequence of delayed or an atypical developmental 
trajectory and/or such behaviours may be syndrome-specific (a 
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Fig. 21.2 Developing single or mixed models of understanding that 
inform intervention. 
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Table 21.1 Assessment of behavioural and comorbid psychiatric disorders in people with ID 


Through history and observation, the 
identification of specific ‘functions’ for behaviours 
and the predisposing, precipitating, and 
maintaining factors and internal and external 


Applied Behavioural Analysis (ABA)—also referred to 
as ‘functional analysis’, explains the presence of adaptive 
and challenging behaviours in terms of learning theory 

and the presence of specific contingencies that increase 


the probability of such behaviours setting conditions 


Developmental—specific patterns of behaviour are 
explained as a manifestation of atypical and/or delayed 
development or as a direct consequence of the specific 
cause of that person's ID (the presence of a ‘behavioural 
phenotype’) 


Biomedical—a diagnostic approach that seeks to 
account for challenging behaviour or cognitive 
abnormalities or abnormalities of mental state on 

the basis of the co-occurrence of an additional 
developmental (for example, autism spectrum disorder, 
ADHD) or acquired comorbidity (for example, affective 
disorder, dementia, or physical illness) 





referra 


behavioural phenotype); and (3) the possibility that such prob- 
lems may be integral to, or a consequence of, the development of 
comorbid physical or psychiatric illness. These different perspec- 
tives and their implications are summarized in Table 21.1. They are 
not necessarily distinct. One example may be a person who has a 
history of a particular pattern of development characteristic of an 
autistic spectrum disorder and who finds change difficult to tolerate 
and anxiety-provoking, who also develops a comorbid illness that 
additionally impacts on behaviour. 

In this context, the assessment of a person with an ID who has 
exhibited particular problematic behaviours (often referred to as 
challenging behaviour) is to provide an understanding of such be- 
haviours in order to inform intervention. The core components of 
any assessment will include history-taking from the patient and/or 
an informant, a mental state examination and, where appropriate, 
a physical examination, investigations, and often structured obser- 
vation over time. The core dimensions will include not only infor- 
mation on the nature, extent, and causation of the person’s ID, as 
described earlier, but also information on the onset and course of 
the index problem, a diagnostic assessment for the presence, or not, 
of comorbid mental or physical illness, and observations to indicate 
whether or not specific factors might precipitate, predispose to, or 
maintain such behaviours. 

The history is crucial—whether some particular behaviour (for 
example, aggression) is long-standing or new and of recent onset 
is likely to lead to very different explanations. If long-standing, it 
may relate more directly to the nature of the person’s develop- 
mental disability, including emotional dysregulation [37] and en- 
vironmental triggers. If the behaviour is new and of recent origin, 
then the possibility that it is a manifestation of the development 
of a secondary comorbid physical or psychiatric disorder is more 
likely. The diagnosis of comorbid psychiatric disorders, such as 
ADHD, severe anxiety, psychotic illness, affective disorder, or de- 
mentia, can be problematic where there is limited language devel- 
opment, and information from informants may be crucial. Specific 
modified diagnostic criteria have been developed, such as the DC- 
LD [38] and DM-ID [39] (Fletcher et al 2016), to inform the diag- 
nostic process. These modified diagnostic criteria seek to balance 


Through history-taking and observations, the 
index behaviours are identified as similar to 
those that are usually self-limiting and occur 
as part of normal development or have been 
shown to occur in excess in those with a specific 
neurodevelopmental disorder 


Through history-taking and additional 
assessments and investigations, the identification 
of established signs and symptoms of comorbid 
psychiatric or physical disorders that the 

history indicates are likely to account for the 
development of the index problem that led to 





The development of alternate strategies of support 
seeking to reinforce positive behaviours and 
develop adaptive responses to triggering events, 
and modifying predisposing and maintaining 
factors and setting conditions 


The development of specific interventions 
informed by other models that seek to reduce the 
severity and frequency of such behaviours and/ 
or interventions that are syndrome-specific and 
informed by knowledge of that syndrome 


The treatment of the specific developmental or 
acquired comorbid disorder using interventions 
known to be effective, including, for example, 
established psychological or pharmacological 
interventions 











the tension between making them more applicable to the assessment 
of a population of people with impaired language development on 
the one hand, and on the other, ensuring that such modified cri- 
teria are still valid in terms of the condition they define. While for 
affective disorders, it may be possible to infer a depressed or manic 
state on the basis of observation or informant reports (tearfulness, 
agitation, over-activity, etc.), this is more problematic when it comes 
to the presence, or not, of thought disorder, hallucinations, or delu- 
sions. Illnesses such as schizophrenia are manifestations of disorders 
of brain mechanisms that underpin thought and perception, and if 
these have not fully developed, it remains uncertain whether they 
can be further impaired. 

Based on the earlier work of Sovner, Hurley et al. (2016) [35] 
have summarized the specific problems that may lead to diag- 
nostic uncertainty when assessing a person with an ID. These in- 
clude: the baseline exaggeration of symptoms (such as self-injury) 
in times of stress; intellectual distortion in terms of the person’s 
understanding of the questions being asked about his/her mental 
state and an inability to appreciate the implications of the question; 
psychosocial masking whereby the symptomatology that occurs 
may be best understood within a developmental framework but 
is misconstrued as a manifestation of mental illness; and cognitive 
disintegration whereby a person with an ID may become grossly 
disorganized and apparently psychotic due to a lack of cognitive 
reserve, resulting in a more severe response to the development of 
a mental illness. 

Being aware of these problems and taking a detailed develop- 
mental and longitudinal history in order to establish firstly a devel- 
opmental diagnosis (ID, autistic spectrum disorder, ADHD) and 
secondly the presence, or not, of a comorbid diagnosis (mood dis- 
order, dementia, etc.) is essential. However, a wider diagnostic ap- 
proach also draws upon other conceptual models of understanding 
such as those of applied behavioural analysis. 

The purpose of assessment in the context of concerns about a 
person’s behaviour is therefore to arrive at an understanding and to 
develop a formulation that sets out how the behaviour or problem 
in question is best understood in terms of these models. This then 
guides the interventions that follow. These may be as varied as 


213 








SECTION 3 Intellectual disabilities 


reducing anxiety by improving understanding and reducing the un- 
predictability of the social environment through the use of visual 
support, the development of strategies for defusing and best man- 
aging challenging behaviour, and the treatment of comorbid phys- 
ical illness or mental ill health. Formulations should acknowledge 
the uncertainties that exist and ensure appropriate observations en- 
able a systematic evaluation of the outcomes of a specific approach. 
Documentation of the assessment and the thinking that led to a par- 
ticular intervention is essential and of great value, as it allows others 
now or in the future to better understand how a person's problems at 
that time were conceptualized. 





Conclusions: the integration of perspectives 


The core dimensions that make up the diagnostic criteria that de- 
fine what is meant by the term ‘intellectual disability’ have been 
set out at the beginning of this chapter. The diagnostic criteria do 
not do justice to the complexity of the issues and have been sup- 
plemented by different ways of thinking that have emphasized the 
multi-dimensional and interactive nature of how we understand 
the link between impairments in brain development, the impact 
on function, and, in turn, the effect on quality of life and well- 
being. It is rarely sufficient simply to state that someone meets the 
criteria for having an ID. There is the need to consider the barriers 
that are limiting choice and opportunity. In recent years, there 
has been a bringing together of social and biomedical models of 
ID, appreciating that each has its place. Health and social support 
needs cannot be separated readily, as each depends on the other. 
With advances in our understanding of the causes of ID, it has also 
become clear that genetically determined neurodevelopmental 
syndromes have their own specific atypical developmental trajec- 
tories that may also include the emergence of particular patterns 
of maladaptive behaviours or an increased risk for developing 
comorbid mental and physical disorders that require treatment in 
their own right. 

While ID falls within the group of disorders known as ‘mental 
disorders, the health and social care needs of the majority of people 
with ID will not be met through child or adult mental health ser- 
vices, but instead in childhood by paediatric, child development, 
and family support services, and in adult life, by social services 
and primary care. However, for those whose mental state or be- 
haviour causes concern and brings them in contact with specialist 
services for people with ID or with mental health services, there 
is an additional layer of assessment. The core dimensions of any 
assessment includes, but is not limited to, only a biomedical diag- 
nostic approach. The application of different theoretical perspec- 
tives leading to a wider diagnostic perspective is required that 
seeks to understand why this problem has affected an individual 
in a specific way at this time. It is this process of assessment, often 
requiring expertise from different disciplines and information not 
only from the person him-/herself, but also his/her family and paid 
support providers and observations over time, which inform inter- 
vention. These interventions are often directly with the family or at 
the interface with social care and, as has been illustrated, may well 
take many forms. 
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Intellectual disabilities are not uncommon. A meta-analysis of 52 
studies reported the prevalence of intellectual disabilities to be 
10.37/1000 population [1]. Prevalence varied according to age, in- 
come group of the country of origin (with higher rates from low- 
income countries), and study design. In high-income countries, 
rates for all ages combined were 9.2/1000, with the highest rates in 
child/young person populations only at 18.3/1000, and the lowest 
rates in adult-only populations at 4.9/1000. Careful understanding 
of the nuances is needed though to interpret these findings. Twenty- 
five of the studies included in the meta-analysis did not provide their 
age range; a further two did not report their observation period, and 
some studies were outliers in their findings. 

An earlier review of studies between 1960 and 1987 is of lesser 
relevance to today’s population, given cohort effects, almost all were 
studies in childhood/youth, and some provided very limited meth- 
odological information [2]. 

Studies of prevalence are, of course, challenging to conduct, as 
they ideally require intelligence to be tested on whole populations 
and therefore would incur considerable resources and costs. Other 
approaches can provide useful information, for example: 
































e Studies of administrative samples, such as people known to local 
authorities, which account for the majority of studies. These are 
the people who are making demands upon services, so the infor- 
mation is useful, but there are likely to be some people with intel- 
lectual disabilities not included in these samples and some people 
who do not have intellectual disabilities within the sample. For ex- 
ample, a rate of 4.3/1,000 aged 16+ was reported from Welsh local 
authorities as being in receipt or in need of intellectual disabilities 
services [3]. 

Studies of people with a record of intellectual disabilities in their 
general practitioner medical records. In high-income countries, 
people are likely to have been assessed once their developmental 
delay was reported, so a record is likely to exist. However, this is 
complicated by the multiple and changing terminology in use over 
time. For example, a rate of 5.4/1000 patients aged 18+ was re- 
ported from an English database of 451 practices [4]. 


e 





e Other secondary analysis of data routinely collected for other 
purposes. These have the attraction that large and whole-country 
samples can be analysed at relatively little costs, but findings reflect 
the original definitions used and the ways the data were collected. 
For example, a rate of 4.9/1000 aged 16+ (self-/proxy report) was 
reported from an analysis of Scotland’s 2011 census [5], and a rate 
of 23/1000 school-aged children/young persons (teacher report) 
was reported from Scotland’s 2015 pupil census [6]. 


Population intelligence approximates to a normal distribution, 
with a mean intelligence quotient of 100 and a standard deviation 
of 15 points. A statistical definition of intellectual impairments is 
an intelligence quotient less than two standard deviations from the 
mean, that is, <70. Simplistically, this would suggest that 2.3% of the 
population have intellectual impairments. However: 


e The greatest deviation from the normal distribution of intelligence 
is at the extreme ends. 

e A test error of 5 points is recognized—and a normal distribution 
would place 2.5% within the range of 70-75, so this can greatly in- 
fluence identified prevalence. 

e There is the Flynn effect (overly high scores due to out-of-date test 
norms). 

e Intellectual disabilities is a social construct, not simply a statistical 
measure of impairments; the definitions of intellectual disabilities 
in the standard classificatory manuals ICD-10 [7] and DSM-5 [8] 
also include the functional requirement of the need for additional 
support. People learn throughout their lives, and so a child with an 
intelligence quotient of 69 will need additional educational sup- 
port at school but, as an adult, may gradually acquire life skills 
to live independently, maintain relationships, hold employment, 
raise children, and not identify with intellectual disabilities nor 
be identified by others as having intellectual disabilities or put de- 
mand on services designed for people with intellectual disabilities. 

e In countries that provide additional educational support for chil- 
dren with intellectual disabilities, there is a clear advantage to 
having the label, and flexibility in its use is beneficial for children 
with abilities that are a little above an intelligence quotient of 70 
(‘borderline intellectual disabilities). 
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e People with intellectual disabilities experience premature death; 
hence, the proportion of the population with intellectual disabil- 
ities progressively falls within older age groups. 

Some children are not identified as having mild intellectual dis- 
abilities until they attend school, so rates are lower in pre-schoolers 
than in school-aged children. 


Cohort effects can influence rates over time, for example, the zika 
virus epidemic in South America in 2015-2016 causing micro- 
cephaly, immunization, iodine, maternal smoking, and alcohol 
use, changes in termination rates of children with Down syn- 
drome (currently falling, but could possibly increase with the 
introduction of first-trimester diagnosis) [9], improved antenatal, 
perinatal, and neonatal health care, increased survival of very 
low-birthweight infants, identification and treatment of meta- 
bolic causes of intellectual disabilities like phenylketonuria, better 
childhood education, access to cardiac surgery for children with 
Down syndrome, improved lifestyles, and access to health care. 

e Migration and clustering (for example, congregate care and col- 
onies) can influence spatial patterning at local levels and can be 
influenced by economic factors and local policy. 

e Prevalence of mild intellectual disabilities is influenced by many 
cultural and societal factors that determine whether a mild 
learning impairment is likely to result in a functional disability, 
contributing to geographic differences. 

Clearly, there is an intellectual gradient across the population, and 

the ‘cut-off’ of 70 is purely arbitrary. Indeed, the accepted ‘cut-off’ 

was changed in 1973, substantially affecting rates. 


Despite these complexities, there are some reasonably consistent 
findings across studies: 


e In high-income countries, about 5/1000 adults have intellectual 
disabilities, falling to about 2/1000 over the age of 65 years. 

e Prevalence is higher in children and young people than in adults. 

e Prevalence is higher in boys/men than girls/women; across the 
whole of Scotland, 58% of people with intellectual disabilities are 
male [5]. 

è Prevalence is higher in low-income countries than in high- and 
middle-income countries for multiple reasons, including life- 
style and health care, although in areas that are less driven by 
technology, having mild intellectual impairments may be less 
disabling. 

e The great majority of people with intellectual impairments have 
mild intellectual impairments (6 mild impairment: 1 moderate 
to profound impairments), and the majority with intellectual im- 
pairments in whom this is disabling have mild, rather than severe, 
intellectual disabilities. 


Causes of intellectual disabilities 





People with intellectual disabilities have some characteristics in 
common such as needing additional educational support at school, 
finding it hard to manage money and bills without help as an adult, 
and having difficulties remembering the temporal sequencing of 
events. Every person with intellectual disabilities is also unique. 
Each child inherits a range of genetic information from both their 
parents which is not shared with other children with intellectual 


disabilities, and their environment and experiences shape their de- 
velopment, interests, likes, and ambitions. So even people with a 
clear genetic cause for their intellectual disabilities, such as Down 
syndrome, are unique from all other persons with Down syndrome, 
while sharing some characteristics. 

Identifying the cause of intellectual disabilities is undertaken by 
paediatricians and clinical geneticists. 


Genetic factors 


Genetic studies indicate intelligence is highly heritable and can itself 
be conceptualized as a spectrum of syndromes [10]. Additionally, 
there are many genetic causes of intellectual disabilities, and recent 
and ongoing studies have found numerous copy number variants 
associated with developmental disorders, with the challenge of iden- 
tifying which are clinically relevant. As described in Chapter 23, 
genetic conditions include: 


e Chromosomal anomalies. Examples are trisomies—Down syn- 
drome (trisomy 21), Patau’s syndrome (trisomy 13), Edwards’ 
syndrome (trisomy 18); autosomal deletions, for example, cri-du- 
chat syndrome (terminal deletion of chromosome 15), Williams 
syndrome (deletion on chromosome 7), and Prader-Willi syn- 
drome and Angelman’s syndrome (deletion on chromosome 15 
or uniparental disomy); and sex-linked conditions, for example, 
fragile X syndrome. 

e Autosomal and sex-linked recessive conditions (particularly 
in communities with high rates of consanguinity). Examples 
are phenylketonuria; homocystinuria; galactosaemia; lipid 
disorders—Tay-Sachs disease, Gaucher disease, Niemann-Pick 
disease; and mucopolysaccharidoses—Hunter’s disease and 
Hurler’s disease. 

e Autosomal and sex-linked dominant conditions. Examples in- 
clude tuberous sclerosis and neurofibromatosis. 


Regarding the prevalence of individual conditions in childhood: 


e 1/800 has Down syndrome. 

e 1/3600 boys and 1/4000-6000 girls have fragile-X syndrome. 
e 1/7500 has Williams syndrome. 

e 1/10,000 has Cornelia de Lange. 

e 1/10,000-25,000 has Prader-Willi syndrome. 

e 1/15,000 has Angelman syndrome. 

e 1/20,000 has Smith-Magenis syndrome. 

e 1/50,000 has cri-du-chat syndrome. 


In addition to causing intellectual disabilities, physical condi- 
tions, and influencing trajectories, these genetic conditions can 
cause a range of other cognitive, behavioural, and mental health 
problems, for which the term ‘behavioural phenotype’ is used. 
Behavioural phenotypes have attracted particular research attention 
and can inform clinical assessments. Examples include high rates 
of dementia in middle-aged and older adults with Down syndrome 
[11, 12]; affective psychosis in Prader-Willi syndrome [13, 14]; self- 
injurious behaviour in Smith-Magenis syndrome [15]; and depres- 
sion in phenylketonuria [16]. While these behaviours/mental ill 
health are genetically driven, it is important to avoid therapeutic ni- 
hilism, as interventions may help the individual. Additionally, some 
genetic syndromes effect physiological differences, which influence 
treatment choices, for example, the low rates of heart disease and low 
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blood pressure in Down syndrome, differing immunology causing 
high rates of thyroid disorders and other immunological disorders, 
and different mechanisms in dementia aetiology (an amyloid 
model), compared to typical Alzheimer disease. The length of this 
chapter precludes a detailed consideration of behavioural pheno- 
types, other than highlighting that clinicians do need to identify if 
their patient/client has a recognized genetic condition, and its asso- 
ciated physiological patterns. 


Antenatal factors 


Antenatal factors include: 


e Teratogenic drugs and toxins. Examples include alcohol. 

e Infections. Examples include toxoplasmosis, rubella, cytomegalo- 
virus, herpes simplex, syphilis, zika virus, and other infections. 

e Fetal growth retardation. Examples include placental dysfunction 
and hypoxia. 

e Endocrine. Examples include iodine deficiency. 


Perinatal factors 


Perinatal factors include: 


e Birth injury, especially in premature and low-birthweight infants. 
e Kernicterus. 
e Infections. 


Postnatal factors 


Postnatal factors include: 


e Infections, encephalitis. 
e Toxins. 

e Brain tumours. 

e Head injury. 

e Starvation. 


Extreme prematurity has been shown to account for 17% of cases 
of intellectual disabilities; together, gestational age and birthweight 
centile have been reported to account for 26.6% of intellectual dis- 
abilities [17]. The month of conception (January-March conception, 
compared with summer conception) has been reported to account 
for 15% of intellectual disabilities, postulated to be related to vitamin 
D or infections at the critical first trimester stage of development 
[18]. Fetal alcohol syndrome is underdiagnosed. 


Mental ill health 


Two recent systematic reviews reported that mental ill health is 
more common in children, young people, and adults with intellec- 
tual disabilities than in the general population [19, 20]. Problem 
behaviours, such as aggressive, self-injurious, and destructive be- 
haviour, are very common in people with intellectual disabilities 
and do not have an obvious comparator in the general population. 
Widely reported prevalence rates have been given in view of dif- 
ferences in populations studied (some from mental health services 
and so their rates are biased and inflated), methods used to identify 
mental ill health, the types of conditions included within the re- 
ported mental ill health (particularly whether or not problem be- 
haviours and/or autism are included), the diagnostic criteria used 


(ICD-10 and DSM-5 under-report mental ill health in this popu- 
lation if strictly applied, especially for problem behaviours), and 
whether studies are reporting point or lifetime prevalences or fail 
to indicate which [21]. 

It is not surprising that mental ill health is more common in people 
with intellectual disabilities, compared with the general population, 
in view of complex biological factors, psychological and social dis- 
advantages, and additional developmental factors [22]. 

Population-based studies in children and young people with in- 
tellectual disabilities reported the prevalence of mental ill health, 
including problem behaviours, ranging from 30% [23, 24] to 50% 
[25]. A robust UK study reported a rate of 36% in 641 children and 
young people (aged 5-16 years) with intellectual disabilities, com- 
pared with 8% of 17,774 children without intellectual disabilities 
in the same surveys: the children and young people with intellec- 
tual disabilities accounted for 14% of all children with mental ill 
health [26]. 

Population-based studies in adults with intellectual disabilities 
reported the prevalence of mental ill health, excluding problem be- 
haviours, ranging from 14.5% (when also excluding ADHD, autism, 
dementia, and personality disorder, people aged 65 and over, and 
people with severe intellectual disabilities [27]) to 43.8% (adults 
with moderate to profound intellectual disabilities only [28]). The 
largest adult population-based prevalence study, in which each 
person was individually assessed, included 1023 adults with intel- 
lectual disabilities [29]. It reported a point prevalence of mental ill 
health of 40.9%, or 28.3% excluding problem behaviours, and used 
more robust methods than previous smaller studies [30, 31]. 

Some types of mental ill health are more common in people with 
intellectual disabilities, including schizophrenia [32, 33], bipolar 
disorder [34], dementia (particularly in adults with Down syn- 
drome), but also in adults with intellectual disabilities of other or 
unknown causes [35, 36], autism [26, 37, 38], ADHD [26], and pica. 
Prevalence rates of mental ill health in children and young people 
with intellectual disabilities are reported to be higher than for other 
children and young people for 27 out of 28 ICD-10 categories [26]. 
Depression and anxiety are common in people with intellectual dis- 
abilities, but probably not more so than in the general population 
[34, 39]. 

Problem behaviours are very common in the population with 
intellectual disabilities. In a large-scale, population-based study of 
adults aged 16+ years, 22.5% were reported to have problem behav- 
iours [29], and of those, 10% had aggressive behaviour [40] and 5% 
had self-injurious behaviour [41]. 

The incidence of mental ill health in adults, excluding problem 
behaviours, has been reported to be 12.6% over a 2-year period— 
8.3% for affective disorders, 1.7% for anxiety disorders, and 1.4% 
for psychotic disorders [42, 33]. The incidence of dementia has also 
been reported for older adults with intellectual disabilities NOT 
due to Down syndrome and found to be considerably higher than 
for the age-matched general population. At the age of 65 years or 
older, the standardized incidence ratio for dementia was 4.98 [43]. 
Regarding problem behaviours, the 2-year incidence of aggression 
was reported as 1.8%, and of self-injury 0.6% [40, 41]. Full remission 
of psychosis after 2 years was only 14.3% [33], aggression 27.7%, 
and self-injury 38.2% [40, 41]. These findings suggest that while 
incidences are higher than those found in the general population, 
much of the current high prevalence of mental ill health is due to 
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enduring disorders, rather than new episodes, though research on 
this is limited in quantity. 

Studies on common types of mental ill health using general popu- 
lation longitudinal cohorts show high and enduring rates of depres- 
sion and anxiety in adults with intellectual disabilities, compared 
with the general population [44-46]. These studies have the attrac- 
tion of drawing direct comparisons with the general population 
but included few people with intellectual disabilities, and of those, 
most had mild learning disabilities and very few severe intellectual 
disabilities, in view of their design and also with differential loss to 
follow-up of people with intellectual disabilities, so findings may be 
biased. 

Longitudinal studies have also reported on the mental ill health 
of children and young people with intellectual disabilities. In 
an Australian cohort study, children and young people aged 4- 
19.5 years were followed over four waves of data collection over 
14 years. High rates of psychopathology levels were reported, with 
hyperactivity more prominent at younger ages and persisting for 
longer in children/young people with more severe degrees of intel- 
lectual disabilities. Emotional disorders emerged later in childhood 
[47-49]. Similar findings have been reported from longitudinal 
studies in children with intellectual disabilities/borderline intellec- 
tual disabilities (excluding those with more severe intellectual dis- 
abilities or with additional sensory or physical disabilities) in the 
Netherlands [50, 51]. 


Neurodevelopmental disorders cluster 


Gillberg coined the term ‘ESSENCE (Early Symptomatic Syndromes 
Eliciting Neurodevelopmental Clinical Examination) to describe 
this [52]. He defined ESSENCE as major problems in: motor skills; 
general development; speech and language; social interaction 
and communication; behaviour; hyperactivity or impulsivity; 
hypoactivity; inattention; and sleep or feeding difficulties. Genetic 
data also increasingly support clustering of neurodevelopmental 
(including epilepsy) and mental health problems [53]. 


Physical ill health, disabilities, and 


multi-morbidity 





Additional physical ill health and disabilities are common in people 
with intellectual disabilities. Indeed multi-morbidity is typical 
for people with intellectual disabilities [4, 54], and hence too is 
polypharmacy. This has implications, as it adds complexity to mental 
and physical health assessments (for example, distinguishing be- 
tween complex partial seizures, depression, and anti-epileptic drug 
side effects), in assessments that are also challenging due to commu- 
nication needs, impairment of understanding, and visual and hearing 
impairments, all of which are common in people with intellectual 
disabilities. This probably contributes to the under-recognition of 
mental ill health that occurs in this population. It also means that 
there are more disease-disease, drug—disease, and drug—drug inter- 
actions to take account of when managing conditions. For example, 
postural problems and deformities (common in people with cere- 
bral palsy and people with profound intellectual disabilities) impact 
upon gastro-oesophageal reflux disease (GORD), which is extremely 
common in people with intellectual disabilities, more so the more 
severe their intellectual disabilities, and can cause anxiety. GORD 


occurs in about 50% of adults with intellectual disabilities; a con- 
sequence is that drugs to manage osteoporosis cannot be tolerated, 
and osteoporosis is common in this population. Many psychotropic 
drugs, commonly prescribed for people with intellectual disabilities, 
lower the seizure threshold, and epilepsy is common. People with in- 
tellectual disabilities may not be able to self-report drug side effects 
and are reliant on others observing these; hence, pharmacovigilance 
is essential. Anticholinergic burden due to polypharmacy of drugs 
with these side effects is an issue for people with intellectual disabil- 
ities, with potential negative side effects such as further impairment 
of cognition [55]. 

Long-term conditions are more common for adults with intel- 
lectual disabilities, compared to the general population. Children, 
young people, and adults with intellectual disabilities have higher 
rates of epilepsy (25% [56]), visual impairment (50%), hearing im- 
pairment (40%), impacted cerumen, GORD (50% [57]), dysphagia 
[58], constipation, diabetes, thyroid dysfunction, osteoporosis, 
contractures, mobility and balance impairments, injuries, eczema, 
xerosis, obesity, and heart failure, compared with the general popu- 
lation [4, 54, 59, 60]. Asthma is also reported to be more common 
in people with intellectual disabilities and may be due to obesity, but 
it is possible that some of this is a misdiagnosis of reflux pneumon- 
itis or aspiration pneumonia. In some cases, the excess physical ill 
health burden relates to the person's underlying cause of intellectual 
disabilities (for example, thyroid dysfunction and Down syndrome), 
but lifestyle and environmental factors and suboptimal support and 
health care are also important contributors. Some problems pre- 
dispose to others. For example, psychotropic drugs (prescribed to 
about 20% of the adult population with intellectual disabilities) [61] 
can increase diabetes risk, as can obesity which is common [62], and 
sedentary lifestyles, also common [63]). 

In view of the shorter life expectancy of people with more se- 
vere intellectual disabilities and those with syndromal causes for 
their intellectual disabilities, older adults with intellectual disabil- 
ities have different characteristics, compared with younger adults. 
Older adults as a group have milder levels of intellectual disabilities 
and lesser quantities of additional physical ill health. The profile of 
their health needs changes, as they have lower rates of the physical ill 
health and disabilities associated with severe intellectual disabilities 
but start to acquire physical ill health associated with ageing. In ex- 
treme old age, the health characteristics of people with intellectual 
disabilities becomes more like those of the general population. 


People with intellectual disabilities do not live as long as other 
people; and life expectancy is shorter, the more severe the person's 
intellectual disabilities. A recent systematic review included 27 
studies and found that although life expectancy has improved in 
recent decades, it is lower, compared with the general population, 
by about 20 years, with no evidence of any closure of the inequality 
gap [64]. More severe intellectual disabilities and/or additional 
comorbidities were associated with the shortest life expectancy. 
Standardized mortality rates showed a greater inequality for 
women than for men, for reasons that are unknown. The main 
causes of death differed from the general population, with respira- 
tory disease the most common, then circulatory diseases (with 
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greater congenital, and lesser ischaemic, disease compared with 
the general population). Cancer was less common, compared 
with the general population, and the cancer profile differed from 
that in the general population. 

Specific syndromes can also shorten life expectancy, including 
death in utero and in infancy and childhood. Life expectancy for 
people with Down syndrome has improved markedly over the last 
50 years, with access to treatments for congenital heart disorders and 
improved surgical techniques and post-operative care accounting 
for much of this [65], but is still reported to be 30 years less than in 
the general population. Down syndrome has been reported to occur 
in 1.2/1000 pregnancies, of which 78.1% are live births. Survival at 
1 year for live births in 1995-1999 was 91.6% [66], and 85% are es- 
timated to survive to 10 years [67]. The proportion of people with 
Down syndrome reduces in older cohorts; 75% survive to 50 years, 
50% to 58.6 years, and 25% to 62.9 years [68]. 

The shorter life expectancy of people with intellectual disabil- 
ities does not just relate to syndromal causes of death and multi- 
morbidity. Some deaths are potentially avoidable, being amenable 
to good-quality care. A confidential inquiry reviewed 247 deaths of 
people with intellectual disabilities, finding that 22% were aged less 
than 50 years [69]. Avoidable deaths from causes that could have 
been amenable to good-quality health care occurred in 37%, com- 
pared with only 13% of the general population [69]. A further large- 
scale study (16,666 people with intellectual disabilities—656 deaths, 
compared with age-, gender-, and practice-matched controls, 
n= 113,562—1358 deaths) also found high rates of deaths amenable 
to good-quality health care at 37.0%, compared with 22.5% in the 
general population [70]. The authors also pointed out the standard 
definition of amenable deaths they used did not include some types 
of death that could be considered amenable to health care and which 
they found occurred more commonly in people with intellectual 
disabilities, including deaths from urinary tract infections and aspir- 
ation pneumonitis [70]. Hence, these disturbing figures are actually 
an undercount of the deaths amenable to good care that people with 
intellectual disabilities experience. 

Improving health care for people with intellectual disabilities 
needs to become a priority for clinicians, service commissioners, 
and policymakers. 
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The complexity of causes 


Intellectual disability (ID) can follow many biological, environ- 
mental, and psychological events. Most do not directly, or inevit- 
ably, lead to ID. Genetic causes may be hereditary or non-hereditary 
and may or may not produce specific syndromes. It is no longer easy 
to separate genetic from non-genetic causes; for example, some en- 
vironmental causes, such as lead, contribute to ID and can produce 
heritable epigenetic changes. In other disorders, environmental fac- 
tors are more prominent or perhaps more easily measured. This is a 
rapidly changing area, as advances in genetic methodology and thr 
ability to share large samples have greatly increased the identifica- 
tion of genetic determinants of severe ID [1]. 

It is presumed that abnormality in prenatal brain development 
accounts for many, if not most, cases of ID. Neurological symptoms 
during the neonatal period are strongly associated with prenatal de- 
velopmental disturbances. Placental insufficiency may lead to mal- 
nutrition, intrauterine growth retardation, and prematurity. One of 
the most important examples of impaired prenatal brain develop- 
ment is that of fetal alcohol spectrum disorders (FASD). Research 
on this very common cause of ID has focused on several neuro- 
transmitters, insulin resistance, alterations of the hypothalamic- 
pituitary-adrenal (HPA) axis, oxidative stress, and epigenetic 
factors [2]. Other toxic substances, such as lead, not only contribute 
to intellectual impairment, but may also produce heritable epigen- 
etic changes [3-5]. 

As discussed elsewhere, ID frequently coexists with other med- 
ical, neurological, and psychiatric disorders, which may further 
restrict interactions with the environment and further delay the de- 
velopment of the individual. 


Classifying causes 

It is difficult to establish the prevalence of ID, as perhaps 70% of 
those so classified have relatively mild disturbance. Rates are higher 
among very low-birthweight infants, with substantial variations 
in rates of mild ID by socio-economic status, but similar rates for 





severe ID [6]. There are many possible bases for classifying causes. 
Here too, the issue is complex with respect to underlying brain ab- 
normalities. Both genetic and non-genetic forms of ID share cellular 
and cortical neurophysiological pathogenetic signatures that result 
in anomalous function of pyramidal neurons [7]. 

Classification by timing may be more successful, although here 
too pre-existing risk factors, such as low iodine in certain geographic 
regions, complicate recommendations for screening for thyroid dys- 
function [8, 9]. Thus, screening of environmental, maternal pre- 
natal, and neonatal measures may all be indicated (for example, folic 
acid deficiency). Separating a group with known genetic syndromes 
has been useful to date, but because these syndromes are prolifer- 
ating and often overlap substantially, it may be unclear in the future 
how helpful these will be as primary classifiers as the number of syn- 
dromes continue to increase. 

This chapter is divided into four sections: 


1. ID in which there are well-documented external prenatal envir- 
onmental factors. 

2. ID with salient genetic causes. 

3. ID treatment and prevention. 

4. [D-associated syndromes in which multiple clinical manifest- 
ations are apparent, in addition to ID (see Chapter 22). 


The section on treatment and prevention focus on the nature and 
timing of treatment and preventive efforts. 








External prenatal factors 


Fetal alcohol exposure 


Fetal alcohol spectrum disorder (as it is commonly called in the lit- 
erature) is one of the main causes of ID worldwide and the leading 
non-hereditary cause of ID in Western geographic regions, with 
1% of children in the United States affected. It is now identified in 
DSM-V through a specifier and is no longer a separate entity in ICD- 
11. Part of the reason for this diagnostic change is the variability 
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in, and non-specificity of, neurological and physiognomic features. 
Research has focused on a broad array of physiological abnormal- 
ities, as well as epigenetic pathways [2]. Current treatment involves 
support to the families with symptom-focused cognitive and psychi- 
atric remediation [10]. Future efforts must prepare for better strat- 
egies for the prevention of this major societal problem. 


Congenital hypothyroidism 


The essential role of thyroid hormone in CNS maturation has been 
clearly demonstrated, and the critical period for this CNS depend- 
ency extends from fetal life to 24-36 months of age [9]. The benefit 
of early treatment was apparent, and with the availability of con- 
genital hypothyroidism (CH) screening through assays for thy- 
roxine and thyroid-stimulating hormone and an adaptation of these 
methods to the filter paper blood spots used for phenylketonuria 
(PKU) screening, CH screening is now a standard of care. The ma- 
jority of children who were treated early experience normal growth 
and normal-range IQ values [11]. Widespread micronutrient defi- 
ciencies exist with pregnant women and their children under 5 years 
at highest risk. Iron, iodine, foliate, and zinc deficiencies are the 
most widespread and all contribute to poor growth and intellectual 
impairments [12]. 


External postnatal factors 
Brain injury 


Traffic accidents are a major source of brain trauma. Here too, mul- 
tiple factors are involved in recovery [13, 14]. 

Meningitis 

Meningitis after the neonatal period can result in ID, particularly 
in severe cases. Lead is a potent neurotoxicant, with demonstrated 
effects on the brain of children and adults. There is a documented 
association between early life exposure to lead and impaired cogni- 
tive function in children. These impairments are influenced by the 
degree of exposure, indicated by childhood blood lead levels and 
sociodemographic factors, with impairment generally falling in the 
milder range [15, 16]. Early treatment is of great importance, along- 
side environmental monitoring. 


Salient genetic causes 


Twin and family studies in various populations and patient cohorts 
revealed a strong genetic basis, with 30-90% heritability, depending 
on the diagnostic category and study design [17, 18]. Similar to IQ, 
mild ID (50 < IQ < 70) is heritable and caused by many genes of 
small effect [17, 19-22]. However, a major cause of severe ID could 
be rare, non-inheritable de novo point mutations [23-25], along 
with environmental factors [26, 27]. 

Genetic causes of ID are thought to account for 25-50% of ID 
cases and are more frequently observed in the group of severe ID, 
accounting for up to 65% [28-33]. Genetic causes of severe ID in- 
clude chromosome aneusomies (aneuploidy and translocation), 
chromosome structural abnormalities, and monogenic disease. 
Unfortunately, ID displays extreme genetic locus heterogeneity, 
and about 50% of ID still remain without any molecular diagnosis 
[34]. ID shares some genetic mechanisms with the broader group of 


neurodevelopmental disorders [30]. The combination of novel tech- 
nology, such as next-generation sequencing (NGS), and increased 
biological understanding is rapidly increasing the diagnostic yield 
of genetic tests in ID and improving the usefulness of genetic test 
results for patients and families involved [35-37]. In addition, it is 
providing possibilities for carrier testing and prenatal screening, as 
well as new targets for treatment [25]. 


Chromosome aneusomies 


As chromosome aneusomies are the most common known causes 
of ID, several new methods, based on high-density microarray tech- 
niques or NGS followed by karyotype analysis, have been devel- 
oped and widely used over recent years to increase detection rate 
of subtle aneusomies [21, 38-40]. These causes explain up to 15% of 
severe cases [41, 42]. Trisomy 21, or Down’s syndrome, is the most 
common chromosomal aneuploidy associated with ID, with an es- 
timated frequency of 1.5 in 1000 pregnancies and accounting for 6- 
8% of all ID. Other common chromosomal abnormalities include X 
chromosome aneusomies and a variety of cytogenetically balanced 
and unbalanced translocations. The most frequent X chromo- 
somal aneusomies are XXY (Klinefelter’s syndrome), XO (Turner's 
syndrome), and XYY syndrome, accounting for 1% of severe ID. 
There is also a wide range of recurrent sub-chromosomal deletions 
or duplications such as Prader-Willi and Angelman syndromes 
(15q11.2-q13 deletion), Williams—Beuren syndrome (7q11.23 dele- 
tion), Smith-Magenis syndrome (17p11.2 deletion), and di George 
syndrome (22q11.2 deletion). The wide application of high-density 
microarrays increases the identification of copy number variants 
(CNVs) associated with ID. Approximately 15-20% of patients with 
ID had microdeletions and duplications revealed by array-based 
CNV analyses [39, 43-45]. These CNVs include both rare de novo, 
as well as rare inherited, mutations. It is anticipated that clinical 
overlap will make their syndromal phenotypic distances less useful, 
as more genetic events are identified. 


Single-gene causes 


It has been estimated that mutations in more than 700 different 
genes may cause ID [25]. These mutations include X-linked, auto- 
somal dominant, and autosomal recessive ID and can be inherited 
or arise through de novo events. Until NGS methods came into use, 
only a handful of autosomal ID-associated recessive genes were dis- 
covered, possibly due to limitations of cytogenetic and sequencing 
technology and the absence of large families with autosomal forms 
of ID [25]. Most metabolic disorders belong to autosomal recessive 
forms of ID (ARID), caused by single mutated genes that disturb 
metabolism by deficient enzyme activity. These diseases include 
PKU and homocystinuria. For the last decade, over 300 genes have 
been identified for ARID, mostly by homozygosity mapping using 
single-nucleotide polymorphism (SNP) microarrays in large con- 
sanguineous families and subsequent follow-up of candidate genes 
by Sanger sequencing [46, 47]. In outbred populations, ARID may 
account for about 10-20% of the cases, while autosomal recessive 
disorders were found to be up to 3-fold more frequent among in- 
bred, compared to non-inbred, cases [48, 49]. 

Autosomal dominant genes cause tuberous sclerosis, myotonic 
dystrophy, Gorlin syndrome, neurofibromatosis type 1, Apert syn- 
drome, Menes syndrome, and Huntington’s disease. The trio-based 
exome sequencing design allowed the identification of dominant 
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de novo mutations (heterozygous) or small indels associated with 
severe, sporadic non-syndromic ID, accounting for 13-55% of pa- 
tients with high locus heterogeneity [17, 35, 50]. Exome sequencing 
also rapidly identified autosomal dominant sporadic syndromes as- 
sociated with ID such as Schinzel-Giedion syndrome [51], Kabuki 
syndrome [52], and Bohring-Opitz syndrome [53]. 

For many years, research into the molecular causes of ID has fo- 
cused on the X chromosome because males are affected more often 
than females [54-56]. Mutations in more than 100 X-linked genes 
are now known to cause ID, accounting for 10-12% of males with 
ID, with none of these genes individually explaining more than 0.1% 
of ID [57-59]. The most well-known example is fragile X syndrome, 
which is caused by expansion of a CGG repeat in the FMR1 gene 
(fragile X mental retardation 1) and is the second most common 
genetic cause of intellectual disability, accounting for about 0.5% of 
ID, after Down syndrome [60]. 

The genetic causes of ID are rapidly being uncovered and will be 
essential for genetic counselling, diagnostics, and treatment in the 
future. Microarrays for mutation screening of known disease genes, 
as well as exome and whole-genome sequencing, will likely become 
essential tools, both for clinical diagnostic purposes and research. 


Treatment and prevention 


Screening/prevention/preatment 


Newborn screening—tests that can be done within the first hours 
or days of life and have a potential for preventing severe health 
problems— has evolved from simple blood or urine tests to a com- 
prehensive and complex system, detecting over 50 different condi- 
tions, including those relevant to ID such as CH and several amino 
acid deficiencies. The newborn blood spot taken from a heel stick is 
mandated in most of North America and Europe [61]. 

Table 23.1 shows the timing and nature of screening for genetic 
disorders and those with prominent environmental factors. Box 
23.1 lists the diagnostic examinations for presumed or established 
intellectual disabilities. 


Environmental/social assessment 


FASD—prenatal counselling of the mother and family, family coun- 
selling, and prenatal care. 


Table 23.1 Aetiology of intellectual disability based on time and 
mechanism of the injury to the central nervous system and history 
for timing of diagnosis and treatment 


Genetic causes: chromosomal, CNV, Family history: recurrent 
single gene, single base pair miscarriages, consanguinity 
Mitochondrial disorders 


External prenatal factors: infections, 
maternal illness, for example, 
hypothyroidism, micronutrient deficiency 


Gestational history: maternal 
infection, trauma, drugs, alcohol, 
exposure 


Postnatally acquired disorders 

Infection 

Brain trauma 

Psychosocial, for example, environmental 
stimuli 


Social history 
History of meningitis, rubella 
Trauma, toxicity, for example, lead 





Box 23.1 Diagnostic examination of presumed or established 





intellectual disability 


e General examination for dysmorphic features. 

e Neurological, ophthalmological, audiological, cardiological, and 
neuropsychological assessments. 

e Full blood count, thyroid function. 

e Genetic screening (Table 23.1). 

e Blood/urine. 

e Neuroimaging. 

e Neurophysiological EEG. 











Prenatal screening/maternal assessment 


There have been potentially revolutionary changes to prenatal diag- 
nosis and screening and include both imaging and tissue diagnoses. 
The discovery of cell-free fetal DNA in maternal plasma opened up 
a new frontier in the quest for a non-invasive prenatal screening 
strategy [62], since validated in several studies [63, 64]. Future 
population studies will be essential to evaluate these data, as the 
overall cumulative prevalence of many genetic risk markers is not 
established. 


Newborn/infant tests and treatments (for example, thyroid) 


Newborn screening for CH—awareness of an iodine-poor environ- 
ment and CH screening at birth now permit levothyroxine therapy 
by 4-6 weeks of life, which represents the standard of care. 


Syndromes causing intellectual disability 





Classic syndromes are usually associated with chromosomal abnor- 
malities and large structural variations, since this type of alteration 
is more likely to lead to additional phenotypic presentations other 
than ID alone. This section deals with some of the syndromes that 
increase to have ID. In syndromic ID, patients present with one 
or multiple clinical features or comorbidities, in addition to ID. 
However, symptoms of some syndromes may be so subtle that they 
are extremely difficult to diagnose, unless the features are looked for 
specifically in the context of a known genetic defect previously asso- 
ciated with these features. Thus, the distinction between syndromic 
ID and non-syndromic ID is often blurred [30]. 

Table 23.2 describes the most well-known syndromes causing ID. 

Tuberous sclerosis occurs in 1 in 7000 people, with a majority 
arising from spontaneous mutations to 9q34 and 16p13. This syn- 
drome is characterized by a triad of epilepsy, intellectual deficiency, 
and a characteristic facial skin lesion. 

Turner's syndrome occurs in 1 in 10,000 female births and is caused 
bya loss or an abnormality of one X chromosome in women. Ninety- 
nine per cent of affected fetuses miscarry. Affected women exhibit 
normal intelligence and verbal abilities, with significant variation. 
They show deficits in spatial perception, visual motor integration, 
affect recognition, visual memory, and attention. Furthermore, they 
tend to display hyperactivity and distractibility during childhood, 
poor social skills, immature social relationships, and low self-esteem 
in adolescence. Dysmorphic features include: a webbed neck, a low 
hairline at the rear of the head, widely spaced nipples, and multiple 
pigmented naevi. Some have cardiovascular abnormalities. 
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Table 23.2 Most prevalent syndromes associated with intellectual disability 





Klinefelter’s 
syndrome 


XXX syndrome 


XYY syndrome 


Down's syndrome 


Velo-cardio-facial 
syndrome 


1 in 660 


live births 
(possibly 
increasing) 


1 in 1000 
female births 


1 in 1000 live 
male births 


1 in 2000 at 
maternal 
age of 30 

1 in 100 at 


maternal age 


of 40 


1 in 2000 
people 


phenotypic males 


47,XXX resulting from 


primary non-disjunction 
of a maternal or paternal X 


chromosome during meiosis. 


Only 40 cases of 48,XXXX 
reported 


Risk increases with advanced 


maternal age 


Primary non-disjunction of 


the Y chromosome; 10% 
have mosaic 46,XY/47,XYY 
chromosome complement. 
Offspring with two Y 
chromosomes are rare 


Additional chromosome 


21. Rarer cases caused by 
chromosomal translocation 
or mosaicism 


Microdeletions at 22q11 (90% 


of cases arise de novo; 10% 
inherited) 


IQ score ranges from 60 to 130, with greater 
impairment to verbal IQ. High rate of 
learning disability, speech and motor 
delays, and impairment in language 
processing. Adults may have increased 
impulsiveness and rate of antisocial 
behaviour 


Affected women have IQs of between 80 
and 90, with increasing severity of IQ 
deficits corresponding to additional 
X aneuploidies. Women with XXXX 
syndrome have IQs of 55-75 

Developmental delay; motor and speech 
delays; higher risk of cognitive deficits and 

earning disabilities in school-age years. 

Some experience relatively poor short- 

erm auditory memory 

Under-activity and withdrawal have been 

reported; slowed emotional development 


Affected men have lower mean intelligence 

scores, but with large overlap with the 

normal distribution 

Poor social adaptation, distractibility, 

hyperactivity, temper tantrums, and 
speech and language problems are 
relatively common in childhood 

Increased frequency of XYY men among 
inmates in special prisons 

Balance and co-ordination minimally 
compromised 








Adults have moderate ID 

Affected children have some degree of 
specific speech and language delay 

Stubbornness and obsessional features 
over-represented 


Learning disability, as well as speech and 
language problems are common 

Difficulties with social interaction. Anxiety, 
social withdrawal, and other disorders 
have also been reported 


Classification Genetic abnormality Behavioural and cognitive features Physical features 


Additional X chromosome(s) in 


Small testes; hypergonadotropic 
hypogonadism; azoospermia; infertility; 
increased luteinizing hormone 
and follicle-stimulating hormone; 
hypogonadism; gynaecomastia 

Below-average height, weight, and head 
circumference at birth; accelerated 
growth and increased height from 
3 years onwards 

Increased risk of diabetes and metabolic 
syndrome, abdominal obesity, 
autoimmune diseases, osteopenia, 
cryptorchidism, decreased penile size 
(children), congenital malformations (for 
example, clinodactyly), cardiovascular 
abnormalities, breast cancer, 
osteoporosis, mediastinal cancers 


Hypotonia, low birthweight, and small 
head circumference in newborns. 
Increased height in adulthood, with a 
low body mass index, epicanthal folds, 
hypotonia, and clinodacty| 

Unimpaired fertility, but possible 
premature ovarian failure and recurrent 
spontaneous abortions 

Seizures, renal, and genitourinary 
abnormalities 





Increase in body and leg length between 
years 4 and 9. As adults, most are over 
10 cm taller than their fathers 

Sexual development and fertility are 
unaffected 





Muscular hypotonia that typically 
improves with development, short 
stature in adulthood, with characteristic 
facial appearance, eyes sloping upwards 
and outwards, wide nose bridge and 
unusually shaped head, transverse 
crease on the arm, large cleft between 
first and second toes, relatively short 
upper arms 

Thyroid abnormalities, heart abnormality, 
gastrointestinal abnormality, changes in 
blood cells 


Physical features include cardiac 
abnormalities, including ventriculoseptal 
deficits, pulmonary stenosis, and cardiac 
outlet abnormalities 

Prominent nose with broad bridge 
and squared tip, small head, or small 
lower jaw; ocular abnormalities; cleft 
palate; short stature and long, thin, 
hyperextensible fingers 


Table 23.2 Continued 


Classification Genetic abnormality Behavioural and cognitive features Physical features 


1 in 3000 births Associated with 17q11.2, which Speech or language abnormalities, learning 


Neurofibromatosis 


type 1 
Fetal alcohol 0.33 per 1000 
syndrome’ births 


Fragile X syndrome 0.3 in 1000 


Congenital 1 in 4000. 
hypothyroidism Occurs more 
commonly in 
females 
Duchenne 1 in 4000 male 
muscular births 
dystrophy 


“No longer separate in DSM-V. 


Angelman syndrome occurs in 1 in 10,000 births and is associ- 
ated with deletions at the 15q11-q13 region of maternal origin (see 
Prader-Willi syndrome). Affected patients exhibit severe ID, mark- 
edly delayed motor milestones, little speech development, and over- 
activity associated with short attention span. Physical features include 
hypopigmentation, a small head, characteristic face with wide mouth, 


produces neurofibromin 
and is involved in regulating 
cell division and tumour 
suppression; 50% of all new 
cases arise in unaffected 
families due to high 
spontaneous mutation 


disability, specific developmental disorders 

involving reading writing, or numeracy 
Distractibility, impulsiveness; visuospatial 

abnormalities, lack of co-ordination 


Exposure to alcohol at any stage Usually mild to moderate ID, but may 


of fetal development inhibits 
N-methyl-D-aspartate 
receptors, which mediate 
post-synaptic excitatory 
effects of glutamate, and 
affects cell proliferation and 
causes cognitive impairment 


X-linked trinucleotide repeats 
eventually halt function of 
the FMR1 gene 

Healthy individuals have 
around 50 repeats; carriers 
with 50-100 repeats are 
considered to have the 
pre-mutation, and carriers 
with over 230 repeats are 
considered to have the full 
mutation 


Majority of cases caused 
by deficiency of iodine in 
mother or infant. Some 
cases due to mutations 
in genes involved in 
thyroid development and 
hormonogenesis such as 
PAX8, TSHR, TSHB, DUOX2, 
TG, TPO, and SLCA5 


X-linked recessive, with 30% of 
cases due to new mutations 

Deletions, duplications, and 
mutations at Xp21 result in 





failure to produce dystrophin, 


a protein component of 
muscle tissue 


be severe 

Reduction in attention span, over-activity, 
irritability in infancy, and co-ordination 
problems 


Mild to moderate ID. Severity of ID in women 
depends on the random proportion of cells 
with X inactivation. Pre-mutation carriers 
are intellectually unimpaired 

Problems with attention and concentration 
may be disproportionate to the 
severity of ID 

Speech and language delay; 
disorganized speech 

Some degree of social impairment. Men with 
fragile X syndrome are usually affectionate, 
without the aloof quality typical of autism 

Self-injury; stereotyped behaviour, for 
example, hand flapping 


If untreated, even mild cases may lead 
to failure of cognitive development, 
resulting in ID 

Sleep disorder 


Some affected people have IQs within 
normal range 

Specific reading disorder and learning 
disability 

Low mood, anxiety, and social abnormalities 
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Diagnosis usually requires two or more 

of the following symptoms: six or more 
ight brown skin lesions of >5 mm in 
diameter before puberty or 15 mm after 
puberty; two or more neurofibromas or 
one plexiform neurofibroma; freckling 
of the inguinal or axillary regions; two or 
more Lisch nodules; optic nerve glioma; 
bony lesions; a first-degree relative with 
he disorder 
Non-enhancing hyperintensities on MRI 
ocated at the cerebellum, basal ganglia, 

brainstem, and thalamus 
Tumours from connective tissue of nerve 

sheaths 

















Facial dysmorphology: thin upper lip 
and smooth philtrum, small jaw, 
low-set abnormal ears, and palate 
abnormalities. Growth retardation, 
skeletal abnormalities, for example, 
deformed ribs and sternum, spinal 
curvature, dislocated hips, fused or 
webbed or missing fingers or toes, 
limited joint movement, small head 

Heart abnormalities and urinary tract 
anomalies 

Small brain with abnormally arranged cells 


Large testes apparent after puberty; 
long face; high-arched palate; large 
forehead, ears, and lower jaw; hyper- 
extensible joints, flat feet; cardiovascular 
abnormalities; ear infections; cataracts 


Severely affected children have a 
characteristic appearance, with a puffy 
face, large and protruding tongue, dry 
and brittle hair, low hairline, and low 
muscle tone 

Constipation and jaundice 


Typical onset is from 2 to 6 years. 
Progressive muscle weakness, affecting 
the pelvis, upper leg, and upper arm first 

Respiratory muscles and cardiovascular 
muscle abnormalities may occur later 

Severity is initially greater in the legs and 
torso, eventually moving up to the arms 
and respiratory muscles 


‘hooked’ nose, prominent lower jaw, widely spaced teeth, and tongue 
protrusion. Voluntary movements are jerky, and gait is ataxic with 
stiff legs. Affected patients tend to enjoy social and physical contact, 
mouthing objects, and display a fascination with water. 

PKUoccurs in 1 in 10,000 live births and is caused by a mutation at 
12q22-24.1, which results in deficiency of the enzyme phenylalanine 
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hydroxylase. Affected patients exhibit moderate to severe learning 
disability, displaying deficits in mathematical, visuospatial, and lan- 
guage skills. Physical features include a small head, blond hair, blue 
eyes, and eczema. Behavioural features include over-activity, self- 
injury, irritability, and marked social impairments. They are often 
affected by tremor and movement disorders. 

Rett’s syndrome occurs in 1 in 10,000 women and is associated with 
a mutation at Xq28, which is usually lethal in males but may persist 
in a small number of surviving males. Affected patients exhibit se- 
vere learning disability. Physical features include cessation of devel- 
opment, lack of muscle tone, scoliosis, spasticity, reduction in brain 
size with reduced cortical thickness, reduced neuronal branching, 
and depigmentation of the basal ganglia. Behavioural features in- 
clude immobility at infancy, loss of skills from around 18 months 
onwards, stereotyped movements, over-breathing or cessation of 
breathing, sleep disturbances, and withdrawal. 

Williams syndrome occurs in 1 in 15,000 infants and is associated 
with a deletion at 7q11.3. Affected children have moderate or severe 
learning disability. Behavioural features include difficulty feeding, ir- 
ritability, constipation, and failure to thrive at infancy. Affected chil- 
dren display social disinhibition, over-activity, poor concentration, 
eating and sleeping abnormalities, abnormal anxiety, poor peer rela- 
tionships, and abnormal hearing sensitivity. Physical features include 
prominent cheeks, wide mouth, flat nasal bridge, growth retardation, 
high serum calcium concentrations, and kidney and heart lesions. 

Smith-Lemli-Opitz syndrome mostly affects males, occuring 
in 1 in 30,000 live births, and is due to abnormalities at 11q12- 
13. Affected patients have moderate to severe learning disability. 
Physical features include growth retardation of the fetus during 
pregnancy, small head, drooping eyelids, squint, forward-facing 
nostrils, small lower jaw, abnormalities of external genitalia, hypo- 
spadias, undescended testes, cleft palate, finger abnormalities, and 
abnormalities of almost all major organ systems. Behavioural fea- 
tures include aggressive and self-injurious behaviour. 

Cri-du-chat syndrome occurs in 1 in 35,000 births and is associ- 
ated with deletions at the 5p terminal. Affected patients have mild to 
severe intellectual deficiency, with markedly delayed language devel- 
opment. Behavioural features include hyperactivity, feeding difficul- 
ties, and an abnormally high-pitched cry at infancy. Physical features 
include ear abnormalities, round face with widely spaced slanting 
eyes, small head, flat broad nose, small lower jaw, and premature 
greying of hair. They sometimes have an asymmetrical face, cleft lip 
or palate, curved fingers, hernias, and orthopaedic abnormalities. 

Prader-Willi syndrome occurs in 1 in 40,000 live births and is as- 
sociated with a deletion at 15q11-q13 of paternal origin. Physical 
features include hypotonia, short stature, small hands and feet, a 
characteristic pattern of facial appearance, and lack of sexual devel- 
opment at adulthood. Behavioural features include feeding problems 
in infancy, overeating in early childhood, insatiety, sleep abnormal- 
ities, lower threshold for loss of temper, scratching or picking at skin, 
insistence on routines, and compulsive behaviours. Obesity may re- 
sult due to overeating. 

Smith-Magenis syndrome, occurring 1 in 50,000 births, is associ- 
ated with deletions at 17p11.2 and commonly results in moderate 
ID. Behavioural and cognitive features include placid newborns, 
difficulty with feeding, hyperactivity, self-injury, self-hugging, mid- 
line handclapping, sleep disorder, absence of rapid eye movement 


(REM) sleep, and insensitivity to pain. Physical features include flat- 
tened mid face, abnormally shaped upper lip, short hands and feet, 
single transverse palmar crease, abnormally shaped or placed ears, 
and sometimes a high-arched palate or protruding tongue. Otitis 
media and squint are common. 

De Lange syndrome occurs in 1 in 60,000 live births and is associ- 
ated with mutations in chromosome 5 at the NIPBL gene. It results 
in severe learning disability, with very limited speech. Physical fea- 
tures include hearing impairments, gut malformations, congenital 
heart defects, gastrointestinal pain, growth retardation, well-defined 
arched eyebrows meeting in the middle, long and curled eyelashes, 
small nose with forward-facing nostrils, down-turned mouth with 
thin lips, and small or shortened limbs. Behavioural features include 
self-injury, autistic features, and pleasurable responses to vestibular 
stimulation. 

Mucopolysaccharidoses are autosomal recessive (except for 
Hunter’s syndrome, which is X-linked) that occur in around 1 in 
100,000 to 1 in 500,000 births. ID ranges from absent to severe. 
Physical features include gargoylism, hepatosplenomagaly, joint 
stiffness, eye abnormalities, and short statures. Behavioural and 
cognitive features include sleep problems, aggression, over-activity, 
restlessness, and anxiety. 

Rubinstein-Taybi syndrome occurs in 1 in 125,000 live births and 
is associated with microdeletions at 16p13.3. Patients have mod- 
erate learning disability. Physical features include congenital heart 
defects, urinary tract abnormalities, decreased height, inadequate 
weight at infancy, small head, beaked or straight nose, downward- 
slanting eyes, stiff gait, and thumbs or first toes with broad terminal 
phalanges, sometimes with an angulation deformity. Behavioural 
cognitive features include constipation, friendly disposition, pro- 
pensity to self-stimulatory activities, and reduced attention span. 

Lesch-Nyhan syndrome occurs in 1 in 380,000 births and is an X- 
linked recessive trait which results in hyperuricaemia. It can also be 
caused by complete and partial deletions, insertions, and duplica- 
tions of Xp26q27. Patients with this syndrome have mild to severe 
ID. Physical features include hypotonia, hyperreflexia, and clonus 
during the first year, growth retardation, and gout. Behavioural 
features include compulsive behaviours, self-injury, spasticity and 
choreo-athetoid movements at 9 months, dystonic movements, dys- 
arthria, and athetoid and movement abnormalities. 
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The advent of a model of disorders of intellectual development/in- 
tellectual disability that emphasizes inclusiveness and participation 
in society for affected individuals brought with it a greater emphasis 
on considerations of the influence of contextual factors that facili- 
tate or impede functioning within society (see also Chapter 21). The 
WHO revision of the International Classification of Diseases (ICD), 
the most widely used health classification worldwide, was approved 
by the World Health Assembly in May 2019. The focus of the revi- 
sion of the ICD Mental and Behavioural Disorders chapter led by 
the WHO Department of Mental Health and Substance Abuse is the 
incorporation of current scientific evidence in a set of diagnostic 
guidelines that will maximize clinical utility and global applicability 
[1, 2]. Consistent with the overarching goal of the classification re- 
vision to enhance clinical utility, the ICD guidelines for disorders of 
intellectual development were written to facilitate more pervasive, 
efficient, and accurate identification of affected individuals, with the 
ultimate goal of enhancing access to appropriate services in varied 
settings globally [3]. 

Disorders of intellectual development is a term proposed for the 
first time in ICD to replace the label of mental retardation that ap- 
peared in ICD-10. Although intellectual disability is frequently used 
and indeed appears in DSM-5 [4], inclusion of the word ‘disability’ 
in the label would be at odds with the approach promoted by the 
WHO Family of International Classifications that distinguishes 
between health conditions classified in the ICD from the conse- 
quences of those conditions (that is, disabilities) classified in the 
ICF. The ‘Disorders of intellectual development’ title also reflects 
that the onset of the condition is during the developmental period 
and focuses on the broad construct of ‘intellect’ that is well under- 
stood in policy circles, as well as among researchers and clinicians 
[3]. Disorders of intellectual development is classified among other 
neurodevelopmental disorders, and its guidelines are used in con- 
cert with the guidelines for autism spectrum disorder to characterize 
the degree of impairments in intellectual functioning in individuals 
affected by that disorder. 

Disorders of intellectual development are defined by three es- 
sential features in ICD: (1) significant limitations in intellectual 





functioning generalized to various domains (for example, percep- 
tual reasoning, working memory, etc.); (2) significant limitations 
in adaptive behaviour, which is the set of conceptual, social, and 
practical skills; and (3) onset of both of these limitations during the 
developmental period. ICD-11 preserves the existing four levels of 
severity that appear in ICD-10 (that is, mild, moderate, severe, and 
profound), largely due to evidence that increasing severity is asso- 
ciated with a number of important support needs, including choice 
of living arrangements [5]. ICD-11 will propose an approach that 
requires consideration of both intellectual and adaptive behaviour 
functioning to determine the level of severity. Each level of severity 
is defined by the expected range of performance on measures of in- 
tellectual and adaptive behaviour functioning, with a caveat stipu- 
lating that differentiation between severe and profound levels on the 
basis of intellectual functioning is unreliable and therefore should be 
guided by findings on measures of adaptive behaviour skills. There is 
also a qualifier to indicate a provisional diagnosis that is used when 
a comprehensive assessment has not been, or cannot be, performed 
(for example, sensory limitations) or if a child is 4 years or younger. 
Poor reliability of the assessment of intellectual and adaptive behav- 
iour functioning in young children preclude a definitive assignment 
of severity. 

In recognition of the fact that, in many settings worldwide, par- 
ticularly in low- and middle-income countries, patients are unlikely 
to be assessed by mental health or developmental specialists, the 
ICD-11 guidelines also include a set of tables that outline concrete 
behavioural indicators of intellectual and adaptive behaviour func- 
tioning at each severity level expected of individuals falling into 
one of three age ranges (early childhood, childhood/adolescence, 
and adulthood) [6]. These tables were developed primarily to assist 
non-specialists to determine the level of severity of a disorder of in- 
tellectual development. Although the use of standardized, normed 
measures of intellectual and adaptive behaviour functioning is pref- 
erable for determining the level of severity, the WHO recognizes 
that availability of such measures in local languages in many global 
settings is limited. Thus, the behavioural indicator tables provide an 
alternative means of establishing severity on the basis of observed 
or informant-reported behaviours. This system represents a con- 
siderable improvement over the idiosyncratic determination of 
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severity or overreliance on the use of a provisional diagnosis, prac- 
tices that limit patients’ access to appropriate services. Clinicians are 
instructed to assign the level of severity on the basis of the level at 
which the majority of the individual’ intellectual ability and adap- 
tive behaviour across all three domains fall, using various sources of 
data and clinical judgement. 


Co-occurring disorders in ICD-11 


Chapter 24 has outlined the common comorbidities seen between 
intellectual disability and other psychiatric disorders. The ICD-11 
proposals for the disorders of intellectual development guidelines 
provide users with comprehensive guidance on the detection of com- 
monly co-occurring disorders in affected individuals. Identification 
of these co-occurring conditions is of vital importance in under- 
standing the needs for, and developing, an appropriate treatment 
plan for affected individuals. Therefore, ICD-11 proposals outline 
guidance on the more commonly comorbid conditions, including 
autism spectrum disorder (ASD), attention-deficit/hyperactivity 
disorder (ADHD), and various other mental and behavioural 
disorders. 

ASD frequently presents with limitations in intellectual and adap- 
tive behaviour functioning, and when both are found to be two or 
more standard deviations below the mean, a diagnosis of ASD with 
intellectual impairment is assigned co-concurrently with a disorder 
of intellectual development qualified according to the determined 
level of severity. However, in these circumstances, the ICD-11 guid- 
ance indicates that determination of the level of severity of the in- 
tellectual disability should be more reliant on the conceptual and 
practical domains of adaptive behaviour functioning, giving less 
weight to social skill abilities, because social communication deficits 
are a hallmark feature of ASD and better accounted for by that 
diagnosis. 

Rates of co-occurring ADHD in individuals with a disorder of 
intellectual development diagnosis are significantly higher than in 
the general population (for example, [7]). When inattention and/or 
hyperactivity-impulsivity are significantly affected in an individual 
with a disorder of intellectual development beyond what is expected, 
based on age and the level of intellectual functioning, a co-occurring 
diagnosis of ADHD can be assigned. 

A variety of other mental and behavioural disorders are known to 
occur at the same, or perhaps higher, rates among individuals affected 
by disorders of intellectual development, as compared to the general 
population (for example, mood disorder, anxiety and fear-related 
disorders, impulse-control disorders, and psychotic disorders) [8]. 
ICD-11 guidelines caution clinicians to ensure that assessments are 
conducted with methods that are appropriate to individuals’ level of 
intellectual functioning and to be aware that self-report is less re- 
liable among affected individuals and may lead to underreporting 
of co-occurring psychopathology. Therefore, observable signs or re- 
ports by caregivers or individuals close to the person being assessed 
may provide a more accurate understanding of the presence of co- 
occurring ICD-11 mental and behavioural disorders. 


The general approach to management of persons with an intel- 
lectual disability is educational and psychosocial, with use of 


specific appropriate management interventions for mental health 
and behavioural problems when needed. Psychiatric comorbidities 
in persons with an intellectual disability are often under-detected. 
Comorbidities are typically suspected when patients exhibit changes 
in behaviour, particularly disruptive behaviour. The Royal College 
of Psychiatrists (2016) [9] has recently summarized three broad 
situations in which people with an intellectual disability and mental 
health or behavioural problems might come into contact with pri- 
mary or secondary care: (1) the presence of challenging behaviour 
that is not associated with a mental disorder; (2) the presence of 
challenging behaviour that is associated with symptoms that meet 
the diagnostic criteria for a mental disorder; and (3) the presence 
of challenging behaviour that is associated with some psychiatric 
symptoms that do not quite fulfil the diagnostic criteria for a mental 
disorder. In many circumstances, changes in behaviour can be at- 
tributed to physical illness, difficulties in communication, or envir- 
onmental changes, or can be a reaction to stressful circumstances 
or frustration. The clinical assessment should consider all the afore- 
mentioned factors, and the diagnostic formulation that serves as 
the basis for a management plan should reflect not only the severity 
and known aetiology of the intellectual disability, but also any co- 
occurring mental and physical disorders, as well as information 
gathered about psychosocial stressors and behavioural problems. If 
the clinician considers the presence of a physical health problem as 
the main factor explaining the behavioural symptoms, appropriate 
intervention for the physical conditions should follow promptly. 

Proper identification of co-occurring disorders is essential in 
addressing the needs of affected individuals. When assessing for 
co-occurrence, clinicians should be familiar with the concept of 
diagnostic ‘overshadowing, whereby emotional and behavioural 
symptoms or frank psychopathology are misattributed to the fea- 
tures of the disorder of intellectual development, rather than a 
separable disorder. Overlooking co-occurring disorders deprives af- 
fected individuals of potentially helpful interventions. A mitigating 
strategy is to use modified diagnostic criteria sets to differentiate 
symptoms of disorders of intellectual development from those better 
attributable to co-occurring conditions [for example, Diagnostic 
Manual-Intellectual Disabilities (DM-ID) [10]]. A general guiding 
principle in the assessment of mental and behavioural disorders in 
individuals with a disorder of intellectual development is that clin- 
icians employ methods that are appropriate to the individual's level 
of development and intellectual functioning, which may require 
a greater reliance on signs in combination with collateral reports. 
Reporting of internal states is more difficult for affected individuals, 
particularly those with more severe forms of disorders of intellectual 
development. 


Psychosocial interventions 


Psychosocial interventions that target behavioural and emotional 
difficulties, as well as diagnosable co-occurring mental and behav- 
ioural disorders, can be effective tools in the overall management 
plan for individuals with intellectual disability. Formulation and 
implementation of a treatment plan, even when it is meant to re- 
duce or eliminate disruptive behaviours (for example, self-injurious 
or aggressive behaviours) should be undertaken, with the goal 
of improving affected individuals’ quality of life and, in doing so, 
respect the principles of equality, non-discrimination, and self- 
determination. Intervention selection that promotes the acquisition 
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ofacademic skills, beneficial habits, and general development should 
be based on the results of a functional behaviour analysis and often 
include psychoeducation, skills training, positive behavioural sup- 
port (for example, differential and non-contingent reinforcement, 
altering triggers for undesirable behaviour, minimal use of aversive 
strategies, and reactive strategies such as the use of distraction [11]), 
carer involvement and training, and, in children, token economies 
[12]. Furthermore, when appropriate, cognitive behavioural therapy 
may be indicated. Many of the same psychosocial interventions that 
apply to redressing adaptive skill deficits among affected individuals 
are relevant to the management and modification of challenging be- 
haviours and co-occurring mental and behavioural disorders. This 
approach views all systems of supports as ultimately serving to im- 
prove human functioning outcomes such as socio-economic status, 
health status, and subjective well-being of the individual [13, 14]. 

Challenging behaviours refer to those that interfere with the af- 
fected individual's quality of life, including their ability to function 
in social, academic, and employment contexts. They typically refer 
to self- and other oriented aggression, destruction of property, and 
behavioural stereotypies [15]. In a large population-based study 
conducted in Sweden, behaviour problems, which are considered to 
be less severe forms of challenging behaviours not necessarily re- 
quiring intervention, were observed in 62% of affected individuals 
[16]. Typically, co-occurring challenging behaviours vary according 
to the level of severity of the disorder of intellectual development 
and with age [17-19]. Although prevention strategies that anticipate 
the need for establishing effective communication strategies, modi- 
fications to environmental stimuli, and awareness of consequences 
that might reinforce challenging behaviour are preferable [20], usu- 
ally consultation requests focus on existing behaviours that signifi- 
cantly interfere with the patient’s functioning or are affecting the 
ability of the individual to function within specific contexts. 

Optimal management strategies often require the co-ordination 
of services across multiple disciplines that include, but are not 
limited to, psychiatry, psychology, occupational therapy, speech and 
language therapy, genetic counselling, and nursing. In addition to 
these professionals, parents, teachers, supervisors, and other care- 
givers should be consulted to provide their perspectives on the 
individual's functioning within the various contexts that they fre- 
quent. Although some challenging behaviours may generalize 
across contexts, some may not. Furthermore, modification of con- 
textual factors (for example, specific stimuli in the environment) in- 
dependent of other management strategies may reduce or eliminate 
challenging behaviours. 

A comprehensive diagnostic assessment typically involves inte- 
gration of data from genetic testing, physical assessment, interviews 
of the patient and his or her caregivers, and consultation with multi- 
disciplinary team members. Furthermore, challenging behaviours 
are most effectively addressed after a functional behaviour assess- 
ment has been conducted. The rationale for functional behaviour 
assessments emerges from research demonstrating that challenging 
behaviours serve a functional role for the individual and become 
established through operant principles of contingent reinforcement. 
Therefore, an explanation of the relationship between antecedent fac- 
tors, challenging behaviours, and consequences provides a rational 
basis for intervention. Indeed, meta-analyses consistently support 
the effectiveness of functional behaviour assessments in improving 
treatment outcomes across all severity levels over those that do not 


include such evaluations (for example, [21]). Challenging behav- 
iours may be found to function as social positive reinforcers (for ex- 
ample, to obtain attention), social negative reinforcers (for example, 
to avoid having to complete a task), or sensory reinforcers (for ex- 
ample, repetitive self-stimulatory behaviours) [22]. Thus, interven- 
tions to reduce the frequency of challenging behaviours can centre 
on removing antecedent factors, altering identified reinforcers, or 
modifying reinforcement schedules. Early intervention to address 
challenging behaviours is beneficial in preventing these from be- 
coming entrenched in the individual’s repertoire. 


Addressing comorbid psychopathology 


Behavioural management strategies predicated on the results of a 
thorough functional behaviour assessments are also helpful in ad- 
dressing co-occurring externalizing disorders (for example, oppos- 
itional defiant disorder). Supplementing clinician-administered 
behavioural management with parent/caregiver skills training is 
often more effective than direct intervention with the affected in- 
dividual (for example, Stepping Stones Triple-P, [23, 24]). With 
respect to emotional disorders, there is emerging evidence that psy- 
chological treatments, in particular cognitive behavioural therapy, 
may be as efficacious in addressing mood, anxiety, stress-related 
disorders (that is, PTSD) in people with disorders of intellectual de- 
velopment as in the general population [25, 26]. The ability to en- 
gage in cognitive behavioural interventions depends, in part, on the 
capacity of the individual to understand the relationship between 
internal states, behaviours, and cognitions, a task that is more likely 
achievable for those with mild to moderate forms of disorders of in- 
tellectual development. 


Pharmacological interventions 


In general, the indications for the use of psychotropic drugs in this 
population are the same as in populations with normal intellectual 
functioning. In clinical practice, the administration of psychotropic 
agents in patients with intellectual disability is very common. A re- 
cent population-based cohort study of adults with intellectual dis- 
abilities in the UK showed that almost half were taking some form 
of psychotropic drug and that the proportion of participants with 
intellectual disability being prescribed psychotropic drugs exceeded 
the proportion with recorded mental illness [27]. Rather, many were 
prescribed these drugs to address challenging behaviours. 

Psychiatric drug treatments are generally effective at the same 
doses and for the same indications than in the general population 
without this condition. Once a medication is selected on the basis of 
being properly indicated, the prescriber is confronted with a series 
of additional challenges. The most salient of these are: the coexist- 
ence of medical comorbidities, the concomitant use of other thera- 
peutic substances, issues related to adherence to the medications, 
and the need to carefully monitor the impact of the pharmacological 
interventions in terms of efficacy and side effects in patients who 
may have communications problems. 

There is no clear evidence that those with intellectual disability 
have more side effects when prescribed psychiatric treatments [28]. 
However, affected individuals have higher rates of physical health 
comorbidities and premature mortality [29]. Major health risk fac- 
tors, such as obesity, metabolic syndrome, and diabetes, are highly 
prevalent in this population [30]. The monitoring of patients with 
intellectual disability receiving psychotropic drugs should consider 








SECTION 3 Intellectual disabilities 


not only the elevated risk of specific drugs, such as dopamine sero- 
tonin antagonists that increase the risk of metabolic syndrome 
and extra-pyramidal symptoms, but also assess medical status, 
dietary regimen, changes in the environment, and polypharmacy. 
Furthermore, with greater severity of intellectual impairment, sen- 
sory deficits and neurological comorbidities (for example, epilepsy) 
are very frequent, increasing the risk of severe side effects at lower 
doses, as well as special sensitivities to the neurological side effects 
of these drugs. 

Involvement of the caregiver should be incorporated in any man- 
agement plan, because adherence, as well as reporting of response 
to treatment and side effects, often relies on their co-operation. 
Their prejudices against medication may be a contributing factor in 
cases of non-adherence to the prescription. The likely effects of the 
intervention on the patient’s and carer’s quality of life should also 
be considered. During the required monitoring period after the 
prescription of psychotropic drugs, efforts should be made to so- 
licit and incorporate objective information from the carers, as well 
as the patients. As a general rule, the lowest possible dose should be 
prescribed, with the physician assessing the need for treatment con- 
tinuation once the target symptoms have improved or when there is 
non-response and/or side effects that have an impact on the patient's 
functioning and quality of life. 


Use of dopamine antagonists/partial agonists 


Dopamine antagonists/partial agonists (meaning the so-called typ- 
ical and atypical antipsychotics) are used frequently in the manage- 
ment of a variety of presenting symptoms, even when the evidence 
for efficacy is not available. Concerns about inadequate prescribing 
of these agents have been raised, in particular, in relation to the 
management of challenging behaviours [31]. While there is limited 
evidence that suggests that dopamine antagonists may be effective 
in treating behavioural disturbances in adults with intellectual dis- 
ability with comorbid autism [32], evidence supporting the use of 
such medication outside this clinical situation is lacking [33]. The 
NICE guidelines (2015) [34] recommend that clinicians consider 
dopamine antagonists to manage challenging behaviour only if 
‘psychological or other interventions alone do not produce change 
within an agreed time; or treatment for any coexisting mental or 
physical health problem has not led to a reduction in the behaviour 
or the risk to the person or others is very severe (for example, be- 
cause of violence, aggression or self-injury)’ (National Institute for 
Health and Care Excellence, 2015). 

Use of dopamine antagonists for the pharmacological treat- 
ment of aggression specifically is controversial. Once properly 
indicated, the follow-up of the patient should take anticipate that 
adverse effects may be more likely due to the higher incidence of 
comorbid conditions (for example, neurological disorders) and 
should consider that some patients may have an elevated baseline 
cardio-metabolic risk profile. Dopamine/serotonin antagonists 
have less risk of side effects like dyskinesia, but they have been as- 
sociated with weight gain, metabolic syndrome, and increases in 
prolactin level. In practice, use in the intellectual disability popu- 
lation may be at lower doses than for other groups of psychiatric 
patients and is not necessarily associated with worse metabolic pro- 
files. However, prolactin elevation (most marked with amisulpride 
and risperidone) is common and often associated with secondary 
hypogonadism in women [28]. Hypogonadism may, in turn, lead to 


bone loss in a population already at risk for osteoporosis and frac- 
tures. Management of hypogonadism is indicated, although there is 
currently too little awareness of this risk, which should be screened 
for more regularly. In cases with treatment-resistant psychosis, clo- 
zapine could be considered, but only when blood test monitoring 
can be ensured. 


Use of drugs for depression 


Selective serotonin reuptake inhibitors (SSRIs) are the most com- 
monly used medication in patients with intellectual disability, both 
for depressive disorders and anxiety disorders. They are also fre- 
quently prescribed for the management of obsessive-compulsive 
symptomatology. During the first few weeks of treatment, some pa- 
tients report nervousness and agitation. Changes in behaviour (for 
example, increased aggression, self-injury, repetitive behaviour) 
may indicate adverse effects or a switch to a manic phase [35]. The 
prescription of these drugs should start with a low dose, and in- 
creases should be slower than in the population without intellectual 
disability. Treatment guidelines based on evidence collected in clin- 
ical populations without intellectual disability also stress the need 
for monitoring emerging suicidality and suicide attempts during 
treatment with antidepressants in children and adolescents with de- 
pression [36, 37]. Because the potential for drug interactions with 
SSRIs, the clinician should be vigilant when using these agents sim- 
ultaneously with other drugs that are metabolized by cytochrome 
P450 enzymes. 


Benzodiazepines 


These agents should only be used for a limited period of time, 
and long-term treatment with these agents should be avoided. 
Paradoxical reactions to benzodiazepines (disinhibition) can mani- 
fest in this clinical population in the form of impulsivity, aggression, 
or rage. 


Mood-stabilizing drugs 


In addition to their use in bipolar disorder and acute mania, lithium 
and anti-epileptic drugs have been shown to be effective in the man- 
agement of affective instability, which can manifest in the form of 
irritability, rage, and outburst of aggression [38]. Many patients with 
intellectual disability also have seizures that may increase the be- 
havioural problems. As in the general population, therapy must be 
guided by regular serum monitoring, as lithium and anti-epileptic 
drugs are correlated with therapeutic and toxic effects. 

There are ongoing concerns among users, carers, health care pro- 
viders, and researchers that psychotropic drugs are too often used 
inappropriately in people with intellectual disability. It is not un- 
usual to observe overuse of psychotropic drugs to treat challenging 
behaviour, excessive dosages, and polypharmacy. The off-label use 
of psychotropics is common practice in this area of medicine, as is 
the fact that once psychotropic drugs are prescribed for a particular 
challenging behaviour, they are incorporated in the long-term man- 
agement of the case, without proper consideration as to whether 
long-term treatment is indicated. Current evidence suggests that 
comprehensive and personalized functional assessment of behav- 
iours in their context with an overarching goal of optimizing in- 
tegration and normal functioning leads to better outcomes. With 
these assessment data, it is easier to develop a treatment plan that 
is amenable to psychological and environmental management of 
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mental illness and challenging behaviour, which, in turn, is likely 
to reduce or, in some cases, obviate the need for presciption of 
psychotropic drugs. 


Revised diagnostic guidelines, such as those in the forthcoming 
ICD-11, have focused on enhancing clinical utility and international 
applicability. These characteristics of the classification are expected 
to improve early and accurate diagnosis of disorders of intellec- 
tual development, thereby permitting a greater number of persons 
to be eligible for treatment. Although effective psychosocial and 
pharmacological interventions are available for the management of 
challenging behaviours and comorbid psychopathology in affected 
individuals, these often depend on specialized behavioural interdis- 
ciplinary teams, a luxury in most settings across the world. Future 
research should focus on improving universal accessibility of bio- 
psychosocial services across the lifespan through training of non- 
specialist staff, as well as carers, in evidence-based approaches such 
as positive behavioural support. 
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Autism spectrum disorder (ASD) is a neurodevelopmental disorder 
that manifests during the early developmental period of childhood. 
This disorder is behaviourally defined by impairments in social 
interaction and communication, in combination with restricted and 
stereotyped patterns of behaviour and/or interest. There is, however, 
a diverse range of symptom presentation and severity among diag- 
nosed individuals, including a range of cognitive ability that spans 
IQ scores representative of profound intellectual disability through 
to average and well above-average levels of intelligence. As a result, 
in less severe cases, symptoms may not become fully apparent until 
later stages of life when social demands surpass the individual's 
limited abilities. ASD is a new diagnostic label, chosen by the 
American Psychiatric Association for the 2013 release of the fifth 
edition of their Diagnostic and Statistic Manual (DSM-5) [1]. This 
label was designed to provide a singular diagnosis to individuals 
who would have previously been categorized by one of the perva- 
sive developmental disorders (PDDs), as defined in the fourth edi- 
tion of the DSM [2, 3] and the WHO’s counterpart to DSM-IV—the 
tenth edition of the International Classification of Diseases (ICD- 
10) [4]. Resultantly, the previously unique PDD diagnoses of aut- 
istic disorder (childhood autism in ICD-10), Asperger’s syndrome, 
childhood disintegrative disorder, and PDD-not otherwise specified 
(PDD-NOS) (PDD-unspecified in ICD-10), are no longer present as 
diagnostic options for new cases based on DSM-5 criteria but can be 
‘grandfathered’ in by those who were already holding one of these 
diagnoses prior to this categorical shift. 

The ASD section of this textbook will examine and discuss this 
disorder from a variety of perspectives. This will include chapters 
addressing current understandings of treatment planning, preven- 
tion and epidemiology, genetics, imaging, and management of ASD. 
Chapter 25, however, will provide an overview of ASD, covering 
topics of disorder history, prevalence estimates, demographic pat- 
terns, clinical features, diagnostic criteria, and differential diagnosis, 
and close with a discussion on the criticisms and concerns sur- 
rounding the DSM-5 classification of ASD and expectations for how 
this disorder will be defined in the forthcoming eleventh edition of 
the ICD (ICD-11). 


Leo Kanner, an Austrian-born child psychiatrist, working at Johns 
Hopkins University, is credited with the first clinical portrayal of 
ASD in his 1943 paper entitled Autistic disturbances of affective con- 
tact [5]. In this paper, Kanner details the behaviour of 11 children 
displaying a pattern of symptoms which included intense resistance 
to change, insistence on ‘aloneness, and impairments in both com- 
munication skills and social interaction. Meanwhile, in 1944, inde- 
pendent of the work being conducted by Kanner, Hans Asperger (a 
psychiatrist working in Austria) produced his own clinical accounts 
of four cases of what he referred to as autistic personality disorder in 
his paper Die Autistischen Psychopathe im Kindesalter [6]. Similar to 
what was being described by Kanner, Asperger notes impairments 
in social and emotional relationships, abnormalities in speech and 
non-verbal communication, and restricted, but intense, focus on 
special interests. The primary difference between their accounts is 
that Asperger additionally discusses the presence of extraordinary 
skills in particular cognitive domains (for example, mathematics) 
in the children he was observing. Unfortunately, due to the fact that 
his work was published in German, Asperger’s clinical findings did 
not become widely disseminated in English-speaking regions until 
nearly 40 years later. 

The first widely utilized label for this disorder (‘early infantile 
autism’) was provided in a 1956 paper co-authored by Kanner and 
Leon Eisenberg [7]. In this paper, the authors proposed that two core 
features define early infantile autism: ‘obsessive insistence on same- 
ness’ and ‘extreme self-isolation. Kanner had observed that parents 
of the patients visiting his clinic presented with similar social disin- 
terest and emotionally distant characteristics to those expressed by 
their children. Based on these observations, Eisenberg and Kanner 
proposed that emotional deprivation, as a result of this distant and 
‘cold’ parenting, could play a role in the development of early in- 
fantile autism. However, they also went on to express that emo- 
tional deprivation could not solely account for the development of 
early infantile autism, as it was their belief that this disorder was 
a state that was present at birth and, as such, was likely to be the 
result of a combination of biological, psychological, and social fac- 
tors. Unfortunately, it was their declaration regarding the potential 
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role of socially detached, ‘cold’ parenting that got the most atten- 
tion initially, and eventually resulted in what becomes known as 
the ‘refrigerator mother’ theory of autism. In the 1960s, this theory 
was furthered in publications by Bruno Bettelheim [8, 9], who, for 
many years, was considered as one of the leading experts in the field 
of autism, resulting in the ‘refrigerator mother’ theory becoming 
widely disseminated and accepted by many in the lay population, as 
well as in medical fields. 

Beginning in the late 1960s, and more significantly in the 1970s, 
research in the field of autism moved away from the clinical descrip- 
tions and/or theory presentations that previously dominated the 
landscape of autism literature, and began applying more scientif- 
ically stringent methodologies to the study of this disorder. Works 
produced during this period presented serious challenges to the psy- 
chogenic theories of autism (for example, the ‘refrigerator mother 
theory), by providing empirically supported evidence for a biological 
and neurological basis of the disorder [10, 11]. In addition, evi- 
dence was produced against the then-held position that autism was 
a form of childhood schizophrenia, with comparative studies clearly 
distinguishing the two disorders by their associated symptoms, phe- 
nomenology, age of onset, and family history [12, 13]. Following 
these scientific advances, ‘infantile autism’ was introduced as a 
formal diagnostic category by the American Psychiatric Association 
in the 1980 release of DSM-III [14]. Infantile autism was classified 
as a PDD, a term developed specifically for DSM-III, and defined as 
being characterized by pervasive and severe impairment in multiple 
basic functions, including socialization and communication. 

In 1981 [15], Lorna Wing introduces the findings of Hans 
Asperger to the English-speaking world and describes ‘Asperger’s 
syndrome, a milder manifestation of the clinical symptoms of in- 
fantile autism, often paired with strong cognitive abilities in fields 
such as mathematics, and excluding the significant language devel- 
opment delays associated with autism. In DSM-IV [2] and DSM- 
IV-TR [3], the association between this Asperger’s syndrome and 
autism (then labelled as ‘autistic disorder’) was officially recognized 
by the American Psychiatric Association, with the decision to ex- 
pand the diagnostic category of PDDs to include five disorders that 
were thought to be related to, or share, many of the same features 
of one another: autistic disorder, Asperger’s syndrome, Rett’s syn- 
drome, childhood disintegrative disorder, and PDD-NOS. Following 
this expansion, this group of disorders (with the exception of Rett’s 
syndrome) began to be discussed under the umbrella term of ‘autism 
spectrum disorders’ in much of the autism literature. 

In 2013, the fifth, and most recent, edition of the DSM (DSM-5) 
[1] was released and was met with a great deal of controversy re- 
garding its alterations to the diagnostic categorizations of autism and 
its related disorders. In DSM-5, the previously distinct categories of 
autistic disorder—childhood disintegrative disorder, Asperger's syn- 
drome, and PDD-NOS—have been reconceptualized as a singular 
‘autism spectrum disorder’ and are defined by more rigid diagnostic 
criteria. This change was made in an effort to provide a more unified 
and rigorous definition of what autism is; however, it was met with 
a number of concerns and criticisms from medical professionals, 
as well as autism advocacy groups (see Concerns and criticisms of 
DSM-5 classification of autism spectrum disorder, p. 242, which 
contains further discussion on this topic), including negative im- 
pacts on the sense of community and identity that had been gained 
through specific diagnostic labels and potential issues with impaired 


children not meeting the more stringent diagnostic criteria and 
losing access to critical services as a result. To address the former 
of these concerns, while Asperger’s syndrome and PDD-NOS are 
no longer diagnostic options for clinicians within the framework of 
DSM-5, it was decided that a ‘grandfather’ clause would be intro- 
duced, allowing these labels to remain in individuals who received 
their diagnosis prior to this change. Meanwhile, to address the con- 
cerns over diagnostic sensitivity, DSM-5 introduced a new condi- 
tion called social communication disorder, intended to provide a 
label for children without the repetitive and stereotyped behaviour 
components of ASD but who do express significant social/commu- 
nicative impairments. 


Prevalence estimates 


The first study to present data regarding an estimated prevalence 
rate for autism was performed by Victor Lotter in 1966 [16]. To pro- 
vide this estimate, Lotter screened the entire population of 8- to 10- 
year-old children from a county in the south east of England (78,000 
children in total) for behavioural markers of early infantile autism, 
as then defined by Eisenberg and Kanner [6]. The findings of this 
screening resulted in an estimate that autism was present in 4.5 of 
every 10,000 children. From this point until as late as the 1990s, 
autism was considered to be a rare condition, with subsequent pub- 
lished prevalence reports suggesting rates of 2-5 per 10,000 children 
[17, 18]. Following the inclusion of ‘Infantile autism’ in DSM-III in 
1980 [14], and the subsequent expansion of diagnostic criteria in 
the 1987 release of the revised DSM-III-R [19], reported epidemio- 
logical studies began documenting dramatic increases in the esti- 
mated rates of prevalence for this disorder [20-24, 29]. 

Beginning in the early 2000s, the Center for Disease Control and 
Prevention (CDC) began collecting large data samples of 8-year old 
children from across the United States and producing highly de- 
tailed prevalence reports for autism based on their findings every 
2-4 years. The most recent of these reports was released in 2014 and 
documented findings from a sample of 363,749 children (roughly 
9% of the total population of 8-year olds in the United States) that 
was collected across 11 different sites in 2010 [20]. Based on the find- 
ings of this report, the overall prevalence of ASD (based on DSM 
IV-TR diagnostic criteria for autistic disorder, Asperger's syndrome, 
or PDD-NOS) for children aged 8 years in the United States is 147 
per 10,000 (or 1 in 68). That figure was 29% higher than their pre- 
vious estimate based on data from 2008 [22], 64% higher than their 
estimate based on data from 2006 [23], and a staggering 123% higher 
than their estimate based on data from 2002 [24]. In combination 
with earlier prevalence studies, the findings produced by the CDC 
suggest a continuous and exponential growth pattern in ASD preva- 
lence over the past 4-5 decades (Fig. 25.1). 

This trend of dramatic increases in the reported prevalence of 
ASD has led some to the conclusion that these findings are evidence 
of an ‘autism epidemic’ and, as a result, has encouraged increases 
in research focused on identifying potential environmental fac- 
tors to help explain this phenomenon. However, a majority of prac- 
titioners and researchers within the ASD field have issued caution 
against interpreting these findings in this manner and instead sug- 
gested that the majority of these increases can be explained by a com- 
bination of alternative factors, including: broadening of diagnostic 
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Fig. 25.1 Time trend of estimated autism prevalence rates (number of children per 10,000). Prevalence estimates for the reported years are from the 
following studies: 1966 [16], 1979 [18], 1982-1989 [21], 1999 [29], 2007 [24], 2009 [23], 2012 [22], 2014 [20]. 


criteria, increased availability of diagnostic services, increased dis- 
order awareness, and diagnostic transference/substitution of children 
who may have previously received an alternative diagnosis (for ex- 
ample, intellectual disability). Though the combination of these fac- 
tors may be able to account for a large portion of these increases over 
the years, there may yet be true increases in ASD incidence as well. As 
such, continued work in this area, and in the identification of poten- 
tial environmental factors associated with an increased ASD liability, 
is warranted. 

Of note, there is frequently a discrepancy between the current high 
prevalence rates of ASD among samples of children and the generally 
lower observed prevalence rates in adults. For some, this discrepancy 
has been viewed as support to the idea that there has been, in fact, an 
increase in ASD incidence in recent decades. In contrast to this pos- 
ition, however, it is becoming more apparent that these discrepancies 
are likely the result of less accurate diagnostic and identification prac- 
tices in the past [25, 26]. Providing support to these beliefs, and per- 
haps one of the more compelling findings in opposition to the ‘autism 
epidemic’ theory, a recent large community survey estimated that 
prevalence rates of ASD for adults in the UK should be roughly 1.1% 
(essentially the same as those found in samples of children), but many 
of these adults are currently misdiagnosed or completely unidentified 
[27]. Regardless of whether there has been a true increase in incidence 
of ASD or not, recent reports suggest that, of all mental disorders, ASD 
is now the leading cause of disability in children below 5 years of age, 
affecting somewhere between 1% and 2%, or 52 million children [28], 
and a roughly equivalent proportion of adults [27] across the globe. 


Demographics 


One thing which has remained consistent since the earliest epi- 
demiological studies of ASD is a disproportionate gender ratio. 
From Kanner’s first cohort in 1943 [4] to the first reported prevalence 
study in 1966 [17], and all the way through to the most recent CDC 
report released in 2014 [20], autism has been notably more prevalent 
among males than females, with estimated ratios of 2.5-5:1 [20, 30]. 


A number of different hypotheses have been proposed to account for 
the presence of this gender difference. This includes ascertainment 
bias or altered phenotypical expression in females, resulting in issues 
with underdiagnosis [31, 32], increased susceptibility of males due 
to elevated prenatal and early postnatal testosterone levels [33], or 
the existence of a female protective effect requiring a greater aetio- 
logic load in girls before ASD behavioural symptoms will manifest 
[34]. At the moment, however, no definitive explanation for why 
ASD is more prevalent in males than females has emerged. Based on 
current estimates [20], the male:female gender ratio for ASD is 4.5:1, 
with a slight increase in this discrepancy when specifically exam- 
ining individuals without co-occurring intellectual disability (4.9:1). 
Unlike gender ratios, one aspect of the demographics of individ- 
uals with ASD that has changed over the years is intellectual ability. 
In particular, over the last decade or so, there have been notable in- 
creases in individuals identified with ASD with average or above- 
average intellectual ability. Likely a result of improved awareness and 
detection of the often subtler behavioural manifestations of ASD in 
this more intellectually able cohort, recent findings suggested that 
46% of children with an ASD diagnosis have IQ scores in the average 
to above-average range (>85), 23% in the borderline range (IQ = 71- 
85), and 31% considered to be in the range of intellectual disability 
(IQ <70). Finally, estimated prevalence rates of ASD tend to be higher 
among white/Caucasian children, compared to other ethnic/racial 
groups [20]. However, much of this variation can be accounted for by 
the significantly higher rates among white children of ASD diagnoses 
without co-occurring intellectual disability (a diagnosis that can be 
more difficult to recognize) and thus is likely more suggestive of the 
influence of socio-economic variables on the quality and quantity of 
diagnostic and treatment services available to families than ofa true 
discrepancy of ASD prevalence between ethnic/racial groups [35]. 


Clinical features and diagnostic criteria 





Although there is a general consensus that there is a strong gen- 
etic and biological basis to ASD, currently no definitive biological, 
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neurological, or genetic markers for this disorder have been iden- 
tified. As a result, clinical definitions of ASD remain fundamen- 
tally behavioural and historical, reliant on observations (by the 
clinician or caregiver) of characteristic problems of social inter- 
action, communication, and unusual behaviours which are of early 
onset (hence the importance of history). Complicating the task 
of diagnosis (and probably consistent with emerging genetic re- 
search [36]), it is clear that a broader spectrum of condition(s) are 
involved [37]. 

For the past two decades, the approach of ICD-10 [4] and DSM- 
IV [2, 3] have been generally concurrent, and partly as a result of 
this international diagnostic consistency, there has been an explo- 
sion of knowledge and research. This has recently changed with the 
rather different approach adopted in DSM-5 [1] which focuses on a 
rather narrower view of autism, in the more classic sense of Kanner’s 
autism, resulting in many more cognitively able or less ‘prototypical’ 
cases being excluded from an ASD diagnosis [38]. Further compli- 
cating the matter are the ‘grandfather’ clauses (adopted, in part, as a 
result of criticism of the increased diagnostic stringency) that allow 
older cases with ‘well-established’ DSM-IV diagnoses to retain their 
diagnosis. 

Current diagnostic criteria for ASD, as defined by DSM-5, ne- 
cessitate the presentation of impairments in the following two 
areas [1]: 


1. Persistent deficits in social interaction and social communica- 
tion (including impaired social-emotional reciprocity, deficits 
in non-verbal social communicative behaviours, and impaired 
ability to develop, maintain, and understand age-appropriate 
social relationships). 

2. Restricted and repetitive patterns of activities, behaviours, or 
interests (including repetitive or stereotyped motor movements, 
speech, or use of objects; inflexibility with regard to routine ad- 
herence, insistence on consistency, or ritualized patterns of be- 
haviour; highly restricted interests that are atypical in focus or 
intensity; and hypo- or hyper-reactivity to sensory input or atyp- 
ical interest in sensory features of their environment). 


For a diagnosis to be received, the presence of these symptoms 
must be present during a child’s ‘early developmental period’ and re- 
sult in clinically significant impairment in some critical area of func- 
tioning (for example, occupational, social, academic). The first of 
these requirements, however, includes the caveat that explicit symp- 
toms of impairment may not become fully apparent until in later 
stages of life when social demands surpass an individual's limited 
abilities [1]. 

The DSM-5 approach has been extensively critiqued [38], and 
a recent meta-analysis has confirmed the consensus that it repre- 
sents a move to a more stringent diagnostic concept [39], some- 
what undercutting the significance of the name change from the 
previous category of ‘Pervasive developmental disorder’ to ‘Autism 
spectrum disorder. Although perhaps intended to provide a diag- 
nostic option to those individuals not captured by this more strin- 
gent diagnostic approach, the inclusion of a new communication 
disorder—social communication disorder—was introduced in 
DSM-5. However, the introduction of social communication dis- 
order presents with its own issues, given the very small amount of 
work supporting such a concept and the complexities involved in 
its application [40, 41]. 





Differential diagnosis 


ASD must be differentiated from other developmental disorders— 
most frequently from intellectual disability (which co-occurs in 
roughly 30% of cases of ASD [20]) and language/communication 
disorders. In intellectual disability without ASD, developmental 
skills (including social ones) are generally relatively evenly devel- 
oped (or impaired), as compared to the more typical pattern of 
marked areas of strength (non-verbal abilities) and weakness (verbal 
and social abilities) seen in ASD. The tasks of differential diagnosis 
can be most complicated in individuals with severe or profound in- 
tellectual disabilities, as differences in relative areas of strength and 
weakness can be less discernible at these levels of cognitive impair- 
ment. Conversely, for more cognitively able individuals, such as 
those with a DSM-IV diagnosis of Aspergers syndrome, receiving 
a new diagnosis of ASD may become an unnecessarily complicated 
task due to the grandfather clause component of DSM-5 [1]. In 
communication disorders, social vulnerabilities are usually not par- 
ticularly prominent and the desire to communicate often is clearly 
present. The new diagnosis of social communication disorder allows 
for a diagnosis when social and communication problems (but not 
restrictive behaviours) are present, but not as severe as in the more 
classic ASD [41]. 

Some individuals who go on, in later adolescence or young adult- 
hood, to develop schizophrenia may exhibit social oddities earlier 
in life, similar to, but not typically as severe as, those seen in ASD. 
Occasionally, severe social anxiety problems may initially be con- 
fused with ASD; however, these cases will generally not present with 
the restricted and repetitive behaviours and interests found in ASD. 
Conversely, the unusual (often due to intensity or repetitive nature) 
behaviours or interests of individuals with obsessive-compulsive 
disorder may be taken to suggest ASD; however, in these cases, 
the social and language communication skills will be preserved. 
Sometimes, in cases of severe neglect, a range of symptoms sug- 
gestive of ASD may be present; however, in these cases, a history 
of neglect will often be observed and the social vulnerabilities will 
begin to abate once appropriate care is provided [42]. 


Concerns and criticisms of DSM-5 classification 





of autism spectrum disorders 


Both Kanner’s original description of early infantile autism [5, 7] 
and Asperger’s description of autistic personality disorder [6] have 
been somewhat modified over time. For Kanner’s autism, Rutter’s 
1978 synthesis of subsequent work [43] focused on both social and 
language/communication deficits of a specific kind (not just ex- 
plained by overall developmental delay) associated with problems in 
dealing with change and ‘insistence on sameness. Similarly, Wing's 
1981 paper [15] introduced, in large part, Asperger's clinical re- 
ports to the English-speaking world, but also modified and broad- 
ened it in important ways in her definition of Asperger’s syndrome 
[44]. Kanner’s autism was officially recognized in DSM-III [14] 
and, up until the release of DSM-5 [1], maintained this traditional 
grouping of diagnostic features/criteria. Of note was the inclusion of 
other disorders with autism in the PDD category in DSM-IV [2, 3], 
including Asperger’s syndrome, and the convergence of ICD-10 [4] 


and DSM-IV definitions of these categories. For autism (or autistic 
disorder/childhood autism, as were then the labels), the DSM-IV/ 
ICD-10 definition proved quite flexible, with many possible com- 
binations of criteria resulting in a diagnosis. 

A number of overall changes impacted the DSM-5 review process. 
Several decisions were made at a programmatic level (for example, 
to avoid having an entire child section, to change approaches to 
multi-axial classifications and ‘not otherwise specified’ conditions) 
that impacted the process. A further consideration was the decision 
to primarily rely on data from commonly utilized ASD diagnostic 
instruments, rather than on field trial data, to provide the new defin- 
itions for disorder criteria. Early on, the decision was made to move 
to a new diagnostic label—autism spectrum disorder (although 
note the singular term)—and avoid finer-grained classifications like 
Asperger's syndrome, etc. The use of modified codes and measures 
of severity were incorporated and were meant to provide the im- 
portant detailed information otherwise lost with the unification of 
the previously distinct disorders under a singular label. The decision 
to eliminate Asperger’s syndrome was most controversial, given that 
despite continued disagreements about the best diagnostic approach 
to use, research on that condition had increased significantly and 
meta-analyses were just beginning to appear that showed important 
differences from more classic Kanner’s autism, which can have crit- 
ical impacts on effective treatment options and strategies [45, 46]. 

Factor analysis of an extensive data set using measures of both cur- 
rent functioning and past history was utilized to come to the decision 
to adopt a ‘two-factor’ approach for the ASD diagnostic criteria in 
DSM-5. The two factors, determined by this analysis, included one 
set of monothetic criteria (that is, all four features must be present) 
focused on social communication problems, and a second set of poly- 
thetic criteria (that is, two of four features must be present) focused 
on restricted, repetitive, or atypical interests and behaviours. Post hoc 
analyses of the data suggested this to be a well-functioning criteria 
set. However, some problems have quickly become evident. In the 
first place, the combinations of criteria that resulted in a diagnosis of 
ASD were drastically reduced. Secondly, the inclusion of a new diag- 
nostic feature associated with hypo- and hypersensitivity to sensory 
input did not appear well justified, as a similar feature had produced 
little predictive value in DSM-IV field trials [47]. In the latter, a factor 
analysis had shown that two-, three-, or five-factor solutions worked 
reasonably well (the two-factor solution paralleling the DSM-5 ap- 
proach, and the five-factor solution providing additional sub-factors 
based on the repetitive behaviour category) [47]. It should be noted, 
however, that interpretations of factor analyses can be particularly 
susceptible to some significant issues, particularly if important data 
are not included and thus are unable to influence the factors being 
produced (see [48] for a discussion of the uses and misuses of factor 
analysis). Some recent studies have now questioned aspects of the 
factor solution proposed for DSM-5 [49, 50]. 

As a practical matter, it was quickly apparent that the approach 
adopted would disqualify many individuals from a diagnosis— 
particularly those who were younger or more cognitively able (see 
[51] for a recent meta-analysis). As a result, a decision was made 
to ‘grandfather’ in those cases with well-established previous diag- 
noses, thus resulting in both DSM-IV and DSM-5 diagnostic cat- 
egories being in effect simultaneously. Some aspects of the approach 
adopted by DSM-5 are welcome. In fact, the current diagnostic 
definition is, in many ways, more consistent with Kanner’s classic 
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definition of autism than the broader autism spectrum adopted in 
DSM-IV, which has a considerable body of support from both gen- 
etic and psychological research [38]. As noted previously, the in- 
clusion of a new communication disorder—‘social communication 
disorder’—was not well justified and, as a practical matter, may not 
succeed in its purpose of capturing those individuals who will fall 
outside the more stringent DSM-5 diagnostic criteria for ASD, as 
it was not developed to be convergent with previous diagnoses of 
Asperger's syndrome or PDD-NOS. 


Expectations for ICD-11 


The eleventh edition of the WHO’s International Classification of 
Diseases (ICD-11) is currently being developed and likely will be 
approved in 2018. A major focus of this effort has been the desire to 
make the manual (or at least the clinical version) more user-friendly. 
There has been some attempt to encourage the use of a standard 
template for clinical decisions and diagnostic guidelines [52]. On 
balance, it likely will have areas of convergence with DSM-5, with 
some potential differences as well. A draft version is available, and 
efforts to evaluate it are now under way [53]. It seems likely that 
the ICD-11 diagnosis will certainly include individuals with ASD in 
DSM-5; at this point, the questions centre more around the approach 
to grouping certain conditions, for example, cases with regression 
or the more prototypical Asperger’s phenotype, and whether the 
‘grandfathering’ rule of DSM-5 will also be adopted. 

It should be noted that, in addition to official systems like ICD and 
DSM, adoption of the Research Domain Criteria (RDoC) approach, 
proposed by the National Institute of Mental Health, attempts to cut 
across traditional definitions and focuses on domains of neurobiology 
and behaviour [54]. The RDoC approaches are organized around sev- 
eral domains of functioning and various approaches of units of study 
[55]. The latter can range from genes to behaviour to self-report. This 
approach has been extensively critiqued [56, 57] but has stimulated 
new lines of enquiry in fields such as co-occurring conditions in trad- 
itional approaches. As such, it may have some utility for clinical and 
treatment studies in which the focus or target behaviour(s) cut across 
transitional diagnostic categories [54, 58, 59]. 
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Autism spectrum disorder (ASD) is a neurodevelopmental disorder 
characterized by difficulties with social communication, and the 
presence of restrictive interests, repetitive behaviours, and sensory 
symptoms [1]. Symptoms must be apparent from early childhood 
for a diagnosis to be made. ASD is 3-4 times more common in boys 
than girls, which likely represents a combination of biological differ- 
ences and under-identification of girls in community settings. The 
name ‘autism spectrum disorder’ reflects the huge variability in the 
degree and nature of the symptoms that people with ASD experience 
and the likely heterogeneity in their underlying causal mechanisms. 
While some individuals with ASD have highly successful profes- 
sional and personal lives, others may never live independently, have 
a job, or form strong peer relationships. Developing new treatment 
options for individuals with ASD requires determining the mechan- 
isms that underlie the emergence, consolidation, and maintenance 
of core symptoms throughout the lifespan. This chapter will describe 
the approaches used to study the basic mechanisms underlying ASD, 
review our current understanding of the processes that contribute to 
symptoms, and indicate the value of this work for identifying new 
treatment targets. 



































When searching for treatment targets, we need to identify the bio- 
logical or neurodevelopmental pathways that cause the emergence 
or maintenance of clinical symptoms of ASD. Such causal mech- 
anisms need to be distinguished from secondary or compensatory 
processes and be separated from differences related to co-occurring 
symptoms (like anxiety or ADHD), in order to target core domains 
of ASD symptomatology. We must understand how mechanisms 
relate to both strengths and weaknesses in ASD, in order to avoid 
unwanted effects of targeted treatments. To address these pressing 
goals, researchers have taken a range of approaches that I shall 
summarize now. 





Case-control studies 


The vast majority of efforts to delineate the mechanisms under- 
lying ASD symptoms rely on comparing neurocognitive function in 
people with and without ASD. Any difference in the ASD group is 
usually interpreted as atypicality and considered to be a candidate 
causal mechanism. Such ‘case-control’ designs can provide a rapid 
assessment of whether a particular measure is likely to have value for 
further investigation. However, they have many limitations. Firstly, 
the interpretation of group differences always relies on the selection 
of an appropriate control group, but finding the ‘perfect’ control 
group is very difficult (and perhaps impossible). Groups may differ 
on both core and associated symptoms (for example, anxiety and 
depression); examining whether variables associated with case/con- 
trol differences are dimensionally related to autism symptomatology 
can mitigate this concern but require large group sizes. Dealing with 
IQ is similarly problematic; uneven cognitive profiles in individuals 
with ASD can make matching difficult, and the considerable neuro- 
biological heterogeneity in children with general developmental de- 
lays can limit our understanding of the differences we see in the ASD 
group. There is also a degree of genetic overlap between develop- 
mental disability and ASD [2], so we could be ‘controlling’ for vari- 
ation in which we are interested. An alternative option is to contrast 
how performance on a neurocognitive task relates to variables, such 
as chronological age, mental age, or social communication skill, in 
both the ASD and comparison groups. This ‘developmental trajec- 
tories’ approach has been used to show that face-processing skills 
co-vary with broader social communication skills in the same way 
in children with and without ASD, such that children with ASD were 
showing patterns resembling those in younger typically developing 
children [3]. This approach is very promising but is yet to be widely 
adopted. In light of such limitations, research has moved away from 
traditional case-control designs and towards other approaches. 


Trait-based approaches 


Trait-based approaches, like the Research Domain Criteria (RDoC) 
framework, address some of the limitations of case-control de- 
signs [4]. Moving beyond the clinical diagnosis, the RDoc speci- 
fies a number of domains that can be described at multiple levels of 
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analysis (from genetics to behaviour). Focusing on particular key like ASD. This can be challenging, because neural functions may be 
domains within heterogenous participant groups may be fruitful less specialized or differentiated in early infancy and the mapping 
in identifying the neural mechanisms that are associated with vari- between brain function and cognitive process may be different [9]. 
ation in a particular symptom cluster. Since most genetic risk for A further challenge to fully implementing trait-based approaches is 
ASD lies with common variation [5] and genes carrying risk for the relative fragility of our current neurocognitive batteries. Many 
ASD co-vary with typical variation in social communication skillsin lack acceptable basic psychometric properties like test-retest reli- 
childhood [6], studying the neurocognitive underpinnings of ASD- ability, or context insensitivity, which may be a critical limitation 
related traits in the general population could be very fruitful. Given to their use in indexing individual differences in a clinical context. 
the developmental nature of ASD, studies in infancy are particu- | One way to mitigate these concerns is to select tasks for investigation 
larly important. Indeed, we recently showed that infant measures that produce robust replicable responses in individuals. However, 
of social attention that predict later ASD also co-vary with parental will these be sensitive to ASD-related differences? A number of 
ASD-related traits in a large group of typically developing infants [7] recent studies have linked ASD to increased intra-individual vari- 
(Fig. 26.1). Such studies provide great potential for understanding ability in responses [10, 11] or greater idiosyncrasy in functional 
the neurodevelopmental processes that mediate between genetic activation patterns [12, 13]. If ASD is characterized not by stably 
risk and phenotypic expression of ASD-related traits. different (trait-like) neurocognitive function, but by greater ‘state’ 

The RDoc framework is not without limitations. The develop- variation in activation patterns, simple tasks that achieve high test- 
mental aspects of the framework need elaborating [8], which is crit- retest reliability in individuals with ASD may actually be those that 
ical to increasing its applicability to neurodevelopmental disorders are least likely to identify deficits. In general, the field may need to 
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Fig. 26.1 Higher parent social anxiety (SADS, social anxiety, and distress) and lower social comfort (COMQ, Social Competence Questionnaire) 
relate to shorter peak look to faces (a, c), but not objects (b, d) in 6- and 12-month-old infants. Shorter peak looks to faces have been previously 
associated with later ASD. 


Reproduced from J. Child Psychol. Psychiatry, 58(3), Jones EJH, Venema K, Earl RK., et al., Infant social attention: an endophenotype of ASD-related traits? pp. 270-281, 
Copyright (2017), with permission from John Wiley and Sons. 
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develop more sophisticated neurocognitive methodologies rooted 
in neurobiology to find metrics that circumvent these issues. This 
may include a shift of approach from static cognitive traits towards 
identifying markers of state-related changes in brain responses. 


Longitudinal prospective studies 


Longitudinal studies of children with ASD allow the identification 
of neural mechanisms that predict later symptom variation. In a re- 
cent example, Neuhaus and colleagues showed that more norma- 
tive neural responses to faces at the age of 3 years predicted greater 
improvement in core ASD symptoms between the age of 3 and 
adolescence [14] (Fig. 26.2). Although these associations do not 
prove causality, they provide a more powerful way of identifying 
good candidate mechanisms to subject to more rigorous testing. 
Prospective studies of infants at familial risk for ASD go one step 
further by allowing investigators to identify mechanisms that may 
underpin symptom emergence. Such studies follow groups of infants 
with older siblings with ASD, who have a 20% chance of developing 
ASD themselves [15]. Researchers use a battery of measures to as- 
sess early neural, cognitive, and behavioural development in these 
infants, which can then be examined in light of their ASD outcome 
status from toddlerhood (Fig. 26.3). Recent reviews of this literature 
highlight the accumulation of behavioural autism symptoms in the 
second year of life, preceded by more subtle, but apparently more 
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domain-general, atypicalities [16, 17]; early reports from parent- 
mediated interventions for high-risk infants suggest that some of 
these domains can be successfully targeted [18-20] (Fig. 26.4). 


Genetics 


Given the high heritability of ASD, genetic information is critical to 
building causal models of symptom emergence. Advances in tech- 
nology have rapidly expanded our ability to extract and interpret infor- 
mation about genetic variation associated with ASD, although there is 
still much to discover [21]. However, turning genetic information into 
meaningful hypotheses about causal mechanisms remains challenging. 
Many genetic studies still use coarse phenotypes like the presence or 
absence of a clinically determined ASD diagnosis, which shares many 
of the limitations of case-control studies of neurocognitive measures 
described (including the common failure to consider co-occurring 
syndromes like depression, ADHD, or anxiety). Further, the path be- 
tween genotype and phenotype is not static but emerges through a 
developmental interaction between the gene and the environment on 
the substrate of the developing brain (Fig. 26.5). Substantial compen- 
satory and adaptive mechanisms exist to buffer early disruptions, and 
these likely include epigenetic modifications that produce significant 
changes in gene expression over development [22]. In order to under- 
stand how genetic variation underpins ASD symptom emergence, pro- 
spective studies are required that measure genetic variation, epigenetic 
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Fig. 26.2 ADOS social affect symptom trajectories over time by N290 response (median split) to neutral faces. Time point 1 = 3 years, 2 = 6 years, 


3 =9 years, 4 = 14 years. 
Reproduced from J. Autism Dev. Disord., 46(7), Neuhaus E, Jones EJ, Barnes K, et al., The Relationship Between Early Neural Responses to Emotional Faces at Age 3 and Later 
Autism and Anxiety Symptoms in Adolescents with Autism, pp. 2450-2463, Copyright (2016), with permission from Springer Science Business Media New York. 


regulation, the developing brain, and emerging behaviouralsymptoms X, tuberous sclerosis, or Cohen’s syndrome [23]. Animal models 
over time. of some of these conditions are available, allowing clearer links to 

A complementary approach to incorporating information about be drawn between human research and basic neurobiology and al- 
genetics into models of ASD emergence is to study children with lowing identification of putative treatment targets. However, medi- 
genetic syndromes associated with high rates of ASD like fragile cations targeted at mechanisms in conditions like fragile X produce 
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Fig. 26.3 Prospective infant sibling studies using broad batteries of measures to assess the emergence of ASD over developmental time. Note: N290 
responses were modelled as continuous predictors. For ease of visualization, lines shown represent symptom trajectories for participants divided on 
median value of N290 responses. 

Image courtesy of Emily Gold. 
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Fig. 26.4 Effects of parent-mediated intervention on time to habituate to faces (a) in infants at high familial risk for ASD. Infants who received the 
Promoting First Relationships (PFR) intervention showed greater developmental improvements in habituation time than infants who only received 
assessment and monitoring. Data from low-risk control infants are shown as solid (longitudinal) and dotted (cross-sectional) lines. 


Adapted from Autism Res., 10(5), Jones EJH, Dawson G, Estes A, et al., Parent-delivered early intervention in infants at risk for ASD: Effects on electrophysiological and 
habituation measures of social attention, pp.961-972, Copyright (2017), with permission from John Wiley and Sons. 
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Fig. 26.5 Symptoms of ASD emerge through a complex interaction between genes and environment, which interact in the context of the child's 


developing body and brain. 
Image courtesy of Emily Gold. 


impressive effects in rodents but have yet to show promise in human 
trials [24]. The significant limitations of mouse and rat models in 
modelling social and communication domains may play a role. 
Further, little is known about which of the neurodevelopmental 
paths underlying symptoms of ASD in genetic disorders are shared 
with children with idiopathic ASD. Fortunately, a number of inves- 
tigators have recently begun longitudinal prospective studies of in- 
fants with conditions like tuberous sclerosis or neurofibromatosis 
type 1, which will go some way to addressing these questions. 


Current understanding of the mechanisms 





underlying ASD symptoms 


The rapidly expanding literature on ASD constantly produces new 
hypotheses about its underpinning mechanisms. I have selected 
three explanations in the following sections that illustrate the neces- 
sity of both theories derived from ‘top-down clinical studies of indi- 
viduals with ASD and those with a strong emphasis on genetics and 
neurobiology. I will point to areas of interface between these levels of 
explanation, which hold promise for the eventual reconciliation of 
these frameworks. However, it is important to note that there may or 
may not be one ‘final causal path’ to autism symptoms—it is probable 
that autism represents the combination of several risk pathways that 
happen to co-occur in some individuals, and the number of possible 
combinations may be many and varied (the ‘autisms’ [25]). There 
is very unlikely to be one single explanation for autism, and indeed 
the following accounts contain many different specific routes to dif- 
ferences in a very broadly defined area of neurocognitive function. 


Social attention, motivation, and affiliation 


Social communication impairment is one of the most distinctive fea- 
tures of ASD. ‘Social first’ theories propose that children with ASD 
find social stimuli less engaging than they should (for a variety of 


possible reasons), which may compromise learning about the social 
world and the ongoing development of the social brain [26-28]. 
Routes to diminished engagement may include a failure to assign 
reward value to social stimuli, altered responsivity to complex and 
unpredictable stimuli, or difficulties with specific aspects of social 
cognition. Children may experience one or more of these specific dif- 
ficulties, but the common end result posited is a gradual withdrawal 
from the social world. Given that self-directed experience is likely 
very important in social communication development, children’s 
initial difficulties are compounded by this social withdrawal. 


Evidence for the role of social attention 


A range of evidence exists for the early emergence of differences 
in social attention. While ‘innate’ social subcortical orienting may 
be intact [29], putatively ‘cortical’ aspects of social attention are 
altered early in infants with later ASD. By 6 months, infants with 
later ASD show altered temporal profiles of attention engagement 
with faces [30] (Fig. 26.6), reduced interest in speaking faces [31], 
reduced interest in social scenes [32], and declining interest in the 
eyes [33]. Differences in social behaviours like joint attention accu- 
mulate through the second year, resulting in a profile of a gradually 
decreasing interest in other people that matches that hypothesized in 
developmental accounts [34]. Taken together, data from prospective 
studies indicate that differences in social attention are observed 
from early development of infants with later ASD. One critical next 
step is to establish whether such differences are simply a signature of 
ASD unfolding [35] or whether they additionally contribute to later 
emerging social communication problems. 

Studies of newly diagnosed toddlers confirm differences in social 
attention. For example, when viewing naturalistic social scenes, tod- 
dlers with ASD pay less attention to other peoples activities [36] 
and show reduced interest in faces and people when speech and eye 
contact are introduced [37]. Further, toddlers with ASD show re- 
duced attention to stimuli depicting biological motion [38, 39], a 
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Fig. 26.6 Shorter (b) and delayed (e) peak looks to faces relate to later autism at 6 months in high-risk infants. Data taken from a habituation paradigm 


illustrated in (a) and (d). 


Adapted from J. Neurodev. Disord., 8(7), Jones EJH, et al., Reduced engagement with social stimuli in 6- month-old infants with later autism spectrum 
disorder: a longitudinal prospective study of infants at high familial risk, Copyright (2016), Jones et al. Reproduced under the Creative Commons 


Attribution License CC BY 4.0. 


Image courtesy of Emily Gold. 


highly conserved ability typically present from birth [40]. By the 
time of diagnosis, differences in social attention are accompanied 
by alterations in face processing. Expert face processing relies on ex- 
perience with faces in early development [41], and thus, early alter- 
ations in face processing could represent downstream consequences 
of diminished social attention. For example, pre-schoolers with 
ASD show altered neural responses to familiar faces [42] and faces 
displaying emotional expressions [43] that predicted symptom tra- 
jectories into adolescence [14], indicating broad disruption to social 
brain systems. Toddlers with ASD also take longer to learn about 
faces [44] and show delays in neural responses to face familiarity 
that are in line with their broader social delays [3] (Fig. 26.7). Of 
note, basic face-processing components like the N290 appear typical 
in earlier development [30, 45]. Taken together, disruptions in social 
attention may be associated with downstream effects on aspects of 
social cognition by the time of diagnosis. 

Moving forward, the potential role of social attention in the 
emergence of ASD makes it a suitable target for developing proxy 
markers of treatment outcome. This may be particularly important 
in developmental populations in which symptoms may not have 
fully emerged. However, the likelihood that there is heterogeneity in 
the processes that underlie alterations in social attention means that 
we will likely need a range of ‘biomarkers’ if we wish to target the 
underlying neural systems. Measures like eye-tracking responses to 
complex social scenes may be closer to the symptoms used to diag- 
nose ASD and may thus show shared alterations across a broader 
range of individuals. In contrast, EEG or fMRI studies may target 


systems that are only altered in a subset of individuals and may thus 
be more suited to identifying meaningful subgroups of participants. 
Whether or not these subgroups require different treatment ap- 
proaches remains an open question and relates to whether or not 
alterations in particular neurocognitive systems can be linked to 
particular aspects of molecular pathophysiology. It may be EEG- or 
f{MRI-derived subgroups can be used to define measures that are 
used to test the effects of treatments on the systems that appear al- 
tered in that individual, but the underpinning pathophysiological 
alterations may not fall into the same categories. For example, one 
might determine that social reward networks are underpinning 
social difficulties in one subgroup, while differences in social cogni- 
tion are most problematic for another. The same treatment might be 
tested, but its effects on social reward used as the primary outcome 
in one group and its effects on social cognition as the primary out- 
come in another. Alternatively, a treatment may specifically target 
social cognition, in which case participants may be screened on the 
basis of neurocognitive measures for trial entry. New efforts to use 
EEG to identify participants for clinical trials are an important step 
in testing such ideas. 


Treatment targets 


Although neurodevelopmental studies with humans do not examine 
the primary distal causes of autism (genetic and environmental), 
they can identify developmental processes that serve to canalize 
or consolidate development down an atypical path. Targeting such 
processes may provide valuable ways of ameliorating later symptom 
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Fig. 26.7 Relationship between the Nc event-related potential familiarity effect (response to familiar minus unfamiliar face) and (a) chronological 
age and (b) socialization age, taken from the Vineland Adaptive Behavior Scales. Relationship between the N290 event-related potential peak latency 
and (c) chronological age and (d) socialization age. Findings show that in toddlers with autism, responses to faces develop in line with social, but not 


chronological, age. 


Adapted from Child Dev. 82(6), Webb SJ, Jones EJ, Merkle K, et al. Developmental Change in the ERP Responses to Familiar Faces in Toddlers with Autism Spectrum Disorders 
Versus Typical Development, pp. 1868-1886, Copyright (2011), with permission from John Wiley and Sons. 


trajectories. Joint attention is a construct that illustrates the potential 
for developing causal models of ASD emergence that can be used to 
develop new treatments. Joint attention refers to the ability to ‘co- 
ordinate attention between interactive social partners with respect 
to objects or events in order to share an awareness of the objects or 
events’ [46]. Most typically developing infants are able to share at- 
tention, follow another’s direction of attention, and direct another’s 
attention by pointing themselves by around 12 months. Early joint 
attention skills predict individual differences in later language de- 
velopment [47], consistent with the idea that shared attention scaf- 
folds early language development. Toddlers with ASD show marked 
impairments in aspects of joint attention [46, 48] that relate both to 
concurrent and future language skills [49]. Building on this research, 
Connie Kasari and colleagues have developed programmes targeted 
at joint attention, in addition to other symbolic play and engagement 
skills [Joint Attention, Symbolic Play, Engagement and Regulation 
(JASPER)], and shown significant effects on social communication, 
emotion regulation, and language in pre-schoolers [50, 51]. Indeed, 
JASPER is one of two interventions recommended by NICE in the 
UK as evidence-based. The progress in this research field highlights 
the power of a developmental approach. 

Social engagement (broadly defined) has been the target of other 
successful behavioural intervention approaches for children with 
ASD. For example, Dawson and Rogers have developed compre- 
hensive and intensive programmes (the Early Start Denver Model) 


that also feature inter-personal exchange and shared engagement 
as a core target; these have improved both communication skills 
[52] and neural responses to social stimuli [53] in young children 
with ASD. Green and colleagues have developed the less intensive 
Preschool Autism Communication Trial (PACT) model, which fo- 
cused on increasing the engagement and reciprocity of interactions 
between children and their parents [54]. This intervention can pro- 
duce long-term improvements in symptoms in children with ASD 
[55]; improvements are likely mediated through changes in parental 
synchrony and child initiations [56]. Plasticity in early development 
has the potential to be even greater, and so researchers have also 
developed parent-mediated programmes for infants at high familial 
risk for ASD. These typically focus on boosting sensitivity to early 
communicative cues, encouraging the child’s efforts to enter the 
social world. Though replication is needed, early results are prom- 
ising and suggest there may be effects on emerging autism symp- 
toms [18, 20] and the neurocognitive correlates of social attention 
[19]. One key limitation of psychosocial trials is that parents and 
caregivers are aware of treatment status, making the use of caregiver 
questionnaire measures (the most common outcome measures used 
in pharmaceutical trials) very problematic. Blinded behavioural or 
neurocognitive outcome assessments are thus critically important 
in this field. However, the relevance of many of these measures to 
everyday functioning for individuals has not been demonstrated. 
Ongoing efforts to develop new objective biomarkers for social 
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cognition [57] and analysis approaches for common diagnostic 
assessments that are more sensitive to treatment change [58] will 
facilitate progress in this area. Further, testing whether changes in 
social attention/social cognition predict longer-lasting change in 
social functioning will be an important test of causal frameworks 
and will indicate whether such measures could form proxy outcome 
measures for future trials. 

Pharmacological treatment targets in the domain of social cog- 
nition have also been identified. One promising line of investiga- 
tion is the neuropeptides oxytocin and vasopressin. These are highly 
conserved mediators of complex social cognition and behaviour 
and, with appropriate caution, could thus prove valuable transla- 
tional targets [59, 60]. Several trials of the effects of oxytocin have 
been reported in people with autism, with some predicted effects 
on social cognition and behaviour [61-63]. One challenge to be ad- 
dressed is the mode of oxytocin administration, since effects on the 
brain of intranasal administration (the most common method) may 
be limited to high doses and within certain time windows after ad- 
ministration [64]. The developmental window during which oxy- 
tocin is administered may also be critical. For example, mice with 
a mutation in Cntnap2 display deficits in social behaviour and re- 
duced oxytocin; daily application of oxytocin improved later social 
engagement, but this was most pronounced when treatment oc- 
curred earlier in development [65]. These results are promising, but 
in wildtype mice, chronic application of oxytocin developmentally 
can produce compensatory mechanisms that reduce sociability in 
adulthood [66]. Such results illustrate the importance of considering 
compensatory systems and the need for individualized treatment 
approaches. Vasopressin (which differs in only two amino acids 
from oxytocin) has been less well investigated, but in addition to 
core functions in regulating blood pressure, it also has an important 
function in mediating social behaviours [4]. Vasopressin enters the 
CNS after intranasal administration and can modulate brain net- 
works involved in processing emotional information [67]. Thus, 
a number of clinical trials targeting vasopressin in ASD are under 
way (https://clinicaltrials.gov). Interestingly, while some investiga- 
tors are testing the effects of increasing levels of vasopressin (http:// 
med.stanford.edu/clinicaltrials/trials/NCT01962870), others have 
reasoned that suppressing vasopressin activity may be more appro- 
priate [68]. Taken together, neuropeptides show some promise as 
translational targets for further work. 


Inhibition/excitation balance 
Evidence 


Network and clustering-based approaches to genetic data have iden- 
tified common functions that appear to be compromised by a range 
of risk variants for ASD, which include neural signalling and de- 
velopment, synaptic transmission, chromatin remodelling, neural- 
glial signalling, and transcriptional regulation [69]. Examination 
of expression patterns of risk genes by brain region and develop- 
mental stage has identified transcriptional regulation and synaptic 
development during pre- and early postnatal development as loci 
of alteration, with concentrations in superficial cortical layers and 
glutamatergic synapses [70]. Taken together, such work indicates 
(perhaps unsurprisingly) that ASD is related to differences in the 
formation and specialization of brain networks. One specific line 
of investigation has focused on disruptions in either glutamate or 


GABA-ergic systems, which could lead to alterations in the co- 
ordination of inhibition/excitation in the brain [71]. The high co- 
occurrence of epilepsy in individuals with ASD forms one line of 
evidence for this hypothesis [72], in addition to the high rates of 
ASD in disorders that disturb glutamatergic or GABA-ergic func- 
tioning like 15q11-13, fragile X, or neurofibromatosis type 1 [73]. 
Alterations in GABA and glutamate levels have been identified in in- 
dividuals with ASD using magnetic resonance spectroscopy, though 
findings are somewhat inconsistent and limited by lack of spatial 
precision [74, 75]. Emerging evidence from stem cell models impli- 
cate an over-production of GABA-ergic neurons [76]. Theoretical 
models indicate the important role of GABA in maintaining brain 
networks [77], and evidence from animal models suggests that 
GABA or glutamatergic dysfunction could contribute to mainten- 
ance of concurrent ASD symptoms [78]. Thus, a range of evidence 
indicates that GABA/glutamate systems may be appropriate treat- 
ment targets for targeting current symptomatology. 

Developmental actions of GABA and glutamate are also critical 
to consider. GABA has well-defined roles in controlling processes, 
including cell proliferation, neuroblast migration, dendritic matur- 
ation, and synapse elimination [79, 80]. Activity-dependent GABA 
signalling is thought to be critical in optiziming the balance between 
excitation and inhibition in the developing cortex [79] and has a 
critical role in shaping cortical-sensitive periods [81]. Such sensi- 
tive periods are important in tuning brain responses to the faces and 
voices infants encounter in their environment, a process called per- 
ceptual narrowing [82]. Disruptions to GABA-ergic signalling pro- 
cesses could thus impact early experience-dependent specialization 
of the social brain, potentially resulting in ASD symptoms. Further, 
a wide body of research indicates that GABA initially has a depolar- 
izing function, with a switch to a hyperpolarizing function some- 
where around the end of the first postnatal week in rodents, though 
there has been some debate [83, 84]. Other neurotransmitter sys- 
tems with significant developmental roles include serotonin, which 
has also been implicated in autism and is particularly involved in 
early development of sensory systems [85]. Understanding how key 
neurotransmitter systems contribute to the emergence of autism 
symptoms will be assisted by longitudinal prospective studies of 
populations of human infants with genetic syndromes associated 
with alterations in key neurotransmitter systems. 


Treatment targets 


A number of therapeutic targets have been identified that may 
modulate GABA or glutamatergic functioning. For example, 
acamprosate is a GABAa agonist and a glutamate antagonist, which 
has shown some benefit in open-label trials [86]. The GABAb agonist 
arbaclofen has successfully reversed symptoms in animal models of 
fragile X syndrome, but human trials have been disappointing [24]. 
Vigabatrin inhibits the breakdown of GABA and increases GABA 
concentration; its use for seizures in infants with tuberous sclerosis 
has been associated with some improvement in ASD symptoms in 
later development [87]. Gabapentin increases GABA concentra- 
tion and is sometimes used to manage disruptive behaviours in 
adults with ASD [88]. Other anti-epileptic drugs like topiramate, 
levetiracetam, and lamotrigine have been trialled, with limited ef- 
fects on core symptoms [89]. D-cycloserine is a partial glutamatergic 
agonist, which has had limited efficacy for core symptoms [90] but 
may support long-term effects of targeted social skills training in 
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children with ASD [91]. Bumetanide enhances GABA-ergic inhib- 
ition and again has shown some promise [92]. Targeting the sero- 
tonin system, SSRIs like fluoxetine and citalopram have been trialled 
in older populations, with mixed results [93]. Clearly, further work 
on turning promising genetic findings into drug targets is required. 
One key step is to develop better neurocognitive markers of targeted 
neurotransmitter systems that can be used as outcome measures in 
clinical trials. 


Oxidative stress and immune activation 
Evidence 


Emerging evidence suggests a role for maternal immune activa- 
tion/inflammation and oxidative stress in causing ASD [94-96]. 
Prenatal risk factors for ASD include infection, maternal gestational 
diabetes and medication use, exposure to significant pollution, 
neonatal anaemia, meconium aspiration or exposure, and respira- 
tory distress [97-99], many of which cause some degree of hyp- 
oxia and/or a strong maternal immune response in pre- or early 
postnatal development. Mutations or alterations in mitochondrial 
functioning have also been linked to ASD [100], which may com- 
promise energy available to neurons. With regard to inflammation, 
epidemiological studies indicate correlations between autism and 
familial history of atopic diseases like asthma, eczema, allergies, 
and food intolerance, all associated with inflammatory responses 
[96]. Prenatal immune activation may produce changes in methy- 
lation (particularly in the prefrontal cortex) that could contribute 
to later ASD symptoms [101]. Effects of oxidative stress or inflam- 
mation may also depend on the presence of particular risk genes 
in influencing outcome [102, 103]. Interestingly, gene expression 
studies show that individuals with ASD exhibit upregulated expres- 
sion of genes associated with inflammatory responses that are cor- 
related with downregulation of synaptic transmission genes [104]. 
Developmental analyses suggest that these inflammatory responses 
may be secondary consequences of early dysregulation associated 
with changes in synaptic function or brain activity across the first 
decades of life [22]. However, such changes may still contribute to 
maintenance of core and associated symptoms of autism and thus 
represent appropriate targets for intervention. 


Treatment targets 


There have been some promising signals of the antioxidant N-acetyl 
cysteine (also involved in regulation of extracellular glutamate) 
during adolescence in animal models of schizophrenia [105]. Early 
trials in ASD are not promising [106] but may have selected insuf- 
ficiently sensitive outcome measures (questionnaire-based, rather 
than mechanistic, as in the animal model); application at a prodromal 
stage of the disorder may be more effective but, of course, would be 
associated with significant ethical hurdles. Other investigators have 
shown promising effects of sulforaphane (in combination with other 
compounds), a component of broccoli that also has significant anti- 
oxidant properties; effects were apparently reversed when the par- 
ticipants stopped taking the treatments [107]. Concerns have been 
raised about the unusually small ‘placebo’ response in the control 
group, although one of the key measures is not commonly used in 
ASD trials (the Social Responsiveness Scale). There have also been 
some supportive findings in animal models [108], suggesting that 
this kind of approach may be worthy of further investigation. 


Linking top-down and bottom-up models 


To date, attempts to link the literature derived from top-down dis- 
sections of the clinical difficulties experienced by children with ASD 
and emerging ‘bottom-up’ findings from genetic studies and animal 
models have been relatively exploratory. There are domain-general 
explanations of ASD within the cognitive literature that may map 
more readily onto the emerging neurobiological literature; these in- 
clude ideas about neural variability [11], reduced influence of prior 
information on current processing [109], or alterations in weighting 
of errors within predictive coding frameworks [110]. However, while 
these explanations map relatively clearly onto some of the perceptual 
experiences of individuals with ASD, they generally provide rela- 
tively underspecified accounts of both core social communication 
symptoms of ASD. Moving forward requires rigorous experimental 
and theoretical and computational approaches to understanding the 
interfaces between these literatures. Nonetheless, there are some 
potential points of convergence. Firstly, effects of early widespread 
neuronal disruption on sensitive periods in early brain specializa- 
tion may be one possibility for why the difficulties in ASD are so 
pronounced in the social domain. Experience-dependent learning is 
clearly very important in the development of social communication, 
since infants are not born knowing what language they will speak 
or what faces they will see; many ASD-associated genes converge 
on processes like neural signalling and development, synaptic trans- 
mission, activity-dependent chromatin remodelling, neural-glial 
signalling, and transcriptional regulation that would be expected 
to affect experience-dependent specialization. Higher-order associ- 
ation ‘nexus’ regions that are critical in social processing may be par- 
ticularly vulnerable to altered inhibition/excitation balance, because 
they integrate signals from several regions [111]. Gene expression 
studies are also beginning to reveal degrees of anatomical specifi- 
city in observed changes that may help guide human neuroimaging 
studies; for example, disordered cortical patterning of gene expres- 
sion has been observed in frontal and temporal cortices, critical to 
the social brain network. Greater refinement of genetic, neurobio- 
logical, and human phenotyping studies may converge on mean- 
ingful explanations of how apparently domain-general alterations 
produce the relatively specific symptoms experienced by individuals 
with ASD. 

An alternative possibility is that the disruptions associated with 
highly penetrant genes or environmental risk factors may not dir- 
ectly cause the social symptoms associated with ASD. Some of the 
more penetrant genetic variants may simply cause the brain to work 
less effectively, which only produces symptoms of ASD when on a 
genetic background that predisposes children towards poorer social 
skills [112]. Similar hypotheses have been advanced for environ- 
mental factors like maternal immune activation [113]. Indeed, most 
copy number variations associated with ASD have also been associ- 
ated with a wide range of other conditions. Genetic evidence suggests 
that copy number variations and common variation associated with 
ASD may affect different biological processes [114]. Larger muta- 
tions may alter the degree to which neurodevelopmental trajectories 
are buffered against environmental or other genetic perturbation 
[115, 116], with specificity then linked to those perturbations. If 
this is the case, trying to figure out how mutations that apparently 
have widespread effects lead to relatively specific symptoms may bea 
fruitless endeavour. Indeed, robust ‘autistic-like’ phenotypes in mice 
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with fragile X syndrome mutations are only seen on a limited range 
of genetic backgrounds [117], potentially critical to understanding 
the failure of recent trials inspired by mechanisms affected by the 
fragile X syndrome mutation. However, such accounts need to ex- 
plain why some genetic syndromes like Williams syndrome are not 
associated with high rates of ASD. 

Thirdly, it is critical to understand the role of adaptation in 
shaping the ASD phenotype. The brain is an organ of adapta- 
tion at multiple spatial and temporal scales [111]. These adaptive 
processes could be relevant to ASD in a number of ways. Firstly, 
deficits in adaptation could directly contribute to ASD. For ex- 
ample, homeostatic processes act to regulate inhibition/excita- 
tion ratios, and disorders/insufficiency of homeostasis [118] or 
autoregulation in general [71] have been posited as unifying frame- 
works in ASD. Neurotransmitter systems like GABA or neuro- 
peptides like vasopressin may thus need to be balanced within 
an optimal range, rather than generally up- or downregulated by 
treatment. Secondly, alterations that occur during particular time 
windows could cause long-term effects but be rapidly masked by 
subsequent compensatory processes [119]. In this case, adaptive 
processes would make it impossible to identify the original causes 
of ASD in the mature brain. Thirdly, adaptive processes may pro- 
duce symptoms of ASD that represent optimal functioning for 
that individual brain, given its early processing constraints [111]. 
If this is the case, boosting adaptation may actually result in more 
symptoms of ASD. Finally, adaptive or compensatory processes 
may not be specific to ASD but may produce co-occurring symp- 
toms. For example, recent gene expression studies indicate that 
immune-related markers emerge over the first two decades of life 
[22]. Increased cytokine and other inflammatory responses have 
been linked to depression [120], and so these responses may relate 
(as cause or consequence) to the common co-occurrence of con- 
ditions like depression in individuals with ASD. The likely crit- 
ical role of adaptation indicates that pharmaceutical development 
will require us to disentangle adaptive or compensatory responses 
from primary causal pathways. 


Although the number of clinical trials of novel therapies for ASD 
is increasing, there has been much discussion of how to accelerate 
the pace of translation to the clinic [121, 122]. We need better meas- 
ures with which to stratify participants for inclusion in trials and to 
measure the effects of those trials on the processes we think are im- 
paired. This involves moving from group-level neuroimaging meas- 
ures that are inconsistent across studies and have low reliability to 
identifying robust proxy biomarkers of treatment success. Such bio- 
markers should be developed not only from theoretical models of 
what we think underpins autism symptoms, but also from bottom- 
up measures of the pathophysiological processes implicated by gen- 
etic work and animal models. We also need translatable biomarkers 
that are sensitive to the same systems across species and can move 
with a drug from preclinical to clinical work; public-private part- 
nerships can be valuable in this endeavour [123]. Further, we need a 
deeper understanding of how genetic and pathophysiological vari- 
ation maps onto brain circuits and then onto emergent cognitive 
functions during development, and how this shapes final common 


pathways to symptom development. This will require emphasis on 
longitudinal studies that measure brain and behavioural develop- 
ment in genetically characterized populations, coupled with com- 
putational models and use of large normative data sets on gene 
expression and circuit function. Finally, we need to understand 
the complex interactions between different neurotransmitter and 
neuropeptide systems over developmental time, including the role 
of compensatory changes, and an appropriate target may be to 
achieve balance, rather than to uniformly boost or suppress activity 
in particular systems. Despite these challenges, new insights mean 
that treatment development for ASD is poised to enter a new frontier 
of translational opportunity. 
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The epidemiology of autism has generated much interest recently, 
mainly because of the reported dramatic increases in the prevalence 
of autism in the last few decades and the controversy surrounding 
vaccines and autism. These reports have led to speculation of the 
causes and intensive study of the changing epidemiology of autism 
in some countries. This chapter provides a background to the epi- 
demiology, particularly the global burden of disease and changes in 
the prevalence of autism in the last few decades and the risk factors 
that may contribute to these changes in epidemiology. In addition, 
financial burden and premature mortality are discussed. 











The epidemiology of autism is complicated by the difficulty in 
determining its onset and the changes in the criteria used for 
the diagnosis over time. Autism is widely considered as a life- 
long condition that occurs in genetically susceptible individuals, 
who may be exposed to other influences either in utero or after 
birth. The diagnosis of autism is rarely made before 18 months 
of age, although parents will often have concerns before that age. 
The age of diagnosis varies considerably, being influenced by the 
presence of symptom severity (particularly earlier in those with 
greater intellectual disability), socio-economic status (later in 
those with lower socio-economic status), parental concerns, and 
geographical region since this is determined by public awareness, 
expertise, and facilities for diagnosis [1]. Thus, determination of 
the onset is difficult, which makes calculating the incidence (new 
cases per population per unit time) problematic. The incidence 
is a more robust epidemiological measurement than prevalence 
(number of cases per population at point in time), particularly 
when examining the secular trends and identifying risk fac- 
tors. However, incidence measurements may underestimate 
the burden of a condition if there is premature mortality (see 
Premature mortality, p. 267) or if there is spontaneous remission 
(this is rare in autism, since the diagnosis is stable throughout 
life, although the severity of symptoms may change). Most of 
the epidemiological studies have reported prevalence, since they 
are conducted on referral databases (which are susceptible to 





bias, particularly ascertainment and assessment bias) or cross- 
sectional surveys with differing methodologies (postal surveys, 
community door-to-door surveys). Comparison between cross- 
sectional surveys are problematic, since the results are influenced 
by different methodologies, including identifying all the people 
within the denominator, tools used to screen for autism, age of 
the subjects, expertise of the final diagnosis, and diagnostic cri- 
teria. Comparison of the subjects identified with autism from 
surveys and established databases of autism may show significant 
discrepancies [2]. Birth cohorts may be more informative but are 
expensive and difficult to conduct. 





The initial studies measuring the prevalence of autism in the 1960s 
and 1970s reported low frequencies, with prevalences from 0.7 to 4.6 
per 10,000, mainly using Kanner’s description of the diagnosis [3] 
(Table 27.1). The DSM-III criteria published in 1980 [4] was used in 
most studies of that decade, with prevalences reported from 2.0 to 
13.8 per 10,000 (Fig. 27.1). In this era, the comparison between the 
studies was complicated by use of varying methods for case ascer- 
tainment, different age groups studied, and some studies using the 
Kanner’s criteria, Lorna Wing's triad [5], or Rutter’s definition [6]. In 
the 1990s, the prevalences reported were considerably higher, with 
a prevalence of 60/10,000 reported from Sweden [7]. The analysis 
of the secular trends was complicated by the use of ICD-10 [8] and 
the introduction of DSM-IV published in 1994 (Table 27.1). From 
the year 2000, the reported prevalence of autism was consistently 
and considerably higher, ranging from 11.0 to 157 per 10,000. Most 
of these studies used either DSM-IV or ICD-10 criteria for autism. 
Since 2010, the prevalence of autism in some countries has been re- 
ported as even higher, with a prevalence of 140/10,000 in the United 
States across all age groups [9] and 260/10,1000 in South Korean 
children [10]. 








Since the original delineation of autism by Kanner in 1943 [3], the 
conceptualization of autism has undergone significant changes 


Table 27.1 Studies measuring the prevalence of autism in the 1960s and 1970s 


Kanner (1943)' Rutter and Schopler | Wing and Gould DSM-IV‘ 
(1978)? (1978)? 


Social 
communication 


Language 


Behaviour 


1. Profound lack of 
affective contact 


2. Repetitive, 
ritualistic 
behaviour, which 
must be of an 
elaborate kind 


1. Impaired social 
development 
which is out-of- 
keeping with the 
child's intellectual 
level 


2. Delayed and 
deviant language 


development that 


also has certain 
defined features 
and is out of 
keeping with the 
child's intellectual 
level 


W 


.'Insistence on 
sameness’, 
as shown by 
stereotyped 
play patterns, 
preoccupations, 
or resistance to 
change 


il. Impairment 
in social 
communication 


2. Impairment 
in verbal and 
non-verbal 
communication 


3. Repetitive, 
ritualistic 
behaviour 


1. Lack of 
responsiveness 
to others 


2. Language 
absence or 
abnormalities 


3. Resistance 
to change or 
attachment to 
objects 
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1. Impairment in 
reciprocal social 
interaction (at 
least two froma 
list of five items, 
comprising 
specified clinical 
examples) 


2. Impairment 
in verbal and 
non-verbal 
communication 
(at least one from 
a list of six items) 


3. Markedly 
restricted 
repertoire of 
activities and 
interests (at least 
one from a list of 
five items) 


261 








A) Qualitative impairment in social 
ee as manifested by at least 
two of the following: 

1. Marked impairments in the use of 
multiple non-verbal behaviours 
such as eye-to-eye gaze, facial 
expression, body posture, and 
gestures to regulate social 
interaction 

2. Failure to develop peer 
relationships appropriate to the 
developmental level 
3. Lack of spontaneous seeking to 
share enjoyment, interests, or 
achievements with other people, 
for example, by lack of showing, 
bringing, or pointing out objects 
of interest to other people) 

4. Lack of social or emotional 

reciprocity (note: in the 

description, it gives the following 
as examples: not actively 
participating in simple social 

play or games, preferring solitary 

activities or involving others 

in activities only as tools or 

‘mechanical aids) 





(B) Qualitative impairments in 

communication, as manifested by at 

least one of the following: 

1. Delay in, or total lack of, 

he development of spoken 

anguage (not accompanied 

by an attempt to compensate 

hrough alternative modes of 

communication such as gesture 

or mime) 

2. In individuals with adequate 

speech, marked impairment in 

he ability to initiate or sustain a 

conversation with others 

3. Stereotyped and repetitive use 

of language or idiosyncratic 

anguage 

4. Lack of varied, spontaneous 
make-believe play or social 
imitative play appropriate to the 
developmental level 





(C) Restricted, repetitive, and 
stereotyped patterns of behaviour, 
interests, and activities, as 
manifested by at least two of the 
following: 

1. Encompassing preoccupation 
with one or more stereotyped 
and restricted patterns of interest 
hat is abnormal either in intensity 
or focus 
2. Apparently inflexible adherence 
o specific, non-functional 
routines or rituals 
3. Stereotyped and repetitive 
motor mannerisms (for example, 
hand or finger flapping or 
wisting, or complex whole body 
movements) 

4. Persistent preoccupation with 
parts of objects 








(continued) 
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Table 27.1 Continued 


Kanner (1943)! Rutter and Schopler | Wing and Gould DSM-IV® 
(1978) (1978) 





Other features Other features 4. Absence of 4. In a total of at A total of six (or more) items 
include: schizophrenic least eight from from (A), (B), and (C), with at least 
3. An anxiously features among the 16 two from (A) and one each from 
obsessive desire for items listed (B) and (C) 
the preservation (III) The disturbance is not better 
of sameness in the accounted for by Rett's disorder 
child's routines and or childhood disintegrative 
environment disorder 


4. A fascination for 
objects, which 
are handled with 
skill in fine motor 
movements 

5. Mutism or a kind 
of language that 
does not seem 
intended for 
inter-personal 
communication 

6. Good cognitive 
potential shown 
in feats of 
memory or skills 
on performance 
tests, especially the 
Séguin form board 





Onset Kanner also 4. Onset before 5. Onset before (Il) Delays or abnormal functioning 
emphasized the 30 months 30 months in at least one of the following 
onset from birth or areas, with onset prior to age of 
before 30 months 3 years: 


(A) Social interaction 

(B) Language as used in social 
communication 

(C) Symbolic or imaginative play 


Source: data from Nervous Child, 2, Kanner L, Autistic disturbances of affective contact, pp. 217-250, Copyright (1943); Rutter M, Diagnosis and definition. In: Rutter M, Schopler E, [eds], 
Autism: A Reappraisal of Concepts and Treatment, pp. 1-25, Copyright (1978), Plenum Press; J Autism Child Schizophr., 8(1), Wing L, Gould J, Systematic recording of behaviors and skills 
of retarded and psychotic children, pp. 79-97, Copyright (1978); Spitzer RL, Gibbon M, Skodol AE, et al., DSM-III. Diagnostic and Statistical Manual of Mental Disorders (Third Edition), 
Copyright (1989), American Psychiatric Press; Perry S, Allen F, Clarkin JC, DSM-III-R Casebook of Treatment Selection, Copyright (1996), Routledge; DSM-IV-TR: Diagnostic and Statistical 
Manual of Mental Disorders (Diagnostic & Statistical Manual of Mental Disorders), 4th Revised edition, Copyright (1994), American Psychiatric Association. 
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Fig. 27.1 (see Colour Plate section) Prevalence of autism since the 1960s to 2015 according to the diagnostic criteria. 


[11]. The definition has changed from a discrete disorder, based 
upon Kanner’s description, to the concept that autism is part of a 
spectrum of disorders characterized by impaired social communi- 
cation with differing degrees of severity. More recently, autism has 
been considered as a continuum, in which the diagnosis depends 
upon the number and severity of autistic traits. These differences 
in the concepts of autism have largely brought about the changes 
in the diagnostic criteria but have been influenced by changing 
conceptualization of mental health disorders and the complexity 
of the genetic basis. 

The changes in the diagnostic criteria have had a major impact 
on the epidemiology of autism, from subjects with intellectual 
disability defined in Kanner’s original description to DSM-IV and 
ICD-10 which are much more inclusive, encompassing higher- 
functioning individuals such as those with Asperger’s syndrome. 
Thus, the prevalence of autism in London in the UK was three 
times higher with the reclassification of cases by ICD-10 criteria, 
compared to those defined by Kanner’s criteria in the original 
study [12]. Many of these additional cases are a result of diag- 
nostic substitution, in which cases of autism were previously diag- 
nosed as other conditions. In the 1960s and 1970s, many cases 
of autism would have been classified as intellectual disability. In 
an analysis of special education requirements across the United 
States, Shattuck showed that many children initially diagnosed 
with intellectual disability were subsequently diagnosed with 
autism, and the increase in prevalence of autism was associated 
with a reduction in intellectual disability during this period [13]. 
In the UK, analysis of the General Practitioner Research Database 
demonstrated that the incidence of autism from the 1988 birth co- 
hort rose from 4/10,000/year to 25/10,000/year in the 1997 birth 
cohort, and again this increase was attributed to changes in the 
diagnostic criteria [14]. 

Increased public and medical awareness of autism is likely 
to have increased the prevalence of autism over time. Parents 
of children with communication difficulties are more likely to 
take their children for assessment and ask about the diagnosis of 
autism. The formation of advocacy groups, for example Autism 
Speaks in the United States and the National Autism Society in 
the UK, has certainly increased public awareness and possibly led 
to an increase in referrals for assessment. Furthermore, parents 
are more likely to have their children assessed for autism now- 
adays, since in many Western countries, there is an increase in fi- 
nancial, psychological, and therapeutic support for families with 
autism than previously. Health care workers are more likely to 
entertain the diagnosis, and autism is now an established com- 
ponent of undergraduate teaching for health and educational 
professionals. Autism diagnostic services has increased, which, 
although they may be self-servicing, have arisen for perceived 
needs in communities. There has been an increase in national 
(Denmark [15]), medical [16], and educational [13, 17] regis- 
tries in the 1990s. However, interpretation of the secular trends 
in these registries is problematic, since the reports often do not 
account for the many confounding factors such as increasing 
population, decreasing age of diagnosis, or changes in referral 
patterns and availability of services. 
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Changes in biological risk factors and in the 


prevalence of autism 





The changes in a number of biological risk factors may have influ- 
enced the secular changes in the prevalence of autism. Increasing 
paternal age is associated with an increased risk of developing 
autism [18], and the paternal age has increased. Some drugs, for 
example sodium valproate, or alcohol use during pregnancy are 
associated with autism, and these may have contributed to the in- 
crease in the 1980s and 1990s. The influence of these factors in the 
last decade is likely to have decreased, given the increased awareness 
of the effects on the fetus [19]. The incidence of perinatal factors, 
for example prematurity, associated with autism has increased, and 
survival of affected neonates has improved [20]. Migration is asso- 
ciated with autism in some countries, for example Sweden, and this 
has increased in the last few decades [21]. In contrast, the incidence 
of other risk factors associated with autism, for example rubella and 
thalidomide exposure, has decreased. 


Prevalence of autism across the world 


The global burden of autism has not yet been defined accurately. 
Although there are many epidemiological studies from North 
America, Europe, and Japan, there are few studies from the re- 
mainder of the world. Thus, in a recent review of mental disorders 
of children, there were no data on any disorders in 124 of the 187 
countries [22]. In particular, there were no studies on the preva- 
lence of autism in Africa, South and Central Asia, Central and Latin 
America, and Eastern and Central Europe. In another review, the 
median prevalence from the available studies was 17 (range 3-94) 
per 10,000 in 36 studies conducted since 2000, with the median 
prevalence of all pervasive disorders as 62 (range 1-189) per 10,000 
from 32 studies during the same time period [23]. There was no sig- 
nificant difference between these estimates between the studies con- 
ducted in Europe, North America, and Western Pacific. 


Global burden of autism 


The Global Burden of Disease (GBD) was set up in the 1990s to com- 
pare the burden between countries and monitor secular trends. The 
GBD is usually presented in disability-adjusted life years (DALYs), 
which is the sum of years lost to life (YLLs) + years lived with dis- 
ability (YLDs). The latter, in turn, is derived from incidence x dis- 
ability weight. The disability weight in the 1990s was determined 
by expert opinion, but in the most recent estimations, the disability 
weight has been derived from interviews conducted across the world 
[24]. However, these interviews did not assess the disability weights 
associated with autism. 

The global burden of autism, as measured with DALYs, was 
337.8 million in 1990 and had increased by 38% to 467.6 million in 
2010; however, this increase could be attributed to the population 
growth, with no increase in DALYs per capita [25]. Standardized by 
age and sex, this equated to 111 DALYs per 100,000 population [95% 








SECTION 4 Autism spectrum disorders 


uncertainty interval (UI) 77-154], but with 170 DALYs per 100,000 
males (95% UI 119-237) and 50 DALYs per 100,000 females (95% 
UI 35-68). 


The male predominance of autism was recognized by Kanner, and 
subsequently confirmed by later studies. Overall the ratio of males 
to females is 4.2:1 [26], but it depends upon the phenotype, in par- 
ticular the degree of intellectual disability. As the intelligence quo- 
tient decreases, the ratio between males to females also decreases. 


Autism has high heritability, with twin studies suggesting that herit- 
ability ranges from 64% to 91% [27], with the estimate partially ex- 
plained by the phenotype being studied. However, the genetic basis 
of autism is complex, with multiple genes involved by a variety of 
abnormalities such as copy number variants, common genetic vari- 
ants, etc. It is likely that a multitude of genetic factors influence the 
susceptibility of an individual to develop autism, but there may be 
environmental factors that influence the genetic propensity [28]. 
The complex genetic basis of autism is outside the scope of this 
chapter, and the reader is referred to some excellent reviews [29-31]. 

There are a number of risk factors associated with autism, 
including parental acquired risk factors, and intrauterine, perinatal, 
and postnatal risk factors. Increasing paternal age (>50 years) is as- 
sociated with a small, but significant, risk (relative risks of 1.5) of 
autism [18, 32, 33]. Advanced maternal age (>40 years) is also asso- 
ciated with an increased risk, but the relative risk is smaller than that 
with older fathers. In addition, young mothers (<20 years) have an 
increased risk [18]. However, given the relatively low relative risks 
and the inconsistency between studies, the importance of these fac- 
tors is not well established. 

Exposure to intrauterine infections, drugs, and other insults is 
also associated with autism. Rubella infection during pregnancy was 
the first infection to be associated with the development of autism. 
Initial studies suggested that 4-7% of affected fetuses develop autism 
[34], and the risk is increased in those children who had other fea- 
tures of congenital rubella syndrome. Further studies report the 
prevalence may be as high as 12% [35]. Other infections, such as 
cytomegalovirus [36], influenza, and parvovirus, have been docu- 
mented with autism, but the epidemiological data are weaker than 
those with rubella. Likewise, some parasitic infections, such as toxo- 
plasmosis and tick-borne infections, have weak and inconsistent as- 
sociations with autism [37]. 

In a meta-analysis of papers published by 2007, maternal infec- 
tion during pregnancy (OR 1.18; 95% CI 0.76-1.83), vaginal infec- 
tions (OR 0.49; 95% CI 0.22-1.09), and maternal fever (OR 1.24; 
95% CI 0.76-2.04) were not associated with autism [38]. However, 
when the analysis was limited to the four studies that controlled for 
multiple covariates or used sibling controls; exposure to intrauterine 
infections was significantly associated with autism (OR 1.82; 95% CI 
1.01-3.30). Two recent population-based studies from Denmark and 
Taiwan have similar results. In the Danish study, admission to hos- 
pital due to maternal viral infection in the first trimester [adjusted 


hazard ratio (aHR) 2.98; 95% CI 1.29-7.15] and maternal bacterial 
infection in the second trimester (aHR 1.42; 95% CI 1.08-1.87) was 
associated with autism in the children, but no association was found 
between any maternal infection and autism (aHR 1.14; 95% CI 0.96- 
1.34) [39]. In the Taiwanese study of 4184 children with autism, two 
or more outpatient visits for genital infection [adjusted odds ratio 
(aOR) 1.34; 95% CI 1.12, 1.60] and bacterial infection (aOR 1.24; 
95% CI 1.06, 1.43) were associated with autism [40]. 

More recently, considerable interest has been generated by im- 
munological activation during pregnancy. Maternal autoantibodies 
to fetal brain proteins are associated with the development of autism 
[41]. Immunoglobulin G (IgG) reactivity against fetal brain proteins 
was found in the plasma of 7 of 61 American mothers (11.5%) of 
children with autism, but not in 62 mothers of typically developing 
children (p = 0.006) or 40 mothers of children with non-ASD devel- 
opmental delay (p = 0.04) [41] . Further studies have identified that 
the antibodies are against fetal brain tissue, and not adult brains. 
In addition, in a systematic review, maternal autoimmune diseases 
were associated with autism (pooled OR 1.34; 95% CI 1.23-1.46), 
including maternal thyroid disease [42]. 

The association between vaccines and the development of autism 
has been controversial, with a highly publicized study discredited 
[43]. Vaccination of children occurs at the ages when children are 
most likely to first show the features of autism. Thus, it is not sur- 
prising that many people think that there is an association between 
vaccination and autism. Since the original, but discredited, publica- 
tion suggesting a link between mumps, measles, and rubella vaccin- 
ation and the development of autism, there has been considerable 
work to assess the association with this vaccine. The most robust 
studies have not found an association, in particular, the analysis of 
the Danish national database [15] and a case-control study based 
upon the British General Practice Register [44]. 

In addition, a meta-analysis of a number of vaccines have not 
found an association between any of these vaccines and autism [45]. 
The cohort data revealed no relationship between vaccination and 
autism (OR 0.99; 95% CI 0.92-1.06), and there was no relationship 
between autism and the measles, mumps, and rubella vaccine (OR 
0.84; 95% CI 0.70-1.01) or the adjuvants used such as thimerosal 
(OR 1.00; 95% CI 0.77-1.31) or mercury (OR 1.00; 95% CI 0.93- 
1.07). The case-control data found similar results. Although the 
immunological activation induced by vaccination may lead to CNS 
manifestations, currently there is no evidence to support vaccin- 
ation is associated with autism. 

Other maternal factors thought to be associated with autism in- 
clude gestational diabetes, nausea/vomiting, and bleeding during 
pregnancy. Gestational diabetes is associated with autism in a 
number of studies, and a recent retrospective longitudinal cohort 
study in California in the United States found that the risk of autism 
was associated with maternal type 2 diabetes (birth year aHR 1.33; 
95% CI 1.07-1.66) and gestational diabetes diagnosed at 26 weeks 
or earlier (1.42; 95% CI 1.16-1.75) adjusted for maternal age, parity, 
education, household income, race/ethnicity, history of comorbidity, 
and sex of the offspring [46]. In a meta-analysis using a random ef- 
fects model to calculate the summary effects, nausea/vomiting (1.48; 
95% CI 1.03-2.14) and bleeding during pregnancy (1.81; 95% CI 
1.14-2.86) were associated with autism [38]. 

Prematurity (gestational age <37 weeks) and low birthweight 
(<2.5 kg) often occur together, and without accurate estimates 
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Fig. 27.2 (see Colour Plate section) Associations between autism and estimates of exposure to individual traffic-related and criteria air pollutants. 

PM, , particulate matter <10 um in diameter; PM,5, particulate matter <2.5 um in diameter; NO,, nitrogen dioxide; NO, nitrogen oxide; O}, ozone; CO, 
carbon monoxide. Exposure measured during developmental windows: PC, peri-conceptual; 1, trimester 1; 2, trimester 2; 3, trimester 3; P, pregnancy; Yr 
1, first postnatal year. We recalculated parameters to reflect a change in the exposure comparison, to be consistent with other comparisons in the figure, 
involving calculations assuming that parameters were normally distributed. 


Reproduced from Curr Probl Pediatr Adolesc Health Care, 44(10), Kalkbrenner AE, Schmidt RJ, Penlesky AC, Environmental chemical exposures and autism spectrum disorders: a 
review of the epidemiological evidence, pp. 277-318, Copyright (2014), with permission from Mosby, Inc. 
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Fig. 27.3 (see Colour Plate section) Associations between autism and estimates of exposure to pesticides. 

DE metabolites, diethyl phosphate metabolites of organophosphate pesticides; DM metabolites, dimethyl phosphate metabolites of organophosphate 
pesticides. Exposure measured during developmental windows: CNS, a priori period of central nervous system development (7 days pre-fertilization 

o 49 days post-fertilization); a period of development (26-81 days post-fertilization); 1, trimester 1; 2, trimester 2; 3, trimester 3; P, pregnancy; Yr 1, 

irst postnatal year. Other results not reported as measures of association with confidence intervals included pesticides that were determined not to be 
associated with increased autism risk and were not included in subsequent analyses from Roberts et al. 138: pesticide classes (cholinesterase inhibitors, 
copper-containing compounds, fumigants, avermectins, halogenated organics, N-methyl carbamates, pyrethroids, and thiocarbamates) and individual 
pesticide compounds (1,3-dichloropropene, chloropicrin, cypermethrin, fenarimol, methyl bromide, norflurazon, bromacil acid, chlorpyrifos, 
dazomet, glyphosate, molinate, oxadiazon, bifenthrin, diuron, metam-sodium, myclobutanil, and paraquat). 


Reproduced from Curr Probl Pediatr Adolesc Health Care, 44(10), Kalkbrenner AE, Schmidt RJ, Penlesky AC, Environmental chemical exposures and autism spectrum disorders: a 
review of the epidemiological evidence, pp. 277-318, Copyright (2014), with permission from Mosby, Inc. 

















of gestational age and monitoring growth during the intrauterine 
period, it is difficult to differentiate between the risks associated with 
these factors. Although prematurity has been shown to be associ- 
ated with autism in some studies, it was not significantly associated 
with autism in a meta-analysis [47]. A meta-analysis of studies has 
shown that intrauterine growth retardation is also associated with 
autism, with risk ratios from 1 to 2 [47, 48]. Other perinatal fac- 
tors associated with autism identified in the meta-analysis include 
multiple births [summary effect size 1.77 (1.23-2.55)], being born in 
summer [1.14 (1.02-1.26)], breech presentation [1.81 (1.21-2.71)], 
cord complications [1.50 (1.00-2.24)], fetal distress [1.32 (1.09- 
2.12)], birth injury [4.90 (1.41-16.94)], maternal haemorrhage [2.39 
(1.35-4.21)], low Apgar score at 5 minutes [1.67 (1.24-2.26)] [47]. 

There is considerable interest in the relationship between toxins 
and autism. In a rigorously conducted review on autism and toxins, 
the authors selected papers on the following criteria: population- 
based human (epidemiological) research, robust statistical tools, 
individual-level data on autism diagnoses, environmental-chemical 
measurements with exposures around conception or during preg- 
nancy or the first postnatal year, and with valid comparison, including 
appropriate sample selection and accounting for confounders and 
adequate sample sizes to generate precise measures of association 
[49]. In this study, there was an association between autism and 
measures of mixed air pollutant exposures and diesel particulate 
matter (Fig. 27.2) and different-size air pollutants and nitrogen di- 
oxide, but not with ozone or carbon monoxide. Associations were 
stronger for exposures in the third trimester of pregnancy and the 
first year of life, compared to earlier in pregnancy. In this review, 
there was a suggestion that volatile organic compounds, for ex- 
ample benzene and heavy metals (lead, mercury, manganese), may 
be associated with autism, but the ORs were not high or consistent 
across studies. The evidence for an association between a number of 
pesticides and autism was more convincing (Fig. 27.3). There was 
little evidence of the association between polychlorinated biphenyls, 
flame retardants, non-stick chemicals, or biphenol-A; evidence for 
some phthalates are more suggestive [49]. 


Premature mortality 


There has been little research on the premature mortality associated 
with autism. Standard mortality ratio (SMR) is used to compare 
mortality of a condition with the background population adjusted 
for age and sex. In early studies, the SMR of autism ranged from 1.9 
in Denmark to 2.6 in the United States. 

Ina recent update of the Danish study, the SMR was 1.93 (CI 1.26- 
2.82), with the SMR higher for females (SMR 4.01; CI 1.73-7.90) 
than for males (SMR 1.57; CI 0.93-2.48) [50]. During the 45 years 
of follow-up, the SMR varied between 1.42 and 1.98, while the SMR 
for females varied between 3.17 and 5.22. The most common causes 
of the 26 deaths in this cohort were unnatural causes (suicide = 2, 
suffocation = 2), epilepsy (8 had epilepsy, with 4 stating that epilepsy 
contributed to their death), and infections. 

In an analysis of a population database in Sweden, the authors 
compared 27,122 ASD probands, diagnosed between 1987 and 
2009, with sex-, age-, and county of residence-matched con- 
trols (n = 2,672,185) [51]. They found that 706 (2.60%) of people 
with autism died, compared to 24,358 controls (OR 2.56; 95% CI 
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2.38-2.76). Mortality occurred in most diagnostic categories, with 
the patterns of mortality influenced by sex and intellectual disability. 


Financial costs 


Autism, as a life-long disorder, is associated with significant costs to 
the family and society. It was estimated that the annual total costs in 
2011 associated with autism was about £3.1 billion in the UK and 
250 billion USD in the United States, assuming 40% have intellectual 
disability [52]. The presence of intellectual disability increases the 
costs up to five times, compared to those without intellectual dis- 
ability, depending upon age. The discounted lifetime costs of autism 
without intellectual disability were £0.92 million in the UK and 
1.43 million USD in the United States, while those of autism with 
intellectual disability were £1.5 million in the UK and 2.44 million 
USD in the United States [52]. These costs are likely to be underesti- 
mates, since autism is often not diagnosed in older people. Thus, one 
forecast suggests that the total ASD-attributable costs in the United 
States will rise to over 450 billion USD by 2025 [53]. 


Conclusions 


The burden of autism is well established in North America and 
Europe, but data from other continents, particularly Africa and 
Asia, are limited. The apparent increase in the prevalence of 
autism in North America and Europe is likely to be caused mainly 
by an increase in awareness and a change in diagnostic criteria. 
The interaction between genetic susceptibility to autism and 
environmental factors needs further examination. There is no 
evidence that autism is associated with vaccines, but its relation- 
ship with immunological responses needs further examination. 
Autism is associated with premature mortality and engenders 
considerable costs. 
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Autism spectrum disorder (ASD) has long been recognized as a 
heritable disorder, based on family and twin studies. Monozygotic 
twin concordance rates have been reported to be approximately 60% 
for the full syndrome and 90% for the broad spectrum. In contrast, 
dizygotic twin concordance has been reported to be relatively low, 
approximately 3-15%, depending on the diagnostic criteria used. 
These data support the conclusion that the observed familial clus- 
tering is largely the result of genetic factors and translate into an esti- 
mate of heritability that places ASD among the most strongly genetic 
of all neuropsychiatric conditions [1-7]. 

Establishing high heritability has encouraged the search for gen- 
etic variants that are enriched in ASD. Early insights into ASD risk 
genes came from recognition that children with rare syndromes of 
known genetic causes (for example, fragile X syndrome, tuberous 
sclerosis complex, Angelman syndrome) have high rates of ASD 
diagnosis, ranging from approximately 10% (Duchenne muscular 
dystrophy) to 80% (Phelan-McDermid syndrome) [8-10]. Initially, 
there was great hope that genetic linkage and candidate gene as- 
sociation studies would complement these syndromic observa- 
tions and rapidly yield a wealth of new insights about ASD genetic 
susceptibility loci. 

Linkage analysis assesses the probability that a given pheno- 
type and particular genetic markers are transmitted together from 
one generation to the next. Association studies typically investi- 
gate one or a number of known, common genetic polymorphisms 
that lie within or near pre-determined candidate genes of interest, 
comparing allele frequencies in cases vs controls. Such association 
studies have been popular over the last few decades, owing, in part, 
to the practicalities of subject recruitment and greater theoretical 
power to detect common susceptibility variants of relatively small 
effect, compared to linkage studies [11-14]. However, linkage and 
candidate gene association studies in ASD, and in nearly all com- 
plex neuropsychiatric disorders, have yielded very few reprodu- 
cible findings. This is likely due to previously underappreciated 
factors such as locus and allelic heterogeneity, clinical heterogen- 
eity, and differing inclusion criteria, inadequate control for popu- 
lation stratification, and insufficiently powered study cohorts, 
especially when studying common alleles with small effect sizes 
[15-17]. 





Due to these limitations, and coinciding with microarray techno- 
logical advancements that delivered affordable high-density geno- 
typing platforms, about 10 years ago, there was a shiftin methodology 
from candidate gene association to genome-wide association studies 
(GWAS), simultaneously testing hundreds of thousands to millions 
of common single-nucleotide polymorphisms (SNPs) for associ- 
ation with disease. Querying SNPs throughout the genome elimin- 
ates the need to previously select candidate variants, allowing for 
hypothesis-neutral investigations. This method also yields data that 
allow rigorous matching for ancestry between cases and controls, 
an aforementioned confounder in earlier association studies. The 
primary challenge in GWAS is the large number of independent 
comparisons performed, requiring a widely accepted genome-wide 
significance threshold of p <5 x 10°. Given the heterogeneity of com- 
plex disease, large sample sizes are required to achieve this level of 
significance. Nevertheless, this technique led to renewed excitement 
in the field around the potential for variant and risk gene discovery. 
However, in ASD, none of the candidate genes emerging from earlier 
association or linkage studies have reached the statistical threshold 
for genome-wide significance using GWAS [18, 19], suggesting ei- 
ther that these candidate genes had initial false-positive associations 
or that the current GWAS cohorts lack sufficient statistical power 
to detect common variant association. Nevertheless, efforts on this 
front continue; recent data suggest that one promising way to reduce 
genetic heterogeneity in ASD, and thereby improve statistical power, 
under the GWAS framework may be to focus analysis on inter- 
mediate core phenotypes that accompany the disorder [20]. 














To date, the most fruitful studies for identifying risk genes in ASD 
have been those focusing on rare, large-effect variants in the protein- 
coding regions (exome) of the genome. In addition to the genes im- 
plicated in rare syndromic forms of ASD, rare point mutations in the 
NLGN3X and NLGN4X genes (Neuroligin 3X and 4X) were the first 
replicated findings in non-syndromic (idiopathic) ASD [21, 22]. 
Soon after, driven by technological advances that allowed for the 
detection of genome-wide rare de novo mutations (detected in the 
probands, but not the parents), several studies reported an increased 
rate of de novo copy number variants (CNVs) in ASD and confirmed 


risk loci identified by the clustering of such variants in certain gen- 
omic regions among unrelated individuals with ASD [23-39]. 

Similarly, de novo coding single nucleotide variants (SNVs) and 
insertion—-deletions (indels) have been found by whole-exome 
sequencing studies to contribute to ASD risk. In particular, likely 
gene disrupting (LGD) de novo variants (for example, nonsense, 
splice site, frameshift) are enriched in ASD and have proven to be a 
powerful avenue for identifying individual risk genes by the finding 
of multiple such variants in the same gene in unrelated individuals 
[40-45]. 

Owing mainly to de novo genotyping and sequencing studies, 
dozens of high-confidence ASD risk genes have been discovered, out 
of an estimated target of 500-1000 genes underlying ASD risk [46]. 
Arecent analysis of all available de novo whole-exome sequencing and 
CNV data from >5000 families in the Autism Genome Project and the 
Simons Simplex Collection identified 65 ASD risk genes and six CNV 
regions [32]. Several online databases are updated regularly with the 
latest findings regarding ASD risk genes and loci, supported by evi- 
dence from ongoing sequencing and genotyping studies [47-50]. 

Based on currently available data, there is an important contri- 
bution to ASD risk from rare de novo and rare inherited variation. 
Rare genetic variants are believed to cause approximately 10-30% 
of ASD [32, 46, 51], and each individual variant can confer signifi- 
cant risk on its own, for example, making an individual 30-50 times 
more likely to develop ASD. Despite the fact that all of the ASD risk 
genes and loci to date have emerged from rare variant studies, the 
largest component of genetic risk in ASD is believed to derive from 
common genetic variants of additive effect. When considered as a 
whole, the contribution of common genetic variation is estimated 
between 15% and 50% [52-54]. Yet, the increase in ASD risk by an 
individual common variant is very small, on the order of 5-10%, and 
no common risk loci have yet been definitively identified. To find 
common variants of small effect, future efforts will need to study 
even larger cohorts than those included in studies to date. 


Stepping towards an understanding 





of ASD biology 


Now that there are clear, successful avenues for risk gene discovery 
in ASD, it is possible to begin asking whether identified genes are in- 
volved in common biological processes, whether they are expressed 
in particular brain regions, during certain developmental time 
periods, and whether their gene products are enriched in certain cell 
types. Already, we are seeing that ASD risk genes are more likely 
to be expressed in cortical pyramidal neurons during mid-fetal de- 
velopment [55, 56]. Furthermore, gene set enrichment analyses are 
showing that ASD risk genes and loci are converging onto a smaller 
number of biological processes, including chromatin remodelling, 
transcriptional regulation, synaptic functioning, Wnt and MAPK 
signalling, and interactions with the FMR1 (fragile X mental retard- 
ation 1) gene [24, 27, 30, 41, 42, 57]. In this way, variant discovery 
provides an important foothold for identifying risk genes and for 
beginning to understand ASD neurobiology. However, it is critically 
important to study the downstream consequences of the identified 
variants to refine our understanding of disease pathophysiology and 
pave the way for earlier diagnosis, therapeutic interventions, and 
novel treatments. 
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Importance of modelling ASD variants 


Once candidate genes have been identified, the most important and 
challenging next step is gaining an understanding of how disruption 
of these risk genes affects basic processes of nervous system develop- 
ment, resulting in behavioural dysfunction. That is, identification of 
ASD-associated genes has the potential to provide a critical window 
into neurobiological mechanisms underlying ASD. For example, 
some of the earliest insights into the neurobiology of ASD came from 
animal models of monogenic syndromes that are associated with an 
increased risk of ASD, including fragile X syndrome (FXS) and tu- 
berous sclerosis complex (TSC). Here, ‘knockout’ mouse models 
lacking the function of the genes that are causative for these syn- 
dromes provided the first evidence for pathophysiological mechan- 
isms underlying ASD and revealed possible pathways for therapeutic 
targeting. However, recent large-scale whole-exome sequencing 
efforts of affected individuals have led to a rapidly expanding list 
of ‘high-confidence’ ASD risk genes, which has made modelling 
ASD-associated variants increasingly challenging. While these risk 
genes represent seemingly disparate functions, ranging from ion 
channels and cell adhesion molecules to chromatin remodellers and 
transcription factors, there is emerging evidence that they converge 
on common mechanistic pathways such as synapse function, chro- 
matin modification, transcriptional regulation, Wnt signalling, and 
targets of the fragile X mental retardation protein (FMRP) [42, 57- 
60]. In this way, these studies have revealed pathways that are likely 
to play a key role in ASD. Moreover, to keep pace with the increased 
rate of gene discovery, scientists are now employing new cellular, 
molecular, and computational strategies, which allow for the sim- 
ultaneous analysis of multiple risk genes and are beginning to shed 
light on fundamental questions in ASD neurobiology. For example, 
these studies aim to elucidate the specific developmental stages and 
neuronal cell types, that is, when and where ASD risk gene func- 
tion is most critical for brain development. Another objective of 
functional studies is to leverage the growing number of risk genes 
to uncover neurobiological pathways that can serve as targets for 
novel pharmacotherapies. Therefore, recent studies modelling ASD- 
associated variants are playing a pivotal role in advancing the field 
from risk gene discovery to the elucidation of neurodevelopmental 
mechanisms and targeted pharmacological treatments. 


Insights from models of monogenic 





ASD-associated syndromes 


Early insights into the neurobiology of ASD came from animal 
models of monogenic syndromes that are associated with increased 
autism risk, as mentioned. FXS is the most common inherited cause 
of ASD and is a notable example of how modelling the loss of func- 
tion of a single gene can result in the identification of novel biological 
pathways and potential therapeutic targets [61]. Studies of mice 
lacking the function of the FMRI gene, which is disrupted in FXS, 
led to the metabotropic glutamatergic receptor (mGluR) theory of 
FXS. Specifically, mouse knockouts of FMRI were found to exhibit 
an increase in a form of synaptic plasticity—long-term depression 
(LTD), which is dependent on protein synthesis and mGluR acti- 
vation [62]. Therefore, excessive mGluR signalling and LTD were 
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proposed as a potential mechanism underlying abnormal structural 
and behavioural phenotypes in mice lacking FMR1 function [62], 
including altered dendritic spine structure in adult cortical neurons, 
hyperactivity, and learning deficits [63, 64]. Indeed, multiple studies 
have shown that inhibition of mGluR signalling is able to reverse 
physiological, structural, and behavioural abnormalities in animal 
models of FXS, ranging from the mouse to the fly [65-70] (reviewed 
in [71]), leading to clinical trials of mGluR antagonists [72]. While 
the efficacy of these agents has yet to be demonstrated in human 
patients with FXS, modifications in clinical trial design, such as ad- 
ministering the medication to younger patients for a longer duration 
and developing more precise efficacy measurements, may improve 
outcomes [73]. 

TSC, which is caused by heterozygous mutations in the TSC1 or 
TSC2 genes, is another example of a syndrome associated with an 
increased risk of ASD that led to the identification of an important 
signalling pathway with relevance to ASD. Specifically, these genes 
encode proteins that function as negative regulators of the mam- 
malian target of rapamycin (mTOR) pathway, which is involved in 
protein synthesis and cell proliferation [74]. This discovery led to 
the hypothesis that inhibitors of mTOR, such as rapamycin, might 
serve as potential therapeutic agents. Interestingly, rapamycin was 
found to reverse learning and social deficits in mouse models of TSC 
[75, 76], paving the way for clinical trials of this class of medications. 
In addition, mutations in the PTEN gene (Phosphatase and Tensin 
Homolog), which also negatively regulates mTOR, are associated 
with an increased risk of ASD and macrocephaly [77-80], providing 
further evidence for this pathway in the neurobiology of ASD. 
Rapamycin was also found to reverse macrocephaly and behavioural 
abnormalities, including anxiety and social deficits, in a conditional 
mouse knockout in which PTEN function was disrupted in post- 
mitotic neurons [81]. Clinical trials are currently under way to in- 
vestigate the effect of mTOR inhibitors on cognition in children and 
adolescents with PTEN mutations (http://www.clinicaltrials.gov). 

Other examples of monogenic syndromes associated with ASD 
that have shed light on relevant molecular mechanisms include Rett 
syndrome, caused by mutations in the MECP2 gene (Methyl-CpG 
binding domain-2), and neurofibromatosis type 1, caused by disrup- 
tion of the NF1 gene (Neurofibromin 1) [61]. In addition, with the in- 
creased rate of gene discovery in idiopathic ASD from whole-exome 
sequencing, together with more detailed clinical characterization of 
individuals carrying variants in specific risk genes, there is growing 
evidence that each of these ASD-associated genes may represent 
a subtype of ASD with shared clinical features [57]. For example, 
recent studies have suggested that the high-confidence ASD genes 
CHD8 (Chromodomain Helicase DNA Binding Protein 8), DYRK1A 
(Dual Specificity Tyrosine Phosphorylation Regulated Kinase 1A), 
POGZ (Pogo Transposable Element Derived with ZNF Domain), and 
GRIN2B (Glutamate Ionotropic Receptor NMDA Type Subunit 2B) 
may represent distinct ASD syndromes [82-85]. Therefore, animal 
models of each of these genes have the potential to reveal biological 
pathways that are uniquely disrupted due to gene loss and to identify 
specific pharmacological targets, providing a ‘personalized medicine’ 
approach to developing new treatments. For example, point muta- 
tions in the SHANK3 gene (SH3 and multiple ankyrin repeat domains 
3), which encodes a post-synaptic scaffolding protein at excitatory 
synapses, as well as deletions of the chromosome region 22q13.3, 
which includes this gene, are associated with Phelan-McDermid 


syndrome, which is characterized by ASD, intellectual disability, 
seizures, and dysmorphic features [86]. Physiological and motor 
deficits in mice in which SHA NK3 function was disrupted were res- 
cued by treatment with insulin-like growth factor 1 (IGF-1) [87], 
which promotes synaptogenesis [88]. IGF-1 was also found to re- 
verse deficits in excitatory signalling in an induced pluripotent stem 
cell (iPSC) model of Phelan-McDermid syndrome [89] and rescued 
abnormal phenotypes in mouse and iPSC models of Rett syndrome 
[90, 91]. These preclinical studies led to a pilot clinical trial of nine 
individuals with Phelan-McDermid syndrome, in which IGF-1 
showed some benefit in treating social deficits and repetitive behav- 
iours, though larger studies are needed [92]. 


Modelling ASD risk genes in animal systems 





Animal models offer distinct advantages for the functional analysis 
of ASD risk genes. Firstly, animal models allow for the investiga- 
tion of the role of risk genes in neural circuits in a live, behaving 
organism, which is not possible using in vitro approaches [93]. 
Secondly, risk gene function can be studied along a developmental 
trajectory, including embryonic stages in animal models [94], which 
is critical for understanding the role of ASD risk genes, given that 
many of these genes are expressed embryonically. Thirdly, animal 
knockouts provide a platform for identifying novel pharmacological 
pathways and studying the in vivo effects of drugs targeting these 
pathways [93]. Moreover, emerging cellular and molecular tech- 
nologies are expanding the range of scientific questions that can be 
addressed using animal models. For example, a recently developed 
method CLARITY allows for high-resolution, three-dimensional 
imaging of intact mouse brains [95]. Other new techniques, such as 
optogenetics and genetically encoded calcium indicators, allow sci- 
entists to assess the circuit-level and behavioural effects of activating 
a subset of neurons (engineered to express a light-activated channel) 
[96-98] and to visualize neural activity in an awake, behaving 
animal [99, 100]. Further, CRISPR (clustered regularly interspaced 
short palindromic repeats)/Cas9 technology, a highly efficient and 
flexible method for generating targeted mutations [101], has revo- 
lutionized the ability to model ASD-associated mutations in both 
in vivo and in vitro systems. In addition, there is increasing interest 
in using smaller, less complex organisms, such as the zebrafish and 
Drosophila, to model ASD risk genes, given their experimental tract- 
ability for conducting large-scale phenotyping analyses and high- 
throughput pharmacological screens [70, 93, 102]. While these 
systems are less conserved than mice, in comparison to humans, at 
the genetic and structural levels, it is anticipated that future studies 
will capitalize on the advantages of multiple animal models for the 
functional analysis of ASD genes. 

At the same time, there are clear limitations to modelling ASD- 
associated genes in animal systems. Firstly, it is impossible to fully 
recapitulate human behaviours in animals, given the limits of face 
validity [93, 94, 103]. Secondly, pharmacological candidates iden- 
tified in animal models may not translate to effective treatments 
in humans. The example of mGluR antagonists in FXS highlights 
these challenges, including identifying the optimal age, duration of 
medication administration, and outcome measurements in human 
studies, which are not easily translated from preclinical studies 
[73]. Thirdly, animal systems cannot fully recapitulate the genetic 


diversity of an individual, which may shape clinical presentation. 
Despite these drawbacks, animal models provide an important in 
vivo approach for elucidating the role of ASD risk genes in basic 
mechanisms of neurodevelopment and circuit-level function. 


Human induced pluripotent stem cell modelling 





of ASD risk genes 


Another approach to modelling ASD-associated mutations is the 
use of human iPSCs. The scientific basis for iPSC generation was 
reported in a landmark study which demonstrated that adult human 
dermal fibroblasts could be reprogrammed into a pluripotent 
state by four transcription factors: Oct3/4, Sox2, Klf4, and c-Myc 
[104]. These cells could, in turn, be differentiated into any cell type, 
including neural cells. Protocols have been developed to differen- 
tiate iPSCs into specific neural populations, including electrically 
active GABAergic, glutamatergic, and dopaminergic neurons; astro- 
cytes; and oligodendrocytes [105, 106]. This approach also enables 
the analysis of patient-derived cells during the course of neural dif- 
ferentiation, from iPSCs to neural progenitor cells (NPCs) to ma- 
ture neurons. In addition, a series of assays can be performed on 
these cells, including transcriptional profiling studies; analyses of 
neuronal morphology; synapse formation and function; and neur- 
onal migration, differentiation, and proliferation—providing a rich 
database for assessing the effect of patient mutations on cellular 
and physiological phenotypes. Further, neural cells can be grown 
as three-dimensional organoids, which, to some extent, can recap- 
itulate features of early human brain development and allow for 
the analysis of simple network activity [105, 107]. To model ASD- 
associated mutations using iPSCs, scientists have adopted two main 
experimental strategies: (1) generating iPSCs from cells obtained 
from individuals with ASD carrying a mutation in a particular risk 
gene; and (2) genetically engineering iPSCs generated from the cells 
of an unaffected individual using the CRISPR/Cas9 system to intro- 
duce a mutation identified in an individual with ASD. The former 
approach captures the genetic background of the patient, which is 
highly relevant to ASD, where clinical presentation may be influ- 
enced by a combination of rare de novo and transmitted variants, 
along with common variants, while the latter method allows sci- 
entists to compare ASD-associated mutations in an isogenic back- 
ground, providing a readout of the effects of the variants alone on 
cellular phenotypes. Another approach utilizes morphological ana- 
lysis and transcriptional profiling of iPSCs generated from indi- 
viduals with idiopathic ASD (unknown aetiology), which has also 
provided insights into neurobiological mechanisms in ASD [108]. 
There are several notable advantages of using iPSCs for the func- 
tional analysis of ASD risk genes. Firstly, unlike animal models, 
iPSCs allow for the analysis of cellular phenotypes in the genetic 
background ofa patient, which can play a critical role in shaping the 
clinical presentation [109]. Secondly, multiple sequence variants or 
large CNVs found in an individual with ASD are difficult to model 
in animals but are present in patient-derived iPSCs [110]. Thirdly, 
the use of iPSCs offers key advantages over studies of post-mortem 
tissue [110, 111], which can be confounded by environmental or 
treatment effects over the lifetime of an individual and cannot pro- 
vide electrophysiological or other functional information. Fourthly, 
iPSCs are amenable to high-throughput pharmacological screens 


CHAPTER 28 Genetics of autism spectrum disorders 


to identify compounds that reverse cellular or electrophysiological 
phenotypes associated with a particular variant, which can com- 
plement drug screens in animal models where the same risk gene 
has been disrupted. Consistent with the goals of personalized medi- 
cine, patient-derived iPSCs provide an individualized platform for 
the discovery of potential new treatments. Nonetheless, limitations 
of iPSC modelling include the inability to assess the effect of muta- 
tions on circuit-level or behavioural phenotypes in an intact nervous 
system. In addition, generating iPSCs, NPCs, and mature neurons is 
a time-intensive process, such that it may take 2 months to repro- 
gramme source cells into iPSCs and differentiate iPSCs into NPCs, 
and 3-4 months to generate neural cells. Further, because repro- 
gramming cells involves inducing genome-wide epigenetic changes 
[112], it is important to analyse epigenetic patterns before and after 
iPSC generation, though iPSCs from individuals with Angelman 
and Prader-Willi syndromes were found to retain the appropriate 
patterns of DNA methylation [113]. Taken together, iPSC model- 
ling of ASD-associated variants has enabled scientists to, at least par- 
tially, overcome the challenge of obtaining living human brain tissue 
for study, revolutionizing the functional analysis of ASD risk genes 
at the cellular and molecular levels. 


ASD risk gene network analysis 





While ASD risk genes may be associated with distinct clinical fea- 
tures that define subtypes of the disorder [57], there is evidence that 
many of these genes converge on common neurodevelopmental 
pathways that provide critical insights into ASD biology, as men- 
tioned [42, 57-60]. Despite the important mechanistic insights 
gained by modelling individual risk genes, one at a time, in animal 
or iPSC models, given the rapid rate of ASD gene discovery, this 
single-gene approach is becoming increasingly challenging. Here, 
advances in computational tools, which allow for the simultan- 
eous analysis of hundreds of ASD-associated genes using a sys- 
tems or network-based approach, are beginning to illuminate 
common neurodevelopmental mechanisms across these genes [60]. 
Specifically, studies have harnessed the BrainSpan transcriptome 
data set, which includes human gene expression data from the brains 
of 57 typically developing individuals and encompasses multiple 
anatomical regions and a range of developmental stages, from early 
fetal stage to late adulthood [114, 115], to elucidate when and where 
ASD risk genes might play critical roles in the developing brain [60]. 
In one study, researchers constructed gene co-expression networks 
around high-confidence ASD genes utilizing the BrainSpan data set 
and assessed these networks for enrichment of an additional set of 
probable ASD-associated genes, identifying mid-fetal glutamatergic 
projection neurons in cortical layers V-VI as a point of convergence 
[56]. Another study used weighted gene co-expression network ana- 
lysis to construct co-expression networks using the BrainSpan data 
set and mapped ASD-associated genes onto these networks [116]. 
This study found enrichment of ASD risk genes in glutamatergic pro- 
jection neurons in superficial cortical layers II-IV, though a weaker 
enrichment signal was also detected in deep layers [116]. Therefore, 
these types of analyses provide a spatiotemporal readout of when 
and where ASD genes are likely to function in brain development, 
which is expected to inform future functional studies of these genes 
in model systems [60]. Taken together, network-based approaches, 
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Fig. 28.1 Schematic representation of the pathway from the identification of risk genes to the discovery of biological mechanisms in ASD. Risk 

genes associated with ASD are identified by a variety of techniques: genome-wide association studies, linkage analysis, chromosomal microarray (to 
identify CNVs), and whole-exome/genome sequencing (to identify sequence variants). Functional analysis aims to bridge genetics and neurobiology 
by determining the impact of variants on biological systems through in vivo and in vitro approaches such as animal model systems and induced 
pluripotent stem cells. Network analysis aims to accomplish this through computational approaches such as gene co-expression studies. The discovery 
of biological mechanisms is expected to identify novel targets for pharmacological intervention in ASD, the ultimate goal of this research. 


along with in vivo and in vitro models of ASD-associated genes, are 
anticipated to advance our understanding of the basic neurobiology 
of ASD and provide a path towards the development of mechanism- 
based treatments. Fig. 28.1 summarizes the pathway from the iden- 
tification of risk genes to the discovery of biological mechanisms 
in ASD. 


Clinical genetics testing in ASD 


Although the genetic contribution to ASD is well established, there 
has been much debate over the years as to the most appropriate la- 
boratory tests to pursue in individuals with ASD. Clinical genetics 
testing is an important component of the medical evaluation of all 
patients with a diagnosis on the spectrum, from severely to mildly 
impaired, for a number of reasons. As has been described, there is 
substantial overlap between ASD and some genetic syndromes such 
as FXS and TSC. The identification of a known genetic syndrome 
allows the clinician to provide anticipatory guidance regarding 


medical and developmental trajectories to caregivers. This is espe- 
cially critical if there is additional organ involvement. For example, 
FXS can also cause vision problems, such as strabismus and ambly- 
opia, and hypotonia. The benign tumours which characterize TSC 
can disrupt the function of not only the brain, but also multiple other 
vital organs such as the heart, lungs, liver, and kidneys. Establishing a 
genetic aetiology can help in obtaining needed care and services and 
avoid the pursuit of further, unnecessary diagnostic tests. Another 
important reason for clinical genetic testing is that the identification 
of variants and whether they are de novo or inherited through the 
testing of family members can have implications for genetic counsel- 
ling. The parents of an affected child may want to know their chances 
of having subsequent affected children, and aunts and uncles may 
want to know if a variant is being passed through a family. 

The American College of Medical Genetics and Genomics 
(ACMG) has provided the most detailed guidelines for clinical gen- 
etics testing in ASD [117]. These guidelines are structured as two 
tiers of tests, with the first tier expected to have higher diagnostic 
yield. Iffirst-tier tests are negative, second-tier tests may be pursued. 


After completion of a three-generation family history to identify a 
potential mode of inheritance for the disorder, clinical geneticists 
should determine if a known genetic syndrome is present, guided 
by a thorough history and physical examination which includes a 
detailed dysmorphology exam. If a specific syndrome or metabolic 
disorder is suspected, targeted testing should be performed. Ifa syn- 
drome that is firmly associated with ASD is diagnosed, the ACMG 
does not consider further testing necessary. These syndromes include 
22q11.2 deletion, Angelman, CHARGE, FXS, Prader-Willi, Rett, 
and TSC, among others [117]. In the absence of a known syndrome, 
chromosomal microarray analysis (CMA) should be obtained. 
Either through array-comparative genomic hybridization (aCGH) 
or an SNP array, CMA will detect chromosomal deletions and du- 
plications. This test has replaced karyotypes since the International 
Standard Cytogenomic Array Consortium has concluded that CMA 
has a higher yield than karyotyping [118]. DNA testing for FXS is re- 
commended for all male patients and for those female patients with 
clinical features of FXS or a positive family history. If all first-tier 
tests are negative, clinical geneticists can consider specific gene tests, 
such as looking for variants in MECP2 and PTEN, and brain mag- 
netic resonance imaging (MRI) if indicators are present. 

The ACMG estimates that, for idiopathic ASD, a minimal clin- 
ical genetics evaluation consisting of CMA and FXS testing will have 
a combined diagnostic yield of approximately 10-15%. A compre- 
hensive evaluation, guided by the two-tiered approach, is estimated 
to identify an aetiology in 30-40% of individuals with ASD [117]. 
Although the ACMG does not currently recommend specific gene 
testing in idiopathic ASD other than for FXS, MECP2, and PTEN, it 
is expected that ASD gene panels will become increasingly valid as 
researchers continue to dissect the genetic aetiology of ASD, as dis- 
cussed in this chapter. 
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Autism spectrum disorders (ASD) encompass a group of life-long 
neurodevelopmental conditions that are characterized by deficits 
in social communication, social reciprocity, and repetitive and 
stereotyped behaviours and interests [1]. ASD are typically diag- 
nosed during early childhood using behavioural observations and/ 
or clinical interviews, with clinical symptoms persisting across the 
human lifespan in roughly 90% of all cases [2]. Over the last decade, 
prevalence estimates for ASD have consistently been rising—from 
about 2-4 individuals per 10,000 children in the 1960s and 1970s 
to about 1 in 68 children in 2014 (that is, roughly 1.4% of the gen- 
eral population) [3]. While this increase in prevalence may partially 
be due to modifications to diagnostic criteria, improved assessment 
tools, and/or increased awareness, the rise in prevalence also high- 
lights the growing need for developing novel pharmacotherapies 
and behavioural interventions designed to effectively target the core 
symptoms of ASD. Yet, developing treatment and interventions for 
ASD remains a challenge. ASD is accompanied by a large degree of 
inter-individual variability, both phenotypic and causative, which 
has so far hampered large-scale clinical trials. For example, genetic 
studies implicate more than 100 different genetic and genomic loci 
in ASD [4], which, in turn, interact with environmental factors to 
give rise to complex behavioural phenotypes. Consequently, there 
is a large degree of heterogeneity in the time course, severity, and 
profile of symptoms expressed by ASD individuals. Current research 
efforts are thus directed towards disentangling the large degree of 
complexity associated with ASD, and to establish homogenous 
subgroups or ‘strata of individuals with a common genetic and 
(neuro)biological make-up. Here, neuroimaging studies employing 
magnetic resonance imaging (MRI) are absolutely crucial, as they 
offer unique insights into the anatomy and functioning of the brain 
in vivo. 

In this chapter, we review and critically discuss ‘state-of-the-art’ 
in vivo neuroimaging findings examining ASD, with a particular 
focus on studies investigating brain structure and connectivity. 
Firstly, we will review the findings of neuroimaging studies exam- 
ining early brain development in ASD. These studies predominantly 
focus on the neurodevelopmental trajectory of global brain meas- 
ures that may underpin regional differences in brain anatomy typ- 
ically observed during late childhood and adolescence in ASD. We 











will then examine how the atypical developmental trajectory during 
early brain development may lead to differences in brain connect- 
ivity in ASD, which are accompanied by atypical patterns of cortical 
gyrification. Last, we will present the findings of a growing number 
of neuroimaging studies examining differences in the neurobiology 
of ASD between men and women. 














It is well established that ASD is accompanied by an atypical devel- 
opment of the brain and developmental perturbations to the forma- 
tion of the brain’s micro- and macro-circuitry. Early cross-sectional 
studies show that the brain of toddlers with ASD is—on average— 
larger in total volume than the brain of typically developing (TD) 
children between the ages of 2-4 years [5]. This increase in total 
brain volume is paralleled by an increase in head circumference, 
which has been reported in several cross-sectional and longitu- 
dinal studies (for example, [6]). The early enlargement of the brain 
seems to disappear between 5 and 6 years of age, when develop- 
mental trajectories of ASD individuals and TD controls intersect 
[7]. After this age, no significant increase in total brain volume is 
typically reported in ASD. Taken together, these early findings led 
to the suggestion that the brain in ASD may undergo an atypical 
trajectory of maturation, which is characterized by a period of in- 
creased growth during toddlerhood, followed by a period of reduced 
or arrested growth during childhood, and possibly a period of accel- 
erated decline in total brain volume over the remaining lifespan [8] 
(Fig. 29.1). These early cross-sectional studies have been confirmed 
by more recent longitudinal studies of brain development in ASD. 
For example, a recent investigation by Lange et al. (2015) examined 
atypical brain development in ASD from early childhood into ado- 
lescence and found: (1) significantly increased brain volumes in 
young children with ASD; (2) intersecting growth curves between 
10 and 15 years of age; and (3) significantly decreased brain volume 
during adolescence [9]. While there is some variability with regard 
to the reported age range, when growth curves intersect, these find- 
ings suggest that the size of the brain in ASD lies within the ‘normal’ 
range during early adolescence, when between-group differences 
in total brain volume are minimal. However, how this relates to the 
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Fig. 29.1 Differences in the neurodevelopmental trajectory of brain 
maturation in ASD individuals and TD controls across the human 
lifespan. 





severity of autistic symptoms is currently unknown. Furthermore, 
despite the absence of significant between-group differences in total 
volume around adolescence, it is likely that the atypical develop- 
mental trajectory of brain maturation prior to this time interferes 
with the development of large-scale neural systems, which reflects 
regional, rather than global, differences in ASD. 

Typical brain development occurs in a highly ‘orchestrated’ 
fashion where the development of phylogenetically younger cor- 
tical areas (for example, higher-order association areas in frontal 
and temporal lobes) builds on the development of phylogenetic- 
ally older brain regions (for example, occipital and parietal regions) 
[10]. However, evidence suggests that frontal-temporal regions 
mature earlier and faster in ASD individuals than in TD controls 
[11]. These early deviations from the typically synchronized devel- 
opmental trajectory would therefore also be expected to impact on 
the development of brain connectivity. For example, a study by Wolff 
et al. (2012) examined structural brain connectivity in 6-month-old 
infants at high genetic (that is, familial) risk of ASD (for example, 
siblings of individuals with a confirmed diagnosis of ASD). In high- 
risk infants who receive a diagnosis of ASD at the age of 2 years, 
perturbations in the development of white matter fibre tracts are al- 
ready visible at 6 months, relative to low-risk infants without ASD 
[12]. Moreover, a more recent study by the same research group also 
reported a significant increase in area and thickness of the corpus 
callosum, which structurally connects the two hemispheres of the 
brain, in infants with ASD scanned at 6 months of age [13]. Taken 
together, these studies suggest that the early atypical development 
of the cortical grey matter in ASD is accompanied by differences in 
white matter fibre tracts that can be observed as early as 6 months 
of age—that is, before the first symptoms typically manifest (around 
1-2 years of age). Yet, there remains some debate on the particular 
time point during early development when grey matter differences 
first arise in ASD. For example, no significant increase in head cir- 
cumference has been reported in ASD during the first year of life 
[14]. There are two studies reporting an enlargement of the brain 
in 2-year olds [15, 16], but direct evidence of a significant enlarge- 
ment of the brain during the first year of life remains missing. It thus 


remains unclear whether the early brain overgrowth in toddlers with 
ASD reflects an ongoing neurodevelopmental process, which is ac- 
companied by an accelerated rate of growth during early childhood, 
or—alternatively—constitutes the end result of an early develop- 
mental perturbation prior to the age of 2 years. 


Early brain overgrowth in ASD—a biomarker or 





simply ‘the tip of the iceberg’? 


Despite the large body of evidence supporting the original reports 
of an early overgrowth of the brain in ASD, it remains contentious 
how relevant macrocephaly is for ASD and whether an early increase 
in total brain volume may potentially be used as an early biomarker 
for the condition. For instance, while the brain in ASD may—on 
average—be larger than the brain in TD individuals, macrocephaly 
(that is, an increase in head circumference of >2 standard deviations 
above the average for that age, or larger than the 98th percentile) 
is only observed in about 20% of all individuals with ASD across 
studies [14]. This implies that in about 80% of ASD individuals, sig- 
nificant clinical enlargement of the head or brain is absent. Thus, 
early markers of total brain volume or head circumference may not 
reflect a neurobiological mechanism that is common to all individ- 
uals on the ASD spectrum. As pointed out by Lainhart et al. (2006), 
it may therefore be that the early brain overgrowth occurs in a spe- 
cific neurobiological subtype (or ‘stratum’) of ASD individuals only 
and may hence be used for stratification, rather than for diagnostic 
purposes. Alternatively, macrocephaly may represent the ‘tip of the 
iceberg; that is a general tendency towards an increased head and 
brain size in ASD individuals [17]. In this case, an increase in head 
size may be indicative of the risk for ASD, rather than of the con- 
dition itself. A number of recent studies also call for a revision of 
population norms, which are used to define what is clinically ‘ab- 
normal. For example, a study by Raznahan et al. (2013) suggested 
that the increased head circumference in ASD might reflect a bias 
in population norm, rather than a replicable pattern of dysregulated 
brain growth [18], particularly when examining patient populations 
in the mental health setting. In future investigations, it will thus be 
of importance to account for variations in population norms, which 
may affect the size of expected effects when comparing psychiatric 
populations with normative populations. 

Head circumference is also correlated with various epidemio- 
logical measures such as biological sex, age, weight and height, and 
genetic ancestry. Such non-ASD measures may therefore also im- 
pact on population norms, and hence significant between-group 
differences, and so need to be accounted for in the statistical model 
[19]. Based on these findings, there is also some evidence to suggest 
that the early brain overgrowth in ASD may reflect an early pattern 
of general physical (that is, somatic) overgrowth, rather than being 
specific to the brain, and particularly in boys with the condition [20]. 
Taken together, these studies highlight the importance of consid- 
ering wider contextual issues when interpreting the findings of trad- 
itional studies reporting early brain overgrowth in ASD. Overall, 
therefore, it seems that measures of total brain volume and/or head 
size may offer neither sufficient sensitivity (that is, ability to detect 
an effect or disease if it is present) nor specificity (that is, ability to 
detect the absence of an effect or disease ifit is not present) to be em- 
ployed as a biomarker for ASD. 


Brain structure and functioning across later 


childhood and adolescence in ASD 





While the neurodevelopmental trajectory of brain maturation 
during early development is well characterized in ASD, less is known 
about the trajectory of brain development during late childhood and 
adolescence. In contrast to the differences in total brain volume ob- 
served during early life, brain development during later childhood 
and adolescence in ASD seems to be dominated by an accelerated 
age-related decline in volume. Typically, the pre-pubertal increase 
in grey matter volume is followed by a post-pubertal loss, resulting 
in an inverted U-shaped trajectory across the human lifespan [21]. 
There is evidence to suggest that the typical attrition of grey matter 
volume after reaching its peak during early adolescence is acceler- 
ated in ASD individuals during adolescence and adulthood. For ex- 
ample, the longitudinal study by Lange et al. (2015) noted previously 
found an accelerated decrease in total grey matter in ASD individ- 
uals after the age of approximately 15 years, which was accompanied 
by a reduced growth rate of total white matter [9]. These findings 
are in agreement with a previous longitudinal study by Hardan et al. 
(2009) in a smaller sample, who also reported a significantly greater 
decrease in total grey matter volume during late childhood and early 
adolescence in ASD 8- to 12-year olds over a period of 30 months 
[22]. Notably, the study by Hardan et al. (2009) also reported that 
the accelerated decrease in total grey matter volume is accompanied 
by significant reductions in cortical thickness. This is of importance, 
as total grey matter volume is, by definition, a product of two sep- 
arate neuroanatomical features, namely cortical thickness and sur- 
face area, both of which: (1) are mediated by different sets of genes; 
(2) have a different phylogeny; and (3) represent distinct aspects of 
the cortical architecture (for example, [23]). Accelerated age-related 
cortical thinning has also been reported by cross-sectional studies 
investigating regional, rather than global, differences (for example, 
in temporal and parietal brain regions) [24, 25]. The age-related 
decrease in cortical thickness also seems to be accompanied by a 
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decrease in surface area. For instance, a more rapid decline in sur- 
face area in ASD with increasing age has been reported on the global 
[26], as well as regional, level [27]. Taken together, these findings 
suggest that the trajectory of brain maturation in ASD differs across 
distinct stages of development, which are dominated by an acceler- 
ated increase in volume, followed by an accelerated age-related de- 
cline in adulthood (Fig. 29.1). 

Moreover, while neuroimaging studies investigating early brain 
development in ASD have predominantly focused on age-related 
differences in total brain volume, studies examining the neurobio- 
logical underpinnings of ASD during late childhood and/or ado- 
lescence mostly focus on regional differences in brain anatomy 
and functioning. Using more ‘fine-grained’ techniques such as 
voxel-based morphometry (VBM) or functional magnetic res- 
onance imaging (fMRI), it is also possible to link specific autistic 
symptoms and traits to neuroanatomical or functional abnormal- 
ities in individual brain regions. Overall, the core components of 
wider neurocognitive systems that mediate the cluster of behav- 
ioural symptoms typically observed in ASD are well established. 
By large, these include: (1) fronto-temporal and fronto-parietal re- 
gions; (2) the amygdala-hippocampal complex; (3) the cerebellum 
and basal ganglia; and (4) the anterior and posterior cingulate cor- 
tices (for a review, see [28]) (Fig. 29.2). As might be expected, many 
of these brain regions include the set of brain regions that mediate 
functions related to social cognition and emotional processing, 
which are also commonly termed the ‘social’ and ‘emotional’ brain 
[29]. The core components of the ‘emotional brain’ include: (1) sub- 
cortical regions such as the amygdala, nucleus accumbens, and 
hypothalamus; and (2) cortical regions such as the orbitofrontal 
cortex (OFC), the anterior cingulate cortex (ACC), and the medial 
prefrontal cortex (mPFC) (reviewed in [30]). Furthermore, the ex- 
tended neural systems mediating social-emotional processing in- 
clude the superior and anterior temporal lobes (STL and ATL), the 
posterior cingulate cortex (PCC), the temporo-parietal junction 
(TPJ), the somatosensory cortex, and the intraparietal sulcus (see 
also [29]). Many of these regions have also been found to be atypical 








Fig. 29.2 The neurocognitive systems underlying ASD. Core ASD regions (orange areas) include the ventromedial prefrontal cortex (VMPFC), the 
orbitofrontal cortex (OFC), the superior temporal gyrus (STS), the amygdala, and the anterior cingulate cortex (ACC). Extended ASD regions (green 
areas) include the inferior frontal gyrus (IFG), the posterior parietal cortex (PPC), the cerebellum, the hippocampus, the fusiform gyrus, the pons (PN), 
the basal ganglia (BG), the thalamus, and the supplementary motor area (SMA). 

Adapted from Trends Neurosci., 31(3), Amaral DG, Schumann CM, Nordahl CW, Neuroanatomy of autism, pp. 137-45, Copyright (2008), with permission from Elsevier Ltd. 
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in ASD not only structurally (as described previously), but also 
functionally. For example, fMRI studies examining deficits in theory 
of mind [31], face processing [32], perception of biological motion 
[33], self-referential cognition, and empathy [34] in ASD have re- 
ported functional abnormalities in many of the regions that mediate 
social and emotional processing. 

In addition, there is large degree of overlap between the neural 
systems implicated in ASD and the neural circuitry mediating 
obsessive-compulsive disorder (OCD), which highlights the close 
symptomatic link between ASD and OCD. For example, a study by 
Russell et al. (2005) showed that individuals with ASD and OCD 
had similar frequencies of obsessive-compulsive symptoms, des- 
pite at a lower level of symptom severity, with only somatic ob- 
sessions and repeating rituals being more common in the OCD 
group [35]. In OCD, obsessive-compulsive symptoms have been 
linked to the so-called cortico-striato-thalamo-cortical (CSTC) cir- 
cuitry (also known as the frontostriatal or corticostriatal model), 
which comprises the thalamus, the OFC, ACC, and striatum [36]. 
Neuroanatomical differences in these regions have also been re- 
ported in ASD. For instance, individuals with ASD show an increase 
in the rate of growth of the caudate nucleus, one of the two integral 
parts of the striatum, which was observed between 10 and 12 years 
of age [37]. Moreover, the increased rate of growth in the caudate 
nucleus was significantly correlated with the severity of repetitive 
behaviours in ASD, thus suggesting that the neurodevelopmental 
trajectory of the caudate nucleus may offer a neurobiological marker 
for stereotypic and repetitive behaviours in ASD [37]. Comparison 
between the neurobiological underpinnings of OCD and ASD 
shows that the cortical abnormalities observed in ASD may not be 
unique to, nor causal for, the condition. Rather, similarities and/or 
differences in neurobiological phenotypes may be associated with 
similarities/differences in clinical symptom profiles. New research 
frameworks, such as the Research Domain Criteria (RDoC) initiative 
(http://www.nimh.nih.gov/research-priorities/rdoc/index.shtml) 
that considers mental disorders along dimensions of symptoms and 
neurobiological measures, rather than distinct categories, will thus 
be crucial for future investigations aimed at establishing distinct and 
common genetic or molecular pathways across disorders. 


Atypical brain connectivity in ASD 


The early perturbations to the developmental trajectory of brain mat- 
uration in ASD are not only likely to yield anatomical and functional 
differences in isolated brain regions, but have also been suggested 
to interfere with the formation of the brains neurocircuitry— 
commonly known as the human ‘connectome’ [38]. However, the 
notion of atypical brain connectivity in ASD is complex, involving 
abnormalities with the grey and white matter (for example, [39]), 
and different levels of integration [40]. Consequently, there is sig- 
nificant heterogeneity in analytical frameworks used to examine 
brain connectivity in ASD, and in reported results. Overall, how- 
ever, connectivity analysis methods may be subdivided into: (1) ap- 
proaches aimed at investigating structural brain connectivity; and 
(2) approaches aimed at examining functional brain connectivity 
(Fig. 29.3). 

Structural connectivity (that is, the existence and degree of struc- 
tural white matter connection in the brain) has so far mostly been 


examined using voxel-wise analysis of the white matter and dif- 
fusion tensor imaging (DTI). For instance, VBM studies showed 
that individuals with ASD have spatially distributed reductions in 
regional white matter volume during childhood [41], adolescence 
[42], and adulthood [43]. The results of such voxel-wise analyses 
are, however, inherently difficult to interpret due to methodological 
limitations (for example, issues related to image registration and/or 
segmentation) and cannot also unequivocally be linked to particular 
fibre tracks. For localization purposes, techniques such as DTI are 
better suited, as they allow the three-dimensional reconstruction of 
particular fibre tracks in order to examine tract-specific measures 
of structural white matter connectivity. In ASD, DTI studies have 
reported neuroanatomical differences in a number of white matter 
tracts that include the corpus callosum [44], the arcuate fasciculus 
[45], the limbic pathways [46], and the amygdala-hippocampal 
complex [47]. A number of DTI studies have also reported differ- 
ences in the basal ganglia [48], the cerebellum [49], and the cortico- 
spinal tract [50]. In many of these tracts, reports point towards a 
reduced degree of structural white matter connectivity in ASD, and 
this contributed to the notion of ASD being a ‘neurodevelopmental 
disconnection’ syndrome [51] that is characterized by deficits in 
global (that is, large-scale) brain connectivity. Such deficits in con- 
nectivity have also been observed on the functional level. 
Functional connectivity is generally defined as the degree of ‘tem- 
poral coherence’ between two neurophysiological signals and can be 
quantified, based on correlations between fMRI or EEG time series 
acquired in the presence of a stimulation paradigm or while the brain 
is ‘at rest. Early {MRI studies examining functional connectivity 
during task performance suggested that the brain of individuals with 
ASD may exhibit a reduced degree of long-distance functional con- 
nectivity, and this has been reported during executive functioning 
[52], sentence comprehension [53], and working memory tasks [54]. 
However, more recent studies show that patterns of both hypo- and 
hyperconnectivity can be observed when combining findings across 
studies [55]. Taken together, this indicates: (1) either that findings are 
inconsistent across studies; or (2) that the concept of atypical brain 
connectivity is more complex than originally assumed. A similar 
picture of mixed results comes from studies examining resting-state 
fMRI connectivity in ASD—with researchers arguing in favour of 
under-connectivity [56], over-connectivity (for example, [57]), or 
unique patterns of both under- and over-connectivity, depending 
on the particular set of brain regions examined [58]. It is important 
to note, however, that measures of functional connectivity are very 
sensitive to even minor methodological differences, and this most 
likely underpins a significant proportion of the variability in find- 
ings. For example, Thai et al. (2009) highlighted the influence of dif- 
ferences in activation response between ASD individuals and TD 
controls, which may affect the degree of functional connectivity ob- 
served within and across groups, in addition to the selection of par- 
ticular regions of interests and general constraints in temporal and 
spatial resolution [59]. Choices in some aspects of image processing 
can also affect the results [55]. Finally, there is evidence to suggest 
that the size (and sign) of the effects of connectivity analyses are age- 
dependent and may therefore vary as a function of the particular 
developmental stage under investigation [60]. Such methodological 
factors thus need to be accounted for in future studies examining 
functional connectivity in ASD. So far, however, the large variability 
in findings across studies has hampered the interpretability of the 
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Fig. 29.3 (see Colour Plate section) Schematic illustration of the underlying concepts behind atypical structural and functional brain connectivity 


in ASD. 


results, and a consistent overarching concept that can explain atyp- 
ical functional brain connectivity in ASD has yet to emerge. 


Atypical cortical gyrification in ASD—a retrospective 





window into early cortical expansion 


The notion of atypical brain connectivity has also been discussed 
in light of recent studies examining cortical gyrification in ASD. 
As the cortex expands during development, it eventually needs 
to fold to fit an increasing surface area into the restricted space of 
the skull. Based on the evidence of early brain overgrowth in ASD 
(see previous section), which seems to be driven by an accelerated 
expansion of the cortex [61], it is therefore likely that the brain in 
ASD not only differs in structure and connectivity, but also in shape 
(that is, geometry). More specifically, an accelerated and more pro- 
nounced expansion of the cortex is expected to be associated with 
a higher degree of cortical folding, as well as atypical patterns of 
cortical gyrification. Yet, studies investigating measures of cortical 
geometry, rather than volume, are rare, and the few existing studies 
have led to variable findings. Most of these studies are based on the 
so-called gyrification index (or GI) [62]. This measures the degree 


of cortical folding (Fig. 29.4). Using the GI, Hardan et al. (2004) 
examined gyrification in a coronal slice of the prefrontal cortex in 
ASD. They reported that children and adolescents with ASD showed 
greater prefrontal cortical folding than TD controls, while no signifi- 
cant between-group differences were observed in adults [63]. This 
early report thus agrees with the notion that early expansion of the 
cortical surface in ASD might result in increased cortical folding. 
The findings of more recent investigations examining cortical 
gyrification in ASD are, however, variable. Most of these recent 
studies utilized high-resolution surface reconstructions of the left 
and right hemispheres of the brain that can subsequently be used 
to estimate the degree of gyrification at each location on the cor- 
tical surface. For example, the so-called local gyrification index (/GI) 
measures gyrification at more than 300,000 different locations (that 
is, vertices) where the /GI at a given vertex is computed as the ratio 
between the surface of a circular patch on the outer smooth sur- 
face of the brain and the surface of the corresponding patch on the 
inner pial surface (Fig. 29.4). Hence, similar to the original GI, the 
IGI at a point reflects the amount of cortex buried within the sulcal 
folds in the surrounding area [64]. Studies utilizing the IGI have so 
far reported: (1) significant increases in gyrification in bilateral pos- 
terior brain regions in males with ASD, compared to TD controls 
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Fig. 29.4 (see Colour Plate section) Computation of the gyrification index (GI) as the ratio between the inner (i.e. pial) surface of the brain and the 
outer smooth contour. The GI can be calculated within brain slices in 2D (a) and at each location on the cortical surface in 3D (b). 


(12-23 years of age) [65], and significant reductions in the /GI (2) in 
the left supramarginal gyrus in males aged 8-40 years [66] and (3) in 
the right inferior frontal and medial parieto-occipital cortices in 
children with ASD [67]. Last, atypical cortical gyrification in ASD 
has been demonstrated, using a variety of alternative metrics of cor- 
tical folding, including measures of sulcal morphometry [68], sulcal 
depth [69], and gyral complexity [70]. Thus, there is considerable 
variability in findings reported by existing neuroimaging studies, 
with regard to both the degree of atypical gyrification and the par- 
ticular regional pattern of gyrification in ASD. 

While these divergent findings can be partially explained by differ- 
ences in sample size, participant demographics, and analytical tech- 
niques, evidence also suggests that patterns of cortical gyrification 
are highly variable across individuals, even in normative popula- 
tions, and that both genetic and non-genetic factors contribute to 
the formation of cortical gyri [71]. For example, Kates et al. (2009) 
examined the amount of GI concordance in monozygotic twin 
pairs, in which only one had a diagnosis of ASD, and in typically 
developing unrelated controls. It was found that cortical folding pat- 
terns across most brain regions are highly discordant within mono- 
zygotic twin pairs, although children with ASD and their co-twins 
both exhibited increased cortical folding in the right parietal lobe 
[72]. This finding is in contrast to conventional measures of brain 
anatomy (for example, total or regional brain volumes), which are 
highly concordant between twins and might thus be largely genetic- 
ally determined [73]. Moreover, cortical folding is not significantly 
correlated with total brain weight or volume, or with body weight 
and length, which are under strict genetic control [62]. Measures 
of gyrification thus seem to be particularly sensitive to environ- 
mental (that is, non-genetic) factors and reflect a degree of plasticity 
that is independent of overall brain size [74]. Thus, to elucidate the 


contribution of genetic and non-genetic factors to brain develop- 
ment in ASD, various cortical features, including those of cortical 
gyrification, should be examined to account for the large amount of 
phenotypic inter-individual variability typically noted between in- 
dividuals with ASD. 


Neuroimaging studies examining sex 


differences in ASD 





As mentioned in the introduction, ASD is accompanied by a large 
degree of inter-individual heterogeneity. Recent research efforts are 
thus directed towards establishing genetically and/or biologically 
homogenous subgroups of individuals that might, in turn, benefit 
from the same—or a similar—approach to treatment. One way of 
reducing the phenotypic heterogeneity associated with ASD is to 
stratify individuals by demographic characteristics such as age, 
intellectual ability, and also biological sex. An increasing number 
of studies are therefore now directed at establishing whether the 
neurobiology of ASD significantly differs between men and women. 
However, studies examining females with ASD are relatively rare. 
So far, most of our understanding of the neurobiological under- 
pinnings of ASD comes from studies examining male partici- 
pants exclusively. For instance, it is estimated that there are about 
eight structural, and 15 functional, neuroimaging studies in males 
for every one neuroimaging study examining females [75]. The 
overrepresentation of studies examining males is primarily due to 
the increased prevalence of ASD in males, with an average male- 
to-female prevalence ratio of approximately 4:1 [76]. Thus, females 
with ASD are relatively rare and hence more difficult to recruit than 
males. The reasons for the male-preponderant prevalence of ASD is 


largely unknown. However, it has been suggested that there are sex 
differences in the clinical phenotype of ASD, which may impede the 
diagnosis in females. For example, males with ASD have been re- 
ported to show more ‘externalizing’ behavioural problems (that is, 
actions directed towards the external environment) than females. 
Such externalizing behaviours include aggressive behaviour, hyper- 
activity, reduced pro-social behaviours, and repetitive/restricted be- 
haviours and interests [77]. Females with ASD, on the other hand, 
have been suggested to show more ‘internalizing’ behaviours such as 
anxiety and depression and/or other affective symptoms [78]. Also, 
females with ASD may have a greater ability to camouflage difficul- 
ties, particularly in the social domain [79]. Taken together, such dif- 
ferences in terms of the clinical phenotype of ASD may impact on 
our ability to diagnose the condition in females, which will, in turn, 
affect prevalence estimates. Consequently, little is currently known 
about the neurobiology of ASD in females. 

In terms of brain anatomy, some of the early neuroimaging studies 
suggest that the degree of cortical abnormality observed in girls with 
ASD may be significantly greater, as compared to boys. For instance, 
Bloss and Courchesne (2007) reported a larger deviation from same- 
sex TD controls among girls vs boys with ASD, in temporal grey and 
white matter volumes and in grey matter volume of the cerebellum 
[80]. Girls with ASD have also been reported to differ more robustly 
in amygdala volume from TD girls between the ages of 1 and 5 years, 
in comparison to the differences between boys with ASD and TD 
boy controls [81]. While there are also reports of a similar degree 
of cerebral enlargement for both boys and girls (for example, [82]), 
findings across studies suggest that girls may need to deviate more 
from their respective normative population than boys in order to 
demonstrate a clinical ASD phenotype. This observation also sup- 
ports aetiological models proposing that while females are likely to 
carry a higher genetic load for ASD than males, females may also 
have a higher ‘threshold’ (that is, minimum liability sufficient to 
cause ASD) for reaching affection status than males (reviewed in 
[83]). Alternatively, female-specific protective factors that may in- 
clude environmental, biological, and/or epigenetic mechanisms may 
counteract the genetic liability for ASD in females overall, so that a 
comparable threshold would affect a smaller proportion of females 
in comparison to males (see also [84]). In this case, one would ex- 
pect the cortical pathology of ASD to be significantly modulated by 
biological sex. 

There are few imaging studies to date that have examined the de- 
gree to which biological sex modulates the regional anatomy of the 
brain, and of those that are available, most are based on assessing 
sex-by-diagnosis interactions within a general linear model (GLM). 
For instance, Lai et al. (2013) performed a VBM analysis and found 
significant sex-by-diagnosis interactions in several white matter 
clusters that also showed a significant difference in white matter 
volume between males and females [85]. Notably, no significant sex- 
by-diagnosis interactions were reported when examining regional 
grey matter volume. Similarly, Beacher et al. (2012) reported signifi- 
cant sex-by-diagnosis interaction in: (1) total white matter volume; 
(2) regional grey matter volume of the right inferior parietal lobe; 
and (3) diffusion measures in the corpus callosum, cingulum, and 
corona radiata [86]. Last, a recent study examined local gyrification 
in males and females with ASD and TD controls—reporting that the 
degree of gyrification in the ventromedial/orbitofrontal cortices was 
significantly modulated by biological sex [64]. Overall, these studies 
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suggest that the neuroanatomy of ASD differs significantly between 
males and females and that biological sex attenuates the cortical 
pathology of ASD. Treating biological males and females as distinct 
genetic and/or biological subgroups may thus reduce the large de- 
gree of phenotypic heterogeneity that is characteristic for ASD and 
also help us better examine the neurobiological underpinnings that 
undermine risk and resilience for neurodevelopmental conditions. 


Over the last two decades, human neuroimaging studies have 
played a crucial role in establishing the cortical mechanisms that 
underlie the cluster of behavioural deficits typically observed in 
ASD. Most importantly, it is known from neuroimaging studies 
that ASD is accompanied by an atypical developmental trajectory of 
brain maturation, which leads to differences in brain anatomy and 
connectivity. In turn, insights arising from the examination of the 
brain in vivo can inform genetic and/or molecular investigations, 
to help elucidate the underlying genetic and/or molecular mechan- 
isms of ASD. This also demonstrates that the role of neuroimaging 
in mental health research is transitioning into being an integral part 
of ‘translational research, which integrates findings across multiple 
disciplines and translates them into the real-world clinical set- 
tings (that is, ‘from bench to bedside’). For instance, in the future, 
neuroimaging approaches may be used to assist (for example, in 
combination with clinical interview) the diagnosis/stratification of 
ASD into more biologically homogenous subytpes and/or to pre- 
dict response to treatment. Here, it will be crucial to shift the analyt- 
ical focus from the group-level statistical inference to the individual 
(that is, case) level—for example, using multivariate pattern clas- 
sification (MVPC) techniques (for example, [87]). Moreover, such 
techniques make it possible to take into account complex and multi- 
variate biological features and are hence particularly well suited to 
characterize conditions with a complex genetic and neurobiological 
architecture such as ASD. However, the application of these ap- 
proaches in the clinical setting remains a vision for the future, and 
there is currently a lack of studies validating models established in 
the research (that is, investigative) setting on independent clinical 
validation samples. If successful, however, these new approaches 
may one day prove invaluable in helping to diagnose, treat, and 
characterize ASD. 
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Autism spectrum disorder (ASD) is characterized by pervasive im- 
pairments in reciprocal social communication and restricted and re- 
petitive behaviours and interests (RRBIs). These characteristics are 
typically evident early in life, and while the course is variable, im- 
pairments persist across the life course. Diagnostic terminology has 
recently changed in the US DSM-5 [1], in which childhood autism, 
Asperger's syndrome, atypical autism, and pervasive developmental 
disorders are all subsumed under the umbrella category of ASD. 
This change occurred because of inconsistency in the use of different 
diagnostic subtypes within ASD [2] and also in recognition of the 
broad spectrum of severity, both across and within individuals over 
their life course. Life course variability in the manifestations of ASD 
may reflect changes emanating from within the individual, such as 
acquisition of language, or the implementation of environmental ac- 
commodations such as an ASD-oriented education or employment. 

While the diagnostic criteria for ASD are based on the core symp- 
toms of social communication and RRBIs, co-occurring disorders 
of development and mental health are the rule. In relation to cogni- 
tion, approximately half of those with ASD also have an intellectual 
disability (ID), as defined by an intelligence quotient (IQ) of below 
70 [3], and over half of those with normal non-verbal intelligence 
have a specific language impairment [4]. Motor co-ordination prob- 
lems are more loosely defined, but population studies indicate that 
three-quarters of children with ASD score in the impaired range 
on structured assessments [5]. Psychiatric disorders are also highly 
prevalent, with aggregate rates in the region of 70% [6-9]. A heter- 
ogenous range of medical conditions are also common in ASD, 
with about 15% of people having a causal medical/genetic disorder 
[10, 11] and a further proportion having coexisting, but not causal, 
conditions such as epilepsy. Finally, people with ASD have impair- 
ments in adaptive function, the competence to undertake a range 
of everyday activities, including personal care, communication 
with others, and organization, and these impairments exceed that 
expected on the basis of their IQ. This impacts significantly on the 
degree of independence achieved by people with ASD across the full 
IQ range. Hence, the comprehensive management and treatment of 





people with ASD requires a wide-ranging approach to a diverse set 
of conditions. 

Because ASD is a developmental disorder starting in early life 
but continuing throughout life, the manifestations often vary con- 
siderably and the goals of parents/carers and people with ASD 
often change over time in relation to treatment goals (Fig. 30.1). 
In early childhood, following diagnosis, there is often a focus on 
interventions that will minimize or eradicate core ASD symptoms, 
including the development of effective communication strategies 
and optimization of learning. In later childhood, co-occurring 
mental health problems are more obvious. The importance of 
peers and the challenges experienced in initiating and maintaining 
these relationships are more evident. During later adolescence 
and adulthood, concern may focus on improving adaptive func- 
tion and increasing independence (both typically lagging behind 
learning), and widening participation in education, employment, 
or community life. Although some behaviour problems may 
ameliorate, often anxiety depression become more prominent. In 
early childhood, ASD is often viewed by parents as a condition to 
‘eradicate’; however, many adults living with ASD embrace their 
autistic identity but want help with specific concerns, such as em- 
ployment and other aspects of the ‘neurotypical’ environment, and 
with managing their mental health [12]. This changing perspective 
means that clinicians and services need to be patient-centred and 
flexible in their approach, as well as help to shape realistic goals 
and targets. 

Many individuals with ASD, including those with average to 
above average intellectual ability, need support throughout their 
lives. The ethos in adult mental health services is to enable patient 
independence and ensure confidentiality. However, people with 
ASD are frequently anxious about health appointments and find it 
difficult to tolerate busy waiting rooms. Their difficulties with com- 
munication, emotional literacy, and perspective-taking can interfere 
with their ability to give a full and accurate account of their symp- 
toms and experiences. Their need for support in health care appoint- 
ments is recognized in the National Institute for Health and Care 
Excellence (NICE) guidelines for adults with ASD [13]. Throughout 
this chapter, we refer to ‘patients and their families, recognizing that 
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in childhood, this is typically parents and in adulthood, it may in- 
volve other family members or carers. 


General approach to treatment and 


management of ASD 





The focus of this chapter is on the management of both core symp- 
toms of ASD and co-occurring psychiatric conditions. These can be 
extremely variable, and a personalized approach is required. 


The autism team 


Because of the range of core and co-occurring symptoms, clinical 
input is needed from multiple disciplines within health, as well as 
from education, employment, social care, and the voluntary sector. 
Furthermore, as ASD is a chronic condition, services should rec- 
ognize that the needs of people with ASD are long term and that 
models of care developed for episodic conditions may not meet their 
needs. In England, therefore, the NICE has recommended that every 
local authority should have an ‘ASD team, which may be virtual, but 
where the relevant professionals can be brought together to consider 
and manage the needs of individual patients [14]. 

The diversity of needs mandates that health services are multi- 
disciplinary in their composition. When younger children are in- 
volved, paediatric input is essential to consider underlying and 
associated medical/genetic disorders. Neurological expertise should 
be available across the lifespan. From mid childhood onwards, psy- 
chiatrists and psychologists are core members of the autism team, 
in relation to the diagnosis and management of ASD and additional 
mental health problems. Speech and language therapists are also es- 
sential to work with ASD patients with varying levels of verbal com- 
munication. Occupational therapists play an important role in the 
management of motor impairments and the sensory atypicalities 
that are often distressing and impairing. Occupational therapists 
may also provide useful input in relation to employment. Usually 
outwith the clinical team, but essential to meeting the needs of pa- 
tients, are close links with education, including special education, 
and social care. 


Principles of intervention 
Building on strengths 


The cognitive profiles and learning styles of people with ASD, while 
diverse, are often different from those of non-autistic, typically 
developing people of the same age and ability level. Often these dif- 
ferences are viewed as impairments that will interfere with effective 
treatment, but a neurodiversity perspective would instead highlight 
the strengths to be enhanced and drawn upon. These may include 
good visuo-spatial memory, attention to detail, and willingness to 
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follow rules and instructions carefully. In developing an intervention 
plan, individual strengths should be systematically elicited through 
interviews, questionnaires, and cognitive assessment of the patient. 


Identifying and prioritizing intervention targets and goals 


ASD is a severe and pervasive disorder that affects every aspect of 
life. As discussed in the following sections, while there is emer- 
ging evidence that some interventions may improve core symp- 
toms, these are far from curative, and the overwhelming majority 
of people with ASD continue to experience significant impairment. 
Treatment of co-occurring mental health problems typically results 
in less amelioration than might be expected in other patient groups. 
The reasons for this reduction in effectiveness remain uncertain. 
One possibility is that co-occurring conditions have a different aeti- 
ology in ASD, and hence the target(s) or underpinning mechanisms 
for intervention are different. Another possibility, particularly for 
non-pharmacological treatments, is that people with ASD are not 
able to use the intervention in the way intended because of ASD- 
related differences such as cognitive strategies. Alternatively, it is 
possible that the true effect of such interventions is similar in people 
with ASD, but the effects are difficult to detect and measure in the 
context of their ASD. Regardless of the reason, a general observation 
is that people with ASD often remain significantly impaired in their 
functioning, following a course of evidence-based intervention. 
Therefore, clinicians should work with patients to set realistic goals. 

An additional challenge is the typical presentation of co-occurring 
conditions may mean that patients and their families have multiple 
treatment targets. The establishment of a sequence of priorities 
should consider which symptoms are most impairing, most likely to 
respond to interventions, and likely to have a knock-on impact on 
other symptoms. 


Monitoring of treatment benefits and adverse effects 


The co-occurrence of core ASD symptoms and additional mental 
health problems may hinder the ability of patients and family in- 
formants to identify improvement in target symptoms. A first step 
is psychoeducation about which of the patient’s symptoms are re- 
lated to core ASD symptoms and which to co-occurring disorders. 
Treatment planning should include a discussion of which symptoms 
are targeted by specific interventions. In monitoring treatment bene- 
fits, structured scales, whether standardized or bespoke, are helpful 
for both patients and clinicians. Autistic patients frequently find it 
useful to rate their symptoms on a Likert scale, which provides the 
clinician with quantitative information about improvement. A base- 
line measure should be obtained before initiating treatment. 
Similarly, structured and systematic monitoring for possible 
adverse effects is essential. The impairments experienced by ASD 
patients in communication, emotional literacy, and verbalizing 
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physical sensations may mean that they do not recognize these and 
volunteer this information at clinical reviews. At the same time, pa- 
tients with ASD may have more day-to-day variability, which can 
affect the assessment of benefit and also lead to misattribution of 
variability as adverse effects. The use of routine monitoring meas- 
ures can help to disentangle variability from treatment-related bene- 
fits and adverse effects. 


Specific considerations for pharmacological interventions 


Over and above the principles outlined, patients with ASD may be 
more sensitive to psychoactive medications. This means that they 
should be commenced on low doses, titrated more gradually to a 
therapeutic dose (which is sometimes lower than for other patient 
populations), and monitored more frequently. It is good practice in 
all patients to use as few medications as possible to manage symp- 
toms. This is even more the case in ASD where drug interactions 
could be more likely to precipitate adverse effects. Hence, every 
attempt should be made to optimize the dose of individual medi- 
cations. Notwithstanding this, many patients require multiple medi- 
cations and a distinction should be made between the use of multiple 
drugs for a single symptom complex (polypharmacy) and the need 
to use drugs targeting different, co-occurring symptoms, which may 
be essential to a comprehensive approach to management. 

Particularly in younger patients and those with ID, it can be dif- 
ficult to undertake routine medical investigations, such as blood 
tests, that are normally expected for the initiation and monitoring of 
medication. In discussion with parents or legal guardians, the clin- 
ician should make a judgement and recommendation about the rela- 
tive risks and benefits of an intervention when routine monitoring is 
difficult or impossible. It is also often possible to be opportunistic in 
obtaining investigations when the patient is undergoing other med- 
ical procedures. 


General considerations for psychological interventions 


As with other groups, patients with ASD and their families are keen 
to have access to psychological interventions. Clinicians are some- 
times concerned about the appropriateness of talking treatments 
with patients whose communication skills are impaired. However, 
strategies for adaptation, for example modified cognitive behav- 
ioural therapy (CBT), are now available and have been evaluated 
in clinical trials, with positive effects, as described in the following 
sections. Before commencing psychological work, it is helpful to 
undertake a cognitive assessment to gain a better understanding of 
the patient’s cognitive level. Clinicians may be misled by patients’ 
‘chattiness’ into assuming they have good receptive language when, 
in fact, their abilities are significantly compromised. The principles 
of adapted CBT for people with ASD include establishing emotional 
literacy, using visual materials to support the work, both in sessions 
and as part of the intervention, and additional practice in multiple 
settings to aid generalization. Autistic patients may continue to 
benefit into adult life from family members as co-therapists, as often 
used with younger children. 


Other non-pharmacological interventions 


Clinicians should be aware of the importance of other modalities 
of support or intervention. While there is increasing interest in 
neuromodulation, there is at present no evidence for its use in ASD. 
However, education for youth and appropriate supported (where 


necessary) employment for adults are among the most important 
ways of optimizing life chances and improving quality of life. The 
acquisition of educational levels and adaptive skills is often slower in 
people with ASD but continues into adult life. In recognition of this, 
many countries have legislated that people with ASD have the right 
to state-supported education past the age mandated for typically 
developing people. Clinicians often have a key role in highlighting 
elements of the educational or employment environment that will 
foster development and well-being. 





Specific treatments and interventions 


The following sections on treatment are organized by symptom 
domain, and within each section, interventions are considered ac- 
cording to type. Where evidence is stratified or limited by age, this 
is indicated. 


Treatment of core symptoms of ASD 


Although the diagnosis of an ASD requires the presence of both 
social communication impairments and restricted RRBIs, there is 
evidence that these domains are underpinned by separate, correl- 
ated genetic architecture [15]. Furthermore, the current evidence 
suggests that the effects of interventions are relatively specific. 
Psychological interventions have focused on social communica- 
tion and learning, with little evidence of transfer of effects to RRBIs. 
Conversely, the limited evidence for pharmacological interventions 
suggests that may be principally useful in managing RRBIs. 


Social communication 


At present, there is no good evidence that pharmacological interven- 
tions improve social communication, although a number are being 
investigated. While there is evidence suggesting secondary im- 
provement in social communication when co-occurring disorders, 
for example attention-deficit/hyperactivity disorder (ADHD) [16] 
or irritability [17, 18], are treated, this is likely to be mediated by 
improvement in the original target symptoms, rather than having a 
direct effect. They should not be used in the absence of the primary 
symptoms. 

There is emerging evidence for psychological interventions aimed 
at improving social communication in young children, from tod- 
dlers through the early school years. In evaluating the evidence in 
relation to psychological interventions, it is important to focus on 
studies with objective and/or blinded outcome measures, as un- 
blinded parent reports are subject to bias and placebo effects [19]. In 
relation to parent-mediated interventions, a randomized controlled 
trial (RCT) in the UK that aimed to improve parental sensitivity to 
their children’s behaviour failed to find a significant effect on child 
social communication at the end of the trial but did find an effect 
on parental sensitivity [20]. Interestingly, a follow-up 5-6 years 
later demonstrated significant benefit for the treatment group on 
a blinded outcome measure of social communication (but not lan- 
guage level) [21]. Several RCTs in the United States have reported 
similar findings [22-24]. 

There is also emerging evidence for the benefits of structured 
social skills programmes, whether parent-mediated [25-27] or 
school-based [28]. This evidence applies from early to mid child- 
hood, for example age 5-6 years, through to adolescence. 
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Applied behavioural analysis (ABA) is a popular approach with a 
focus on intensive behavioural work to help autistic children acquire 
specific skills. Early evaluation suggested that while specific skills 
might be learnt, the intervention did not generalize to meaningful 
improvements in ASD [29]. In line with the early findings, a struc- 
tured variation—the Early Start Denver Model (ESDM)—has been 
more carefully evaluated recently, with follow-up findings at 1 and 
2 years. This revealed improvements in the treated group in IQ and 
adaptive function, but notin ASD symptoms [30], with these gain no 
longer being significant at the 4-year follow-up [31]. Studies com- 
bining classical ABA and joint attention training approaches have 
reported benefits for proximal target symptoms, but it is unclear 
whether these effects generalize to outcomes more closely linked to 
real-life functioning [32, 33]. 

In adults, observational studies suggest that behavioural or social 
learning approaches may be helpful, but the evidence from RCTs is 
very weak, with only one study which failed to find significant benefit 
of a social learning programme [34]. There are a range of other inter- 
ventions to consider for adults with ASD which may benefit their 
adaptive function and quality of life, without specifically improving 
social communication skills. These include structured life skills pro- 
grammes for those who have difficulty with organization or social 
isolation and CBT-based programmes to reduce the risk of victim- 
ization [35]. However, the evidence base is weak. 

Research shows that the following treatments are not of benefit, 
exclusion diets have no benefit, and the once-touted treatments 
of secretin, hyperbaric oxygen, and chelation therapy may be 
harmful [36]. 


Restricted and repetitive behaviours and interests 


Despite early findings suggestive of benefit, selective serotonin re- 
uptake inhibitors (SSRIs) should not be used to treat RRBIs. Early, 
but small, studies of fluoxetine in children [37] and adults [38] 
and fluvoxamine in adults [39] were superseded by a large trial of 
citalopram in youth, which failed to identify beneficial effects and 
demonstrated adverse effects in the treated group [40]. These ag- 
gregate findings, along with a few other studies, were combined in 
a Cochrane review that did not support the use of SSRIs for RRBIs 
in ASD [41]. As for social symptoms, both risperidone [42] and 
aripiprazole [43, 44] significantly reduced stereotypic behaviour in 
secondary analyses where autistic children were selected, based on 
high levels of irritability. The interpretation of these findings should 
be that where high levels of irritability are present, such drugs may 
have wide-ranging benefits. There are no trials suggesting psycho- 
logical interventions directed at RRBIs are useful. 


Sensory symptoms 


The recent revision of DSM-5 included for the first time sensory 
atypicalities as part of the diagnostic criteria [1]. They are sub- 
sumed under the domain of RRBIs. Sensory atypicalities include 
both hyper- and hyposensitivity, as well as unusual interests to one 
or more domains of sensation. The study of sensory atypicalities has 
been held back by a lack of good measures of sensory experiences 
and behaviours. Recommendations from occupational therapists 
include the use of stimuli that provide alternative sensations such as 
squeezy balls, weighted blankets and vests, eye protectors, and mas- 
sage. There is weak (because of small sample size and non-blinded 
outcomes) RCT evidence for the benefit of occupational therapy 


in one structured intervention of 30 sessions [45] and similarly for 
massage [46]. Although they have been popular, the evidence on 
weighted vests does not support their use [47]. 


Treatment of common co-occurring mental disorders 


As highlighted previously, co-occurring psychiatric disorders are 
the rule in people with ASD. The reasons for this are only partly 
understood. Twin studies indicate that co-occurrence with other 
neurodevelopmental disorders, such as ADHD, tics/Tourette’s syn- 
drome, and ID is predominantly due to shared genetic risk factors 
[48]. This does not apply to co-occurring emotional symptoms 
where shared genetic factors are very small [49]. For the latter group 
of symptoms, it is more likely that features of ASD, impairments 
in cognitive processing, including social cognition, and executive 
function are mediators for emotional disorders [50]. Differences in 
behavioural style may render people with ASD more likely to ex- 
perience adverse environmental and life events, including bullying 
[51], and they may have greater difficulty in adaptive responses 
when they have occurred. Others have argued that the high level 
of co-occurring psychopathology merely reflects inaccuracies in 
diagnostic boundaries and that these should be broader and more 
inclusive to reflect the patterns of observed symptoms. A benefit of 
conceptualizing these symptom domains as co-occurring disorders 
is that it encourages clinicians to identify and treat them, drawing 
upon the wider intervention evidence base in the absence of ASD- 
specific studies. 


Attention-deficit/hyperactivity disorder 


Population studies suggest 30-50% of children with ASD meet cri- 
teria for ADHD [8, 9]. Diagnostic classification systems prior to 
DSM-5 precluded the diagnosis of ADHD (hyperkinetic disorder 
in the International Classification of Disease) in the context of ASD; 
hence, much of the relevant research refers to people with high levels 
of ADHD or hyperactivity symptoms. For all practical purposes, 
the patients included in pharmacological trials would meet formal 
diagnostic criteria for ADHD or hyperkinetic disorder and they are 
referred to in the following sections as having ADHD for ease of 
discussion. 

There are RCTs supporting the benefits of methylphenidate, 
atomoxetine, and guanfacine in youth with ASD and ADHD. 
Methylphenidate has been evaluated in four RCTs, with a meta-analytic 
effect size of 0.66 [52] and about half being classified as responders 
[53]. This contrasts with effect sizes in the region of 0.8-1.0 in typically 
developing youth with ADHD [54] and 80-90% of responders [55]. 
Furthermore, rates of adverse effects are higher, with 18% in the largest 
study being withdrawn due to adverse effects in one study [53]. 

Atomoxetine has been evaluated in a small [56] and medium- 
sized [57] trial. Both reported significant improvements with active 
treatment over placebo, but effect sizes were not reported. In the 
larger study, only 21% were responders under the usual criteria (vs 
9% in the placebo group) [57], compared to 57% and 25%, respect- 
ively, in the smaller study [56]. However, rates of adverse effects were 
low, with only two withdrawals across both studies. A two-by-two 
RCT (atomoxetine vs placebo, parent training vs no psychological 
intervention) showed benefits for all three active arms (atomoxetine 
plus parent training, atomoxetine alone, and parent training alone), 
with effect sizes of between 0.57 and 0.98, compared to no treatment. 
The active arms could not be distinguished statistically [58]. 
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Most recently, guanfacine has been evaluated in ASD and ADHD 
in a medium-sized study showing a large effect size (>1), with 50% 
of the active group classed as responders vs 9% of the placebo group 
[59]. Adverse effects leading to withdrawal were present in 13%, and 
lethargy early in treatment was prominent. 

The only RCT evidence for psychological intervention comes 
from the atomoxetine/parent training study described, in which the 
effect size against no active treatment was 0.6 [58]. In contrast to the 
pharmacological trials, this was not a blinded outcome, as parents 
were the recipients of the intervention, as well as the informants on 
ADHD, so the magnitude should be viewed with caution [60]. 

In summary, there is good evidence for pharmacological inter- 
ventions in ADHD among youth with ASD. However, the effects are 
mostly smaller than in typically developing youth and the rate of 
adverse effects is higher. Therefore, careful monitoring is required 
when prescribing. Different classes of medication should be tried 
sequentially if benefits are inadequate or the drug is poorly tolerated. 
There are no adult studies to draw upon; therefore, extrapolation 
from the experience in children and adolescents is necessary. The 
findings of parent training highlight the importance of including 
psychological approaches to intervention and is consistent with the 
NICE update on ADHD, in which it is recommended that parent 
training is offered to all parents of children with ADHD [61]. 


Anxiety and obsessive-compulsive disorders 


Rates of anxiety disorders vary widely across studies, from about 
30% to 80%, depending on populations and measurement [62]. 
With regard to pharmacological interventions, there have been no 
RCTs targeting anxiety disorders among patients with ASD. Trials 
of both fluoxetine and citalopram failed to show an impact on ob- 
sessional symptoms [37, 40], and these trials were likely measuring 
RRBIs, rather than obsessive-compulsive disorder (OCD). 

A meta-analysis of eight RCTs of adapted cognitive behavioural 
therapy (CBT) for anxiety in children and adolescents indicates 
benefits, with an aggregate effect size of 1.2 [63]. Child-reported 
outcomes from five studies revealed a lower meta-analytic effect size 
of 0.68. As above, these large effects should be viewed with caution, 
as the informants were not blind to treatment allocation and four of 
the trials used waitlist controls. In addition, the samples are selected 
to include youth of higher intellectual ability. In adults, the evidence 
for CBT is more anecdotal. One medium-sized trial of CBT for OCD 
in adolescents and adults with ASD failed to find a treatment effect 
against generic anxiety management [64]. The sample sizes was 
probably not large enough for a comparison against an active inter- 
vention, and both the rate of response (45% vs 20%) and effect sizes 
(0.33 vs 0.05) were suggestive of superiority for CBT that warrants 
further investigation. 

When possible, CBT should be the first-line treatment. 
Pharmacologically, in the absence of ASD-specific data, treatment 
should follow that used in typically developing individuals, with the 
provisos described in relation to low doses and gradual titration. 
Medication should be considered when CBT is ineffective or unsuit- 
able because of patient characteristics, including anxiety that is too 
severe for such an approach. 


Depression 


Rates of depression appear to be low in children with ASD [9] but 
affects up to 50% of adults [7]. There is even less evidence base for 


ASD-specific treatment of depression. One extremely small study of 
fluoxetine in adults failed to find an effect but was underpowered to 
reach any conclusions [65]. There is a similar lack of evidence in rela- 
tion to psychological interventions for depression. Treating depres- 
sion should follow recommendations for people without ASD, but 
when psychological interventions are used, they should be adapted 
to be ASD-specific. 

Tics 

Tics are common in ASD [8, 9] but importantly need to be dif- 
ferentiated from mannerisms, stereotypies, repetitive behaviours, 
and complex rituals. This usually requires direct observation. As in 
typically developing people, tics should only be treated when they 
are impairing and then the aim of treatment should be to manage 
impairment, rather than eradicate the tics. Psychoeducation is 
important, as families and teachers often do not appreciate the 
involuntary nature of tics. In individuals without ASD, a recent 
meta-analysis of RCTs reported good to moderate quality evi- 
dence for the benefits of the alpha-2 adrenergic receptor agonists 
clonidine and guanfacine and dopamine antagonists/partial agon- 
ists [66]. This review also judged the trials for habit reversal to be 
of good quality and demonstrating efficacy. These findings should 
guide intervention in ASD. 


Behaviours that challenge 


One of the most common and impairing presentations among 
people with ASD is that of ‘behaviours that challenge others’ or 
‘challenging behaviour’. Over the years, a number of terms have 
been used to reflect these behaviours, which comprise aggressions 
directed at the self or others and severe non-compliant behaviour, 
frequently with high levels of irritability. These behaviours may vary 
in their intensity and frequency but are of concern when they im- 
pact on an individual's ability to participate in everyday activities 
such as family life, school attendance, employment, or leisure activ- 
ities. Such behaviours are common in people with ASD and more 
frequent than in other conditions causing similar impact on intel- 
lectual functioning [67, 68]. Behaviours that challenge have been 
conceptualized in different theoretical perspectives, which are not 
mutually exclusive. In the comprehensive approach to their manage- 
ment, all need to be considered. 

Social learning models of behaviours that challenge view these 
as maladaptive learnt responses. Applied behaviour analysis (ABA) 
aims to identify the causes and responses that are driving its re- 
inforcement. There is an extensive ABA literature of individual pa- 
tients and case series demonstrating how learning principles can 
be used to break the cycle of these behaviours. Although there is a 
lack of group-based RCTs, the literature includes examples demon- 
strating the causal role of contingent response, in which the response 
to a challenging behaviour is modified, leading to a reduction in the 
behaviour, followed by a return to the original contingencies, which 
leads to a return of the unwanted behaviour. All clinicians working 
with these patients demonstrating challenging behaviour should 
have a good understanding of functional behavioural analysis in 
order to take a history that elicits antecedents, behaviours, and con- 
sequences. ABA in its classical form is undertaken by psychologists 
and behaviour modification therapists who make use of a range of 
operant conditioning principles; it can be time-consuming but is an 
important treatment modality for behaviour that challenges. 
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Fig. 30.2 Conceptual framework linking psychiatric disorders to 
behaviour that challenges. 


Behaviours that challenge may also arise as a consequence of con- 
ditions causing pain or discomfort. Painful conditions are more 
common in ASD, either due to the high rates of causal or associated 
medical conditions. Even higher-functioning autistic individuals 
often have difficulty identifying bodily sensations and linking them 
to their emotional state and behaviour. Hence, these should always 
be considered, especially when there is a sudden increase in challen- 
ging behaviour. 

Behaviour that challenges may also be an external manifestation 
of an underlying psychiatric disorder (Fig. 30.2). The presentation of 
psychiatric disorders may be atypical and more difficult to identify 
because of poor communication skills and emotional literacy. For 
example, in autistic individuals, anxiety may present as irritability 
[69]. The link between different psychiatric disorders and behav- 
iours that challenge has been demonstrated in adults with ID [70]. 
Clinicians should assess for psychiatric disorders that may underpin 
behaviours that challenge. 

In the absence of identifying another psychiatric disorder, there 
is evidence for treating behaviour that challenge described as ‘ir- 
ritability with dopamine antagonists/partial agonists, most not- 
ably risperidone and aripiprazole. Both drugs have robust acute 
and medium-term benefits [71, 72]. However, they may also have 
significant adverse effects, which include weight gain and somno- 
lence, and for risperidone, raised prolactin levels [73, 74]. Clinical 
impressions that aripiprazole may have fewer adverse effects are not 
at present confirmed by research; a small head-to-head comparison, 
which was likely underpowered, failed to detect differences in rates 
of short-term adverse effects [75]. 

A trial comparing risperidone alone vs risperidone and parent 
training in children revealed significantly greater improvement in 
adaptive function in those receiving the combination interven- 
tion, and medication doses were marginally (not significantly) 
lower than in the medication only arm [76]. Based on all these 
findings, in conjunction with clinical consensus, NICE recom- 
mends that behavioural treatment should be the first-line inter- 
vention for behaviour that challenges. Medication should only be 
first line when circumstances do not permit a behavioural inter- 
vention, and where this is the case, a behavioural intervention 
should be added as soon as possible. Conversely, if a behavioural 
intervention is inadequate, medication augmentation should be 
considered. Although medication retains its effectiveness in the 


medium term, there are real concerns about long-term safety. 
Ideally, the use of antipsychotic medication should be seen as an 
opportunity to develop other strategies for behavioural manage- 
ment, with the aim of reducing or eliminating medication, wher- 
ever possible. 


Sleep problems 


Although sleep problems are very common in people with ASD, 
particularly those with associated ID [77], research on their man- 
agement is very limited. Sleep onset and maintenance are the most 
common problems. The causes are likely multifactorial; there is 
emerging evidence for biological differences in melatonin produc- 
tion and circadian patterns. Co-occurring medical disorders may 
play a role, and sleep apnoea should be considered. Psychiatric dis- 
orders, such as anxiety, depression, and ADHD, are all linked to sleep 
problems in people without ASD, and these are of higher prevalence 
in ASD. People with ASD often have maladaptive bedtime routines 
that cause or exacerbate sleep problems. 

One trial of melatonin in children with developmental disabilities 
included a large proportion with ASD [78]. The protocol required 
that eligible families first receive a manual on behavioural manage- 
ment of sleep. Following this, over one-third of children no longer 
met inclusion criteria, highlighting the benefits of psychosocial 
intervention. Among those completing the trial, parent diaries in- 
dicated a benefit of melatonin for sleep latency and total sleep time, 
while actigraphy on a subsample confirmed the effect on sleep la- 
tency only. Recommendations from NICE place priority on behav- 
ioural management strategies, particularly the implementation of 
a sleep hygiene programme. Where this is insufficient, melatonin 
can be considered but should only be extended when there is clear 
benefit and then for a limited time period [36]. 


Conclusions and research directions 





ASD is a severe, pervasive, and lifelong disorder. Although there is 
an increasing understanding of its aetiology, this has only begun to 
translate into theoretically derived approaches to intervention. In 
relation to core symptoms of ASD, most promising at present are 
psychosocial interventions aimed at improving social communica- 
tion. A better understanding of their long-term impact is required. 
Pharmacological approaches to core symptoms have recently fo- 
cused on molecules implicated in biological pathways that appear to 
be different in people with ASD, but there is insufficient evidence to 
recommend their use in clinical practice. A programme of clinical 
trials is needed to escalate the rate of progress. 

Co-occurring psychiatric disorders are exceedingly common in 
ASD and form an important consideration in the comprehensive 
management of ASD. There is a good evidence base for the treat- 
ment of ADHD and ‘irritability’ or behaviour that challenges, and 
some evidence to guide practice for anxiety disorders. However, for 
many other psychiatric disorders, there is a lack of therapeutic re- 
search in ASD, leaving clinicians reliant on the evidence base for 
patients without ASD. Because of the differences in biology and cog- 
nitive make-up, such extrapolation may be inappropriate. There is 
an urgent need for treatment trials in affective and sleep disorders. 
Different intervention modalities should be considered, both on 
their own and in combination. 
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The development of new, reliable, sensitive, and unbiased ob- 
jective outcome measures should aid the progress and interpretation 
of clinical trials. Finally, environmental interventions, including 
those based in education and employment, have the potential to 
produce large gains in well-being and quality of life. 
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The concept of attention-deficit/hyperactivity disorder (ADHD) 
arose from neurological formulations but does not entail them, and 
the modern definition describes a set of behavioural traits. A con- 
dition consisting of pervasive and persistent inattention and/or im- 
pulsiveness is validated by its predictive power. It is associated with 
the presence of brain changes and genetic inheritance, as well as 
treatment response and prognosis [1]. The historical evolution of 
the concept is described by Taylor [2]; it began with the idea that 
some behavioural problems in children arose, not from social and 
familial adversity, but from subtle changes in brain development. 
Nowadays, however, the definition comes from specific and observ- 
able behaviour traits without assumption of cause. Attention Deficit/ 
Hyperactivity Disorder (ADHD) in DSM-5 [3] and ‘Hyperkinetic 
Disorder in ICD-10 [4] describe a constellation of overactivity, im- 
pulsivity, and inattentiveness. 

These core problems often coexist with other difficulties of 
learning, behaviour, or mental life, and the coexisting problems may 
dominate the presentation. This coexistence, to the psychopatholo- 
gist, emphasizes the multi-faceted nature of the disorder; to the soci- 
ologist, a doubt about whether it should be seen as a disorder at all; 
to the developmentalist, the shifting and context-dependent nature 
of childhood traits. For clinicians, ADHD symptoms usually need to 
be disentangled from a complex web of problems. It is worthwhile 
to do so because of the strong developmental impact of ADHD and 
the existence of effective treatments. Public controversy continues, 
but professional practice in most countries makes ADHD one of the 
most commonly diagnosed problems of child mental health. 








Hyperactivity 


The idea of hyperactivity is an excess of undirected movement. It 
is not totally dependent on context and cannot be reduced to non- 
compliance; physical measures of activity level have indicated that 
it is higher in children with ADHD than in controls, even during 


eractivity disorder 





sleep [5]. It is, however, partly dependent upon context—it is often 
inhibited by a novel environment, creating a pitfall for the inexperi- 
enced diagnostician who may exclude it incorrectly because it is not 
manifest during observation at a first clinic visit. It may not be shown 
in situations where high activity is expected such as the games field 
or energetic play. The key situations where it is evident are familiar to 
the child and where calm is expected, such as visiting family friends, 
attending church, at mealtimes, during homework, and—often the 
most troublesome—at school, during class. 


Impulsiveness 

Impulsiveness means action without reflection—often described 
as a failure to ‘stop and think. The term covers premature, unpre- 
pared, and poorly timed behaviours—such as interrupting others 
and giving too little time to appreciate what is involved in a school 
task or a social situation. 

It can also refer to several neuropsychological changes such as 
failure to inhibit inappropriate responses, aversion to delay, and 
speed/accuracy tradeoffs. These are indeed more common in children 
with ADHD and may well underlie some of the observed behaviour 
changes, but they are not constant and are not part of the diagnosis. 

DSM 5 sets out a list of observable behaviours that together de- 
fine a dimension of hyperactivity-impulsiveness. They can be 
summarized as: 


e Often fidgeting. 

e Often leaving seat. 

e Often running or climbing inappropriately. 
e Often unable to play quietly. 

e Often unable to be still when required. 

e Often talking excessively. 

e Often blurting out answers prematurely. 

e Often having difficulty in waiting. 

e Often interrupting or intruding on others. 


Six of these nine problems need to be present—or five of them for 
people aged over 17—as a criterion for the diagnosis. Experienced 
clinicians will sometimes find it infeasible, or even misleading, to 
make a pedantic count and may be guided by the overall impression 
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of an unmistakeable deviation from what is expected for an indi- 
vidual of given chronological and developmental age. 


Inattentiveness 


Inattentiveness means disorganized and forgetful behaviour— 
short-sequence activities, changing before they are completed, with 
lack of attention to detail and failure to correct mistakes. These are 
behavioural observations, not psychological constructs. On a cog- 
nitive level, ‘attention deficit’ is not always an accurate description; 
the performance of affected children does not always fade with time 
on a task any more than that of ordinary people, and the presence 
of irrelevant information (‘distractors’) does not necessarily worsen 
their performance disproportionately to that of other people [6]. 
Cognitive changes of several kinds are indeed seen more frequently 
than in typically developing people. These changes go beyond atten- 
tion and executive function [7]; the recognition of inattentiveness 
depends on descriptions and observations of behaviour, rather than 
on tests of performance. 

DSM-5 provides a list of nine possible features that are often pre- 
sent to excess. They can be summarized briefly as: 


e Lack of close attention to details; careless mistakes. 
e Difficulty in sustaining attention on tasks or play. 

e Not listening when spoken to. 

e Not following instructions or finishing tasks. 

e Difficulty organizing tasks and activities. 

e Avoiding tasks that require effort. 

e Losing things that are needed for activities. 

e Distractible by outside stimuli. 

e Forgetfulness in daily activities. 


Six of the problems should be present for the full diagnosis (or five 
in adult life). Care is needed to avoid confusion with oppositional be- 
haviour. For example, ‘not following instructions’ is intended to imply 
that instructions are forgotten or not attended to, but the behaviour 
can also be shown for reasons of wilfulness and defiance. Careful de- 
scription, or witnessing the behaviours complained of, is necessary 
to identify the quality of disorganization. The recognition of the dis- 
ability therefore needs an experienced professional. The diagnosti- 
cian also needs to recognize that the behaviours can be modified by 
the context. Tasks with a strong incentive, or that are inherently re- 
warding and engaging, may well elicit a much improved performance. 
Good focus on a shoot-em-up computer game, or good self-control at 
the first visit to the office, should not be taken to exclude the diagnosis. 


Further requirements for diagnosis 





Developmental inappropriateness 


Inattention and impulsive over-activity both need to be judged in 
terms of what is clearly excessive for the individual’s developmental 
level. Either is sufficient for a diagnosis; when both are present, then 
the combination should be noted. In the previous version of DSM- 
IV, there was an expectation that predominantly inattentive, pre- 
dominantly impulsive, and mixed subtypes should be distinguished. 
A longitudinal study, however, has found that they do not behave 
as subtypes over time—one tends to turn into another. They are, 


accordingly, now referred to only as ‘presentations. Nevertheless, 
clinical experience suggests that there are important differences in 
children who are not just ‘predominantly, but exclusively, inatten- 
tive. Dreamy children, who are sluggish, rather than excitable, in 
temperament can still have enough problems in their attention that 
their education is seriously compromised. They will need much 
more focus on educational progress, and less on risk for antisocial 
development, than their disruptive contemporaries—and indeed 
much more psychological research than has yet been forthcoming. 


Pervasiveness and persistence 


Pervasiveness across situations and over time is another important 
part of the clinical picture. One is trying to identify a trait in the 
person, not simply a quality of the environment that elicits restless- 
ness. The extent of pervasiveness is the main distinction between the 
DSM-5 definition of ADHD and the ICD-10 definition (currently 
under revision) of hyperkinetic disorder (HKD). The latter specifies 
that all three of the cardinal features—impulsiveness, over-activity, 
and inattentiveness—should be present, pervasively and persist- 
ently, across home, school, or work, and in other situations. HKD is 
therefore, in effect, a subtype of ADHD [8]. 

Disability 

Impairment in social, academic, or occupational functioning is also 
a necessary condition for the full diagnosis. The necessity for this is 
somewhat controversial. The extent to which the features of ADHD 
impose a disability is very variable. Careful parents and responsive 
teachers can help a child to achieve their potential, in spite of the 
struggle to focus and self-organize. Conversely, a highly academic 
school may impose demands for achievement that entail a real dis- 
ability for children who have only minor difficulties in concentrating. 
Furthermore, in comorbid cases, it can be hard to tell whether the 
impairment is due to the ADHD itself or to the coexisting difficulties 
(see Associated features, p. 302). 

The level of impairment of function that is required for the diag- 
nosis accounts for a good deal of the great variation in prevalence be- 
tween different studies and different places [9] (see Chapter 33). A full 
assessment will separate caseness and impairment, and accept both as 
important in making the formulation and clinical plan. 


Associated features 


Many other behavioural changes characterize some children with 
ADHD. They are, for instance, often irritable and their emotions can 
flash very rapidly when provoked. They may sleep badly (and this, in 
turn, can contribute to poor concentration). They can be aggressive 
to other people and non-compliant to authority. They can also be 
charming, humorous, inquisitive, and intuitive. None of these, how- 
ever, are either constant in ADHD or confined to those with ADHD. 
They are worth noting, but they do not make the diagnosis. 


Modifying features 


Inattention and impulsive over-activity both need to be assessed in 
light of the effects of age, gender, and developmental level. 
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The first 3 years 


A difficult temperament’ in early childhood includes over-activity 
and poor self-regulation and can have a harmful effect on parent- 
child relationships. Some children do indeed present a whirlwind 
of activity, but the normal range is very great and the concept of 
inattentiveness is hard to apply at this age. The exploration of the 
world by toddlers is typically short-lived, but intense. A diagnosis of 
ADHD would be insecure. 


Age 3-6 years 


ADHD behaviours are clearly recognizable by this age, and there 
is a strong likelihood of persistence into the school years [10]. It 
can still be difficult to distinguish between behaviour problems at 
home resulting from impulsiveness and those from headstrongness 
and anger. The presence or absence of inattentiveness is therefore 
important for diagnosis, but the issue is often not pressing because 
parent training is an effective intervention for behaviour problems 
at home and should be available for parents with children at risk, 
without waiting for a formal diagnosis. 


Age 7-11 years 


Attention span increases during the early school years, and most 
children with ADHD will be able to sustain interest for some 3 
minutes—more, if the activity is unusually attractive. Hyperactivity 
is reflected in running and climbing in situations that expect calm. 
School and peer demands make ADHD behaviours impairing; the 
tolerance of families and culture at large help to determine whether 
ADHD is seen as a problem, and this is a very common age for re- 
ferral and diagnosis. Hyperactivity (as opposed to inattentiveness 
alone) becomes important in generating aggressive and antisocial 
behaviour and delinquency. The extent to which there is a poor 
social outcome depends upon both genetic and environmental 
influences [11], including gene-environment interactions such 
as that found in a study in which a catechol-O-methyltransferase 
(COMT) gene polymorphism, together with low birthweight, 
predicted the development of antisocial symptoms in those with 
ADHD [12]. 


Age 12-18 years 


During adolescence, there is maturing in the abilities of self-control. 
Attention span may well have risen to 10 minutes at a time, but the 
demands of classrooms will typically expect much more. Academic 
failure becomes increasingly salient in the young people's lives. 
Hyperactivity may now be reflected only in fidgeting and a strong 
feeling of a need to be ‘up and away. Emotions can become unregu- 
lated in most teenagers, but especially so in those with ADHD. Some 
children with ADHD will become more reflective, but the demands 
for self-control rise as well, and so the young people still tend to be 
more impulsive and inattentive than their peers. Indeed, even those 
who had never been clinically diagnosed or referred for help were 
still four times as likely to merit a psychiatric diagnosis by the time 
they were 17 years old in a longitudinal population study [13]. Those 
who continue to show hyperactivity are at increased risk for other 
problems, notably aggressive and antisocial behaviour, delinquency, 
and motor traffic accidents. Substance misuse may be appearing, 
largely attributable to coexisting problems of conduct and social 
dysfunction. 


Adult life 


By adult life, most will no longer meet full diagnostic criteria for 
ADHD, but most will retain some functional impairment related to 
hyperactivity and the impairment is present at lower levels of the 
number of criteria than is the case in chidhood [14]. DSM-5 there- 
fore adjusted the number of symptoms expected for diagnosis, from 
six to five. Inattention may now be reflected in incomplete assign- 
ments, procrastination, and mind-wandering. Over-activity may be 
confined to an inner feeling of restlessness. Impulsiveness may now 
be a matter of recklessness and poor decision-making. Emotional 
lability is often a major part of the presentation and is closely linked 
to the level of the ADHD problems themselves [15]. 

There is also an expectation that the symptoms should have 
started in childhood, that is, before the age of 12 years. They may 
well not have been diagnosed at the time, and indeed they may 
not even have been impairing. Bright children may have been able 
to achieve satisfactory academic progress, and those without sig- 
nificant oppositional problems may well have fitted harmoniously 
into family life. ADHD features may only have become impairing 
when adult life presented greater demands for responsibility and 
scheduling. Recent research from a longitudinally studied co- 
hort has identified adults who presented ADHD features but 
did not show them when they were studied in their childhood 
[16]. These ‘adult-onset’ people were different from those whose 
ADHD had been continuous throughout development, in ways 
that cast doubt on whether they could be considered as showing 
a neurodevelopmental disorder—they were less likely to have af- 
fected family members and cognitive deficits, and more likely to 
have used cannabis and other drugs. 

A key difficulty in current knowledge is that of making an ac- 
curate diagnosis when adults present for the first time [17]. Adults 
may be mistaken in their recall of their childhoods, and the sub- 
jective aspects of their condition may be rather different from those 
that other people observe. The account of somebody who knows 
the patient well—perhaps a spouse or a work partner—is very de- 
sirable but does, of course, need interpreting in light of their own 
interests in a diagnosis. People who knew the individual as a child 
can be very helpful informants, for example through a telephone 
interview, and there may be school records available. Simply the 
giving of a diagnosis comes as a relief to some who have puzzled 
over the reasons for their failures and can liberate problem-solving 
approaches 


Gender 


Females are less likely than males to be rated as over-active, im- 
pulsive, or inattentive during childhood. The ratio is usually about 
2-3:1 in population surveys, and those girls who are affected are 
no less likely than affected males to become impaired in social and 
academic functioning [18]. Diagnostic rates in series of children re- 
ferred to clinics usually yield much wider ratios, and it may well be 
that boys are preferentially identified in the community as needing 
help. Affected girls are often more troubled by inattention than over- 
activity and may often suffer more than boys from impairment of 
peer relationships. In current knowledge, the same criteria should 
be applied to both genders. Clinicians, however, should guard 
against dismissing girls who are not disruptive as being unaffected 
by ADHD. 
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Developmental level 


The diagnosis should only be made when the problems are out of 
keeping with the developmental level. Intellectual disability is very 
likely to include problems of concentration. Psychometric assess- 
ment is therefore advised before assuming that attentiveness is 
disproportionately affected in such cases or that this deficit is re- 
sponsible for poor achievement. Over-activity/impulsiveness is usu- 
ally assessed with reference to what would be expected in a typically 
developing child whose chronological age would correspond to the 
developmental age of the child being assessed. The validity of this 
approach deserves the attention of researchers. 


Diagnostic problems 


Description of the symptoms makes them sound easy to recog- 
nize, and indeed the problems are usually very salient, disruptive to 
other people, and common causes of referral to health and special 
education services. Nevertheless, there are pitfalls in the diagnosis, 
making it necessary for a specialist assessment to be undertaken be- 
fore the diagnosis is given. 


Differentiation from normality 


The behaviours of ADHD are continuously distributed in the 
population (see Chapter 33). The level that is considered normal 
or acceptable will vary from one culture to another and from one 
rater to another. To be diagnosed, it should be excessive not only 
for the child’s age but also for the developmental level, present 
across situations and time, and giving rise to significant distress or 
dysfunction. 


Confusion of cardinal and associated features 


Many behavioural problems—such as temper tantrums, sleepless- 
ness, aggression, and disobedience to adult authority—are common 
in children with ADHD and may be the key reasons for presenta- 
tion. It is easy to make the mistake of diagnosing ADHD when only 
disruptive behaviour is present. Some of the confusion comes from 
the ambiguity of ‘impulsiveness. Behaviours, such as calling out in 
class and interrupting others, can indeed come from a difficulty in 
holding oneself back, but they can also represent deliberate flouting 
of the rules. Direct observation can usually make the distinction— 
watching either the children tackling tasks requiring them to stop 
and think in the clinic or their natural behaviour in the classroom. 
Inattentive behaviour also helps to make the diagnosis of ADHD 
and is less confounded by oppositionality. 


Reliance on non-expert judgements 


Judgements about whether behaviours occur ‘often’ and are atypical 
demand considerable familiarity with the usual range of variation. 
The diagnostician will acquire this in the course of training and ex- 
perience; experienced teachers will be excellent judges, but inexperi- 
enced or overstressed parents may identify the problems at a low 
level of hyperactive behaviour or suppose that an abnormal level is 
only to be expected in childhood. It is usually helpful to obtain a de- 
tailed behavioural account, rather than rely on an overall judgment 
of ‘over-activity’ or ‘failure to concentrate’ 

Contradiction between sources may occur and lead to argu- 
ments between parents and teachers. This may be due to different 


expectations, the emotional relationship of raters with the child, or 
children behaving very differently in contexts that vary in the de- 
mands placed on the children. The clinician needs to understand the 
full context of the way involved adults describe the child. 


Difficulties in recognition in the presence 
of coexisting problems 


It is commonplace for children whose problems meet the criteria for 
ADHD to show other patterns of disturbance as well. This is often, 
confusingly, called ‘comorbidity’—confusing because it implies that 
the other pattern is a distinct disorder, which is only one of the ex- 
planations for coexisting problems. Clinicians need to understand 
the relationships for two reasons, so that they do not make or miss 
the diagnosis of ADHD and so that they can make good strategies 
for treating ADHD in the presence of other disorders and other dis- 
orders in the presence of ADHD. 


Conduct and oppositional disorders 


The most common association, and the best researched, is with 
conduct and oppositional disorders. Nearly half the children with 
hyperactive behaviour in a community survey showed high levels 
of defiant and aggressive conduct as well, but the associations of 
the two problems were different, with hyperactivity (but not con- 
duct problems) being associated with delays in motor and language 
development [19]. When both ADHD and conduct problems were 
present, the combined condition (‘hyperkinetic conduct disorder’ in 
ICD-10) showed the associations of both disorders. ADHD is there- 
fore not to be diagnosed by the absence of conduct disorder features, 
but by the clear presence of the core problems of inattentiveness and 
disorganization. 


Tourette's disorder and multiple tics 


A different kind of differential is presented by children with Tourette's 
disorder. Their motor restlessness may indeed represent the coexist- 
ence of ADHD but can result directly from tics. If a child’s tics are 
very frequent and numerous, then their repetitive and stereotyped 
nature may not be apparent and they may be seen simply as restless 
fidgetiness. Again, direct observation of the pattern of over-activity 
is the key. When there is doubt, filming the child and subsequent 
slow-motion review may make repetitive patterns evident. 


Autism spectrum disorders 


Children with autism have clear and characteristic impairments 
of language, communication, and social development. Spectrum 
disorders, however, can raise diagnostic challenges. Children with 
ADHD alone often show language delays (usually of an expressive 
nature with oversimple utterances, by contrast with the receptive 
difficulties and idiosyncratic patterns of autism). Their attention dif- 
ficulties may make them unresponsive to the overtures of others in a 
way that can simulate the social obliviousness of people in the spec- 
trum of autism, and they are often friendless—not because of lack of 
interest in others, but because of the capacity of hyperactive behav- 
iour to irritate other people. Indeed, attention problems can extend 
to perseveration on certain activities, such as video games, which 
may be mistaken for the restricted interests of autism. All these fac- 
tors can lead to ADHD being mistaken for autism, but the reverse 
can happen too. There are other reasons for over-active behaviour 
in autism. Firstly, stereotyped patterns of driven over-activity can 
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be seen; they are not disorganized or impulsiveness and are often 
made worse by change and novelty (which usually reduce the over- 
activity of ADHD). Secondly, episodic bursts of extreme activity can 
be seen and may be best regarded and treated as catatonic. Thirdly, 
akathisia may result from neuroleptic medication, or irritable rest- 
lessness from anticonvulsants, and it will be necessary to establish a 
clear history that ADHD has been a persistent trait. 


Attachment disorders 


Reactive attachment disorder (RAD) may share with ADHD a dis- 
inhibited style of relating to other people (an unreserved, but shallow, 
making of social contact). Children with RAD, however, tend to be 
controlling rather than disorganized, and vigilant rather than in- 
attentive, and inattention and impulsiveness are not cardinal features 
of RAD—so it is not difficult to recognize both patterns when pre- 
sent in an individual child. The confusion in practice often comes 
from theoretical misconceptions. Those caring for neglected or aban- 
doned children may consider that the diagnosis of ADHD cannot be 
accurate because the cause of the children’s problems is clearly to be 
found in their early deprivation. The causal pathway may indeed be 
that of neglect (though genetic inheritance and fetal damage also need 
considering); but ADHD is a descriptive category, not an explanatory 
one. If the pattern of ADHD is present, it still needs recognizing—not 
least because the cause of the ADHD behaviour does not seem to de- 
termine the response to stimulant medication and children who have 
encountered neglect or abnormal early attachment may still have 
their ADHD problems reduced by medication. 


Bipolar disorders 


Both ADHD and manic conditions are characterized by over- 
activity, overtalkativeness, a sensation of whirling thoughts, and 
often irritable mood. The distinction is made by the presence in bi- 
polar disorder of episodicity, euphoria, and grandiosity. 

In all these differential diagnoses, the principle is to establish that 
the individual shows not only over-active or inattentive behaviour, 
but also the specific pattern of ADHD. Experienced judgement may 
be required, and the practice of diagnosing on the basis of ques- 
tionnaire scores alone carries risks of both overidentification and 
missing cases. 

In adult life, there are still more possibilities for misdiagnosis. 
The most common reasons for uncertainty are in distinguishing 
from atypical bipolar disorder and the effects of substance misuse. 
‘Personality disorder’ is sometimes applied, and indeed ADHD 
shares with personality disorders a long-standing trait quality but 
can also be a more precise way of describing the difficulties pre- 
sented. Differentiation from the normal range of variation can be 
difficult in the absence of clear standards. The task of the diagnos- 
tician is harder when adults are presenting for the first time if only 
self-report is available; the self-description of hyperactivity may be a 
form of self-depreciation. 


Methods of recognition 


Rating scales 


Questionnaire ratings by parents or teachers are very useful for 
screening purposes and monitoring interventions. Many are avail- 
able; the most famous are those from Conners which yield several 


different scoring systems [20], derivatives such as the lowa Conners, 
and the Strengths and Weaknesses of ADHD-symptomsand Normal- 
behavior (SWAN) and Swanson, Nolan, and Pelham (SNAP) scales 
[21]. Other scales were developed directly from the DSM criteria 
such as the ADHD Rating Scale [22] and the Vanderbilt ADHD 
Rating Scale [23]. Furthermore, several general-purpose scales in- 
clude subscales comprising problems of over-activity and inatten- 
tion such as the widely used Child Behavior Problems Checklist 
(CBCL) [24] and the Strengths and Difficulties Questionnaire 
(SDQ) [25]. They are particularly suitable for initial screening and 
triage. Free access from the Internet is available for SNAP-IV and 
SWAN (http://www.myadhd.com/snap-iv-6160-18sampl.html) and 
SDQ (http://www.sdqinfo.org). 

Many of these rating scales have similar strengths and weaknesses 
[26]. In group studies, they give a reasonably good discrimination 
between people with a clinical diagnosis of ADHD and controls 
from the ordinary population, with an effect size of about two 
standard deviations, implying a substantial overlap. This discrim- 
ination is probably somewhat better for the special-purpose ADHD 
scales than for the general-purpose scales. Test-retest reliability is 
typically around 0.7; all are sensitive to the effects of medication. 
Agreement between teachers and parents is typically rather low. The 
SWAN scale was developed with a wider range of scores for each 
item (including both ‘far above average’ and ‘far below average’)— 
so it will yield a more Gaussian distribution in a population survey 
than most other instruments. All scales, used as a population screen, 
leave a fair number of individuals misclassified and are liable to 
many false positives. The questionnaires are best seen as a first stage 
of screening, not as defining casehood. 

For adults, a variety of self-report measures have been devel- 
oped and are reviewed by Davidson [27]. The Wender Utah scales 
were particularly important in the early conceptualization of adult 
ADHD [28], but—like other rating scales—they tend to overidentify. 
An Adult ADHD Self-Report Scale (ASRS-v1.1) [29] is a symptom 
checklist developed for the WHO. It can be downloaded from: http:// 
www.caddra.ca/pdfs/Guidelines_ENG_ChangesJuly2012.pdf 


Informant interview 


A detailed interview with parents (or other main caregivers) is the 
most informative single method for assessing children and ado- 
lescents. The aim is to go beyond their overall ratings of problems 
to an understanding of what actual behaviours are the basis for 
ratings. It is helpful to obtain a description of key situations such 
as getting ready for school, playing with toys, drawing, mealtimes, 
homework, and family excursions. Parents will usually have very 
good recall of attention and activity control, even if their know- 
ledge about the normal range is limited. Professional judgement 
can then be applied to whether those descriptions indicate abnor- 
mality of development. 

Standardized interview schedules are also available. Some are 
highly structured and can be delivered by non-clinicians such as the 
Diagnostic Interview Schedule for Children (DISC) [30]. Others 
are more interactive, can be delivered flexibly, and expect the inter- 
viewer, rather than the respondent, to be the specifier of what is 
rated (as described previously). Examples of these ‘semi-structured’ 
instruments would be the Child and Adolescent Psychiatric 
Assessment (CAPA) [31] and the Schedule for Affective Disorders 
and Schizophrenia for School Age Children (K-SADS) [32]. They 
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are time-consuming, sometimes too much so for routine clinical 
practice, but can ensure a full cover of salient features. 

Interviewing should, of course, go beyond the making of a single 
diagnosis. It should include other problems, social and intellectual 
development, emotional life, and any physical problems. It can also 
be helpful—when possible—to interview teachers, not only to gain a 
full impression of the child in the classroom, but also to be seen to do 
so lest teachers form the impression of a less-than-comprehensive 
evaluation. 


Psychiatric interview 


Interview with the child is valuable for the observation of attention 
and social interaction that it yields, and for understanding a child’s 
view of their predicament. Children, however, are not good wit- 
nesses about their own concentration and impulse control, and even 
affected adults are not good at describing themselves in these terms. 
The experience of ADHD is usually one of suffering the reactions 
evoked from other people, or one of repeated failure. Adults often 
describe an experience of whirling and interrupted thoughts (in the 
absence of manic features), and some children will say the same, es- 
pecially if treatment has enabled them to make a comparison with 
another way of being. 


Observation 


Direct observation of the child in the clinic, or during psychometric 
testing, can give a vivid idea of the nature of their problems in at- 
tending. Do they take time to appreciate what they are being asked 
to do? Is the tempo of their activity increased? Can they slow down 
when asked? Are they distracted by the setting? Do they complete 
tasks or play activities? Similarly, observation of play and task per- 
formance in natural settings can be valuable in confirming or quali- 
fying the accounts of informants. The effect of the presence of the 
observer needs to be taken into account in making judgements. 


Investigations 


Assessment needs not only to distinguish ADHD from related dis- 
orders, but also to consider whether the ADHD pattern may result 
from remediable causes. The anamnestic history is by far the most 
productive investigation. It should include whether hearing prob- 
lems have been excluded by previous testing (and if not, an expert 
assessment should be arranged), and any injuries or diseases po- 
tentially damaging to the brain. The strengths and weaknesses of 
the family environment need to be assessed; they may dictate the 
choices of treatment. Physical examination should be sufficient to 
detect congenital anomalies, skin lesions, and motor abnormalities 
that can be the pointers to a neurological cause. Psychometric as- 
sessment is desirable whenever there are problems at school, both to 
generate an idea of developmental level against which the ‘develop- 
mental inappropriateness’ of behavioural symptoms can be judged 
and to detect barriers to learning that may be the reason for inatten- 
tiveness. Special physical investigations are not routinely necessary. 
EEG often yields evidence of immaturity, but this does not advance 
assessment much and is not routinely indicated. It is valuable in the 
investigation of epilepsy and in the rare cases when deterioration of 
function suggests the possibility of a degenerative disorder. Blood 
tests should be planned only on the basis of history and examination 
but may include tests of thyroid function, lead (in high-lead areas), 
chromosomal integrity (including fragile-X probe) when there is 


other evidence of developmental delay, and specific DNA tests when 
there is clinical suspicion of a phenotype such as that of Williams 
syndrome. 


Communicating the diagnosis 


Unlike most psychiatric conditions, a diagnosis of ADHD is often 
sought by parents and welcomed by them. Its image of being a phys- 
ically caused neurological disease is often perceived as a relief from 
the stigma of mental disorder. On the other hand, the media con- 
troversy over whether it is a ‘real’ disorder and over the use of con- 
trolled drugs leaves some parents confused and fearful. 

Assessment on the principles described will have led to an in- 
dividual formulation of the nature and causes of the impair- 
ment. Extended explanation is worthwhile in the longer term. An 
oversimple description in terms of a chemical deficiency in the brain 
may seem a useful starting point but can lead to unrealistic expect- 
ations for treatment and frustration with the doctor or, worse, with 
the child. A model of chronic disability is in keeping with the evi- 
dence from longitudinal studies but needs to be modulated by the 
good outcome for some children, the improvement for most, and 
the ability of warm and encouraging parenting to reduce the risks for 
antisocial behaviour in later childhood and adolescence. 

Children’s understanding of their problems is also worth a good 
deal of effort. Little research has so far addressed the issue, but it 
is important to their ability to cope. They need to know that their 
problem is understood, that treatments are available, that they can 
influence their outcome by their own actions, and that the people 
around them understand all this and can be encouraging. Positive 
role models are useful—some successful sports stars, performers, 
politicians, and business people have outed themselves as having, 
and sometimes using, ADHD. Explanations need to be repeated, as 
the young people mature and expect a fuller and more interactive 
discussion. 

Explanation is often needed by teachers as well. They may need to 
revise their expectations of the level of challenge with which the child 
can cope, and for some, frustration can lead to antagonism towards 
the child’s family. If they already see ADHD as a neurological disease, 
then the frequent observations of changeability in the children and 
of the ability to cope sometimes with difficult tasks may make them 
reject a neurological cause—and, with it, the diagnosis and validity 
of drug treatment. They may need to know that physical and psycho- 
logical factors can both enter into the child’s presentation and that 
the effect of medication does not depend on the aetiology. 

Explanation often leads on to basic advice about helping the 
children’s development. The first steps with parents are to establish 
whether there is already a framework of frequent warm interactions 
and effective ways of giving instructions and following up children’s 
actions with consistent patterns of reward or loss of reward. If this 
does not already exist, then a parent training group is often helpful. 
Both a supportive atmosphere and the teaching of skills in behav- 
iour modification seem to be necessary. The target behaviours for 
modification are often the ones most troublesome to parents— 
disobedience and aggression—rather than restlessness or inatten- 
tiveness specifically. 

Liaison with schools should include advice on the severity of 
the problem and the intensity and nature of extra help that will 
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be required. Teachers will often be able to share good practice in 
classroom management. One of the principles is to maintain good 
stimulus control, for instance by having the affected child at the front 
of the class under the teacher’s eye. Another is to find opportun- 
ities for the children to let off physical energy (they can sometimes 
be used as messengers between classrooms) and to learn in short 
chunks. Variety and interest in the material to be learnt or under- 
stood are useful. Transitions between activities in the classroom are 
often the time for children to become disorganized, and the child 
with ADHD should be the first to change activity, with the teacher’s 
supervision. Individual attention is probably the most effective re- 
source in the classroom, but it is also very demanding—a classroom 
assistant may help to achieve it. Star charts for younger children and 
token economy systems for older ones are often recommended but 
usually depend upon the system used for the rest of the class. 

In summary, this section has presented a picture of ADHD and a 
severe form—HKD—as disabilities that change with development 
and are often accompanied by other problems that can mask it or 
themselves be masked by it. They are rewarding challenges for diag- 
nosis and treatment in adulthood, as well as during childhood and 
adolescence. 
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Although the first description of an ADHD-like disorder dates 
back nearly 250 years to 1775 [1], the aetiology of ADHD and the 
basic mechanisms underlying the disorder are still relatively poorly 
understood. Hypotheses about mechanisms have been defined at 
different levels, including cognitive, neurophysiologic, and mo- 
lecular theories and models. 


£ 
J 


Cognitive f 
As explained in Chapter 31, ADHD is a behaviourally defined 
disorder. However, cognitive deficits have been studied as an in- 
tegral part of it. Individuals with ADHD have, on average, a 7- to 
12-point lower IQ than non-ADHD matched controls [2]. This as- 
sociation seems to be partly explained by shared genetic risk fac- 
tors between ADHD and IQ [2]. Initial theories claimed ADHD to 
be due to a single core cognitive deficit, for example in sustained 
attention [3], non-optimal regulation of the energetic state, and 
selected problems in activation [4], weak inhibitory control, that is, 
the capacity to voluntarily inhibit or regulate prepotent attentional 
or behavioural responses [5] or delay aversion and/or altered sensi- 
tivity to reward [6]. However, from more recent work in both chil- 
dren and adults, it is clear that ADHD is characterized by deficits 
in relatively independent cognitive domains, and that there is a 
large heterogeneity in the cognitive deficits across individual pa- 
tients (for example, [7, 8]). Executive functioning deficits are seen 
in visuo-spatial and verbal working memory, inhibitory control, 
vigilance, and planning. Reward dysregulation in patients with 
ADHD is reflected in suboptimal and more risky decisions and 
preference for immediate, compared with delayed, rewards. Other 
domains impaired in ADHD are temporal information processing 
and timing, speech and language, memory span, speed processing, 
response time variability, arousal/activation, and motor control 
[1]. Although most ADHD patients show deficits in one or two do- 
mains, some have no deficits, and very few show deficits in all do- 
mains. Attempts to identify subgroups with homogenous cognitive 


unctioning in ADHD 


profiles have revealed four cognitive subtypes in a community 
sample of control children and among subjects with ADHD [9]. 
This was replicated in a sample of adults with ADHD and adult 
controls [10]. It supports the view that at least part of ADHD’s cog- 
nitive heterogeneity is nested within normal variation. It is, how- 
ever, unclear whether these subtypes predict treatment response or 
course. It is also unclear whether cognitive deficits cause ADHD 
symptoms, though cognitive deficits appear to moderate the devel- 
opment of the clinical phenotype [11, 12]. 

Neuroimaging findings in ADHD 

Chapter 35 summarizes our current understanding of the brain 
structural and functional correlates of ADHD (see also [1, 13]). 
Briefly, alterations have been observed in virtually all neuroimaging 
modalities applied to the study of the ADHD brain, including 
structural and fMRI, EEG, and MEG. Grey matter structure, both 
cortical (thickness and surface area) and subcortical (volume), 
seems to be less affected in adults than in children with ADHD 
[14, 14a]. In functional activation studies in children with ADHD, 
hypoactivation relative to comparison subjects was observed 
mostly in systems involved in executive function (fronto-parietal 
network) and attention (ventral attentional network). Significant 
hyperactivation in ADHD relative to comparison subjects was 
observed predominantly in the default, ventral attention, and 
somatomotor networks. In adults, ADHD-related hypoactivation 
was predominant in the fronto-parietal system, while ADHD- 
related hyperactivation was present in the visual, dorsal attention, 
and default networks [15]. Most central to the mechanistic models 
of ADHD are MRI findings showing structural differences between 
patients with ADHD and normally developing individuals in the 
basal ganglia, especially the different components of the striatum 
(for example, [14]) and a potential structural and/or functional 
dysregulation of fronto-subcortical-cerebellar connections, which 
are of importance in controlling attention and salience thresh- 
olds, motor behaviour, inhibitory control, and response to reward 
[1, 13]. 
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Atypical brain activity underlying multiple cognitive processes 
and resting state activity has further been reported in EEG and 
ERP studies. Abnormal ERP activity has been found in both chil- 
dren and adults with ADHD, in association with attentional allo- 
cation, inhibition, response preparation, and error processing [16]. 
Preparation—-vigilance measures were also predictive of remission 
of ADHD over adolescence into adulthood [16]. Similarly, atypical 
patterns of quantitative EEG frequency have been observed, mostly 
as increased power of low-frequency theta activity and/or decreased 
power of fast beta activity [17]. Excessive theta/beta ratio cannot 
be considered a reliable diagnostic measure of ADHD but may be 
useful as a prognostic measure [18]. 

Importantly, the brain alterations seen in ADHD are heter- 
ogenous, with individual patients showing different patterns of al- 
terations and individual neuroimaging findings having very limited 
effect sizes. As a result, neuroimaging has not found its way into 
diagnostic protocols for ADHD, nor has it been predictive for treat- 
ment planning [19, 20]. 


Molecular neurobiology of ADHD 


Knowledge about the molecular neurobiology of ADHD has so far 
largely relied on serendipity and coincidental findings, for example 
from medication studies and work on animal models. Additional 
evidence for the involvement of particular pathways comes from 
genetics as well as first metabolite biomarker studies. For example, 
a comprehensive meta-analysis of potential biomarkers found sev- 
eral measures, specifically noradrenaline, monoamine oxidase, 
3-methoxy-4-hydroxyphenylethylene glycol, and cortisol, to be 
significantly altered in blood and urine of drug-naive/drug-free 
patients with ADHD, compared to healthy individuals [21]. Some 
of the metabolites were also associated with symptom severity of 
ADHD and/or the response to ADHD medication. 


Monoaminergic neurotransmission pathways in ADHD 


The serendipitous finding that methylphenidate (MPH) treats 
ADHD symptoms originally directed research into the role of dopa- 
minergic neurotransmission in the aetiology of the disorder. This ef- 
fort was soon extended to include noradrenaline pathways, given the 
realization that MPH is not selective to the dopamine synapse, but 
also blocks noradrenaline transporter function. Later, serotonergic 
neurotransmission has also been found to be involved. 


Dopamine 


The neurotransmitter dopamine is involved in regulation of motor 
activity and limbic functions, but also in attention and cognition, es- 
pecially executive functioning [22] and reward processing [23-25]. 
It is a key contributor to behavioural adaptation and to anticipatory 
processes necessary for preparing voluntary action following in- 
tention [26]. Thus, the function of dopamine maps well onto those 
domains implicated by the signs and symptoms observed in people 
with ADHD. Further, dopamine circuit dysfunction has been impli- 
cated in ADHD by different experimental evidence [27]. 
Dopamine-producing cells are localized in the midbrain 
substantia nigra pars compacta and the ventral tegmental area. 
From there, three projection pathways can be distinguished: the 
nigrostriatal pathway, which originates from the substantia nigra 
and projects to the dorsal striatum (caudate nucleus and pu- 
tamen); the mesolimbic pathway, which projects from the ventral 


tegmentum to limbic system structures, in particular the ventral 
striatum (nucleus accumbens), hippocampus, and amygdala; and 
the mesocortical pathway also originating in the ventral tegmental 
area, which projects to the cerebral cortex (the medial prefrontal 
areas, in particular) [28]. 

As indicated previously, the dopamine transporter—which is the 
most important molecule in the regulation of dopamine signalling 
in most areas of the brain—is the main target of stimulants like 
MPH and also dexamphetamine, the most frequently used pre- 
scription drugs for the treatment of ADHD symptoms. Blockade 
of the dopamine transporter leads to an increase in dopamine con- 
centration, particularly in parts of the basal ganglia that have the 
highest expression of the transporter, that is, the striatum [29, 30]. 
This effect is due to the blockade of the transporter molecule in 
the case of MPH [31] and due to both transporter blockade and 
stimulation of dopamine release/block of breakdown through 
monoamine oxidase in the case of dexamphetamine [32]. The 
dopamine transporter protein (DAT) and its gene DAT1 (official 
name SLC6A3) have thus received the most attention in research of 
mechanisms underlying ADHD. In animal models, knockout of the 
DAT1 gene produces elevated dopaminergic tone and hyperactivity 
in the mouse [33]; the latter is also observed upon knock-down of 
the dopamine transporter in the fruit fly Drosophila melanogaster 
[34]. The neonatal 6-hydroxy-dopamine lesioned rat model also 
implicates the dopaminergic system in ADHD-like behaviour [35]. 
Neuroimaging studies of the dopamine transporter in humans with 
PET suggest increased dopamine transporter activity in ADHD, 
compared with healthy individuals [36, 37], and evidence for de- 
pressed dopamine signalling has been inferred from alterations in 
dopamine receptors seen in PET [38]. Disturbed dopamine signal- 
ling has also been suggested by findings of genetic studies. Genetic 
polymorphism in the 3’-regulatory region of the DATI gene has 
been the subject of most studies. Meta-analyses have supported 
significant associations of this genetic variation in the gene, albeit 
different versions of the gene in children and adults with ADHD 
[39-41]. Furthermore, an analysis of genetic variants in a larger 
group of genes involved in ADHD suggested an association of this 
set of genes with the severity of symptoms in children with the 
disorder [42]. 


Noradrenaline 


Noradrenaline signalling is intimately linked to the dopamine system 
because noradrenaline is a downstream product of the metabolism 
of dopamine. Noradrenaline neurotransmission regulates important 
higher cognitive functions, such as working memory and inhibitory 
control, primarily through its projections originating in the locus 
coeruleus and innervating multiple areas of the cortex, the thalamus, 
and cerebellum [27]. Innervation of the prefrontal cortex (PFC) by 
noradrenaline pathways is thought to be particularly important for 
understanding ADHD. Noradrenaline and dopamine signalling are 
intimately linked in the PFC, that is, they influence each other in 
optimizing PFC performance in cognitive tasks [43-45]. A role of 
noradrenaline in ADHD is implied by the inhibition of the noradren- 
aline transporter (NET) by MPH and dexamphetamine (in addition 
to DAT) [27, 44]. Moreover, atomoxetine, a selective NET inhibitor, 
is effective in the treatment of the cardinal symptoms of ADHD and 
some of its comorbidities [46, 47], as are several other prescription 
drugs with noradrenergic, but not dopaminergic, properties, like 
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guanfacine and clonidine [27]. While this is potent evidence that 
altering noradrenaline signalling can ameliorate the symptoms of 
ADHD, less evidence is available to link it to ADHD aetiology per se. 
This may primarily be due to the concentration of research effort on 
the dopaminergic pathways and the large overlap between dopamine 
and noradrenaline synthesis and function. No animal models for 
ADHD based on altering genes involved directly in noradrenaline 
signalling have yet been described, but many models actually impli- 
cate both dopamine and noradrenaline neurotransmission circuits 
[48]. PET of the NET has been inconclusive, thus far [49]. Genetic 
studies of anumber of noradrenaline receptors and the NET have not 
produced convincing evidence for the involvement of these genes ei- 
ther [40, 50]. 


Serotonin 


Serotonin is involved in regulating mood and emotion, and also 
plays an important role in inhibition, one of the executive cognitive 
deficits observed in ADHD [51]. The neurons of the raphe nuclei 
in the midline of the brainstem are the main source of serotonin 
in the brain. Axons of neurons in the higher raphe nuclei spread 
out to the entire brain, with strong projections, for example, into the 
PFC, while axons originating in the lower raphe nuclei project to the 
cerebellum and spinal cord (see also Chapters 11 and 14). Serotonin 
signalling is known to affect the regulation of other neurotransmit- 
ters, including that of dopamine, which may occur through sev- 
eral mechanisms [52]. Neurotransmission through serotonin was 
first implicated in ADHD based on paradoxical calming effects of 
MPH observed in a mouse model lacking DAT [53]; the drug was 
shown to act by blocking the serotonin transporter in the absence 
of DAT [54]. Also other animal models with altered serotonin sig- 
nalling show ADHD-like symptoms and inattention, as well as 
hyperactivity [51]. In humans, studies have reported reduced levels 
of peripheral serotonin in patients with ADHD (for example, [55]), 
but other studies did not find such effects [51]. The exact role of 
serotonin in ADHD still has to be defined in humans. Serotonin 
neurotransmission may modulate the severity of ADHD symptoms, 
rather than being related to ADHD onset [42]. Other theories sug- 
gest that it may be the comorbidity, especially with antisocial behav- 
iours (conduct disorder, obsessive-compulsive disorder, aggression) 
and mood disorders (major depression and/or anxiety), rather than 
the core symptoms of ADHD, which is influenced by serotonin [51]. 
Genetic studies of the contribution of the serotonergic system to 
ADHD have not been fully convincing. However, ina meta-analysis, 
the serotonin receptor gene HTR1B and the gene encoding the sero- 
tonin transporter (SLC6A4, 5-HT'T, SERT) have been implicated in 
the disorder [40]. 

Gene-environment interactions may explain some of the ob- 
served inconsistency across studies, as the effect of stress on ADHD 
symptoms seems to be influenced by genetic variation in the sero- 
tonin transporter gene [56]. A recent analysis of a gene set re- 
lated to serotonergic neurotransmission suggests that variation in 
serotonergic genes may be associated with disease severity [42]. 
Tryptophan depletion, which causes reduction in brain 5-HT syn- 
thesis, was associated with increases of aggression, inattention, 
and impulsivity [51]. A retrospective pilot study on the adminis- 
tration of precursors of serotonin and dopamine led to promising 
results in 85 children and adolescents with ADHD [57]. However, 
in spite of this supportive evidence for serotonergic involvement in 


ADHD, findings from clinical trials with SNRIs, such as venlafaxine 
and duloxetine, in adults with ADHD are rather mixed (for review, 
see [51]). 


Other neurotransmission pathways in ADHD aetiology 
Glutamate 


Glutamate is the most abundant excitatory neurotransmitter in the 
human CNS and is involved in many neuronal functions, including 
synaptic transmission, neuronal migration, excitability, plasticity, 
and long-term potentiation [58] (see Chapter 14). The fronto-striatal 
circuits implicated in impulsivity and compulsivity are notable for 
their relatively rich glutamatergic receptor density. Glutamatergic 
projections from the various frontal subregions (orbitofrontal, 
infralimbic cortex, and prelimbic cortex) to the striatum (and vice 
versa) play a key role in the regulation of various compulsive be- 
haviours. Glutamate receptor proteins are expressed on the surface 
of neurons in such a way that they can only be activated from the 
outside, so glutamate exerts its neurotransmitter function from the 
extracellular fluid. Glutamate levels are regulated by releasing glu- 
tamate to the extracellular fluid and then removing glutamate from 
it. There are no enzymes located extracellularly that can degrade glu- 
tamate, so low extracellular concentrations require active reuptake. 
Several families of glutamate receptor proteins have been identified 
and classified as NMDA, AMPA, kainate, and metabotropic recep- 
tors [59]. Most, if not all, cells in the nervous system express at least 
one type of glutamate receptor. 

Several candidate genes within the glutamatergic system have been 
associated with ADHD. For instance, associations have been found 
for variation in the GRIN2B gene with both inattention and hyper- 
activity symptoms in ADHD [60]. A genome-wide study investi- 
gating rare variants found overrepresentation of variants belonging 
to the metabotropic glutamate receptor genes in several ADHD co- 
horts [61]. An analysis of a glutamate gene set showed significant 
association with the severity of hyperactivity/impulsivity of patients 
with ADHD [62]. Proton magnetic resonance spectroscopy (MRS) 
studies suggest a possible increase in Glx (a combination of glu- 
tamate, glutamine, and GABA) in the striatum across ADHD, OCD, 
and ASD, and further, an increased Glx signal in the anterior cingu- 
late cortex in children with ADHD and ASD, but a lower Glx signal 
in adults with ADHD and ASD. This suggests neurodevelopmental 
changes in fronto-striatal glutamatergic circuits across the lifespan 
[63]. Glutamatergic agents, such as memantine, an antagonist of the 
NMDA receptor, are of potential value in the treatment of impul- 
sivity in children and adolescents, including ADHD [64, 65], but 
large-scale positive trials have not yet been published. 


Histamine 


Histamine is one of the key neurotransmitters regulating arousal 
and attention. The cell bodies of histamine neurons are found in 
the posterior hypothalamus, in the tuberomammillary nuclei. From 
here, these neurons project throughout the brain, including to the 
cortex, through the medial forebrain bundle. Histamine neurons in- 
crease wakefulness and prevent sleep [66]. In addition, histamine is 
an important agent in (neuro)immune reactions. Interest in the role 
of histamine in ADHD has resulted from observations that allergies 
have an increased incidence in people with ADHD. Indeed, a re- 
cent meta-analysis showed that children with ADHD are more likely 
to develop asthma, allergic rhinitis, atopic dermatitis, and allergic 
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conjunctivitis than healthy individuals [67]. Similarly, children with 
allergies appear to have higher ADHD symptom ratings than non- 
affected children [68]. The histamine H3 receptor subtype is mainly 
distributed in the CNS and functions as a presynaptic autoreceptor 
(that reduces histamine release) and a heteroreceptor (that regulates 
the release of other neurotransmitters). Histamine H3 receptor ant- 
agonists and inverse agonists increase the release of brain histamine 
and other neurotransmitters. The H3 receptor antagonists have been 
shown to promote arousal in various species, without the psycho- 
motor activation seen with stimulants [69]. Potent histamine H3 re- 
ceptor antagonists are currently being developed and tested for the 
treatment of ADHD [70]. 


Nicotinic acetylcholinergic system 


Nicotinic acetylcholine receptors are receptor proteins that re- 
spond to the neurotransmitter acetylcholine. Nicotinic receptors 
also respond to drugs, including the nicotinic receptor agonist 
nicotine. Nicotine use has been associated with improvement 
in cognition, attention in particular, in different animal species, 
healthy human volunteers, and patients with ADHD [71-75]. In 
addition, the nicotinic acetylcholine neurotransmission system is 
also implicated in ADHD through genetic findings; a large study 
of copy number variants found duplications of the gene encoding 
the a7-nicotinic acetylcholine receptor (CHRNA7), located in the 
mutation-prone region on chromosome 15q13.3, to contribute to 
the risk for the disorder [76]. The nicotinic acetylcholine system 
may be one of the new targets for the development of alternative 
drugs for ADHD. Nicotine appears to exert its beneficial effect se- 
lectively on behavioural inhibition and delay aversion tasks, which 
are known to have good discriminant validity in distinguishing 
subjects with ADHD from controls [75, 77]. Stimulation of neur- 
onal nicotinic acetylcholine receptors by nicotine may be medi- 
ated directly via changes of cholinergic neurotransmission and/or 
by modulating the activity of other neurotransmitters, including 
dopamine (see Dopamine, p. 310). Trials of nicotinic drugs dem- 
onstrated beneficial effects in adults with ADHD, with evidence 
for positive effects on cognitive and emotional domains [78]. There 
are no approved medications yet for ADHD that target nicotinic 
acetylcholine receptor function. 


The genetic architecture of ADHD 


ADHD isa highly heritable disorder, both in children and in adults, 
with heritability estimates of around 60-80%, based on twin studies 
[79, 80]. As discussed in Chapter 34, the genetic architecture of 
ADHD is complex and multifactorial, such that most patients prob- 
ably carry many genetic risk factors, which are individually of small 
effect size. Importantly, twin studies as well as molecular genetic 
studies suggest that there is incomplete overlap between the genetic 
factors that are involved in ADHD onset and those that are relevant 
for the persistence of ADHD across adolescence into adulthood [39, 
81-83]. Genetic studies can be an ideal starting point to understand 
the molecular and cellular processes involved in a disease. As for 
other disorders, genetic approaches include hypothesis-based can- 
didate gene approaches and hypothesis-free genome-wide studies 
[84]. The first candidate gene-based studies for ADHD, dating backto 
the 1980s, mainly concentrated on genes related to monoaminergic 
neurotransmission, based on information on the drugs effective in 
treating ADHD symptoms described in previous paragraphs [1, 79]. 


Genome-wide association studies [50] and genome sequencing are 
now becoming available in increasingly large samples and will even- 
tually allow new hypotheses regarding the basic biological mechan- 
isms of the disorder [1, 84, 85]. 


The role of the environment 


The behaviour of any individual develops from an interplay of na- 
ture and nurture; neither genetic nor environmental factors are 
likely to act in isolation [86]. Different forms of gene-environment 
interplay (see review in [87]) and gene-environment correlation 
[88-91] are relevant to ADHD and are also described in Chapter 34. 
Environmental factors known to increase ADHD risk and/or dis- 
ease severity and persistence include pre-/perinatal as well as post- 
natal factors (for example, [1]). Exposures occurring in utero and in 
the early years of life, which are major windows of developmental 
vulnerability, have the strongest effects on ADHD. The brain is then 
growing at maximal speed, and the blood-brain barrier provides 
only partial protection against the entry of substances into the CNS 
[92]. Prenatal exposures to neurotoxicants, such as organic solvents, 
pesticides, and flame retardants [93-95], as well as the heavy metals 
zinc and lead [21], are linked to ADHD. Additional risk from ma- 
ternal smoking and alcohol use during pregnancy may be partly me- 
diated by genetic factors [96] (see Chapter 34). Postnatally, stressful 
events like early deprivation [97], harsh parenting practices [88], 
and composites of early adversity (low social class, marital discord, 
large family size, paternal criminality, maternal mental disorder, 
and foster care) [98] are of importance. Particularly informative is a 
follow-up study of Romanian adoptees, who had experienced severe 
early deprivation in orphanages during early life prior to adoption. 
This showed a dose-dependent relationship of the length of depriv- 
ation with the risk for development of ADHD-like symptoms [97]. 
Exposure to increased stress due to negative life events or daily has- 
sles has been shown to exacerbate ADHD core symptoms [56] and 
may contribute to ADHD persistence. Like genetic risk factors, the 
effects of any one environmental risk factor are small and, rather 
than being specific to ADHD, these environmental risks are associ- 
ated with several psychiatric disorders. 

The environmental contribution to ADHD is viewed as an in- 
creasingly important target of research, because environmental 
factors may, in principle, be modifiable, though not always easily in 
practice. Identification of protective factors is as important as scru- 
tinizing the risk factors, and this may contribute to therapy and pre- 
vention in ADHD. 


Mechanistic theories underlying ADHD 





Hypotheses about the mechanisms underlying ADHD have been 
defined at molecular, brain neurophysiologic, and cognitive levels. 
A number of models have been put forward, based on this work, 
which aim at vertical integration of the information across these do- 
mains of function. We mention the more prominent ones here, al- 
though there is insufficient evidence yet as to what the most accurate 
model is to describe the processes underlying ADHD and how the 
different models relate to each other. In all of them, dopamine and 
noradrenaline, in some cases in their relation to glutamate, play a 
central role. Taking into account the important regulatory activities 
of the monoaminergic neurotransmitters in glutamate signalling (as 
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Fig. 32.1 Dopamine and noradrenaline are important in regulating 
glutamatergic neurotransmission in cortical neurons. Noradrenaline 
amplifies signals coming in from other neurons via NMDA-type 
glutamatergic receptors, while dopamine reduces background noise. In 
this way, specificity of signalling of the pyramidal neurons is ensured. 


Adapted from J Am Acad Child Adolesc Psychiatry, 51(4), Arnsten AF, Rubia K, 
Neurobiological circuits regulating attention, cognitive control, motivation, and 
emotion: disruptions in neurodevelopmental psychiatric disorders, pp. 356-367, 
Copyright (2012), with permission from American Academy of Child and Adolescent 
Psychiatry. 


depicted in the schematic in Fig. 32.1 and described in [99]), this is 
not surprising. 

The dynamic developmental model of ADHD was largely based 
on research on an animal model for ADHD—the spontaneous 
hypertensive rat. Published by Sagvolden and coworkers in 2005, 
it supposes low tonic and low phasic dopamine in cortico-striatal 
loops, resulting in an ineffectiveness to regulate signalling of other 
neurotransmitter systems. Through the mesolimbic dopaminergic 
pathway, this is thought to produce altered reinforcement of behav- 
iour and deficient extinction of previously reinforced behaviour, re- 
sulting in the rise to delay aversion, development of hyperactivity 
in novel situations, impulsiveness, deficient sustained attention, 
increased behavioural variability, and failure to ‘inhibit’ responses 
(‘disinhibition’). Hypofunction of the mesocortical dopamine 
pathway is thought to cause attention problems and poor execu- 
tive functions. Altered nigrostriatal dopamine is linked to impaired 
modulation of motor functions and deficient non-declarative habit 
learning and memory [100]. 

A model for cognitive deficits in ADHD, in which the dopa- 
mine and noradrenaline systems and the basal ganglia are central, 
was proposed by Frank and coworkers, based on computational 
modelling [101]. They suggested that tonic and phasic dopa- 
mine in the striatum is low in people with ADHD and that tonic 
noradrenaline is high, causing a lowered phasic noradrenaline. 
According to the model, the lowered dopamine levels should 
lead to deficits in learning from positive reinforcement. The 
altered noradrenaline levels may underlie the intra-individual 
variability observed in people with ADHD. Alterations in the 
levels dopamine and noradrenaline are viewed as occurring 
independently [28]. 


The moderate brain arousal model poses the combination of low 
tonic with increased phasic dopamine to cause the symptoms of 
ADHD [102, 103]. This is in contrast to the dynamic developmental 
model and the basal ganglia model, which suppose both tonic and 
phasic dopamine to be low in ADHD. The low tonic dopamine re- 
sults in hypo-arousal when few stimuli are provided to the brain, 
but also in increased stimulus responsivity of cells in terms of phasic 
dopamine, resulting in exaggerated responses to environmental 
stimuli. Since dopamine shows an inverted-U dose-response rela- 
tionship with performance, both too little and too much dopamine 
will result in underperformance of the brain, as seen in ADHD [28]. 

The dopamine transfer theory—a purely theoretical model to ex- 
plain some of the symptoms of ADHD—supposes that the dopa- 
mine response to a reward at the cellular level is normal in people 
with ADHD, but that it cannot be transferred to a reward-predictive 
cue due to an insufficient (low) phasic dopamine level [104]. This 
was hypothesized to result in a delay in the dopamine signal at the 
level of the cell and the observed reduction of the anticipatory stri- 
atal brain activity in people with ADHD [105]. 

The neuroenergetics model is defined at the cellular level and pos- 
tulates that neurons function suboptimally in people with ADHD 
because they cannot adequately recruit lactate from astrocytes [106]. 
The putative cause is altered noradrenaline and glutamate neuro- 
transmission, which is needed to stimulate astrocytes to release the 
necessary lactate. Lactate is produced in astrocytes by glycolysis and, 
after uptake by neurons, is converted into pyruvate and glucose. The 
astrocyte-neuron lactate transfer shuttle system ensures a supply 
of substrates for brain metabolism. There is increasing evidence for 
lactate acting as a signalling molecule in the brain, to link metab- 
olism, substrate availability, blood flow, and neuronal activity [107]. 
The suboptimal use of the shuttle system is proposed to result in 
lack of energy, which causes attention drifts, mental fatigue, and re- 
sponse variability, as observed in ADHD. 


Future prospects for novel targets in ADHD 


In the coming years, the hypothesis-generating data coming from 
genome-wide analyses can be expected to identify the mechanisms 
underlying the disorder; they may also contribute to developing 
treatment that goes beyond the current approach, which can only 
dampen the symptoms of the disorder. The first papers implicating 
novel biological processes and mechanisms in ADHD have now been 
published. This work has implicated the neurodevelopmental pro- 
cess of neurite outgrowth in the aetiology of the disorder [85]. More 
recently, integration of genome-wide association studies of common 
and rare genetic variants found several additional biological pro- 
cesses to be involved, including ligand-gated ion channel activity 
and oxidative stress-related pathways [108, 109]. Such processes are 
known to be druggable and thus may represent the first candidates 
for the development of novel therapeutic approaches. Since the field 
of genetics research is currently upscaling strongly, more interesting 
findings can be expected in the coming years. Ultimately, we will 
need interdisciplinary approaches integrating research across dif- 
ferent levels of organismal complexity, including genetics, genomics, 
and cell- and/or animal-based model systems, as recently discussed 
in [50], to really understand the biology of ADHD and increase the 
chances of developing curative treatment options. 
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Since the publication of the third edition of the Diagnostic and 
Statistical Manual of Mental Disorders, third edition (DSM-III) 
in 1980, when the first reliable operational diagnostic criteria for 
attention-deficit/hyperactivity disorder (ADHD) appeared, the 
scientific literature on the various aspects of the disorder has been 
expanding significantly. Rigorous and informative epidemiological 
studies have been conducted to describe the distribution and nat- 
ural course of the disorder. The familial nature of ADHD and the 
importance of genetic risk have also been recognized, as described 
in Chapter 34. Nevertheless, as for any complex disorder for which 
no single risk factor is either necessary or sufficient to cause the 
disorder, longitudinal—mainly retrospective—and case-control 
studies have suggested that certain environmental factors increase 
the risk for ADHD. As is usual in medicine, epidemiological find- 
ings have contributed to shed light on aetiological mechanisms and 
temporal and cultural effects, to plan services, and to monitor de- 
livery of treatment, among other clinical and research advances. It 
has the potential to contribute to the next challenge of the field—to 
prevent the development of the disorder. 








Studies investigating the prevalence of ADHD emerged soon after 
DSM-III was published in 1980. As the studies accumulated, the 
overall result was of a significant variability on reported rates, ran- 
ging from as low as 1% to as high as nearly 20% among school-age 
children. This variability raised debates of whether broad cultural 
and societal modifications, mediated by changes in family structure 
and habits and also by increasing academic expectations and failing 
educational models, were producing the disorder. As described in 
Chapter 31, these debates fuelled concerns about the validity of 
ADHD. However, it was important, and initially underestimated, 
that surveys adopted a variety of methods (in regard of sampling 





strategy and study design), case definitions (for example, diag- 
nostic criteria), and strategies to collect information (for example, 
instrument, informant), and several studies were of low or very low 
quality. The conclusion that prevalence rates varied across settings 
was, in fact, premature, because methodologies influence the find- 
ings and direct comparisons between individual results were not 
possible. Therefore, the appreciation of study methods and the em- 
pirical assessment of their effect are key to interpret prevalence rates 
of ADHD. 


Study methods 


Sampling is the process of selecting study participants. This is a fun- 
damental process in an epidemiological study, because the sample 
must accurately represent the target population. If, for any reason, 
selected participants do not represent the specific population of 
interest, results cannot be generalized to that population and are 
valid only for the specific sample studied. A probabilistic sampling 
strategy, in which, by the principle of randomization, each eligible 
member of the population has a defined chance to be identified and 
included, is essential. Nevertheless, it is necessary to have an ob- 
jective list of all of the units in the target population, that is, a sam- 
pling frame. Examples of sampling frames are telephone directories, 
census data, and schools. For studies investigating the prevalence 
of childhood mental disorders, including ADHD, schools are fre- 
quently used as the sampling frame. However, if virtually all children 
from the population of interest are not enrolled in ordinary schools, 
or if individuals are enrolled in schools but are not attending classes 
(which may be closely linked to the presence of mental disorders), 
this strategy is likely to produce biased results. Different probabil- 
istic sampling strategies can be adopted to select the units within 
the frame such as simple random, stratified random, and system- 
atic random. As the sampling frame and eligible participants are 
identified, it is possible to identify and study the individuals who 
refuse to participate. Among eligible individuals, the proportion 
who give consent to participate and are actually assessed (that is, 
the response rate of a study) is an important indicator of the success 
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of the sampling procedure. If a substantial proportion of eligible 
participants do not take part in a survey, or if refusal to participate 
is non-random (that is, associated with specific characteristics of 
participants), results may be biased and inferences about the whole 
population of interest may not be possible. 

In one-stage design studies, diagnostic interviews are used to as- 
sess the whole sample. In two-stage design studies, a screening instru- 
ment is answered by the whole sample and individuals are selected 
for further assessment by diagnostic interviews based on its results. It 
is fundamental that not only individuals who are considered to have 
screened positive, but also a proportion of individuals who screen 
negative, are assessed with the diagnostic interview, so that rates of 
false and true positives (and negatives) at the screening phase can 
be determined and prevalence rates for the whole sample estimated. 

In terms of case definition, the inconsistencies between diagnostic 
criteria presented by DSM and the International Classification of 
Diseases (ICD) may also have reduced the homogeneity of preva- 
lence rates. The background to the overall approach to the diagnosis 
of ADHD has been described in detail in Chapter 31. Research diag- 
nosis of hyperkinetic disorder requires symptoms of both inattention 
and hyperactivity-impulsivity, whereas DSM-III-R, DSM-IV, and 
DSM-5 require symptoms of only one of the symptom dimensions. 
Therefore, prevalence rates according to ICD are consistently lower 
than those according to DSM. Differences between DSM-III-R and 
DSM-IV were subtle. DSM-5 introduced three important modi- 
fications to DSM-IV criteria: extended the upper limit of the age of 
onset of symptoms from 7 to 12 years of age; decreased the symptom 
threshold for adults (18 years of age and older) from six to five symp- 
toms; and indicated that ADHD could be diagnosed in comorbidity 
with autism spectrum disorder. The impact of the modification of the 
age of onset of symptoms on prevalence rates was investigated in a 
representative sample of the population prospectively followed during 
childhood. Results showed that extending the age of onset criterion to 
age 12 resulted in an increase in ADHD prevalence by age 12 of only 
0.1% [1]. As for adult ADHD, uncertainties regarding the most valid 
definition of the disorder (for example, reflected by modifications of 
symptom threshold, developmental manifestations of symptoms such 
as hyperactivity) and the most appropriate strategy to assess the dis- 
order (for example, reflected by the requirement, or not, of collateral 
informants) are very important for case definition in epidemiological 
studies. Reduction of the symptom threshold for diagnosing adults is 
expected to increase the expected prevalence rates by 27% [2]. 

Also in terms of case definition, diagnostic assessment, informant, 
and requirement for functional impairment for the diagnosis are all 
important. As for mental disorders generally, standardized assessment 
by using a diagnostic interview is a key component to guaranteeing 
consistency within and between studies (see Chapter 7). Interviews 
vary in the way questions are constructed, on the presence of gate/skip 
questions, and on the emphasis on the respondents understanding 
and interpretation of the questions (structured interviews, usually ad- 
ministered by lay interviewers) or on the interviewers’ interpretation 
of the responses (semi-structured interviews, usually administered by 
trained mental health professionals). Informants more frequently re- 
porting ADHD symptoms are parents, children (usually older than 
11 years), and teachers. The correlation between different informants 
is low to moderate, which suggests that symptoms are modulated by 
settings and/or different informants emphasize different aspects of be- 
haviour. Thus, it is indicated to use multiple informants and integrate 


information using different strategies such as the ‘best estimate pro- 
cedure; ‘and rule; and ‘or rule. The best estimate procedure implies 
that data from different sources and instruments are integrated by 
clinicians, who define the presence of a diagnosis, based on their 
best judgement of available information. The ‘and rule’ implies that 
a symptom should be considered positive if both informants endorse 
the symptom. The ‘or rule’ implies that a symptom should be con- 
sidered positive if one out of two informants endorses the symptom. 

Requirement of functional impairment for the diagnosis is an im- 
portant parameter for case definition, as individuals frequently meet 
symptomatic criteria but have minimal or no impairment criteria but 
have optimal functioning. This may be dependent upon the structure 
of the environment they are part of, the support they receive, the de- 
mands they are faced to, the expectations placed on their performance, 
and the very specific definition of impairment. Different definitions of 
functional impairment may be adopted such as measures of impair- 
ment specifically related to ADHD or global measures of impairment 
[such as the Child Global Assessment Scale [CGAS)]. The former has 
the advantage to be related specifically to the identified disorder, and 
the latter, although not specifying the source of impairment, is a good 
predictor of adaptative functioning and need of service. 


Prevalence rates in childhood and adolescence 


The first comprehensive literature review and meta-analysis on the 
prevalence rates of ADHD [3] was based on an extensive search 
strategy, and studies were included based on the following inclu- 
sion criteria: (1) original surveys on ADHD/hyperkinetic disorder 
prevalence (point prevalence); (2) diagnoses based on any DSM (II, 
III-R, or IV) or ICD (9 or 10) versions; (3) probabilistic sample from 
the general population or from schools; and (4) participants up to 
the age of 18 years. One hundred and two studies were included, 
which represented populations from North America (k = 32), 
Europe (k = 32), Asia (k = 15), South America (k = 9), Oceania 
(k= 6), Middle East (k = 4), and Africa (k = 4). Most studies had their 
samples ascertained from schools (k = 60), with a one-stage assess- 
ment (k = 61), adopting DSM-IV criteria (k = 44), with no impair- 
ment criterion (k = 68), and with parents as the most frequent source 
of information (k = 33). A meta-regression analysis was performed 
to empirically investigate the methodological approaches that im- 
pacted the detected rates. Multivariate meta-regression identified 
diagnostic criteria, source of information, and requirement of func- 
tional impairment for the diagnosis as methodological procedures 
significantly associated with variability of estimates. The analysis 
identified that study location was associated with heterogeneity only 
when estimates from Africa and the Middle East were compared to 
estimates from North America. Estimates from Europe, Oceania, 
South America, and Asia did not differ from estimates from North 
America. It is noteworthy that Africa and the Middle East were the 
continents with the fewest number of studies (k = 4 in each con- 
tinent). A meta-analysis resulted in a pooled prevalence rate of 
5.29% [95% confidence interval (CI) 5.01-5.56] (Fig. 33.1). 

A second comprehensive review of the literature on the preva- 
lence of ADHD and a meta-analysis including only studies that 
adopted DSM-IV diagnostic criteria were published in 2012 [4]. 
Eighty-six studies were included, and pooled estimates ranged from 
5.9% to 7.1%, depending on the source of information for the diag- 
nosis. Further results detected significant heterogeneity between es- 
timates, but once again, the country or region where the study was 
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Fig. 33.1 ADHD/HD pooled prevalence according to demographic characteristics and geographic location. 
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conducted did not explain the variability, supporting the previous 
findings. 

A third comprehensive review of the literature aggregated studies 
included in the first two meta-analyses [3, 4] and updated the search 
aiming to follow up the previous finding that estimates from Africa 
and the Middle East were different from those from North America 
[5]. As the influence oftime on prevalence of ADHD was still a matter 
of controversy, this study also aimed to investigate time-effects on 
prevalence rates. The third review detected 154 studies, and 135 
were included in the multivariate statistical models. The 25th, 50th, 
and 75th percentiles of distribution of studies, based on year of pub- 
lication, corresponded, respectively, to years 1997, 2001, and 2005. 
The studies represented populations from North America (k = 48), 
Europe (k = 42), Asia (k = 12), South America (k = 10), Oceania 
(k =7), the Middle East (k = 11), and Africa (k = 5). Univariate ana- 
lysis revealed that diagnostic criteria, impairment criterion, source 
of information, and geographical location of studies (but not year of 
study publication) were significantly associated with heterogeneity 
of prevalence estimates. Multivariate analysis indicated that source 
of information, impairment criterion, and diagnostic criteria re- 
mained significantly associated with heterogeneity of results. Rates 
based on teacher reports were an estimated 5.47% higher than those 
based on best estimate procedure, and those rates with no require- 
ment for impairment were an estimated 2.32% higher than when 
impairment was required. Regarding diagnostic criteria, rates based 
on DSM-III-R and ICD-10 were an estimated 2.42% and 4.09%, re- 
spectively, lower than rates based on DSM-IV. Study location was no 
longer significant when methodological variables were accounted 
for. Year of publication was not associated with heterogeneity, indi- 
cating the absence of time-effect on prevalence of ADHD (Fig. 33.2). 
No significant interaction was detected between year of publication 
and diagnostic criteria (Fig. 33.3), other methodological variables, 


and geographic location (Fig. 33.4), ruling out a possible increase in 
prevalence estimates over time in specific geographic locations or 
for specific study methods [5]. 

The lack of time-effect on prevalence rates detected is in accord- 
ance with studies that have investigated rates of inattentive and/or 
hyperactive symptoms, using equivalent rating scales and parents or 
teachers as informants across time. They found no evidence for a sys- 
tematic variation in prevalence of ADHD symptoms, with symptom 
levels and the proportion of children with clinical scores remaining 
stable or presenting small reductions over recent decades [6]. 

One important limitations of the existing epidemiological litera- 
ture is the scarcity of studies outside Europe and North America. 
In this regard, the Global Burden of Disease Study 2010 (GBD 
2010) and 2013 (GBD 2013) systematically reviewed the litera- 
ture and employed an imputation method to derive prevalence 
estimates by age and sex in three time periods (1990, 2005, and 
2010) for 21 world regions, including those with little or no data. 
For each of the time period, ADHD prevalence for boys was es- 
timated in 2.2% (95% CI 2.0-2.3) and for girls in 0.7% (95% CI 
0.6-0.7), indicating that prevalence remained stable over time [7]. 
Nevertheless, a very concerning scenario is the small proportion of 
the global population of children and adolescents represented in 
prevalence studies for any mental disorder. From the GBD data, it 
is estimated that only 5.47% of children and adolescents around the 
world are covered by ADHD surveys. This proportion is hugely dis- 
crepant between high-income and low-/middle-income regions— 
36.47% and 2%, respectively [8]. 

Other meta-analyses conducted on the prevalence of ADHD de- 
tected discrepant findings such as higher [9] or variable [10] rates 
across regions. Nevertheless, these analyses had different inclusion 
criteria that explained the contrasting findings, such as the inclu- 
sion of studies that relied on questionnaires for diagnosis and the 
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exclusion of studies using ICD criteria [9] and the inclusion of 
studies using regional diagnostic criteria for mental disorders [10]. 
More recently, a systematic review of surveys estimating the preva- 
lence of any mental disorder in the community across the world, 
including ADHD, was published [11]. The systematic search identi- 
fied 23,191 abstracts; 198 studies were selected for review, 48 studies 
met inclusion criteria, and 41 studies were included. The studies 
were published from 1985 to 2012 and represented populations from 
North America (k= 14), Europe (k = 14), Asia (k = 8), South America 
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Fig. 33.3 ADHD prevalence estimates over time as a function of 
diagnostic criteria. 

Reproduced from International Journal of Epidemiology, 43(2), Polanczyk G, 
Guilherme V, Willcutt, EG, ADHD prevalence estimates across three decades: an 
updated systematic review and meta-regression analysis, pp. 434-442, Copyright 
(2014) with permission from Oxford University Press. 


(k = 5), Oceania (k = 3), the Middle East (k = 2), and Africa (k = 2). 
A pooled rate of prevalence of any mental disorder was estimated 
in 13.4% (95% CI 11.3-15.9). The final multivariate meta-regression 
model identified sample representativeness, sample frame, and diag- 
nostic interview as significant moderators of prevalence estimates. 
Study geographic location and sample age range were not significant 
moderators. Secondary meta-analysis focusing on ADHD included 
33 studies, representing a sample of 77,297 individuals, and resulted 
in a pooled prevalence of 3.4% (95% CI 2.6-4.5) [11]. 


Prevalence rates in adulthood 


Prevalence studies of ADHD in adulthood emerged approximately a 
decade after studies in childhood, and significant fewer surveys have 
been conducted so far. A systematic review published in 2009 identi- 
fied seven samples eligible for a meta-analysis, which yielded a preva- 
lence rate of 2.5% (95% CI 2.1-3.1). Mean age of participants ranged 
from 19 to 45 years, but mostly up to 28 years, with only one sample 
with a mean age of 45 years [12]. Prevalence rates were detected to 
have a significant negative association with age, and this association 
was moderated by the gender composition of the sample. In a study 
investigating individuals from 60 to 94 years, the estimated preva- 
lence rates of ADHD weighted back to the general population was 
2.8% (95% CI 0.86-4.64) using a cut-off point of six symptoms, and 
4.2% (95% CI 2.05-6.39) using a cut-off point of four symptoms [13]. 

Recently, the WHO’s World Mental Health (WMH) Survey 
Initiative reported estimated prevalence rates for adults 18 to 
44 years of age in 20 WMH surveys. These surveys were conducted 
in 11 high-income, 5 upper middle-income, and 4 low- and lower 
middle-income countries. Estimated rates were based on retro- 
spective self-report of childhood ADHD, the presence of current 
symptoms and related impairment, and a multiple imputation 
method to assign clinical diagnoses of adult ADHD. Prevalence of 
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Fig. 33.4 (see Colour Plate section) ADHD prevalence estimates over 
time as a function of geographic location of the studies. 


Reproduced from International Journal of Epidemiology, 43(2), Polanczyk G, 
Guilherme V, Willcutt, EG, ADHD prevalence estimates across three decades: an 
updated systematic review and meta-regression analysis, pp. 434-442, Copyright 
(2014) with permission from Oxford University Press. 


DSM-IV ADHD in childhood averaged 2.2% across surveys (range 
0.1-8.1%). Estimates had a positive correlation with country in- 
come. Current prevalence of adult ADHD averaged 2.8% across sur- 
veys (range 0.6-7.3%), a higher rate than childhood ADHD. Higher 
estimates in adulthood were also detected in high-income (3.6%) 
than in low- and lower middle-income (1.4%) countries [14]. 


Developmental changes 


The first prospective longitudinal studies that investigated the nat- 
ural course of ADHD across development included clinically re- 
ferred children and clearly documented that symptoms declined 
over time. An early systematic review initially estimated an exponen- 
tial decline during development, suggesting that virtually all chil- 
dren with ADHD would improve over time [15]. Subsequent studies 
have confirmed that remission of full diagnostic status was indeed 
common but that a substantial number of children remained symp- 
tomatic and most of them remained impaired as early adults [16]. 
Longitudinal studies documented that decline is more pronounced 
for hyperactive and impulsive symptoms, and less pronounced for 
inattention [16]. Indeed, cross-sectional studies assessing adults 
with ADHD indicated that inattention and executive dysfunction 
are the most common symptoms [17]. 

A meta-analysis of longitudinal studies confirmed that rates of 
ADHD persistence across development depended on its conceptual 
definition. The pooled rate for persistence at a mean age of 25 years 
according to full criteria (syndromatic persistence) for DSM-III, 
DSM-III-R, or DSM-IV adult ADHD was approximately 15%, and 
according to partial remission (symptomatic persistence) approxi- 
mately 65% [18]. The study with the longest prospective follow-up 
conducted until now evaluated adults at a mean age of 41 years who 
were clinically referred and diagnosed with ADHD as children, at a 


mean age of 8 years [19]. Among them, 22% met adult ADHD cri- 
teria (vs 5% in the comparison group). Individuals with childhood 
ADHD had worse educational, occupational, economic, and social 
outcomes as adults, as well as increased rates of antisocial person- 
ality disorder and substance use disorders [19]. 

More recently, prospective longitudinal studies following repre- 
sentative community samples of children to adolescence and adult- 
hood challenged the notion that ADHD always begins early in life. 
Moffitt et al. [20], studying the Dunedin Multidisciplinary Health 
and Development Study in New Zealand, documented ADHD 
cases prospectively from childhood to 38 years of age. The authors 
reported a follow-forward analysis of ADHD cases diagnosed in 
childhood and a follow-back analysis of ADHD cases diagnosed in 
adulthood. Follow-forward analysis revealed that only 5% of chil- 
dren with ADHD still met diagnostic criteria at the age of 38 but 
continued to experience difficulties in life adjustment. Follow-back 
analysis showed that only 10% of adults with ADHD had the diag- 
nosis as children [20]. Surprisingly, these results were replicated in 
two other prospective longitudinal studies of community samples 
from the UK [21] and Brazil [22]. Analysis of the E-Risk Study (UK) 
demonstrated that 21.9% of children with ADHD met diagnostic 
criteria at the age of 18 years. At the age of 18 years, only 32.5% of 
those with ADHD met criteria for ADHD during childhood [21]. 
Analysis of the Pelotas Study (Brazil) demonstrated that 17.2% 
of children with ADHD met diagnostic criteria at the age of 18- 
19 years. At the age of 18 years, only 12.6% of those with ADHD 
met criteria for ADHD during childhood [22]. Adult ADHD cases 
without a history of childhood ADHD were not fully explained 
by comorbidities, subthreshold symptoms, and information bias. 
A possible explanation is that subthreshold cases in childhood 
emerge as cases in adulthood when demands exceed capacities [23]. 
Alternatively, child-onset and adult-onset ADHD may be distinct 
disorders, with possible distinct aetiologies. In this direction, poly- 
genic risk score for ADHD (see Chapter 34) predicts childhood-, 
but not adult-, onset forms of ADHD [20]. Also, the likelihood of 
adult-onset ADHD arising in a monozygotic twin is not increased 
when the co-twin has childhood-onset ADHD [21]. The questions 
of whether adult-onset ADHD represents the late expression of 
early-onset neurogenetic risk factors or if it constitutes a distinct 
syndrome are yet to be answered [24]. 


Sociodemographic risk 


ADHD affects predominantly males, with a male:female ratio 
of 3-4:1 in clinical samples [25] and 2.4:1 in community samples 
[26], suggesting a significant referral bias. Among adults, this sex 
difference is attenuated, probably also due to referral biases (in the 
opposite direction) or sex-specific developmental trajectories of 
ADHD. Cross-sectional studies and administrative data detected 
associations with ethnicity, which are not detected by community- 
based studies. Low socio-economic status in early childhood is as- 
sociated with increased likelihood of ADHD, but it is not clear if it 
is a marker or risk, since there is a strong genetic risk and the dis- 
order leads to low educational and occupational achievement, or if 
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there is a causal relationship [27], possibly interacting with other 
risk factors [28]. 


Genetics 


Evidence from different approaches support the strong genetic in- 
fluences on ADHD, which are mostly of small individual effect. 
Genetic variants underlie multiple molecular, neural, and cogni- 
tive trajectories, partially shared with other neurodevelopmental 
disorders, that lead to behavioural phenotypes currently defined as 
ADHD [29, 30]. A detailed description is given in Chapter 34. 


Environmental risk 


Twin studies indicate that shared environmental risks are not rele- 
vant to the aetiology of ADHD. Environmental risk factors seem 
to exert their effect in the non-shared familial environment, that 
is, parental influences that are specific to a child, peer, and school 
influences and exposure to individual risks. Exposure to environ- 
mental stressors may result in biological changes, such as brain 
structure and function, and also in modifications of gene expres- 
sion. Environmental risk can also interact with genetic variants 
and, depending on them, can have different effects, that is, gene- 
environment interaction. Also, environmental risks can arise due 
to behaviours that are genetically influenced and may have partial 
or no effect on the outcome once the genetic effects are taken into 
account, that is, gene-environment correlation. Environmental risks 
have been investigated by different epidemiological methods, some 
of them suited to infer causality, but most of them not. Exposures to 
risk factors can be influenced by confounders, some of which are not 
identified and controlled for (for example, genetic effects), and also 
by the very presence of ADHD that can lead to the risk exposure (re- 
verse causation). Selection bias is also an important methodological 
caveat that may lead to false positive associations. 


Prenatal and perinatal factors 


Low birthweight, prematurity, obstetric complications, in utero 
exposure to maternal stress, tobacco, and alcohol and substances 
(medications and illicit drugs) are risk factors associated with 
ADHD. There are no studies to date demonstrating their causal 
effect. Associations between maternal stress and smoking during 
pregnancy and ADHD have been demonstrated by genetically sen- 
sitive studies to be explained mainly by genetic or other shared risk 
factors [31]. 


Toxins 


The association with prenatal exposure to toxins, such as organo- 
phosphate pesticides, polychlorinated biphenyls, and lead, has been 
studied in longitudinal studies, and also in cross-sectional studies 
which demonstrated higher levels in children affected by ADHD. 
Their causal association is biologically plausible, and animal studies 
have supported their negative effect over several domains of cogni- 
tive function. Nevertheless, their causal effect in humans has yet to 
be demonstrated. 


Diet 


Extreme nutritional deficits impair neurodevelopmental processes 
globally, but the effects of specific nutrients and how mild deficits 
at specific moments of development affect cognitive functions are 


not yet clear [32]. The associations between zinc, magnesium, and 
polyunsaturated fatty acids and ADHD have been studied in cross- 
sectional studies, and there is no convincing evidence indicating 
that they are risk factors, and not correlates. Sugar and artificial food 
colourings have also been identified so far as correlates. 


Other factors 


Obesity [33], sleep disorders [34, 35], and television exposure [36] 
have been associated with attentional problems and ADHD by 
cross-sectional, prospective, and retrospective longitudinal studies, 
with insufficient data to characterize the temporal ordering; at the 
moment, they are also understood as correlates. 


Family adversity 


Family adversity and harsh or hostile parent-child interactions 
have been consistently associated with ADHD but may arise as con- 
sequences of the disorder, rather than increase the risk of it [37]. 
The English and Romanian Adoptees study has followed children 
exposed to early severe institutional deprivation in Romania who 
were subsequently adopted by English families and studied their 
long-term emotional and cognitive developmental. Deprivation had 
strong effects immediately after adoption, followed by substantial 
catch-up for some children at the age of 6 [38]. At the age of 15 years, 
children exposed to more than 6 months of deprivation were four 
times more likely to present with ADHD than those with less than 
6 months of deprivation, and those children exposed to more than 
6 months of deprivation had persistently higher rates of inattention 
and hyperactivity through to the ages of 23 to 25 [39]. These results 
point to a causal role of very severe early deprivation in the develop- 
ment of ADHD with particular characteristics. Deprivation-related 
ADHD was not as likely as non-deprivation-related ADHD to affect 
boys and to present with comorbid conduct problems; it presented 
with high levels of comorbid social disinhibition and autistic fea- 
tures and severe neuropsychological impairment [40]. 


Conceptual and methodological issues 


Preventive actions target causal risk factors to reduce incidence, that 
is, new cases of disorders. Therefore, by definition, preventive inter- 
ventions are those implemented before the onset of the disorder. 
Those implemented after the onset are referred to as treatment or re- 
habilitation. There are three levels of preventive interventions: uni- 
versal, selective, and indicated. Universal interventions target the 
general public or a whole population group that has not been iden- 
tified on the basis of increased risk. Selective interventions target 
individuals or subgroups of the population whose risk of developing 
a disorder is significantly higher than average, as evidenced by bio- 
logical, psychological, or social risk factors. Indicated interventions 
target high-risk people who are identified as having minimal, but de- 
tectable, signs or symptoms or biological markers, indicating a pre- 
disposition for a disorder, but who do not meet diagnostic criteria 
at that time. Also preceding the disease onset, measures to enhance 
well-being (by increasing protective factors such as supportive 
family, school, and community environments) may also function as 
preventive measures [41]. 
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In the past decades, significant investment has been directed to 
the prevention science of mental, emotional, and behavioural dis- 
orders affecting youth. Researchers have successfully developed and 
tested interventions targeting depression, suicide, schizophrenia, 
and conduct disorder. These studies have begun to define the meth- 
odological challenges and strategies for the success of preventative 
interventions for mental disorders. Firstly, the causal mechanisms or 
origins of the disorder should be clear enough to be manipulated by 
strategies that are effective in reducing their occurrence or buffering 
their effects. Secondly, the disorder itself must be well characterized, 
together with the boundaries between the disorder and normalcy 
and its trajectory. Thirdly, methods to screen and identify individ- 
uals who are not affected by the disorder but are likely to develop 
it within a specific period of time should be available. Fourthly, the 
risks and costs related to the screening and intervention processes 
should be well balanced and not be higher than their potential bene- 
fits. Once all these challenges have been overcome, randomized con- 
trolled studies with adequate statistical power must demonstrate the 
benefits of the intervention. Statistical power reflects the necessary 
sample size to demonstrate reductions in incidence of the disorder, 
which may vary from hundreds to tens of thousands. It will depend 
on the base rate of new cases for the particular population studied 
and the magnitude of the effect of the intervention [42]. Preventive 
interventions are not necessarily ready to be used once their effect- 
iveness has been demonstrated, but they must be implemented, sus- 
tained, and scaled up within clinical and community systems and 
offered to the target populations. At this stage, elements such as 
programme contents, delivery strategy, intensity and duration, per- 
sonnel training and supervision, compliance, fidelity, and funding 
need to be addressed, so that they can be translated into real-world 
programmes [43-45]. 


Challenges of preventive interventions for ADHD 


The development of preventive strategies for ADHD, such as for any 
neurodevelopmental disorder, is a challenging endeavour. ADHD 
arises as a consequence of multiple mechanistic causal processes, 
for which no component has been identified so far as sufficient or 
necessary. More importantly, as reviewed before, only a few envir- 
onmental exposures—components that can be manipulated—have 
been identified so far as likely causal. Early severe institutional de- 
privation is likely to be causal but is mercifully uncommon in de- 
veloped societies. Pre- and perinatal conditions and environmental 
toxins have been demonstrated to be risk factors with an unclear 
causal role. No other manipulable causal risk factor has been iden- 
tified so far. In addition, genetic influences seem to be substantial, 
explaining a large proportion of heritability. The effects of genetic 
variants possibly interact with environmental exposures occurring 
early in life (some of them in the prenatal period) and lead to early 
emerging psychopathological processes. In addition, there are no 
current technologies to identify molecular and neural psychopatho- 
logical processes at their early stages, before the manifestation of 
behavioural symptoms. Therefore, with obscure causal mechanisms 
and early arising psychopathological processes, at the present mo- 
ment, ADHD becomes identifiable at a stage when strategies can no 
longer prevent its development. Early diagnosis and interventions 
have an important potential to prevent cumulative impairments and 
comorbidities over development but are not conceptually defined as 
preventive interventions. 


Therefore, preventive interventions that might reduce the in- 
cidence of ADHD and that are readily available are universal or 
selective programmes to act on the initial stages of common multi- 
disease mechanisms. In fact, compartmentalized programmes that 
focus on specific aspects of behavioural, educational, or social prob- 
lems are unlikely to be adopted as public policies, which is an argu- 
ment in favour of universal and selective prevention programmes 
[46]. These interventions potentially produce multiple benefits, 
which increase their cost-benefit and consequently their implemen- 
tation and scale. Several intervention programmes in early child- 
hood have been developed and tested worldwide [47, 48]. Some of 
the most studied interventions focus on programmes for the de- 
velopment of parental competences [49], on the stimulation of in- 
fants during the first 3 years of life [50], or on school curricula for 
the development of social and emotional competencies of children 
from 4 years of age [51]. Important features of a successful early 
childhood intervention programme include integrated models that 
combine elements geared to physical health, cognitive, emotional, 
and social development, sufficient duration and intensity, model- 
ling learning, structured curriculum, adequate material support for 
learning, systematic supervision, and clear incorporation of theories 
of change [52]. 

A number of issues arise from the experience to date [52, 53]. 
Firstly, is there a distinction between promotion and prevention 
strategies, and can the incidence of mental disorders be reduced 
with promotion strategies? It is hypothesized that the promotion 
of skills is one of the possible mechanisms through which mental 
disorders can be prevented [54]. Thus, as social-emotional skills 
are developed generally, there would be a secondary reduction in 
the occurrence of mental disorders. This is an example of the Rose 
prevention paradox and refers to the hypothesis that an interven- 
tion benefiting a large number of people at moderate risk may be 
of more value than an intervention benefiting only a small number 
of people at high risk [46]. However, it is a consensus that positive 
mental health and mental disorders are not just opposing poles of 
the same dimension, but two overlapping and interrelated compo- 
nents of the concept of mental health. Other important issues to 
be addressed are implementation characteristics, even for those 
programmes that have evidence of their effectiveness [43]. Among 
the main barriers are the limited description of the interventions, 
the overlap between elements of prevention and promotion in the 
same programmes. the limited theoretical conceptualization of the 
desired changes and how they can lead to the desired outcomes, 
and the external validity of the evidence. In addition, poor under- 
standing of the mechanisms involved in the effect of interventions 
and the lack of evidence of sustained effects beyond their initial 
implementation constitute significant barriers to the success of pre- 
vention science. 


Conclusions 


ADHD is an impairing neurodevelopmental disorder that affects ap- 
proximately 5% of children and adolescents worldwide, irrespective 
of their country of origin. In childhood, it affects predominantly 
males, although there is significant referral bias in their favour. In 
adulthood, sex differences are attenuated. ADHD is a long-lasting 
disorder, with symptoms or diagnosis persisting in approximately 
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70% of children as early adults. Cross-sectional studies estimate the 
adult ADHD prevalence in 2.5%. 

Emerging evidence suggests that ADHD may arise in adult- 
hood, challenging the notion that the disorder has always a 
neurodevelopmental origin. Currently, there are no definitive ex- 
planations for the so-called (ADHD adult-onset cases, and hy- 
potheses suggest that clinical cases in adulthood are preceded by 
childhood subclinical cases that emerge because of increasing de- 
mands or decreasing support or that child-onset and adult-onset 
ADHD are distinct disorders. In any case, no sufficient or neces- 
sary causal components (in terms of genetic risk, but also environ- 
mental influences and their interaction) have yet been implicated. 
Indeed, there are no environmental risks for ADHD identified so 
far of important magnitude and population-attributable fraction. In 
addition, molecular, neural, and cognitive psychopathological pro- 
cesses underlying the emergence of ADHD cannot yet be precisely 
identified, although, as explained in Chapters 32 and 34, the field is 
evolving rapidly. 

Prevention is especially appealing for childhood mental dis- 
orders, which emerge temporally close to putative risk exposures. 
However, aetiological mechanisms probably operate during the 
perinatal period, are dominated by genetic risk, and have very few 
established environmental causal risk factors. Future elucidation of 
specific causal mechanisms will be required to inform the develop- 
ment of effective preventive interventions. Until then, the science of 
prevention of childhood mental disorders can be best further devel- 
oped by accommodation with the growing interdisciplinary field of 
early childhood development. 
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Attention-deficit/hyperactivity disorder (ADHD) is a phenotypic- 
ally heterogenous disorder and, like other psychiatric disorders, is 
multifactorial in origin; multiple genetic, as well as environmental, 
influences contribute risk. The multifactorial liability model is 
one useful approach to conceptualizing complex disorders such as 
ADHD. This considers disorders, including ADHD, as manifest- 
ations of an underlying continuously distributed liability, with the 
diagnosis lying above a certain threshold on this liability curve (Fig. 
34.1). Multiple risk factors contribute to this liability, and the addi- 
tive effects of genetic and environmental risk and protective factors 
shift individuals along this population liability. This chapter will look 
at the role of genetic factors in the aetiology of ADHD and those fac- 
tors that have already been identified as being relevant, highlighting 
our current understanding regarding its genetic architecture. It will 
also discuss the methodological and phenotypic challenges that have 
influenced our current understanding. The heterogeneity of the 
ADHD phenotype has been discussed in more detail in Chapter 31. 








Traditional genetic studies that infer, rather than directly assess, gen- 
etic contribution have overwhelmingly found ADHD to be a highly 
familial and heritable disorder. Family studies have shown that the 
risk of ADHD is increased in first-degree relatives of those with the 
disorder, with modest relative risks of between 4.0% and 5.4% [1]. 
Similarly, adoption studies of ADHD have suggested that adopted 
children have ADHD behaviour scores that are more similar to their 
biological than adoptive parents [2-5]. 

Numerous twin studies have also indicated the importance of gen- 
etic factors in the aetiology of ADHD, with meta-analyses reporting 
heritability estimates of between 70% and 80% [6]. The remainder of 
the variance is generally found to be due to unique environmental 
effects, with only a very small proportion attributable to shared en- 
vironmental factors [6]. These studies suggest ADHD is one of the 





more heritable of psychiatric disorders, on a par with autism spec- 
trum disorder (ASD), bipolar disorder, and schizophrenia [7, 8]. 

These estimates are similar for the ADHD symptom dimensions 
of inattention and hyperactivity/impulsivity, with both common 
and separate genetic factors relevant to each dimension [6, 9-11]. 
There is also evidence from twin studies that genetic factors influen- 
cing ADHD act on a continuum, whereby heritability estimates are 
similar between those with a clinical diagnosis of the disorder and the 
traits seen within the general population (for example, [12, 13-15]). 
The informant utilized to ascertain ADHD symptoms appears to be 
important here and is an important consideration for genetic studies 
of ADHD. Twin studies using self-reported ADHD symptoms in 
adulthood have reported significantly lower heritability estimates of 
between 30% and 40% [16] than studies of children or adolescents 
that have generally utilized parent or teacher reports. The low self- 
report heritability estimates may reflect findings that affected indi- 
viduals are less accurate than others at describing their own ADHD 
behaviours [17]. However, investigations have suggested that the 
drop-off in heritability estimates is likely due to the fact that (unlike 
with parent and, frequently, teacher reports) self-reported measures 
involve different individuals rating each twin. Studies where dif- 
ferent parents or teachers reported on twins in childhood (for ex- 
ample, [18, 19]) have seen this same drop-off in heritability rates, 
while studies utilizing composite measures of parent and teacher 
reports and self-report in adulthood have observed high heritability 
rates of 60-70% [17]. While the initial findings of adult ADHD twin 
studies are no longer considered a challenge to the view that ADHD 
is highly heritable throughout the lifespan, they do highlight that 
methodological issues, such as the choice of informant, can impact 
upon our conclusions and understanding regarding the genetics 
of ADHD. 





While traditional twin and family studies are unable to identify 
which specific genetic factors might be involved in the disorder, 
they do indicate that molecular genetic studies should aid our 
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Fig. 34.1 Threshold liability model. Multiple risk factors, both genetic 
and environmental, are continuously distributed across the population. 
Individuals with a burden of risk factors above a certain threshold are 
considered to have a clinical diagnosis of the disorder. 


understanding of the aetiology and pathogenesis of ADHD. To date, 
such studies have involved investigation of different classes of gene 
variants, both rare and common, across the whole genome and at 
specific candidate locations. It has been known for some time that 
some types of rare genetic syndromes are associated with ADHD 
phenotypes. For example, 22q11.2 deletion syndrome, as well as 
being a known risk factor for schizophrenia, is commonly associ- 
ated with ADHD in childhood [20]. Other rare genetic syndromes 
associated with ADHD include fragile X syndrome [21] and tu- 
berous sclerosis [22]. It is not inevitable that affected individuals 
show intellectual disability, but these syndromes are commonly 
characterized by additional phenotypic characteristics (for example, 
congenital heart defect, dysmorphic features, epilepsy). A number 
of studies have investigated the contribution of rare chromosomal 
deletions and duplications known as CNVs to ADHD risk. As ob- 
served for schizophrenia, ASD, and intellectual disability, a higher 
burden of large, rare CNVs has been identified in individuals with 
ADHD, compared to control individuals [23-26] (see Chapters 23, 
28, and 59). 

The burden of large, rare CNVs is especially elevated in those 
who have intellectual disability, as well as ADHD, but findings are 
not restricted to this IQ group [25]. Not all studies have individu- 
ally observed an excess of CNVs in those with ADHD (for example, 
[27-29]). However, a pooled analysis of multiple data sets suggests 
overall there is an increased burden of CNVs in children with ADHD 
and a significant excess of duplications at chromosomes 15q13.3 and 
16p13.11 (24, 25]. ADHD-associated CNVs show significant overlap 
with those implicated in ASD and schizophrenia [25, 27]. These 
findings highlight that even though these disorders are clinically 
very different, there are shared genetic risk factors. Genetic findings 
provide a window into biology that is otherwise difficult to directly 
access for psychiatric disorders, including ADHD. CNVs associ- 
ated with ADHD have started to provide some clues on pathogen- 
esis. These chromosomal duplications and deletions span genes that 
encode proteins involved in multiple biological pathways that also 
partially overlap with those implicated in autism and schizophrenia. 
Results to date from ADHD CNV studies [27, 30-32] suggest sig- 
nificant evidence of enrichment for genes involved in the brain (for 
example, neurite outgrowth, ion channel pathways). However, the 


genes and biological mechanisms implicated in ADHD are not iden- 
tical to those in other disorders. That is perhaps unsurprising, given 
their phenotypic manifestations are dissimilar and also medications 
that are effective in reducing ADHD symptoms do not benefit core 
symptoms of schizophrenia and autism. 

Rare genetic mutations, including CNVs, can either be inherited 
from a parent with the same mutation (transmitted) or arise as the 
result of a spontaneous mutation in the individual (de novo). There 
have been some suggestions that spontaneous de novo CNVs may 
provide especially useful clues on genetic risk loci for disorders 
[33] and are associated with a larger effect size than transmitted 
CNVs [34]. While not all studies of ADHD have parental DNA to 
identify the origin of CNVs (for example, [32]), it seems that both 
transmitted and de novo CNVs have been identified in those with 
ADHD [25, 27-29]. Interestingly, some inherited CNVs seem to ori- 
ginate from parents who are also affected by ADHD [27-29] and 
involve regions previously implicated as risk loci for ADHD and 
other neurodevelopmental disorders [27, 28]. While further work is 
needed to replicate these findings and further elucidate their poten- 
tial role in the aetiology of ADHD, they highlight the relevance of 
looking beyond the individual at the wider family to further under- 
stand the genetic aetiology of ADHD. 

As one would expect for a multifactorial disorder, these rare 
genetic syndromes and CNVs are not observed in all affected in- 
dividuals and CNV carriers do not necessarily show ADHD alone. 
CNVs are one class of rare genetic variant involving chromosomal 
alterations. To date, rare variations in DNA sequence have not been 
examined in ADHD. However, common gene variants have been in- 
vestigated, and this will be discussed next. 


Candidate gene studies 


Initial molecular genetic studies were restricted to genotyping 
genetic variants at specific locations, rather than across the whole 
genome. In psychiatry, such candidate gene-based approaches are 
now regarded with scepticism, given the high likelihood of false 
positives. Also, the pathogenesis of psychiatric disorders is un- 
known, meaning that the selection of candidate genes is not based 
ona firm hypothesis. Nevertheless, for ADHD, some candidate gene 
variant findings were replicated and withstood meta-analysis. Meta- 
analyses have shown associations for variants within DRD4, DAT1, 
DRD5, 5HTT, and SNAP25 [35]. As the candidate gene methods 
have been replaced by newer hypothesis-free genome-wide associ- 
ation studies (GWAS), the relevance of these putative risk variants 
needs to be reconfirmed. While none of these findings have been 
upheld by GWAS to date, this might be observed in the future as 
sample sizes for such studies increase. This has been seen for schizo- 
phrenia; variants within the DRD2 gene that had been associated 
with schizophrenia in candidate gene studies were not replicated in 
smaller GWAS, but DRD2 has now been implicated by the largest 
schizophrenia GWAS to date [36]. 


Findings from ADHD GWAS 


GWAS use case-control or similar types of design to investi- 
gate the association between a disorder and a large number of 
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single-nucleotide polymorphisms (SNPs), a type of common gen- 
etic variant, across the genome simultaneously. This method is ad- 
vantageous, as it looks at multiple variants in a hypothesis-free (and 
therefore unbiased) manner, but requires stringent corrections for 
multiple testing as so many variants are tested for association [37]. 
This means that such studies require extremely large samples for as- 
sociations to reach genome-wide significance [38]. 

GWAS of ADHD have generally been undertaken on small, 
underpowered samples. In the first nine reported studies [23, 39- 
45], sample sizes for ADHD cases ranged from 343 to 1013, with 
a meta-analysis of four separate studies [46], including just over 
3000 individuals with ADHD. None of these studies reported any 
genome-wide significant findings, and there was little overlap in 
the top hits across studies. Although the small sample sizes were ac- 
knowledged to be an issue (for example, [47, 48]), as for similarly 
sized samples investigating schizophrenia or bipolar disorder [46], 
the lack of findings initially led some to question the relevance of 
common gene variants in the aetiology of ADHD [49]. 

However, recently, the ADHD sample size assembled by the 
Psychiatric Genetics Consortium (PGC) ADHD subgroup has risen 
to over 20,000 ADHD cases and 35,000 controls. Initial reports 
suggest twelve genome-wide significant loci (PGC-ADHD sub- 
group report, International Society of Psychiatric Genetics meeting, 
Toronto, October 2015) [49a]. This is an exciting development and, 
given the increased sample size in comparison to previous studies 
of ADHD, demonstrates the frequently cited need for large samples 
to find genome-wide significant risk loci [38, 50]. It is too early to 
understand the implications of these findings. Each of these vari- 
ants will need to be looked at in more detail to identify the specific 
causal variant, as has been undertaken in other complex disorders 
(for example, [51]). It will also be important to ascertain whether 
the genes implicated by the associated SNPs highlight specific bio- 
logical functions or pathways [36]. Furthermore, it is clear that these 
putative risk loci are only the ‘tip of the iceberg’ in understanding 
the aetiology of ADHD, as a multifactorial disorder influenced by 
multiple genetic and non-genetic risk factors. Also, a much larger 
number of common and rare genetic loci remain to be identified. 
Although the latest ADHD GWAS includes a much larger sample 
size, it is still small in comparison to those utilized for studies of 
other medical and some psychiatric disorders. Indeed, in the study 
of schizophrenia, where the largest GWAS study has identified 128 
genome-wide significant hits [36], it is recognized that this is not 
the full extent of the common genetic variance accountable for the 
disorder [50]. 

However, because these studies were sufficiently powered to iden- 
tify genetic loci with an effect size of 1.3 [37], we can conclude that 
there are no ADHD common genetic risk variants with a large effect 
size. This suggests that, as with other disorders such as schizophrenia 
(for example, [36]), the genetic architecture of ADHD is explained 
by a large number of common gene variants, each of small effect size, 
as well as rare mutations that have larger effect size. Environmental 
risks may also make an important contribution. 





Composite measures of common genetic risk 


While the recent GWAS findings are promising, any single SNP has 
a small effect size on its own. This has led to methods that involve 


generating composite measures of multiple common genetic risk 
variants [37, 52]. One such method is genomic relationship-matrix 
restricted maximum likelihood (GRML), which is often referred to 
by the software programme used to implement it—GCTA. Using 
genome-wide SNP data, this method estimates the genetic similarity 
between unrelated individuals. This makes it possible to identify 
SNP heritability, which ranges between 17% and 45% [45, 53-55] 
(although not all are in agreement [56]). Schizophrenia polygenic 
risk scores derived from much larger studies account for only 7% 
of the variance [36]. Common gene variants appear to account for 
only a proportion of the heritability identified by twin studies, and 
this will, in part, be due to the role of additional risks, including rare 
variants such as CNVs. 

A further method used to obtain a composite measure of common 
genetic risk is polygenic risk score analysis, which involves sum- 
ming multiple risk alleles (SNPs) identified by a ‘discovery’ GWAS, 
including those at a very nominal level of significance. Putative 
risk alleles in individuals in an independent sample are then com- 
bined to give an individual polygenic risk score. ADHD polygenic 
risk scores have been shown to predict ADHD caseness [57], which 
highlighted, prior to the PGC genome-wide significant findings, 
that common gene variants contribute to risk. 

ADHD is a multifactorial liability disorder whereby genetic and 
other risks are on a continuum throughout the population, with 
clinical diagnosis as one extreme (for example, [12-15]). Polygenic 
risk scores derived from the general populations ADHD trait scores 
predict ADHD diagnosis in a case-control sample [58], while the 
opposite is also observed; ADHD polygenic risk scores derived from 
patients with the diagnosis (case-control samples) predict ADHD 
trait levels in the general population [30, 59]. 

Polygenic risk scores (across all disorders) currently account for a 
very small proportion of the phenotype variance—0.1-2.0% in the 
current studies of ADHD (for example, [49a, 57, 59]). Therefore, 
while polygenic risk score analyses can give some insight into the 
genetic architecture of ADHD, they only provide one indicator of 
genetic risk and alone cannot be used as diagnostic prediction tools. 
As sample sizes increase and our ability to identify loci more ac- 
curately from GWAS increases, measurement of genetic risk should 
improve, and when coupled with other measures of genetic risk (for 
example, family history), predictive ability should also improve [60]. 


Gene-environment interplay 





Environmental exposures often believed to be risk factors for ADHD 
are described in Chapter 34. It will be explained in more detail here 
why the epidemiological findings may be a further expression of 
genetic, rather than simply environmental, risk. Environmental 
exposure in ADHD is not independent of an individual's genes— 
a phenomenon known as gene-environment correlation. Gene- 
environment correlation arises for environmental exposures (for 
example, quality of relationships) that are shaped by individuals’ 
behaviours or dispositions (active and evocative gene-environment 
correlation) or through their parents’ attributes (passive gene- 
environmental correlation). Further explanations and examples of 
gene-environment correlation can be seen in Table 34.1. Even ifan 
environmental risk has genetic origins, the impact on ADHD could 
still be environmental, although designs that go beyond observation 
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Table 34.1 Types of gene-environment correlations: descriptions and examples 





Passive 


Active (niche picking) 


Evocative 


(for example, genetically informative twin studies) are required to 
test this. For example, cigarette smoking is an environmental risk 
that is strongly genetically influenced (that is, has genetic origins), 
but it is an exposure that still has environmentally ‘mediated’ causal 
risk effects on lung cancer. 

There is evidence to suggest that some of the environmental risks 
thought to contribute to ADHD (for example, maternal lifestyle 
during pregnancy) do have partly genetic origins. The next ques- 
tion is whether the genetic factors that impact on ADHD environ- 
mental exposures (for example, maternal lifestyle) overlap with the 
ones that contribute to offspring ADHD. If that is the case, it would 
mean these environmental exposures, rather than being true causal 
risk factors for ADHD, represent expressions of the same genetic 
liability. Maternal smoking in pregnancy is a risk factor that is ro- 
bustly associated with ADHD [61] but that is also under strong gen- 
etic influence. Multiple genetically informative studies, including 
discordant sibling pair designs (where the mother smokes in one 
pregnancy, and not in the other) [62], children of twins [63], and 
a design based on children born by assisted production [64], sug- 
gest that while the links between maternal smoking in pregnancy 
and offspring low birthweight are ‘environmentally mediated; as one 
would expect, for ADHD, the association appears to be entirely, or 
mainly, explained by shared genetic liability in mother and child 
(manifest as smoking in pregnancy and offspring ADHD) [64a]. 
Clearly, smoking cessation is advisable for offspring and maternal 
health, but these findings suggest that this policy is unlikely to re- 
duce the numbers affected by ADHD. 

Mother-child hostility is another apparent environmental risk 
factor for ADHD. However, a number of genetically informative 
[65, 66] and treatment designs [67] suggest that it again is a conse- 
quence of ADHD symptoms and child genetic liability. Nevertheless, 
parent-child hostility has environmentally mediated links with con- 
duct disorder [68] so might be important in terms of modifying 
ADHD outcomes. In contrast to these findings, very extreme early 
privation, as experienced by a group of children initially reared in 
Romanian orphanages and then adopted away, is an exposure that 
impacts on ADHD-like symptoms [69]. The links cannot be ex- 
plained by shared genetic liabilities because the environmental ex- 
posure was imposed, rather than selected or evoked. 

Gene-environment interaction provides another example of how 
genes and the environment work together. This has been demon- 
strated most clearly in animal models where the observable manifest- 
ation (phenotype) of a gene depends on the environmental context 
(for example, diet, exposure to a pathogen). Interaction extends to 
the molecular level; DNA codes for proteins, and there are multiple 
complex molecular mechanisms, including epigenetic ones, that 


Individuals seek environments that are compatible 
with, and reinforce, their genetic propensities 


An individual's genetic tendencies elicit reactions from 
others which reinforce these tendencies 


An individual inherits genotypes from their parents 
who also provide the rearing environment 


A disorganized parent may provide a more chaotic home 
environment, which reinforces the child's own disorganized 
tendencies 


An impulsive child may seek out more risky situations or 
choose friends who engage in risky behaviour 


An irritable child will elicit more negative interactions from 
those around them 


modify the biological function of the same DNA sequence. Gene- 
environment interaction in humans is much more challenging to 
assess robustly at a molecular level. Traditional behavioural genetic 
designs that involve separation of G and E and capture all genetic li- 
ability via design (for example, children adopted away at birth) have 
shown repeatedly that the clinical manifestations of genetic liability 
(for example, for criminal behaviour, schizophrenia) are modi- 
fied by the rearing environment and/or adoptive context (for ex- 
ample, [70, 71]). However, ADHD has not been studied in this way. 
Although there are published reports of interaction between specific 
candidate gene variants and environmental exposures (for example, 
[72, 73]), convincing findings on ADHD have yet to emerge. Similar 
methodological issues apply to the concept of ‘differential suscepti- 
bility’ whereby some individuals are more susceptible to the bene- 
fits of positive enriched environments, as well as more vulnerable to 
environmental risks. 

Another example of gene-environment interaction comes from 
pharmacogenetics when the effects of medication (the environ- 
mental exposure) vary according to genotype, for example, slow 
and ultra-rapid metabolizers of drugs metabolized by the enzyme 
CYP2D6 [74]. So far, there have been no consistent and robust 
pharmacogenenetic findings in relation to ADHD medications that 
would have clinical practice implications. Despite the research chal- 
lenges, it remains important at a conceptual level to appreciate the 
interplay of genes and the environment in understanding the aeti- 
ology of ADHD. 


Phenotype heterogeneity and ADHD 


As described in detail in Chapter 31, ADHD is a highly heter- 
ogenous clinical disorder, with individuals frequently presenting 
with comorbid ASD, conduct disorder, anxiety, or depression. 

Heterogeneity is very likely to be influenced by the underlying 
genetic architecture of the disorder. For example, the degree of 
genetic risk loading appears to vary for different subgroups of in- 
dividuals with ADHD. Family and twin studies of ADHD suggest 
that individuals with comorbid conduct disorder have an increased 
genetic load, in comparison to those with ADHD alone [75, 76]. 
The relative risk (RR) of ADHD in biological relatives of those with 
ADHD plus conduct disorder is almost twice as high (RR 9.5) as 
that among first-degree relatives of individuals with ADHD alone 
(RR 5.4) [75]. At a molecular level, ADHD polygenic risk scores 
have also been observed to be higher in those individuals who have 
ADHD with comorbid conduct disorder, when compared to those 
with ADHD alone [57]. 
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Previous candidate gene studies have suggested that there may 
be additional genetic variants which are specifically related to the 
subgroup of individuals with ADHD and conduct disorder, rather 
than all individuals with ADHD (for example, associations with the 
COMT val158met variant which have been identified in numerous 
replicated studies and a pooled analysis) (for example, [77, 78-80]). 
Time will tell whether or not this pattern of different genetic risks 
contributing to ADHD subtypes (and these specific associations) 
is confirmed, but such studies highlight the potential association of 
clinical heterogeneity to the genetic architecture of ADHD and the 
need to take this into account in the future. 

While initially considered to be a childhood disorder, it is now 
recognized that ADHD persists into later adolescence and adult- 
hood in a substantial number of individuals [81, 82]. Twin studies 
suggest that heritability estimates for ADHD are relatively stable 
throughout the lifespan (for example, [17-19]). Indeed, longitu- 
dinal studies following individuals throughout childhood to early 
adulthood suggest that heritability rates for ADHD remain stable, 
although the influence of specific genetic factors may change over 
time [83]. This change in the specific genetic factors potentially rele- 
vant at different ages may introduce a further element of heterogen- 
eity in studies attempting to identify specific ADHD loci and may 
need to be considered. This is especially because, as larger samples 
are sought for GWAS, samples of children and adults with ADHD 
may be pooled. However, family and twin studies suggest that per- 
sistence of ADHD into adult life is more strongly familial [84] and 
highly heritable than childhood ADHD alone. To date, genome- 
wide gene discovery studies of adult ADHD have not yielded unique 
findings. Recent reports have suggested a further source of hetero- 
geneity in adult ADHD; three population cohorts have shown that 
new cases of ADHD can present for the first time in late adolescence 
or adulthood [85-87]; future studies may consider the extent to 
which genetic factors are relevant to this group and how this may be 
the same or different to those for childhood-onset ADHD. ‘The first 
of these studies [86] found that ADHD polygenic risk score did not 
significantly predict this later-onset group, although the sample size 
was small and an older discovery GWAS was used to generate risk 
alleles. Replication will be required. 


Shared genetic risks across disorders 





Given that ADHD is known frequently to co-occur with other dis- 
orders, there has been interest in whether or not there is an overlap 
between the genetic risk for different psychiatric disorders. Twin 
studies suggest that there is considerable overlap of ADHD gen- 
etic risk factors with those that contribute to ASD [9], conduct dis- 
order [76], and other neurodevelopmental disorders [88], although 
the extent of this overlap may vary somewhat, depending upon 
whether age or hyperactive/impulsive or inattentive symptoms are 
considered [9]. Although there are some shared genetic effects, twin 
studies also demonstrate genetic risks that are unique to each dis- 
order. ADHD polygenic risk score studies have shown some overlap 
with other disorders (schizophrenia, bipolar disorder, and depres- 
sion [54, 57]) and with other neurodevelopmental traits, including 
cognitive and language ability in the general population [89]. 

The large PGC Cross Disorder Group investigates the overlap 
between eight psychiatric disorders: ADHD, ASD, schizophrenia, 


bipolar disorder, major depressive disorder, anorexia nervosa, ob- 
sessional compulsive disorder, and Tourette syndrome. Using GWAS 
data, this group has found four genome-wide significant genetic loci 
appearing to be in common across disorders [54]. Interestingly, 
ADHD genetic overlap was strongest between ADHD and major 
depressive disorder [90, 90a]. 





Summary and clinical implications 


While it has been known for some time that ADHD is highly herit- 
able, progress to date in identifying specific gene variants has been 
slow until recently. However, now is an exciting time. The findings 
of the first twelve genome-wide significant loci from the largest 
GWAS study to date give a glimpse of the specific common genetic 
risk variants that might be involved in the disorder and provide fu- 
ture insights into biology. Genetic findings confirm what clinicians 
have long supposed—that ADHD lies at the end ofa population con- 
tinuum. However, individual genetic loci have very small effect sizes, 
and at present, even composite measures cannot be used to predict 
who has or will get ADHD [90b]. 

CNV studies have demonstrated the relevance of rare vari- 
ants to ADHD that have much greater effect size than individual 
common variants. These CNV findings have further highlighted the 
neurodevelopmental origins of ADHD and overlap between different 
neurodevelopmental traits as well as disorders. Overall genetics re- 
search findings are strongly supportive of the DSM-5 approach 
whereby ADHD is classified under the group of neurodevelopmental 
disorders, along with autism. Major efforts will be required to move 
research on ADHD genetic discoveries into clinically meaningful 
findings. This includes investigating the biological impacts of gene 
mutations, especially rare, highly penetrant ones. This can be achieved 
using a variety of approaches, including animal studies and cellular 
models (see Chapter 32). Other research will involve assessing the im- 
pact of gene variants (composite measures of common gene variants 
or rare, highly penetrant mutations) on brain/cognitive function (for 
example, [91]) and population health (for example, [92]). 

Finally, genetic testing will become an issue. At present, although 
rare mutations are associated with ADHD, routine genetic testing 
for CNVs in those with ADHD alone is not indicated in any country, 
unless the affected individual also has intellectual disability. While 
genetic understanding cannot currently influence clinical practice, 
it is likely that as rare genetic subforms of ADHD are identified, 
policies for genetic investigation in practice might alter. Increased 
understanding will enable researchers to investigate the pathogen- 
esis of ADHD in more detail, with positive benefits for the treatment 
of ADHD. 
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sights from neuroanatomical imaging 


into attention-deficit/hyperactivity 
disorder throughout the lifespan’ 


Philip Shaw and Eszter Szekely 





Early models of attention-deficit/hyperactivity disorder (ADHD) 
sometimes viewed the disorder as a result of ‘minimal’ damage to 
brain structure and function [1]. This was partly based on a com- 
parison with the effects of lesions of certain brain structures. For ex- 
ample, damage to the prefrontal cortex (PFC) is linked to deficits in 
motor and impulse control [2, 3]. The advent of magnetic resonance 
imaging (MRI), a non- invasive imaging tool that is free of ionizing 
radiation, accelerated the study of brain-behaviour relationships. It 
provided direct access into the developing brain. Here we review the 
literature, exploring the insights into ADHD throughout the life- 
span provided by neuroanatomical imaging. 

To outline this chapter, firstly, models of ADHD are briefly re- 
viewed to aid in the interpretation of the neuroanatomical find- 
ings. Secondly, the literature on structural brain differences found 
in ADHD is summarized, adopting a developmental perspective, 
as childhood, adolescent, and adult ADHD are considered. Finally, 
looking to the future, we consider possible clinical applications of 
this work and likely research directions. 























There is a near consensus that there are multiple neuropsycho- 
logical pathways to ADHD [4-7]. These are discussed in greater 
depth elsewhere (see Chapters 32 and 33). Here, we sketch these 
models to aid with the interpretation of the major neuroanatomical 
findings. The models are illustrated in Fig. 35.1. Firstly, ADHD has 
long been associated with anomalies in the fronto-striato-thalamic 





1 This chapter is reproduced froma previously published book The Oxford Textbook 
of Attention Deficit Hyperactivity Disorder, edited by Tobias Banaschewski, David 
Coghill, and Alessandro Zuddas (May 2018), pp. 73-81, Copyright © 2018 
Oxford University Press. 





circuitry, that is, the rich, sometimes bidirectional, interconnec- 
tions between the prefrontal/parietal cortices, basal ganglia, and 
thalamic nuclei [8]. These circuits mediate a host of cognitive 
functions that are disrupted in ADHD such as working memory, 
response inhibition, and sustained attention. Secondly, children 
with ADHD also often show deficits in processing information 
about the temporal structure of the environment, making errors 
in time estimation, perception, and temporal foresight [9, 10]. In 
turn, this implicates the corticocerebellar circuitry spanning the 
cerebellum and its projections to the PFC, cingulate, and some 
parietal regions. 

Thirdly, an aversion to delayed reinforcement, expressed as 
atypical processing of rewards, has been found in ADHD [6, 11]. 
This feature, along with anomalies in brain activation during an- 
ticipation of rewards, has spurred interest in connections between 
the ventral striatum and the ventromedial cortex/orbitofrontal 
cortex (OFC) in ADHD [12]. Fourthly, emotion dysregulation 
is increasingly recognized as a common, if not core, problem in 
ADHD [13]. It implicates the paralimbic circuitry, particularly 
interactions between the amygdala, medial PFC, and OFC. Finally, 
it has been argued that children with ADHD are not so much con- 
sistently worse at certain cognitive functions as they are more 
variable in performance, perhaps due to anomalies in vigilance 
or motivation [14-16]. This model has been linked most closely 
with atypical neurophysiology, specifically anomalies in the pat- 
terns of synchronized brain activity that arise during periods free 
from task demands [17]. However, there are also some possible 
neuroanatomical correlates, with increased within-subject vari- 
ability associated with decreased white matter and prefrontal grey 
matter anomalies. 

Most cognitive models of ADHD derive from research in chil- 
dren. However, it is increasingly clear that we need to consider the 
neural and cognitive anomalies that underpin ADHD throughout 
the lifespan. This is because, while some children ‘grow out’ of 
ADHD, many do not. Recent prospective studies find that 20-45% 
of children with ADHD will have the full syndrome into adulthood 
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Fig. 35.1 A sketch of some current neurocognitive models of ADHD. m, medial; OFC, orbitofrontal cortex; PFC, prefrontal cortex; vm, ventromedial. 


and a further 25-48% have impairing symptoms [18, 19] (see also 
Chapter 34). 


A review of the neuroanatomy of ADHD 


throughout the lifespan 





This is a selective, but comprehensive, review of the structural 
neuroimaging literature of ADHD, highlighting some landmark 
studies and key concepts. Systematic reviews and meta-analyses are 
referenced throughout, where appropriate. We first review child- 
hood ADHD, summarizing findings for the three main compart- 
ments of the brain: deep brain structures, the cerebral cortex, and 
the cerebellum. We then review the literature on adolescent and 
adult ADHD. 


Childhood ADHD: the basal ganglia and other 
deep structures 


The basal ganglia are perhaps the most widely studied brain struc- 
tures in ADHD. This reflects their status as pivotal components in 
nearly all of the circuitry implicated in the disorder. Neuroimaging 
studies in the 1990s focused mainly on the caudate, often using 
manual tracing to chart volumetric alterations. The findings were 
mixed, with reports of increase [20], decrease [21, 22], and no 
difference [23]. The next wave of imaging studies has employed 
methods unconstrained by a priori regions of interest. These in- 
clude voxel- based morphometry, which allows change in grey 
matter volume and density to be measured at thousands of points 
(or voxels) throughout the entire brain. In a landmark quantitative 
meta-analysis, Nakao and colleagues found that 14 such studies con- 
verged to find reduction in the dimensions of the right putamen and 
globus pallidus extending to the head of the caudate [24]. This ef- 
fect was more pronounced in childhood and could not be attributed 


to psychostimulant medication (which attenuated diagnostic group 
differences). The findings lend support to the ‘classic’ models of 
ADHD as stemming from anomalies in the fronto-striatal circuitry 
which mediates many executive functions. 

Conventional MRI, at the currently used field strength of 1.5 and 
3 T, has great difficulty in resolving the individual subnuclei of deep 
brain structures such as the striatum and thalamus. Thus, localized 
compromise of subnuclei could be missed. One strategy to circum- 
vent this limitation is to examine the surface morphology of these 
deep structures. This is defined by the degree to which an individual’s 
basal ganglia (or thalamic) surface contours have to be ‘deformed’ 
(stretched or shrunk) to fit a template (25, 26]. Surface expansions 
are generally interpreted as reflecting volume increase in under- 
lying nuclei, and surface contractions as indicative of volume loss. 
The identity of the underlying nuclei is inferred through reference to 
surface maps of the structures, created by expert neuroanatomists. 

One of the first studies to use this approach found that smaller 
basal ganglia volumes in boys with ADHD were driven by compres- 
sion bilaterally in the head of the caudate and the anterior putamen, 
with relative sparing of more posterior regions [27]. A somewhat 
similar pattern was reported by a separate group [28]. 

The basal ganglia are richly interconnected with the thalamus, and 
here too localized surface anomalies have been delineated using this 
surface mapping technique [29]. Specifically, marked surface con- 
traction in the region of the pulvinar nuclei bilaterally was noted in a 
group of 105 children with ADHD and controls. The pulvinar nuclei 
link action and vision, and the pulvinar’s lateral portions, where the 
morphological anomalies were most prominent, support circuitry 
that detects salient somatosensory stimuli. Such morphological dis- 
ruption of the pulvinar could contribute to the inefficient allocation 
of attentional resources seen in ADHD [30]. Notably, these complex 
surface changes and associated underlying volumetric perturbations 
did not lead to a change in the overall volume of the thalamus in 
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youths with ADHD, relative to typically developing youths. Thus, 
a traditional region-of-interest study examining the volume of the 
thalamus would have missed these subtle, but important, diagnostic 
signals. 

Others have looked beyond striatothalamic regions and delin- 
eated similarly localized changes that can inform cognitive models 
of ADHD. One study of 114 children delineated components of the 
limbic system, specifically the amygdala and the interconnected 
OFC [31]. It detected anomalies in amygdala shape— but not 
volume—and a disruption of the typical correlation between the 
amygdala and orbitofrontal volumes, suggesting a breakdown in the 
connectivity between components of the limbic system. The limbic 
system underlies affective processes such as emotion perception and 
emotion regulation. Thus, these findings lend support to the emo- 
tion regulation difficulties often seen in the disorder. 

The basal ganglia develop throughout childhood [32]. This raises 
the question of whether the growth trajectories of the basal ganglia 
are altered in childhood ADHD. While cross-sectional data can be 
used to define trajectories, longitudinal data are better suited for 
capturing dynamic processes. In part, this is because longitudinal 
data focus on within-subject variation over time (which is of central 
interest) and remove much between-subject variance (which can 
confound). Indeed, theoretically, the only situation in which indi- 
vidual trajectories can be correctly inferred from cross-sectionally 
derived curves of mean values is when all entrances or exits from 
the study are random and the trajectories of the ADHD and con- 
trol groups are parallel [33]. Neither requirement is likely to be met 
in ADHD. 

One study used a mix of longitudinal and cross-sectional data to 
chart the development of the basal ganglia, comparing 270 children 
with ADHD against 270 well-matched typically developing controls 
[34]. Age-related change was mapped using surface-based morpho- 
logical methods. The study confirmed prior findings of a general 
increase in surface area of the basal ganglia throughout childhood 
into mid adolescence. Children with ADHD similarly showed an in- 
crease in surface area but started from a ‘lower’ point, as they had 
a decreased surface area at study entry in the head of the caudate 
and the body of the putamen. Only one region showed a diagnostic 
difference in trajectories—the ventral striatum. Here, the typically 
developing group showed surface area expansion with age, whereas 
the ADHD group showed progressive contraction. In cognitive 
terms, the atypical development of the ventral striatum supports 
cognitive models implicating anomalous reward processing as a key 
contributor to childhood ADHD. 


Childhood ADHD and the cerebral cortex 


Different areas of the cerebral cortex have also been implicated in 
most neurocognitive models of ADHD, partly through their rich 
interactions with striatal, thalamic, and cerebellar regions. An early 
landmark study of 152 children with ADHD and 139 well- matched 
typically developing controls established that there was a 4-6% re- 
duction in the volume of the cerebral cortex, affecting all the major 
lobes [35]. Advances in analytical techniques provide more precise 
localization of change, allowing a closer alignment of neuroana- 
tomical findings with cognitive models. Many studies have used 
methods that extract the cortical mantle and then measure the thick- 
ness, surface area, and curvature at thousands of points across the 
cerebrum [36, 37]. In childhood ADHD, ten studies have converged 


to find a predominantly thinner cortex with reduced surface area, 
albeit with less agreement on the precise location of change [38-47]. 
Thus, widespread reduction of cortical dimensions in children with 
ADHD have been reported in the PFC, superior parietal cortex, and 
medial and anterior temporal regions. One study has reported in- 
creased dimensions confined to the primary somatosensory cortex 
[48]. Direct links between this compromised cortical substrate and 
cognitive deficits have been found. Thus, one study reported that at a 
group (but not individual) level, deficits in inhibitory control found 
in ADHD were accompanied by a thinner inferior frontal cortex 
[44]. A separate study using voxel-based morphometry found that 
age-related deficits in response inhibition in children with ADHD 
improved in tandem with increasing volumes in a network com- 
prising the anterior cingulate cortex, striatum, and medial temporal 
lobes [49]. 

The cerebral cortex also develops during childhood, again 
prompting us to ask if trajectories differ in ADHD. Using data from 
over 500 children, many with repeated observations, the velocity 
of cortical development of both cortical thickness and surface area 
was defined [42, 50]. The key finding was a slower rate of cortical 
change in childhood ADHD. This altered velocity entailed a ‘delay’ 
in some prefrontal regions, as defined by the age at which each cor- 
tical vertex attained its peak dimensions. More recent work suggests 
that these differential trajectories may be moderated by intelligence, 
with slower rates of cortical development being more prominent in 
ADHD children with lower intelligence [51]. 


Childhood ADHD and the cerebellum 


Although the cerebellum has traditionally been considered a site 
of motor control, lesion studies and functional imaging studies 
in healthy subjects have demonstrated a wide range of cognitive 
and affective functions in cerebellar structures, particularly tem- 
poral information processing (but also attention shifting, verbal 
working memory, and emotional regulation [52, 53]). Dysfunction 
in each of these cognitive domains has, in turn, been implicated in 
the aetiology of ADHD. Anatomically, there are reciprocal loops 
that interconnect cerebral cortical areas, including the PFC, with 
the cerebellum by way of the pons, dentate nucleus, and thalamus. 
Structural anomalies of the cerebellum are among the most consist- 
ently reported features of ADHD [54-57]. A recent meta-analysis 
found that the regions most reduced in ADHD—lobule IX and 
VIIIB, Crus 1—were entirely distinct from those compromised 
among children with autism or developmental dyslexia [58]. These 
ADHD-related cerebellar regions interact richly with prefrontal 
and striatal regions to form networks involved not only in tem- 
poral information processing, but also in the effortful allocation of 
attention. 

Dividing a literature overview into sections on different brain re- 
gions should not be taken to diminish the importance of discerning 
the coherent patterns of change that occur across the entire brain 
[5]. Large-scale brain networks in ADHD have largely (and appro- 
priately) been mainly delineated using imaging modalities that more 
directly measure structural (for example, diffusion tensor imaging) 
and functional connections (for example, resting state fMRI) [59]. 
Nonetheless, the ways in which different brain structures co-vary in 
their dimensions can provide a more indirect index of anatomical 
networks [60]. One study adopting this approach found anomalous 
anatomical network structures in children with ADHD, centred on 
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the OFC and insula [61]. Future work will usefully integrate such 
approaches with other modalities of imaging. 


Adolescent and adult ADHD 


Most studies have focused on childhood or adult ADHD, with a rela- 
tive neglect of adolescence. This gap has been filled by a recent large 
study of 307 adolescents with ADHD, 169 of their unaffected sib- 
lings, and 196 typically developing controls [62]. Two major neuro- 
anatomical findings have emerged. Firstly, during adolescence, 
there is a fixed, non-progressive reduction in total brain volume in 
ADHD, echoing earlier childhood studies [63]. By contrast, there 
was progressive volume loss in adolescence in the right caudate and 
putamen (unlike childhood studies). Many of these changes were 
present in an attenuated form in the unaffected siblings, suggesting 
that they represent shared familial, perhaps genetic, risk factors. 
Cortical dimensions were also examined, and the ADHD group 
showed global thinning, compared to controls, with the medial 
temporal cortex being particularly compromised [45]. This effect 
was found throughout the adolescent to the adult age range and 
held even when psychostimulant medication was considered. This 
finding might suggest a developmental shift from the fronto-parietal 
anomalies that are prominent in early childhood ADHD towards 
more medial temporal/limbic anomalies in adolescents and adults. 

In addition, the medial temporal anomalies are consonant with 
the prominent emotional dysregulation seen in young adults with 
ADHD (although direct evidence of a link was not found). 

Studies into adults with ADHD have addressed two central issues. 
Firstly, are the neuroanatomical changes seen in childhood ADHD 
fixed and thus carried forward into adulthood, or do novel neuro- 
anatomical changes emerge in adult ADHD? Secondly, is the vari- 
able clinical course of ADHD reflected in neuroanatomical change? 

Most adult studies have focused on cortical structure and reported 
decreased volumes of predominantly prefrontal regions (dorso- 
lateral, anterior cingulate, and OFC) [64, 65]. Cortical thinning in 
adult ADHD has also been found in the right inferior prefrontal, 
cingulate, and medial frontal cortices, in keeping with childhood 
findings [66, 67]. This would suggest that many of the anomalies of 
childhood are carried forward to adulthood. However, a more com- 
plex picture emerges, particularly when we consider the highly vari- 
able clinical course of ADHD. 

One study clinically followed a large group of males with child- 
hood ADHD into late adulthood (mean age 41 years) [68]. By this 
stage, 17 individuals had ADHD persisting into adulthood and 
26 had remitted ADHD; contrasts were drawn against 57 never- 
affected controls. Neuroimaging was added at the adult assessment 
to allow an examination of the cerebral cortical structure and grey 
matter density. The study found a globally thinner cortex in both 
the persistent and remitted ADHD groups, compared to controls, 
particularly in the parietal, temporal, and frontopolar regions and 
the precuneus. The ADHD outcome groups mostly did not differ 
significantly from one another, compatible with fixed ‘trait’ deficits 
possibly carried forward from childhood. However, some changes 
did reflect outcome. The right middle frontal and limbic cortices 
were thinner, and the right cerebellum and thalamus had lower 
grey matter density in the persistent, compared to the remitted and 
never-affected, groups. 

A separate study of clinical course used a fully longitudinal de- 
sign and acquired neuroanatomical data in tandem with clinical 


assessments, both in childhood (mean age 10 years) and in early 
adulthood (mean age 24 years) [69]. Such longitudinal data pro- 
vided the ideal tool for delineating the neuroanatomical trajectories 
that are tied to the variable clinical course. By adulthood, 55 of the 
92 members of the ADHD group had remitted and the remainder 
still met diagnostic criteria. Contrasts were made against 184 never- 
affected controls. Regions were identified where change in cortical 
structure—specifically cortical thickness—was tied to this vari- 
able outcome. A link emerged in the cingulate cortex bilaterally, 
the right inferior parietal cortex and dorsolateral PFC, and the left 
sensorimotor region. Here, increasing severity of adult inattention, 
but not hyperactivity-impulsivity, was linked with higher rates of 
cortical thinning. The net effect of these differences was that those 
whose ADHD symptoms improved showed a significant conver- 
gence during adolescence towards typical cortical dimensions, rec- 
tifying early anomalies. By contrast, those with persisting symptoms 
had childhood cortical anomalies that persisted into adulthood. 
Importantly, these outcome-related differences in trajectories held 
when psychostimulant treatment history was entered as a covariate. 
These regions where the trajectory was tied to outcome span several 
of the brain networks most closely linked with ADHD. For example, 
outcome was linked with trajectories of the fronto-parietal cortical 
regions that guide much goal-directed behaviour through cortico- 
striato-thalamic circuits. 

Do the trajectories of other brain regions also mirror the clinical 
course? Such a link was found for the developmental trajectories of 
some, but not all, cerebellar regions [70]. Studying 72 participants 
from the study just discussed, the trajectories of cerebellar develop- 
ment were defined from at least three scans. Outcome was assessed in 
adolescence. Some cerebellar trajectories were sensitive to outcome. 
There was atypical progressive volume loss in the inferior—posterior 
cerebellar lobes in those with worse clinical outcome, and a conver- 
gence to typical dimensions in the anterior hemisphere in those who 
improved. Not all cerebellar change reflected outcome—a reduced 
volume of the superior vermis persisted, regardless of outcome. 

In summary, there is regional heterogeneity in the neuroanatom- 
ical trajectories of variable adult outcome. While different prefrontal 
cortical and cerebellar hemispheric development may be tied to 
different clinical outcome, more posterior cortical regions and the 
cerebellar vermis may show more fixed, trait-like deficits. 

To summarize this section, meta-analyses and some of the larger 
individual studies converge to find anomalies in the basal ganglia in 
ADHD, some of which appear developmentally fixed while others 
may be progressive. Compromise of the cerebral cortex is commonly 
found, albeit with less consensus on the exact regions affected. 
Again, some of these anomalies may vary with age in ADHD and 
may be tied to the clinical course of the disorder. Finally, while the 
morphology of the cerebellum is relatively understudied, it may also 
show a mix of fixed and dynamic anomalies, some of which show 
diagnostic specificity. 





Towards future clinical application 


Clinicians are often interested in two questions. Firstly, can 
neuroimaging aid with the diagnosis or prognosis of ADHD? 
Secondly, what are the effects, if any, of psychostimulants given as 
treatment for ADHD on brain structure? 
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At present, neuroimaging remains a research tool and the diag- 
nosis of ADHD remains a clinical skill. Neuroimaging can define 
robust differences at the group level and provides rich insights into 
neurobiological mechanisms. However, to realize clinical utility, the 
next wave of studies will need to be able to detect changes at the in- 
dividual level with high sensitivity and specificity. It seems unlikely 
that ‘one-shot, one-modality imaging will suffice for this purpose, 
and multimodal imaging may be required. Biomarkers are not only 
needed to inform diagnosis, but also prognosis. We currently cannot 
accurately predict an individual child’s likely outcome using clin- 
ical, sociodemographic, and neuropsychological data. For example, 
while baseline symptom severity and multiple comorbidities are sig- 
nificantly associated with long-term course, one study found these 
factors combined account for only 15-20% of the variance in out- 
come [71-73]. There are mixed findings on whether or not socio- 
economic status, general intelligence, family environment, and 
cognitive skills are associated with adult outcome [71, 73-79]. In this 
context, future studies will increasingly turn to objective measures 
of the brain and the genome to inform diagnosis and prognosis. 

Psychostimulants appear to normalize brain activity in ADHD 
during the performance of tasks and while the subjectis at rest [80,81]. 
What about neuroanatomy? Nearly all studies found that either 
there is no clear association between anatomy and psychostimulant 
treatment or treatment is associated with more normative dimen- 
sions (reviewed in [82]). One report found psychostimulant treat- 
ment to be associated with a smaller hippocampus, although this 
was not found in a larger study [63, 83]. However, it is critical to 
remember that all of the neuroanatomical studies discussed here are 
observational and causality cannot be inferred. For example, other 
factors, such as socio-economic status, access to care, or comorbid- 
ities, could influence the likelihood of receiving psychostimulant 
medication and brain structure. Statements about causality await 
imaging within the context of randomized medical trials. 


Future research directions 





What does the future hold for neuroanatomical imaging in ADHD? 
Four major shifts are already discernible: (1) a shift from diagnostic 
categories towards considering underlying dimensions; (2) a move 
to ‘big data’ partly attained through collaborative efforts; (3) the rise 
of novel imaging modalities and their integration; and (4) the use of 
brain structure as a phenotype in genetic studies into ADHD and 
related disorders. 


The shift from categorical approaches to dimensionality 


The National Institute of Mental Health has recently proposed a shift 
in research focus away from the use of DSM-5 diagnostic categories 
towards objectively defined constructs that are continuously distrib- 
uted throughout the population [84]. In line with this concept, epi- 
demiological and neuropsychological evidence suggests that ADHD 
can be considered dimensionally, lying at the extreme end of a con- 
tinuous distribution of symptoms and underlying cognitive processes 
[85, 86]. Thus, it becomes pressing to ask if such dimensionality is also 
present in the brain changes associated with the categorical disorders. 

To date, studies have only asked if the symptom dimensions of 
ADHD are tied to neuroanatomical change. Three studies have 
found such links. The first built on the earlier finding of a slower 


rate of cortical change in those with a diagnosis of ADHD. It found 
that this slower rate of cortical thinning during late childhood and 
adolescence was also associated with the severity of symptoms of 
hyperactivity and impulsivity in typically developing children [87]. 
As symptoms of hyperactivity-impulsivity increased in 193 typical 
children (with 389 neuroanatomical scans), the rate of cortical thin- 
ning decreased, approaching that seen in a group of 197 children 
with the clinical syndrome. Thus, similar, but attenuated, disrup- 
tions to trajectories are found in children who exhibit behavioural 
problems characteristic of ADHD, even when these do not pass the 
threshold for making a diagnosis. This finding was replicated in a 
separate cohort of 357 typically developing children among whom 
higher-attention problems were associated with a thinner cortex at 
baseline and slower cortical thinning with ageing in multiple areas 
involved in attention processes [88]. Finally, a population-based 
imaging study of 444 children found that greater problems with 
hyperactivity and impulsivity were associated with a thinner post- 
central cortex bilaterally [89]. The next stage is to define how cog- 
nitive domains pertinent to ADHD and related diagnostic entities 
might also impact on the brain and its development. 


Big data 


Several meta-analytic studies have already demonstrated the power 
of quantitative summaries of several smaller studies. Such meta- 
analyses are being facilitated by international collaborative efforts 
such as the Enhancing Neuroimaging in Genetics through Meta- 
Analyses initiative (http://enigma.ini.usc.edu). This collaboration 
has already provided initial reports on subcortical compromise in 
ADHD in over 3200 participants [90]. Such meta-analyses are not 
without their limitations. Foremost is the integration of data ac- 
quired using different scanners, with different sequences, and with 
often slight, but important, variations in image processing. This in- 
cludes consensus protocols on how to assess the quality of images 
for motion and other imaging artefacts. The requisite methods are 
being developed to allow the integration of diverse data collected on 
different platforms. 


Novel imaging modalities and their integration 


We are already seeing a shift towards imaging at field strengths (7 
T and above) that can distinguish, for the first time, the layers of 
the cortex, the subfields of the hippocampus, and the subnuclei of 
deep structures. These data will quickly surpass existing data, ex- 
tending and refining our current conclusions. Integration of these 
multimodal images is under way, often using multivariate methods 
that detect patterns or ‘motifs’ across multiple data sets that charac- 
terize ADHD [91]. 


Brain structure and genomics 


While ADHD is highly heritable (h, >0.7), there has been limited 
progress in understanding the mode of action of susceptibility genes 
[92, 93] (see Chapter 35). It has been argued that the phenotype 
for genetic studies should not always be the diagnostic category 
of ADHD per se but should also include quantitative brain-based 
traits that are pertinent to the disorder [94, 95]. Such quantitative 
traits have the advantage of containing more information about 
inter-individual trait variability than a dichotomous diagnostic cat- 
egory. The brain also lies closer to gene action and may be genetically 
less complex than the more distal clinical phenotypes of outcome. 
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Finally, progress in genetics might also be aided by parsing ADHD 
according to its highly variable clinical outcome. Developmental 
phenotypes that capture the dynamic nature of ADHD might better 
reflect its biology and provide better targets for genetic under- 
standing. This developmental perspective is particularly important, 
given that a longitudinal study of 8395 twin pairs found that over 
half of the genetic contributors to variance in clinical course were 
distinct to those genetic factors driving the onset of ADHD [96]. 

What is the evidence that the brain alterations associated with 
ADHD are genetically determined? Comparing monozygotic and 
dizygotic twins is the ideal design for parsing the relative contribu- 
tion to aetiology of genetics and unique and shared environmental 
factors. While no study has yet adopted this design in ADHD per se, 
one group used a variant of this design [97]. It found that volumes 
of the PFC were likely under genetic control (through a contrast of 
concordantly affected against concordantly unaffected monozygotic 
twins), whereas striatal volumes were more likely under environ- 
mental control (differing within discordant twin pairs). Several 
studies have also reported that some of the brain changes seen in 
children with ADHD, such as decreased prefrontal grey volume, 
are also found in their unaffected siblings in an attenuated form 
[63, 98]. Similarly, a study of 169 unaffected siblings found they had 
reduced volumes of total and cerebral grey, but not white, matter 
that lay between their siblings with ADHD and unaffected controls 
[63]. Interestingly, the finding of compromise in unaffected siblings 
does not extend to the cerebellum, one of the least heritable brain 
structures [99]. These unaffected sibling studies point to familial 
factors—either genetic or shared environment—as pivotal in deter- 
mining some, but not all, brain dimensions pertinent to ADHD. 

The next step is to identify the specific genes driving this herit- 
ability. To date, candidate genes implicated by a priori knowledge 
have been studied. Among structural studies, most have focused 
on common polymorphisms of the dopaminergic system. Risk- 
conferring variants of the dopamine transporter gene and the dopa- 
mine D4 receptor were associated, respectively, with decreased 
volume of the caudate nucleus and volumes and thickness of the 
prefrontal and parietal regions [100-102], with some studies finding 
this effect was confined to those with ADHD [102]. However, sev- 
eral studies do not report associations. A notable negative finding 
came from a population-based study of 1871 children, of whom 344 
had neuroimaging [103]. It tested for additive effects of genetic vari- 
ation in dopamine, serotonin, and neurite outgrowth pathways on 
both the severity of ADHD-related problems and the volume of the 
basal ganglia. While a modest association between ADHD symptom 
severity and putamen volume was found, none of the genetic path- 
ways were related to clinical or brain features. 

The future lies in studying the impact of common and rare genetic 
variants across the entire genome on the brain in ADHD. Given the 
large numbers that are needed for such studies (usually tens of thou- 
sands), it is likely that the primary phenotypes used for gene discovery 
will have to be relatively cheap and quick to ascertain, and entail min- 
imal inconvenience—such as online questionnaires and testing. The 
deep phenotyping provided by multimodal imaging may prove pro- 
hibitively expensive when focused on just one disorder. Thus, the 
neural phenotyping of ADHD might be most usefully employed to 
characterize the most promising signals to emerge from larger studies 
using more readily available clinical and behavioural phenotypes. 


Understanding the impact of DNA sequence variation on the 
brain in ADHD is only the first step. Interactions between the en- 
vironment and genes are critical to consider, not least as the high 
heritability estimates include not only the main effect of genes, but 
also gene-environment interactions. There are a handful of per- 
tinent studies. One of the most studied and controversial gene- 
environment interactions is between variants of the serotonin 
transporter gene (a variable number tandem repeat polymorphism 
in the promoter region) and early life adversity in producing vul- 
nerability to depression [104, 105]. A study of 701 youths found that 
this interaction between the serotonin transporter gene variants and 
chronic life stressors was also associated with the severity of ADHD 
symptoms [106]. Using mediation analyses, the study further found 
that decreased volume of the OFC and anterior cingulate cortex 
could be the mechanism underlying this interaction. 

The environment can also alter gene expression through epigen- 
etic modifications such as methylation of DNA sequences. A recent 
study found that high levels of methylation of the serotonin trans- 
porter gene in 102 children with ADHD were associated with more 
severe symptoms and decreased thickness of the right occipitotem- 
poral cortex [107]. Clearly, such preliminary findings require repli- 
cation, but the studies illustrate how we can take well-characterized 
gene-environment interactions into the brain in an effort to map out 
the pathways to ADHD. 


Conclusions 


Structural neuroimaging in ADHD is surprisingly young; large-scale 
research applications of MRI only started in the 1990s. Imaging has 
already moved beyond delineating morphometric change in entire 
lobes or deep structures, to consider how the disorder might be char- 
acterized by highly localized alterations in volume, shape, and surface 
complexity. The future holds challenges. Group-level differences can 
richly inform our models of ADHD, which could, in turn, impact on 
treatment approaches. However, direct clinical application of imaging 
to diagnosis or prognosis awaits the delineation of specific and sensi- 
tive alterations among individuals with ADHD. Chances of attaining 
this goal may be boosted by integrating neuroanatomical imaging 
with other imaging modalities and neuropsychological, clinical, and 
environmental data. Multimodal imaging will also prove vital if we 
are to define the neural mechanisms through which genetic variants 
confer risk for ADHD. While many challenges lie ahead, given the 
advances already made, it appears that the future of neuroanatomical 
imaging in ADHD is shaping up well. 
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As explained at length in Chapter 32, attention-deficit/hyperactivity 
disorder (ADHD) assessment involves the integration of informa- 
tion from different sources and an appropriate clinical decision- 
making process able to resolve conflicting observations and 
information. The full assessment includes information collected by 
validated instruments documenting present problems, health and 
developmental history, comorbid psychiatric conditions, and global 
functioning; intruments and procedures include questionnaires, 
interviews, and, when apropriate, direct observation and neuro- 
psychological testing. Assessment of cardiac risk factors needs to 
be recorded at assessment before prescribing medications and rou- 
tinely thereafter. At the end of the assessment process, the clinician 
will determine a principal and differential diagnosis and the pres- 
ence of any comorbidity, and will develop a formulation that places 
these diagnoses in context. 

ADHD treatment guidelines and algorithms have been developed 
in Europe [1-4] and North America [5-7], proposing evidence- 
based approaches for ADHD assessment and management. Model 
templates for planning services for ADHD assessment and interven- 
tion, such as the Dundee ADHD Clinical Care Pathway (DACCP) 
[8], have also been tested and are currently used in many countries; 
it includes specific stages for: (1) referral and pre-assessment; (2) as- 
sessment, diagnosis, and treatment planning; (3) initiating treat- 
ment; and (4) monitoring and continuing care. 


















































Treatment for ADHD is based on a multimodal approach com- 


bining behavioural and pharmacological treatment [2, 9]. 
European guidelines highly recommend a stepwise approach, with 
psychobehavioural interventions as first-line treatments, especially 
for pre-school children and subjects affected by a mild form of the 
disorder. American guidelines do not preclude pharmacotherapy as 
a first therapeutic approach, even in the youngest and mildest cases. 

If pharmacological treatment is prescribed first line, it should be 
combined with behavioural interventions in any case. However, it is 








largely accepted that pharmacological treatment for ADHD is highly 
effective, either alone or in combination with behavioural interven- 
tions, that it has greater benefits than behavioural intervention alone, 
and that outcomes are much improved when a structured approach 
to medication management is adopted [10]. There is not a unique 
strategy for management; individual circumstances, comorbidities, 
and medical history have to be considered as an integral part of the 
treatment plan [11]; after appropriate information, parents’ and pa- 
tients’ preference should always be taken into account. 

When concomitant medical or psychiatric conditions such as 
conduct disorders, specific learning disorder, and social difficulties 
are present, psychosocial, psychoeducational, and environmental 
interventions centred on the family and school are often required. 

The main objectives of any comprehensive treatment are: 


e To improve core ADHD symptoms. 

e To improve interpersonal relationships with parents, siblings, 
teachers, and peers. 

e To decrease disruptive behaviours. 

e To improve school learning abilities. 

e To increase personal autonomy and self-esteem. 

e To improve social acceptability of the disorders and the quality of 
life of children/adolescents suffering from the disorder. 














The most common non-pharmacological treatments are behav- 
ioural interventions both for children and carers [that is, parent 
training (PT), educator/teacher training, cognitive behavioural 
therapy (CBT), support groups, and social skills training], cognitive 
training, neurofeedback, and nutraceutics. The objective is to im- 
prove global functioning and reduce ADHD symptoms. 


Pa 





ent training 


PT programmes are psychoeducational interventions providing 
parents with information about ADHD and adequate strategies 
to target and monitor problematic behaviours, in order to better 
manage their children. The intervention is typically delivered to 


groups of parents and covers positive reinforcement skills, reward 
systems, the use of ‘time ouf, liaison with teachers, and planning 
ahead to anticipate problems. 

According to recent studies, PT has positive effects on the behav- 
iour of ADHD children by increasing parents’ behaviour manage- 
ment skills, as well as reducing stress and improving parents’ sense 
of efficacy [12]. 


Cognitive behavioural therapy 


CBT is a structured psychotherapeutic approach to help subjects in 
recognizing their dysfunctional patterns of thought and behaviour. 
It aims to help subjects to learn new skills and strategies to achieve 
their objectives, improve self-esteem, and deal with their emotions 
and social difficulties. 

Objective evidence for the efficacy of psychosocial interventions 
is problematic. A recent meta-analysis by the European ADHD 
Guideline Group (EAGG) showed good effect sizes when outcome 
measures were based on ADHD assessments by raters closest to the 
therapeutic setting. There was an overall standardized mean differ- 
ence (SMD) of 0.40 (95% CI 0.20, 0.60). However, when only ratings 
by assessors blind to treatment allocation were considered, treatment 
effects almost disappeared (SMD 0.02, 95% CI -0.30, 0.34) [13]. 

In contrast with the lack of sufficient evidence on ADHD core 
symptoms, a more recent meta-analysis showed a significant posi- 
tive effect of behavioural interventions on parenting skills and con- 
duct problems in ADHD children [14, 15]. 


Cognitive training 


In recent years, also cognitive training aimed at targeting specific 
deficits (for example, attention, working memory, inhibitory con- 
trol) has been investigated as potential ADHD treatment [16]. The 
effects of cognitive training on ADHD, as addressed in a recent 
meta-analysis performed by the same EAGG, were significant when 
calculated using unblinded ratings (SMD 0.64, 95% CI 0.33-0.95) 
but, as reported for psychoeducational interventions, became stat- 
istically non-significant (SMD 0.24, 95% CI 0.24-0.72) when meas- 
ures generated by probably blinded observers were used [17]. 

A more recent meta-analysis, including a larger number of trials, 
showed a more reliable estimate of the effects of cognitive training 
on specific functions (working memory, sustained attention, and 
inhibition); when measured by blinded raters, cognitive training 
mainly improved working memory, with limited effects on ADHD 
core symptoms [17]. 


Neurofeedback 


Neurofeedback is based on the training of self-regulation of brain 
activity; it has been considered a promising ADHD treatment by 
targeting aberrant patterns of brain activity putatively underpinning 
the disorder. The results of a recent meta-analysis on 13 controlled 
trials did not show neurofeedback to be an effective treatment for 
ADHD when evaluated by probably blinded raters [18]. A recent 
study in adults confirmed these results [19], although some more 
promising preliminary results have been reported with innovative 
techniques [20, 21]. 

Behavioural interventions have also been provided as useful 
treatments for managing comorbid disorders in ADHD subjects. 
Recently, a multi-centre RCT conducted in 159 adolescents, inves- 
tigating two new individual, short-term cognitive behavioural 


CHAPTER 36 Management and treatment of ADHD 


therapies (one including skills teaching, while the other was 
solution-focused), showed significant improvements in comorbid 
depression, anxiety, and disruptive disorder symptoms with both 
approaches [22]. Weaker evidence is available for the efficacy of 
behavioural sleep interventions in children with ADHD, although 
about 70% of ADHD subjects display mild to severe sleep prob- 
lems. Behavioural treatments for insomnia are clearly effective in 
children in general [23], and ADHD should be considered not only 
a day-time disorder, but also better as a 24-hour ongoing process 
including sleep difficulties. Given the well-known consequences of 
poor sleep on memory and cognition, behavioural approaches to 
improve sleep may represent an important resource needing more 
specific research [24]. Some clinicians already recommend initiating 
behavioural treatment for poor sleep early in management, since it 
is anon-contentious symptom, carrying none of the stigma of a psy- 
chiatric diagnosis. 


Nutraceutics 


There has been growing interest in the roles of the n-3 polyunsat- 
urated fatty acids (PUFAs) docosahexaenoic acid (DHA) and the 
precursor eicosapentaenoic acid (EPA) as potential treatments for 
ADHD symptoms. The longest-chain n-3 PUFA DHA is the most 
abundant PUFA in brain membrane phospholipids; it has an im- 
portant role in membrane fluidity and associated metabolic and 
neural activities. DHA appears to be particularly concentrated at 
synapses, influencing dopaminergic, serotonergic, noradrenergic, 
and gabaergic neurotransmission [25]. Recent meta-analyses [26- 
28] suggest that omega supplementation may improve ADHD 
symptoms to a modest degree, with an effect size of about a quarter 
as large as that seen for pharmacological treatment; whether sub- 
normal blood concentrations should be an indication for treatment 
is still not clearly established. 

In summary, evidence from meta-analyses investigating 
RCTs of non-pharmacological interventions indicate that non- 
pharmacological treatments should not be recommended as the 
only interventions for core ADHD symptoms. The wider adoption 
of such approaches requires better evidence reported using blinded 
assessments [17]. This does not mean that pharmacological therapy 
necessarily represents the first choice for all subjects; in some chil- 
dren, psychosocial interventions alone are associated with recovery 
[29]. However, consistent evidence across studies confirms the utility 
of adding appropriate pharmacological therapy to existing psycho- 
social therapy, while only small benefits emerge from the addition of 
current psychosocial therapy to an ongoing drug treatment. 


Pharmacological treatments 


Drug treatment for ADHD represents an intervention of spe- 
cific value and relevance to patients within multimodal treatment. 
Methylphenidate (MPH), dexamfetamine, or amphetamine (AMP), 
derivatives, and ‘noradrenergic’ medications atomoxetine (ATX) 
and guanfacine are the most effective psychopharmacological treat- 
ments for ADHD. As a class the first three are usually referred to as 
stimulants. As explained in Chapter 11, terms based on mechanism 
of action are much preferable. This is especially true in this case be- 
cause the objective of treatment in ADHD is almost the opposite of 
stimulation. 








SECTION 5 Attention-deficit/hyperactivity disorder 


Drugs for ADHD 


AMP and MPH have been proved to be the most effective drugs for 
ADHD; their use is well established and consistently recommended 
in evidence-based clinical guidelines across the world [1-9], with a 
response rate of around 70%, rising to 95% when non-responders 
are treated with a second drug [30]. It is a significant problem that 
amphetamine is a controlled substance as a result of the United 
Nations decree in 1971. In a number of countries, it is classed along 
with drugs like heroin, which has inevitably retarded its adoption for 
the treatment of children. 


Molecular mechanism of action of drugs for ADHD 


AMP and MPH enhance the efflux and function of noradrenaline 
(NA) and dopamine (DA) in the CNS, with a rapid onset of action; 
they act by blocking (or even reversing) reuptake in their respective 
monoamine transporters [31]. AMP also increases catecholamine 
release from synaptic vesicles [32]. The therapeutic effects on be- 
haviour and attention are presumed to be related to the enhanced 
neurotransmission of these catecholamines, especially in the pre- 
frontal cortex [33]. 

Racemic AMP (a-methylphenetylamine) contains equal amounts 
of d-(dextroamphetamine) and l-amphetamine isomers. In vitro, the 
affinity of AMP is higher for noradrenaline transporters (NETs) in 
the prefrontal cortex than for DA (in the striatum) [34]. 

MPH is significantly less potent than AMP at inhibiting ves- 
icular accumulation of DA or NA, but a similarly potent inhibitor 
of synaptic reuptake of DA and a slightly less potent inhibitor of NA 
reuptake. 

At therapeutic doses, DA and NA have a complementary effect on 
the firing rate of catecholamine neurons, which results in improved 
signalling, especially in the prefrontal cortex [35]. During cogni- 
tive tasks, MPH has been shown to increase cerebral blood flow in 
dorsolateral prefrontal and posterior parietal cortices in healthy 
controls [36] and in the prefrontal cortex in adults with ADHD 
[37], with a significant decrease in other regions. This suggests de- 
creased metabolic activation in task-irrelevant brain regions, with, 
in turn, more focused activation in task-relevant areas and improved 
performance [38]. 

More recently, drugs for ADHD have been shown to modulate 
functional connectivity; MPH normalizes activation and functional 
connectivity deficits in the brain networks supporting attention and 
motivation. This has been shown in medication-naive children with 
ADHD during a rewarded continuous performance task, together 
with normalization of fronto-cingulate under-activation during 
error processing [39]. In adolescents with ADHD, the drugs dem- 
onstrated effects on the functional connectivity of fronto-parietal 
networks, with beneficial effects on working memory performance 
[40]. During inhibitory tasks, children with ADHD exhibit a raised 
motivational threshold at which task-relevant stimuli become suffi- 
ciently salient to deactivate the default mode network (DMN); treat- 
ment with MPH normalizes this threshold [41]. 


Pharmacokinetics 


Absorption of AMP is rapid, with peak plasma levels about 3 hours 
after oral administration. AMP is metabolized through the liver by 
various P450 enzymes. Food does not affect total absorption but can 
delay it. 


Consistently with its pharmacokinetic profile, the onset of action 
of AMP is rapid, within 1 hour after administration. For immediate- 
release preparations, the duration of action is slightly longer than 
for MPH (around 405 hours), but still requiring at least twice-daily 
administration to ensure adequate coverage. Mixtures of different d- 
AMP and dl-AMP salt formulations (that is, Adderal’) are available 
in the United States, but not in Europe. 

Lisdexamfetamine (LDX) is dextroamphetamine (DEX) cova- 
lently attached to the essential amino acid L-lysine. LDX itself is 
not pharmacodynamically active nor does it result in high DEX 
levels when injected or snorted, thus having a lower abuse potential. 
Following oral administration, the amide linkage between the two 
molecules is enzymatically hydrolysed, releasing active DEX. Most of 
this hydrolysis takes place within red blood cells. Pharmacokinetics 
of d-AMP after single-dose oral administration is linear; in adults, 
there is no accumulation of DEX at steady state nor accumulation of 
LDX dimesylate after once-daily dosing for 7 consecutive days. The 
mean plasma half-life of d-AMP is about 11 hours. 

Oral MPH is rapidly absorbed from the gastrointestinal tract, 
with peak plasma concentrations occurring about 1.5-3 hours after 
administration. The elimination plasma steady half-life of d-threo- 
MPH is about 3-3.5 hours. Because of this short-half life, steady 
state for MPH is not achieved during regular treatment, although 
there is a theoretical possibility of steady state developing with high 
doses of extended-release preparations or in poor metabolizers. 
Classroom studies suggest a close relationship between the pharma- 
cokinetic profile and pharmacodynamic properties [42]. Optimal 
clinical effect appears to be associated with rapidly increasing levels 
in the morning, followed by a steadily increasing plasma level across 
the rest of the day. 

Concerta XL’, Matoride XL’, Equasym XL’, Medikinet Retard’, 
and Ritalin LA’ all provide a mixture of immediate- and extended- 
release MPH; they differ in the mechanics of the delayed-release 
system and in the proportion of immediate-release to delayed-release 
MPH. Concerta XL and Matoride XL effects last 10-12 hours, while 
Equasym XL’, Ritalin LA’, and Medikinet Retard can be considered 
mid-release formulations lasting between 6 and 8 hours. 

Transdermal patches (Daytrana’) allow about 12 hours of effect 
if worn for 9 hours and are available in the United States. This for- 
mulation exhibits minimal first-pass metabolism, resulting in high 
bioavailability of MPH. Quillivant XR’ (5 mg/mL) is a recent long- 
lasting liquid preparation of MPH. Peak plasma levels occur ap- 
proximately 5 hours after dosing, with effects marketed to last for 
up to 12 hours. 

These differing delivery profiles provide the clinician with in- 
creased options when choosing which preparation to use. They 
allow a more flexible and sensitive individualized adjustment, 
while retaining the benefits of an extended-release preparation. 
Pharmacokinetic profiles may, however, show considerable inter- 
individual variation, and caution should be observed when general- 
izing from aggregated profiles to individual cases. 


Interaction with other drugs 


Drugs for ADHD show little interference with the metabolism and 
pharmacokinetics of other medications. There is, however, a po- 
tential to inhibit the metabolism of anticonvulsant drugs, such as 
phenobarbital, phenytoin, and primidone, and of tricyclics. Drugs 


for ADHD may potentiate stimulating effects of other drugs on the 
cardiovascular or central nervous system and can increase pressor 
response to vasopressor agents. There are potentially dangerous 
interactions with substances of abuse like cocaine and other sym- 
pathomimetic agents, including ATX. 


Clinical efficacy 


Strong evidence supports the efficacy of drugs for ADHD in redu- 
cing ADHD core symptoms over treatment periods of up to a year, 
and numerous placebo-controlled randomized trials confirm their 
effectiveness in the short term, with effect sizes of between 0.8 and 
1.1 on hyperactivity symptoms [2]. 

Drugs for ADHD reduce restlessness, inattentiveness, and im- 
pulsiveness markedly and rapidly, but they also have an important 
role in improving the quality of social interactions and decreasing 
aggression. Beneficial effects also extend to a broad range of associ- 
ated functional impairments and comorbidities, including increased 
academic achievement [43], reduced risk of emergency admission to 
hospital for trauma [44], lower risk of depression [45] and suicidal 
events [46], and decreased rates of substance abuse [47] and crimin- 
ality [48], although effect sizes are generally somewhat smaller than 
for symptom reduction per se. 

Knowledge about the effectiveness of medication in ADHD chil- 
dren and adolescents has also been expanded in specific spopulations. 

Within the Preschoolers with ADHD Treatment Study (PATS) 
trial, 160 children younger than 6 years were randomized to placebo 
or immediate-release MPH. The magnitude of MPH effect (dose 
range 2.5-7.5 mg) was lower than typically observed in school-aged 
children, with an increased frequency and severity of adverse events 
(that is, greater mood lability or reduced growth rate and treatment 
discontinuation in about 11% of cases) [49]. 

Positive results, but with more frequent and severe adverse effects 
have also been observed in ADHD children with comorbid autism 
spectrum disorder (ASD). A meta-analysis showed MPH to be ef- 
fective (effect size 0.67) for treating ADHD symptoms in children 
with pervasive developmental disorders, but its relatively lower tol- 
erability must be taken into account [50]. 

Although early studies suggested that children with comorbid 
anxiety or internalizing symptoms displayed lower response rates 
on ADHD symptoms, more recent studies, including the MTA 
study, do not support this. Drugs for ADHD at a group level show a 
beneficial effect on anxiety [51], and several recent studies showed 
that they are quite effective in controlling ADHD in the context of 
Tourette's disorder. 

A recent meta-analysis also shows that MPH is clearly an im- 
portant potential therapeutic option for comorbid disruptive behav- 
ioural problems and aggression in patients with ADHD and conduct 
disorder. Studies of MPH efficacy on aggression in conduct disorder 
without ADHD are still lacking [52]. 

LDX has been shown to be more effective than placebo on ADHD 
symptoms (effect size 1.8). LDX was also effective in treating day- 
to-day problems associated with ADHD, significantly improving 
quality of life [53]. The randomized withdrawal of treatment after 
24 weeks of open-label treatment indicated the continued benefit of 
LDX treatment, with treatment-emerging adverse events generally 
consistent with those associated with the MPH preparation, used as 
the active comparator [54]. 
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Long-term efficacy of drugs for ADHD 


Although the short-term benefit of drugs for ADHD has been re- 
peatedly confirmed within several studies, its long-term effects 
have been less well investigated. The MTA study still represents the 
most valid source of information on the long-term effects of MPH. 
Within this large-scale, random-allocation, non-blind trial, a com- 
parison was made between careful medication management, inten- 
sive behaviourally oriented psychosocial therapy, a combination of 
the two, and a simple referral back to community (usually medica- 
tion) [10]. The main conclusions after 14 months were that careful 
medication was more effective than behavioural treatment. The 
combination of behavioural therapy and medication did have some 
benefits: better control of aggressive behaviour at home, improved 
overall sense of satisfaction of parents, and possibly reduced medi- 
cation dosage. 

The follow-up at 36 months, by which time parents and children 
were free to choose the actual treatment, showed that all four ori- 
ginal groups had a similar outcome; similar results were observed at 
up to 16 years’ follow-up [55]. Various explanations are possible— 
the effects of more intensive therapy disappear when intensive treat- 
ment is stopped; self-selection of patients to treatments at the end of 
the randomization phase may lead to similar outcome (many chil- 
dren assigned to behavioural intervention started medication, and 
a significant percentage of those on intensive medication manage- 
ment actually withdrew medication). The most favourable overall 
development was found in children initially randomized to the MTA 
medication regime, whether or not they were taking medication at 
36 months, thus suggesting some lasting benefit for some children 
with ADHD. 


Non-dopaminergic medications 


Currently, two classes of non-dopaminergic agents are approved 
for ADHD treatment: the selective NA reuptake inhibitor ATX and 
the a2 agonists guanfacine and clonidine. The main clinical differ- 
ences between these agents and dopaminergic drugs are their more 
favourable legal status, their potential for a long (up to 24 hours) 
duration of clinical efficacy, and a generally slower onset of action 
(at least several weeks). 


Mechanism of action and pharmacokinetics 


ATX is a selective inhibitor of NA synaptic reuptake. In vitro, it 
shows high affinity with NETs, and in vivo, it induces an increase 
in NA extracellular concentrations in the prefrontal cortex. In this 
region, despite its low affinity for its transporters, it also results in 
a strong increase in the extraneuronal concentration of DA, which 
remains stable in the nucleus accumbens and striatum [56], making 
abuse or triggering of tics by ATX unlikely. 

ATX is metabolized mainly through the hepatic cytochrome P450 
2D6 enzymatic system (CYP2D6), resulting in metabolites with 
clinically significant activity. Once-daily ATX is associated with a 
decrease in 3,4-dihydroxyphenylethylene glycol (DHPG), which is 
the main brain metabolite of NA and a biomarker of central NET 
inhibition, persisting for at least 24 hours. Interestingly, there also 
appears to be a dissociation between the pharmacokinetic and 
pharmacodynamic profiles for ATX, with evidence that, despite the 
relatively short half-life, even once-daily dosing can result in clinical 
effects lasting throughout the day. 








SECTION 5 Attention-deficit/hyperactivity disorder 


Guanfacine is a selective a2 noradrenergic agonist, with 15-20 
times higher affinity for a2A adrenergic receptors than for a2B or 
a2C receptors. Extended-release guanfacine (matrix-s tablets, GXR) 
is well absorbed; after oral administration, the time to peak plasma 
concentration is approximately 5 hours. GXR half-life is 16-17 hours, 
allowing once-daily administration. The pharmacokinetic profile of 
guanfacine is linear (first-order) and dose-proportional. The profile 
of GXR differs from that of immediate-release (IR) guanfacine, with 
GXR resulting in 60% lower Cmax and up to 43% lower area under the 
curve (AUC), compared to IR tablets [57]. 

Clonidine is an a2 receptor agonist, which modulates adrenergic 
transmission. Reducing sympathetic activity can give rise to hypo- 
tension, sedation, and irritability. Clonidine appears to be less ef- 
ficacious than conventional drugs in the treatment of ADHD. The 
effectivness in reducing tics in children with ADHD is comparable 
to dopaminergic drugs. In addition to the effects of sedation and 
symptomatic hypotension, clonidine can cause bradycardia and 
dry mouth. 


Clinical efficacy of non-dopaminergic medications 


The clinical efficacy of ATX has been well documented in short- and 
long-term studies, with an effect size of approximately 0.7 across 
studies [58, 59]. The onset of clinical effectiveness of ATX is slower 
than that of dopaminergic drugs, varying between 2 and 4 weeks. 
Full therapeutic effect can take up to 6-8 (or even 12) weeks, but 
responders typically show some degree of improvement by 4 weeks 
[60]. Some patients may continue to improve for 36 weeks [61]. 

Guanfacine showed significant improvement of clinician- and 
teacher-rated symptoms, functional impairment, and positive con- 
tinuous performance outcome measures in a series of controlled 
trials [62]. A recent randomized withdrawal trial showed long-term 
maintenance of efficacy [63]. In adults, guanfacine modulates the 
influence of emotion control on cortical activation for cognitive 
control [64]. 


Other drugs 


Bupropion acts as a weak inhibitor of presynaptic reuptake of NA, 
DA, and 5-HT; it has been licensed for depression and smoking ces- 
sation. It showed better efficacy than placebo in reducing ADHD 
symptoms in children, although lower than dopaminergic drugs. 
Bupropion can cause nausea, insomnia, and palpitations; it can 
also trigger tics and cause dermatological reactions, at times severe 
enough to lead to discontinuation of the drug. Three head-to-head 
trials found bupropion had efficacy comparable to that of MPH, al- 
though a large double-blind, placebo-controlled multi-centre study 
found smaller effect sizes for bupropion, compared to MPH. In 
terms of tolerability, a head-to-head trial found that headache was 
more frequent in the MPH group than in the bupropion-treated 
group, with no difference in other adverse events [65]. 

Tricyclic antidepressants: the mechanism of action of tricyclics 
involve inhibition of presynpatic NA reuptake, with poor select- 
ivity. None of them is approved by the FDA or European Medicines 
Agency (EMA) for the treatment of ADHD; they were prescribed 
off-label for children with ADHD, but after the introduction of ATX, 
they are rarely used because of concerns about their potential car- 
diovascular toxicity. 

Modafinil is a ‘wakefulness-promoting agent’ marketed for the 
treatment of narcolepsy; it has been occasionally used for the 


management of inattention in adults. Its mechanism is poorly de- 
fined as a non-dopaminergic activating action on the frontal cortex. 
A recent meta-analysis on five RCTs showed an effect size of 0.7 on 
ADHD symptoms and a significantly higher incidence of decreased 
appetite and insomnia, but non-significant cardiovascular adverse 
events, compared to placebo [66]. 


Medication safety and management of adverse effects 


Medications for ADHD are generally safe and well tolerated; adverse 
effects are mild, transitory, reversible, and easily manageable by the 
ADHD specialist [3, 4]. The most common side effects of dopa- 
minergic agents include sleep difficulties, irritability, appetite and 
weight loss, headache, tachycardia, and increased blood pressure 
(BP) and heart frequency. Side effects of non-dopaminergic drugs 
mainly include nausea, sedation, and appetite loss, dry mouth, in- 
somnia, constipation, and mood swings. Additionally, urinary 
retention and sexual dysfunction have been observed in adult pa- 
tients. Most of these adverse effects diminish over the first months of 
treatment, with no significant differences between normal and poor 
metabolizers [2]. 

More severe adverse reactions, like psychotic symptoms or al- 
lergic reactions, have rarely been observed in association with 
dopaminergic or non-dopaminergic medications [3, 4]. Despite 
this, the tolerability and safety of medications used to treat ADHD 
have recently been raised as a concern to some regulatory author- 
ities. In January 2009, the European Union (EU) Committee for 
Medicinal Products for Human Use (CHMP) concluded that the 
benefit of MPH outweighs the risk when prescribed to ADHD chil- 
dren over 6 years and recommended to standardize prescribing 
and to provide safety information across all EU members. They 
further concluded that research was needed on the long-term ef- 
fects of MPH [67]. As a result of these recommendations, the 
Attention Deficit Hyperactivity Disorder Drugs Use Chronic 
Effects (ADDUCE) consortium was established to confirm MPH 
safety; it included experts in the fields of ADHD, drug safety, 
neuropsychopharmacology, and cardiovascular research (http:// 
www.adhd-adduce.org). 


Cardiovascular effects 


MPH, AMP, and ATX are sympathomimetic agents that increase 
noradrenergic and dopaminergic transmission; an effect on heart 
rate (HR) and BP is therefore an intrinsic feature of their pharma- 
cological activity [68, 69]. MPH and ATX may be associated with 
generally small elevations of BP (<5 mmHg) and HR [<10 beats/ 
minute (bpm)] at a group level; a subset of children and adolescents 
(around 5-15%) may experience greater treatment-related increases 
in HR or BP, over the 95th centile, or may report a cardiovascular- 
type complaint during drug treatment [69]. 

The 10-year follow-up of the MTA study [70] found no effect of 
treatment on either systolic or diastolic BP; however, use of dopa- 
minergic drugs was associated with a higher HR at years 3 and 
8. A recent meta-analysis conducted in ADHD children and adoles- 
cents treated with MPH, AMP, or ATX, including 18 trials with data 
from 5837 participants (80.7% boys) and an average treatment dur- 
ation of 28.7 weeks, revealed that all three medications were associ- 
ated with a small, but statistically significant, pre-post-increase of 
systolic BP (SBP). Compared to AMP and ATX, MPH did not show 
a pre-post-effect on diastolic BP (DBP) and HR. Only 2% of patients 


discontinued their medication due to any cardiovascular effect. In 
the majority of patients, the cardiovascular effects resolved spon- 
taneously or with medication dose changes [71]. A meta-analysis in 
adult patients with ADHD reported that treatment is associated with 
small, but significant, increases in SBP (+2.0 mmHg) and HR (+5.7 
bpm), but no effect on DBP [72]. No changes in ECG parameters, 
including PR, QRS, and QT intervals, have been associated with the 
use of MPH and ATX [69]. However, it is a concern that even small, 
but persistent, BP and/or HR increases may increase the risk for ser- 
ious cardiovascular events, including sudden cardiac death, acute 
myocardial infarction, and stroke, in the long term. The magnitude 
of this risk remains to be established. 

The available epidemiological studies, summarized by 
Hammerness et al. [65], do not show a significant association be- 
tween ADHD drugs and serious cardiovascular events. A recent 
large study of 1,200,438 children and young adults between 2 and 
24 years found no evidence that ADHD drugs increased the risk of 
serious cardiovascular events [73]. Similarly, several large registry 
studies of adults [74] and children (3-17 years) [75] suggest that 
ADHD medication, when medically supervised, was not associated 
with an increased risk of severe cardiovascular events. 

Current guidelines [3] recommend that patients being considered 
for ADHD medication should have a clinical assessment, including 
identification of any known heart disease, any history of syncope 
with exercise, and any family history of sudden unexpected death 
under the age of 40 years. HR and BP should be taken at baseline and 
repeated every 3-6 months. It is important to make a referral for fur- 
ther assessment when indicated, and for patients with pre-existing 
cardiac conditions, dopamine or NA-releasing drugs should be used 
cautiously and only after consultation with a cardiologist. 


Growth and developmental effects 


One of the most common side effects of drugs for ADHD is appe- 
tite loss, with consequent reduced weight gain and possible growth 
reduction with prolonged use. The mechanism by which growth is 
affected has not yet been fully clarified. The decrease in appetite and 
the consequent reduction in caloric intake represent the obvious 
probable cause of the growth slowdown [76]. Other possible mech- 
anisms include medication effects on hepatic and/or CNS growth 
factors and direct effects on cartilage. 

Studies providing longitudinal data suggest the height deficit is 
approximately 1 cm/year during the first 3 years of treatment, while 
some other data suggest that these effects tend to attenuate over time 
and ultimate adult growth parameters are generally not affected 
(72, 77]. 

In the recently published MTA follow-up into adulthood [55], 
prolonged use of MPH in the ADHD group resulted in an average 
height of 1.29 + 0.55 cm shorter than the control group (p <0.01, 
d = 0.21). Within the treated sample, adherence to drug treatment 
was classed as consistent, inconsistent, or negligible: participants 
with the consistent or inconsistent pattern were 2.55 + 0.73 cm 
shorter than the subgroup with the negligible pattern (p <0.0005, 
d=0.42). 

Preliminary analysis of the large 2-year naturalistic 
pharmacovigilance ADDUCE study [67] shows a lower height vel- 
ocity standard deviation score (SDS) in medicated ADHD subjects 
than in unmedicated ones, roughly equivalent to a reduction in 
height velocity of about 0.35 cm/year for a 9.6-year-old boy (mean 
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age of subjects in the study). In post hoc analyses, the impact on a 
child’s height velocity SDS was apparently associated with the se- 
verity of illness, suggesting a possible dose-dependent effect of MPH. 

With regard to ATX, a meta-analysis of seven double-blind/ 
placebo-controlled and six open-label studies found that the mean 
actual weight and height at 24 months were, respectively, 2.5 kg and 
2.7 cm lower than the expected values [78]. The difference occurred 
mostly during the first 18 months of treatment. 

In order to prevent growth suppression and ensure an adequate 
growth pattern to subjects in the age of development, current guide- 
lines [3] recommend: 


e Monitoring appetite, weight, height, and BMI every 6 months. 

e Differentiating between pretreatment eating problems and 
medication-induced eating problems. 

e Giving medication after meals, rather than before. 

e Encouraging the use of high-calorific snacks and late evening 
meals. 

e Reducing the dose or switching to an alternative class or form- 
ulation. 

e Discontinuing medication on weekends to prevent weight loss or 
longer drug holidays to allow for catch-up growth. 

e Referring to a paediatric endocrinologist/growth specialist if 
height and weight values are below critical thresholds. 


Neurological effects 

Tics 

Drugs for ADHD might, from a theoretical point of view, exacerbate 
tic severity as they can increase DA activity in the basal ganglia [79]. 
A meta-analysis, including nine DBRPC trials, examining the effi- 
cacy of medications for ADHD in children with comorbid tic (a total 
of 477 subjects) concluded that MPH does not worsen tic severity in 
the short term, although it is possible that MPH may worsen tics in 
individual cases and that ATX can, on the other hand, significantly 
improve comorbid tics [80]. The recent Cochrane group systematic 
review [81] concluded the same. With regard to ATX, although it 
can have a positive effect on tic frequency, some case reports have 
described exacerbation of tics during ATX treatment [4]. Thus, tics 
are no longer a contraindication for the use of ADHD drugs in the 
EU, but caution is still recommended [82] (EMA, 2010). 

The management of tics should include: (1) observation of the in- 
tensity of tics over a 3-month period before any decision regarding 
ADHD treatment; (2) dose reduction; (3) substitution; and (4) if the 
previous measures are not effective, an antipsychotic can be added 
to control tics. 


Sleep problems 


Sleep problems are common in individuals with ADHD, and it is 
also important to rule out primary sleep disorders that may mimic 
or exacerbate ADHD. Both AMP and MPH may increase the latency 
to sleep onset, shorten sleep duration, and decrease sleep efficiency 
[83]. It is necessary to assess sleep at baseline and to continually 
monitor throughout treatment, as all ADHD medications can af- 
fect sleep. Dose timing adjustments, switching drug formulations, 
or adding an evening dose of melatonin are often helpful strategies 
for drug-exacerbated insomnia in children displaying a good medi- 
cation response [84]. 
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Seizures and epilepsy 


Although longer-term effects of MPH and its effects in children with 
frequent seizures need further study, current evidence supports 
the use of MPH for the treatment of ADHD in patients with well- 
controlled epilepsy and even in those with infrequent seizures [85]. 
When epilepsy is poorly controlled, the frequency of seizures should 
be carefully monitored; if their frequency increases, or seizures de- 
velop de novo, then ADHD medication should be stopped. 


Psychiatric effects 
Psychotic, mood, and other psychiatric symptoms 


Drugs for ADHD have been associated rarely with possible severe 
psychiatric effects, including psychotic symptoms (hallucinations). 
A review of the literature does not strongly indicate an association 
between MPH treatment for ADHD and psychosis. However, some 
open-label trial extension studies reported discontinuations due to 
psychotic symptoms. In contrast, some positive clinical experience 
(drugs for ADHD being helpful) have been reported for managing 
ADHD symptoms in the context of a psychotic disorder. A single 
study also found that medication with MPH in childhood could 
show a protective effect by reducing schizotypic features in adults. 

Suicide-related events have also been investigated during treat- 
ment with drugs for ADHD, with symptoms sometimes anecdotally 
improving following discontinuation of medication. A recent 
population-based electronic medical records study showed that, in 
fact, the incidence of suicide attempts was higher in the period im- 
mediately before the start of MPH treatment. The risk remained ele- 
vated immediately after the start of MPH treatment but returned to 
baseline levels during continuation of the treatment. The observed 
higher risk of suicide attempts before treatment may reflect emer- 
ging psychiatric symptoms that trigger medical consultations, re- 
sulting in MPH treatment [86]. 

Population-based studies indicate that medication for ADHD is 
associated with a reduced long-term risk (that is, 3 years later) for 
depression [45] and a potential protective effect on suicidal behav- 
iour [87]. However, individual patients being treated with medica- 
tions for ADHD should be observed for the emergence of psychotic 
symptoms, depression, irritability, and suicidal ideation, as part of 
routine monitoring. 

Mood lability, dysphoria, anxiety, hostility, and explosive out- 
bursts may be observed in 5-10% of children taking drugs for 
ADHD. Among children whose aggressive behaviour develops in 
the context of ADHD and oppositional defiant disorder or conduct 
disorder, systematic, well-monitored titration of monotherapy often 
reduces aggression considerably, thus averting the need for add- 
itional medications [88]. 


Substance misuse 


DA-releasing drugs have the potential for misuse and can be di- 
verted by patients or families to this end [89]. The extended-release 
formulations of drugs for ADHD are less prone to diversion, because 
they do rapidly increase blood drug levels and are also less easily 
crushed into powder for injection or snorting. Once-a-day admin- 
istration also makes parental supervision easier to enforce. Non- 
dopaminergic medications (that is, ATX) are another option with 
low abuse potential. 


Concern has been raised that therapeutic use of drugs for ADHD 
may result in ‘sensitization’ and possibly increase the risk for sub- 
stance use disorder (SUD) later in life. However, ADHD is associ- 
ated with impulsivity and conduct disturbances, and represents itself 
a risk factor for SUD. Naturalistic follow-up studies do not support 
the contention that drugs for ADHD increase the risk for SUD (for 
example, Swedish registry studies found drug treatment associated 
with decreased rates in patients with ADHD) [48]. 


Management and treatment of ADHD 
in adulthood 


ADHD is a chronic condition with symptoms frequently persisting 
into adulthood. About 60% of affected children have, as adults, sig- 
nificant ADHD-related impairments, including social dysfunction, 
educational and occupational underachievement, substance abuse, 
increased risk for accidents, and legal difficulties [90]. Although 
awareness, recognition, and diagnostic criteria of ADHD in adults 
have improved in recent years, there is still the need to increase 
physician support and specialized services for the clinical manage- 
ment of this disorder, as patients make the transition to the adult 
health care system [91, 92]. 

The NICE guidelines in the UK provide strong support for the 
provision of adult ADHD services, recommending that young 
people with ADHD and receiving treatment from child and adoles- 
cent mental health services (CAMHS) or paediatric services should 
be reassessed at school-leaving age to establish whether treatment 
should continue into adulthood. 

Adult ADHD symptomatology differs from the typical childhood 
presentations; hyperactivity/impulsivity may diminish over time, 
while inattention tends more often to persist, with a greater impact 
in adults. Optimal strategies for ADHD in adulthood are, as in child- 
hood, multimodal, with the main goals of improving symptoms and 
optimizing functional performance [93]. Adults respond well to the 
same classes of medication used in children, and MPH and AMP are 
confirmed as first-line pharmacologic interventions [94]. Adults with 
ADHD have, in fact, reported improved social relationships, aca- 
demic/work functionality, and driving, together with reduced criminal 
behaviour, as a consequence of ADHD medications [95]. Additionally, 
adults with ADHD treated with drugs prior to 18 years have been 
shown to have better outcomes across broad quality of life measure- 
ments, as compared with subjects not previously so treated [96]. 

Just as for children and adolescents, drug treatment of adult 
ADHD consistently yields positive short-term effects, but few trials 
have evaluated their long-term efficacy and safety. Poor compliance 
and comorbid psychiatric disorders may further complicate the de- 
termination of treatment benefits in adults with ADHD [97]. ADHD 
medications can cause side effects also during adulthood. The most 
worrying include increases in both BP and HR. However, as con- 
firmed by a recent large population cohort study [70], there is no 
increased risk for acute myocardial infarction, sudden cardiac death, 
or stroke for current ADHD medication users, compared to non- 
users. While the risks of serious side effects are thought to be low for 
healthy adults, physicians should, however, be cautious when pre- 
scribing for patients with cardiovascular disease, seizure disorders, 
and psychosis [98]. 


Even though pharmacologic interventions are considered first- 
line treatments for ADHD, some adults with ADHD continue to 
experience significant residual and impairing symptoms. CBT has 
been shown helpful in the treatment of adult ADHD, alone and in 
combination with psychopharmacology [99]. 

Another psycoeducational approach that has gained increasing 
popularity in the last year is coaching. It is a highly individualized 
intervention, in which a personally assigned coach guides the pa- 
tient in accomplishing tasks and goals. Coaching differs from trad- 
itional CBT, because it is more focused on solving specific problems 
or reaching specific goals and is more accessible to the patient on an 
as-needed basis [100]. 


ADHD is a chronic, heterogenous condition, with a high preva- 
lence and persistence into adult life and the potential for serious 
functional impairment extending beyond the affected individual. 
Established efficient assessment methods and good treatment evi- 
dence help clinicians in monitoring and managing this disorder in 
childhood and adulthood. ADHD drugs, available in different for- 
mulations, are, especially in the short term, currently among the 
more effective drugs in psychiatry and perhaps in general medicine 
[101], with a good safety profile. Long-term treatment efficacy and 
tolerability and safety data are still insufficient and need more re- 
search. On the basis of the current evidence, however, there does not 
seem to be any necessity to change current clinical guidelines for the 
monitoring of the main possible adverse events. 
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Neurodevelopmental motor disorders are a group of conditions 
characterized by developmental deficits in learning, control, and 
execution of motor skills. These deficits may manifest as difficulties 
in programming and executing fine and gross motor tasks that are 
crucial for routine activities of daily living and social and academic 
abilities, as well as an inability to control excessively repetitive or 
unwanted motor behaviours. The appropriate realization of a motor 
programme or a sequence of movements requires the convergence 
of numerous pathways [sensory (somatosensory, proprioceptive, 
visual, vestibular) pathways, motor pathways, and regulatory path- 
ways, for example cortico-basal ganglia, cerebello-thalamo-cortical 
and intracortical networks]. This neuroanatomical substrate in- 
volves cortical and subcortical structures regulating the correct 
integration of sensory inputs and motor outputs, which allows the 
successful execution and learning of advantageous goal-directed 
movements. The correct functional development of motricity is 
strongly supported by experience, with appropriate synaptic circuits 
being reinforced and established as a result of recurrent practice of 
efficient movement patterns. 

The most common neurodevelopmental motor disorders are 
tic disorders, stereotypic motor disorder, and developmental co- 
ordination disorder (DCD), which will be the object of this chapter. 
































Phenomenology 


Tic disorders are characterized by the recurrent presence of tics, 
which can be defined as unwanted, discrete, non-goal-directed, 
non-rhythmic movements (motor tics) or vocalizations (vocal 
tics). The basic features of tics which help distinguish them from 
other abnormal behaviours include: repetitive and patterned 
character; variability of the type of movement/vocalization and 
severity; association with premonitory urges; and partial or com- 
plete suppressibility on demand. Three tic disorders are included in 
DSM-5: Tourette’ disorder [also called Tourettes syndrome (TS)], 
persistent (also called chronic) motor or vocal tic disorder; and pro- 
visional tic disorder [1]. The core diagnostic criteria of tic disorders 
are listed in Box 37.1. 





The repetitive and patterned character of tics is an important fea- 
ture to differentiate them from other unwanted movements such 
as chorea or myoclonus. Regardless of their repetitive quality, tics 
are highly variable within and across subjects, as regards to muscle 
groups involved (and thus body location), complexity, and interfer- 
ence with normal voluntary behaviour. The overall severity of tics 
fluctuates over time, and each tic may wax and wane over the course 
of time. Their changing pattern is one of the main differences with 
stereotypies, discussed later in this chapter. 

Although virtually any striatal muscle group in the body can be 
involved in a tic, a cranio-caudal gradient of frequency is typically 
observed, with the face, head, and proximal upper limbs being the 
most commonly affected. At the same time, the ability to suppress 
tics voluntarily is maximal where tics are least frequent (trunk and 
lower limbs), and minimal in the most cranial body areas [2]. 

Tics are also classified as simple and complex, depending on 
whether they involve a single muscle group or consist of co- 
ordinated sequences or combinations of movements/vocalizations 
involving several muscle groups. Motor tics may therefore range 
from very rapid, simple motor twitches (for example, blinking, fa- 
cial grimacing, head flicks, shoulder shrugs, etc.) to orchestrated 
sequences of movements, sometimes resembling purposeful behav- 
iour. Likewise, vocal tics may range from monosyllabic, meaning- 
less vocal utterances to longer fragments of verbal language with a 
semantic content. Complex tic-like repetitive behaviours include 
also: echophenomena (echopraxia, echolalia) and paliphenomena, 
that is, non-voluntary repetition of movements or vocalizations 
made by another person or his/her own; and socially inappro- 
priate words or gestures, which can also have obscene content, as in 
coprophenomena (coprolalia, copropraxia). When severe enough, 
tics may cause impairment in social, academic, and professional 
functioning. Some tics may have a violent and potentially (although 
not deliberately) self-harming quality, for example, head banging, 
hitting, etc. 

Premonitory urges (PUs) are uncomfortable sensations that 
build up in intensity immediately before tics and are markedly al- 
leviated immediately after tic release. Often the linked PUs and tics 
occur in the same body region. In some patients, PUs are described 
more as a poorly defined feeling of pressure, tension, or restless- 
ness around the body part associated with a certain tic. Only about 
25% of pre-pubertal children are able to describe PUs, whereas this 
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Box 37.1 DSM-5 criteria for tic disorders 


Tourette's syndrome (TS) 
For a person to be diagnosed with TS, he or she must: 





e Have two or more motor tics (for example, blinking or shrugging the 
shoulders) and at least one vocal tic (for example, humming, clearing 
he throat, or yelling out a word or phrase), although they might not 
always happen at the same time. 

e Have had tics for at least a year; the tics can occur many times a day 
usually in bouts) nearly every day, or off and on. 

e Have tics that begin before he or she is 18 years of age. 

e Have symptoms that are not due to taking medicine or other drugs 

r due to having another medical condition (for example, seizures, 
untington disease, or post-viral encephalitis). 














Q 











Persistent (chronic) motor or vocal tic disorder 

For a person to be diagnosed with a persistent tic disorder, he or she 
must fulfil the same criteria of TS, apart from the first criterion, which is 
as follows: 


e The person must have one or more motor tics (for example, blinking 
or shrugging the shoulders) or vocal tics (for example, humming, 
clearing the throat, or yelling out a word or phrase), but not both. 








Provisional tic disorder 
For a person to be diagnosed with this disorder, he or she must fulfil the 
same criteria of persistent (chronic) tic disorder, with the only difference 
that tics have been present for no longer than 12 months in a row. 





Source: data from the Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 











self-awareness increases dramatically to about 60% during late teen 
years [3]. Fig. 37.1 summarizes the differences in the main ante- 
cedents between tics, compulsions, and impulsive actions. 





Patients with tics may also report ‘just-righť phenomena, in which 
they reiterate actions that may or may not involve sensory experi- 
ences (for example, repeatedly touching an object or a body part) 
until these feel ‘just right. Just-right’ phenomena are more common 
in patients with comorbid obsessive-compulsive disorder [3]. 

Another relevant sensory feature associated with tics is the un- 
usual focus of attention on external stimuli which are repetitive, 
faint, and poorly salient. This phenomenon has been named som- 
atic hypersensitivity or site sensitization and is not related to ab- 
normalities in the ability to discriminate between sensory stimuli 
of different intensities [4]. Some patients may become easily and 
disproportionately irritated by excessive irritation by environmental 
noise (misophonia). The origin of somatic hypersensitivity in tic dis- 
orders is still not very well understood but may be related to abnor- 
malities of sensorimotor gating. 

The ability to suppress tics is higher in adults than in children and 
varies across patients. Although suppressing tics leads to a rise in 
PU intensity, the quality or severity of PUs is not associated with 
the ability to hold tics in [2]. Although a proportion of patients 
may experience a rebound worsening of tics, following prolonged 
tic suppression, this is not a constant feature. More rapid tics, for 
example eye movements or blinking, are often too quick to be sup- 
pressed. Active tic suppression demands attention and may distract 
from complex cognitive activities, including school-based learning 
activities. Voluntary suppression of tics is facilitated by reward, 
for example in social situations where inhibiting tics may lead to 
greater social acceptance. Relaxation, physical exercise, engagement 
in attention-demanding tasks, and reduced sympathetic tone may 
also reduce tic severity. At the same time, psychosocial stress is the 
most effective contextual factor that modulates tic severity. Raised 
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Fig. 37.1 Basic phenomenologic differences between tics, compulsions, and impulsive behaviours. 


anxiety levels and fatigue are also important precipitants of tics. 
Moreover, in some cases, tics and PUs may be precipitated by talking 
or thinking about them (metacognitive inducibility) [5]. 

Tic-like rapid movements presenting in adulthood may be la- 
belled as tics, but at least a proportion of them are considered to 
have a functional origin. Apart from age at onset, these abnormal 
movements differ from typical ‘organic’ tics for their female pre- 
dominance, sudden onset often precipitated by minor trauma or 
panic attacks, uncommon association with PUs or suppressibility, 
common suggestibility, lack of association with a family history of 
tic disorders or a personal history of ADHD or OCD, and no re- 
sponse to anti-tic medications such as dopamine antagonists [6]. 

Tics typically begin between 5 and 8 years of age, and in up to 
95% between ages 4 and 13. The first tics are more frequently simple 
and involve cranial muscles, usually facial (blinking, sniffing, etc.). 
Severity peaks around puberty (age 10-12) when complex tics usu- 
ally appear, and tics may spread beyond the cranio-cervical region. 
Tics usually decrease in the second decade. Persistence of clinically 
relevant tics in adulthood was found to be associated with higher tic 
severity in childhood, lower caudate volumes, and poorer perform- 
ance on tasks involving visuomotor skills during childhood [7]. 

Most prevalence studies on tic disorders have focused on paedi- 
atric populations. A meta-analysis of prevalence data from 1985 to 
2011 found that studies applying a school-based assessment yielded 
a combined prevalence of 0.77% for TS, 1.61% for chronic tic dis- 
orders, 2.99% for transient tic disorders, and 2.82% for the ‘all tic 
disorders’ category in the population of up to 16 years of age [8]. 
There is, however, large variability of estimates across studies, due 
to differences in design and conduct. A male predominance was de- 
tectable only in TS, but not in other chronic and transient (provi- 
sional) tic disorders. Cohort studies of youth with TS report that half 
to two-thirds of these patients experience a significant decline of tic 
symptoms that roughly coincides with late adolescence. In contrast 
with this, a study in which adults previously diagnosed with TS in 
childhood were evaluated again after age 20 found that 90% still had 
tics, suggesting that tic disorders are not rare in adulthood [9]. 

The lifetime prevalence of any psychiatric comorbidity among TS 
patients was estimated at 85-90%, with almost 60% suffering from 
two or more psychiatric disorders, and more than 70% meeting cri- 
teria for OCD or ADHD [2, 3]. Overall, other symptoms/behaviours 
are often a greater cause of distress than tics and have a negative im- 
pact on TS patients’ quality of life. 

ADHD is the most common comorbidity, affecting up to 60% 
of TS patients, particularly male children. ADHD typically begins 
before the onset of tics, and its presence is associated with earlier 
tic onset (5.8 vs 6.2 years in TS patients without ADHD). ADHD is 
associated with a higher likelihood of reduced social functioning, 
anxiety disorder, anger control and sleep problems, mood disorders, 
and conduct or oppositional defiant disorder. 

Obsessive-compulsive symptomatology in TS is most often as- 
sociated with symmetry behaviours, ‘just-right’ experiences, and 
repetitive touching, although other types of obsessive-compulsive 
behaviours may also be observed. In a large sample of more than 5000 
children and adolescents with TS, comorbid OCD was associated 
with ADHD, mood and anxiety disorders, conduct disorder/oppos- 
itional defiant disorder, and autism spectrum disorder (ASD) [10]. 

Self-injurious behaviour is encountered in up to 60% of TS pa- 
tients. TS patients are 1.4 times more likely to suffer from anxiety than 
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the general population, independent of comorbid OCD and ADHD. 
Up to 30% of patients may develop mood disorders in their lifetime, 
although this seems mostly accounted for by comorbid OCD. About 
10% of youth with chronic tic disorders experience suicidal thoughts 
and/or behaviours [11]. Prevalence estimates of ASD in TS may be 
as high as 13%. Of note, ASD were found to be associated with a 
higher risk of rage attacks/explosive outbursts. Finally, 64% of adult 
TS patients were found to have at least one comorbid personality 
disorder, with borderline, depressive, obsessive-compulsive, para- 
noid, passive aggressive, and avoidant being the most common [10]. 


Pathogenesis 


Structural and functional abnormalities were detected in TS patients 
at numerous levels of the cortico-striato-thalamo-cortical (CSTC) 
network, which is fundamental for the programming and execution 
of motor actions and habit formation [12]. On a microscopic level, 
one transcriptomics and two neuropathological studies of previously 
medicated TS adults showed reduced numbers and gene expression 
within inhibitory GABAergic and cholinergic interneurons in sen- 
sorimotor parts of the striatum and internal segment of the globus 
pallidus. A functional imbalance between inhibition/excitation in 
these structures was hypothesized. Animal model work on rodents 
and monkeys have shown that injection of the GABA, antagonist 
bicuculline in sensorimotor parts of the striatum leads to disinhib- 
ition of the CSTC loops and to generation of tic-like rapid move- 
ments, whereas injection in the nucleus accumbens (ventral or limbic 
part of the striatum) leads to vocal utterances resembling vocal tics. 
Tic-like generation in one of these models followed the somatotopical 
distribution of respective disinhibited striatal motor areas [13, 14]. 

Structural neuroimaging studies in TS patients have provided 
further evidence of abnormalities in CSTC circuits (in particular, in 
sensorimotor and limbic loops). Voxel-based morphometry studies 
reported volumetric changes of the caudate, putamen, midbrain, 
and hippocampus, whereas diffusion tensor imaging tractography 
showed reduced connectivity between the supplementary motor 
area and the basal ganglia, as well as in frontal cortico-cortical 
circuits [15]. Event-related fMRI revealed activation of the sen- 
sorimotor area and posterior parietal cortex at tic onset, whereas 
premotor areas, the anterior cingulate cortex, and the insula were 
activated in the 2 seconds prior to tic (hence, possibly corresponding 
to the urge to tic). It has thus been speculated that urge activity in 
the insula could influence the primary motor cortex to produce tics, 
bypassing the premotor cortex [16]. 

There is evidence for defective inhibitory mechanisms in TS 
brains, at the level of both intracortical inhibitory microcircuits 
and sensorimotor gating. The latter is classically explored, assessing 
prepulse inhibition, where an initial small sensory stimulus inhibits 
a reflex response to a second stronger stimulus. Prepulse inhibition 
defects have been reported in a proportion of patients with tic dis- 
orders and could be associated with the somatic hypersensitivity ex- 
hibited by these patients [17]. 

Another important theory at the basis of tic generation supports 
the involvement of overactive dopaminergic systems. This is sug- 
gested by the observation of excessive phasic dopamine release fol- 
lowing stimulation with amphetamine in nuclear imaging studies. 
It also gives support to the idea that tics represent a form of habit, 
arising from a dopaminergically mediated reward associated with the 
relief of the uncomfortable urge induced by the tic movement [12]. 
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A long list of conditions and psychotropic substances may give 
rise to secondary tics. A later age at onset (late adolescence or adult- 
hood), abrupt onset, and association with other neurological mani- 
festations should prompt to exclude these secondary causes. Box 
37.2 provides a comprehensive list of these causes. 


Management 


A comprehensive history, collected from patients and reliable in- 
formants, and assessment using validated rating instruments for 
tics and associated behavioural symptoms are crucial in guiding the 
management of tic disorders (Fig. 37.2). Appropriate education of 
patients and families is a necessary phase of management, which 
aims at optimizing the understanding and acceptance of symptoms 
and identifying the best coping strategies with the disorder. Both 
thorough assessment and education help the clinician in selecting 
patients who require more active interventions for their tics, when 
these are needed, and whether and which comorbidities should be 
treated with priority. 

Tics typically require active interventions when they are stigma- 
tizing, socially or academically impairing, influencing mood and 
self-esteem, and generating situational anxiety. Also, a smaller pro- 
portion of cases experience violent tics that are potentially harmful 
or physically exhausting, leading to pain, headaches, or fatigue. Fig. 
37.3 presents a therapeutic algorithm adapted from recently pub- 
lished guidelines [18, 19]. 

The first line of interventions for tics encompasses pharma- 
cological and behavioural approaches. The choice of a specific 
intervention is not only guided by evidence, but relies also on the 
predicted compliance of each individual to a given treatment, as well 
as on the need to obtain a rapid effect and the direct availability of 
certain interventions (for example, behavioural). 

Dopamine antagonist drugs have been in the therapeutic arsenal 
for tics for more than 40 years. Pimozide (0.5-6 mg) and haloperidol 
(0.5-10 mg) were shown to be superior to placebo by six and three 
RCTs of fair quality, respectively, although haloperidol showed a 
worse tolerability profile. The use of fluphenazine (0.5-20 mg) is 
supported by open-label studies, retrospective case series, and one 
single-blind, placebo-controlled crossover study. Benzamides like 
tiapride and sulpiride (50-200 mg) are also used, especially in some 
European countries. However, there is limited evidence of their ef- 
ficacy; moreover, sedation is frequently reported by patients on 
sulpiride (Table 37.1) [20]. 

Among second-generation antipsychotics, the use of risperidone 
is supported by five RCTs that confirmed its superiority to placebo 
in treating tics. A meta-analysis showed that risperidone (daily dose 
range 1-6 mg) does not differ in efficacy from haloperidol, pimozide, 
and ziprasidone. Initial evidence shows that risperidone could 
be helpful as an augmentation strategy for associated obsessive- 
compulsive symptoms and impulse-control disorder. The efficacy of 
aripiprazole (5-30 mg) is supported by one fair-quality RCT, five 
prospective open-label studies on relatively large clinical samples, 
and one meta-analysis. About two-thirds of treated patients across 
different ages show clinically significant improvement of their tics 
with this medication, with effects lasting over 1 year in up to 50% 
of responders. Cardiac safety may also be larger for aripiprazole, 


Box 37.2 Secondary causes of tics 


e Neurodevelopmental disorders 
— Mental retardation 
— Autistic spectrum disorders (including Asperger's syndrome) 
— Rett's syndrome 
— Genetic and chromosomal abnormalities 
— X-linked mental retardation (MRX23) 
— Albright hereditary osteodystrophy 
— Duchenne muscular dystrophy 
— Factor VIII haemophilia 
— Fragile X syndrome 
— Lesch-Nyhan syndrome 
— Triple X and 9p mosaicism 
— 47,.XXY karyotype 
— Partial trisomy 16 
— 9p monosomy 
— Beckwith-Wiedemann syndrome 
— Tuberous sclerosis 
— Congenital adrenal hyperplasia due to 21-hydroxylase deficiency 
— Phenylketonuria 
— Corpus callosum dysgenesis 
— Craniosynostosis 
— Klinefelter's syndrome 
— Neurofibromatosis 
— Developmental stuttering 
e Acute brain lesions 
— Post-traumatic 
— Vascular 
— Infectious 
— Varicella-zoster virus 
— Herpes simplex encephalitis 
— Mycoplasma pneumoniae 
— Lyme disease 
e Post-infectious 
— Sydenham’'s chorea 
— Paediatric autoimmune neuropsychiatric disorder associated with 
streptococcal infections (PANDAS) 
— Neurodegenerative diseases 
— Huntington's disease 
— Neuroacanthocytosis syndromes 
— Neurodegeneration with brain iron accumulation 
e Other systemic diseases 
Behcet's syndrome 
Antiphospholipid syndrome 
e Peripheral trauma 
e Amphetamines 
e Cocaine 
e Heroin 
e Methylphenidate 
e Pemoline 
e Antipsychotics (D2 blockers) 
— Fluphenazine 
— Perphenazine 
— Thiothixene 
e Antidepressants 
e Antiepileptics 
— Carbamazepine 
— Phenytoin 
— Phenobarbital 
— Lamotrigine 
e L-dopa 
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1. DEMOGRAPHICS 3. COMORBIDITIES 

(age, gender, ethnicity, e OCD: [C]Y- BOCS 
education level, SES, marital e ADHD: SNAP 

status [parental or pts]) e Anxiety/depression: 





SCARED — BDI/BAI 
e Disruptive behaviours: DBRS 
2. TIC EVALUATION e Autism: ASSQ 
e YALE GLOBAL TIC 
SEVERITY SCALE (checklist 








+ severity + overall 4. AREA OF FUNCTIONING 
impairment) — PUTS Engage and listen to parents/partners 
e Engage and listen to (academic and professional proficiency; 
parents/partners hobbies and recreational interests; 
e CONTEXTUAL FACTORS aspirations) 
(no standardized instrument GTS-QoL 
available) Sleep diary (if required) 














Fig. 37.2 Assessment plan in Tourette's syndrome. PUTS, Premonitory Urge for Tics Scale (Woods et al., 2005); OCD, obsessive-compulsive disorder; 
[C]Y-BOCS: [Children]Yale-Brown Obsessive- Compulsive Scale (Scahill et al., 1997); SNAP, Swanson, Nolan and Pelham questionnaire (Bussing et al., 
2008); SCARED, Screen for Child Anxiety and Related Emotional Disorders (Birmaher et al., 1997); BDI, Beck Depression Inventory (Joe et al., 2008); BAI, 
Beck Anxiety Inventory (Steer et al., 1986); DBRS, Disruptive Behavior Rating Scale (Silva et al., 2005); ASSQ, Autism Spectrum Screening Questionnaire 
(Ehlers et al., 1999).§ 
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Fig. 37.3 Decision tree for the treatment of tics (specifically focused on Tourette's syndrome and other primary tic disorders). 


Adapted from Eur Child Adolesc Psychiatry, 20(4), Roessner V, Plessen KJ, Rothenberger A, et al., European clinical guidelines for Tourette syndrome and other tic disorders. Part 
ll: pharmacological treatment, pp. 173-196, Copyright (2011), The Author(s). 
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Table 37.1 Medications for the treatment of tics 




















Daily dose (mg) | Adverse effects Level of evidence 


CG: moderate quality, strong 
recommendation 
ESSTS: level A evidence 


CG: moderate quality, strong 
recommendation 
ESSTS: level A evidence 


CG: high quality, weak recommendation 
ESSTS: level A evidence 


CG: high quality, weak recommendation 
ESSTS: level A evidence 


CG: high quality, weak recommendation 
ESSTS: level A evidence 


CG: moderate quality, weak recommendation 
ESSTS: level B evidence 


CG: low quality, weak recommendation 
ESSTS: level B evidence 


CG: low quality, weak recommendation 
ESSTS: level B evidence 











CG: low quality, weak recommendation 
ESSTS: not evaluated 


CG: not evaluated 
ESSTS: level B evidence 





CG: low quality, weak recommendation 
ESSTS: not evaluated 





CG: low quality, weak recommendation 





CG: low quality, weak recommendation 


CG: very low quality, weak recommendation 
ESSTS: level C evidence 


Clonidine 0.05-3 Sedation, dry mouth, headache, irritability, mid-sleep awakenings, 
rebound hypertension, tics and anxiety following abrupt discontinuation 

Guanfacine 0.5-4 Orthostatic hypotension, bradycardia, sedation, headache 

Haloperidol 0.5-10 Rigidity, parkinsonism, tardive involuntary movements and akathisia; 
appetite changes; weight gain; salivary changes; constipation; depression, 
anxiety; fatigue, sedation; hyperprolactinaemia (galactorrhoea, 
gynaecomastia, irregular menses, sexual dysfunction) 

Pimozide 05-6 Similar to haloperidol, but with less movement disorders; QTc interval 
prolongation 

Risperidone 0.5-16 Sedation, fatigue, depression and acute phobic reactions, weight gain 

Aripiprazole 5-30 Weight gain, increase in BMI and waist circumference, nausea, fatigue, 
sedation, akathisia, movement disorders, sleep problems; probably lower 
risk of QT prolongation than with other antipsychotics 

Ziprasidone 5-40 Sedation, anxiety, akathisia, movement disorders 

Olanzapine ZA) Sedation, weight gain and increased appetite, dry mouth, transient 
hypoglycaemia 

Fluphenazine 0.5-20 Similar to haloperidol, but less frequent 

Sulpiride 50-200 Sedation; less commonly, paradoxical depression, restlessness, sleep 

Tiapride problems, weight gain, hyperprolactinaemia 

Topiramate 50-150 Weight loss, paraesthesiae 

Botulinum toxin Focal weakness, hypophonia 

Delta-9-THC 10 Anxiety, dizziness, fatigue, dry mouth, mood changes, memory loss, 
psychosis, blurred vision 

Quetiapine 50-250 Sedation 

Tetrabenazine 2550 Sedation, fatigue, nausea, insomnia, akathisia, parkinsonism, depression 





CG: very low quality, weak recommendation 


Adapted from Martino, D. and Mink, J.W. (2013). Tic disorders. Continuum (Minneap Minn), 19(5 Movement Disorders), 1287-311. 


CG: Canadian guidelines (Pringsheim 1., 2012). 


ESSTS: European Society for the Study of Tourette Syndrome guidelines (Roessner et al., 2011). 


compared to other dopamine antagonist drugs, in patients with 
tics [20]. 

Alpha-2 agonists have been used to treat tics for more than three 
decades. Clonidine and guanfacine are considered first-line therapy 
in the United States and Canada, thanks for their more favourable 
tolerability profile. Six RCTs of heterogenous quality demonstrated 
superiority to placebo for clonidine in both oral and transdermal for- 
mulations (0.05-3 mg). Treatment with clonidine requires monitoring 
for reduced blood pressure, sedation, headache, irritability, dysphoria, 
and interrupted night sleep. It is also advisable to use a slow titration 
of clonidine in order to improve tolerability, as well as gradual discon- 
tinuation to prevent rebound hypertension. Guanfacine has a longer 
half-life than clonidine and can be administered only once daily (0.5- 
4 mg); it has a similar side effect profile to clonidine. Two RCTs and 
two open-label studies confirmed the efficacy of guanfacine, espe- 
cially in children with tic disorders. Importantly, one meta-analysis 
confirmed superiority to placebo for both alpha-2-agonists only for 
youngsters with comorbid ADHD [21]. 


Tetrabenazine is a presynaptic dopamine depletor that acts by 
blocking the vesicular VMAT2 protein and displays minor post- 
synaptic D2 blockage. Its relatively widespread use in tics is sup- 
ported by its greater tolerability, compared to antipsychotics; 
however, there is limited evidence of its efficacy, mainly supported 
by a single small open-label trial and by retrospective cohort studies 
reporting improvement of tic severity sustained for up to 2 years. 
Among GABA agonists, baclofen and topiramate were found to be 
moderately effective in two medium-sized double-blind RCTs on 
patients of different ages. 

Botulinum toxin type A injections represent a very useful re- 
source in older adolescents and adults for the treatment of per- 
sistent, simple motor tics located to the upper face or neck, as well as 
for a number of vocal tics, including coprolalic utterances. However, 
as with tetrabenazine, its large use is not supported by a strong evi- 
dence of efficacy, which consists only of case reports or small series, 
three retrospective cohort studies, and one RCT. Hypophonia is a 
very common side effect of botulinum toxin injections for vocal tics. 


Delta-9-tetrahydrocannabinol, the main active compound of can- 
nabis, has been tested in two small RCTs on adults with TS, showing 
a modest, but promising, benefit on some outcome measures and 
acceptable tolerability. Nicotine has been investigated as an add-on 
treatment to haloperidol in a number of studies, but its mild efficacy 
is outweighed by intolerable gastrointestinal side effects [20]. 

Complementary and alternative medicine approaches in TS have 
recently attracted growing interest. Among the most promising 
interventions, there is interesting evidence of safety and moderate 
efficacy on tics or tic-related impairment for omega-3 fatty acids and 
chinese traditional medicine approaches that include Ningdong and 
5-Ling granules. The efficacy on tic severity of 5-Ling granules has 
recently been reported by a large double-blind RCT [22]. 

Investigation and application of behavioural therapies specific- 
ally targeting tics have remarkably increased in the last decade. The 
most effective treatment approaches of this kind are habit reversal 
training (HRT), a related and expanded version of the HRT package 
called Comprehensive Behavioral Intervention for Tics (CBIT), 
and exposure and response prevention (ERP) for tics [23, 24]. HRT 
for tics is built on awareness training, during which patients learn 
to recognize in real time their urges and tics; competing response 
training, whereby non-pervasive motor behaviours physically in- 
compatible with tics are selected and implemented to diminish 
tic production; and social support, based on the engagement of a 
support person in promoting the deployment of competing motor 
responses outside of therapy. Six RCTs and other smaller observa- 
tional studies demonstrated the efficacy of HRT in diminishing tic 
severity. This was confirmed by meta-analyses, which detected a 
large treatment effect size, similar to that observed in other widely 
used behavioural interventions for conditions like OCD or anxiety 
disorders. A combination of HRT and acceptance and commitment 
therapy (ACT) has also been proposed but is still not supported 
by sufficient evidence of efficacy. ACT consists of exercises helping 
patients to gain distance from distressing cognitions, emotions, 
and urges. 

CBIT is the most popular and, to date, effective extension of HRT. 
CBIT complements HRT with: (1) function-based assessment, which 
targets contextual factors predictably worsening tics and facilitates 
support persons and patients to identify strategies to counteract tic- 
prone situations; (2) relaxation training, primarily acting on stress 
management; and (4) and psychoeducational interventions. CBIT 
consists of 11 1-hour long sessions, six of which weekly, two sched- 
uled every 2 weeks, followed by three booster monthly sessions, with 
a fixed protocol of homework assignments. This treatment protocol 
was three times superior to supportive therapy plus psychoeducation 
in two large multi-site RCTs, one with children and one with adults 
with TS, demonstrating very good tolerability and low attrition, sus- 
tained improvements for more than 6 months, and amelioration of 
familial distress, obsessive-compulsive symptoms, and anxiety [23]. 
Preliminary evidence demonstrated the non-inferiority of HRT ad- 
ministration via a tele-health approach, compared to face-to-face 
administration. 

ERP is largely used in OCD. Its adaptation for tic treatment relies 
on prolonged exposure to urges associated with sustained suppres- 
sion of tics, thus generating habituation to the urge and a progressive 
decrease in the motivation of the patient to tic. Only one RCT has 
shown non-inferiority of ERP to HRT [24]. 
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A cognitive psychophysiological model, aiming to change the 
background activity against which tics occurs and treat associ- 
ated metacognitive and perfectionist beliefs, has recently been 
tested in an open trial [25]. Other therapeutic approaches focusing 
on stress modulation that have recently been tested in open-label 
studies include mindfulness-based stress reduction programmes 
in small group classes, which yielded a promising percentage of re- 
sponders, or aerobic exercise sessions such as kickboxing routines. 
Neurofeedback modulation of EEG oscillatory activity of cortical 
areas involved in cognitive control has also been tested in prelim- 
inary open-label studies and found to attenuate ADHD and tic 
symptoms. Controlled trials for these approaches are lacking. More 
research is also needed to increase knowledge of predictors of good 
response to CBT and patient selection criteria. 

Within the realm of neuromodulation strategies for more severe 
cases of TS, deep brain stimulation (DBS) remains the only active 
intervention that is extensively being investigated for adult TS pa- 
tients who are deemed refractory to pharmacological and behav- 
ioural interventions. The main patient selection criteria for DBS 
in TS proposed by recommendation documents [26] are listed in 
Box 37.3. 

Consensus on the ideal anatomical target has not yet been 
reached to date, mostly due to the lack of adequately powered ac- 
tive comparator trials. The most frequently used targets, supported 
by crossover RCTs, are the centromedian/parafascicular-substantia 
periventricularis-nucleus ventralis oralis internus crosspoint of the 
thalamus and the globus pallidus pars interna (more frequently the 
anteromedial, or limbic, portion) [27, 28]. Although DBS of TS has 
not raised major safety concerns to date, its efficacy appears to be 
still quite variable across patients, and long-term observation is 
warranted. The application of adaptive approaches using looped 
paradigms that adapt stimulation parameters to real-time brain os- 
cillatory activities associated with tic generation is currently being 
investigated. 


Box 37.3 Recommendation on patient selection for deep brain 





stimulation in Tourette's syndrome 


e Diagnosis of TS or other chronic tic disorder. 

e Age 18 or above (according to some authors, 25 or above). 

e Tics as prominent feature of the clinical presentation. 

e High tic severity (proposed as 235 on the Yale Global Tic Severity 
Subscore for a stable period of time, suggested by some authors of 
at least 12 months) and/or presence of potentially highly disabling 
or harmful tics (for example, tics associated with a clear self-injurious 
component). 

e Tics are considered refractory to pharmacological and behavioural 
interventions. Failure of pharmacological treatment implies having 

tried a course with medications from at least three different pharma- 

cological classes (for example, antipsychotics, a2-agonists, benzo- 
diazepines), lasting at least 12 weeks (according to other authors, 

6 months) for each drug, at adequate dosage and with proven com- 

pliance. Failure of behavioural therapy implies having tried at least ten 

sessions of behavioural treatment (either HRT-based or ERP-based). 

e Stable and optimized treatment of medical, cognitive, and behav- 

ioural comorbidities. 

e Expected compliance during monitoring (stable psychosocial 

environment). 



































SECTION 6 Motor disorders 


Stereotypic motor disorder 


Phenomenology 


Stereotypies are repetitive, non-goal-directed actions predomin- 
antly seen in childhood. Unlike tics, they can be highly consistent 
over time (24 weeks), are generally not preceded by sensory or 
mental experiences, and may be distracted by surrounding stimuli 
(for example, by calling the child’s name) [29]. Common stereotypies 
are the most frequent, occurring in 20-50% of children. These 
comprise non-disabling habits which change pattern during devel- 
opment, including (in order of the most frequent time of appear- 
ance) thumb sucking, body rocking, nail biting, foot tapping, head 
banging, chewing, or hair twisting. Common stereotypies only 
seldom require medical intervention, and this is usually related to 
medical sequelae caused by their persistance and intensity such as 
dental malocclusion, digital deformities, paronychia, skin abrasions, 
or temporomandibular disorders. Complex motor stereotypies con- 
sist of different limb movements (flapping, waving, fisting, finger 
or wrist movements). These are subdivided into primary (stable or 
remitting, occurring in developmentally normal children) and sec- 
ondary (occurring in conjunction with a neurologic or behavioural 
disorder), which, in some cases, may develop into self-injurious be- 
haviours (for example, eye poking, biting, head banging, complex 
hand movements) [30, 31]. Other secondary stereotypies include 
atypical gazing, pacing, running, and jumping, all typically associ- 
ated with autism. Head nodding begins earlier, is a quasi-rhythmic 
stereotypy which may also involve the shoulders, and often regresses 
in later childhood. In general, intense emotional states (particularly 
excitement), being engrossed in demanding activities, fatigue, or 
boredom may trigger stereotypies. Movements related to intense 
imagery have also recently been identified as a subtype of primary 
complex motor stereotypies. 

The DSM-5 defines repetitive habit behaviours that cause impair- 
ment to the child as a stereotypic movement disorder (Box 37.4) 
[1]. Complex motor stereotypies and head nodding fulfil the DSM-5 
criteria for stereotypic movement disorder. 

Birth and developmental history, including defective communica- 
tion and social skills, may be very helpful to identify disorders under- 
lying secondary complex stereotypies. Family history of complex 
motor stereotypies may be present in as many as 40% of cases. Drug 
history is also relevant; for example, toxic effects of amphetamines 
include severe head banging and hand biting stereotypies [29]. 

Complex motor stereotypies have been estimated to occur in 
up to 3-4% of the general population of pre-school children, but 
in more than 60% of autistic pre-schoolers and 25% of non-autistic 
pre-schoolers with similar intellectual abilities [32]. The prevalence 
of complex motor stereotypies may be as high as 40-60% in insti- 
tutionalized children with severe learning disabilities, with self- 
harming forms estimated to be 5-6 times less common. More than 
80% of complex motor stereotypies begin before age 2; one-third of 
cases improve or resolve, in most cases during the first year, and 60% 
stabilize, although often persisting at least through the teenage years. 
If the behaviour persists through childhood, it is more likely that it 
will become a substantial problem with age. Overall, there is a male 
predominance in complex motor stereotypies, in part confounded 
by the different prevalence of underlying diagnoses between the two 
genders. 


Box 37.4 DSM-5 criteria for stereotypic motor disorder 


e Repetitive, seemingly driven, and apparently purposeless motor be- 
haviour (for example, handshaking or waving, body rocking, head 
banging, self-biting, or hitting one’s own body). 

e The repetitive motor behaviour interferes with social, academic, or 

other activities and may result in self-injury. 

e Onset in the early developmental period. 

e The repetitive motor behaviour is not attributable to the physiologic 

effects of a substance or neurologic condition and cannot be better 

explained by another neurodevelopmental or mental disorder (for 
example, trichotillomania or OCD). 











The following specifiers are used: 


self-injurious behaviour (or behaviour that would be self- 
njurious if not prevented). 
out self-injurious behaviour. 














Whether the stereotypy is associated with a known medical or genetic 
condition, developmental disorder, or environmental factor is also spe- 
cified, and an additional code is used to identify the associated element. 
Severity is specified as follows: 








e Mild—symptoms are easily suppressed by sensory stimulus or 

distraction. 

e Moderate—symptoms necessitate explicit protective measures and 

behavioural modification. 

e Severe—continuous monitoring and protective measures are re- 
quired to prevent serious injury. 














Source: data from the Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 











The main causes ofsecondarystereotypies are neurodevelopmental 
disorders including severe learning disabilities and autistic spec- 
trum disorders (Fig. 37.4). Therefore, identification of intellectual 
disability or developmental delay is crucial in the differential diag- 
nosis of stereotypies. Some forms are associated with chromosomal 
aberrations (for example, Down's syndrome, Williams syndrome, 
fragile X syndrome, cri-du-chat syndrome) or with Smith-Magenis 
syndrome. Deprivation stereotypies are permanent and observed in 
patients with congenital sensory deprivation. 

Another important genetic cause of stereotypies is Rett’s syn- 
drome, an X-linked, neurodegenerative illness that typically pre- 
sents with stereotypies in the developmental period [33]. Rett’s 
syndrome is caused by mutations in the MECP2 gene, which codes 
for a protein that acts as a transcriptional repressor binding to 
methylated DNA. Whereas stereotypies in ASD are more intermit- 
tent, randomly distributed with preference for distal extremities, 
and associated frequently with visual inspection and flickering of 
the fingers, the stereotypies in Rett’s syndrome are more continuous, 
predominantly located in the hands, mouth, and axial districts, and 
commonly involve objects. 

Genetically inherited neurometabolic causes of stereotypies in- 
clude, among others, phenylketonuria and adenylosuccinate lyase 
deficiency. Among acquired causes of stereotypies, autoimmune 
encephalopathies such as anti- NMDA receptor antibody-mediated 
encephalitis have been reported and may be triggered by emotional 
stimuli. These stereotypies are typically located to the upper limbs 
and bucco-linguo-facial regions, and sometimes may be difficult to 
differentiate from chorea. When rarer forms of repetitive actions are 
present, epileptic automatisms should be considered. 
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Fig. 37.4 Aetiology of motor stereotypies. 


Adapted from Martino D, Espay AJ, Fasano A, et al., Unvoluntary Motor Behaviours. In: Chapter 3: Disorders of Movement, pp. 97-153, Copyright (2016), with permission 


from Springer-Verlag. 


Pathogenesis 


Abnormalities of dopaminergic transmission in the basal ganglia are 
involved in the generation of stereotypies. Systemic or intrastriatal 
infusion of amphetamines and dopamine agonists in rats may cause 
highly repetitive behaviours (sniffing, gnawing, licking, biting), 
which resemble stereotypies in humans, and can be antagonized 
by dopamine D1 receptor blockers. Amphetamines and other 
psychostimulants, particularly cocaine, potentiate dopaminergic re- 
lease and transmission and are known to be able to cause stereotypies 
in the developmental period. Tardive stereotypies are very rare in the 
developmental period. In general, specific movements involving the 
arms (rhythmical extensions of the elbow), trunk, and pelvis, such as 
‘copulatory dyskinesias, may be considered stereotypies [34]. 


Management 


Overall, a functional behavioural assessment by a psychologist may 
be helpful to determine the impact of stereotypies on the youngster’s 
routine daily life and can aid in identifying the types of activities that 
may occur jointly with, or exacerbate, stereotypies. Family members 
and significant others will urge to seek specialist attention when they 
recognize typical stereotyped behaviours. The aetiologic diagnosis 
relies primarily on a complete medical history and physical exam- 
ination, complemented by a series of instrumental tests, primarily 
biochemical or genetic. 

Isolated primary stereotypies can be regarded as developmental 
variants of physiological habits that often do not require specific 
treatment, apart from appropriate psychoeducation. The main aim 


of psychoeducational interventions in motor stereotypies is to di- 
minish parental anxiety and correct inaccurate beliefs and per- 
ceptions in order to avoid unnecessary distress or stigmatization 
related to these motor behaviours. The management of stereotypies, 
as for other developmental motor disorders, should be tailored to 
the characteristics of the patient, in particular his/her age, behav- 
ioural comorbidities, and the aetiology of the stereotypies. Similar to 
standard practice in tic disorders, motor stereotypies require an ac- 
tive intervention when, after accurate psychopathological screening, 
these cause social isolation or stigma, predispose to physical injury 
directly associated with the self-injurious character of the move- 
ments, and when these generate substantial familial distress. In 
these cases, referral to a child psychologist or psychiatrist or to a 
developmental-behavioural paediatrician is advisable. 

The mainstay of active treatment of common or primary com- 
plex motor stereotypies (Fig. 37.5) is behavioural therapy [35]. This 
includes two components: intervention on the context of environ- 
mental surroundings, and direct intervention on the patient. The 
principle of intervening on the environmental context is to adjust 
the surroundings in order to minimize the risks of injury and the 
vulnerability associated with more violent or erratic movements. 
Environmental interventions may target antecedents and conse- 
quences. Antecedent-based strategies have originally included 
providing visual or verbal cues to forewarn the patient of a change 
in the activity, engaging in calming (for example, taking a nap) or 
highly preferred activities prior to a difficult or less preferred ac- 
tivity. Physical exercise has also been used. Other examples of 
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“If behavioural approaches — 
unfeasible or ineffective 








Fig. 37.5 Decision tree for the treatment of stereotypies. 


Adapted from Martino D, Espay AJ, Fasano A, et al., Unvoluntary Motor Behaviours. 
In: Chapter 3: Disorders of Movement, pp. 97-153, Copyright (2016), with 
permission from Springer-Verlag. 


antecedent-based strategies include: the removal of stimuli trig- 
gering stereotypies, as, for example, using gloves or adhesive plasters 
to prevent thumb sucking; contingency manipulation, for example 
using bitter- or aversive-tasting substances to prevent nail biting or 
thumb sucking; and non-contingent stimulation or environmental 
enrichment, consisting of providing the patient access to appropriate, 
competing sources of reinforcement such as preferred objects. The 
safety of the environment can be increased also by protecting from 
physical consequences such as using a helmet in cases of severe head 
banging or dental occlusion splints for tooth clenching. 

Several behavioural approaches have been applied to primary 
complex motor stereotypies, whereas these usually show lower ef- 
ficacy and feasibility in secondary stereotypies. More recently, it 
was shown that behavioural treatments may retain their efficacy 
also when provided in a home-based, parent-administered fashion 
through an instructional DVD [36]. Habit reversal therapy using 
differential reinforcement has demonstrated efficacy lasting more 
than 12 months. Evidence is less strong for other approaches, 
including response interruption and redirection in secondary 
stereotypies associated with ASD, relaxation training, or response 
blocking [35]. 

Pharmacological interventions are taken into account only when 
behavioural strategies are either unfeasible or ineffective (Fig. 37.5) 
[34, 37]. Most of the data concerning drug therapies refer to sec- 
ondary stereotypies associated with learning disabilities and/or 
autism. Evidence of efficacy is overall limited. Drugs for depression, 
in particular fluoxetine and clomipramine, are the family of psycho- 
tropic agents probably the most investigated. A meta-analysis has 


documented a small, but significant, effect of SSRIs in decreasing 
stereotyped behaviours in autism. The effect of naltrexone, an opioid 
receptor competitive antagonist, in ASD was systematically re- 
viewed, showing that this drug may improve hyperactivity and irrit- 
ability, whereas its actual effect on stereotyped behaviours remains 
uncertain. Drugs for psychosis have also been explored. A meta- 
analysis of three RCTs demonstrated that risperidone may decrease 
stereotypies subscale scores in autism [38]. Aripiprazole showed 
effect in the treatment of stereotypies in autistic children, with 
better tolerability, compared to risperidone or haloperidol. Finally, 
a Cochrane review of five double-blind RCTs on self-injurious be- 
haviours in adults with learning disabilities concluded for a weak 
evidence for any of the active drugs investigated, clomipramine and 
naltrexone among others [34]. 





Developmental co-ordination disorders 


Phenomenology 


DCDs are characterized by motor skills deficits, with onset in the 
early developmental period, that are not explained by intellectual 
disability, visual impairment, or other neurologic motor problems 
such as cerebral palsy, muscular dystrophies, or degenerative dis- 
orders. The specific DSM-5 criteria for DCDs are listed in Box 37.5 
[1]. DCDs constitute a broad clinical spectrum, in which the severity 
of motor impairment and the burden of comorbidities vary consid- 
erably. Their prevalence also varies widely across different studies, 
falling within a conservative estimate range of 1.5-5% of children 
below age 10, with an additional 10% of children possibly affected 
by minor forms of DCDs. A 1.7:1 male-to-female ratio has been re- 
ported in a study on more than 7000 British children, but no well- 
documented interracial or inter-ethnic differences [39]. 

In line with the introductory considerations at the beginning of 
this chapter, the pathogenesis of DCDs can be related to abnormal 
maturation and/or integration of the different building blocks of 
motor functioning, for example muscular tone and strength, gross 


Box 37.5 DSM-5 criteria for developmental 


co-ordination disorder 





e Acquisition and execution of co-ordinated motor skills are below 
what would be expected at a given chronologic age and opportunity 
for skill learning and use; difficulties are manifested as clumsiness (for 
example, dropping or bumping into objects) and as slowness and 
inaccuracy of performance of motor skills (for example, catching an 
object, using scissors, handwriting, riding a bike, or participating in 
sports). 

e The motor skills deficit significantly or persistently interferes with ac- 
tivities of daily living appropriate to the chronologic age (for example, 
self-care and self-maintenance) and impacts academic/school prod- 
uctivity, prevocational and vocational activities, leisure, and play. 

e Onset in the early developmental period. 

e The motor skills deficits cannot be better explained by intellectual 
disability or visual impairment and are not attributable to a neuro- 
logic condition affecting movement (for example, cerebral palsy, 
muscular dystrophy, or a degenerative disorder). 











Source: data from the Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 











and fine motor skills, planning and kinematic monitoring of move- 
ment, and sensory-motor integration. These functional blocks 
depend on the normal maturation and connectivity of integrated 
pathways, involving the motor cortex, cerebellum, and sensory sys- 
tems (proprioceptive, visual, vestibular). Different forms of DCDs 
may differ, according to which of the core processes of motor devel- 
opment is primarily dysfunctional, although subtyping of DCDs is 
still under investigation. Some disorders referring to specific core 
neuromotor developmental processes, such as ‘sensory integration 
disorder, might, in fact, represent subtypes of DCDs, rather than 
different conditions. 


Pathogenesis 


Like other developmental motor disorders, DCDs are also be- 
lieved to result from alterations in the global organization of brain 
networks, which is demonstrated by structural and functional dif- 
ferences in sensory-motor tract connectivity between DCDs and 
typically developing children [40]. DCDs have a multifactorial aeti- 
ology, involving genetic, pre-/perinatal, and environmental factors. 
The heritability of DCDs has been estimated to be between 0.47 and 
0.69, with a likely polygenic architecture of the genetic contribution 
to their pathogenesis. Intrauterine exposure to alcohol, and possibly 
to stimulants like cocaine or methamphetamine, is an important 
risk factor for the development of DCDs, as the overlap between 
DCDs and fetal alcohol spectrum disorder suggests [41]. Likewise, 
extremely pre-term infants (particularly those born at <37 weeks 
of gestational age) have a substantially higher risk for developing 
DCDs, compared to infants born at term [39]. Postnatal exposure, 
including exposure to lead, manganese, and iron deficiency, may 
also be contributing factors. 

Several comorbidities may be associated with DCDs. Common 
problems that aggravate motor co-ordination difficulties in chil- 
dren include problems with attention and concentration such as 
those in ADHD [39]. Children with ASD have also been reported 
to exhibit frequently ‘motor clumsiness, although there is little 
overlap in connectivity abnormalities between these two groups 
of disorders [42, 43]. Many children with motor difficulties have 
language difficulties, including stuttering and orofacial dyspraxia, 
and may have problems in written expression and other learning 
disabilities. 
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Assessment 


The diagnosis of DCDs is based on an accurate history and phys- 
ical examination of the fine and gross motricity of the child. 
Presentation varies with age, and therefore, information on spe- 
cific motor co-ordination problems should be sought across the 
different stages of the neurodevelopmental period. Table 37.2 
summarizes the main manifestations of motor co-ordination 
abnormalities. 

The physical examination of the child in the context of DCDs can 
be divided into two main parts: (1) the observation of motor co- 
ordination within daily activities, including playing; and (2) dedi- 
cated manoeuvres aiming at the assessment of specific motor tasks 
and abilities. The first part includes assessment of routines associ- 
ated with academic activities or activities of daily living, for example 
cutting with scissors, tying shoes, drawing or colouring, throwing 
or kicking a ball, putting on or taking off a coat or a pair of trou- 
sers. The second part of the assessment may involve different man- 
oeuvres that have not, to date, been operationalized in a universally 
accepted assessment protocol. Examples of these manoeuvres are 
performing sequential finger tapping in search of sequential errors 
and adventitious or mirror movements; finger-to-nose or finger- 
chase tasks; testing energy investment in a given sequential motor 
task (for example, drawing, writing, etc.) when sustained for 2-3 
minutes; searching for ‘motor clumsiness, indicating visuospatial 
deficits; testing the accuracy of fine motor tasks, for example activ- 
ities involving small or miniature toys (miniature domestic utensils, 
small blocks for building activities, etc.); and testing co-ordination 
in gross motricity typically associated with sports or physical play 
activities. 

Overall, children with DCDs typically underperform in many ac- 
tivities of daily living, when compared to typically developing chil- 
dren of the same age. Interestingly, limited physical activity levels 
resulting from DCDs may increase the risk for cardiovascular dis- 
ease and predispose to obesity [44], as well as predispose to reduced 
self-esteem and problems with peer relation and depression, in add- 
ition to the learning difficulties that may occur in comorbidity with 
DCDs [45-47]. In addition, children with DCDs may exhibit a range 
of gait abnormalities and an increased risk of falls, associated with 
altered control of the centre of mass and a diminished limit of sta- 
bility in the backward direction [48, 49]. 


Table 37.2 Main motor manifestations associated with developmental co-ordination disorders throughout the developmental period 


Stage of development Manifestations potentially associated with DCDs 


First year of life 


Hypertonic reactions to auditory or visual stimuli 

Persistence of primitive (Moro, plantar, rooting) reflexes 

Delayed ability to roll over or sit without head-lag; tendency to slip through the examiner's grasp 
Ability to self-correct posture and sit unassisted by 9 months of age 

Persisting Babinski sign at the end of the first year of life 

Persisting crossed-adductor reflexes and ankle clonus 


Second and third years of life e Difficulty in making a pincer grasp (that is, picking up a small object with the index and thumb) 
e Inability to furniture-walk by 18 months of age 
e Refusal of foods requiring greater chewing ability by the end of the third year of life 


Pre-school and school years 








Difficulty in pencil grasp 
nco-ordination in playing with a ball 





Obvious delay in ability to jump and hop 
Complete lack of, or premature, hand dominance (although true ambidexterity is possible) 


nco-ordination in other fine motor tasks such as tying shoes, drawing, colouring, etc. 
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A normative functional skills assessment may be used to evaluate 
the breadth and severity of the DCD. This relies on commonly used 
tests of motor impairment, which include: the Bruininks—Oseretsky 
Test of Motor Proficiency (BOTMP), suitable for youth aged 4- 
21years and frequently administered by therapists to monitor the 
efficacy of a treatment course; the Movement Assessment Battery 
for Children (MABC), which evaluates comprehensively manual 
dexterity, static and dynamic balance, and the functional inter- 
actions between the child and the surrounding environment; and 
the Test for Gross Motor Development, second edition (TGMD-2), 
used in children aged 3-10 years and focused on locomotor and 
object control abilities. Additional commonly used instruments 
for a broader neurodevelopmental evaluation are, among others, 
the Touwen Test for children with minor neurologic dysfunction 
and the Physical and Neurological Examination for Soft Signs 
(PANESS). 

The diagnostic workup of DCDs should also include investiga- 
tions aiming to rule out alternative causes for motor skills deficits. 
Among these, laboratory and neurophysiological tests to rule out 
muscle and peripheral nerve disorders and screening for metal 
toxicity and iron deficiency and metabolic abnormalities, such as 
thyroid dysfunction, may be warranted. Brain imaging and gen- 
etic screening for ataxias, myopathies, peripheral neuropathies, 
and metabolic syndromes could be necessary in specific cases, and 
neurological consultation is advised. 


Management 


The large inter-individual variability of adaptive functioning prob- 
lems generated by DCDs requires a treatment approach that is 
highly personalized. A recent meta-analysis has identified strong 
effects in improving motor performance for task-oriented interven- 
tions and physical or occupational therapy, compared to process- 
oriented therapies, with insufficient evidence in support of chemical 
supplementation [50]. 

Physiotherapists and occupational therapists administer task- 
oriented interventions, often in collaboration with supporting 
figures like teachers and the direct engagement of parents. Task- 
oriented interventions exploit a top-down approach, employing 
specific techniques that target difficulties with specific motor 
challenges, for example catching a ball, performing fine digital 
tasks, handwriting, etc. Cognitive motor intervention is an ex- 
ample of these task-oriented treatments. It is based on designing 
a set of exercises that children should practise, with the assistance 
and supervision of parents, until the motor skills are mastered. 
The possibility of measuring objectively the treatment goal repre- 
sents an important advantage of this approach. Cognitive motor 
intervention builds substantially also on motivational and cog- 
nitive reinforcers as crucial drivers of motor learning and aims 
at generating a schema that could be reused later also in other 
situations [51]. 

An alternative (bottom-up) approach that has been used by 
physiotherapists and occupational therapists is based on the as- 
sumption that general mechanisms like dysfunctional sensory in- 
tegration or abnormal kinaesthetic perception subdue the motor 
co-ordination difficulties. This approach modulates the exposure to 
external stimuli (visual, auditory, tactile, proprioceptive) that influ- 
ence motor performance in children with DCDs, who may be either 


hypersensitive or hyposensitive to endogenous and exogenous 
stimulation. It also exploits techniques that foster self-regulatory 
mechanisms of sensory-motor integration. The efficacy of sensory 
integration therapies could be higher in combination with occupa- 
tional therapy, although a recent meta-analysis has demonstrated a 
weak effect in improving motor performance, and doubts have been 
casted on their overall value [50, 52]. 

More recently, specific programmes aiming at improving balance 
strategies in DCDs have been tested in RCTs. Task-specific balance 
training approaches, such as functional movement training and 
a specific adaptation of the latter—functional movement-power 
training, showed efficacy in enhancing balance strategies and 
neuromuscular performance in children with DCDs [53]. The effi- 
cacy of other treatment approaches, including kinaesthetic or visual 
training and neurodevelopmental treatment, has a weak or insuff- 
cient evidence base. 

Finally, there are no pharmacologic approaches that showed 
consistent efficacy in improving motor performance in DCDs. 
Methylphenidate showed a moderate effect size in three studies 
and represents the most explored medication in this context [50]. 
Other pharmacologic strategies may help specific comorbidities 
that affect motor co-ordination; for example, propranolol or other 
beta-blockers may ameliorate severe hand tremor when this is ag- 
gravating co-ordination problems. 
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Dementia has become the most prevalent disease burden of our time. 
In the UK alone, there are currently over 850,000 sufferers, a figure 
that is predicted to rise to 1 million by 2025 and 2 million by 2050. 
This presents an enormous societal and economic burden (cur- 
rently £26 billion per annum). Globally, the figures are even more 
staggering with around 47 million sufferers and an associated eco- 
nomic cost of close to $1 trillion (https://www.alzheimers.org.uk/ 
download/downloads/id/2323/dementia_uk_update.pdf;  https:// 
www.alz.co.uk/research/WorldAlzheimerReport2015.pdf). 
Neurodegenerative diseases are progressive, age-related multifac- 
torial diseases with increasing prevalence and no cure or lasting 
symptomatic therapy. The need to understand the cellular and mo- 
lecular mechanisms that underpin dementia has never been more 
urgent. 

There are many causes of dementia, but most are due to 
neurodegenerative diseases (NDDs), principally Alzheimer’s dis- 
ease (AD), Parkinsons disease (PD), Huntington’s disease (HD), 
and fronto-temporal dementia (FTD). Historically, these diseases 
have been studied in isolation as separate entities, but recently a 
consensus has arisen that, despite arising from distinct aetiologies, 
NDDs share many common underlying mechanisms that can be ex- 
ploited to identify novel therapeutic targets. In broad strokes, these 
common cellular mechanisms include misfolded protein aggrega- 
tion, endoplasmic reticulum (ER) stress, mitochondrial dysfunc- 
tion, synaptic dysfunction, cellular transport, and inflammation. 
These cellular functions share overlapping molecular aetiologies, 
so, for example, cellular stressors, such as a-synuclein or B-amyloid 
(AB), may give rise to misfolded proteins that, in turn, generate 
ER stress and disrupt cellular transport, resulting in disruption 
to cellular trafficking and mitochondrial and synaptic function. 
Accordingly, discovering the underlying molecular players, their 
mechanisms, and how their interactions disrupt cellular function, 
leading to neuronal dysfunction and subsequent loss, has become a 
major goal in understanding the molecular and cellular aetiology of 
NDDs. One caveat to bear in mind is that even if the ultimate symp- 
toms of NDDs primarily reflect loss of synaptic function, neuronal 





connectivity, and neuronal death, nevertheless disease pathogenesis 
takes place in a complex cellular milieu and requires interactions 
among astrocytes, microglia, and the cerebral vasculature. Indeed 
the ‘neurovascular unit’ has been proposed as a term to encompass 
this complexity. Although this parsimony of common mechanism is 
an attractive concept, it is equally important to recognize that each 
NDD does have its own distinct signature, so we need to be cog- 
nizant of both disease-specific and disease-common mechanisms. 
Here, we focus on mechanisms common among NDDs, but the fol- 
lowing chapters in this book complement this approach and adopt a 
disease-specific focus. 


Pathology and genetics 


At first sight, the pathologies of NDDs such as AD and PD appear 
distinct. On the one hand, AD is associated with progressive deficits 
in memory, cognition, and behaviour, while, on the other hand, PD is 
primarily considered a dysfunction of motor control. However, 30% 
of AD sufferers develop PD, and a similar proportion of PD patients 
develop dementia. This overlap extends to the primary molecular 
pathology of both diseases; AD is characterized by extracellular de- 
posits of AB plaques and intracellular accumulations of tau neuro- 
fibrillary tangles distributed throughout the forebrain, whereas PD 
is characterized by accumulation of a-synuclein in Lewy bodies 
(LBs) within the dopaminergic neurons of the midbrain. However, 
LBs are not restricted to PD and are also found in the amygdala 
of over half of patients diagnosed with familial or sporadic AD, as 
well as those suffering from dementia with Lewy bodies (DLB) [1]. 
Similarly, neuronal inclusions of TDP43 (TAR-DNA binding pro- 
tein 43) are common to AD, FTD, and amyotrophic lateral sclerosis 
(ALS) [2-4], while tau (encoded by the MAPT gene) neurofibril- 
lary tangles are present in the brains of AD, DLB, and FTD [5]. 
Further evidence of common aetiology among NDDs is provided 
by the overlap of the underlying genetics of NDDs. An example is 
provided by the microtubule-associated protein (MAPT) H1 haplo- 
type that is associated with both AD and PD, with the suggestion 
that these MAPT variants increase the risk of PD by a crossover of 
the mechanisms governing aggregations in LBs and neurofibrillary 
tangles [6]. More recently, several loss-of-function variants in the 
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ATP-binding cassette transporter A7 (ABCA7) (required for protein 
aggregate clearance) have been associated with both AD and PD [7]. 
Common genetic association goes beyond AD and PD; a notable 
example is C9orf72 (chromosome 9 open reading frame 72) bearing 
hexanucleotide GGGGCC repeat expansions as a causative gene in 
both FTD and ALS [8, 9] and even more prevalently in FTD-ALS 






PICALM 


[10]. Further common genetic causality of multiple NDDs is evi- 
denced by mutations in Fused in Sarcoma (FUS), an RNA-binding 
protein, valosin-containing protein (VCP), and Ubiquitin 2 
(UBQLN2), all of which are associated with a minority of ALS and 
FTD cases [11-14]. The commonality of function of many risk genes 
associated with AD and other NDDs can be seen in Fig. 38.1. 
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Proteostasis 


The realization that a neurodegenerative phenotype, very similar 
to the corresponding sporadic disease, can result from single geno- 
types has led to the identification of the key effector proteins in 
these diseases and a common theme that may explain their role in 
the progression of diverse pathologies, that is, the idea that specific 
proteins accumulate within or outside neurons and misfold into 
toxic conformers, which may also spread to distant brain regions 
within interconnected neuronal networks. Several genes that cause 
monogenic AD, such as amyloid beta precursor protein (APP), 
presenilin1 (PSEN1), and presenilin2 (PSEN2), as well as variants in 
the genome, such as phosphatidylinositol binding clathrin assembly 
protein (PICALM), have mapped out the pathway of Af clearance 
and suggested that increased production and deposition of AB,_,, in 
the brain of patients with sporadic AD may be the initiating patho- 
genic event [15]. Similarly, tau mutations were identified in familial 
cases of FTD, and tau fibrils in neurofibrillary tangles is a cardinal 
feature of sporadic AD. a-synuclein mutations or multiplications are 
rare causes of PD that share the same pathology with sporadic PD, 
namely the misfolding of a-synuclein into LBs. 

These pathological changes are thought to predate the clinical 
presentation by several years. For example, studies of a familial AD 
cohort (the Dominantly Inherited Alzheimer Network [DIAN]]) sug- 
gest that AB,_,, levels in the CSF begin to decline as early as 25 years 
before the onset of symptoms [16]. This is followed by the appear- 
ance of fibrillar amyloid deposits in the brain detected by Pittsburgh 
compound B (PiB)-PET, increased levels of tau in the CSE, and pro- 
gressive brain atrophy roughly 15 years before clinical presentation 
[16]. Cerebral hypometabolism and subtle episodic verbal memory 
impairment seem to begin about 10 years before overt dementia 
[16]. This time course may be generally similar to that of sporadic 
AD, based on cross-sectional studies [17], suggesting that detectable 
biochemical and histopathological abnormalities occur at least two 
decades before clinical symptoms. 

Fibrillar assemblies of AB,_,,, tau, and a-synuclein also exert 
non-cell autonomous effects, which are reminiscent of ‘prion-like’ 


phenomena. The strongest evidence that this may occur in the human 
brain comes from the identification of LB pathology in embryonic 
neural grafts 12-16 years after transplantation into the brain of 
people with PD [18, 19]. Direct inoculation of human brain extracts 
or human brain-extracted fibrils from patients with LB pathology 
or multi-system atrophy led to progressive a-synuclein aggregation 
and neurodegeneration in connected areas of the brain in animal 
models, including non-human primates [20-22]. The propagation 
of a-synuclein in mouse brain has also been demonstrated with re- 
combinant fibrils, but cell loss in this model was not always seen [20, 
23, 24]. Similarly, tau pathology without neurodegeneration was ob- 
served in wild-type or transgenic mice expressing human tau, after 
seeding with fibrils extracted from human brain or mouse brain ex- 
pressing the pathogenic P301S tau mutation [25, 26]. Intracerebral 
infusion of dilute AB-rich brain extracts from AD patients or from 
aged APP-transgenic mice also stimulated the premature formation 
of plaques and amyloid angiopathy in these models [27]. Spreading 
of a-synuclein and tau aggregates from the periphery to the brain 
has also been demonstrated [25, 26, 28]. 

Progress in identifying pathogenic proteins and their mode of 
propagation opened up the possibility for targeted therapies aimed 
at preventing their spread or promoting the clearance of their 
misfolded conformers. However, the most widely tested therapeutic 
approach to date—the use of active or passive immunotherapy—has 
not shown adequate clinical efficacy, even though there was evidence 
in some trials that such approaches promote the clearance of AB,_,, 
[15]. A similarly poor response was detected in a larger trial that 
examined the effect of AB,_,. immunotherapy in a group of patients 
with mild cognitive impairment. Whether such therapies need to 
be given at the pre-symptomatic phase of the disease to be effective 
is currently being investigated in around 300 pre-symptomatic 
members of a large Colombian pedigree with the PSEN1 E280 > 
A280 missense mutation and a smaller number of pre-symptomatic 
American participants from the DIAN cohort who carry other 
presenilin mutations. In contrast to the disappointing results in hu- 
mans, immunotherapies in preclinical models effectively reduced 
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the burden of pathological aggregates. The reason for this discrep- 
ancy is currently unclear but suggests that the complex cellular states 
of the human diseased brain beyond the culprit protein need to be 
considered. 

One limitation of targeted therapies against misfolded proteins, 
as exemplified by immunotherapies, is the assumption that during 
‘degenerative’ cellular states in the sporadic forms of these diseases, 
the neuronal mechanisms that normally handle misfolded proteins 
will rapidly recover and respond adequately to further protein ag- 
gregation, which in vivo may actually occur over a period of hours 
[29]. Age-related deficiencies in protein homeostasis (proteostasis) 
could contribute to the accumulation of aggregating proteins [30]. 
For example in AD or PD brains, there is upregulation of chaperones 
and accumulation of proteasomal and autophagic components [31]. 

Accumulation of misfolded proteins typically triggers the activa- 
tion of chaperones, which either attempt to refold proteins or help re- 
direct non-native conformers towards degradation by proteasomes 
or lysosomes. Proteasomes are large multi-subunit complexes that 
consist of a 19S regulatory cap and a 20S proteolytic core. The 19S 
regulatory particle recognizes ubiquitinated substrates, removes 
ubiquitin chains, and unfolds the substrate to allow entry into the 
20S core where it is rapidly degraded into peptides [32]. Although 
the proteasome is the primary source of protein degradation in the 
cell, restricted entry into the proteolytic chamber of the 20S compo- 
nent does not permit the degradation of misfolded or large protein 
complexes. One way to bypass this limitation is to employ chaperone 
complexes, such as the BAG1-HSP70 or cytosolic VCP/p97, that re- 
trieve misfolded proteins from aggregates and then direct them to 
the proteasome. Alternatively, larger aggregates can be directed en 
masse to the lysosome via autophagy. Autophagy complements the 
proteasome in three forms: macroautophagy, chaperone-mediated 
autophagy (CMA), and microautophagy. Macroautophagy is the 
best-understood form and entails the sequestration of organelles 
or aggregates into a double-membrane structure known as the 
autophagosome. ‘The resulting autophagosome is then transported 
to, and fuses with, the lysosome, thereby delivering its cargo for deg- 
radation. In contrast, microautophagy occurs by direct engulfment 
of the cytosol at the lysosome membrane, and CMA occurs through 
HSC70-mediated delivery of proteins across the lysosomal mem- 
brane via the LAMP2A receptor [33]. 

Selective targeting of misfolded proteins to the proteasome or 
lysosome is initiated by the addition of polyubiquitin chains in a 
three-step enzymatic process involving E1 ubiquitin-activating en- 
zymes, E2 ubiquitin-conjugating enzymes (E2s), and E3 ubiquitin 
ligases (E3s). There are more than 650 E3s and 30 E2s that regu- 
late the degradation of protein substrates, making this system highly 
specialized. Ubiquitin chains are formed through conjugation of 
ubiquitin monomers to protein substrates via distinct lysine res- 
idues; K48- and K11-linked ubiquitin chains are the main linkages 
that direct substrates to the proteasome, whereas K63-linked chains 
mediate trafficking to the lysosome. This is achieved by the action of 
adaptor complexes, which recognize the ubiquitin chain and direct 
the ubiquitinated proteins to the relevant pathway. Deubiquitinating 
enzymes (DUBs) remove ubiquitin chains and may determine com- 
mitment to degradation or trim ubiquitin chains to fine-tune the 
degradation process. The relevance of ubiquitin-dependent quality 
control is well established in PD, especially in a subgroup of familial 
cases that are caused by mutations in the E3 ligases Parkin or FBXO7. 


Ubiquitin signalling also regulates a-synuclein degradation and 
counteracts its proteotoxicity. For example, the E3 ligase CHIP that 
normally eliminates cytosolic misfolded proteins was implicated in 
the clearance of oligomeric a-synuclein [34]. The E3 ligase NEDD4 
exerts a protective effect in diverse models of a-synuclein toxicity, 
at least in part, by directly promoting the ubiquitination and lyso- 
somal degradation of a-synuclein [35, 36]. Interestingly, a small 
molecule activator of NEDD4 protects induced pluripotent stem 
cell (iPSC)-derived dopaminergic neurons expressing the patho- 
genic A53T mutation [37]. Ubiquitination in LBs is significantly re- 
duced in the pigmented neurons of the substantia nigra, compared 
to other brain regions, and this pattern inversely correlates with the 
upregulation of the deubiquitinase USP8 [38]. USP8 deubiquitinates 
a-synuclein and opposes its lysosomal degradation in vitro, whereas 
USP8 knockdown in fly models prevents a-synuclein accumulation 
and toxicity [38]. The role of Parkin in mitophagy is discussed in 
Mitochondrial dysfunction, p. 377. 


Endoplasmic reticulum stress 


Defects in protein trafficking between organelles, especially the 
ER to Golgi and endosomes to lysosomes, are also critical in 
neurodegeneration. For example, mutations in VPS35, a com- 
ponent of the retromer complex which functions in endosomal 
protein sorting cause late-onset forms of familial PD and 
AD [39]. Heterozygous mutations in the lysosomal enzyme 
glucocerebrosidase (GBA), which, when biallelic, cause Gaucher's 
disease, are the most common risk factors for PD; and muta- 
tions in the kinase LRRK2, which is the most common form of 
autosomal dominant PD with LB pathology impair endosomal 
trafficking and autophagy [39]. The significance of these path- 
ways for the viability of vulnerable neuronal subpopulations in 
PD and AD is reinforced by genome-wide analysis of variants 
in common sporadic forms of these diseases (for example, GAK 
and RAB7L in PD and PICALM in AD) that function in these 
pathways. Misfolded proteins accumulating in the ER lumen are 
normally recognized by ER membrane-associated complexes and 
retrotranslocated to the cytoplasm by the AAA ATPase VCP/p97- 
NPL4-UFD-1 complex. The retrotranslocated protein is then 
ubiquitinated and delivered to the proteasome for degradation, 
thereby eliminating terminally misfolded proteins from the ER. 
In neurodegenerative diseases, ER stress due to accumulation of 
misfolded proteins in the ER lumen activates the unfolded pro- 
tein response (UPR), a protective cellular response that aims to 
reduce unfolded protein load and restore protein-folding homeo- 
stasis. The UPR has three arms, which initiate signalling cas- 
cades through protein kinase RNA (PKR)-like ER kinase (PERK), 
inositol-requiring enzyme 1 (IRE1), and activating transcription 
factor 6 (ATF6). Paradoxically, persistent activation of the PERK- 
eIF2a branch of the UPR is detrimental, as it causes uncom- 
pensated decline in global translation rates, leading to synaptic 
failure and neuronal death [40]. Restoring translation by targeting 
PERK or more selectively elF2a, using repurposed drugs such as 
trazodone, was neuroprotective and enhanced memory in animal 
models of prion disease and tauopathy despite the accumulation 
of misfolded proteins [41]. In this respect, specialized enzyme 
complexes that function in protein folding, trafficking, or degrad- 
ation are emerging at the frontier of experimental therapeutics in 
neurodegenerative diseases. 
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Mitochondrial dysfunction 


Mitochondria are responsible for ATP generation, intracellular cal- 
cium (Ca’*) homeostasis, reactive oxygen species (ROS) formation, 
and apoptosis. Neurons are exquisitely sensitive to mitochondrial 
dysfunction because of their high energy demands, and it is not sur- 
prising that many NDDs are linked to mitochondrial dysfunction. 
Numerous studies have shown mitochondrial structural changes that 
precede pathology in AD brain [42] and expression of B-amyloid leads 
to loss of activity of several key mitochondrial enzymes involved in 
the tricarboxylic acid (TCA) cycle or the electron transport chain, 
including the a-ketoglutarate dehydrogenase complex, the pyruvate 
dehydrogenase complex, and cytochrome oxidase [43]. 

Early evidence for the role of mitochondrial dysfunction in 
dopaminergic neurons came from observations in methyl-4- 
phenyl-1,2,3,6-tetrahydropyridine (MPTP) users who devel- 
oped parkinsonism. Subsequent studies showed that MPTP 
is rapidly taken up by astrocytes, metabolized to 1-methyl-4- 
phenylpyridinium (MPP+), which is taken by dopaminergic 
neurons and inhibits complex I in the electron transport chain, 
thereby inhibiting ATP production, decreasing intracellular Ca”*- 
buffering capacity, and increasing ROS production, all of which lead 
to neurotoxicity. Complex I activity is also lowered in PD brain, and 
several genes linked to PD are all linked to mitochondrial function, 
including SNCA (PARK1/4), Parkin (PARK2), PINK1 (PARK6), 
PARK7, LRRK2 (PARK8), and HTRA2 (PARK13) [44]. Dissipation 
of the mitochondrial membrane potential induces translocation of 
the kinase pink1 to mitochondria and activation of Parkin through 
phosphorylation of ubiquitin and the ubiquitin-like domain of 
Parkin. This cascade initiates the degradation of mitochondrial pro- 
teins by the proteasome in a p97-dependent fashion or mitophagy, a 
process opposed by the mitochondrial deubiquitinase USP30 [45]. 

In HD brain, the activities of Complex I, II, and II are all inhibited. 
HD is a monogenic disorder, so the primacy of mutant huntingtin 
(HTT) in disease aetiology is beyond dispute, but still the mechanism 
ofaction remains unclear. There is evidence of direct interaction of mu- 
tant HTT with mitochondria, resulting in depolarization [46]; again 
these mitochondrial changes preceded overt disease symptomology. 
In addition, it is likely that some of the mutant HTT interaction with 
mitochondria is mediated indirectly via dysregulation of key tran- 
scription factors such as p53 and CREB-binding protein [47]. These 
examples serve to illustrate the mitochondria as the recipient of cel- 
lular stressors, but mitochondrial dysfunction and mitophagy can ul- 
timately amplify inflammatory responses by releasing ROS into the 
cytoplasm and extracellular space—once again muddying the rela- 
tionship of cause and effect of NDD pathology. 


Common molecular pathways in NDD: REST 


In the preceding sections, we have illuminated common pathologies, 
genetics, and cellular processes that accompany all NDDs. Here, we 
explore the idea that common molecular mechanisms may also be 
at play in driving or protecting against neurodegeneration. Our ex- 
emplar is the transcriptional repressor RE1-silencing transcription 
factor (REST), a key regulatory factor that has been implicated in the 
pathogenesis of several NDDs, including HD, PD, and AD [48-51]. 
REST was discovered over 20 years ago [52], and its role was initially 
seen as that of a transcriptional silencer of differentiated neuronal 


genes in neural progenitors and non-neural tissue. Initially, it was 
thought that REST was silenced during neuronal differentiation, 
thereby allowing the expression of differentiated neuronal genes. 
However, REST’s role is now seen as much more multi-faceted, and 
we now know that REST is not silenced during neuronal differenti- 
ation but continues to be expressed at low levels in several areas of 
the adult brain, notably the cerebral cortex where it represses the 
expression of many neuronal genes [53-57]. However, the role of 
REST during neurodegeneration is controversial, since it has been 
shown to be both pro-apoptotic and neuroprotective, depending on 
circumstances. In HD, in the presence of mutant HTT, levels of nu- 
clear REST are increased, leading to repression of numerous REST 
target genes, including BDNF, an essential neurotrophic factor re- 
quired for survival of medium spiny GABAergic neurons in the 
striatum [50, 51]. REST has also been shown to be pro-apoptotic 
in hippocampal CA1 neurons subjected to ischaemic insults, re- 
sulting in increased levels of nuclear REST and enrichment at the 
miR-132 promoter, leading to silencing of miR-132 expression and 
subsequent neuronal death [58]. The role of REST in PD is less clear. 
Administration of the dopaminergic neurotoxin MPTP to REST- 
null mice led to enhanced loss of dopaminergic neurons, indicating 
that REST was normally neuroprotective [48], whereas earlier 
studies showed that MPTP increased migration of REST from the 
nucleus to the cytosol, accompanied by subsequent increase in REST 
target gene expression and increased cell death [59]. This apparent 
contradiction can be resolved by considering that, in both cases, 
REST levels at their target genes are decreased either due to ablation 
of the REST gene or migration of the REST protein from the nucleus, 
and both result in increased susceptibility to MPTP toxicity. Stated 
otherwise normal (nuclear) levels of REST are neuroprotective. This 
neuroprotective action of REST has been more directly interrogated 
in a recent study that indicates a neuroprotective role for REST in the 
ageing brain that is lost in the brain of patients with mild cognitive 
impairment and AD, FTD, and DLB [49]. Further, using SHSY5Y 
neuroblastoma cells, they showed that REST bound the promoters 
of many cell death genes, including p38 MAP kinase, FAS, FADD, 
and TRADD, as well as the presenilin 2 promoter. Moreover, REST 
was induced in both cultured mouse and human cortical neurons in 
response to challenges with AB,_,, or H,O,, and ablation of REST in 
mouse cortical neurons led to an increase in the expression of pro- 
apoptotic genes and increased neurodegeneration in response to AB 
challenge. So how can these studies be reconciled? 

WNT signalling has long been associated with AD pathogenesis, 
and numerous studies have shown that canonical wnt signalling pro- 
tects against B-amyloid toxicity [60-62]. Acute exposure of rat cortical 
neurons to Af activates non-canonical wnt signalling via the JNK/ 
PCP pathway and subsequent activation of several transcription fac- 
tors, including EGR1 and KLF10, as part of a pro-apoptotic pathway 
[63]. Furthermore, this pathway is also detectable in the brain of 
people and in animal models with amyloidopathy, but not tauopathy, 
adding weight to the evidence that this non-canonical pathway is ac- 
tivated by AB in AD. Since REST is known to be directly regulated 
by canonical wnt signalling [64, 65], then when taken together, these 
observations suggest a model whereby exposure to Af controls a 
bipartite signalling cascade, with canonical wnt signalling control- 
ling an anti-apoptotic arm, in which REST represses pro-apoptotic 
genes, and non-canonical wnt signalling controlling an antagonistic 
pro-apoptotic arm in which JNK/PCP activates pro-apoptotic genes. 
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This model supports the suggestion that high levels of REST in the 
ageing brain play a neuroprotective role and may be sufficient to pro- 
tect against dementia, even when the brain shows significant struc- 
tural pathology, including AB deposits [49]. This conditionality of 
Af-induced neurodegeneration is also reflected in the levels of AB in 
vivo, which fluctuate as a function of neuronal activity [66], diurnal 
cycle [67, 68], and ageing [67]. Importantly, diurnal variation of AB 
occurs in the brain of APPswe/PS1AE9 mice before the appearance 
of AB plaques. Furthermore, in the human brain, increases in Af also 
precede the appearance of AB plaques or any symptoms of AD. In 
other words, neurons are constantly exposed to fluctuating levels of 
AB, yet this does not immediately result in an increase in AB plaque 
formation or neuronal death. REST could thus act as a gatekeeper 
in this bipartite response to AB and could act as an essential compo- 
nent of a homeostatic response to neurodegenerative stressors such as 
Af, perhaps as an inhibitory node of a feed-forward inhibition loop, 
a widespread regulatory motif that homeostatically regulates many 
molecular pathways [69] by modulating the dynamic response of net- 
work output to a varying input, that is, they buffer a gene network 
against fluctuations in input, whether as signal or noise. This would 
allow a neuron to tolerate low Af exposure without committing to an 
irreversible apoptotic response, yet still permit apoptosis in the pres- 
ence of persistent AB exposure. Taken together, these observations 
indicate that REST occupies a unique position as a key regulator of 
neuroprotective pathways induced in the ageing brain, offering pro- 
tection against B-amyloid-induced neurodegeneration. Since REST is 
induced in response to several stressors, including Fe**, H,O,, and the 
glutathione synthesis inhibitor buthione sulfoxide [49], then this may 
provide a model whereby REST provides neuroprotection against a 
wide range of neurodegenerative stressors. More generally, this multi- 
faceted role of REST may reflect the perspective that although the mo- 
lecular function of a gene may be largely fixed, the biological process 
that this function subserves is highly context-dependent. This system's 
perspective is reflected in the different gene modules and networks in 
which a gene may be embedded, which will, in turn, depend on the 
cell type and cell state, that is, it is the network interactions and output 
that determine the ‘function’ of the gene. 


Novel cellular models of neurodegeneration 


The underlying cause of most NDDs is a complex cocktail of genetic 
and environmental risk factors, and this is nowhere clearer than in 
AD [70]. The predominant hypothesis to date has been that of the 
‘amyloid cascade; the initial evidence for which came from the obser- 
vations that autosomal dominant AD arises from mutations in either 
the amyloid $ precursor protein (APP) gene or in genes encoding 
APP secretases. In brief, the amyloid cascade hypothesis posits that 
faulty processing of APP generates neurotoxic AB fragments and 
consequent neurodegeneration and other hallmarks of AD path- 
ology, including aggregation of phosphorylated tau and synaptic loss 
[71]. However, as stated earlier, clinical trials of immunotherapies 
targeting A®,_, have failed. Although these failures may be most 
related to clinical trials being conducted too late or on participants 
without amyloid pathology, they nonetheless serve to emphasize that 
late-onset AD is likely to be complex and that a fuller understanding 
of the molecular mechanisms of the canonical amyloid cascade, as 
well as associated processes, including inflammation, for example, 


is an essential prerequisite to developing therapeutic interventions. 
This amyloid cascade hypothesis has led to the development of nu- 
merous animal models as investigative tools to interrogate disease 
mechanism and as screens to identify therapeutic targets. The ma- 
jority of these are murine and based on the introduction of cock- 
tails of mutations targeting APP, PSEN1, PSEN2, and MAPT [72]. 
However, rodent models have several severe limitations, especially 
in relation to AD. Firstly, there are numerous important intrinsic 
differences between human and rodent neurons, including marked 
differences in the transcriptome of the ageing brain [73, 74], human- 
specific transcriptional modules in AD brain [75], neuronal size 
and complexity of the dendritic tree [76], protein composition of 
the post-synaptic density [77], and speed of synaptic information 
transfer [78]. Secondly, overexpression of mouse APP does not ag- 
gregate in amyloid plaques, a striking difference between mice and 
humans. Subsequent generation of mouse transgenic lines, such as 
PDAPP and Tg2576, which bear mutant human APP genes, still fail 
to exhibit cardinal hallmarks of AD, including neurofibrillary tangle 
formation and neuronal death [79, 80]. Thirdly, more recent mouse 
models based on the 3xTg triple mutation (APP Swedish; MAPT 
P301L; PSEN1 M146V) offer some improvement, including syn- 
aptic dysfunction preceding the deposition of plaques and tangles 
[79], but only by forcing the overexpression of a set of mutations 
that do not have any pathophysiological counterpart in humans. 
Fourthly, animal amyloid modes show cognitive deficits in the ab- 
sence of discernible neuronal pathology; yet in humans, cognitive 
impairment is accompanied by neuronal pathology and death. This 
profound incongruity undermines the usefulness of animal models 
in a myriad of AD drug development programmes. Fifthly, there is 
a tacit assumption that the underlying regulatory molecular ma- 
chinery is conserved between mouse and human. Whereas this is 
true on a coarse-grained analysis, it is demonstrably untrue at a 
detailed level; comparison of the cistromes of numerous transcrip- 
tion factors shows only a small conservation of transcription factor 
binding sites (TFBS) between mouse and human [80], and further- 
more, genes associated with hominid-specific TFBS are specifically 
enriched in neurological pathways [81]. This latter issue is brought 
into sharp focus by a recent study on the transcriptional repressor 
REST, elevated levels of which are found in the normal ageing brain, 
but not in AD brain (see earlier), and which confer neuroprotection 
by repressing pro-apoptotic genes in the normal ageing brain [49]. 
However, close inspection of human and mouse REST cistromes [82] 
shows that many of the genes conferring protection in the human 
brain are REST targets only in humans, but not in mice. These dif- 
ferences in the underlying topology of mouse and human gene net- 
works may result in potentially critical species-specific gene-gene 
interactions that undermine the relevance of mouse systems as tools 
to investigate molecular mechanisms and the discovery of thera- 
peutic targets in human cells. This has led to searches for human 
cellular neuronal models that obviate these limitations and reduce 
our reliance on rodent models. Until recently, human cellular models 
were dominated by non-neuronal cell lines, such as HEK293, or 
neuroblastoma lines, such as SH-SY5Y, all of which suffer from many 
limitations, including their non-neuronal origin, aberrant cell sig- 
nalling, and chromosomal abnormalities, serious concerns that curb 
any enthusiasm for their usage. However, the twin developments of 
cellular reprogramming and gene editing have completely changed 
the experimental landscape of modelling disease using human cells. 
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Induced pluripotent stem cells 


Yamanaka’s groundbreaking discovery of reprogramming somatic 
cells [83] to become iPSCs has revolutionized much of biology. iPSCs 
have the cardinal properties of an embryonic stem cell: (1) they can 
self-renew and maintain their pluripotent state; and (2) they can dif- 
ferentiate to (almost) any lineage. Equally importantly, iPSCs obviate 
many ethical concerns over the use of primary human embryonic 
stem cells, since they do not require the destruction of human em- 
bryos. Unlike immortalized human cell lines, iPSCs are ‘normal, can 
be derived from any personal genetic background, and can be pro- 
duced by reprogramming a wide range of somatic cells, including 
readily available skin fibroblasts and hair follicle cells, making it pos- 
sible to readily make patient-specific iPSCs. These, in turn, can be 
differentiated towards multiple neuronal (and glial) types relevant 
to individual NDDs, associated with loss of specific neuronal popu- 
lations such as cortical neurons in AD, medium spiny GABAergic 
neurons of the basal ganglia in HD, and dopaminergic neurons of 
the substantia nigra in PD. iPSC neurons have many features that 
lend themselves to modelling neurodegeneration in vitro [84-86], 
including: (1) acquisition of electrical excitability, as evidenced by 
the expression of voltage-sensitive sodium (Na*) and potassium (K*) 
channels; (2) firing of action potentials; and (3) physical evidence 
of synapses on the basis of juxtaposition of pre- and post-synaptic 
markers [3]. Not only do these iPSC-derived neurons offer poten- 
tial to generate cells for replacement of damaged tissue, but they also 
offer unparalleled human cellular models of disease and discovery 
tools for therapeutic target identification and drug screening [84, 87]. 


Gene editing 


The advent of gene editing, particularly by use of CRISPR/CAS9 
[88, 89] offers the possibility of introducing, or reversing, disease- 
associated mutations. As with iPSCs, this development opens up pos- 
sibilities for repair and replacement and for generating new cellular 
models. The most widely used CRISPR/CAS9 system can be thought 
of as comprising a dual module made up of a unique genomic ad- 
dress delivery system [provided by single-guide RNAs (sgRNAs) and 
a cargo (CRISPR-associated protein 9 (CAS9)]. The sgRNA binds 
to its unique genomic address and then takes delivery of the CAS9 
cargo. CAS9, in its native form, then introduces double-stranded 
DNA breaks, which are subsequently repaired by endogenous DNA 
repair mechanisms and, as a consequence, introduces discrete mu- 
tations into the targeted locus. More recently, numerous engineered 
forms of CAS9 have been introduced that use CAS9 simply as a de- 
livery vehicle to introduce novel genome-modifying activities to 
greatly expand the repertoire of genomic and epigenomic changes, 
including: (1) reversion of specific disease-associated SNPs; (2) ex- 
erting local control of transcriptional activation or silencing; and 
(3) manipulation of specific epigenetic marks [90, 91]. The combined 
power of iPSCs and gene editing now lets us derive neurons from any 
patient-specific background, study their behaviour before and after 
making highly discrete changes to the genome or epigenome, and 
attribute any changes in behaviour to a specific genetic locus. An ex- 
ample of this power can be seen in studies of iPSCs derived from a fa- 
milial AD background. iPSC neurons derived from patients carrying 
a duplication of the APP gene have higher levels of AB,_,, increased 
tau phosphorylation, and raised levels of GSKB, relative to normal 
iPSCs, all of which are seen in AD brain [92]. Use of CRISPR/CAS9 
to introduce AD disease causing mutations in the APP gene (APP™°) 


and the PSEN1 gene (PSENI™"*°") of iPSC neurons also gave rise to 
elevations in Af, and in this latter case, use of gene editing allowed 
unambiguous attribution of the change in cellular phenotype to spe- 
cific disease mutations [93]. 


The primacy of any of the biological processes discussed here in the 
aetiology and pathogenesis of NDDs is an ongoing source of con- 
tention, and undoubtedly it varies within and among diseases. More 
likely is that pathogenesis is not a linear cascade, but any of these 
processes can act as a driver and recruit other processes into tipping 
neurons into a neurodegenerative state. Although the brain has only 
a limited capacity to replenish lost neurons, it does exhibit remark- 
able plasticity, which ensures that the myriad of its neuronal con- 
nections continues to function until a critical threshold is eventually 
reached. It is now considered likely that a prolonged period of neur- 
onal compensated dysfunction may precede cell loss. Identifying 
stages on this neurodegenerative trajectory that are reversible and 
irreversible will be key to developing future therapeutic targeting 
strategies. Similar to the approaches that have been so successful 
in cancer, a deeper understanding of the fundamental biology that 
drives neuronal degeneration, rather than a focus on a singular cel- 
lular process, is likely to lead to successful targeted therapies. 
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Delirium is a common term; it has been used to name creations as 
varied as a waltz, a book series, and an alcoholic beverage. Delirium 
is also a common condition; while no age is unaffected, elderly 
people are most often affected, especially those in long-term care 
(LTC) or admitted to hospitals. In such people, it is associated with 
very poor outcomes. While about half do not remember, those who 
do live with the anxiety evoked by their experience [1]. Delirium is 
associated with more in-hospital complications, longer stays in hos- 
pital, a greater likelihood of transfer to nursing homes, and higher 
mortality. Hospitals incur greater costs in managing delirium. 
Despite its common usage, frequency of occurrence, and impact on 
people and health care systems, delirium remains poorly diagnosed. 

Delirium has been recognized and described for two millennia 
[2]. The stability of description is remarkable. Delirium was thought 
to be an acute and transient mental disorder, associated mostly with 
febrile illnesses and alcohol. It is still considered to be a disorder of 
acute onset, but now it is recognized that its course is not always 
transient [3]. 

Research on delirium has progressed in the past couple of decades 
[4]. There is greater confidence in our ability to prevent delirium 
than in our ability to treat it [5]. The problem in translation of cur- 
rent research is held back by the consistent and widespread reports 
of persistently high rates of diagnostic error [6]. To this end, delirium 
needs to be better understood, so that we can provide better care to 
the most vulnerable older people [7], especially in the context of a 
rapidly ageing society and our increasingly refined understanding of 
frailty and dementia subtypes. 
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The key conceptual change of delirium reflects how populations 
are ageing. Something that was likely observed in adults with acute 
illnesses, intoxications, or withdrawals is now seen mainly in older 
people in hospitals and nursing homes or in adults with very se- 
vere or terminal illnesses; this age and specialty distribution is re- 
flected in a recent delirium prevalence study in a university hospital 
in Ireland [8]. 





In the first available descriptions, delirium was distinguished from 
insanity [2]. Transiency was an important contrasting feature be- 
tween delirium and insanity, with the latter seen to be a permanent 
condition [9]. Since the beginning of the nineteenth century, de- 
lirium began to be conceptualized as a disorder of consciousness, 
with ‘clouding’ coming to represent both an alteration in the level 
and fragmentation of consciousness [2]. The literature from this 
era is rich in clinically descriptive detail, but problematic in its ap- 
plication. Delirium has been seen either as not being a problem of 
consciousness or as a problem of content of consciousness [10]. The 
clinical features of delirium are such that it does not allow for it to 
be operationalized either on a binary measure, as in present or not 
present, or to be thought of as being along a continuum. The modern 
approach to the problem has been to work by trial and error (‘Table 
39.1), with the word ‘consciousness’ being finally removed in DSM 
5. The European Delirium Association and the American Delirium 
Association have jointly critiqued this change, asserting that ‘con- 
ceptualisation of delirium must extend beyond what can be assessed 
through cognitive testing (attention) and accept that altered arousal is 
fundamental [11] (2014). It might be that the essence of conscious- 
ness is not readily captured and is in need still of both a better de- 
scription and, with it, quantification. 

Attention is easier to define than consciousness, though seemingly 
hard to standardize for routine clinical examination. Impairments in 
attention have been as a reduced ability to direct, focus, shift, and 
sustain attention, which incorporates most aspects of what is cur- 
rently known about it [12]. However, there are problems in both 
addressing it in research and translating this into clinical practice, 
not the least because of overlap between the constructs of atten- 
tion, working memory, and executive control [13]. DSM-5 certainly 
achieves high inter-rater reliability [14], but when interpreted 
strictly, it may miss many cases [15], as might be expected when 
a key component is missed. In contrast, the ICD-10 rigid criteria 
achieve high specificity at the cost of low sensitivity, because of the 
requirement of a number of mandatory features [14]. 

Regardless, it is important not to limit our understanding of a 
phenomenon because of artificially imposed constraints by defin- 
itions and rating scales derived thereof. Recognizing this limitation, 
Paula Trzepacz, David Meagher, and José Franco have proposed 
their Trzepacz, Meagher, and Franco (TMF) research criteria for 
delirium, based on an analysis of data from the Delirium Rating 
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Table 39.1 Comparison of diagnostic systems 


| io DSM-III DSM-III-R DSM-IV DSM-5 


Clouding of 
consciousness 


Consciousness 


Attention 


Awareness 


Disturbances in 
cognition 


Disturbances in 
perception 


Disturbances in 


speech/thinking 


Sleep-wake cycle 
disturbances 


Psychomotor 
disturbances 


Onset 


Presence of 
fluctuation 


Requirement of 
cause 


Exclusion 


Reduced clarity of 
awareness of the 
environment 


Reduced clarity of 
awareness of the 
environment 







As above As above 


Reduced clarity 
of awareness of 
environment 


Reduced ability to focus, 
sustain, or shift attention 


Reduced ability to focus, 


Reduced ability to maintain 
sustain, or shift attention to attention and to shift 
attention to new external 
stimuli 


Reduced ability to 
focus, sustain, or shift 


environmental stimuli attention 



















As above As above As above 


Disorientation and 
memory impairment. 
Required feature if testable 


Both immediate recall 
and recent memory and 
disorientation in time, 
place, or person required 


One of three 
features: insomnia, 
nocturnal worsening of 


symptoms, and disturbing 
dreams 


Change in cognition 
(memory deficit, 
disorientation, 


ol re 


.. development of 


not accounted for by 
dementia 







Disorganized 
thinking: rambling, irrelevant 
or incoherent speech 





One of following: shifts 
from hypo- to hyperactivity; 
t reaction time; f or 4 flow 
of speech; enhanced startle 
reaction 


Developing rapidly; time 
course not defined 


Developing over a short 
period of time—hours to 
days 


Developing over a short 
period of time—hours to days period of time—hours 


to days 


Diurnal pattern Diurnal pattern Diurnal pattern Diurnal pattern 


Cerebral or systemic 
disease 


Evidence or presumption 
of organic factor judged 
to be aetiologically related 
allowed 


And presumption of 
aetiology if symptoms not 
accounted for by a ‘non- 
organic mental disorder’ 


Direct physiological 
consequences of 

a general medical 
condition 


Dementias 








[Essential C |Either/or (use ambiguous) ESS Additional 


language disturbance) 


perceptual disturbance 


Developing over a short 







Reduced ability to 
focus, sustain, or shift 
attention 


Reduced orientation 
to the environment 


Change in cognition 
(memory deficit, 
disorientation, 
language, and 
visuospatial ability) 
Ofe 


.. perception 
(included in ‘change in 
cognition’ above) 


Disturbance in 
attention and 
awareness defined 
developing over a 
short period of time— 
hours to days 


Diurnal pattern of 
fluctuating severity 
of inattention and | 
awareness 


Broader categories 


Other major 
neurocognitive 
disorders and coma 


MBS Notstated/not required 
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Source: data from World Health Organization, The ICD-10 Classification 


Manual of Mental Disorders, (Third Edition, DSM-III, Copyright (1980); 


of Mental and Behavioural Disorders, Copyright (1996), World Health Organization; Diagnostic and Statistical 





DSM-5, Copyright (2013)), American Psychiatric Association. 


Third Edition-Revised, DSM-III-R, Copyright (1987); Fourth Edition, DSM-IV, Copyright (1994); Fifth Edition, 
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Scale-Revised-98 (DRS-R-98) [7]. These criteria have three domains 
of impairment: attentional/cognitive, circadian, and higher-level 
thinking [7]. The TMF criteria had high sensitivity and specificity 
(87.4% and 89.2%, respectively) and were better balanced than 
DSM-III-R (100% and 31.6%, respectively), DSM-IV (97.7% and 
74.1%, respectively), DSM-5 (97.7% and 72.6%, respectively), and 
ICD-10 (66.2% and 100%, respectively). 


Epidemiology of delirium 


Elderly people will make up approximately 16% of the world popu- 
lation by 2050 [16]. While, at any given time, about 4-6% of older 
adults are in nursing homes, around 30-50% of them will use a 
nursing home before their deaths [17]. Older people are high users 
of hospitals; around 40% of acute hospital stays were for people 
aged 65 years and older. A recent study suggested that hospital- 
ization appears to be related to the proximity to death and nursing 
home use to ageing, which, if true, implies that nursing home use 
is likely to grow more rapidly than hospital use as populations age 
[18]. Another study found that multi-morbidity and neuropsychi- 
atric diagnoses were the highest risks for LTC dependency [19]. If 
correct, these findings imply two things. Firstly, that people in hos- 
pitals and nursing homes are more likely to be the oldest old, to have 
higher rates of cognitive impairment, and to be frail and closer to 
death—all factors associated with high rates of delirium. Secondly, 
an understanding of delirium, the problems of its diagnosis, and 
management must broaden to include delirium in nursing homes 
and other settings. 

In the general population, delirium is more common with advan- 
cing age and in the presence of dementia. In people 265 years, the 
point prevalence of delirium is estimated to be around 2% [20]. 
A population-based study examining the 30-day prevalence in the 
oldest old diagnosed delirium in 17% among the 85-year olds, 21% 
among the 90-year olds, and 39% among participants aged 95 years 
and older [21]. Overall, delirium was present in 52% of those with 
dementia, compared to only 5% of those without dementia. LTC is 
variably defined in studies [20]; when defined narrowly to include 
nursing homes, the prevalence estimates are still quite varied, ran- 
ging from just over 1% to 70% [20]. 

Eleven to 25% of hospitalized elder patients have delirium on 
admission, and a further 30% will develop it during the course of 
hospitalization [22]. The former is inaccurately termed ‘prevalent’ 
delirium, missing points in patient flow in and out of the hospital 
through emergency departments (EDs). Delirium occurs in 8-10% 
of older patients presenting to EDs, and 15% still have delirium at 
ED discharge [22]. Two-thirds of people with ‘ excited delirium; a 
poorly recognized state, die at the scene or during transport by para- 
medics or police [23]. Excited delirium is a term used most often 
in the forensic literature to refer to patients who, in other contexts, 
might have been referred to having hyperactive delirium. A key dif- 
ference seems to be the context, with ingestion of drugs or toxins 
by community-dwelling people, many of whom are not elderly 
and whose initial encounter is frequently with the police, being the 
characteristic case. The mortality rate is non-trivial, and the term is 
meant also to suggest extreme adrenergic activation, for a variety 
of reasons. Given our limited experience with, and understanding 
of, this special case, we will not be referring to it further (see [23]). 


Of the 11% of patients who had delirium in the ED on day 1, in 
nearly three-quarters, it persisted into day 2 and for about half of 
them for all 3 days [24]. Thus, the timing of delirium assessments 
can miss cases. 

Once admitted, one study found a point prevalence of delirium of 
approximately 20% of adults evaluated [8]. The highest rate (>50%) 
was unsurprisingly found in the geriatrics ward and the lowest in 
surgical wards, with intermediate rates in both medical and ortho- 
paedic wards (around 25%). When only older adults were examined 
in a nationwide study in Italy, the point prevalence of delirium was 
estimated to be approximately 23% [25]. The highest rates were de- 
tected in neurology and geriatrics wards. There are three caveats to 
consider from these studies. Firstly, a number of patients may be 
excluded because of untestability and lack of consent. Secondly, the 
relatively lower rate of delirium in surgical wards may be an artefact 
of the point prevalence estimation, especially if done on a weekend 
[8]. Surgery is a risk for delirium—the type of surgery, whether it 
was elective or emergency, and the duration of surgery appear to 
be important determinants of post-operative delirium [22]. Finally, 
both these studies excluded intensive care units (ICUs) [8, 25] and 
one excluded palliative care units [25]. The prevalence of delirium 
in ICU cohort studies can range from a low of 20-30% to as high as 
70-80% [26], and in palliative care units, the delirium prevalence 
increases closer to death: 13.3-42.3% on admission, 26-62% during 
admission, and 58.8-88% in the weeks or hours preceding death 
[27]. This dramatic rise in delirium closer to death raises a prag- 
matic question—how do we distinguish between delirium as a part 
of dying and delirium as a sign of preventable death [28]? Prevalence 
estimates may rise further by 8-13% if subsyndromal delirium, a 
prognostically important milder state that does not meet the full 
diagnostic criteria, is included [29]. 


Clinical features and diagnosis 


History taking and learning to interview and observe a patient who 
might not be very co-operative is critical to both the diagnosis of 
delirium and its differential diagnosis. Every effort must be made 
to obtain collateral information from the patients family members, 
caregivers, general practitioner, and past clinical records. 

Delirium is an acute-onset condition. About half of older patients 
with delirium in hospitals already have it at admission, so this is an 
important part of the history to ascertain. The key questions are 
whether the current presentation of the older person represents a 
change for them and, if so, the time course of that change. Responses 
from family members suchas“... this isn’t my mum, or“... my father 
is off with the fairies’ should be taken seriously. It is also useful to 
find out if the patient had had a cognitive assessment in the past year, 
since most high-income countries now require periodic cognitive 
assessments for older people. In hospital, baseline cognitive assess- 
ment and change indicate delirium [30]. 

Once onset, delirium characteristically fluctuates, often in a di- 
urnal pattern, within as little as 3 hours [31]. This can be missed 
if examinations are less frequent (for example, every 3 days) [32]. 
Experts rely on fluctuation of motor symptoms wherein changes to 
physical activity of the patient appear important [33]. This may be 
particularly helpful in diagnosing delirium superimposed on de- 
mentia, both in the hospital [34] and in nursing homes [35]. 


Delirium presents as a mental disorder; it typically shows gross 
disturbance of mental functions and consciousness. Schiff and Plum 
noted that the primary neuropsychological components of con- 
sciousness are arousal, attention, intention, memory, awareness, and 
mood/emotion [10]. Research on clinical features of delirium dem- 
onstrates impairments in all of these components of consciousness. 
In addition, there are often profound disturbances in both the form 
and content of thought and perception. Overall there is considerable 
fragmentation of the conscious experience, which was one of the 
meanings held by the phrase ‘clouding of consciousness. 

Generalized arousal is considered to be the most powerful and 
essential activity in any vertebrate nervous system [36]. Arousal is 
essential for the organism to be both aware and attentive [37]. In de- 
lirium, arousal is both diminished and heightened, as manifest when 
patients are hypoactive and hyperactive. Hypoactive and mixed sub- 
types are usually more common than the hyperactive subtype and 
more likely to be missed. Two-thirds of subtypes appear to maintain 
stability over time [38]. There is some evidence that arousal may pre- 
dict both delirium and inattention [39]. Testing attention and other 
cognitive functions is difficult when arousal is reduced, so it is useful 
to become clinically adept at observing changes to arousal. Rating 
scales such as the modified Richmond Agitation and Sedation Scale 
(mRASS) [33] and the Observational Scale of Level of Arousal [39] 
may assist. 

Attention is commonly impaired during clinical examination; 
global attention, when measured as the patient’s ability to participate 
in a conversation over a short period of time, is the most common 
impairment [40, 41]. Of the available tests of attention, months 
backwards, defined as the ability to name months backwards from 
December to July without error, appears to perform the best, both in 
hospitals [41] and in nursing homes [42]. These measures are easy 
to use by the bedside, but practice is essential in learning both to 
administer and to observe patients as they perform. An app is also 
available to assist in this process [43], although how it will be em- 
ployed in routine practice is unclear [44]. To a certain extent, the 
ability to direct attention is an act of intention; however, intention 
has, as such, not been studied in delirium. The ability to direct the 
mind towards an object and the capacity to act on the object are typ- 
ically impaired. 

Awareness of both self and the environment is grossly impaired. 
While the diagnostic systems do not guide us in how this is to be 
either understood or measured, research into the subjective experi- 
ence in delirium demonstrates this impairment to a startling degree. 
Patients talked of experiencing themselves as being in a vacuum or 
a torpor and/or slumbering and that the surroundings were experi- 
enced through a mist [1]. Time-orderedness of this awareness too 
was impaired; patients encountered a mix of events from past and 
present, and visualization interpreted as imagination or fantasizing 
in a borderline state. These experiences were typically frightening 
when experienced and, when remembered, evoked a variety of emo- 
tions, from feelings of suspiciousness to shame, and between 19% 
and 22% develop post-traumatic stress disorder [45]. Examining 
these experiences relies on developing nuanced interview and ana- 
lytic skills. 

Delirious people feel turbulent emotions, from anxiety and sad- 
ness to irritability and anger. Sadness and anxiety, along with di- 
minished activity of the hypoactive subtype of delirium, prompt a 
diagnosis of depression in such patients and lead to inappropriate 
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treatment [46]. Psychotic symptoms are fragmented; judgements 
about, or perceptual misinterpretations of, the immediate external 
world are common, and when hallucinations occur, they are short- 
lasting and frightening. Thus, delusions, unlike those seen in schizo- 
phrenia, are liable to change and rarely systematized. Delusions and 
psychomotor agitation appear to predict the severity of distress after 
delirium [45]. The persistence of memories and distress may remind 
the clinician of delusions, but its certainty relates to the experience at 
the time and, unlike those seen in schizophrenia, is not an ongoing 
experience of the world. Hallucinations are typically visual and 
may occur in up to 50% of patients; rates are higher (nearly three- 
fourths) in patients referred to consultation-liaison psychiatry ser- 
vices [47]. It is clinical lore that delirium must be considered in an 
elderly person with first-onset visual hallucinations; however, one 
must be cautious in passively accepting it because it is not informed 
by research, and visual hallucinations are common in Lewy body 
dementias (LBDs). 

Standard electroencephalography (EEG) is neither recommended 
for, nor feasible in, routine clinical practice. A recent study has 
shown that it may be possible to derive useful information from just 
two electrodes [48]. How this will influence routine practice has yet 
to be established. 





Differential diagnosis 


The differential diagnosis of delirium is influenced by the settings 
in which the patient is examined; in hospitals, dementias, depres- 
sion, and alcohol use disorders (AUDs) are important to consider, 
whereas in nursing homes, dementias, behavioural and psycho- 
logical symptoms of dementia (BPSD), and depression are likely 
more important. The task is made challenging because of the fre- 
quent co-occurrence of these disorders. 

Dementia and cognitive impairment are consistently identified as 
risk factors for delirium. In approximately 50% of patients, delirium 
is superimposed on dementia [34]. The problem then is not just to 
differentiate delirium from dementias, but also to diagnose it when 
dementia is present; there appears to be no consensus among ex- 
perts as to how to do this [49] (Table 39.2). Three issues must be con- 
sidered. Firstly, there is considerable overlap of symptoms between 
delirium and those of dementias such as LBD and some rare forms of 
dementias such as those caused by infections and prions. Two-thirds 
of experts found it challenging to diagnose delirium superimposed 
on dementia with Lewy bodies (DLB) [49]. While estimates of prior 
probability in a setting is a useful guide to differential diagnosis, the 
risk of missed diagnoses and potential harm from treatment makes 
itimportant to ensure accuracy of diagnosis. Secondly, BPSD are not 
included in the diagnostic criteria for the most common dementias 
[AD and vascular dementia (VaD)], but these are common and add 
to the challenge of a delirium diagnosis. Finally, in a fifth of cases, de- 
lirium persists at 6 months after diagnosis [50]; where does delirium 
end and dementia begin? 

Dementias can onset acutely [51], though the proportion of those 
that onset over hours to days might be small. Cerebrovascular dis- 
ease, including VaD, is an important differential diagnosis because 
of possible acuity of onset and common co-occurrence. Around 11% 
of dementia patients in acute hospitals may have VaD [52]; while the 
frequency of strategic infarct VaD in this population is not known, it 
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Table 39.2 Prevalence and comparison between delirium and dementias 


Dementias 

















Onset Acute (hours to ee Typically insidious Acute to subacute Insidious Acute to insidious nsidious 
onset 
Arousal Impaired Typically unimpaired May be impaired Typically unimpaired ay be impaired mpaired 
Attention 97% mpaired Impaired mpaired mpaired Grossly impaired 
Orientation 75% mpaired Impaired mpaired mpaired mpaired 
Memory 88% mpaired Impaired mpaired ay not be in early stages Impaired 
Sleep-wake cycle 97% ay be impaired 6-11% ay be impaired ay be impaired mpaired 
Hallucinations 50% ay be present 0-18% ay be present ay be present Present 
Delusions 31% ay be present 9-40% Present (later stages) Present (earlier stages) Present 
Fluctuation Present ay be present May be present ay be present (20%) May be present (35-50%) Present (90%) 
Course Persistent in 20% at Progressive Persistent Progressive Stable to progressive Progressive 





end of 6 months 


may account for 10% of all VaD diagnoses at autopsy [53]. Delirium 
can also be present in 10-30% of patients with stroke [54]. The pres- 
ence of focal neurological signs should trigger requests for brain 
imaging. When absent, repeat assessment focusing on arousal levels, 
attention, and focal higher cognitive signs can be helpful. Other 
dementias, typically those of infective causes or due to prions, can be 
acute in onset, but the prevalence is unknown. A high index of sus- 
picion generated by experience of a curious mind is possibly vital. 

LBD in general, but DLB in particular, has a clinical presentation 
that is most similar to delirium. Global cognitive impairment (in 
this case, dementia), fluctuation in cognition, with pronounced vari- 
ations in attention and alertness, and visual hallucinations are core 
criteria for DLB [55]. The prevalence of DLB in hospital settings and 
in nursing homes is not known; it may be present in approximately 
3% of inpatients [56], and 2% of patients with delirium during hos- 
pitalization were diagnosed with DLB at 3 months’ follow-up [57]. 
Parkinson's disease, which is associated with both hallucinations and 
delirium, is perhaps more common in hospitals and nursing homes 
[58]. The time course over which symptoms evolve and the presence 
of Parkinsonian signs help differentiate. Treatment of hyperactive 
delirium may involve the use of dopamine antagonist drugs, such as 
haloperidol, which may be fatal in patients with DLB. 

BPSD are common in dementias and especially so in nursing 
homes and perhaps also in hospitals; a recent study estimated that 
around 70% of hospitalized patients with dementia will show at least 
one BPSD [59]. The phenomenon of sundowning can mimic de- 
lirium, and delirium worsens BPSD [60]. BPSD tend to be persistent, 
so once again attempts to obtain a history from family members or 
staff at the nursing home are important in distinguishing delirium 
from BPSD. 

Depression is common in hospitalized elderly patients [46], and 
delirium, depression, and dementia commonly co-occur in nursing 
homes [61]. Patients with hypoactive delirium and those with sig- 
nificant disturbance in mood and emotion can appear depressed 
on cross-sectional examination, leading to misdiagnosis. Efforts 
to obtain a history and, in the fast-paced environment of an acute 
ward, avoid conflating a symptom with a syndrome are important to 


an accurate diagnosis. Depression onsets over days to weeks, rather 
than hours to days; severe melancholic depression can show fluctu- 
ation, but it is typically predictable, with characteristic worsening 
of symptoms in the morning; while sleep-offset insomnia is charac- 
teristic of severe depression, insomnia in depression can also be of 
sleep-onset and sleep-maintenance. However, it is rarely fragmented 
or reversed, as seen in delirium; when psychotic symptoms occur in 
depression, they tend to be delusions, which are typically delusions 
of poverty, guilt, and nihilism. 

The number of people with substance use disorders aged 50+ 
years are likely to double by 2020 [62]. Almost 50% of older adults 
(265 years) and almost 25% of subjects over 85 years drink alcohol; 
AUDs afflict 1-3% of elderly subjects [62]. In addition, up to 30% of 
older patients hospitalized in general medicine wards and up to 50% 
of those hospitalized in psychiatric wards have AUDs. Standardized 
rating scales are recommended for the identification and prediction 
of withdrawal symptoms [62]. 

Late-onset schizophrenia and late-onset schizophrenia like psych- 
osis are relatively rare. People with these disorders are less likely 
than young-onset schizophrenia to demonstrate cognitive symp- 
toms [63]; indeed cognitive impairment is felt sometimes to exclude 
a diagnosis [64]. Where it does not, impairments in attention and 
memory are common [65]. Base rates of diagnoses are a useful guide 
to diagnostic reasoning. 

In summary, careful history taking and interview, along with col- 
lateral information focused on premorbid cognition and behaviour 
and examination, are key, but no guarantee, to an accurate diagnosis. 


Given the rate of misdiagnosis of delirium, bedside use is as im- 
portant a feature of a tool as it is for research. Two systematic reviews 
have examined the role of delirium assessment instruments [66, 67]. 
Wong et al. [66] reported that positive findings on the Global 
Attentiveness Rating (GAR), Memorial Delirium Assessment Scale 
(MDAS), Confusion Assessment Method (CAM), Delirium Rating 


Scale Revised-98 (DRS-R-98), Clinical Assessment of Confusion 
(CAC), and Delirium Observation Screening Scale (DOSS) each had 
a likelihood ratio of >5 for diagnosing delirium. They determined 
that the Mini-Mental State Examination (MMSE) was the least 
useful for identifying patients with delirium [66]. The other brief 
cognitive screening instruments likely perform similarly. However, 
it is clinically relevant and important to conduct routine cognitive 
assessments, where possible, on all older patients admitted to hos- 
pital and those in post-acute care and nursing homes. Such instru- 
ments [for example, the Abbreviated Mental Test Score (AMTS)] 
might be helpful in the following ways in identifying delirium: (1) 
to detect dementia, possibly the most important risk factors for de- 
lirium, in hospitalized older patients [68]; (2) given its sensitivity, 
the MMSE may be useful in ruling out delirium [69]; (3) a change 
from baseline cognition may help predict delirium [30]; or (4) spe- 
cific items such as disorientation to time and place on the MMSE on 
admission could predict delirium. Impairment in these two items 
classified nearly 90% of patients, and these two measures, along with 
visuoconstructional impairment, were each associated with either 
hypoactive or mixed subtype [70]. 

A problem when instruments are used in isolation is that they have 
to fit into hospital clinicians’ available time. Even the CAM, which 
takes only 5 minutes to complete, may be too long for the busy clin- 
ician [66]. Besides, almost all instruments require training; in un- 
trained clinicians, CAM has a low sensitivity [71-73]. Instruments 
may have to be suitably matched to clinicians using them. For ex- 
ample, the DRS-R-98 might be more suitable for use by suitably 
trained clinicians such as old age psychiatrists and geriatricians. On 
the other hand, the Nursing Delirium Screening Scale (Nu-DESC), 
which has shown good psychometric properties in two systematic 
reviews, might be a good observational tool for routine use by nurses 
[66, 67]. Two new instruments have taken on the challenge of being 
both brief and capable of wider use: 4AT [74] and RADAR (Repérage 
Actif du Delirium Adapté dla Routine or Recognizing Acute Delirium 
As Part of Your Routine) [35]. The 4AT has four items: (1) alertness; 
(2) four questions from the AMTS on age, date of birth, place (name 
of the hospital or building), and current year; (3) attention as tested 
by months of the year backwards; and (4) acute change or fluctu- 
ating course (http://www.the4at.com/). The RADAR has just three 
questions: “When you gave the patient his/her medication: 1) Was 
the patient drowsy?; 2) Did the patient have trouble following your 
instructions?; 3) Were the patient’s movements slowed down?’ [35]. 
Of the two, the 4AT has so far shown a good balance between sen- 
sitivity and specificity when used by geriatricians [74] (Bellelli et al., 
2014) and nurses without training in its use [75]. The RADAR had 
lower levels of both sensitivity and specificity, but participating 
nursing staff took only 7 seconds, on average, to complete the tool 
and almost all received it very well [35]. Additionally, it is important 
to note that the RADAR was tested across five nursing homes where 
the misdiagnosis of delirium is typically higher than in hospitals [76]. 


Natural history of delirium 


Delirium is associated with an increased risk of post-discharge mor- 
tality, institutionalization, and dementia [77]. Hypoactive delirium, 
increased severity, and persistent delirium, pre-existing dementias, 
and depression may all be predictive of these negative outcomes [78]. 


CHAPTER 39 Delirium 


Delirium has been for long thought of as a transient disorder; 
most modern studies demonstrated that this is not the case. One 
review found the combined proportions with persistent delirium at 
discharge, 1, 3, and 6 months to be approximately 45%, 30%, 25%, 
and 20%, respectively [50]. Persistence was associated with de- 
mentia, increasing numbers of medical conditions, increasing se- 
verity of delirium, hypoactive symptoms, and hypoxic illnesses [3]. 
It is unclear what proportion of these outcomes are attributable to 
factors other than delirium. Mortality in delirium may be medi- 
ated by the severity of illness and frailty in the short term [79] and 
through cognitive impairment in the long term [80]. Other negative 
outcomes may be mediated through persistence, which, in turn, is 
likely due to pre-existing dementia [78] and the presence of other 
geriatric syndromes [81]. Delirium, especially hypoactive delirium, 
is likely to predispose to deconditioning, falls, dehydration, pressure 
ulcers, and urinary retention, which, in turn, send the patient into a 
vicious spiral [81]. 

The outcomes of delirium from ICU settings are no less grim. 
Patients with delirium had significantly higher mortality during 
admission, longer durations of mechanical ventilation, and longer 
lengths of stay in the ICU and in hospital [82]. 

In nursing homes, symptoms of delirium predict incident de- 
lirium [83]. Baseline delirium and subsyndromal symptoms of 
delirium, depression, and dementia interact to lead to increased 
mortality and functional and cognitive decline [61]. 

Examination of outcomes of delirium requires further work. 
Adamis et al. [84] noted that recovery is not well defined for de- 
lirium; they proposed sustained improvement in cognitive function 
to define therapeutic outcomes in delirium. Following generally ac- 
cepted nomenclature in psychiatry, they defined ‘response as when 
baseline cognitive function is re-established for at least one full day 
of assessment and ‘recovery’ when this baseline cognitive function is 
sustained for at least a week [84]. 





Aetiopathogenesis of delirium 


Causality in medicine is commonly conceptualized, such that they 
can be reduced to biology, with preference for bottom-up explan- 
ations that focus on generative physiological mechanisms [85]. 
Outside of geriatrics and psychiatry, causality is commonly re- 
quired to be parsimonious; multiple causes are eschewed. For the 
researcher, the attraction is the idea that a single cause can lead to 
a single treatment, and for the busy clinician, a clear direction for 
action. These ideas can be problematic when applied to the common 
and complex presentations of later life, including delirium. 
Considering the pathogenesis of delirium, many causal hypoth- 
eses/models have been articulated [86]. A closer look reveals two 
broad categories: (1) those that reduce the cause of delirium to a 
single final common pathway operating in and through the brain; 
and (2) those that address complexity by other means. Complexity 
is usually addressed by attempting to either integrate two or more 
final common pathways’ together or use a framework such as the 
vulnerability-stress model to either empirically derive or theorize a 
model. There is a varying degree of strength of assumption in these 
models as to whether the brain is the organ of dysfunction. A meta- 
phorical understanding of delirium and related geriatric giants 
would consider the human organism as a complex system, however. 
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When complex systems fail, their highest-order functions fail first. 
In humans, consciousness, one of the higher-order functions, and 
its failure are signalled by delirium. In this metaphor, the brain is 
simply a bystander to a set of processes occurring throughout the 
body [37]. 

To be valid, the single common pathway theories should be able 
to describe the mechanisms that lead to the delirium in particular, 
and not something else, either dementia or one of the other com- 
plex presentations in older adults. These should also explain the 
occurrence of delirium subtypes, especially the occurrence of the 
mixed subtype of delirium. The more complex theories should not 
be setting-specific; they should be able to account for delirium in 
an older adult, regardless of the setting. They should also be able to 
describe the occurrence of delirium in older adults of varying levels 
of fitness within the same setting. Finally, they should serve utility. 


Single final common pathway hypotheses 
Oxidative stress 


Engel and Romano [9], in their now classic paper, hypothesized 
that delirium could result from two processes: failure of function 
or failure of structure. In this schema, the former is reversible and 
the latter permanent, drawing to the distinction between transient 
delirium and permanent dementia. Presciently, they noted it was 
too early to ‘regard these states as any more than different degrees 
or stages of similar processes: Their hypothesis was that delirium 
resulted from low cerebral oxygen consumption. In its modern it- 
eration, delirium is thought to result from brain damage resulting 
from oxidative stress [87]. Neopterin produced by microglia in the 
brain is a marker for both cell-mediated immunity and oxidative 
stress and was found to be higher in both the preoperative samples 
of CSF and the serum of delirious patients than in those with no 
delirium. Those with both delirium and chronic cognitive impair- 
ment had the highest levels of neopterin, and those with neither 
had the lowest [87]. In those without cognitive impairment, there 
was a correlation between delirium severity and neopterin levels. 
The association between obstructive sleep apnoea and delirium is 
seen as further evidence of the role of oxidative stress [88]. The fact 
that neopterin also has a role in cell-mediated immunity (though, in 
their study, the correlations persisted, even when those with infec- 
tion and malignancies were removed from analysis) suggests more 
complex pathways. Importantly, elevated levels have been found in 
other neuropsychiatric disorders. 


Neuroendocrine hypothesis 


Kral hypothesized that delirium was the result of age-linked decline 
of stress resistance mediated through the hypothalamo-pituitary- 
adrenal (HPA) axis [89]. Some recent support for this hypothesis 
has come from a positive association between raised perioperative 
plasma cortisol concentrations post-coronary artery bypass graft 
surgery [90]. However, unlike other recent studies examining anti- 
cholinergic activity, this study did not adjust for confounders [91]. 
Moreover, intraoperative dexamethasone failed to reduce the inci- 
dence of post-operative delirium [92]. The important issue here is 
that impaired HPA activity has not only profound systemic effects, 
but also effects on the brain, and it is through their pro-inflammatory 
effects that they can be usefully integrated into a more complex 
hypothesis [93]. 


Cholinergic hypothesis 


Tune et al. proposed the cholinergic hypothesis, based on the finding 
of an association between the use of anticholinergic drugs and post- 
operative delirium [94]. The hypothesis is attractive because of both 
the basal forebrain degenerative changes in dementias and the role 
that acetylcholine plays in arousal and attention. Much work has 
been done since the 1980s to prove the hypothesis by attempting to 
demonstrate associations between anticholinergic drug burden and 
serum anticholinergic activity and delirium. However, both an early 
[95] and recent studies cast doubt on this hypothesis [5, 91, 96, 97]. 
In one cohort study, anticholinergic drugs were not associated with 
an increased risk of delirium [96], while in another cohort study, the 
associations disappeared after adjusting for confounders [91]; in the 
first study examining CSF (and serum) anticholinergic activity, there 
was no association between either of these measures and delirium 
[97], and finally cholinesterase inhibitors have not demonstrated a 
role in preventing delirium [5]. Acetylcholine is but one important 
component in the modulation of attention and arousal [12]; disturb- 
ances in other neurotransmitters, such as dopamine, noradrenaline, 
GABA, and glutamate, are posited as well [86]. It is not that cholin- 
ergic systems have no role in delirium; instead they are likely to be 
part of more complex dysfunction that leads to delirium [98]. 


Melatonin hypothesis 


Rooij and van Munster [99], noting the prominent disturbance of the 
circadian rhythm in the sundowning phenomenon seen in dementia 
and delirium, posited that melatonin deficiency triggers delirium 
and thus could be used to prevent it from occurring. The poten- 
tial efficacy of melatonin in reducing symptoms of sundowning to 
this idea [100]; unfortunately, a large randomized clinical trial by 
the same group found no evidence for melatonin in reducing post- 
operative delirium [101]. Melatonin is implicated in the modulation 
of inflammation in the brain, and there are associations between 
sleep dysregulation and decreased proportions of natural killer 
cells, reduced lymphokine-activated killer activity, and reduced IL-2 
production [86]. As with other pathways considered here, the pic- 
ture points to complex interactions between inflammation, neuro- 
transmitters, and neuroendocrine activity [86], which leads us to 
consider complex models that have attempted to integrate these 
findings in a systematic manner. Although additional investigations 
have been proposed, there is little evidence to recommend the use of 
melatonin at this time [102]. 


Complex models 
Neuroinflammatory hypothesis 


The most common hypotheses attempt to integrate theory and avail- 
able data in different ways [98, 103, 104]. 

Pro-inflammatory cytokines generated in the periphery, such as 
interleukin 1B (IL-16), tumour necrosis factor a (TNFa), and inter- 
leukin 6 (IL-6), communicate with the brain through several routes, 
including autonomic afferents, across the blood-brain barrier 
(BBB), and via the circumventricular region [103]. Increased TNFa 
is thought to activate microglia, which, in turn, release cytokines 
and prostaglandins. One postulate is that, with ageing and dementia, 
microglia are ‘primed’ due to loss of cholinergic inhibition resulting 
in their heightened response [98]. There is evidence that increasing 
severity of dementia may be associated with higher likelihood of 


delirium [105]. This microglial priming and resulting uncontrolled 
inflammation causes further neurodegeneration leading to a vi- 
cious cycle of brain change [98]. Even though cytokines are known 
to be increased in delirium [106], this hypothesis, while explaining 
why neuroinflammation could be associated with dementia, does 
not quite explain the occurrence of delirium [98]. Cerejeira et al. 
[103] and Cunningham and Maclullich [104] go further, using an 
explicitly stated vulnerability—stress framework to provide mechan- 
istically plausible accounts for the occurrence of delirium. In their 
comprehensive review, Cerejeira et al. [103] concluded delirium 
symptoms may be accounted for by the widespread dysregulation 
of brain homeostasis, neurotransmission, and neurophysiological 
functions, resulting ‘in a reduced capacity to interact with the en- 
vironment and to integrate stimuli within the cognitive experience’ 
In a more nuanced analysis, Cunningham and Maclullich [104] 
stated that their model accounts for the characteristic symptoms of 
hypoactive delirium, but not for the presence of psychotic symp- 
toms. Intriguingly, in a longitudinal study of community-dwelling 
older adults with AD, raised IL-6 was associated with hallucinations, 
but not with delirium [107]. 


Network disconnection hypothesis 


The network disconnection hypothesis, also using the vulnerability- 
stress framework, proposes that acute breakdown of network con- 
nectivity within the brain results in delirium [86]. Two recent 
studies have attempted to characterize this hypothesized discon- 
nection through fMRI and a within-patients design [108] and EEG 
using a between-patients design [109]. They both demonstrated dis- 
connection, but one showed an increase in functional connectivity 
between the prefrontal cortex and the posterior cingulate cortex 
[108], while the other found reduced corticocortical connectivity 
[109]. Manshour and Avidan urge caution in interpreting these 
findings as evidence of disconnection given the assumptions made 
[110]. Post-operative delirium may be predicted by increased white 
matter hyperintensities (WMHs), offering a structural basis for this 
apparent functional disconnection [111], though in another study, 
WMHs did not predict post-operative delirium in those without 
dementia [112]. This latter group of researchers did find an asso- 
ciation between diffusion tensor imaging abnormalities observed 
in the corpus callosum, cingulum, and temporal lobe and delirium 
[113], suggesting that disconnection may yet play an important role 
in delirium. 


Multifactorial model 


At the level of the human organism, Inouye’s multifactorial model is 
widely regarded. It too employs the vulnerability-stress framework, 
furnishing empirically derived factors to populate vulnerabilities 
and stressors [114]. It has clinical and pedagogic utility. The lack of 
temporal separation between, and lack of clarity of, vulnerabilities 
and stressors poses a problem, as does a failure to provide an ex- 
planation as to why delirium is caused. A recent study found pre- 
operative risk factors, such as impaired cognition, thus more clearly 
temporally separated vulnerabilities, clearly modify the response to 
post-operative factors, such as post-operative pain and opioid use, 
to result in delirium [115]. This model may not work well in nursing 
homes [116]. In addition, there is some evidence that deficit accu- 
mulation, and thus systems failure, may account for delirium [79], 
but this has not been tested more widely. 


CHAPTER 39 Delirium 


In summary, current models integrate available evidence and 
may explain at least one of the subtypes of delirium and provide a 
mechanism for accounting for the influence of age and pre-existing 
dementia in being a vulnerability to delirium. Most do not explain 
why the complex impairments lead to the characteristic symptoms 
of delirium, and not dementia, however. The multifactorial model 
and clinical and pedagogic utility may not be applicable outside hos- 
pital settings. 


Treatment and management of delirium 





The treatment and management of delirium are as much about pre- 
vention as it is about treating the condition once it has occurred. 
Management strategies that use single and multiple components, re- 
flecting the multifactorial causation of delirium, have been trialled. 
Multicomponent interventions typically include some combin- 
ation of the following: staff education, orientation protocols, avoid- 
ance of sensory deprivation, multi-disciplinary team approach to 
care, sleep protocols, early mobilization, hydration, nutrition, medi- 
cation reviews, oxygen delivery, and pain control [117]. The quality 
of evidence from the four randomised clinical trials reviewed in a re- 
cent systematic overview was considered to be very low [117]. There 
appears to be no evidence for efficacy of multicomponent interven- 
tions in the treatment of delirium. No new trials of multicomponent 
interventions in acute care appear to have been published after 2006 
[118]; a pragmatic trial in post-acute care too failed to show an effect 
[119]. Does this mean that we should give up on treating delirium? 
What one of us wrote more than a decade ago is still relevant: ‘... the 
care provided to the intervention group is how most of us would like 
our loved ones to be cared for, with enough attention paid to them 
that health care professionals would at least recognize alterations in 
their mental state. The unattractive but real alternative is that these 
patients are consigned to a poor quality of care for ‘confusion that 
demoralizes families and health care professionals alike. Let me 
press the point further: to fail to recognize delirium is to practise 
with an unsatisfying disengagement with one’s patients’ lives’ [120]. 
Dopamine antagonists and other psychotropics are commonly 
used to treat delirium and agitation in acute hospitals. The (at least) 
four systematic reviews/meta-analysis published in 2015-2016 vary 
in their recommendations in using dopamine antagonist medica- 
tions to treat delirium [121-124]. What is clear is that routine use 
of such medications cannot be recommended for the treatment of 
delirium. In routine use, the harms from this class of drugs outweigh 
any potential benefit. However, there will be times in a busy hos- 
pital ward when behavioural control of agitation is required for the 
safety of the patient and others. In each single case, a risk—benefit 
analysis must be done before using a dopamine antagonist. On the 
positive side, a majority of the studies show some benefit, in con- 
trast to use of pro-cholinergic drugs or benzodiazepines [125]. Most 
treatment guidelines recommend haloperidol, and some include 
olanzapine [125]. The problem with haloperidol is that it is reputed 
to cause extra-pyramidal side effects (EPSEs), and it is not quite 
known how it helps. EPSEs may well be caused by higher doses; for 
example, in one recent clinical trial, the haloperidol dose, which is 
roughly equivalent to risperidone, was four times higher than the 
risperidone dose [126]. When used in equivalent doses (chlorpro- 
mazine equivalent dose of approximately 60 mg/day), haloperidol 
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(approximately 1 mg/day) was equally efficacious as newer dopa- 
mine antagonists without EPSEs [127]. The problem discovered in 
this trial was that the efficacy of all the dopamine antagonists tested 
was poor in those aged 75 years and over [127]. Benzodiazepines are 
not recommended in the treatment of typical delirium in an older 
person. However, they will need to be considered in cases where 
alcohol withdrawal is the cause of the delirium. In such cases, in 
order to avoid risks of oversedation, short-acting benzodiazepines 
should be considered. Lorazepam (1-2 mg orally or intramuscularly 
or intravenously every 4 hours) or oxazepam (30-60 mg orally every 
4 hours for the first day, and then tapering the dose by 50% on days 
2 and 3) are recommended [128]. 

Ifa dopamine antagonist is deemed necessary to use, then it is best 
to first trial haloperidol. The typical dosing should be 0.25 mg 3-4 
times a day, resulting in stable drug levels from the outset. The im- 
portant act is to review the use of the medication daily and to taper 
and cease it as soon as possible. Ideally, this would be part of a hos- 
pital routine to match the use of dopamine antagonist medications 
to an indication for their use. 

In summary, multicomponent interventions continue to be im- 
portant in ensuring older adults receive good quality of care during 
their hospitalization. Routine use of dopamine antagonist medica- 
tions is not recommended for the treatment of delirium; they should 
be used when necessary to provide safe care in a hospital setting. 

There is better evidence of effect for multicomponent interven- 
tions in the prevention of delirium. A recent Cochrane review found 
moderate-quality evidence that multicomponent interventions re- 
duce the incidence of delirium, compared to usual care, with similar 
effect sizes across medical and surgical settings. However, the evi- 
dence was less for efficacy in patients with pre-existing dementia 
[5]. It is not clear which combination of components works best 
for patients and for patients in a given setting [117], but based on 
the very limited evidence from trials that used single components, 
it is possible that staff education, reorientation protocols, and geri- 
atric consultations may be important components to include [117]. 
A pragmatic clinical trial in 283 older patients admitted for hip frac- 
ture repair found good adherence to pre-printed post-operative 
orders, but more importantly, significantly less delirium in the 
intervention group (33% vs 51% in controls) and a stronger effect in 
people with pre-existing dementia [129]. 

Multicomponent interventions are not without problems; it is 
unclear if they actually work in reducing delirium in the most frail 
[130], and implementation of all elements in a hospital setting over 
time may be problematic. This requires change in the thinking of 
what hospitals do, from the coal-face to the board [131]; a frail older 
adult-friendly hospital is likely to be friendly for all patients. 

There is no evidence for the use of cholinesterase inhibitors, mela- 
tonin agonists, or dopamine antagonist medications in the preven- 
tion of delirium [5]. In the ICU setting, dexmedetomidine appears 
to reduce the incidence of delirium, agitation, and confusion in crit- 
ically ill patients [132]. Placement of earplugs in patients admitted 
to the ICU, either in isolation or as part of a bundle of sleep hygiene 
improvement, appears to reduce the risk of delirium [133]. However, 
these short-term gains have to be considered against the finding that 
interventions do not reduce short-term mortality [134]. 

There is little evidence of systematic interventions in nursing 
homes [135]. In their review, Clegg et al. [135] found one large cluster 
randomized trial of a computerized system to identify medications 


that may contribute to delirium risk and trigger a pharmacist-led 
medication review. This study showed a large reduction in delirium 
incidence, but not in other measures such as reduction in hospital 
admissions, mortality, or falls risk. Despite this, it is heartening to 
see the reduction in delirium incidence alone. While the system 
may not be either available or affordable in all countries, consider- 
ation for deprescribing is important. Canada has now established 
a deprescribing network, and information available on its website 
http://deprescribing.org/ would be a useful resource to consider. 





Closing remarks and future 


Interest in delirium, both in research and policy, has dramatically 
increased in the past decade. This bodes well. However, despite this 
heightened interest, diagnostic error rate for delirium remains high 
and outcomes poor. Josef Strauss wrote his waltz Delirien for a medical 
ball; as beautiful as it is, after the first minute and 40 seconds, it con- 
tains little in its music to suggest the turbulence of delirium. Perhaps 
it is time for us to dance to a different tune for, and with, our patients. 
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Worldwide, 47.5 million people have dementia. The total number of 
people with dementia is expected to reach 75.5 million by 2030 and 
135.5 million by 2050 [1, 2]. In the UK, dementia has become the 
leading cause of death in women and is second to heart disease for 
men [3]. The most common cause of dementia is Alzheimer’s disease 
(AD), being implicated in up to 70% of dementia cases [4]. The tragic 
combination of symptoms present in AD has a profound emotional 
and resource-intensive impact on patients, family, friends, and carers. 























Typical early symptoms include absent-mindedness, difficulty re- 
calling names and words, difficulty learning new information, 
disorientation in unfamiliar surroundings, and reduced social en- 
gagement. Atypical, non-amnestic forms also exist where the ini- 
tial presentation is dominated by visual agnosia (posterior cortical 
atrophy), word-finding difficulties (logopenic—primary progressive 
aphasia), or dysexecutive syndrome (presents as fronto-temporal 
dementia phenocopy). As the disease progresses, there is marked 
memory loss and loss of other cognitive skills, including a reduced 
vocabulary and less complex speech patterns. This may be accom- 
panied by mood swings, apathy, a decline in social skills, and the 
emergence of psychotic phenomena. Advanced AD is character- 
ized by monosyllabic speech, psychotic symptoms, behavioural dis- 
turbance, loss of bladder and bowel control, and reduced mobility. 
Nevertheless, due to the mixed nature of dementia pathologies, the 
differential diagnosis of AD is inexact. For example, AD frequently 
occurs in the presence of vascular changes, indicating a coexisting 
vascular dementia. Without imaging and neurocognitive test evi- 
dence, the early stages in atypical AD may not be distinguishable 
from other neuropathologies. The lack of certainty is reflected in the 
NINDS-ADRDRA diagnostic category of ‘probable AD’ [5]. Post- 
mortem examination reveals that AD brains are characterized by 
the aggregation and accumulations of inter-neuronal amyloid (de- 
scribed as amyloid plaques) and intra-neuronal tau (described as 
neurofibrillary tangles) proteins. 

There are two separate AD expressions, based on their age of 
onset and underlying genetics. In early-onset AD, also known as 








dominantly inherited or familial AD (fAD) due to its high inherit- 
ability, symptoms can begin in a person's late 20s through to their 50s. 
Late-onset AD, also known as sporadic AD (sAD), has more varied 
genetic determinants, but they individually and collectively have less 
impact. In sAD, there is a greater influence of environmental risk 
factors such as exercise, alcohol consumption, and obesity. In sAD, 
symptoms rarely occur before the age of 65. Although the range and 
complexity of causal factors may be greater in sAD, the disease pro- 
cesses are sufficiently similar for both to result in comparable symp- 
tomatology and neuropathology. This has allowed fAD to be used as 
a model for sAD [6]. 

Current convention set by the National Institute on Aging- 
Alzheimer’s Association Workgroup differentiates core clinical and 
research diagnostic criteria [7]. In clinical practice, the diagnosis of 
probable AD requires the presence of dementia (either amnestic or 
non-amnestic in terms of affected cognitive domain) that is of in- 
sidious onset, represents a deterioration from the individual's base- 
line, and is not more likely to be accounted for by another cause 
(for example, other types of dementia or other neurological or med- 
ical comorbidities). The term mild cognitive impairment (MCI) has 
additionally been introduced to recognize states of pre-dementia 
where cognitive impairment does not interfere to a significant extent 
in the patient’s function. The research AD criteria see the addition 
of two sets of biomarkers: pathophysiology and neuronal injury. The 
former is defined by the presence of AD pathology (low CSF AB-42 
levels or positive PET amyloid imaging). The major neuronal injury 
biomarkers are elevated levels of CSF tau, reduced temporo-parietal 
glucose uptake, as measured by ‘“fluorodeoxyglucose PET, and 
disproportionate atrophy in the temporal and parietal lobes using 
structural neuroimaging. The current recommendation is that the 
biomarker-enhanced AD diagnosis is not used in routine clinical 
practice, as the core clinical criteria have good diagnostic accuracy 
for the majority of patients, while the biomarker criteria require fur- 
ther validation and standardization across laboratories. 











fAD accounts for less than 5% of AD cases, and due to its herit- 
ability, most cases are easily identified [8]. Characterizing the preva- 
lence, incidence, and risk factor profile of sAD, however, is more 
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challenging due to the subtle and insidious onset of symptoms, the 
stigma associated with a dementia diagnosis, and the limited avail- 
ability of diagnostic tools. Accurate diagnosis, particularly in the 
early disease stages, is difficult and requires (preferably serial) struc- 
tural and molecular brain imaging, as well as CSF analysis. These 
highly specialized data are rarely available in population studies, 
being confined to clinical studies. 

In a systematic review of 45 studies of sAD, with 20 studies eli- 
gible for meta-analysis of participants aged 60+ years, the overall 
prevalence was 4%, ranging from 1.5% in India to 20% in Israel. In 
community settings, the prevalence was 3%, while in institutional 
settings, the prevalence ranged between 10% and 23%. Age-specific 
prevalence of sAD increases from around 1% at age 60 years to 2% 
at age 65, 5% at age 70, 10% at age 75, and 20% at age 80 to rates of 
around 30% at age 90 [9, 10]. 

In a systematic review of 30 studies, with 11 studies eligible for 
meta-analysis of participants aged 60+ years, the pooled annual 
incidence of sAD was 1.6%. All studies in this meta-analysis were 
in community settings, with estimates ranging between 0.7% in an 
Italian study to 3% in an American study. Age-specific incidence 
rates go from around 0.2% at 65 years to 6% at 90 years [9]. A more 
sophisticated analysis, adjusting for competing morbidities and at- 
trition, but using non-specified dementia as an outcome, confirms 
these systematic review estimates [11]. 

Until the age of 85, AD incidence rates for men and women are 
comparable [12]. Beyond the age of 85, incident AD is lower in men, 
although vascular dementia risk in men rises. The resulting balance 
of risk is for an overall increased risk of incident dementia in women 
at older age. The extent to which this is due to greater female survival 
is unknown. 

A comparison of dementia across 20 years (1989-2008) in the UK 
shows a recent and modest decline in annual incidence from 2% to 
1.7% [13]. This is more pronounced in men and is attributed to an 
improvement in lifestyle over this period. Although these numbers 
refer to dementia, given the mixed pathology of many dementias, 
similar changes are unlikely not to be reflected in sAD. This drop 
in incidence, although encouraging, will do little to ameliorate 
the population impact of sAD. Due to the increasing numbers of 
older people, the overall numbers of sAD cases will continue to rise. 
Similar trends are found elsewhere [14]. These findings are con- 
sistent with the fall in AD after the age of 98 found in post-mortem 
studies [15]. 


Neuropathology 


AD is a protein abnormality resulting in relentlessly progressive 
neurodegeneration. The molecular biology underlying AD is com- 
plex and incompletely understood. Interest has focused on the 
two main brain protein accumulations that are characteristic of 
AD: amyloid and tau. The -amyloid protein (AB) is prone to ag- 
gregation when it fails to fold into the three-dimensional structure 
that is essential to its function. The misfolded AB monomers pro- 
gressively aggregate, forming oligomers, then fibrils, and then into 
relatively disordered clumps of fibrils known as plaque. Tau pro- 
tein stabilizes extracellular clumps, which are essential to a number 
of cellular processes. It has a number of phosphorylation sites. 
Hyperphosphorylation at these sites inhibits microtubule assembly 


and sequesters normally phosphorylated tau, causing microtubule 
disassembly. Hyperphosphorylated tau self-assembles into paired 
helices and straight filaments, which aggregate as tangles. 

These accumulations of misfolded proteins are intracellular tan- 
gles of protein processing abnormalities. Although these endpoints 
are unlikely to represent the initial neuropathological triggers, with 
time (and increasing age), accumulation of misfolded proteins may 
result in further cellular changes to oxidative and inflammatory 
pathways, leading to energy failure, axonal transportation failure, 
synaptic dysfunction, and neuronal death. 


The genetic variants causing fAD are found in the B-amyloid pre- 
cursor protein (APP), presenilin-1 (PSEN1), and presenilin-2 
(PSEN2) genes [16]. These genes are located on chromosomes 21, 
14, and 1, respectively. Of the 30 known coding mutations in the 
APP gene, 25 result in over-production of Af, resulting in AD. Most 
of these variants are highly penetrant, that is, almost all individuals 
carrying the variant will develop AD. One variant (A673T) reduces 
Af production and is protective against AD [17]. PSEN1 and PSEN2 
contribute to the y-secretase complex, which processes APP. The 
PSEN1 and PSEN2 genes each contains ten coding exons, several 
variants of which increase the production of the longer forms of AB, 
that is, those which are more likely to misfold. 

The genetics of sAD are more complex, involving a greater var- 
iety of genes, each with lower penetrance. The apolipoprotein E 
(APOE) gene is the strongest genetic risk factor for sAD [18]. Of 
the three common alleles, £4 is associated with a higher risk of AD 
relative to £3, while £2 is associated with a lower risk relative to £3 
[19]. The three alleles are expressed unevenly in the general popu- 
lation, with 60-90% being €3 carriers, 10-20% carrying £4, and 0- 
20% £2 [20]. A single e4 allele confers a 3-fold greater risk, while 
two £4 alleles confer a 12-fold greater risk. APOE binds to AB and 
is involved on the clearance of AB from the brain. Genome-wide 
association studies (GWAS) have identified other genetic variants 
related to AD risk [21-23]. These genes affect a wide range of mech- 
anisms. Although these risk variants can be relatively common in 
the general population (3-50%), their individual contribution to AD 
risk is small. Although the specific AD-related functionality of the 
risk variants is largely unknown, likely areas of action can be iden- 
tified. Reduced expression of the clusterin gene (CLU) is associated 
with a higher risk of AD, clusterin being involved in AB clearance 
[16]. Variants in the ATP-binding cassette transporter A7 (ABCA7) 
gene that reduce gene expression are associated with an increased 
AD risk. ABCA7 deletion facilitated the processing of APP to AB by 
increasing the levels of the B-site APP cleaving enzyme 1 (BACE1) 
[16]. Several genes have been associated with the immune response 
(CR1, CD33, MS4A, TREM2). The relationship between the im- 
mune response and AD is complex, but in general, upregulation of 
the immune response is associated with a raised AD risk. A further 
constellation of AD risk genes is associated with endocytosis (BIN1, 
PICAML, CD2AP, SORL1). Endocytosis is critical to many aspects 
of neuronal health, including APP processing (BIN1), AB clearance 
(PICALM), synapse formation (CD2AP), and APP cleavage and 
APOE uptake (SORL1). Although each of these GWAS- identified 
genes contributes only a small degree of AD risk, in combination, 
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Fig. 40.1 (see Colour Plate section) Risk genes for Alzheimer's disease identifying the primary mechanism, risk conferred, and population allele 


frequency. 
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the risk increases. Although the patterns of association are com- 
plex, put simply, if APOE is included in the polygenic risk estimate, 
comparison of the extremely high and extremely low polygenic risk 
score groups can predict AD with >80% accuracy [24-26]. The range 
of risk genes is summarized in Fig. 40.1 (adapted from Karch and 
Goate [16]), giving the primary mechanism, risk conferred, and 
population allele frequency. 


That sAD is genetically heterogenous suggests it is unlikely that a 
single mechanism will fully explain the variety of protein-protein 
interactions underlying the disease. AD may best be considered 
as a multifactorial disorder—a common outcome from multiple 
neuropathologic pathways. A parsimonious and necessarily sim- 
plistic working model would be the expression of amyloid and 
tau protein folding abnormalities which not only exert direct 
neurotoxic effects, but also promote an innate immune response, 
leading to synaptic failure, neurodegeneration, and apoptosis. 
Tau and amyloid may have a synergistic impact on the immune 
response, serving to accelerate neurodegeneration. Injury to the 
neurovascular unit may increase susceptibility to AD through 
large-vessel atheroma and disruption to the blood-brain barrier. 
Finally, the systematic spread of amyloid and tau suggests that 
extracellular misfolded amyloid and tau protein may ‘seed’ protein 
misfolding in surrounding cells. 


Beta amyloid 


AD-specific initiating events include AB processing [27]. Af is an 
albumoid (albumin-like) peptide synthesized from APP. AB pep- 
tides are natural products of metabolism. AB is formed by the pro- 
cessing of APP by sequential enzymatic cleavage involving a, B, 
and y-secretase. There are six known isoforms of Af, which vary 
in length from 36 to 43 peptides [4]. The amyloid hypothesis is that 
the longer isoforms, that is, AB 242 peptides, which are more self- 
aggregating, are over-produced intracellularly throughout life in 
fAD and gradually rise extracellularly with age due to faulty clear- 
ance mechanisms in sAD. From this perspective, both fAD and sAD 
point to APP processing as being the trigger for AD. 


Transmembrane APP processing 


APP produced within the cell is cleaved, as it passes through the cell 
membrane (Fig. 40.2). Non-amyloidogenic cleavage of APP begins 
with a-secretase cleavage of APP, leading to the release of the frag- 
ment sAPPa into the intercellular space; sAPPa contributes to syn- 
apse formation and repair. sAPPa includes part of the AB sequence 
and so precludes the formation of AB. The remaining APP (in the 
form of the C83 fragment), still tethered to the cell membrane, is 
cleaved by y-secretase, releasing P3 fragment into the intercellular 
space and the APP intracellular domain (AICD) back into the cell. 
Amyloidogenic cleavage begins with B-secretase activity, leading 
to the release of a shortened fragment (sAPP) into the extracel- 
lular space, leaving the AB peptide intact. The remaining APP (in 
the form of the C99 fragment), still tethered to the cell membrane, is 
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Fig. 40.2 (see Colour Plate section) Schematic of amyloidogenic and non-amyloidogenic APP processing pathways. 


cleaved by y-secretase, releasing Af into the intercellular space and 
AICD back into the cell. y Mitochondrial mass changes occur in the 
AD brain -secretase cleavage proceeds in a number of steps, begin- 
ning with e-cleavage releasing either AB48 or AB49, which are then 
processed, leading to the release of extracellular AB [28]. 

Newly cleaved monomeric Af aggregates into oligomers, which, 
in turn, become fibrils and then plaques. Plaque formation is a dy- 
namic process, with individual oligomers aggregating and disag- 
gregating. Relatively diffuse plaques occur in normal ageing. In 
AD, plaques are A642 rich, more frequent, and more compact, con- 
sisting of a central core of amyloid fibrils. Apart from stimulating 
an inflammatory response, it is difficult to see how plaques affect 
cell function, and plaque formation may be thought of as a com- 
pensatory mechanism to isolate harmful species. That plaques are 
surrounded by dystrophic neurites and astrocytes suggests they are 
not entirely inert, and the dynamic nature of AB aggregation sug- 
gests that it is the presence of aggregating oligomers that is neuro- 
toxic and stimulates an immune response. Extracellular oligomeric 
Af has been shown to interfere with synaptic function, adversely 
affecting long-term potentiation (LTP). This, in turn, downregulates 
the translation of neurotransmitters at the synapse, leading to the 
withdrawal of dendritic spines. In the case of the hippocampus, this 
effectively inhibits new memories from forming and established 
memories being retained. 

For fAD, genetic variants that are related to AD risk increase the 
level of either AB42 or the AB42/AB40 ratio. For sAD, the key genetic 
risk factor is APOE. In the brain, most APOE is expressed by astro- 
cytes. Although its function and activity are not fully understood, 
APOE has an affinity for AB, binding to form a protein complex. 
APOE alleles vary in their Af binding ability, with E2 > E3 > E4. The 
production of AB does not appear to vary according to APOE geno- 
type. The lack of variation in production, combined with variation 
in binding, suggests that for sAD, it is the amyloid clearance mech- 
anism that is not operating efficiently [18]. This is consistent with 
evidence that in AD, CNS Af is lower, suggesting that AB is being 
retained in the brain. 


| AICD 


Intracellular APP processing 


APP in the cytosol is also processed within the cell, although this 
process is less well understood. Several studies have demonstrated 
that a- and B-secretases are present in the ER, enabling APP cleavage 
[29, 30]. As in the plasma membrane, a- and f-secretases com- 
pete with each other for intracellular APP cleavage; this competi- 
tion modulates the production of AB. It has also been shown that 
y-secretase activity is located predominantly in a specialized sub- 
compartment of the ER—the mitochondria-associated membrane 
(MAM). MAM is an intracellular lipid, raft-like structure intimately 
involved in cholesterol and phospholipid metabolism and is phys- 
ically and biochemically connected to mitochondria. The area of 
close apposition between mitochondria and MAM is substantially 
enlarged in AD. Release of AB from the ER into the cytosol and 
exposure of AB to mitochondria suggest intracellular neurotoxic 
mechanisms. 

Mitochondrial mass changes occur in the AD brain. Although 
the total amount of mitochondria may increase, in hippocampal 
neurons, there is a decline in the number of intact mitochondria and 
an increase in the number of degraded mitochondria. A case can 
be made that the brain of AD subjects show perturbed mitochon- 
drial function and a subsequent compensatory response. The bio- 
energetics evidence has led to the mitochondrial cascade hypothesis, 
where mitochondrial dysfunction initiates the preferential produc- 
tion of intracellular A642. However, the balance of evidence is in 
favour of APP processing as an initiating event and that an effect of 
mitochondrial function on intracellular Af is subsequent to the ini- 
tial production of Af oligomers. 


Tau 


Tau is more closely associated with cognitive performance than 
Af [31-33], and porcine evidence suggests tau is required for A- 
mediated toxicity [34]. Furthermore, extracellular and intracellular 
Af do not immediately explain the development of the tau-based 
neurofibrillary tangles that are characteristic of AD [35]. 


Tau is a soluble protein found abundantly in neurons and is one of 
three microtubule-associated proteins (MAPs) which promote the 
assembly and stability of microtubules in the neuron, microtubules 
being essential to axoplasmic flow which is critical for neuronal ac- 
tivity. The ability of tau to support microtubule structure is dependent 
on the level of phosphorylation [36]. Optimal phosphorylation 
occurs with the phosphorylation of around 30 of the 85 potential 
sites. Hyperphosphorylated tau has several cytotoxic effects. As the 
number of phosphorylated sites increase beyond 30, tau loses the 
ability to bind with microtubules. Hyperphosphorylated tau also 
sequesters normal tau and other MAPs from the microtubule lattice 
into the cytosol. This destabilizes the microtubule, leading to dis- 
assembly. Microtubule network dysfunction leads to loss of axonal 
transportation and neurodegeneration. Apart from affecting normal 
microtubule function, hyperphosphorylated tau is associated with 
conformational changes, leading to misfolding and increased aggre- 
gation. As with mature Af plaques, mature neurofibrillary tangles 
are relatively inert, suggesting that it is monomeric and oligomeric 
hyperphosphorylated tau that is cytotoxic, that is, it is the soluble 
species, rather than the neurofibrillary tangle itself [37]. 

A link between tau phosphorylation and APP processing in the 
cell membrane has been shown, although the full pathway is not 
clear [36]. In stem cell models, changes in tau production and tau 
phosphorylation are associated with APP processing through B- and 
y-secretase activity. Inhibition of B-secretase reduced tau levels and 
phosphorylation, while inhibition of y-secretase was associated with 
increased tau and tau phosphorylation [29]. Modulation (enhance- 
ment) of y-secretase was associated with reduced tau production. 
Limiting B-secretase function reduces the release of APP-C99, while 
limiting y-secretase function reduces the release of both APP-C99 
and APP-C83 (Fig. 40.2). Enhancing y-secretase function increases 
the release of APP-99 and APP-83, which accumulate in the cell. 
These findings suggest that tau production and tau phosphorylation 
occur as part of an APP processing pathway, which is independent 
of extracellular AB levels. How phosphorylated tau affects cell func- 
tion is largely unknown. One possible mechanism is a gradual loss 
of axonal function through the disassembly of microtubules, leading 
to an immune response and apoptosis. APOE4 is associated with 
increased tau phosphorylation [38]. Although the mechanisms are 
unclear, that APOE4 acts independently on both tau and Af opens 
the possibility that tau and AB may act synergistically to accelerate 
neurodegeneration. 


Immune response 


Several strands of evidence suggest a general upregulation of the im- 
mune response, associated with a raised AD risk. Genetic evidence 
for an immune response to plaque comes from the overexpression 
of genes (Trem2, CR1, CD33) that are associated with plaque phago- 
cytosis [39-42]. The co-location of plaque with astrocytes and other 
glial elements and the presence of inflammatory and complement 
cascade components implicate an innate and adaptive immune re- 
sponse to the presence of plaque. This response is likely to be sub- 
sequent to the triggering events surrounding APP processing but 
serves to accelerate AD pathology. Although the immune response 
may be targeted at plaque removal, by microglial phagocytosis, for 
example, it is unlikely not to affect the surrounding neurons. The 
associated increasing concentrations of reactive oxygen species and 
inflammatory cytokines, for example, will affect the function and 


CHAPTER 40 Alzheimer's disease 


integrity of the plasma membrane. As described, the increased sus- 
ceptibility to AD, associated with an upregulated immune response, 
may be largely a response to abnormal APP processsing. However, it 
may be that upregulation provides a cellular environment that also 
facilitates abnormal APP processing. 


Synaptic failure 


Although proteinopathy may initiate neurodegeneration, AD is 
functionally a reflection of synaptic failure leading to neuronal loss 
[43]. Synaptic loss is more prevalent in the vicinity of AB plaques, 
suggesting plaques are a source of synaptotoxic molecules [44]. 
Hippocampal synapses begin to decline in patients with mild cog- 
nitive impairment in whom the remaining synaptic profiles show 
compensatory increases in size. In mild AD, there is a reduction of 
about 25% in the presynaptic vesicle protein synaptophysin. With 
advancing disease, synapses are disproportionately lost, relative to 
neurons, and this loss is the best correlate with clinical symptoms of 
dementia. Experimental application and expression of AB, especially 
oligomers, impair synaptic plasticity by altering the balance between 
LTP and long-term depression (LTD) in favour of LTD. LTP refers 
to the increase in dendritic spine development, while LTD causes 
spine shrinkage and collapse, thus reducing the number of dendritic 
spines [45]. At high concentrations, oligomers may suppress basal 
synaptic transmission. From in vitro studies, AB is associated with 
disruption of the release of presynaptic neurotransmitters and post- 
synaptic glutamate receptor ion currents [46, 47]. For example, AB 
binds to a-7 nicotinic acetylcholine receptors, impairing the release 
of acetylcholine and the maintenance of LTP. Although a reduction 
in LTP is found in normal ageing, it may be that Af triggers synaptic 
deficits earlier. 


Circulatory and metabolic factors 


Vascular injury and the resulting parenchymal inflammation per- 
petuate the cycle of protein misfolding. Although cases of ‘pure’ AD 
are recognized, most AD cases are mixed, combining features of both 
AD and vascular disease [4]. This has led to an increased interest in 
the neurovascular unit—the close anatomical and functional rela- 
tionship between the cellular components of the brain [48]. AB is 
toxic to endothelial and smooth muscles cells and compromises the 
AP influx and efflux mechanisms of the blood-brain barrier [49]. 
A further factor is the reduced cerebral blood flow typically found in 
AD [50]. The extent to which this reflects reduced demand or inad- 
equate supply is moot. 

Metabolic factors focus on the insulin signalling pathway. Glucose 
intolerance and type 2 diabetes are risk factors for AD. Several 
models have been proposed to explain the increased insulin resist- 
ance underlying these associations [51]. These focus on deregulated 
insulin signalling affecting the energy available to the cell, increased 
inflammatory response, increased AB production, and comprom- 
ised synaptic transmission. However, the direction of causality in- 
volved in these associations is unclear. 


Propagation 


Mechanisms underlying the spread of AD pathology, typically 
beginning in the entorhinal cortex have focused on the prion hy- 
pothesis, which is that the process is dependent on the nucleating 
properties of misfolded protein and the transmission of misfolded 
protein between cells [52, 53]. In this model, misfolded protein acts 
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as a template for further misfolding, accelerating the misfolding pro- 
cess. If the misfolded protein is released into the extracellular space 
and endocytosed into a neighbouring cell, the misfolding process is 
transmitted independently of the receiving cell’s intrinsic ability to 
misfold the protein [54, 55]. In AD, there is evidence that both AB 
and tau can be transmitted between cells. This ‘prion’ model does not 
replace the basic trigger mechanism of APP processing. It is likely 
that populations of cells, subject to proteolytic ‘stresses’ (genetic or 
otherwise), independently produce misfolded protein and that the 
seeding of these proteins further distributes pathology. 


Significant advances in the understanding of AD pathophysiology 
and corresponding investment in the development of potential 
disease-modifying treatment have so far not led to effective aetio- 
logical therapy, with several notable failed agents. The only licensed 
treatment of AD is therefore symptomatic, with two drug classes 
available: acetylcholinesterase inhibitors (donepezil, rivastigmine, 
and galantamine) and the NMDA receptor antagonist galantamine. 
The former are licensed for mild to moderate AD, while memantine 
can be used in moderate to severe AD. The most recent guidelines 
recommend a combination between the two classes in moderate and 
severe AD cases [56]. The expected benefit is stabilization of cogni- 
tive decline for a period of 6-12 months, with, on average, nine pa- 
tients needed to be treated for one of them to experience significant 
benefit (that is, number needed to treat of 9). Despite the modest 
immediate benefits associated with these drugs, they appear to have 
some additional effects in those treated long term. In the Donepezil 
and Memantine in Moderate to Severe Alzheimer's Disease 
(DOMINO) study, Howard and co-authors examined the effects of 
continuing symptomatic treatment beyond the stage of moderate to 
severe dementia; a sample of patients on stable donepezil dose were 
randomized to: (1) continue the drug; (2) discontinue the drug; or 
(3) switch to, or add, memantine. In this study, the group that dis- 
continued symptomatic treatment had faster cognitive decline and 
worse functional outcomes [57] and were admitted to care more 
quickly [58]. These results led to the current consensus guideline 
that ‘cholinesterase inhibitors should not be stopped just because the 
point of severe dementia has been reached’ [59]. 

The transition of prodromal and subclinical AD is accom- 
panied by the emergence of a variety of behavioural and psycho- 
logical symptoms of dementia (BPSD). These include depression 
(comorbidity of AD and depression in up to 50%), psychosis, agita- 
tion, aggression, sleep disturbance, wandering, apathy, and a variety 
of socially inappropriate behaviours. Some of these symptoms af- 
fect practically all patients with dementia at some point of the illness 
and are a major cause of loss of independence, carer burden, and 
early placement into nursing care [60]. Current guidelines advocate 
the use of non-pharmacological strategies to prevent and address 
problem behaviours in dementia [61]. Pharmacological treatment 
of aggression through sedation is limited to acute settings to prevent 
injury with long-term use of dopamine antagonists, which is being 
actively discouraged due to a 3-fold elevation in risk for cerebrovas- 
cular events. Drugs for psychosis, such as risperidone, offer some 
efficacy in terms of improving recurrent agitation and psychosis 
symptoms, which warrants careful consideration of risk/benefits. 


SSRIs may be a better alternative, with evidence of citalopram and 
sertraline, in particular, for beneficial effects on agitation and psych- 
osis. Also, long-term treatment with a drug for depression should be 
a consideration where there is comorbid depression, although the 
evidence for efficacy is weak [60]. Careful risk/benefit consideration 
is also applicable here, given the higher likelihood of SSRIs causing 
hyponatraemia and gastrointestinal bleeding in the elderly. 


The care for individuals with dementia is a major societal challenge, 
with an estimation of care costs in the United States alone standing 
at $203 billion in 2013 and expected to rise to $1.2 trillion in 2050 
[63]. This is due to individuals with dementia requiring significantly 
higher levels of care than other chronic conditions. This level of 
need, as a rule, leads to care in nursing homes, with four-fifths of 
all residents suffering from dementia. Still, the majority of dementia 
patients are being looked after by their families in their homes. Such 
informal care is associated with significant carer strain, with higher 
levels of stress, less time for social interaction, and high prevalence 
of dropping out of work or otherwise missing job opportunities, 
relative to carers of other chronic conditions [63]. Carers are also 
at significantly higher risk for psychological problems (for example, 
depression risk is 3-39 times higher than age-matched controls [64]) 
and physical deterioration (carers experiencing stress have a 60% 
higher risk of mortality over 4 years, relative to those who do not re- 
port stress [65]). Informal care, however, is and will remain critical 
to the sustainability of looking after dementia. Long-term strategies 
therefore focus on reducing carer burden through complex inter- 
ventions, focusing on education, training, practical support, and 
respite. A typical intervention lasts 8-12 weeks and includes indi- 
vidual or family counselling, case management, education on strat- 
egies for managing behaviours, and environmental modifications. 
Such interventions result in improvement of carer mental health and 
increase the time to institutionalization by up to a year [66]. 


Prevention of sAD has been identified by the WHO and G8 asa key 
element in the strategy to counter the dementia epidemic. Economic 
analyses have shown that delaying the onset of the disease by even 
1 year would reduce its prevalence by up to 11%, while a delay of 
5 years would halve it [67]. 


Risk factors 


A number of risk factors for sAD have been identified. Non- 
modifiable risk factors can be used for targeting therapies in ap- 
propriate groups. The best established non-modifiable risk factors 
for sAD are age, genetics, and female sex. These are discussed in 
the relevant Epidemiology and Genetics sections of this chapter. 
Modifiable risk factors are the logical focus of prevention strategies. 
Attributing population risk estimates, that is, the proportion of the 
population for whom a risk factor has caused a disease, is particu- 
larly challenging for sAD. Apart from most risk factor studies fo- 
cusing on non-specific dementia as an outcome, rather than sAD, 
the major modifiable risk factors for sAD (lifestyle, comorbidities, 


education) are unlikely to impact sAD independently of each other. 
Nevertheless, an overall estimate is that between a third and a half of 
all sAD cases are attributable to modifiable risk factors [68]. 

Lifestyle risk factors, such as exercise, diet, smoking, excessive al- 
cohol use, and associated cardiovascular risk, are the main focus of 
prevention programmes in sAD. This focus predicates a life course 
approach to dementia prevention, as these risk factors frequently 
represent chronic exposures and exposure at some life stages may be 
more harmful than at others. 


Cardiovascular factors 


The major modifiable cardiovascular risk factors (hypertension, dia- 
betes, obesity) increase the likelihood of not only the vascular sub- 
type of dementia, but also sAD. This is due to the two conditions 
frequently sharing neurodegenerative pathologies—it is likely that 
cardiovascular disease potentiates sAD pathology through disrup- 
tion of the blood-brain barrier and atherosclerotic, inflammatory, 
and/or thrombotic mechanisms. Hypertension was identified as a 
major risk factor through cohorts such as the Goteborg Longitudinal 
Study [69] and the Honololu Asia Aging Study [70]. They demon- 
strated a strong relationship between mid-life hypertension and de- 
mentia in later life, but also that a decline in blood pressure levels 
frequently precedes the development of sAD. This had previously 
led to the distorted view that low blood pressure predisposes to sAD. 
Direct evidence of sAD-protective effects of long-term blood pres- 
sure control is currently lacking, despite significant evidence of the 
benefit of blood pressure therapies on overall mortality and mor- 
bidity [71]. Late-life diabetes has been consistently linked to both 
sAD and vascular dementia. Trials exploring the protective effect 
of strict diabetic control on cognition had conflicting results; this 
may have been due to tighter control being associated with cognitive 
benefits, but also increasing the risk of hypoglycaemic episodes, a 
known risk factor for dementia onset [72]. The evidence for up to 
50% increase in sporadic dementia risk by mid-life diabetes comes 
from health record studies, for example an American study of 8845 
patient records [73]. While obesity and associated hypercholester- 
olaemia in middle age have not been shown to directly increase the 
risk for sAD [72], it is nonetheless a rational target for prevention, 
given their established role as cardiovascular risk factors. In later life, 
perhaps counterintuitively, body mass index reduction precedes the 
development of sAD by 5-15 years [74]. Current research therefore 
demonstrates the importance of addressing cardiovascular risk fac- 
tors from a life course perspective, with emphasis on mid-life risk 
factor control. 


Alcohol and smoking 


Smoking is a major risk factor for sAD and the only one with a good 
level of evidence for the protective effect of intervention. Specifically, 
current smokers have a 50% increase in their sAD risk, relative to 
never-smokers. The effect appears to be mediated by a direct effect 
on amyloid metabolism (reduction in microglia-mediated amyloid 
clearance), as well as the well-described effect of increasing cardio- 
vascular risk. It is plausible that the true risk for sAD is underesti- 
mated due to the greatly reduced life expectancy associated with 
smoking—analyses aiming to account for the missing smokers 
support this. In addition, heavy smokers who survive to old age are 
likely to be biologically unusual individuals who have a variety of 
protective factors against cardiovascular risks that may also confer 
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protection against sAD. Importantly, smoking cessation leads to a 
reduction of sAD risk to the level of never-smokers [72], which is 
hugely encouraging for prevention programmes. 

The association with alcohol consumption is less clear. A recent 
meta-analysis demonstrated a modest protective effect for mod- 
erate drinkers vs abstainers, while heavy drinkers had comparable 
risk with abstainers [72]. This J-shaped curve, reminiscent of that 
found between alcohol and cardiovasular disease, likely reflects 
several competing mechanisms. These include the ‘sick-quitter’ ef- 
fect where higher-risk individuals stop drinking prior to the onset 
of symptoms, the underreporting of alcohol consumption at high 
levels of intake, and under-representation of high-risk individuals 
with high alcohol of intake. Although plausible protective mech- 
anisms for a protective effect of moderate alcohol on the cerebro- 
vascular system exist, population-based 30-year follow-up imaging 
evidence showed no benefits of light drinking, with hippocampal 
atrophy rising dose-dependently [75]. 


Physical exercise and cognitive stimulation 


Regular exercise is recommended as a means of reducing cardio- 
vascular, as well as broader health, risk and therefore has the poten- 
tial for indirectly reducing sAD risk. In addition, exercise improves 
cerebral perfusion, has anti-inflammatory properties, improves syn- 
aptic function, and stimulates neurogenesis [76]. It also has a social 
and cognitive element, which is protective in terms of mental health 
more broadly. A systematic review of the effects of physical effects on 
sAD risk in older adults showed a 50% sAD risk reduction in those 
exercising regularly [77]. Interpretation of these data, however, is 
limited by the possibility that lower physical activity may be part of 
the dementia prodrome, either through apathy or through broader 
functional impairment (reverse causality). 

Similarly, cognitively stimulating activities are of great interest as 
potential protective strategies. The available studies point to a con- 
sistent reduction of sAD risk in older adults who engage in intel- 
lectually challenging activities vs passive activities such as watching 
TV. These effects suffer from the same limitation as physical exercise 
where a gradual reduction in engagement with intellectual activities 
may be the first sign of incipient dementia. Some evidence, how- 
ever, exists linking intellectual activity in early and mid life and a 
reduced risk for sporadic dementia (Rush Memory and Aging study 
Project [78]). Whether the benefits of cognitive stimulation are due 
to a slowing of pathology or an improved efficiency of remaining 
function due to training is unclear. 


Psychological factors 


Psychiatric disorders are of particular interest in relation to preven- 
tion of sAD, given their high prevalence across the lifespan and the 
availability of effective psychological and pharmacological treat- 
ments. There is a robust link between depression in late life and 
the incidence of sporadic dementia, whereby having depression 
increases nearly 2-fold the risk of developing dementia [72, 79]. 
Having depression also increases the risk of transitioning from MCI 
to dementia [80]. The interpretation that late-life depression is a risk 
factor for depression was discounted recently by the 28-year-long 
follow-up Whitehall II cohort study [81]. It is instead more likely 
that low mood forms part of the dementia prodrome or that the two 
conditions share a common cause. The same study discounted a role 
for mid-life depression as a risk factor for sAD, even where it was of 
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a recurrent/chronic nature. Personality factors, schizophrenia, and 
other psychiatric factors have been investigated, but the evidence for 
a link with sAD is so far lacking [72]. While this shows that treating 
depression is unlikely to delay the onset of sAD, the well-established 
deleterious effects of depression on memory, sleep, and social func- 
tioning makes treating depression in the stage of established cogni- 
tive impairment an important clinical consideration. 


Education and occupational attainment 


Higher educational and occupational attainments have consist- 
ently been implicated as protective for developing dementia in later 
life. For education, a recent meta-analysis reported an approxi- 
mately 40% risk reduction for high vs low education attainers [72]. 
Similarly, there was an 80% increased risk for dementia in low-paid 
workers, although a significant part of the variance appeared to be 
accounted for by their level of education [72]. The protective effects 
of education may be due to prolonged period of learning, stimu- 
lating the development, or being the result, of larger brain volume or 
neural networks of greater complexity. These potential physical and 
functional factors could be related to a greater capacity to compen- 
sate the underlying sAD pathology and thus delay the onset of clin- 
ical symptoms (the ‘cognitive reserve’ theory). Equally, an inborn 
characteristic of the nervous system that is protective of sAD may 
also increase the capacity to learn at a young age and thus predispose 
to longer education. 


The importance of conducting effective primary and secondary pre- 
vention of sAD has led to the development of population-based pro- 
grammes specifically designed to translate sAD risk factor research 
into public health benefit. The clustering of lifestyle risk factors in 
sAD lends itself to complex interventions where several risk factors 
are addressed simultaneously. The Finnish Geriatric Intervention 
Study to Prevent Cognitive Impairment and Disability (FINGER) is 
one of the first such initiatives aimed at secondary prevention [82]. 
FINGER recruited non-demented individuals in the 60-77 years 
range who had an elevated dementia risk score (based on age, sex, 
education, blood pressure, BMI, cholesterol levels, and physical 
activity), as well as a minor degree of cognitive impairment. This 
therefore selected an ultra-high-risk population for dementia and 
allowed the trial of a multi-domain intervention involving changes 
to nutrition, physical activity, education, and cognitive training. 
The intervention group’s cognitive outcomes improved at 25-150%, 
compared to the control group (health advice only). Future studies 
would need to demonstrate the benefit of such interventions on 
the key public health outcome—time to dementia onset. A further 
key question to be addressed is the extent to which addressing risk 
factors in mid life results in effective primary prevention of sAD. 
However, indirect evidence of this effect is already available through 
the observed decline in the projected dementia incidence rate—a 
trend attributed to improved cardiovascular risk control [11]. 
Despite the huge societal gains that are likely to accrue from risk 
factor control, it is likely that disease-modifying treatments will be 
required to affect the global burden of dementia. The realization that 
the disease process starts decades before the first symptoms has led 
to a number of initiatives aimed at identifying the condition either 


at its preclinical (that is, a state with evidence of sAD pathology) 
or prodromal (that is, a state of minimal symptoms and sAD path- 
ology) stages. 

Development in these disease stages rests on the increased know- 
ledge of mechanisms for the development of new compounds, access 
to biomarkers for risk stratification for targeting treatments to those 
who will benefit the most, the development of more sensitive outcome 
measures allowing benefits to be more efficiently detected, and a trials- 
ready infrastructure enabling newcompounds to betested more rapidly. 
These challenges are increasingly being met through large-scale collab- 
orative platform science projects. In relation to mechanisms, these in- 
clude the UK Dementia Research Institute (https://ukdri.ac.uk/). For 
biomarker development, there is the Deep and Frequent Phenotyping 
Study [83], while for risk stratification and trials, there are the 
Dementias Platform UK (https://www.dementiasplatform.uk/), the 
European Prevention of Alzheimer’s Dementia Consortium (EPAD) 
(http://ep-ad.org/), and the American-based Global Alzheimer’s 
Platform Foundation (http://globalalzplatform.org/). Powerful digital 
technologies are also being harnessed for sAD research, including the 
Dementias Platform UK (https://www.dementiasplatform.uk/), the 
European Medical Informatics Framework (http://www.emif.eu/), 
and the Real world Outcomes across the Alzheimer’s Disease spec- 
trum for better care: Multi-modal data Access Platform (ROADMAP) 
project for the assessment of real-world evidence in sAD (http:// 
roadmap-alzheimer.org/). 

Arguably, a culture shift in the public perception of AD and de- 
mentia in general will be required before full benefit of preventative 
and treatment strategies will be realized. Moving from dementia to 
a framework of brain health would help de-stigmatize cognitive de- 
cline, empower the public to take greater responsibility for preven- 
tion, and encourage society at large to generate inclusive solutions 
for maintaining functional independence. To reimagine dementia 
care in terms of brain health centres, rather than memory clinics, 
would not be an idle reverie. 
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Arnold Pick first reported in 1892 a case of dementia with symp- 
toms of aphasia and atrophy of the frontal and temporal lobes [1]. 
By the 1920s, the clinicopathological entity named Pick’s disease had 
been established [2]. It was soon realized that many patients with 
progressive degeneration of the frontotemporal regions did not have 
classical Pick bodies. In 1994, the Lund-Manchester groups devel- 
oped a consensus to delineate the clinical and neuropathological 
criteria for what they called frontotemporal lobar degeneration 
(FTLD) [3]. The criteria encompassed each of the major clinical 
syndromes of FTLD: frontotemporal dementia (FTD) [currently 
called behavioural-variant FTD (bvFTD)], progressive non-fluent 
aphasia, and semantic dementia. These criteria have been replaced 
by the international criteria for FTD [4-14] and primary progressive 
aphasia (PPA) [5-27]. 


FTD is recognized as one of the leading causes of early-onset de- 
mentia and typically presents in people in their late fifties. The onset 
of symptoms is usually between the fourth and eighth decades, but 
cases that begin in the third and ninth decades are seen. About 10% 
of FTD cases present in patients aged less than 45 years, and about 
60% of FTD cases present in those aged between 45 and 64 years [6]. 
Approximately 30-50% of cases are familial, and 10-15% of FTD 
cases appear to be associated with an autosomal dominant pattern 
of inheritance. The prevalence of FTD in individuals between 45 
and 64 years of age is estimated to be 2-22 per 100,000 but varies 
widely, ranging from 4-31 per 100,000 in Europe [4] and 15-22 
per 100,000 in the United States [6] to 2-9.5 per 100,000 in Japan 
[5, 7]. The estimated incidence of FTD is between 2.7 and 4.1 per 
100,000 person-years, based on population studies from the United 
States and Europe [4]. Regarding the gender distribution of FTD, 
some studies have reported a significantly higher prevalence among 
men, whereas other studies have reported almost the same preva- 
lence in men and women [4]. The median survival after onset is 6.0- 
11.8 years [8-10]. In FTD subtypes, semantic dementia has longer 
median survival (12 years) [9], while FTD and motor neuron disease 
(FTD-MND) has the shortest median survival (3 years) [8, 11]. 











FTD is a neurodegenerative disorder that primarily affects the 
frontal and temporal lobes and leads to distinct clinical syndromes. 
FTD is classified into three clinical variants, according to the FTD 
consensus criteria: (1) bvFTD, which is characterized by early and 
significant changes in behaviour, emotion, personality, and execu- 
tive control; (2) non-fluent-variant PPA (nfvPPA), which is charac- 
terized by progressive deficits in speech, grammar, and word output; 
and (3) semantic-variant PPA (svPPA), which is a progressive dis- 
order of semantic knowledge and naming. Some patients present 
with features of both FTD and amyotrophic lateral sclerosis (ALS)/ 
MND (ALS/MND). In addition, behavioural and cognitive symp- 
toms of progressive supranuclear palsy (PSP) and cortico-basal syn- 
drome (CBS) often overlap with FTD. These clinical syndromes are 
closely linked to specific molecular pathologies that affect the frontal 
and temporal lobes [12] (Fig. 41.1). 

A new classification for cognitive disorders (including dementia) 
was established in the Diagnostic and Statistical Manual of Mental 
Disorders, fifth edition (DSM-5). FTD is referred to as a fronto- 
temporal neurocognitive disorder (NCD) in DSM-5. Fronto- 
temporal NCDs are subdivided into mild or major frontotemporal 
NCD, according to the severity of cognitive impairment. Fronto- 
temporal NCD is characterized by the progressive development of 
behavioural and personality changes and/or language impairments. 
According to the DSM-5 criteria, the prerequisite for a diagnosis of 
behavioural-variant frontotemporal NCD is a progressive decline 
in social cognition and/or executive abilities. In addition, three or 
more of the following behavioural features must be present: behav- 
ioural disinhibition, apathy/inertia, loss of empathy, perseverative/ 
ritualistic behaviour, and hyperorality/dietary changes. The diag- 
nosis is categorized as ‘possible’ or ‘probable’ frontotemporal 
NCD, according to the severity of cognitive and functional impair- 
ment. If there is supporting evidence from genetic mutations or 
neuroimaging, the diagnosis is described as probable; otherwise, it 
is designated as possible [13]. The behavioural variant and three lan- 
guage variants (agrammatic/non-fluent, semantic, and logopenic) 
exhibit distinct patterns of brain atrophy and some distinctive forms 
of neuropathology. The necessary criteria must be met to make a 
diagnosis of either the behavioural or the language variant, but many 
individuals present with features of both. 
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Fig. 41.1 Clinical and pathological spectrum of frontotemporal dementias. Frontotemporal dementia encompasses three canonical 
syndromes: behaviour syndrome, language syndrome, and motor syndrome. These clinical syndromes can overlap in the disease course. 


Despite recent advances in the characterization of bvFTD, 
the diagnosis of the syndrome remains challenging. Some pa- 
tients with bvFTD are misdiagnosed as having psychiatric dis- 
orders or Alzheimer’s disease (AD). In the absence of definitive 
biomarkers, clinical diagnostic criteria are more important in 
the diagnosis of bvFTD. The International Behavior-variant FTD 
Criteria Consortium (FTDC) developed revised guidelines for 
the diagnosis of bvFTD in 2011. A diagnosis of bvFTD is cat- 
egorized as ‘possible’ ‘probable, or ‘definite’ bvFTD, based on this 
hierarchical classification system, with support from radiological 
findings, genetic studies, and neuropathological data. The FTDC 
criteria have been changed by prescribing specific criteria for diag- 
nosis, while excluding patients with biomarkers for AD or other 
neurodegenerative disease [14]. 


Behavioural-variant frontotemporal dementia 


The most pronounced behavioural symptoms of bvFTD are loss 
of empathy, apathy, and disinhibition, which typically precede any 
obvious cognitive impairment. Therefore, bvFTD can be mistaken 
for depression or other psychiatric disorders in the early stages. 
In bvFTD, behavioural changes can lead to job loss and financial 
problems. The families of patients with bvFTD report high levels of 
distress from the very early stages, compared with families of pa- 
tients who have AD [15]. Most behavioural symptoms are caused by 
disruptions to the brain networks involved in emotions and social 
function. Patients with bvFTD show loss of empathy towards family 
and other people. They cannot interpret the emotional state of other 
people and show decreased responsiveness to emotions (emotional 
blunting). Family members of patients with bvFTD often describe 
their loss of empathy as selfish or self-centred. Loss of empathy is 
associated with human brain networks that include the medial 
orbitofrontal cortex (OFC), anterior temporal lobe, and anterior 
insula, and these regions match those affected by the pattern of 


atrophy characteristic of bvFTD [16, 17]. The loss of empathy seems 
to be lateralized to the right anterior temporal lobe, OFC, and an- 
terior insula [16, 18]. 

Apathy is defined as loss of interest or emotion and loss of mo- 
tivation [19], and in patients with bvFTD, this presents as reduced 
interest in work, hobbies, and hygiene that leads to a decline in per- 
sonal hygiene. Apathy is associated with atrophy of the anterior cin- 
gulate cortex (ACC) and ventromedial superior frontal cortex [20]. 
The ACC (especially the dorsal ACC) also plays an important role 
in both the motor and cognitive domains. The dorsal ACC is in- 
volved with the supplemental motor area in the planning of complex 
movements. ACC lesions can cause dramatic reductions in spontan- 
eous speech and movement (akinetic mutism) in extreme cases [21]. 
In terms of the cognitive domain, the dorsal ACC and dorsolateral 
prefrontal cortex are associated with executive functions such as set 
maintenance. 

Disinhibition is a distinctive feature of bvFTD, and the OFC 
plays an important role in modifying behaviour, depending on 
the principle of receiving rewards and avoiding punishment [22]. 
Avoiding inappropriate behaviour is a key component of successful 
social functioning. The OFC is one of the earliest regions affected in 
bvFTD, and its atrophy leads to disinhibition [22]. Dysfunction of 
the right OFC in bvFTD manifests as marked behavioural disinhib- 
ition [20], which is associated with inappropriate behaviours such 
as hugging or kissing strangers, urinating in public, or telling sexual 
jokes. Behavioural disinhibition may also manifest as impulsivity 
such as reckless driving, excessive purchasing of unnecessary items, 
or criminal behaviour such as shoplifting. Loss of insight is defined 
as a lack of awareness of mental symptoms due to frank denial or un- 
concern for the consequences [23]. Patients with bvFTD show even 
more severely diminished insight into their symptoms and overesti- 
mate their performance. Loss of insight is associated with ventro- 
medial and frontopolar prefrontal atrophy [23]. 


Alterations in eating behaviour occur in patients with bvFTD, 
including changes in appetite and food preferences. Patients with 
bvFTD tend to display sugar cravings and binge eating, often as- 
sociated with significant weight gain. Changes in food preferences 
are due to dysfunction in neural networks that include the hypo- 
thalamus, right insula, bilateral nucleus accumbens, and OFC [24]. 
Binge eating is associated with loss of function in the right ventral 
insula, striatum, OFC, thalamus, and anterior cingulate (24, 25]. In 
terms of neuropsychological testing, patients with bvFTD tend to 
show deficits in executive function, although visuo-spatial func- 
tion, and often episodic memory, is relatively spared, compared with 
AD [26]. 


Language-variant frontotemporal dementia 


PPA is a progressive language disorder that is often associated with 
disproportionate atrophy of the left frontal and temporal lobes, and 
is classified as language-variant frontotemporal NCD in DSM-5 
[13]. Language dysfunction is the main symptom for the first 2 years 
of the illness. Deficits in language production, object naming, 
syntax, or word comprehension are apparent during conversation 
or in speech and language assessments [27]. Patients with language- 
variant frontotemporal NCD present with PPA with gradual onset, 
with three subtypes being commonly described—semantic-variant, 
agrammatic/non-fluent-variant, and logopenic-variant—and each 
variant has distinctive features and corresponding neuropathology 
[27]. nfvPPA and svPPA are associated with FTD, and logopenic- 
variant PPA is associated with AD. ‘Probable’ is distinguished from 
‘possible’ frontotemporal NCD by the presence of causative gen- 
etic factors (such as mutations in the gene coding for microtubule- 
associated protein tau) or by the presence of a distinctive pattern of 
atrophy or reduced activity in frontotemporal regions on structural 
or functional imaging [13]. 


Non-fluent-variant primary progressive aphasia 


nfvPPA is characterized by non-fluent and effortful speech pro- 
duction, and agrammatism (omission or misuse of grammar) 
is usually, but not always, present. Agrammatism and effortful 
speech are core symptoms of nfvPPA [27]. Apraxia of speech 
(AOS) is also common in nfvPPA. AOS causes inconsistent speech 
sound errors such as distortions, insertions, deletions, substitu- 
tions, and transpositions of speech sounds. Word-finding difficul- 
ties are relatively mild, compared with svPPA. Single-word and 
object comprehension are relatively spared, although patients can 
have mild anomia that may be more pronounced for verbs than 
for nouns. Deficits in syntax comprehension are evidenced by im- 
pairments in sentence comprehension, initially only for the most 
difficult syntactic constructions [27]. nfvPPA is associated with 
atrophy predominantly in the left inferior frontal and insular re- 
gions, which is necessary for making a diagnosis of the imaging- 
supported non-fluent variant. Clinically, nfvPPA patients exhibit 
few behavioural changes. 


Semantic-variant primary progressive aphasia 


Patients with svPPA present with deficits in comprehension and 
word-finding abilities (anomia) due to impairments in semantic 
knowledge; however, these patients retain a fluent expressive 
speech pattern, with correct use of grammar. Anomia and single- 
word comprehension deficits are the core features [27]. These 
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deficits reflect a gradual loss of semantic knowledge, which is ob- 
served irrespective of modality. Anomia tends to be more pro- 
nounced for nouns than for verbs or pronouns. Semantic deficits 
are usually present for most categories (that is, tools, animals, and 
people), although rare cases have been described with greater, or 
even selective, deficits for people and animals. A common feature 
among svPPA patients is surface dyslexia and dysgraphia, impair- 
ments in which words with atypical spelling or pronunciation are 
regularized. Anatomically, svPPA is associated with atrophy in 
the ventral and lateral portions of the bilateral anterior temporal 
lobes, although damage is usually greater on the left side. The clin- 
ical presentation of predominantly right-sided semantic dementia 
cases is heterogenous and includes inappropriate behaviour, loss of 
knowledge about familiar faces, loss of empathy, and/or semantic 
deficits [28]. 


Motor symptoms 


In patients with FTD syndromes that overlap with MND, behav- 
ioural and cognitive symptoms can develop before, after, or simul- 
taneously with motor symptoms. Patients with FTD often present 
with parkinsonism, including bradykinesia, rigidity, tremor, and/ 
or postural instability. The prevalence of parkinsonism in FTD 
varies from 12.5% to 30% [29]. Early parkinsonism is observed 
in 16% of FTD patients [30], with bradykinesia being the most 
common manifestation in 84% of patients, followed by a parkin- 
sonian gait (71%), rigidity and postural instability (35.5%), and 
resting tremor (6.5%) [31]. PSP and CBS are two atypical par- 
kinsonian syndromes demonstrating cognitive and behavioural 
features that overlap with FTD. Patients with FTD often have con- 
comitant ALS/MND. In some instances, FTD precedes ALS by 
many years; in other instances, ALS occurs before FTD. The inci- 
dence of FTD in patients with bulbar-onset ALS is reported to be as 
high as 48% [32]. Although 10-15% of patients with FTD develop 
MND [32, 33], there is an even higher prevalence of ‘subclinical’ 
MND, with evidence from electromyography (EMG) of MND or 
mild motor signs such as fasciculations in 30-60% of patients with 
FTD [32, 33]. Among the FTD variants, MND arises frequently 
in patients with bvFTD, and less often in patients with svPPA or 
nfvPPA. 


Approximately 40% of FTD cases have a family history, and 10-15% 
of FTD cases show an autosomal dominant pattern. Mutations have 
currently been associated with bvFTD in genes encoding the fol- 
lowing proteins: chromosome 9 open reading frame 72 (C9ORF72), 
microtubule-associated protein tau (MAPT), granulin (GRN), 
transactive response DNA-binding protein 43 (TARDBP, otherwise 
knownas TDP-43), fused in sarcoma (FUS), valosin-containing pro- 
tein (VCP), and charged multivesicular body protein 2b (CHMP2B) 
[34] (Table 41.1). Mutations in COORF72, MAPT, and GRN account 
for the majority of familial FTD cases [35]. 


C9ORF72 


The C9ORF72 mutation results in a recently identified 
hexanucleotide repeat expansion on chromosome 9 [36]. The 
largest proportion of familial FTD (about 25%) is caused by 
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Table 41.1 Clinical and neuroimaging features of hereditary frontotemporal dementia 


a ee Mean age of onset Clinical presentation Atrophy pattern 



































C9ORF72 25% of familial FTD 45 years bvFTD Symmetrical frontal atrophy with temporal and 
ALS parietal involvement 
FTD-MND Thalamus and cerebellar atrophy 
Parkinsonism 
MAPT 2-11% of familial FTD 49 years bvFTD Symmetric anteromedial temporal lobe atrophy 
PPA 
CBS 
Psi 
GRN 20% of familial FTD 58 years bvFTD Asymmetric frontal, temporal, and parietal lobe 
PPA atrophy 
CBS 
PSP 
FUS 3% of familial FID-MND 46 years FTD-MND Frontal and temporal atrophy 
ALS 
VCP 1-2% of familial FTD-MND 57 years bvFTD Frontal, temporal, and parietal lobes, especially 
FTD-MND prefrontal and superior temporal cortices 
nclusion body myopathy 
Paget's disease of the bone 
TARDP 1.3% of familial FTID-MND 63 years FTD-MND Frontal and temporal atrophy 
ALS 
CHMP2B 1% of familial FTD-MND 58 years bvFTD Frontal and temporal atrophy 
FTD-MND 
TBK1 0.5-4% of ALS 67 years FTD-MND Asymmetric anterior temporal atrophy and frontal 
ALS involvement 
nfvPPA 
svPPA 
CBS 


pathogenic expansions of C9IORF72 [35]. Cases with C9ORF72 
mutations are characterized by underlying TDP-43 pathology [36]. 
The most common clinical phenotypes associated with C9ORF72 
mutations are bvFTD, ALS, or both. C9ORF72 mutations are the 
most common genetic cause of familial FTD and ALS [37]. Thirty 
to 50% of patients with C9ORF72 expansions present with hallucin- 
ations and delusions [36]. 


MAPT 


The microtubule-associated protein tau is encoded by MAPT and 
plays an important role in microtubule assembly and stabilization 
in neurons. MAPT mutations account for between 2% and 11% of 
familial FTD cases [34]. An MAPT mutation was identified as the 
first causative genetic mutation in several families with FTD and as- 
sociated parkinsonism in 1998 [38]. MAPT mutations are associated 
with an underlying tauopathy. Clinically, cases with MAPT muta- 
tions usually present with byFTD and may include parkinsonism. 
PPA syndromes, including nfvPPA and svPPA, can also be seen in 
patients with MAPT mutations [39]. Variability in the normal haplo- 
types of MAPT has also been associated with an increased risk for 
the development of the syndromes PSP and CBS that overlap with 
FTD [40]. 


GRN 


GRN encodes granulin, a secreted protein that may function in in- 
flammation, wound repair, and cell cycling [41]. The neuropathology 
of patients with GRN mutations is characterized by tau-negative, 
TDP-43-positive inclusions. GRN mutations account for up to 20% 
of familial FTD cases and are occasionally seen in sporadic cases 


[42]. GRN mutations are found in 24.7% of FTLD-TDP cases [41] 
and are associated with an earlier age of symptom onset and an 
earlier age of death, compared with FTLD-TDP patients without 
GRN mutations [41]. The clinical presentation of patients with GRN 
mutations is bvFTD, with nfvPPA or CBS seen less frequently [34]. 
Episodic memory deficits occur in 10-30% of those with GRN mu- 
tations and may lead to the clinical diagnosis of an amnestic variant 
of MCI, with parietal deficits similar to AD [42]. GRN mutations are 
seldom seen in association with MND [41]. 


Others 


Mutations in the genes encoding five other proteins, including 
VCP, CHMP2B, TARDP, FUS, and TBK-1, have been identified in 
a minority of cases [34]. VCP mutations result in a rare syndromic 
combination of bvFTD with inclusion body myopathy and Paget's 
disease of bone (known as IBMPFD). Pathologically, VCP muta- 
tions have TDP-43 type D pathology [43]. CHMP2B mutations are 
a genetic cause of FTD linked to chromosome 3 (FTD-3), which 
was originally identified in a large Danish family. The phenotype of 
CHMP2B mutations is usually a behavioural syndrome similar to 
bvFTD, although several cases with FTD-MND have been described 
[44]. TARDBP mutations on chromosome 1 have been reported in 
FTD and FTD-MND cases [45], and are found in 5% of those with 
familial ALS [42]. FUS mutations were originally described in fa- 
milial ALS, accounting for 3% of cases [46]; however, most FTD 
cases with FUS pathology are sporadic and do not have FUS mu- 
tations [47]. Finally, recent studies suggest that loss-of-function 
mutations in TBK1 (encoding TANK-binding kinase 1) can cause 
FTD-MND. In addition, svPPA and nfvPPA can be manifestations 


of TBK1 mutations and should be considered in patients with PPA 
associated with ALS. 

Semantic and non-fluent aphasic variants, secondarily associ- 
ated with ALS, are predominant FTLD phenotypes in TBK1 carriers 
[48, 49]. 


Pathology 


To distinguish the neuropathology from the clinical diagnosis, the 
term FTD is used for the clinical syndrome, whereas the term FTLD 
is used when referring to the neuropathological classification [50]. 
AD pathology can be responsible for bvFTD syndrome, although if 
the international research criteria for this disorder are applied care- 
fully, this is rare [51]. One pathological finding of FTLD is regional 
atrophy of the frontal and temporal cortices, with neuronal loss and 
microvacuolation predominantly involving the superficial cortical 
layers II and III [52]. FTLD is a neuropathologically heterogenous 
disorder, which can be divided into two major subtypes, according to 
the characteristic pattern of abnormal protein deposition: FTLD with 
tau-positive inclusions (FTLD-tau) subtype and FTLD with ubiquitin- 
positive and TDP-43-positive subtype. FTLD-FUS is seen in a small 
minority of tau-negative, ubiquitin-positive cases [12] (Table 41.2). 


FTLD-tau 


FTLD-tau accounts for 36-50% of all cases of FTLD, according to 
different pathological series, and varies widely across the different 
FTD syndromes [35]. The defining characteristic of FTLD-tau is 
the accumulation of filamentous aggregates of pathological tau in 
neurons, astrocytes, and oligodendrocytes, which are collectively 
known as tauopathies (Fig. 41.2a—c). Primary tauopathies include 
Pick’s disease, FTD with MAPT mutations (FTDP-17), PSP, CBD, ar- 
gyrophilic grain disease (AGD), and neurofibrillary tangle dementia 
(NTD). The most common primary tauopathy is CBD, accounting 
for 35% of cases, followed by PSP with 31%, Pick’s disease with 30%, 
and AGD accounting for the remaining 4% of all cases of FTLD- 
tau [12]. Pick’s disease is characterized by the presence of Pick 
bodies, cytoplasmic inclusions composed primarily of three-repeat 
isoforms of abnormal tau protein (Fig. 2a). PSP, CBD, and AGD are 
tauopathies with predominantly four-repeat tau isoforms. There are 
very good clinicopathological associations between FTLD-tau as a 
group, particularly for PSP. nfvPPA is usually a primary tauopathy, 
with CBD accounting for approximately 50% of cases with PSP 
and Pick’s disease also seen [12, 53]. At our institution, University 


Table 41.2 Correlations between clinical presentation, pathology, 
and genetics in frontotemporal dementia 


Common clinical presentation | Associated gene 
mutation 


FTLD-tau bvFTD, nfvPPA, svPPA, CBS, PSP. MAPT 
FTLD-TDP TypeA POD nfvPPA PGRN 
TypeB bvFTD, FID-MND C9ORF72 
TypeC _ svPPA, bvFTD 
TypeD IBMPFD VCP 
FTLD-FUS bvFTD, FID-MND FUS 
FTLD-UPS bvFTD, FID-MND CHMP2B 
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of California, San Francisco, approximately one-third of all byFTD 
cases are due to tau. 


FTLD-TDP 


TDP-43 is a ubiquitously expressed nuclear protein that functions 
as a repressor of transcription. It also regulates gene expression and 
splicing. FTLD-TDP is represented in about 50% of all cases of FTD 
[35]. TDP-43 pathology is seen in sporadic and familial cases of 
FTLD involving mutations in the genes for GRN and VCP, and in 
FTD with or without MND and the CO9ORF72 hexanucleotide re- 
peat, as well as familial and sporadic cases of ALS [52]. Four major 
subtypes of FTLD-TDP are recognized (types A, B, C, and D), based 
on the pattern of cytoplasmic or intranuclear pathology [54] (Fig. 
2d) (Table 41.2). 


FTLD-FUS 


FTLD-FUS accounts for about 10% of all bvFTD cases [35, 55]. The 
majority of tau-negative/TDP-43-negative, ubiquitin-positive FTLD 
cases have positive immunohistochemical staining for the FUS pro- 
tein, thus distinguishing a third category of FTLD neuropathology 
that includes the categories formerly called atypical FTLD with ubi- 
quitinated inclusions (aFTLD-U), basophilic inclusion body dis- 
ease (BIBD), and neuronal intermediate filament inclusion disease 
(NIFID) [55]. This last entity is characterized by sporadic, early- 
onset (<40 years) bvFTD or FTD-MND [35]. 


Others 


The FTLD-other category is reserved for diseases in which the major 
protein associated with the disease entity remains unknown [12]. 
FTLD without inclusions (FTLD-ni) is diagnosed when there is 
histological evidence of frontotemporal neuronal loss and gliosis, 
but no presence of any inclusions by immunostaining. FTLD 
with immunohistochemistry against proteins of the ubiquitin 
proteosomal system (FTLD-UPS) is diagnosed when there are ubi- 
quitin or p62 immunoreactive inclusions that are negative for tau, 
alpha-synuclein, alpha-internexin, TDP-43, and FUS [12]. The ma- 
jority of FTLD-UPS cases are associated with CHMP2B mutations 
[12, 44]. 


Investigations 


Neuroimaging 

Structural MRI shows distinct patterns of atrophy in FTD. Brain at- 
rophy in FTD predominantly affects the frontal or temporal lobes, 
and atrophy of the ACC and frontal insular cortex are particularly 
characteristic of FTD, while medial and posterior temporo-parietal 
and occipital lobe atrophy is indicative of AD, rather than FTD [56]. 
The genetic forms of FTD have their own typical patterns of atrophy. 
GRN mutations are more likely to show strongly asymmetrical at- 
rophy, predominantly affecting one hemisphere and involving the 
inferior frontal, temporal, and inferior parietal lobes [34]. MAPT 
mutations are associated with a more symmetrical pattern of at- 
rophy, localized predominantly in the anterior temporal lobes and 
also involving the OFC [34]. With C9ORF72 mutations, thalamic, 
and sometimes cerebellar, atrophy is a distinguishing feature, and a 
slightly more generalized pattern of atrophy is evident than with the 
other genetic forms of FTD. 
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Fig. 41.2 (see Colour Plate section) Histological features of FTLD-tau and FTLD-TDP (a) Pick’s disease. Pick bodies (lower inset) in the inferior 
temporal gyrus. (b) Cortico-basal degeneration. Astrocytic plaque (circle) and neuronal cytoplasmic inclusions (arrow and upper inset) in the precentral 
gyrus. (c) Progressive supranuclear palsy. Tufted astrocyte in the sensorimotor cortex. (d) FTLD-TDP type B. Neuronal cytoplasmic inclusions (arrow and 
lower inset) and neuropil threads (arrowheads and upper inset) in the middle frontal gyrus. Immunostains are phospho-tau (a-c) and TDP-43 (d). All 


scale bar represents 10 um. 


PET imaging is also useful for distinguishing AD and FTD. In FDG- 
PET studies, a pattern of hypometabolism in the frontal, anterior 
temporal, and anterior cingulate cortices characterizes FTD, while 
hypometabolism in the temporoparietal and posterior cingulate cor- 
tices characterizes AD [57]. Amyloid imaging may also be useful to 
distinguish FTD from AD. The novel PET tracer ''C-PiB selectively 
binds to amyloid plaques and identifies cortical lesions affected by AD 
pathology [58]. A variety of tau tracers are being tested in patients with 
AD [59]. Tau imaging could potentially differentiate between AD, 
FTLD-tau, and FTLD with ubiquitinated inclusions, although at least 
with the svPPA form of FTD where TDP-43 is usually the underlying 
pathology, tau imaging markers have proven disappointing. 

Functional connectivity network mapping using resting state 
fMRI is a novel technique for identifying specific brain networks. 
In patients with bvFTD, intrinsic connectivity within the salience 
network is decreased, while increased activity is seen in the default 
mode network. By contrast, the opposite pattern of intrinsic con- 
nectivity is seen in AD [60]. 


In vivo biomarkers 


To date, no serum or CSF biomarkers have been identified that dis- 
tinguish patients with FTD from normal controls. Similarly, there 


is no biomarker that enables the differentiation of FTD subtypes. 
By contrast, biomarkers that help to differentiate AD from FTD are 
well accepted. tau, phosphorylated tau (pTau181), and AB42 in com- 
bination help to differentiate FTD from AD [61]. AB42 in the CSF 
is reduced, and tau and pTau181 are increased in AD, compared 
with FTD. Combined CSF analysis of the ratios of AB42/AB40 [62], 
total tau/Ab42, or phosphorylated tau/AB42 [63] can be used to im- 
prove the diagnostic accuracy and enable AD to be differentiated 
from FTD. 


Differential diagnosis 


A careful history, which includes the progression of behavioural 
changes and family and social history, is one of the most effective 
ways to distinguish FTD from other dementias. Performance on 
neuropsychological testing, laboratory studies, and neuroimaging 
also help to differentiate FTD from other causes of dementia such as 
inflammation or infectious disease. Among the subtypes of FTD, the 
differential diagnosis of early bvFTD is the most challenging. The 
possibility of AD needs to be considered, especially the behavioural/ 
dysexecutive variant that also affects the frontal lobes. Some 10-40% 


of patients clinically diagnosed with bvFTD have subsequently been 
found to have AD pathology on amyloid PET or post-mortem evalu- 
ations [64], although our recent studies suggest that bvFTD diag- 
nosed in an expert centre is almost never due to AD pathology. 
Patients with behavioural variant AD showed worse memory deficits 
than those with bvFTD but did not differ from patients with typical 
AD, while their executive function was more impaired, compared to 
patients with bvFTD and typical AD [64]. 

Another important differential diagnosis is late-onset psychi- 
atric disease. In the early stage of bvFTD, behavioural changes, 
including disinhibition, apathy, and compulsive behaviour, can 
occur in the absence of neurological signs or cognitive impair- 
ment. These behavioural changes in bvFTD can overlap with spe- 
cific psychiatric disorders. The initial diagnosis in 6% of patients 
with bvFTD is schizophrenia, schizoaffective disorder, bipolar 
disorder, depression, or an unspecified psychotic state [65]. A few 
forms of genetic bvFTD have been associated with psychotic symp- 
toms. Patients with byFTD and C9ORF72 mutations often exhibit 
psychosis as a dominant presenting problem, resulting in initial 
diagnoses of obsessive-compulsive disorder, schizophrenia, or bi- 
polar disorder [36]. Patients with bvFTD and GRN mutations also 
present with psychiatric manifestations, including bulimia, per- 
sonality changes, sexual disinhibition, ritualistic behaviours, and 
paranoia [66]. 

There is a subset of bvFTD patients in whom disease progres- 
sion following clinical diagnosis does not appear to occur despite a 
clear history from caregivers of initial personality and behavioural 
changes. These patients are said to suffer from bvFTD phenocopy 
syndrome [67]. Patients with phenocopy syndrome should be distin- 
guishable because they do not show a functional decline or imaging 
changes [14]. As the long-term prognosis is good, it seems less 
likely that phenocopy syndrome is caused by a neurodegenerative 
disorder [14]. Recent work suggests that some phenocopy subjects 
carry a genetic form of bvFTD [68]. 


Treatment and care 


Currently, no disease-modifying treatments can alter the course of 
FTD. Therefore, pharmacological treatments have focused on the 
modulation of behavioural and cognitive symptoms of FTD [35, 69]. 
Recent randomized controlled trials have shown that cholinesterase 
inhibitors, galantamine, and memantine, which are effective for 
treating cognitive or behavioural symptoms in AD patients, have no 
efficacy in patients with FTD [69, 70]. Some drugs for depression, 
including SSRIs and trazodone, have shown efficacy in improving 
the behavioural symptoms of bvFTD such as disinhibition, depres- 
sion, compulsive behaviour, and carbohydrate cravings [35, 70]. 
A few studies have shown that olanzapine is effective for aggression, 
agitation, and psychosis, although the adverse events associated 
with dopamine antagonists include parkinsonism [69, 70]. 

To manage the behavioural symptoms of bvFTD, environ- 
mental modification and education for caregivers are also neces- 
sary. Educating caregivers about the best non-pharmacological 
behavioural strategies, identification of triggers for problematic be- 
haviours, and increasing their coping skills have shown success in re- 
ducing caregiver stress and improving caregiver tools for managing 
behavioural symptoms [70, 71]. Resources, such as The Association 
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for Frontotemporal Degeneration, and the development of strategies 
to maintain emotional and physical safety have been shown to min- 
imize caregiver burden [70]. 
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Prion disease 
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Human prion diseases, also known as subacute spongiform en- 
cephalopathies, have been traditionally classified into Creutzfeldt 
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matter), astrocytic proliferation, and neuronal loss, accompanied by 
PrP deposits which may form amyloid plaques. 





Jakob disease (CJD), Gerstmann-Straussler syndrome (GSS) (also 
known as Gerstmann-Straussler-Scheinker disease), and kuru, 
although this has largely been superseded by aetiological categor- 
ization. Although rare, with an annual incidence of 1-2 per million 
worldwide (accounting for 1 in 5000 deaths in the UK), remark- 
able attention has been focused on these diseases. This is because 
of the unique biology of the transmissible agent, or prion, and also 
because bovine spongiform encephalopathy (BSE), an epidemic 
bovine prion disease, was transmitted to humans as variant CJD 
(vCJD), leading to fears of a significant public health threat through 
dietary exposure. 

The transmissibility of the human diseases was first demonstrated 
with the transmission of kuru and then CJD in 1966 and 1968, re- 
spectively, [1, 2] by intracerebral inoculation of brain homogenates 
into chimpanzees. Transmission of GSS followed in 1981. The proto- 
typic prion disease is scrapie, a common naturally occurring disease 
of sheep and goats, recognized in Europe for over 200 years and pre- 
sent in sheep flocks of many countries. Scrapie was demonstrated 
to be transmissible by inoculation in 1936 [3], and the recognition 
that kuru, and then CJD, resembled scrapie in its histopathological 
appearances led to the suggestion that these diseases may also be 
transmissible [4]. Kuru reached epidemic proportions among the 
Fore linguistic group in the Eastern Highlands of Papua New Guinea 
and was transmitted by consumption of human tissues at mortuary 
feasts. Since the cessation of cannibalism in the 1950s, the disease 
has declined, but a few cases were still occurring into the 2010s as 
a result of long incubation periods, which may exceed 50 years [5]. 

The term Creutzfeldt-Jakob disease was introduced by Spielmeyer 
in 1922, bringing together the case reports published by Creutzfeldt 
and Jakob. Several of these cases would not meet modern diagnostic 
criteria for CJD; indeed, it was not until the demonstration of trans- 
missibility allowing diagnostic criteria to be refined that a clear diag- 
nostic entity developed. Prion diseases of both humans and animals 
are associated with the accumulation in the brain of disease-related 
isoforms of a host-encoded protein known as prion protein (PrP), 
and all share common histopathological features: the classical triad 
of spongiform vacuolation (affecting any part of the cerebral grey 








According to the widely accepted ‘protein-only’ hypothesis, prions are 
devoid of significant nucleic acid and consist of multichain assemblies 
of misfolded, host-encoded PrP, a cell surface glycoprotein expressed 
in most tissues, but at its highest levels in the CNS. Cellular PrP (PrP°) 
is present in all vertebrates and highly conserved in mammals. Prions 
are thought to propagate by acting as a template to recruit PrP mono- 
mers in an autocatalytic process of seeded protein polymerization and 
fission, mimicking propagation of a biological pathogen. Biophysical 
experiments established that PrP‘, rich in a-helical structure, needs to 
largely unfold in order to adopt the B-sheet-rich structure of the infec- 
tious amyloid form, arguing that prions propagate by recruitment of 
at least partially unfolded PrP. Disease-related PrP isoforms have been 
classically referred to as PrP“ (for the scrapie form of PrP), which was 
originally defined by its partial resistance to proteolytic digestion. 
However, it is now established that there are multiple disease-related 
forms of PrP, some of which are protease-sensitive, and the term PrP* 
should not be used synonymously with prion infectivity. 

Human prion diseases have three distinct aetiologies: (1) auto- 
somal dominant inheritance, whereby coding mutations in the PrP 
gene (PRNP) result in PrP© predisposed to form seeds; (2) spor- 
adic CJD, following the spontaneous production of seeds as a rare 
stochastic event and representing the large majority of CJD cases; 
and (3) acquired by environmental exposure to prions. While this 
aetiological triumvirate was thought unique, it is now being widely 
considered whether this is relevant to much more common diseases. 
Notably, the principal neurodegenerative diseases are all associated 
with accumulation of misfolded proteins; they also occur primarily as 
sporadic conditions, but with rare inherited forms generally associ- 
ated with mutations in the genes encoding or processing the relevant 
accumulating proteins or peptides. Recent data are challenging the 
assumption that these conditions do not have acquired forms, with 
a developing interest in understanding whether the so-called ‘prion- 
like’ mechanisms are involved, notably in Alzheimer’s disease [6]. 

The human PrP gene (PRNP) is a single-copy gene located on the 
short arm of chromosome 20. A turning point in understanding 


human prion diseases was the identification of mutations in PRNP 
in familial CJD and GSS in 1989. The first mutation to be identified 
in PRNP was ina family with CJD and constituted a 144-bp insertion 
into the coding sequence [7]. A second mutation was reported in 
two families with GSS, and genetic linkage was confirmed between 
this missense variant at codon 102 and GSS, confirming that GSS 
was an autosomal dominant Mendelian disorder [8]. Current evi- 
dence suggests that around 15% of prion diseases are inherited and 
over 40 coding mutations in PRNP are now recognized. 

A common PrP polymorphism at residue 129, where either me- 
thionine or valine can be encoded, is a key determinant of genetic 
susceptibility to acquired and sporadic prion diseases—the large 
majority of which occur in homozygous individuals [9, 10]. This 
protective effect of PRNP codon 129 heterozygosity is also seen in 
some of the inherited prion diseases. 


Prion strains and transmission characteristics 


Although devoid of a nucleic acid genome, prions exist as multiple 
strains which can be serially propagated in laboratory animals and 
produce distinct patterns of disease. Strains are associated with spe- 
cific types of PrP*S and it is thought they represent structurally distinct 
seeds that are able to recruit host PrP“ into their discrete polymeric 
forms. Understanding how a protein-only infectious agent could en- 
code such phenotypic information has been of considerable biological 
interest and represents a non-Mendelian form of transmission. 

Prion strain diversity appears to be encoded by differences in PrP 
conformation and pattern of glycosylation [11]. Molecular strain 
typing approaches, based on these characteristics, have allowed 
the identification of multiple types among CJD cases. Two classi- 
fications are in use; no internationally agreed classification has yet 
emerged, and it is likely that additional PrP* types or strains will 
be identified [12, 13]. According to the London classification, four 
main types of PrP* are recognized—sporadic and iatrogenic CJD 
being of PrP* types 1-3, while all variant CJD cases are associated 
with a distinctive type 4 PrP* [11, 12]. 

Transmission of prion diseases between different mammalian 
species is limited by a so-called ‘species barrier’. Early studies of the 
molecular basis of the species barrier argued that it principally res- 
ided in differences in PrP primary structure between the inoculating 
species and the host. Transgenic mice expressing hamster PrP were, 
unlike wild-type mice, highly susceptible to infection with hamster 
prions [14]. That most sporadic and acquired CJD occurred in in- 
dividuals homozygous at PRNP polymorphic codon 129 supported 
the view that prion propagation proceeded most efficiently when 
interacting PrP* and PrP were of identical primary structure [12]. 
However, it has been long recognized that prion strain type affects 
ease of transmission to another species. Interestingly, with BSE 
prions, the strain component to the barrier seems to predominate, 
with BSE not only transmitting efficiently to a range of species, but 
maintaining its transmission characteristics, even when passaged 
through an intermediate species with a distinct PrP gene [15]. The 
term ‘transmission barrier’ is therefore preferable [15]. Both the PrP 
amino acid sequence and strain type affect the three-dimensional 
structure of glycosylated PrP, which will presumably, in turn, affect 
the efficiency of the protein-protein interactions thought to deter- 
mine prion propagation. 

Mammalian PrP genes are highly conserved. Presumably only a 
restricted number of different PrP** conformations (that are highly 
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stable and can therefore be serially propagated) will be permissible 
thermodynamically and will constitute the range of prion strains 
seen in mammals. While a significant number of different PrP* con- 
formations may be possible among the range of mammalian PrPs, 
only a subset of these would be allowable for a given single mamma- 
lian PrP. Substantial overlap between the favoured conformations 
for PrP* derived from species A and species B might therefore result 
in relatively easy transmission of prion diseases between these two 
species, while two species with no preferred PrP* conformations 
in common would have a large barrier to transmission (and indeed 
transmission would necessitate a change of strain type). According 
to such a conformational selection model [16] of a prion transmission 
barrier, BSE may represent a thermodynamically highly favoured 
PrP* conformation that is permissive for PrP expressed in a wide 
range of different species, accounting for the remarkable promis- 
cuity of this strain in mammals. Contribution of other components 
to the transmission barrier is possible and may involve interacting 
co-factors which mediate the efficiency of prion propagation, al- 
though no such factors have yet been identified. 

Additional data have further challenged our understanding of 
transmission barriers [17], the assessment of which has relied on the 
development of a clinical disease in inoculated animals. However, 
it is now clear that subclinical prion infections are sometimes estab- 
lished on prion inoculation of a second species [18]. Such animals 
harbour high levels of prion infectivity but do not develop clinical 
disease during a normal lifespan. The existence of such subclinical 
carrier states of prion infection has important potential animal and 
public health implications and argues against direct neurotoxicity 
of prions. Indeed, recent studies have demonstrated uncoupling of 
prion propagation and neurotoxicity [19]. 

The transmission barrier between cattle BSE and humans cannot 
be directly measured but can be modelled in transgenic mice ex- 
pressing human PrP‘, which produce human PrP* when chal- 
lenged with human prions. While these transgenic mouse models 
have been able to faithfully propagate human prion strains [11, 
20, 21] and recapitulate the characteristic neuropathology of vCJD 
[22], there are important caveats in extrapolating from such animal 
models to human susceptibility. However, these studies have found 
a much higher infection rate in transgenic mice expressing human 
PrP M129 than mice expressing human PrP V129 when challenged 
with either BSE or vCJD prions, and demonstrated that BSE prion 
infection can produce disease phenotypes resembling sporadic CJD 
and also novel prion strain phenotypes. Most recently, these studies 
have argued that the vCJD phenotype may only be expressed in the 
presence of the M form of human PrP [23]. While implying that only 
those humans expressing PrP M129 may develop the vCJD pheno- 
type, this does not mean that VV individuals are completely resistant 
to BSE prion infection—but rather that if infected, they would show 
a different phenotype [23]. Modelling of susceptibility of the MV 
genotype suggests that several different phenotypes may be possible 
when infected with BSE or vCJD prions [24]. 





Clinical features and diagnosis 


Human prion diseases can be divided aetiologically into inherited, 
sporadic, and acquired forms, which largely share one or more 
core clinical features: progressive cognitive deterioration, ataxia, 
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myoclonus, and pyramidal or extra-pyramidal motor signs. Focal 
cognitive deficits (for example, executive or memory impairment) 
may present in isolation for weeks or months, as can isolated ataxia. 
There is inevitably progression to global cognitive dysfunction, and 
the common pre-terminal clinical state is akinetic mutism. 

Nevertheless, there is wide phenotypic variability both between and 
within aetiological groups. Indeed, kindreds with inherited prion dis- 
ease that segregate the same mutation can manifest remarkably di- 
verse clinical presentations. Approximately 85-90% of human prion 
disease cases in the UK are sporadic, and psychiatric symptoms in the 
form of agitation, anxiety, hallucinations, or depression are present 
in the majority early in disease course. A subset of patients develop a 
more discrete psychiatric syndrome, such as florid psychosis, and may 
first present to psychiatric services without common motor features. 

Broadly, prion disease should be considered in all cases of early- 
onset or rapidly progressive dementia or ataxia. Significant clinical 
overlap exists with other neurodegenerative syndromes, including 
Alzheimer’s disease, dementia with Lewy bodies, and fronto- 
temporal dementia. A definite diagnosis can only be made via 
neuropathological examination, commonly post-mortem (or with a 
confirmed PRNP mutation in inherited prion disease). However, the 
rapid advancement in diagnostic markers, in particular diffusion- 
weighted MRI and CSF prion seeding assays for sporadic prion dis- 
ease, coupled with the availability of PRNP genotyping, now permits 
accurate ante mortem diagnosis in the large majority of cases. Brain 
biopsy is now rarely required and is usually reserved for cases with 
highly atypical features or a strong suspicion of a treatable mimic. 
The key clinical features and investigations for the diagnosis of prion 
disease are given in Box 42.1. 


Sporadic CJD 


The onset is usually in the 45- to 90-years age group, peaking around 
65 years. Sporadic CJD (sCJD) classically presents with rapidly pro- 
gressive multifocal dementia, myoclonus, and gait ataxia. The con- 
dition typically progresses over weeks to akinetic mutism and death, 
often in 2-3 months; around 70% of cases die in under 6 months 
and 90% by 1 year. Prodromal features such as headache, fatigue, 
or weight loss are reported in about a third of cases; the other key 
features of sCJD include pyramidal and extra-pyramidal signs and 
visual signs. For case classification for epidemiological purposes, the 
appropriate clinical picture, in combination with at least one typical 
investigation finding (MRI, EEG, or CSF), would allow ante mortem 
classification as probable sCJD, without histological confirmation 
(Box 42.1). Probable CJD is almost always (>95%) confirmed as def- 
inite if autopsy is performed. 

Routine blood investigations are usually within the normal range 
and, even when abnormal, possess no particular value diagnostic- 
ally. MRI brain with diffusion-weighted sequences has emerged as 
the most sensitive non-invasive diagnostic tool for sCJD [25]; sCJD 
patients typically demonstrate restricted diffusion in the basal gan- 
glia and thalami and cortical ribboning, but the regional combin- 
ations vary from case to case (Fig. 42.1). Signal hyperintensity on 
T2-weighted and FLAIR sequences in the corresponding regions 
may also be present, but this is less striking. CSF examination is es- 
sential to exclude other causes of rapid cognitive decline, and both 
cell count and routine biochemistry remain normal in sCJD cases. 


Box 42.1 Diagnosis of prion disease 


Sporadic CJD (MRI-CJD diagnostic criteria [25]) 
. Clinical signs (rapidly progressive dementia* with additional features) 
— Myoclonus. 
— Cerebellar or visual signs. 
— Pyramidal or extra-pyramidal signs. 
— Akinetic mutism. 


Il. Tests 

— Periodic sharp waves complexes on EEG. 

— 14-3-3 protein detected in CSF. 

— High signal abnormalities in caudate nucleus/striatum, or at 
east two cortical regions (temporal/parietal/occipital), either in 
diffusion-weighted imaging (DWI) or fluid attenuation inversion 
recovery (FLAIR) MRI sequences. 

— Positive RT-QuIC in CSF or other tissues 
Probable sCJD 
e Rapidly progressive dementia and two of | and at least one of Il. 


Progressive neurological syndrome and positive RT-QuIC in CSF or 
other tissues 








Possible sCJD 
e Rapidly progressive dementia and two of | and duration of less than 
2 years. 


latrogenic CJD 


e Progressive predominantly cerebellar syndrome and behavioural 
disturbance in a recipient of human pituitary hormones (growth 
hormone or gonadotrophin). Limb dysaesthesiae is also a frequent 
feature. 

e Probable CJD with exposure to a recognized iatrogenic CJD risk 
factor: human pituitary hormones; human dura mater graft; corneal 
graft from a recipient with definite/probable human prion disease; 
exposure to neurosurgical instruments used in a case with definite/ 
probable human prion disease. 

e Incubation periods may be over 40 years from exposure. 

e MRI findings are similar to those seen in sporadic CJD; EEG periodic 
sharp waves are rarely seen. 

e CSF 14-3-3 protein can be detected in approximately half of cases; 
RTQuIC can also be positive, but sensitivity is much reduced, com- 
pared with sporadic CJD. 





Variant CJD 

e Progressive neuropsychiatric syndrome: depression, anxiety, with- 

drawal, delusions. 

e Peripheral sensory symptoms (pain or dysaesthesiae), cerebella 

ataxia, and myoclonus/chorea/dystonia often precede dementia. 

e Mostly young adults, with a longer disease course (>6 months). 

e EEG: non-specific slow waves, CSF 14-3-3 may be elevated or 

normal. 

e MRI: pulvinar sign present in approximately 90% (particularly FLAIR 
sequence) but may be a late feature and not pathognomonic. 

e Tonsil biopsy: characteristic PrP immunostaining and PrP* on 
western blot. 


a 








Inherited prion disease 

e Varied clinical syndromes between and within kindreds: consider in 
all pre-senile dementias and ataxias, irrespective of family history. 

e PRNP analysis: diagnostic, codon 129 genotype may predict age at 
onset in pre-symptomatic testing. 

"This is typically 6 months or less, but there is high variability; approximately 10% 


have a duration of >2 years. No other cause identified, including no pathogenic PRNP 
mutations. 


Incorporates data from Lloyd, S., Mead, S., & Collinge, J. (2011). Genetics of prion dis- 
ease. In Prion Proteins (pp. 1-22). Springer Berlin Heidelberg. 
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Fig. 42.1 Typical MRI features of sporadic CJD (sCJD), variant CJD (vCJD), and iatrogenic CJD. (a) Diffusion-weighted (DW) imaging showing the basal 
ganglia, thalamic, and hippocampal tail increased signal in sCJD, with corresponding restriction (b) in apparent diffusion coefficient (ADC) sequence. 
(c) and (d) DW and ADC images in sCJD, showing cortical ribboning and restricted diffusion. (e) FLAIR image in vCJD showing the pulvinar sign 
(thalamic high signal more prominent than in the basal ganglia). (f) DW image in iatrogenic CJD, showing high signal in the motor strip. 


The presence of protein 14-3-3, raised S100B levels, and highly ele- 
vated total tau levels in the CSF are useful adjunctive tests for sCJD. 
However, they have been superseded by the development of the real- 
time quaking-induced conversion (RT-QuIC) assay in recent years. 
RT-QuIC is an amyloid-seeding assay that uses repeated mechanical 
sample shaking and incubation to amplify prion seeds in patient CSE, 
using recombinant PrP as a substrate and a fluorescent thioflavin T 
readout. A positive CSF RT-QuIC possesses a comparable sensitivity 
to a positive protein 14-3-3 in the CSE, but more importantly it has 
a specificity for sCJD approaching 100% [26]. The RT-QuIC assays 
seeded by nasal brushings and, more recently, skin from sCJD pa- 
tients also yield high sensitivities and specificities approaching (and 
reaching) 100%, although neither of these have yet been incorpor- 
ated into routine clinical practice [27, 28]. EEG in sCJD may show 
characteristic pseudoperiodic sharp wave activity, but this is demon- 
strated in less than 50% of patients [25], the large majority of whom 
are PRNP codon 129 MM. To some extent, demonstration of a typ- 
ical EEG is dependent on the number of serial EEGs performed. 

Neuropathological confirmation of sCJD is by demonstration of 
the classical spongiform change, neuronal loss, astrocytosis, and 
positive PrP immunohistochemistry. PrP amyloid plaques are usu- 
ally not present in sCJD, although protease-resistant PrP“ types 1- 
3 can be demonstrated by immunoblotting of brain homogenates 


(molecular strain typing; see Aetiology, p. 414). PRNP analysis is im- 
portant to exclude pathogenic mutations. By definition, there is no 
family history in these cases, although rarely mutations are seen due 
to late-onset disease or non-paternity. It is possible some cases might 
arise from somatic mutation of PRNP or unidentified environmental 
exposure to human or animal prions. 


Atypical forms of sCJD with distinct clinical syndromes and dur- 
ations of survival are well recognized. Those with the so-called ataxic 
variant of sCJD present with slowly evolving gait ataxia in the initial 
months before the onset of cognitive impairment. These individ- 
uals are typically PRNP codon 129 MV and have a longer duration 
of illness, with a good proportion living beyond 2 years from the 
time of symptom onset. In the Heidenhain variant of sCJD, which 
affects individuals who are PRNP codon 129 MM, cortical visual 
symptoms and signs predominate in the early course of the illness, 
while memory remains intact; patients can progress rapidly to cor- 
tical blindness and death in as short as 4 weeks from symptom onset, 
often accompanied by severe agitation. 

Other clinical syndromes at presentation encountered in sCJD 
patients include pure cognitive syndromes, neuropsychiatric syn- 
dromes, cortico-basal syndrome, and stroke-like syndromes; 





SECTION 7 Delirium, dementia, and other cognitive disorders 


the former two can progress slowly with late emergence of typ- 
ical sCJD signs and are frequently mistaken for Alzheimer’s dis- 
ease and behavioural-variant fronto-temporal lobar degeneration, 
respectively. 


Acquired prion diseases 


While human prion diseases can be transmitted to experimental 
animals by inoculation, they are not contagious in humans. 
Documented case-to-case spread has only occurred during endo- 
cannibalistic practices (kuru) or following accidental inoculation 
with prions during medical or surgical procedures (iatrogenic CJD). 


Kuru 


Kuru reached epidemic proportions among the Fore linguistic 
group and their neighbours in the Eastern Highlands of Papua New 
Guinea [29]. It predominantly affected women and children, as they 
were the principal participants in the practice of consuming dead 
relatives at mortuary feasts, as a mark of respect and mourning. 

The clinical course typically lasts between 1 and 2 years and be- 
gins with a constitutional prodrome of headaches and limb and joint 
pain, followed by progressive cerebellar ataxia, often with other fea- 
tures such as tremor, diplopia, dysarthria, social withdrawal, and 
depression. Patients subsequently develop global dementia and fre- 
quently pyramidal and extra-pyramidal motor signs. 

Kuru incidence has been in decline since its peak in the 1950s, fol- 
lowing the cessation of endocannabalism in the late 1950s, and the 
last recorded case was 2012. However, the study of kuru continues 
to reveal valuable insights, the most recent being the identification 
of the protective G127V polymorphism in kindreds at the epicentre 
of the kuru epidemic. This polymorphism appeared to provide com- 
plete protection against kuru [30], and this was supported by trans- 
genic mouse modelling and inoculation experiments which showed 
that mice expressing human PrP 127V were completely resistant to 
all known human prion strains [31]. 


latrogenic CJD 


Iatrogenic transmission of CJD has occurred by accidental inocu- 
lation with human prions as a result of medical procedures. Such 
iatrogenic routes include the use of inadequately sterilized neuro- 
surgical instruments, dura mater and corneal grafting, and use of 
human cadaveric pituitary-derived growth hormone or gonado- 
trophin. It is of considerable interest that cases arising from 
intracerebral or optic inoculation manifest clinically as classical 
CJD, with rapidly progressive dementia, while those resulting from 
peripheral inoculation, most notably following pituitary-derived 
growth hormone exposure, typically present with a progressive cere- 
bellar syndrome and are, in that respect, somewhat reminiscent of 
kuru. Unsurprisingly, the incubation period in intracerebral cases is 
short (less than 2 years for depth electrodes and neurosurgery), as 
compared to peripheral cases (typically 15 years or more). There is 
evidence for genetic susceptibility to iatrogenic CJD with an excess 
of codon 129 homozygotes [9] (see Aetiology, p. 414). 


Variant CJD 


The appearance of an unusual form of CJD in the UK was heralded 
by a spate of cases in teenagers and young adults in 1995-1996, with 


a surprisingly consistent, but unique, histological pattern [32]. There 
was considerable concern over the possibility that they might suggest 
a link with BSE. These cases were named ‘new variant’ CJD, and even- 
tually ‘variant’ CJD (vCJD), although it was clear that they were also 
rather atypical in their clinical presentation; in fact, most cases did 
not meet the accepted clinical diagnostic criteria for probable CJD. 

Direct experimental evidence that vCJD is caused by BSE was 
provided by molecular analysis of human prion strains and trans- 
mission studies in transgenic and wild-type mice (see Aetiology, 
p. 414), with the most likely explanation being exposure to specified 
bovine offal (notably CNS tissues) prior to the ban on its inclusion in 
human foodstuffs in 1989. While it is now clear that vCJD is caused 
by infection with BSE prions, it is unclear why this particular age 
group should be predominantly affected. 

vCJD has an insidious clinical onset, and its early features are 
highly non-specific. The clinical presentation is often with behav- 
ioural and psychiatric disturbance, commonly depression, but also 
anxiety, social withdrawal, and behavioural change, complex unsus- 
tained delusions, and visual and auditory hallucinations. Initial re- 
ferral is consequently often to a psychiatrist. Persistent limb or face 
dysaesthesiae and pain are also prominent early features. In most 
cases, neurological features are not apparent until some months into 
the clinical course, whereby a progressive cerebellar syndrome de- 
velops with gait and limb ataxia. Overt dementia then occurs, with 
inevitable progression to akinetic mutism. Myoclonus is seen in 
most patients, and chorea is often present, which may be severe in 
some patients. The age at onset ranges from 12 to 74 years, with a 
mean of around 28 years. Clinical course is usually prolonged, com- 
pared to sCJD (9-35 months, median 14 months). 

The EEG is abnormal, most frequently showing generalized slow 
wave activity, but without the pseudoperiodic pattern seen in sCJD 
cases. CSF 14-3-3 protein may be elevated or normal. Neuroimaging 
by CT is either normal or shows only mild atrophy. The most useful 
non-invasive investigation in advanced cases is magnetic reson- 
ance neuroimaging, in which FLAIR hyperintensity has been dem- 
onstrated in the pulvinar nuclei of the thalamus, designated the 
‘pulvinar sign, in 91% of vCJD cases [33]. The pulvinar sign is not ex- 
clusive to vCJD but can also be seen in sCJD, paraneoplastic limbic 
encephalitis, idiopathic intracranial hypertension, status epilep- 
ticus associated with cat-scratch disease, Alper’s disease, and post- 
infectious encephalitis. The absence of the pulvinar sign does not 
exclude a diagnosis of vCJD, and in fact, the MRI in older patients 
with histologically confirmed vCJD seems less likely to possess the 
pulvinar sign [34]. Remarkably, up until early 2016, all definite cases 
of vCJD have been of the PRNP codon 129 MM genotype; a case 
of vCJD in an individual who is PRNP codon 129 MV was previ- 
ously described, but the first case of pathologically confirmed vCJD 
in whom the PRNP codon 129 genotype was MV was reported in 
2017 [35]. MRI features in this case were reminiscent of sCJD; the 
pulvinar sign was absent. Crucially, this patient did not fulfil diag- 
nostic criteria for probable or possible vCJD in life. 

Tonsillar biopsy is a sensitive and specific diagnostic procedure for 
vCJD and obviates the need for brain biopsy, based on the presence 
of prion replication in lymphoreticular tissue prior to neurological 
onset [36-39]. Tonsillar PrP* is uniformly present in clinically af- 
fected cases of vCJD, but not in other forms of prion disease. Prior 
to tonsil biopsy, PRNP analysis is essential to rule out pathogenic 
mutations, which may clinically mimic vCJD. 


Several biofluid-based techniques have been in development for 
diagnostic use in vCJD. The blood-based direct detection assay 
(DDA) exploits the steel binding affinity of prions with a capture 
matrix, coupled with immunodetection; this had a sensitivity of 
71% and a specificity of 100% and is available at the UK National 
Prion Clinic [40]. More recently, the protein misfolding cyclic amp- 
lification (PMCA) assay, which is analogous to the RT-QuIC tech- 
nique but uses repetitive cycles of sonication, instead of shaking and 
western blot as a readout, has been adapted with success for histor- 
ical vCJD blood and urine samples. The sensitivities and specificities 
achieved were between 90% and 100%, but neither has yet been ap- 
plied in routine clinical practice or for screening purposes [41-43]. 

The neuropathological appearances of vCJD are striking and rela- 
tively consistent. While there is widespread spongiform change and 
gliosis and neuronal loss, most severe in the basal ganglia and thal- 
amus, the most remarkable feature is abundant ‘florid’ PrP amyloid 
plaques (in which the plaques are surrounded by vacuoles) in the 
cerebral and cerebellar cortices. Western blot analysis of brain tissue 
demonstrates PrP“ type 4, which is pathognomonic of vCJD. 

Important lessons predicting the evolution of vCJD epidemiology 
can be drawn from kuru research and large-scale studies of archived 
appendix tissue in the UK. Firstly, kuru studies highlighted the pro- 
found effect of PRNP codon 129 genotype on incubation period 
with individuals who are 129 MV, having incubation periods which 
can exceed 50 years [5]; a similar effect of codon 129 genotype on 
incubation period is also seen in iatrogenic CJD cases secondary 
to human cadaveric growth hormone [44]. Secondly, screening of 
archived appendix studies led to estimates that approximately 1 in 
2000 of the UK population carry subclinical vCJD infection [45]. 
While numbers of vCJD cases have steadily fallen since their peak 
in 2000, it is possible that a second wave of cases may occur in 129 
MV individuals. Furthermore, given the atypical clinical features of 
the MV vCJD case and results from animal modelling studies (see 
Aetiology, p. 414), assiduous autopsy surveillance may be required 
to establish the true extent of BSE-related human prion disease, 
some of which might present as sCJD. 


Secondary (iatrogenic) vCJD 


The prominent lymphoreticular involvement raised early con- 
cerns that vCJD may be transmissible by blood transfusion. Indeed 
the tissue distribution is similar to that of ovine scrapie where 
prionaemia has been demonstrated experimentally. In 2004, two 
transfusion-associated cases of vCJD prion infection were reported 
among a small cohort of patients identified as having received blood 
from a donor who subsequently developed vCJD. One patient had 
a typical clinical course of vCJD, although the diagnosis was not 
made until autopsy, and had the PRNP codon 129 MM genotype. 
The second, who died of an unrelated condition, was found to have 
prion infection at autopsy. This patient had the PRNP codon 129 
MV genotype, which is associated with relative resistance to prion 
disease. Subsequently, two further patients have been diagnosed 
with vCJD during life from this group of 23 known surviving recipi- 
ents of implicated blood. That four of 23 patients have been infected, 
three dying of vCJD, in each case following transfusion with a single 
unit of implicated red cells, suggests the risk to recipients of blood 
from a silently infected donor is substantial. The incubation period 
in the clinical cases was 6-7 years. Since 2003, all known recipients 
of implicated blood have been notified of their status. Over 6000 
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individuals in the UK have been exposed to blood products pre- 
pared from large donor pools containing blood from a donor who 
went on to develop vCJD. None of these individuals, predominantly 
haemophiliacs, have yet developed vCJD. 


Inherited prion diseases 


Approximately 10-15% of human prion disease cases are inherited 
in an autosomal dominant fashion. Over 40 pathogenic mutations 
are reported in the human PrP gene (Fig. 42.2) and consist of: (1) 
point mutations within the coding sequence, resulting in amino acid 
substitutions in PrP or the production of a stop codon resulting in 
the expression of a truncated PrP; (2) insertions encoding additional 
integral copies of an octapeptide repeat present in a tandem array 
of five copies in the normal protein [octapeptide repeat insertion 
(OPRI)]; and (3) insertions or deletions resulting in a frameshift 
mutation. Some reportedly pathogenic mutations may, in fact, rep- 
resent benign or risk-conferring polymorphisms [46]. 

Phenotypes can be widely variable, even within kindreds; hence, 
PRNP analysis should be considered in all early-onset dementing or 
ataxic disorders and is available from the UK National Prion Clinic 
(see Further information, p. 421). Brief details of the more com- 
monly seen types are given in the following sections. For a more 
comprehensive review, see reference [47]. 


P102L (commonly presenting as 
Gerstmann-Straussler-Scheinker syndrome) 


This is the most common mutation seen in the UK, usually pre- 
senting as GSS, as originally described in 1936 in a family, prior to 
the discovery of PRNP mutations [48]. Progressive ataxia is the dom- 
inant clinical feature, with dementia, pyramidal features, and lower 
limb dysaesthesiae occurring later. The mean age of onset is 49 years 
(range 25-70), and the mean clinical duration 4 years. Rarely, the 
clinical course is shorter and can closely mimic sCJD. 


Octapeptide repeat insertion mutations 


The first PrP mutation reported was a 6-OPRI mutation, found ina 
small UK family with familial CJD [7], now known to form part of 
the largest known kindred with an inherited prion disease caused by 
an OPRI mutation. Kindreds segregating for pathogenic mutations 
encompassing 4-9 and 12-OPRI mutations are described (see, for 
example, [49-52]). Insertions of 1-3 OPRI repeats have been rarely 
reported, often with clinical and neuropathological features indis- 
tinguishable from sCJD [53]; it remains to be seen whether these 
reflect true pathogenic mutations or benign polymorphisms in the 
context of sporadic disease. 

The 6-OPRI mutation is the most commonly seen in the UK; 
the initial presentation is usually of cortical cognitive deficits such 
as apraxia, acalculia, language impairment, frontal dysexecutive 
features, and episodic memory loss. Physically, there is a varying 
combination of cerebellar ataxia and dysarthria, pyramidal signs, 
myoclonus, and occasionally extra-pyramidal signs, chorea, and 
seizures. A well-described behavioural prodrome can include per- 
sonality disorder, aggression, hypersexuality, and impulsivity, often 
manifesting in childhood or long before any overt neurodegenerative 
syndrome; patients are frequently in contact with the criminal justice 
system prior to diagnosis [54]. The mean age of onset is 34 years 
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Fig. 42.2 Pathogenic mutations (above) and polymorphic variants of the human prion protein gene. 


(range 20-53), with a mean clinical duration of 8-10 years. Codon 
129 heterozygotes have an age of onset of approximately 10 years 
later than homozygotes [51]. 


A117V 


This mutation has been described in families from France, the 
United States, and the UK [55]. The clinical features are of progres- 
sive cortical dementia that can often present as expressive language 
dysfunction. There is associated Parkinsonism, which may predom- 
inate in the early stages and mimic Parkinson’s disease. Pyramidal 
signs, pseudobulbar features, and cerebellar ataxia are also seen. 
The mean age of onset is 39 years, and the mean clinical duration of 
around 4 years. 


D178N (fatal familial insomnia) 


The D178N mutation has been documented widely in kindreds 
throughout Europe [56, 57], and the clinical phenotype is modified 
by the codon 129 status of the mutant allele. When the mutation is 
encoded on a methionine 129 allele, the clinical picture is usually of 
fatal familial insomnia (FFI); the cardinal feature is insomnia, often 
with a preceding period of inattentiveness. There may be initially 
compensatory daytime somnolence that masks early symptoms. 
Autonomic features invariably develop, including hypertension, ex- 
cessive salivation, hyperhidrosis, diurnal pyrexia. and impotence. 





Hallucinations are a frequent feature, and with progression, there is 
ataxia, dementia, and myoclonus. The mean age of onset is 50 years 
(range 20-72). When the codon 178 mutation occurs on a valine 
129 allele, the clinical picture closely mimics sCJD [58], as do MRI 
diffusion-weighted imaging sequence appearances, albeit with a 
more prolonged course of about 15 months. 


E200K 


This mutation is the most common cause of inherited prion disease 
worldwide, particularly in closely segregated communities, for ex- 
ample in Chile and Slovakia, and among Sephardic Jews in Libya and 
Israel [59, 60]. The clinical picture and imaging findings are indistin- 
guishable from sCJD, although there is often peripheral neuropathy, 
and seizures are more frequently seen. The mean age at onset is a little 
earlier than in sCJD, at around 61 years, but there is a wide range of 
31-78 years in the UK. The clinical duration is, on average, 5 months. 


Y163X (PrP systemic amyloidosis) 


First described in 2013 in a large family from the south of England, 
this truncation mutation usually manifests in the fourth decade 
with insidious diarrhoea, nausea, autonomic disturbance, neuro- 
genic bladder, and recurrent urinary infections, followed much 
later by length-dependent axonal neuropathy and cognitive decline. 
The clinical course is progressive over approximately 20 years, and 


pathology is remarkable for the extent of peripheral prion amyloid 
deposition, primarily in the gut, in addition to the CNS [61]. 


Pre-symptomatic and antenatal testing 


Direct gene testing allows unequivocal diagnosis in patients with 
inherited forms of the disease and pre-symptomatic testing of un- 
affected, but at-risk, family members, as well as antenatal testing 
after appropriate genetic counselling. Pre-implantation genetic 
diagnosis and transfer of unaffected embryos via in vitro fertilization 
(IVF) were successfully used to remove the risk of inherited prion 
disease in the offspring of a parent carrying the F198S mutation in 
the United States [62]. 

Because age of clinical onset in some mutations is affected by PRNP 
codon 129 genotype, it is often possible to determine within a family 
whether a carrier of a mutation will have an early or late onset of dis- 
ease. However, biomarkers indicating a more specific age of onset are 
sorely needed and are the focus of determined investigation. 

Most of the common mutations appear to be fully penetrant; how- 
ever, experience with some is extremely limited. In some families, 
for example with E200K or D178N (FFI), there are examples of eld- 
erly unaffected gene carriers who appear to have escaped the disease; 
the wide age of onset may indicate complete penetrance at a very ad- 
vanced age, which may not be practically relevant for many patients. 
Recent work comparing the expected incidence of clinical disease 
with the frequency of reportedly pathogenic mutations in large con- 
trol populations suggests the penetrance of several to be 10% or 
lower; at this level, they may be better classified as risk-conferring 
variants [46], with important implications for genetic counselling 
discussions. 

Genetic counselling is essential prior to pre-symptomatic testing. 
A positive PrP gene analysis has important consequences for other 
family members, and it is preferable to have discussed these issues 
with others in the immediate family before testing. Following the 
identification of a mutation, the wider family should be referred for 
genetic counselling. It is vital to counsel both those testing positive 
for mutations and those untested but at-risk that they should not be 
blood or organ donors and should inform surgeons, including den- 
tists, of their risk status prior to significant procedures, as precautions 
may be necessary to minimize the risk of iatrogenic transmission. 


All recognized prion diseases are invariably fatal, following a pro- 
gressive course. There have been few robust therapeutic trials in 
human prion disease. These are challenging due to the combination 
of difficulty recruiting adequate patient numbers, a paucity of val- 
idated outcome measures beyond mortality, and widely variable 
disease patterns in the absence of longitudinal natural history data. 
Many of these barriers to meaningful controlled clinical trials have 
now been largely overcome [63, 64]. 

A double-blind RCT of flupertine, a non-opioid analgesic, sug- 
gested some benefit in cognitive scores, but not mortality, albeit in 
a small sample size and with borderline statistical significance [65]. 
A patient preference trial (PRION-1) of quinacrine, an anti-malarial 
agent, in 107 patients with prion disease did not significantly alter 
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the disease course [66]. A more recent randomized, double-blind, 
placebo-controlled trial of doxycycline, an antibiotic with positive 
reports in previous observational studies, similarly did not show any 
effect on disease progression or mortality [67]. The effects of a var- 
iety of other agents have been reported anecdotally or in small case 
series, including pentosan polysulfate, a glycosaminoglycan admin- 
istered to a handful of patients with vCJD, with anecdotal reports of 
some longer disease courses [68]. 

More recently, many groups have turned their attention to 
targeting the PrP itself; interference with PrP© expression in the 
adult brain is without serious effect and blocks the onset of neuro- 
logical disease in animal models [69]. Anti-PrP monoclonal anti- 
bodies, or small molecules that interfere with the conversion of PrP“ 
to PrP‘, appear to slow or arrest the disease course in mouse models 
of CJD if given prophylactically, during the incubation period [70]. 
‘They represent a promising therapeutic avenue in humanized form. 
New methods for early diagnosis—and their timely use—will there- 
fore be vital, as arresting prion propagation will not reverse neuronal 
cell loss, which may be considerable by the time a clinical diagnosis 
is typically reached. 

Symptom management is crucial and, for specific indications, 
can be highly effective. Myoclonus is generally of cortical origin 
and responds well to levetiracetam, or a benzodiazepine (com- 
monly clonazepam), or sodium valproate at normal therapeutic 
doses. Visual hallucinations can be managed with a centrally acting 
anticholinesterase such as donepezil, and aggression or agitation 
with a dopamine antagonist, such as risperidone or quetiapine, or 
benzodiazepines; extra-pyramidal symptoms are common in prion 
disease, and the risk of these should be taken into account when pre- 
scribing (see Chapter 64 for details). 

Patients and their families require early and consistent support, 
given the likely rapid pace of clinical change and the variety of func- 
tional deficits in cognition, mobility, toileting, and behaviour. Good 
community or residential nursing care is essential, and the involve- 
ment of a palliative care team is often of great benefit. Supportive 
medical therapies, such as parenteral nutrition, may prolong sur- 
vival, with no appreciable improvement in quality of life. 


FURTHER INFORMATION 


CJD Support Network. http://www.cjdsupport.net/ 

Medical Research Council Prion Unit at University College London, 
Institute of Prion Diseases. http://www. prion.ucl.ac.uk/ 

The National CJD Research and Surveillance Unit, Western General 
Hospital, Edinburgh. http://www.cjd.ed.ac.uk/ 

Public Health England. Creuztfeldt-Jakob dis- 
ease (CJD): guidance, data and analysis. 2008. 
https://www.gov.uk/government/collections/ 
creutzfeldt-jakob-disease-cjd-guidance-data-and-analysis 

UK National Prion Clinic at the National Hospital for Neurology and 
Neurosurgery, London. http://www.nationalprionclinic.org 
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Dementia with Lewy bodies (DLB) is the second most common 
neurodegenerative dementia [1]. Lewy bodies are abnormal aggre- 
gates of several proteins that form eosinophilic spherical neuronal 
intracytoplasmic inclusions. Lewy bodies are found in the cerebral 
cortex, brainstem, various subcortical structures, and peripheral 
nervous system. Clinical manifestations of DLB include cognitive 
impairment, extra-pyramidal symptoms, neuropsychiatric symp- 
toms, and autonomic dysfunction. 

DLB has been known by other names, which include diffuse Lewy 
body disease, Lewy body dementia, dementia associated with cor- 
tical Lewy bodies, senile dementia of Lewy body type, and the Lewy 
body variant of Alzheimer’s disease [2]. The current consensus is 
that “Lewy body dementia’ term includes DLB and Parkinson's dis- 
ease dementia (PDD) [3]. DLB and PDD are clinically differenti- 
ated only by the timing of onset of dementia and Parkinson's disease 
(PD), using an arbitrary 1-year rule. PDD starts at least 1 year after 
well-established PD, but people with DLB develop dementia be- 
fore, concurrently, or within 1 year of onset of Parkinsonism. Some 
people with DLB never develop Parkinsonism. Once both dementia 
and Parkinsonism are well established, DLB and PDD seldom differ 
clinically. As increasing evidence suggests common pathophysio- 
logical processes underlying the two disorders [3], DLB and PDD 
can also be viewed together as a single nosological continuum [4]. 
However, this chapter focuses only on DLB, and Chapter 44 presents 
PDD in detail. 














Fritz Heinrich Lewy, later known as Frederic Henry Lewey, first 
reported eosinophilic intracytoplasmic inclusions in 1912, while 
he was studying the neuropathology of PD in the team of Alois 
Alzheimer at Munich University. He observed the inclusions in the 
dorsal vagal nuclei and substantia innominata of people with PD, 
but he concluded that the inclusion bodies are not pathognomonic 
of PD [5]. Konstantin Tretiakoff, a Russian neuropathologist, coined 
the term ‘Corps de Lewy (Lewy bodies) and presented it as one of 
the major hallmarks of PD pathology in 1919. Lewy also observed 
swellings of axon cylinders of the neurons with the inclusions, and 





they were called ‘Lewy neurites’ later. The interest in Lewy bodies 
was renewed in 1960 when Bethlem and Den Haltog Jager diligently 
documented the distribution of Lewy bodies in the central and auto- 
nomic nervous systems. Later, Haruo Okazaki reported in 1961 aut- 
opsies of two people with progressive dementia and rigidity, who 
had widely disseminated Lewy bodies in their cerebral cortex. 

Over the next two decades, Kenji Kosaka and other researchers 
from Japan reported autopsies of more than 20 people, who clinically 
presented with varying degrees of extra-pyramidal symptoms, cog- 
nitive impairment, and neuropsychiatric symptoms and had variable 
distribution of Lewy bodies in their brainstem and cerebral cortex. 
‘They proposed the term ‘Lewy body disease’ in 1980 and classified it 
into brainstem, transitional, and diffuse types. Yoshimura reported 
more autopsies of people with the diffuse type and proposed the term 
‘diffuse Lewy body disease’ in 1983 [6]. Hansen and Perry reported 
more autopsies from North America and Europe, respectively, in 
1990. Their studies have confirmed that diffuse Lewy body disease 
is not rare and that at least 15-20% of older people with dementia 
have Lewy bodies. Later, more autopsies that found Lewy bodies 
almost exclusively in the cerebral cortex were reported. The first 
international workshop on DLB was held in Newcastle in the UK in 
1995, and the consensus guidelines for the clinical and pathological 
diagnosis of DLB were published in 1996. Maria Spillantini identi- 
fied the major component of Lewy bodies as a-synuclein in 1997. 
The Third International Workshop on DLB was held in 2003, and 
the revised consensus diagnostic guidelines for DLB were published 
in 2005. The fourth consensus report of the DLB Consortium and 
the revised criteria for the clinical diagnosis of DLB was published 
in 2017. Studies investigating the epidemiology, pathophysiology, 
clinical diagnosis, and management of DLB have been exponentially 
increasing over the past two decades, and this chapter attempts to 
provide a brief overview of this expanding knowledge base. 




















Robust data on the prevalence and incidence of DLB remain sparse. 
However, DLB is no longer considered to be a rare disorder, and 
its prevalence is second only to Alzheimer’s dementia (AD) among 
older adults with neurodegenerative dementia. A recent systematic 
review, including 22 studies that investigated the prevalence and/or 


incidence of DLB, has reported that the incidence rates ranged from 
0.5 to 1.6 per 1000 person-years among the community-dwelling 
older adults aged 65 years and above [7]. 3.2-7.1% of all incident 
dementias are due to DLB. The incidence of DLB is nearly twice 
more common in men than in women. The prevalence of DLB 
has been reported from 0.02 to 0.06 per 1000 adults younger than 
65 years of age, and from 0.3 to 6.5 per 1000 older adults aged 65 and 
above. Studies that included only people older than 70 years have 
reported the prevalence of DLB as from 8.6 to 33.3 per 1000 people, 
and one study that included only people older than 81 years has 
reported the prevalence as 63.5 per 1000 people. The mean preva- 
lence of DLB is 4.2% in community studies, and it is 7.5% in clinical 
samples. 

DLB remains underdiagnosed in many clinical settings, so the 
prevalence of DLB could be higher than the reported estimates 
[8, 9]. Studies including detailed neurological examinations, 
neuroimaging, and screening for rapid eye movement (REM) sleep 
behaviour disorder (RBD) have reported the prevalence of DLB as 
high as 24%. Advancing age, hypertension, dyslipidaemia, and a past 
history of depression and anxiety have been reported to increase the 
risk of DLB. People with DLB reportedly have poorer quality of life 
than those with AD. Quality of life in DLB has been directly associ- 
ated with the independency in instrumental activities of daily living 
and inversely associated with neuropsychiatric symptoms, espe- 
cially apathy and delusions [10]. People with DLB may utilize more 
health resources, and their costs of care are reportedly higher than 
those of people with AD [11]. 


Pathology 


Neuropathology 


Subcortical and cortical Lewy bodies differ in their appearance and 
immunohistochemical staining. Subcortical Lewy bodies have the 
characteristic appearance of dense eosinophilic cores, surrounded 
by concentric lamellar bands and pallid halos. They can be visual- 
ized by immunohistochemical staining for a-synuclein and ubi- 
quitin. Cortical Lewy bodies do not have the surrounding halo. 
They can be visualized by immunohistochemical staining for ubi- 
quitin, but less than one-third of cortical Lewy bodies can be de- 
tected by a-synuclein immunostaining [12]. Lewy neurites are 
abnormal neuronal projections, including aggregates of a-synuclein 
and other proteins. They are more extensively distributed in various 
cortical and subcortical regions than the Lewy bodies, and they can 
be visualized by immunohistochemical staining for a-synuclein 
and ubiquitin. They are believed to represent earlier stages of 
neurodegeneration and to lead to synaptic dysfunction when they 
are present in the presynaptic terminals. 

It remains uncertain whether Lewy bodies play a neurotoxic or 
neuroprotective role [3]. Moreover, the relationship between cor- 
tical Lewy body burden and the clinical severity of DLB is not linear. 
However, a pathological classification system including three sub- 
types of DLB has been proposed on the basis of a semi-quantitative 
scoring system assessing the pattern of distribution of Lewy bodies 
and Lewy neurites. Brainstem-predominant DLB has Lewy path- 
ology principally within the substantia nigra, dorsal nucleus of the 
vagus, and the locus caeruleus. Limbic or transitional DLB includes 
Lewy pathology in the anterior cingulate and transtentorhinal 
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cortices. The third subtype—neocortical or diffuse DLB—has dis- 
seminated Lewy pathology in the cerebral cortex, with or without 
involvement of the brainstem. Braak has introduced a patho- 
logical staging system for PD, and he has suggested caudorostral 
progression of Lewy pathology that starts from the brainstem and 
then spreads to the neocortex. Some people with DLB may follow 
the caudorostral progression of Lewy pathology, but many do not 
follow this kind of spread. Hence, the importance of Braak staging 
is limited to DLB, and an additional ‘cerebral’ subtype of DLB has 
been proposed. 

Eighty to 90% of older adults with DLB have varying degrees 
of coexisting Alzheimer’s-like pathology. They often have diffuse 
B-amyloid plaques, but neocortical neurofibrillary tangles are less 
common in DLB. The neocortical type of DLB is more likely to have 
substantial coexisting Alzheimer’s-like pathology. Increasing evi- 
dence support that B-amyloid deposition also contributes to the 
pathology of DLB. Loss of cholinergic neurons in the basal nucleus 
of Meynert and decreased cortical choline acetyltransferase indicate 
that the deficits in cholinergic neurotransmission are common in 
DLB. Such deficits occur earlier and are more severe in DLB than in 
AD. Minor cerebrovascular lesions are found in around 30% of older 
adults with DLB, but their contribution to DLB pathology is uncer- 
tain. Depletion of dopaminergic neurons and deficits in GABA and 
serotonergic neurotransmission have been reported in older adults 
with DLB [1]. 


Neuroimaging 
Neuroimaging of people with DLB not only helps to enhance our 
understanding of the underlying pathology, but also has direct 
clinical implications [13]. Structural neuroimaging in DLB with 
MRI often reveals diffuse, but relatively mild global, grey matter at- 
rophy with relatively preserved medial temporal lobes, when com- 
pared to people with AD. However, if there is substantial coexisting 
Alzheimer’s-like pathology, medial temporal lobes may not be 
spared. The volume of the substantia innominata and putamen 
may be less in DLB than in AD. Atrophy in the temporal, occipital, 
and parietal lobes is often more pronounced in DLB than in PDD. 
Diffusion tensor imaging (DTI) studies have documented the loss of 
parieto-occipital white matter tracts in DLB, when compared with 
people without cognitive impairment [14]. Several studies have 
investigated DLB using magnetic resonance spectroscopy (MRS) 
[15]. They have documented reduced N-acetyl aspartate/creatine 
ratio in the centrum semiovale, occipital grey matter, and temporal 
lobes in DLB, when compared to healthy controls. Findings on cho- 
line/creatine ratios in DLB have been inconsistent. A recent study 
has reported that people with mild cognitive impairment (MCI) 
and lower N-acetyl aspartate/creatine ratio in their posterior cin- 
gulate cortex were significantly more likely to develop AD, but not 
DLB [16]. 
°T_28-carbomethoxy-3B-4-iodophenyl-N-3-fluoropropyl 
nortropane single-photon emission computed tomography (° FP- 
CIT SPECT) can demonstrate low dopamine transporter uptake 
in the basal ganglia in people with DLB. This can distinguish DLB 
from AD, before one develops clinically observable extra-pyramidal 
symptoms. This has been evaluated in a multi-centre phase III trial, 
and it has been incorporated in the DLB Consortium revised cri- 
teria for the clinical diagnosis of DLB [17]. SPECT studies investi- 
gating regional cerebral blood flow by using radionucleotides, such 
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as "Tc hexamethylpropyleneamineoxime, have reported global 
cortical hypoperfusion in DLB, when compared to healthy controls, 
and occipital hypoperfusion in DLB, when compared to people with 
AD. PET studies using '*F-fluorodeoxyglucose (FDG) have con- 
firmed occipital hypometabolism in DLB, when compared to AD. 
An association between the frequency of visual hallucinations and 
occipital hypometabolism in DLB has been reported. Besides, me- 
tabolism in the posterior cingulate cortex is relatively preserved in 
DLB, when compared to AD, and this ‘posterior cingulate island 
sign’ can distinguish DLB from AD. Amyloid PET studies have 
documented variable degrees of cortical -amyloid ligand binding 
in DLB. B-amyloid deposition is usually more in DLB, when com- 
pared to healthy controls and people with PDD, but it is relatively 
less than that in AD [18]. A recent tau PET study that employed '*F- 
labelled AV-1451 ligand has demonstrated variable tau pathology, 
especially in the inferior temporal gyrus and precuneus, in people 
with DLB [19]. Cholinergic PET studies have reported reduced 
cortical and thalamic acetylcholinesterase activity in DLB. I- 
metaiodobenzylguanidine (MIBG) scintigraphy can demonstrate 
the involvement of the autonomic nervous system in DLB. Reduced 
myocardial MIBG uptake due to the involvement of post-ganglionic 
cardiac sympathetic nerves can distinguish DLB from AD very ac- 
curately, and it has been recently included in the diagnostic criteria 
for DLB [20]. Fig. 43.1 presents representative coronal MRI, FDG 
PET, and '*FP-CIT SPECT images in DLB and AD. 


Molecular biology 


Lewy bodies are complex structures, and they are made of at 
least 90 distinct molecules [21]. They comprise a-synuclein, a- 
synuclein binding proteins, such as tau, agrin, synphilin-1, 14-3- 
3, and microtubule-associated proteins, and synphilin-1 binding 
proteins such as parkin, dorfin, and glycogen synthase kinase-36. 
They include proteins from the ubiquitin proteasome system, such 
as ubiquitin, ubiquitin-activating enzyme, ubiquitin-conjugating 
enzyme, ubiquitin C-terminal hydrolase, ubiquitin ligase, prote- 
asome, and proteasome activators, and from the autophagosome 
lysosome system such as glucocerebrosidase, LC3, GATE-16, 
and NBR1. Many aggresome-related proteins, such as y-tubulin, 
HDAC6, and pericentrin, several cellular stress response proteins 
including heat-shock proteins, clusterin, glutathione peroxidase, 
heme oxygenase-1, aB-crystallin, and superoxide dismutase 1 and 
2, and various protein phosphorylation and signal transduction 
proteins including calcium/calmodulin-dependent protein kinase 
II, cyclin-dependent kinase 5, G-protein coupled receptor kinase 5, 
leucine-rich repeat kinase 2 (LRRK2), and phospholipase C-6, are 
found in Lewy bodies. Lewy bodies include mitochondria-related 
proteins, such as cox IV, cytochrome C, and PTEN-induced putative 
kinase 1 (PINK1), cell cycle proteins, such as cyclin B and retino- 
blastoma protein, and cytoskeletal proteins, such as neurofilament, 
and tubulin. Several cytosolic proteins, including amyloid precursor 
protein, calbindin, choline acetyltransferase, chromogranin A, 
synaptophysin, synaptotagmin, tyrosine hydroxylase, and vesicular 
monoamine transporter 2, immunoglobulins, and lipids are present 
in Lewy bodies [21]. 

It is widely accepted that aggregation of a-synuclein is the key 
initial step in the formation of Lewy bodies [12]. DLB, along with 
PD and multiple system atrophy, is recognized as a primary a- 
synucleinopathy. Made of 140 amino acids, a-synuclein normally 


exists as an unfolded protein without a typical secondary struc- 
ture or as a stable folded tetramer. It is principally localized in 
the presynaptic terminals of neurons. Molecular functions of a- 
synuclein remain uncertain, but they may include trafficking of 
synaptic vesicles, formation of SNARE complexes that mediate 
the fusion of vesicles with their target membranes, regulation of 
synaptic vesicle recycling, and regulation of presynaptic release of 
several neurotransmitters, including dopamine [22]. Presynaptic 
a-synuclein aggregation leads to synaptic dysfunction that may ex- 
plain the neurodegeneration in DLB. Conformational changes and 
aggregatory properties of a-synuclein have been extensively inves- 
tigated over the last decade. a-synuclein is susceptible to undergo 
various post-translational modifications, such as phosphoryl- 
ation, ubiquitination, nitration, oxidation, and sumoylation [23]. 
Such post-translational modifications, binding with polyamines, 
and cross-linking by tissue transglutaminase promote oligomer- 
ization of a-synuclein; this leads to the formation of protofibrils 
and mature aggregates [24]. a-synuclein oligomers and protofibrils 
are neurotoxic. However, the formation of Lewy bodies with other 
proteins may protect the involved neurons and prolong their 
survival [12]. There are at least two more alternatively spliced 
isoforms of a-synuclein. a-synuclein that has 126 amino acids (a- 
synuclein 126) is less prone for aggregation, but a-synuclein that 
has 112 amino acids (a-synuclein 112) has increased propensity 
for aggregation [25]. 

a-synuclein oligomers can be released into the extracellular 
space either by exocytosis or by the formation of exosomes, which 
are 30-100 nm-sized extracellular vesicles [26]. Exosomes carry a 
complex cargo of proteins, lipids, and nucleic acids from the brain 
to the peripheral systems, and they lead to cell-to-cell transport of 
several biologically active molecules, including a-synuclein. An 
increasing number of cell culture, animal, and post-mortem studies 
have shown that exogenous a-synuclein can instigate Lewy path- 
ology in the recipient cells, and the possibility of prion-like propaga- 
tion of a-synuclein in DLB has been hypothesized [27]. Extracellular 
a-synuclein can activate neighbouring microglia and astrocytes. 
Possible mechanisms of microglial activation by a-synuclein, 
and consequent chronic neuroinflammation in DLB, have been 
studied over the past decade [28]. Activated microglia release 
pro-inflammatory cytokines, such as interleukin-1, interleukin- 
2, interleukin-6, and tumour necrosis factor-a, nitric oxide, and 
reactive oxygen molecules and trigger an apoptotic process of af- 
fected neurons [29]. Besides, the ubiquitin proteasome system and 
the autophagy lysosome pathway are the two most essential path- 
ways that repair or degrade several abnormal neuronal proteins, 
including a-synuclein. Accumulating evidence suggests that these 
two pathways are dysfunctional in DLB and that such deficits lead 
to an increased release of extracellular a-synuclein and an uncon- 
trolled propagation of associated pathology. B-glucocerebrosidase 1 
(GCase), a lysosomal hydrolase, has received special attention re- 
cently [30]. Lysosomal dysfunction, secondary to GCase deficiency, 
may lead to misprocessing and aggregation of a-synuclein [31]. 
GCase deficiency also contributes to mitochondrial dysfunction 
[32], and other mitochondria-related proteins, such as parkin and 
PINKI, are involved in the pathology of DLB. Mitochondrial en- 
ergy metabolism dysfunction increases the vulnerability of neurons 
to oxidative damage [33], and a-synuclein pathology can directly 
damage mitochondria [34]. 
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Fig. 43.1 (see Colour Plate section) Representative coronal MRI, FDG-PET, and FP-CIT SPECT images in DLB and AD. 
i from The Lancet 386(10004), Walker Z, Possin KL, Boeve BF, Aarsland D., Lewy body dementias, Pages 1683-97. Copyright 2015 with permission from Elsevier. 





Genetics 


Although DLB is mostly sporadic, there is evidence for familial ag- 
gregation of DLB [35]. A genome-wide linkage study in an auto- 
somal dominant family with autopsy confirmed DLB has mapped 
the chromosomal locus for DLB at 2q35-q36 [36]. SNCA codes 
for a-synuclein, and SNCA triplications and missense mutations 
have been associated with DLB [37]. GBA (glucosylceramidase 
beta) encodes GCase, and the genetic associations between DLB 
and polymorphisms in GBA have been replicated by several studies 
[38-40]. A large multi-centre study has reported that pathogenic 


GBA mutations increase the risk of DLB by more than eight times 
and that they are associated with an earlier age of onset of DLB and 
disease severity [39]. A recent study has found that the pathogenic 
GBA mutation carriers have significantly less GCase activity in their 
brain [41]. Another large study that investigated 788 people with 
DLB and 2624 controls has reported false-discovery adjusted stat- 
istically significant genetic associations of DLB with APOE, SNCA, 
and SCARB2 (scavenger receptor class B member 2) [42]. Along 
with GBA and SCARB2 mutations that lead to lysosomal dysfunc- 
tion, two more lysosome-related genes SMPD1 and MCOLN1 have 
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been associated with Lewy pathology [41]. Available evidence sug- 
gests that APOE e4 allele increases the risk of DLB and that APOE 
£2 allele reduces the risk and delays the onset of DLB by 4 years 
[43]. Evidence supporting the associations between DLB and other 
AD-related genes, such as APP and PSEN1, are weak [37]. The first 
genome-wide association study (GWAS) investigating DLB was 
published in January 2018. It estimated 36% heritability of DLB, 
and it confirmed the associations between DLB and variants in 
APOE, SNCA, and GBA [43a]. 

Gene expression studies in DLB have confirmed the import- 
ance of alternative splicing of a-synuclein and the differential 
expression of the isoforms. Increased expression of a-synuclein 
112 and decreased expression of a-synuclein 126 in post-mortem 
frontal cortical samples from people with DLB have been reported 
[25]. A study investigating the expression of all small ribonucleic 
acids (RNAs) in post-mortem temporal cortical samples of people 
with DLB did not find any differentially expressed small RNA. 
However, another study that investigated post-mortem anterior 
cingulate cortical samples from people with DLB using next- 
generation RNA sequencing technology has documented 490 dif- 
ferentially expressed genes, including many downregulated genes, 
which are implicated in neurogenesis, myelination, and regulation 
of nervous system development [44]. Another study that investi- 
gated the primary visual cortex post-mortem, using gene expres- 
sion microarrays, has reported that several genes, associated with 
GABAergic neurotransmission, were differentially expressed in 
people with DLB who experienced recurrent complex visual hal- 
lucinations. Lower levels of DNA methylation in the SCNA intron 
1 has been found in the peripheral leucocytes of people with DLB, 
but studies evaluating the epigenetics of DLB remain sparse [45]. 


Other biological correlates 


Theta and delta wave activity and frequency variation in EEG, es- 
pecially in the posterior leads, are more frequent in DLB than in 
AD and PDD [46]. Quantitative EEG may predict conversion 
of MCI to DLB accurately. RBD is a core clinical feature of DLB. 
Video polysomnographic investigation of people with DLB has re- 
vealed increased sleep latency, reduced sleep efficiency, and altered 
non-REM sleep architecture. Several studies have investigated po- 
tential biomarkers of DLB in the CSF for helping an early and ac- 
curate clinical diagnosis, as well as for monitoring prognosis and 
treatment effectiveness [47]. A meta-analysis including 13 studies 
and 2728 people has established that a-synuclein concentration 
in the CSF is significantly lower in DLB than in AD [48]. Akin to 
AD, CSF B-amyloid 1-42 concentration is often reduced in DLB 
and has been shown to predict a more rapid cognitive decline in 
DLB [49]. However, CSF tau and phosphorylated tau concentra- 
tions are less in DLB than in AD. The oxidized form of B-amyloid 
1-40 in the CSF is higher in DLB than in PDD and healthy con- 
trols. CSF levels of homovanillic acid, 5-hydroxyindoleacetic acid, 
3-methoxy-4-hydroxy phenylethyleneglycol levels, and neurosin, as 
well as cocaine and amphetamine-regulated transcript, have been 
reported to be less in DLB than in AD. Other CSF studies have re- 
ported elevated levels of neurofilaments, calcium, and magnesium, 
as well as reduced levels of soluble neuron glia 2 in DLB [47]. Studies 
evaluating potential biomarkers of DLB in other biological fluids 
remain sparse. Unlike AD, serum fatty acid-binding protein levels 
are elevated in DLB. Preliminary evidence suggests elevated serum 


magnesium levels in DLB. Although Lewy pathology often involves 
minor salivary glands, reliable salivary biomarkers for DLB have not 
been found so far. 


Although a definite diagnosis of DLB can be confirmed only by neuro- 
pathological verification, a probable or possible diagnosis of DLB can 
be made clinically. DLB is currently diagnosed by using either the DLB 
Consortium revised criteria for the clinical diagnosis of DLB [50] or the 
DSM.-5 diagnostic criteria for major neurocognitive disorder with Lewy 
bodies. ICD-10 does not include specific diagnostic criteria for DLB. 


The DLB consortium criteria 


DLB diagnoses are missed more often than not in clinical settings 
when standard diagnostic criteria are not used routinely. A large 
study that evaluated the accuracy of clinical diagnoses of DLB, in 
comparison with autopsy findings, has reported 95% specificity, 
but only 32% sensitivity [51]. The third DLB Consortium revised 
criteria for the clinical diagnosis of DLB have been reported to in- 
crease the clinical diagnoses of probable DLB by nearly 24%. Table 
43.1 presents the current revised criteria for the clinical diagnosis 
of probable and possible DLB [50]. Meeting the general criteria for 
dementia is essential for the diagnosis of possible or probable DLB. 
Two of the four core clinical features (fluctuating cognition, recur- 
rent visual hallucinations, RBD and spontaneous Parkinsonism) are 
sufficient for a diagnosis of probable DLB, and only one of them is 
sufficient for a diagnosis of possible DLB. If one or more of the in- 
dicative biomarkers is present in the presence of one or more core 
clinical features, probable DLB can be diagnosed. A diagnosis of 
probable DLB cannot be made on the basis of indicative biomarkers 
alone. In the absence of any core clinical features, one or more in- 
dicative biomarkers are sufficient for a diagnosis of possible DLB. 
Possible DLB is an unstable diagnosis, and some develop probable 
DLB, whereas others develop AD or other diseases. The revised cri- 
teria list supportive clinical features and supportive biomarkers that 
are commonly present in DLB, but their diagnostic specificity are 
uncertain. Hence, these supportive features do not have any diag- 
nostic weighting. There are ongoing endeavours to improve the sen- 
sitivity of the criteria without compromising their specificity. 


DSM-5 diagnostic criteria 


As the term ‘dementia is replaced by the term ‘major neurocognitive 
disorder’ in DSM-5, DLB is called ‘major neurocognitive dis- 
order with Lewy bodies’ (NCDLB). The DSM-5 diagnostic cri- 
teria for probable and possible major NCDLB mirrors the 2005 
version of third DLB Consortium revised criteria for the clin- 
ical diagnosis of DLB, but there are a few differences between 
the two. Three core features (fluctuating cognition, recurrent 
visual hallucinations, and spontaneous Parkinsonism) are in- 
cluded, and RBD and severe neuroleptic sensitivity are men- 
tioned as two suggestive features in DSM-5. The neuroimaging 
biomarker demonstrating low dopamine transporter uptake in 
the basal ganglia is not included in DSM-5, and this may com- 
promise the sensitivity of the criteria further. At least two core 
features or one core feature with one or two suggestive features 
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Table 43.1 The DLB Consortium revised criteria for the clinical diagnosis of probable and possible DLB 


Essential feature 


Dementia that is defined by progressive cognitive decline of sufficient magnitude to interfere with normal social or 


occupational function. Prominent or persistent memory impairment may not occur in the early stages but is usually evident 
with progression. Deficits on tests of attention, executive function, and visuo-spatial ability may be especially prominent 


Core clinical features 


1. Fluctuating cognition with pronounced variations in attention and alertness 


2. Recurrent visual hallucinations that are typically well formed and detailed 


3. Spontaneous features of Parkinsonism 
4. RE 


Supportive clinical features 


sleep behaviour disorder that may precede cognitive decline 


Severe sensitivity to antipsychotic agents, postural instability, repeated falls, syncope or other transient episodes of 


unresponsiveness, severe autonomic dysfunction, hypersomnia, hyposmia, hallucinations in other modalities, systematized 


delusions, apathy, anxiety, and depression 





Indicative biomarkers 1. Reduced dopamine transporter uptake in basal ganglia demonstrated by SPECT or PET 
2. Abnormal (low uptake) '*iodine-MIBG myocardial scintigraphy 
3. Polysomnographic confirmation of REM sleep without atonia 

Supportive biomarkers 1. Relative preservation of medial temporal lobe structures on CT/MRI 


2. Generalized low uptake on SPECT/PET perfusion/metabolism scan with reduced occipital activity with or without the 


cingulate island sign on FDG-PET imaging 


3. Prominent posterior slow-wave activity on EEG with periodic fluctuations in the pre-alpha/theta range 


A diagnosis of DLB is less likely i 
otal for the clinical picture 





n the presence of a physical illness or brain disorder including cerebrovascular disease sufficient to account in part or in 


2. If Parkinsonism only appears for the first time at a stage of severe dementia 


Temporal sequence of symptoms 


DLB should be diagnosed when dementia occurs before or concurrently with Parkinsonism. The term PDD should be used 


to describe dementia that occurs in the context of well-established Parkinsons disease. In clinical settings, the term that is 
most appropriate to the clinical situation should be used, and generic terms such as Lewy body disease are often helpful. In 
research studies in which distinction needs to be made between DLB and PDD, the existing 1-year rule between the onset of 
dementia and Parkinsonism for DLB continues to be recommended 





Adapted from Neurology, 89(1), McKeith IG, Bradley F, Boeve BF, et al., Diagnosis and management of dementia with Lewy bodies: fourth consensus report of the DLB Consortium, 


pp. 88-100. Copyright (2017), with permission from American Academy of Neurology 


are needed to diagnose probable major NCDLB. Only one core 
feature or one or two suggestive features are needed to diagnose 
possible major NCDLB. Besides, there is no consensus on the def- 
inition of prodromal or pre-dementia phase of DLB, but DSM-5 has 
introduced probable and possible mild neurocognitive disorder 
with Lewy bodies. Nosological validity of these two diagnostic 
categories remains uncertain. DSM-5 has also provided further 
coding for major NCDLB with or without behavioural disturbance. 
However, DSM-5 does not include a list of supportive features of 
NCDLB but mentions many supportive features as ‘diagnostic 
markers’ with uncertain diagnostic weighting. 


Clinical presentation 


As the initial clinical presentation of DLB varies widely, people with 
DLB may first present to primary care, psychiatric, neurology, in- 
ternal medicine, or emergency services. DLB may initially present 
with memory impairment, impairment of other cognitive func- 
tions, visual hallucinations, psychosis, depression, fall, syncope, 
loss of consciousness, acute confusion, sleep disorders, autonomic 
dysfunction, or extra-pyramidal symptoms. Such heterogenous 
clinical presentation contributes to the underdiagnosis of DLB in 
clinical settings. DLB is not clinically diagnosed, until people de- 
velop cognitive impairments that are severe enough to interfere 
with their functions. People with DLB have varying degrees of 
cortical and subcortical neuropsychological deficits. People with 
DLB often have greater impairment of their attention, visuo-spatial 
skills, and executive functions than people with AD [52]. DLB 
usually presents with recurrent episodes of confusional states on 


a background of progressive cognitive deterioration. Fluctuating 
cognition with pronounced variations in attention and alertness is 
a core feature of DLB. Fluctuating cognition is present in 50-75% 
of people with DLB but is difficult to elicit reliably in the clinical 
setting. Scales, such as clinician assessment of fluctuation, the one- 
day fluctuation assessment scale, the Mayo fluctuations composite 
scale, and the dementia cognitive fluctuation scale, may help, but 
they need further validation and they are seldom used routinely. 
Visuo-spatial impairment occurs more frequently and earlier in 
DLB than in AD. Executive function deficits in DLB include im- 
pairments in cognitive flexibility, planning, abstraction, conceptu- 
alization, judgement, self-monitoring, and reinforcement learning. 
Episodic memory and verbal memory impairments are often less 
severe in DLB than in AD. Recognition of information is better 
preserved than recall in DLB, and this indicates impaired retrieval 
mechanisms in DLB. 

Spontaneous Parkinsonism may occur in 60-92% of people with 
DLB [53], and 25-50% of people with DLB may have them at the 
time of their diagnoses. Missing the clinical diagnosis of DLB is more 
likely in those lacking clinically observable extra-pyramidal symp- 
toms. Detailed neurological examination and scales, such as the 
unified PD rating scale, facilitate the detection of extra-pyramidal 
symptoms in DLB. Extra-pyramidal symptoms in DLB often include 
bilaterally symmetrical limb rigidity, symmetrical postural tremor, 
bradykinesia, facial impassivity, axial rigidity, and shuffling gait. 
Prominent tremors, asymmetrical resting tremors, and asymmet- 
rical limb rigidity are less common in DLB [2]. '*FP-CIT SPECT 
and PET neuroimaging can demonstrate reduced dopamine trans- 
porter uptake in the basal ganglia, before the occurrence of clinically 
observable extra-pyramidal symptoms. A meta-analysis including 
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four studies and 419 people has reported that '“FP-CIT SPECT has 
86.5% sensitivity and 93.6% specificity to diagnose DLB [54]. 'FP- 
CIT SPECT is not necessary once Parkinsonism is well established 
clinically. Missing a clinical diagnosis of DLB and the presence of 
associated psychotic symptoms often set the stage for revealing se- 
vere sensitivity to dopamine antagonist drugs in DLB. Challenge 
with a dopamine antagonist must not be used for the diagnosis of 
DLB, because of the associated risks. 

Almost 85% of people with DLB experience recurrent complex 
visual hallucinations that usually feature people, children, and 
animals [53]. The images are often vivid, colourful, well formed, 
three-dimensional, and mute. Visuoperceptual deficits predis- 
pose people with DLB to experience visual hallucinations [55]. 
A combination of faulty perception of environmental stimuli, im- 
paired recollection of prior experience, and intact generation of 
images may lead to visual hallucinations. Studies that evaluated 
the neurocognitive processes underlying visual hallucinations in 
DLB have proposed a disconnection between the prefrontal cortex 
and the visual cortex, as well as improper activation of the inferior 
temporal cortex [56]. Hallucinations involving other modalities 
are less common. People with DLB may have other neuropsychi- 
atric symptoms, including apathy, paranoid delusions, depres- 
sion, anxiety, and aggression [57]. Besides, RBD is a parasomnia 
that presents with a lack of motor inhibition during REM sleep, 
leading to potentially harmful simple or complex dream-enacting 
vocalizations and motor behaviours. Although a formal diagnosis 
of RBD can be confirmed only by polysomnography, diligent his- 
tory taking and standardized questionnaires, such as the Mayo 
sleep questionnaire, can elicit RBD reliably in clinical settings. The 
prevalence of RBD among people with DLB has been suggested to 
be as high as 77%, and RBD often precedes the onset of DLB by sev- 
eral years. Additionally, autonomic dysfunction, including ortho- 
static hypotension, and carotid sinus hypersensitivity are common 
in DLB, and they may lead to constipation, urinary incontinence, 
syncope, and falls. 





Differential diagnosis 


Differentiating DLB from other conditions with overlapping clin- 
ical presentations is essential for accurate diagnosis and appropriate 
management, and for minimizing the use of dopamine antagon- 
ists. Differential diagnosis of DLB include AD, PDD, vascular de- 
mentia, fronto-temporal dementia, other dementias, delirium, PD, 
adverse effects of anti-parkinsonian medications, progressive supra- 
nuclear palsy, multiple system atrophy, cortico-basal degeneration, 
other neurodegenerative diseases, prion diseases, complex partial 
seizures, late-onset psychosis, and psychotic depression. Diligent 
history taking, neurological examination, neuropsychological as- 
sessments, neuroimaging, EEG, and CSF biomarkers aid to clarify 
diagnostic dilemmas [53]. Fluctuating cognition and RBD are likely 
to be missed, unless specifically asked for, and eliciting them is vital 
for an accurate clinical diagnosis of DLB. If fluctuating cognition is 
the principal clinical presentation, a detailed systemic examination 
and investigations for potential causes for delirium should be car- 
ried out. Polypharmacy and associated adverse effects in older adults 
can mimic many clinical features of DLB. Hence, a comprehensive 
medication review is essential. 


AD is the most common clinical misdiagnosis of people with 
DLB. The neuropsychological profiles of AD and DLB differ, and 
people with DLB may perform relatively worse on tests of attention, 
working memory, figure copying, spatial judgement, and set-shifting. 
SPECT and PET can help differentiating DLB from other dementias 
by revealing low dopamine transporter uptake in the basal ganglia 
and occipital hypoperfusion, as well as hypometabolism. '“FP-CIT 
SPECT may be positive in fronto-temporal dementia, but associated 
occipital hypometabolism favours the diagnosis of DLB. A meta- 
analysis including eight studies and 346 people has reported that 
MIBG scintigraphy can differentiate DLB from other dementias, 
with 98% sensitivity and 94% specificity [20]. CSF biomarkers may 
not help the differential diagnosis of DLB, but they may predict a 
more severe clinical course. Besides, DLB may initially present only 
with neuropsychiatric symptoms, so it should be considered as one 
of the differential diagnoses for all older adults presenting with late- 
onset psychosis, especially if they have associated cognitive deficits 
or extra-pyramidal symptoms. 


Clinical course 


Pathological processes underlying DLB may begin several years 
earlier than their clinical manifestations. A heterogenous, poten- 
tially long prodromal phase of DLB is widely recognized but has 
been poorly defined so far. RBD is likely to be the most common fea- 
ture of prodromal DLB. Many longitudinal studies have confirmed 
the association between idiopathic RBD and later DLB diagnosis 
[58]. Features of prodromal DLB may include MCI, subtle neuro- 
psychological deficits, olfactory dysfunction, dysautonomia, con- 
stipation, hypersalivation, mild extra-pyramidal symptoms, and 
neuropsychiatric symptoms. Among people presenting with MCI, 
non-amnestic deficits, fluctuating cognition, visual hallucinations, 
extra-pyramidal symptoms, EEG changes, low dopamine trans- 
porter uptake in the basal ganglia, reduced myocardial MIBG up- 
take, occipital hypoperfusion, and occipital hypometabolism have 
been associated with progression to DLB. The clinical onset of DLB is 
often insidious. Typical clinical course is gradually progressive, and 
the rate of cognitive decline is similar to that of AD. Initial studies 
have suggested that DLB and AD do not differ in their prognosis, but 
recent evidence indicates a worse prognosis for DLB. Longitudinal 
studies evaluating the courses of DLB and AD have associated DLB 
with an increased risk of mortality [59], shorter median survival 
time, and earlier admission to nursing homes by nearly 2 years [60]. 
DLB also places a high burden on the caregivers, even compared to 
AD [61]. APOE e4 allele has been associated with a rapid decline in 
DLB, and increased CSF tau levels have been associated with shorter 
survival in DLB. 


Good clinical management of DLB begins with an early accurate 
diagnosis. As treatments that can alter the progression of DLB path- 
ology are not currently available, contemporary management focuses 
on symptomatic treatment, treatment of comorbidity, and providing 
holistic supportive care. Patients, their families, and carers need to 
be educated about the nature of the illness and should be involved in 


Table 43.2 Summary of available pharmacological options for DLB 


Cognitive symptoms Donepezil’ 

Rivastigmine 
Galantamine 
emantine 


Hallucinations and delusions Donepezil 

Rivastigmine 

emantine 

Clozapine 

Quetiapine 

Pimavanserin 

Olanzapine (poorly tolerated) 











Depression Antidepressant medications 
Anxiety Antidepressant medications 
REM sleep behaviour disorder Rivastigmine 

emantine 

elatonin 

Clonazepam 

Restless legs syndrome Gabapentin 
Excessive daytime sedation odafinil/armodafinil 
Postural hypotension Fludrocortisone 


Urinary incontinence Trospium 


“Level | evidence is available. 


all clinical decisions. Individualized care plans should be formulated 
after comprehensive assessments of the needs, risks, disability, and 
wishes of people with DLB. Well-co-ordinated multi-disciplinary 
management involving general practitioners, psychiatrists, neurolo- 
gists, psychologists, nurses, physiotherapists, occupational therap- 
ists, and social workers is preferred. Good-quality RCTs specifically 
recruiting people with DLB are few, and people with DLB are very 
sensitive to drug-related adverse effects. Hence, treating psychiat- 
rists should adhere diligently to the general principles of geriatric 
psychopharmacology such as very low-dose initiation, slow titra- 
tion, periodic monitoring, dose reduction at the earliest possible 
opportunity, and minimizing polypharmacy. Therapeutic decision- 
making is not easy when distressing psychotic symptoms and neuro- 
leptic sensitivity coexist in DLB. It is tough for patients and their 
carers to make a choice between mobility and psychosis. Risks and 
benefits of all available treatment options should be discussed in 
such scenarios. Table 43.2 presents a summary of available pharma- 
cological management options for DLB. 


Cognitive symptoms 


Available evidence supports the use of acetylcholinesterase inhibi- 
tors in standard doses to treat cognitive symptoms in DLB. There 
have been two RCTs evaluating donepezil, and one RCT investi- 
gating the effects of rivastigmine in DLB [62]. Donepezil was signifi- 
cantly more effective in both RCTs, with a pooled mean difference of 
nearly 2 Mini-Mental State Examination (MMSE) points, and cog- 
nitive improvements were maintained for 52 weeks in both RCTs. 
Both RCTs have reported that people receiving donepezil were sig- 
nificantly more likely to have absence of deterioration on clinical 
global impression. People taking rivastigmine showed nearly 1- 
point improvement of their MMSE scores during the study period, 
but this effect was not statistically significant. An uncontrolled trial 
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investigating the efficacy of galantamine in DLB has reported im- 
provements in cognitive fluctuations. There have been two RCTs 
investigating the effects of memantine in mixed DLB and PDD sam- 
ples [63]. Subgroup analyses revealed small, but statistically signifi- 
cant, improvements among the participants with DLB. Moreover, 
meta-analyses have supported the use of anti-dementia drugs in 
DLB [64]. 


Neuropsychiatric symptoms 


Evidence supporting the use of donepezil, rivastigmine, and 
memantine to treat neuropsychiatric symptoms in DLB is not robust. 
Some people with DLB may be on anti-parkinsonian medications. 
If they develop distressing visual hallucinations or other neuro- 
psychiatric symptoms, the possibility of reducing anti-parkinsonian 
medications should be considered first. One RCT has provided 
evidence for the efficacy of donepezil to treat delusions, hallucin- 
ations, apathy, and depression in DLB, but a subsequent larger RCT 
has not confirmed this effect [62]. Another RCT has showed that 
rivastigmine could mitigate delusions, hallucinations, apathy, and 
depression in DLB, but the effects were not statistically significant. 
There have been two RCTs providing debatable evidence for and 
against the efficacy of memantine to treat neuropsychiatric symp- 
toms in DLB. After considering a trial of an acetylcholinesterase 
inhibitor, the option of a low-dose dopamine antagonist may be con- 
sidered. As already noted, dopamine antagonists are poorly toler- 
ated by people with DLB, and there is insufficient evidence justifying 
their use. A small RCT has shown that risperidone was poorly toler- 
ated, and it worsened cognition and neuropsychiatric symptoms in 
DLB [65]. People with DLB tolerate olanzapine poorly, but a small 
RCT has reported that olanzapine 5 mg/day could significantly re- 
duce the severity of delusions and hallucinations [66]. A case series 
supported the use of quetiapine, but the only RCT investigating 
the effects of quetiapine in DLB was negative. There have not been 
any studies investigating the efficacy of clozapine to treat psychotic 
symptoms in DLB. However, extrapolating the evidence for its ef- 
ficacy to treat psychotic symptoms in PDD suggests that low-dose 
clozapine (6.25-50 mg/day) may be the drug of choice in DLB also 
[57]. Similarly, pimavanserin, a selective serotonin 5-HT,, inverse 
agonist, has not been evaluated in DLB, but level I evidence exists for 
its efficacy to reduce psychosis in PD. Future studies investigating 
its efficacy and safety in people with DLB are awaited with interest. 

Drugs may be used to treat depression or anxiety symptoms in 
DLB, but the evidence supporting their use is weak. Citalopram is 
the only drug for depression that has been investigated by an RCT for 
its efficacy in DLB. The RCT did not show any benefit, and it showed 
that citalopram was poorly tolerated [65]. Drugs with anticholin- 
ergic adverse effects should be avoided. Other drugs for depression 
may be used in DLB after careful consideration of their adverse ef- 
fect profiles and potential drug interactions. Besides, a case series 
each for zonisamide and ramelteon, as well as a very small RCT for 
the herbal medicine yokukansan, has provided weak evidence for 
their efficacy to lessen neuropsychiatric symptoms in DLB. 


Other symptoms 


Anti-parkinsonian medications are used in DLB to manage the 
extra-pyramidal symptoms, but they may worsen visual hallucin- 
ations and other neuropsychiatric symptoms. Moreover, levodopa 
response is variable in DLB. Assessing and managing the risk of 
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falls is essential for all people with DLB and mobility problems. 
Rivastigmine and memantine may reduce the frequency of RBD in 
DLB. Melatonin and low-dose clonazepam can be used to manage 
RBD. Gabapentin may be helpful in people with DLB presenting 
with restless legs syndrome. There is weak evidence to support the 
use of modafinil and armodafinil for managing excessive daytime 
sedation in DLB. Obstructive sleep apnoea is not uncommon in 
DLB, and its treatment may improve excessive daytime sedation in 
DLB. Fludrocortisone can be used to manage postural hypotension 
in DLB. Review of antihypertensive medications, salt supplemen- 
tation, and compression stockings should be considered. Trospium 
may help with urinary incontinence. Constipation in people with 
DLB can be managed by dietary changes, exercise, stool softeners, 
psyllium, or polyethylene glycol. 


Non-pharmacological interventions 


There have not been many systematic trials evaluating the effi- 
cacy of non-pharmacological interventions for managing various 
symptoms in DLB. Electroconvulsive therapy may be considered 
for people with DLB presenting with severe depression and psych- 
osis, and a few case reports, as well as a small case series, support 
its efficacy and safety in DLB [67]. Repetitive transcranial mag- 
netic stimulation may be helpful in DLB. A case study employing a 
novel structured intervention model—‘skill building through task- 
oriented motor practice’ —has reported functional gains in a woman 
with DLB [68]. Physical exercise, social interactions, person-centred 
care, cognitive training, and cognitive behavioural therapy may help 
people with DLB, but further research is needed to investigate their 
efficacy. 


Future research 


A consensus should be reached on the definition of prodromal DLB. 
It will help future research for identifying novel biomarkers to help 
with early diagnosis and for developing potential neuroprotective 
therapies [69]. Large, robust RCTs are needed for addressing many 
unanswered questions regarding the efficacy of available pharmaco- 
logical and non-pharmacological options for the symptomatic treat- 
ment of DLB. Genetics, transcriptomics, and epigenetics of DLB 
should be investigated further. Further research on extracellular a- 
synuclein and circulating exosomes may enhance our understanding 
of neurobiology of DLB and may reveal novel therapeutic avenues. 
Research on extracellular a-synuclein has already provided the im- 
petus for developing immunotherapeutic approaches for DLB [70]. 
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Neuropathology 


The neurobiological basis for Parkinson’s disease (PD) is the de- 
generation of nigrostriatal dopamine neurons and the pathological 
depositon of the protein, a-synuclein in intra-neuronal Lewy inclu- 
sions within vulnerable populations of neurons in the brain. Lewy 
bodies (LBs) (aggregations of proteins, including a-synuclein) and 
neurites (degenerating neurites with a-synuclein) have been con- 
sidered to be pathological hallmarks of PD (see Chapter 43). Braak 
and colleagues have proposed a staging schema by which the dis- 
tribution of Lewy pathology spreads through the brainstem in a 
prion-like fashion into the midbrain and subsequently throughout 
the cortex [1] (Table 44.1; Fig. 44.1). These pathological changes 
occur gradually and progressively over many years, with a signifi- 
cant period of clinically silent cell dysfunction and cell death or 
autonomic/sleep/mood symptoms before the onset of Parkinson’s 
motor symptoms. 

In the elderly, pathologies often occur on a background of age- 
related pathologies, which are extremely common in the over-75 age 
group. The pathologic substrate for Parkinsons disease dementia 
(PDD) and Parkinson’s disease with mild cognitive impairment (PD- 
MCI) appears to be heterogenous and includes LBs, Alzheimer’s dis- 
ease (AD) pathology, cerebrovascular disease, and other findings, 
including cerebral amyloid angiopathy (Fig. 44.2) [2]. 

The most compelling evidence to date suggests that Lewy-related 
pathology is the most important factor in the development of 
cognitive impairment in PD. The distribution of synuclein path- 
ology has been shown, to some extent, to map the clinical symp- 
toms [3-5]. However, neuronal loss can be present in the absence 
of Lewy pathology [6], leading to the investigation of smaller a- 
synuclein aggregates [7]. Further uncertainty about the role of 
synuclein pathology is caused by the presence of LBs in the ab- 
sence of Parkinsonian symptoms, termed incidental Lewy body 
disease. There also appears to be a synergistic relationship between 
a-synuclein, amyloid peptide, and tau proteins [8]. The additional 
presence of Alzheimer’s pathology is associated with a shorter time 
to dementia and a higher burden of Lewy pathology in the cortex. 
There are also some data suggesting that chronological ageing may 
be an important driving factor in the onset of dementia in PD 





patients [9], a factor also related to the prevalence of AD pathology 
in patients with dementia. 

Neuropathologic studies have also linked tau deposition to 
Lewy body diseases, with the following observations. In both pa- 
tients with DLB and those with PDD, the presence of tau patho- 
logic changes, in combination with AB and a-synuclein, has been 
shown to potentiate dementia [10]. Furthermore, as in patients 
with AD [11], tau aggregates in those with PD have been found to 
correlate with the severity of cognitive impairment [12], and tau 
aggregates measured at autopsy late in the course of the disease 
are commonly observed in both patients with PDD and those with 
DLB. However, greater tau burden has been noted in the tissue of 
patients with DLB than in those with PDD [13], raising the pos- 
sibility of a difference during life. The contribution of brain tau 
aggregates during life to the clinical manifestations and course 
of these diseases has only recently been delineated due to new 
imaging tracers (see Imaging under Biomarkers for PDD, p. 443, 
for imaging correlates). 

Insummary, it is very likely that there is a synergistic effect between 
a-synuclein pathology, age, and other pathologies, including AD 
and tau pathology, that is the main driver of cognitive decline in PD. 


Neurotransmitter systems 


Multiple neurotransmitter deficits have long been emphasized as of 
major importance in PD, and particularly in underlying the cogni- 
tive deficits. An understanding of these deficits has led to current 
management options for PDD, namely cholinesterase inhibitors and 
changes to dopamine medication use. 

Dopamine and PDD 

Dopaminergic neurons in the brain are found in three main mid- 
brain dopamine regions, projecting to the basal ganglia (nigrostriatal 
system), limbic regions (mesolimbic system), and cortical regions 
(mesocortical system; Fig. 44.3). 

The largest group of dopaminergic neurons are found in the 
substantia nigra (SN), which project upwards to the striatum 
(largely the putamen) in an important feedback loop controlling ac- 
tions and thoughts. The ventrolateral portion (red-VLa SN) of this 
loop degenerates early in PD before LB formation and dementia, 
leading to progressive motor deficits of bradykinesia, rigidity, and 
tremor helped by dopamine replacement. In PDD, there is greater 
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Table 44.1 Stages in the evolution of PD-related pathology 

















Stage l: medulla oblongata Constipation, hyposmia 
Stage 2: medulla oblongata and pons Sleep, depression, anxiety 
Stage 3: midbrain Motor PD 
Stage 4: basal prosencephalon and PD-MCI 
mesocortex 
Stage 5: neocortex PD-MCI/PDD 
Stage 6: neocortex PDD 
Source: data from Neurology, 64(8), Braak H, Rüb U, 


(2005), American Academy of Neurology. 


degeneration of the medial SN dopamine neurons [orange—medial 
SN and ventral tegmental area (VTA)], which project through the 
mesolimbic pathway and are involved in behavioural selection and 
impulsivity. Dopamine agonist use may enhance this pathway, con- 
tributing to impulse control disorders (ICDs) in PD. Degeneration 
of dopaminergic neurons in the VTA (yellow) may lead to a reduc- 
tion in cortical dopamine through the mesocortical system, affecting 
cognitive function. 


Other monoamine systems and PDD 


Marked degeneration of noradrenergic neurons in the locus 
caeruleus projecting to the forebrain are well described in PDD. 
Degeneration of serotonergic neurons in the median raphe nucleus 
as the disease progresses, with reduced serotonin innervation, is 
described. 


Acetylcholine and PDD 


An approximate 40% loss of cholinergic pedunculopontine neurons, 
which project to the thalamus, is observed. However, degeneration 
is most marked in the nucleus basalis, leading to severe, widespread 
cortical reductions in choline acetyltransferase, which correlate with 
the extent of cognitive impairment and can be demonstrated with 
functional imaging studies. Anticholinergic medications accelerate 
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Fig. 44.1 How PD pathology spreads upwards in PD. 


Area of pathology Possible clinical correlate Subregions affected 


Lesions in dorsal IX/X motor nucleus 
Pathology of stage 1 plus lesions in raphe, reticular, and caeruleus-subcaeruleus nuclei 
Pathology of stage 2 plus midbrain lesions, especially substantia nigra 


Pathology of stage 3 plus prosencephalic lesions, temporal mesocortex 


Pathology of stage 4 plus higher-order sensory neocortical lesions 





Pathology of stage 5 plus first-order sensory association areas of neocortex 


ansen Steur ENH, et al., Cognitive status correlates with neuropathologic stage in Parkinson disease, pp.1404-10, Copyright 


PD cognitive impairment, while cholinesterase inhibitors, such as 
donepezil and rivastigmine, benefit PDD. 


Summary of neurotransmitter systems 


In summary, the emergence of PDD occurs on a background of se- 
vere dopamine deficits and correlates with a marked loss of limbic 
and cortically projecting dopaminergic, noradrenergic, serotonergic, 
and acetylcholinergic neurons. 


Epidemiology and risk factors 





The heterogeneity of cognitive deficits found in PDD, coupled with a 
variability in cognitive tests and diagnostic criteria, cause significant 
problems when comparing published studies. Recent attempts to 
address this have included the publication of diagnostic standards, 
which have included the introduction of the concept of PD-MCI. 
This classification system is drawn from the study of AD as a means 
of classifying cognitive problems which do not meet the criteria for 
dementia. 

Dementia is common in PD; while the point prevalence of PDD 
is estimated to be 31.3% (95% CI 29.2-33.6%) [14], the risk of 
developing dementia increases with time. While not all suffering 





Reproduced from Neurology, 64(8), Braak H, Rüb U, Jansen Steur ENH, et al., Cognitive status correlates with neuropathologic stage in Parkinson disease, pp. 1404-10, 


Copyright (2005), with permission from American Academy of Neurology. 
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Fig. 44.2 (see Colour Plate section) Tissue histopathology of PD and PDD. (a) Transverse section through the midbrain of a control (at left) showing 











Copyright (2014), with permission from John Wiley and Sons. 


dementia, even in the earliest stages of the disease, up to half will 
have a degree of cognitive impairment. The onset of dementia it- 
self is highly variable, with a mean onset of 6 years and up to half 
of patients developing dementia within 10 years from diagnosis in 
a community study followed up from diagnosis [15]. By 20 years, 
most (the long-term cumulative prevalence is up to 80%) will have 
developed dementia, although not all [16, 17]. 

While the cognitive decline is progressive, it is non-linear and 
heterogenous. The pattern of cognitive deficits (including visuo- 
spatial, executive, and memory domains early on) varies consid- 
erably between individuals, as does the rate of the evolution of the 
symptoms. Overall, however, allowing for this inter-individual vari- 
ability, the rate of global decline (as measured by global cognitive 
tasks) is slower initially and increases with disease duration and the 


he darkly pigmented SN in the ventral aspect of the midbrain, whereas the pigmented neurons in this structure are lost in patients with PD (at right). 
b) Higher magnification (box in (a)) of a haematoxylin and eosin-stained section through the SN, showing only a few pigmented neurons remaining 
with many smaller phagocytic microglia. (c) and (d) Higher magnification of a haematoxylin and eosin-stained (c) and an a-Syn-immunoreactive 

d) pigmented neuron in the SN of a PD patient containing an LB. (e) and (f) a-Syn-immunoreactive LBs and Lewy neurites in the amygdala (e) and 
anterior cingulate cortex (f) of a patient with PDD. (g) Silver-stained neurofibrillary tangle in the cortex of a patient with PDD. (h) Beta-amyloid- 
immunoreactive plaques in the cortex of a patient with PDD. (i) Vascular ischaemic tissue damage identified in a haematoxylin and eosin-stained 
section of the globus pallidus in a patient with PDD. (j) Beta~amyloid-immunoreactive congophilic angiopathy in the cortex of a patient with PDD. 
Reproduced from Mov Disord., 29(5), Halliday GM, Leverenz JB, Schneider JS, et al., The neurobiological basis of cognitive impairment in Parkinson's disease, pp. 634-50, 








onset of dementia [18]. This may be because the global cognition 
scores are not sensitive enough to pick up the subtle changes earlier 
in the disease course but has been a consistent finding. The decline is 
overall faster than reported in age-matched control groups. 

Studies consistently agree that older age increases the risk of more 
rapid cognitive impairment [15, 19]. Other features, such as vascular 
risk factors [20], the presence of REM sleep behaviour disorder [21], 
motor impairment, poor fluency, and visuo-spatial dysfunction [15] 
may also predict an earlier dementia. The impact of PDD is substan- 
tial, with major consequences for functioning, institutionalization, 
psychiatric comorbidity, caregiver burden, and mortality [22]. Once 
dementia has developed, progression of physical and cognitive de- 
cline to death is similar across all groups, lasting on average 3 years 
(range 1-9). 
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(cognitive function) 
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Fig. 44.3 (see Colour Plate section) Dopamine pathways affected in PD and PDD. Red outline: SN, which contains both dopamine neurons in 
the pars compacta that give rise to the nigrostriatal projections, and GABA neurons in the pars reticulate, which innervate the thalamus. Dotted 
red line: ventrolateral (VLa) SN, which is selectively damaged in patients with PD. Yellow outline: ventral tegmental area (VTA), which contains both 
dopamine and non-dopamine neurons that project to limbic and cortical regions. Dotted orange outline: medial SN and VTA, which give rise to 
mesolimbic projections affected in patients with PDD. cp, cerebral peduncle; N. acc, nucleus accumbens; R, red nucleus. 

Reproduced from Mov Disord., 29(5), Halliday GM, Leverenz JB, Schneider JS, et al., The neurobiological basis of cognitive impairment in Parkinson's disease, pp. 634-50, 


Copyright (2014), with permission from John Wiley and Sons. 


Genetic risk factors and PDD 
Familial PD 


Several mutations have been found to be associated with PD, and 
it is generally accepted that interplay of genetic and environmental 
factors in an ageing brain are influencing disease development and 
progression. However, despite clear evidence for monogenetic and 
susceptibility genes in PD, the role of genetic factors in the develop- 
ment of dementia in PD is less clear [23]. Dementia is more common 
in patients with PD with a strong family association of PD probably 
reflecting the influence of genetic factors in the development of de- 
mentia in PD [24]. Mutations in the a-synuclein gene (SNCA) can 
be found even in sporadic PD. Functional studies on brain tissue 
revealed that SNCA genomic copy number and gene expression are 
related, as an increasing number of SNCA copies leads to an increase 
in a-synuclein expression, as well as LB formation. Consistent with 
these findings, a relationship between SNCA dosage and clinical 
phenotype, including age at onset, progression, and development of 
dementia, has been reported. 

Mutations in the leucine-rich repeat kinase 2 gene (LRRK2) and 
parkin are the two most common genetic causes of Parkinsonism. 
Patients displaying various mutations in LRRK2 have a low rate of 
cognitive dysfunction and dementia. In parkin mutations, progres- 
sion of the disease tends to be rather slow, and dementia is not a 
common feature. Much less is known regarding the association with 
dementia of two other mutations associated with PD—Pink1 and 


DJ-1. It should be noted, however, that these monogenic forms of PD 
are generally rare and will occur with an overall frequency of <5% in 
community-ascertained, unselected Caucasian PD cohorts. 


Cognition and risk genes 


Mutations in the genes transcribing microtubule-associated protein 
tau (MAPT), apolipoprotein E (APOE), glucocerebrosidase (GBA), 
and SNCA have all been associated with increased dementia risk in 
PD. MAPT helps assemble and stabilize microtubules throughout 
the central nervous system, predominantly found in the axons of 
neurons. The gene encoding MAPT is located on chromosome 
17q21, part of a 900-kb fragment which is commonly inverted, 
causing two common haplotypes H1 and H2. Despite the fact that 
tau pathology is not a neuropathological feature of PD, the MAPT 
gene (H1/H1 MAPT haplotype) has been consistently identified as 
a risk gene for both PD and PDD [15, 25, 26] Interestingly, a gene- 
gene interaction between MAPT and SNCA was also reported [27]. 
Mutations in the GBA gene have been found to increase both PD 
riskand the risk of PDD, with affected patients presenting at a younger 
age and developing dementia earlier. Visuo-spatial deficits specific- 
ally have been associated with this haplotype, even prior to the onset 
of dementia, with regionally specific changes in cortical activation 
[28-30] The link between dementia risk and the APOE £4 genotype 
is strongly established, with the s4 allele associated with a higher risk 
and an earlier onset of AD. In PD, however, there is no association 
between the APOE ¢4 genotype and PD risk. However, this genotype 


is an independent predictor of PDD, even when controlling for AD 
pathology severity [31]. The enzyme catechol-O-methyltransferase 
(COMT) is an important regulator of synaptic dopamine, particu- 
larly in the frontal and prefrontal cortices. A common functional 
polymorphism causing lower enzymatic activity is linked to poor 
attention and executive performance in early PD subjects; however, 
this genotype is not predictive of future dementia [32]. Lastly, a re- 
cent study characterized the SNCA gene locus in PDD and DLB, and 
found a PDD risk haplotype that was distinct from DLB, with associ- 
ation profiles in single-nucleotide polymorphisms across the SNCA 
gene for Parkinsonism and dementia [33]. 





Classification, clinical criteria, and controversies 


Cognitive impairment in PD is heterogenous both in severity and 
pattern and is subject to influences both integral and external to the 
disease. Diagnostic criteria have been developed by the Movement 
Disorders Society that help to guide clinicians and researchers to 
an accurate diagnosis of PD-MCI [34] or PDD [35]. These cri- 
teria largely meet the Diagnostic and Statistical Manual of Mental 
Disorders, fourth edition criteria for dementia. In both PD-MCI and 
PDD, gradual cognitive decline within the context of established 
PD and objectively demonstrated cognitive deficits are core features 
(Table 44.2). 

In PDD, cognitive deficits in more than one domain are required, 
whereas in PD-MCI, a single domain may be affected. In PDD, the 
deficits must be severe enough to impair activities of daily living, 
whereas in PD-MCTI, they must not be sufficiently severe to inter- 
fere with functional independence (although minor difficulties may 
be present). To operationalize these criteria and to assess the pat- 
tern and severity of cognitive dysfunction, we need: (1) valid meas- 
ures of cognitive abilities covering the major domains of cognition; 
(2) a method to determine whether or not the performance repre- 
sents a decline from a persons previous level of functioning; and 
(3) an assessment of how the individual's cognitive abilities enable 
(or disable) function in day-to-day activities. These will be further 
discussed in Diagnosis and differential diagnosis, p. 441 later in this 
chapter. 


Table 44.2 Criteria for PDD 
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Although cognitive impairment in PD exists on a continuum of 
severity, it is often divided into two categories—mild cognitive im- 
pairment and PDD, based on the extent to which the impairment 
interferes with activities of daily living (Table 44.2). This classifica- 
tion system risks oversimplifying a very complex entity but has been 
fundamental to clinical trials of therapies for cognitive dysfunction 
in PD, which have been largely devoted to the treatment of PDD. 
Currently, therefore, evidence-based treatment of cognitive impair- 
ment in PD is restricted to PDD and depends on making a diagnosis 
of this entity. Identifying mild cognitive impairment is also useful, 
however, because it has prognostic significance. Several longitu- 
dinal studies have demonstrated that PD-MCI is a risk factor for 
developing PDD. There is also evidence that the pattern of cognitive 
deficit is prognostically important(see [36] for review). 


Controversies: the 1-year rule 


With advances in knowledge, disease boundaries may change and 
require redefinition of the disease. In the case of PD, as diagnostic 
criteria currently stand, dementia developing before the second 
year of Parkinsonism is an exclusion criterion for PD; the diagnosis 
is DLB. 

If dementia starts after 1 year, the diagnosis is PDD. Beyond the 
arbitrary nature of the 1-year rule, there is increasing controversy 
about whether the distinction itself is valid. PDD and DLB share 
many similarities in dementia presentation, neuropsychological 
findings, non-motor profile (olfactory loss, depression, sleep dis- 
orders, and autonomic dysfunction in both), imaging, genetics, and 
pathology (Fig. 44.4). However, broad differences do exist and are 
summarized in Fig. 44.3. Fig. 44.5 illustrates how the type of cortical 
pathology and clinical presentation of PDD and DLB might interact. 
A recent MDS task force proposed that the 1-year rule separating 
PDD and DLB be omitted [37]. The authors argued that rather, when 
a patient presents with motor signs and meets full clinical criteria 
for PD, the diagnosis of PD is applied, regardless of the presence or 
timing of dementia. In other words, dementia is no longer an exclu- 
sion criterion for PD. For those patients who already carry a DLB 
diagnosis (according to consensus criteria [38]), the authors argued 
that the diagnosis can optionally be qualified as PD (DLB subtype)’ 
Note that this proposal would not invalidate the diagnostic category 





Probable PDD 1. PD diagnosis 1. Typical cognitive deficits in two of four domains 1 Vascular disease on imaging or other abnormality that 
(attention, executive function, visuo-spatial may Cause cognitive impairment, but not dementia 
function, and free recall) 2. Unknown time interval between motor and cognitive 

symptoms 
2. Slowly progressive dementia 2. At least one behavioural symptom (apathy, 3. Acute confusion resulting from systemic diseases or 
syndrome depression/anxious mood, hallucinations, abnormalities or drug intoxication 
delusions, or excessive daytime sleepiness) 4. Features compatible with probable vascular dementia 

PossiblePDD 1. PD diagnosis 1. Atypical cognitive deficits in one or more 1. Acute confusion resulting from systemic diseases or 
domain (fluent aphasia or storage-failure abnormalities or drug intoxication 
amnesia) with preserved attention 

2. Slowly progressive dementia Vascular disease on imaging or other abnormality 2. Features compatible with probable vascular dementia 


syndrome that may cause cognitive impairment, but not 
dementia, and/or unknown time interval between 


motor and cognitive symptoms 





Reproduced from Mov Disord., 22(12), Emre M, Aarsland D, Brown R, et al., Clinical diagnostic criteria for dementia associated with Parkinsons disease, pp. 1689-1707, Copyright 
(2007) with permission from John Wiley and Sons. 
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Fig. 44.4 Similarities and differences between PDD and DLB. 


of DLB. In clinical communication with patients with the DLB sub- 
type, the diagnostic term DLB could continue to be used [37]. 
Contrary to this, others have argued that the 1-year rule 
distinguishing PDD from DLB is worth maintaining because it 
serves an important purpose in clinical practice, clinical and basic 
science research and when helping the lay community understand 
the complexity of these different clinical phenotypes. Furthermore, 
opponents believed that adding an additional diagnostic label “PD 
(dementia with Lewy bodies subtype)’ will confuse, rather than 
clarify, the distinction between DLB and PD or PDD and will not 
improve management or expedite therapeutic development [39]. 


Clinical features 


Cognitive syndrome 
PD-MCI 


The spectrum of impairment ranges broadly in phenotype, as well as 
in timing in the disease course. Cognitive deficits can occur in one 
or more domains, vary in severity, and present differently at various 





stages of the disease. Studies of incident PD indicate that cognitive 
impairment is not just a late-stage problem but occurs in up to 50% of 
early cases, including untreated individuals, who manifest PD-MCI 
[40-43]. Executive function represents the most common cognitive 
domain affected in PD early on, as well as later, in the disease. Deficits 
can be detected on tests that are sensitive to frontal dysfunction (for 
example, tests of planning, spatial working memory, and attentional 
set shifting). However, impairments in attention, explicit memory, and 
visuo-spatial function also are demonstrable in early PD. Executive 
dysfunction may occur individually as a single-domain impairment 
or in combination with other cognitive deficits as multiple-domain 
impairments. Some, but not all, studies investigating clinical features 
and frequencies of cognitive impairment in PD patients without de- 
mentia revealed that non-memory (non-amnestic) single-domain 
deficits are the most frequent cognitive subtype. 


PDD 


The cognitive profile of PDD remains variable, although it often af- 
fects cognitive domains similar to those affected in PD-MCI, but 
with more severe deficits and with the disruption of multiple areas. 
The onset of PDD is insidious. In one prospective study, the mean 
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Fig. 44.5 Cortical pathology and clinical presentation of PDD and DLB. Dementia in DLB/PD is associated with two major 
pathologies: synucleinopathy (a-Syn, i.e. Parkinson pathology) and neuritic amyloidopathy (i.e. Alzheimer pathology). In PD patients who develop 


dementia very late in thei 


r illness, or not at all (far left), neuritic amyloid deposition is minimal (or absent), and cortical pathology is mainly that of a-Syn 


deposition. At the other extreme, DLB patients with predominant neuritic amyloid deposition and very minimal a-Syn deposition would usually be 


diagnosed as AD during 
and DLB. 


ife, developing clinical DLB hallmarks late (if at all). Between these two extremes of the spectrum lie the most patients with PD 


Reproduced from Mov Disord., 29(4), Berg D, Postuma RB, Bloem B, et al., Time to redefine PD? Introductory statement of the MDS Task Force on the definition of Parkinson's 
disease, pp. 454-62, Copyright (2014), John Wiley and Sons. Reproduced under the Creative Commons Attribution License (CC BY). 





annual decline on the MMSE during 4 years was 1 point in the non- 
demented and 2.3 points in the PDD group, the latter figure being 
similar to the decline observed in patients with AD. A similar rate 
of decline was reported in another longitudinal study; the mean de- 
cline in MMSE over 2 years was 4.5 and was comparable to that seen 
in patients with DLB, with a mean decline of 3.9 points [35]. 

By definition, PDD includes impairment in at least two cognitive 
domains but, as per MDS criteria, does not require memory deficits. 
The predominant cognitive deficits in late PDD are similar to those 
in DLB, with marked visuo-spatial dysfunction and fluctuating at- 
tention. Impairments in executive function, working memory, and 
episodic memory are also common in PDD, although language, par- 
ticularly as measured by object naming, tends to be relatively pre- 
served (see [44] for review). 


Longitudinal relationship between PD-MCI and PDD 


The CamPalGN study, a longitudinal population-based cohort ofin- 
cident PD, recently reported 10-year follow-up data [15]. Analyses 
at multiple time points in this cohort (n = 142) indicated that, aside 
from age, the most significant baseline predictors of later dementia 
were impaired semantic fluency and pentagon copying (hazard 
ratios of 3.1 and 2.6, respectively, for dementia at 10 years from diag- 
nosis). There was no association between ‘fronto-striatal-based’ ex- 
ecutive dysfunction and later dementia and, in fact, there was no 
decline in executive function performance over this time. 


Neurological findings 


By definition, the PDD patient will have features of established 
motoric Parkinsonism, of which bradykinesia is key—characterized 
by a progressive decrement in the amplitude or velocity of move- 
ment (or both), rather than the generalized slowness seen with cere- 
bellar or pyramidal disorders. Rigidity and tremor are common, 
and in general, motor Parkinsonism will be more severe than that 
seen in DLB or the mild extra-pyramidal features associated with 
AD. Parkinsonism is typically levodopa-responsive, with levodopa- 
induced dyskinesias common at this advanced stage and other fea- 
tures such as wearing-off periods. As the disease progresses, severe 
Parkinsonian symptoms with gait freezing are common, so that the 
patient frequently falls and becomes wheelchair-bound. The quality 
of the dementia resembles a subcortical pattern, with pronounced 
psychomotor slowing, not usually accompanied by severe aphasia, 
agnosia, or apraxia. Patients perform disproportionately poorly on 
timed tests and visuo-spatial tasks. 


Psychiatric issues 


A variety of neuropsychiatric symptoms, including depression, 
anxiety, hallucinations, apathy, psychosis, and ICDs, occur com- 
monly in up to 90% of PDD patients [45]. Psychotic symptoms 
affect 60% of PD patients long term and include hallucinations 
and delusion. The occurrence of these symptoms are predictors 
of nursing home placement and mortality. Complex visual hal- 
lucinations, often of animals or people, are the most typical 
manifestation, occurring in 40-90% of PDD patients, but other 
visuoperceptual disturbances can occur, including illusionary ex- 
periences, sensations of movement in the periphery (passage hallu- 
cination), and a feeling of presence (extracampine hallucination). 
Auditory, tactile, and olfactory hallucinations are less common. 
Delusions manifest when insight is compromised, so they are 
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associated with a degree of cognitive impairment. Delusions in PD 
tend to be paranoid in nature, and other phenomena, including 
delusional misidentification (for example, Capgras and Fregoli 
syndromes), can occur in a minority. 

Depression is one of the most common non-motor symptoms 
in PD, with clinically relevant symptoms occurring in 35% of PD 
patients. Rates are higher in PD-MCT, and there is a known associ- 
ation between depression and cognitive impairment. Generalized 
anxiety disorder (GAD) is the most commonly diagnosed anxiety 
condition in PD, followed by panic attacks and phobias. Anxiety 
is also strongly linked to depressive symptoms but appears to be 
less common in PDD than PD, and is also related to motor symp- 
toms, particularly occurring in the off condition in patients with 
motor fluctuations. Apathy, defined as a decrease in goal-directed 
behaviour, verbalization, and mood, is common in a range of 
neurodegenerative diseases, including PD. It is usually accom- 
panied by reduced self-awareness, so changes are noticed and 
brought to the attention of clinicians by caregivers. A common 
assumption is that the patient is depressed, although a lack of 
endorsement of sad mood suggests apathy instead. Apathy can 
occur independently of cognitive impairment in PD, but overlap is 
common. Some studies estimate the prevalence of apathy in PDD 
to be up to 50%. 

ICDs (for example, compulsive gambling, buying, sexual behav- 
iour, and eating) are increasingly recognized as common and clin- 
ically significant disorders in PD. Given that research suggests that 
the strongest risk factors for ICD development in PD are dopamine 
agonist treatment and younger age, and given that cognitively im- 
paired patients are more likely to be older and less likely to be pre- 
scribed a dopamine agonist, it is not surprising that ICDs are not 
commonly reported in PDD patients. In one single-centre study 
of 805 PD patients, ICD symptoms were less common in PD pa- 
tients with dementia (3.8%), compared with non-demented patients 
(9.6%) [46]. 





Diagnosis and differential diagnosis 


As mentioned in Classification, clinical criteria, and controversies, 
p. 439, the clinical diagnosis of PDD requires: (1) valid measures 
of cognitive abilities covering the major domains of cognition; (2) a 
method to determine whether or not the performance represents 
a decline from a person’s previous level of functioning; and (3) an 
assessment of how the individual’s cognitive abilities enable (or dis- 
able) function in day-to-day activities. Core features are established 
PD (clinical diagnosis usually made by a physician) and objectively 
demonstrated cognitive deficit. 


Clinical evaluation 


The cornerstone in the evaluation of a patient with suspected PDD 
is detailed clinical and neurological history and examination, 
including interview with a close informant. Assessment of social 
functions and activities of daily living, as well as psychiatric and be- 
havioural symptoms, is part of the basic evaluation. 


Measuring cognitive function 


Cognitive function in PD is generally measured in one of three 
settings: (1) in the course of an evaluation by a physician or an 
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occupational or speech therapist; (2) by a neuropsychologist per- 
forming a dedicated clinical cognitive evaluation; or (3) in a research 
study. These assessments require different tools; a physician or an 
occupational therapist usually requires a relatively short global cog- 
nitive scale with adequate sensitivity to screen for the presence of 
cognitive impairment or to follow changes over time (Table 44.3). 
A neuropsychologist uses multiple tests, each emphasizing a specific 
cognitive domain, to provide a detailed assessment of the pattern 
of cognitive dysfunction. Research studies require variable detail, 
depending on the goals of the study, but generally necessitate an in- 
strument with good specificity for an accurate diagnosis and often 
responsiveness to change over time. Thus, methods for assessing 
cognitive function in PD need to be selected with careful thought 
to the goals of the assessment [36]. A useful summary of cognitive 
screening measures and their approximate administration time is 
given later in this chapter. 

In terms of selecting a brief bedside screening tool, the author 
would generally favour the MoCA over the MMSE for assessing PD 
cognition, as it has greater sensitivity across a range of cognitive sub- 
domains, shows less ceiling effects, and is more sensitive than the 
MMSE in detecting longitudinal change over time [41]. 


Establishing cognitive decline 


As described previously, to establish a diagnosis, it is critical to 
understand whether or not any impairment represents a decline 
from a premorbid level of cognitive functioning. Subjective reports 
of decline are often spontaneously reported during clinical and re- 
search contacts and can be elicited with general questions about 
concerns related to memory or thinking. A more formal method 
is to use a cognitive complaint interview. However, patients are not 
very insightful into their own difficulties, questioning the reliability 
of this approach. Work in AD has suggested that patients with earlier 
dementia, but not MCI, underreport cognitive difficulties, com- 
pared with caregivers, suggesting that incorporating caregiver re- 
port of cognitive complaints is important. 

To the author’s knowledge, this issue has not been evaluated in 
PD; however, the use of a caregiver-reported questionnaire—the 
short form of the Informant Questionnaire on Cognitive Decline 
in the Elderly (SHORT IQ-CODE) [47]—is currently being evalu- 
ated, to assess for cognitive decline in the Oxford Discovery PD 


cohort. Lastly, cognitive decline can be established on the basis 
of serial performance on cognitive testing by a clinician or on the 
basis of results from neuropsychological test performance that is 
poorer than expected based on an estimate of a person’s premorbid 
cognitive abilities. Estimating premorbid verbal IQ can be accom- 
plished using a reading test such as the Wechsler Test of Adult 
Reading or the National Adult Reading Test, which are relatively 
resistant to change in the face of common neurodegenerative 
conditions. 


Assessing function in relation to cognition 


Impairment on the patient’s ability to carry out instrumental activ- 
ities of daily living (IADLs) is an essential feature for establishing the 
severity of cognitive impairment. This is especially relevant for non- 
demented PD subjects in whom some aspects of altered functioning 
in IADLs may appear unnoticeable before the diagnosis of dementia 
without a formal examination. 

As described, the presence of significant functional deficiency due 
to cognitive impairment is embedded in the current criteria to sup- 
port the diagnosis of PDD. Moreover, the major formal differenti- 
ation of PD-MCI from PDD has typically required cognitive deficits 
not to interfere significantly with the patient's ability to implement 
IADLs. Presently, judgement as to the PD patient’s ability to adapt 
to the demands of the environment and execute IADLs is mostly 
derived from indirect methods (for example, cognitive testing), un- 
structured interviews with relatives or other caregivers, or the use of 
functional scales intended for other dementias. 

Scales not specific for PD do not take into account the motor im- 
pact of the disease and can overestimate the extent to which cog- 
nitive dysfunction is contributing to problems carrying out IADLs. 
The lack of a recommendable instrument, capable of measuring the 
specific impact of cognitive decline in PD, minimizing the motor 
symptoms of the disease, has made it challenging to set a standard 
for what is meant by ‘significantly’ (that is, dementia) or ‘subtly re- 
duced’ (that is, MCI) functional performance. Two new PD-specific 
instruments—the PD-Cognitive Function Rating Scale (PD-CFRS) 
[48] and the brief Penn Daily Activities Questionnaire (PDAQ) 
[49]—may help fill the need for assessments of IADLs that are sen- 
sitive to cognitive impairment in PD, while minimizing the motor 
aspects of the disease. 


Table 44.3 Generic and non-Parkinson’s disease specific cognitive screening measures 


Scale name Assessed cognitive domains Approximate 
administration time 





Mini-Mental State Examination (MMSE) Orientation, verbal registration and recall, attention,naming and 10 min 
repetition, verbal comprehension, praxis, visuo-spatial 

Montreal Cognitive Assessment (MoCA) Orientation, attention, memory, naming, fluency, verbal repetition, 10 min 
visuo-spatial/executive 

Addenbrooke Cognitive Examination (Revised)—ACE(R) Attention/orientation, memory, fluency, language, visuo-spatial 20 min 

Cambridge Cognitive Assessment (Revised)—CAMCOG(R) Orientation, language, memory, attention, praxis, calculations, abstract 25 min 
reasoning, perception 

Dementia Rating Scale (2nd edition)/Mattis Dementia Rating Attention, initiation/perseveration, construction, conceptualization, 30 min 

Scale—DRS (2) memory 

Repeatable Battery for the Assessment of Neuropsychological Attention, language, visuo-spatial/construction, immediate memory, 30 min 

Status (RBANS) delayed memory 

Alzheimer's Disease Assessment Scale—Cognition (ADAS-Cog) emory, language, praxis 30 min 





Differential diagnosis 


The differential diagnosis of PD is extensive; of particular note, 
those that are also degenerative are associated with dementia/ 
MCI, including progressive supranuclear palsy (PSP), multiple 
system atrophy (MSA), DLB, and cortico-basal degeneration 
(CBD). Occsaionally, frontal lobe disorders nay be mistaken for 
Parkinsonian disorders, such as Pick’s disease, and non-degenerative 
aetiologies such as frontal lobe meningiomas or drug-induced 
Parkinsonism. Other progressive, non-degenerative diseases that 
have some Parkinsonian features include multi-infarct dementia 
and normal pressure hydrocephalus, usually excluded on an MRI 
brain scan. Occasionally, Parkinsonian features occur in patients 
with AD. 


Biomarkers for PDD 


The point prevalence of dementia in patients with PD is 25%, and 
it rises to 80% in patients who live for >20 years with the disorder. 
Cognitive decline in PD worsens the patients prognosis more than 
any other non-motor symptoms. A prognostic marker delineating 
the probability for those patients at risk for cognitive decline would 
be of utmost importance. While PD-MCT is a risk factor for PDD, it 
is a heterogenous entity, and it is not known which types of PD-MCI 
confer a higher risk of progression to dementia. In this sense, useful 
biomarkers are needed that can predict future outcome or that are 
useful to longitudinally track the underlying disease pathology in 
an objective way. In this section, we summarize the current know- 
ledge on cerebrospinal fluid (CSF) and blood proteins, imaging, and 
the impairment of gait/postural instability as risk markers for cogni- 
tive decline in PD. Genetic predictors of PDD have been discussed 
elsewhere. 


Cerebrospinal fluid 


The presence of LBs, amyloid plaques, and neurofibrillary tangles 
in the neocortex and limbic system is associated with dementia 
and MCI in PD. Hence, the levels of amyloid-6 (AB), tau protein, 
and a-synuclein have been studied in the CSF of PD patients (see 
[50, 51] for review). In most studies, there was less AB in PDD than 
in healthy controls, and lower levels of AB were associated with 
progression to dementia in PD and cognitive measures. By con- 
trast, data for total (t-tau) and phosphorylated tau (p-tau) are less 
consistent, with increased or unchanged levels in PDD patients. 
Although total a-synuclein was similar in PDD and controls in ini- 
tial studies, technically more advanced analyses showed that PDD 
patients have more oligomeric forms of a-synuclein, and a higher 
total a-synuclein concentration was associated with a faster decline 
in cognitive performance in de novo patients. However, most studies 
have failed to find any association between total or oligomeric a- 
synuclein and cognition in PD patients. 

Proteins involved in inflammatory processes, oxidative stress, and 
neuronal viability have also been investigated in the CSF, with ele- 
vated C-reactive protein, IL-6, and IL-1b in PD-MCI compared to 
PD normal-cognition patients or controls. Uric acid (UA), a scav- 
enger of free radicals, and cystatin C, which has anti-amyloidogenic 
properties, were also reduced in PDD and DLB patients. Despite 
some variability, reduced AB in PDD patients and those who pro- 
gress to PDD is consistent, and this suggests that the AB protein 
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might represent a useful biomarker to identify specific types of PD- 
MCI that might be at higher risk of suffering dementia. 


Plasma/serum and urine 


Plasma or serum levels of proteins involved in inflammation (C- 
reactive protein), oxidative stress (UA), or neuroprotection (vitamin 
D, transthyretin) were not different in PDD and cognitively normal 
PD patients [50, 51]. However, in PD patients with normal cogni- 
tion, low UA concentrations were associated with a worse outcome 
in global cognition, attention, and memory; high vitamin D levels 
with better semantic fluency and memory; and high concentrations 
of IL-6, tumour necrosis factor-a, and interferon-y-induced protein 
10, with lower cognitive scores. Importantly, low levels of epidermal 
growth factor (EGF) and insulin-like growth factor (IGF) have cer- 
tain predictive values for the development of dementia and cognitive 
decline, and IGF positively correlates with global cognition and ex- 
ecutive function. 

There is no consistent relationship between plasma homocyst- 
eine and dementia or worse cognitive outcome. Lipids have also 
been evaluated because abnormal lipid peroxidation may play a role 
in the pathogenesis of PD and other neurodegenerative diseases. 
Whereas plasma levels of phospholipids were higher in PD-MCI 
than in normal cognition PD subjects, prostaglandin isomers de- 
rived from free radical peroxidation of polyunsaturated fatty did not 
differ. In keeping with findings in plasma, low UA levels are associ- 
ated with poor neuropsychological performance. These findings are 
consistent with recent data linking neurodegeneration and ageing 
with disturbances in lipid metabolism and neuroinflammation. 


Imaging 


(See [50 ] for a review of this topic.) 


MRI grey matter changes 


Although there are many studies in this field, the most valuable are 
those with larger cohorts and more advanced analytic approaches, 
especially the longitudinal studies. Accordingly, reduced cortical 
volume or thickness in several areas, and especially in the hippo- 
campus, appears to be associated with progression to dementia 
and MCI. This is a promising avenue to be followed, in which well- 
designed prospective studies using modern analytical models might 
help to validate these findings or identify new patterns that could 
serve as potential biomarkers. 


White matter microstructure 
Diffusion tensor imaging 


Reduced fractional anisotropy (FA) or increased mean diffusivity 
(MD) in diffusion tensor imaging studies can indicate alterations in 
the microstructure of white matter (WM) tracts. Both approaches 
show that dementia and MCI in PD are associated with extensive 
areas of modified WM microstructure, with reduced FA being wide- 
spread in PDD along the main tracts. 


Functional MRI 
Cerebral blood flow 


Functional MRI (fMRI) in resting state or during the execution of 
tasks indirectly measures neural activity and is used to study re- 
gional activation of the brain and the association or dependency 
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between two or more anatomic locations, termed functional con- 
nectivity. One interesting approach is to study the default network 
that reflects the predominant activity at rest, which is dampened 
when switching to a cognitive task. In PDD patients, this net- 
work has weaker connectivity in the right inferior frontal gyrus 
and is less intensely deactivated than in controls when confronted 
with a complex visual task. Considering the data available, it can 
be speculated that there are two main functional networks in the 
resting state: one more anterior that seems to be related to ex- 
ecutive dysfunction, and another more posterior one that might 
herald the evolution to dementia. This would also be consistent 
with observations derived from the longitudinal CamPAIGN co- 
hort (see Longitudinal relationship between PD-MCI and PDD, 
p. 441). 


PET and single-photon emission computed tomography 
(SPECT) imaging 


Cholinergic ligands 


Pathological studies and pharmacological trials with acetylcholin- 
esterase inhibitors indicate that cholinergic dysfunction is relevant 
in dementia in PD. Studies using different radiotracers show that the 
cholinergic activity in PDD patients was weaker in the whole cortex 
and in the occipital, precentral, parietal, temporal, and posterior 
cingulate cortices than in healthy controls. PET studies indicate 
that assessing the cholinergic state might be useful as a biomarker 
of dementia in PD, but current accessibility limits their clinical and 
research use. 

Af ligands 

Fibrils of AB can be assessed in vivo by Pittsburgh compound B 
(PiB) PET imaging [52]. Studies showed that cortical AB deposition 
is common in individuals with PD and PDD, that high levels of AB 
are observed in most cases of DLB [53], and that greater deposition 
of Af is a risk factor for cognitive impairment in patients with PD, 
accelerating cognitive decline once established [54]. These findings 
were consistent with prior neuropathologic reports. In AD, disease 
progression occurs in the context of high A$ levels and is associated 
with the spread of tau deposits from the medial temporal lobe to the 
basal temporal neocortex and then to other neocortical regions [55], 
in association with regional neuronal loss. From the few such studies 
undertaken in PD patients, the in vivo results of PiB-PET studies are 
rather variable, with low sensitivity and specificity in the diagnosis 
of dementia and MCI in PD patients. 


Regional blood flow/glucose uptake ligands 


Reduced regional cerebral blood flow and FDG-PET uptake in the 
posterior cortical areas seem to be useful biomarkers of dementia in 
PD, in line with fMRI data. 


Tau ligands 


In the last year, the radioligand fluorine 18-labelled AV-1451, also 
known as [18F]T807, has been used to image tau in patients with 
LB diseases because of its high affinity, selectivity, and favourable 
kinetics for imaging tau. Two recent studies have shown: (1) AD pa- 
tients can be distinguished from DLB patients on the basis of signifi- 
cantly higher AV-1451 uptake, representing cortical tau, particularly 
in the medial temporal lobes [56]; and (2) patients with DLB and 


PDD manifest a spectrum of tau pathology, with cortical tau aggre- 
gates associated with cognitive impairment [52]. 


Gait and cognition in PD 


Gait disturbance in PD shares neurochemical, pathological, struc- 
tural, and genetic relationships with cognitive risk factors [51, 57]. 
It is believed that gait disturbance associated with the postural in- 
stability gait disorder (PIGD) motor PD phenotype and dementia 
is underpinned by a common neurochemical deficit in cholinergic 
function. Brain imaging highlights shared structural correlates of 
gait and cognitive impairment. Combined with evidence in older 
adults that gait changes may precede cognitive decline, these find- 
ings add validity to the role of gait as a surrogate marker of cogni- 
tive impairment. Longitudinal follow-up is required to explore the 
temporal relationship between these risk factors and their sensitivity 
and specificity. 


In recent years, there have been important advances regarding clin- 
ical characterizations, definitions, associated biomarkers, and risk 
factors for both MCI in PD and PDD. However, there is a paucity 
of effective therapies for cognitive impairment in PD, whether for 
mild symptoms or for moderate to severe dementia [58]. At present, 
only rivastigmine is U.S. Food and Drug Administration-approved 
for PDD, an indication received nearly a decade ago. Given the fre- 
quency of PD cognitive impairment and its substantial impact on 
both patients and families, the lack of available and effective treat- 
ments represents a striking gap in the field, especially when compared 
to the large number of available therapies. Improved symptomatic 
therapies, as well as potential disease-modifying agents, for PD cog- 
nitive impairment are needed. 

Treatments can be broadly divided into pharmacological and 
non-pharmacological and are summarized here. However, special 
considerations apply to the management of PDD. Firstly, dopamin- 
ergic therapy is more likely to cause hallucinations and delusions in 
Parkinsonian patients with dementia than in those with uncompli- 
cated PD. Secondly, neuroleptic therapy is more likely to cause an 
exacerbation of Parkinsonian symptoms, hence should be avoided 
unless absolutely clinically necessary. Third, on-off motor fluctu- 
ations may commonly be accompanied by mild fluctuations in cog- 
nitive state as well. 


Pharmacological treatments 


Modest symptomatic effect of the cholinesterase inhibitor 
rivastigmine for PDD has been shown, in particular for patients 
with accompanying visual hallucinations [59]. Other cholinesterase 
inhibitors, such as donepezil, have been shown to produce similar 
modest improvements in cognition and behaviour in PDD and DLB 
[60]. A naturalistic study indicated that amantadine may increase 
the time from onset of PD to dementia [61]. Several studies have 
used memantine to show benefit in patients with both PDD and 
DLB combined [62]. PD psychosis can be difficult to treat; practic- 
ally, however, removal of anti-parkinsonian medications may help 
ameliorate psychotic symptoms, particularly in the earlier stages of 
the disease, and a specific order of withdrawal has been suggested, 


beginning with anticholinergic agents through to dopamine agon- 
ists/COMT inhibitors, and then finally, if required, levodopa [45]. 
However, reductions in dopamine therapies can be challenging, 
given the need to adequately treat motor symptoms. 

Specific pharmacological interventions include the use of drugs 
for psychosis; however, as mentioned, these are more likely to cause 
an exacerbation in motor symptoms in PD patients. Dopamine ant- 
agonists, such as haloperidol, can provoke severe neuroleptic sensi- 
tivity reactions and therefore are contraindicated in this population. 
Similar reactions or worsening of parkinsonism can also be ob- 
served with most of the newer drugs such as risperidone, olanzapine, 
and aripiprazole [63], which should be avoided because of the risk. 
While there have been numerous RCTs of antipsychotics for PD 
psychosis, there is a paucity of data looking at the benefits of these 
agents in PD patients with cognitive impairment specifically. The ex- 
ception is clozapine where one major study with a positive outcome 
included some PD patients with possible dementia (mean baseline 
MMSE: placebo group, 21.7; clozapine group, 23.8) [64]. However, 
subgroup data for PDD patients were not provided, and widespread 
use of clozapine is limited due to its potential to induce agranulo- 
cytosis and the necessity for regular blood monitoring. Quetiapine 
is used most frequently for PD psychosis, but there is no evidence 
from controlled studies for its efficacy in PDD [65) Additionally, 
prolonged use of dopamine antagonists may have deleterious effects 
on cognition, and they significantly increase cerebrovascular events 
and mortality in older people with dementia in general (see [45] for 
review), so these agents should be used cautiously in PD patients 
with cognitive impairment until demonstrated not to increase mor- 
bidity and mortality in this population. 

A promising new drug for PD psychosis is pimavanserin (se- 
lective 5-HT2A inverse agonist). A recent controlled trial found a 
significant benefit of pimavanserin on all endpoints, including the 
Scale for Assessment of Positive Symptoms in Parkinson's Disease 
(SAPS-PD) score, with additional improvement in night-time sleep 
and daytime somnolence [66]. Overall pimavanserin was well toler- 
ated and did not worsen motor symptoms. 

Cholinesterase inhibitors, in particular rivastigmine, may be an 
alternative first-line treatment option, as these agents can improve 
cognition, function, and neuropsychiatric symptoms. Other drugs, 
such as memantine (NMDA antagonist) and ondansetron (5-HT3 
antagonist), have also been considered for the treatment of PD 
psychosis. Overall the therapeutic benefit of memantine remains in- 
conclusive, and from a psychosis perspective, improvements in the 
neuropsychiatric inventory (NPI) in patients with PDD have not 
been observed, although there may be some benefit in DLB patients. 


Non-pharmacological treatments 


There are no systematic studies evaluating non-pharmacological 
interventions for PD psychosis, although a small study suggested 
benefit for ECT in patients refractory to dopamine antagonists 
[67]. As mentioned, removal of anti-parkinsonian medications can 
help ameliorate psychotic symptoms, but often at the cost of motor 
control. Phased reduction, followed by withdrawal, if necessary, of 
dopamine agonist medication, in particular, is critical to the man- 
agement of ICDs. Abrupt withdrawal of dopamine agonist medica- 
tion should be avoided, due to the risk of causing dopamine agonist 
withdrawal syndrome (DAWS), an unpleasant syndrome charac- 
terized by agitation, motor restlessness, worsening of symptoms, 
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dysphoria, and anxiety. Similarily, abrupt withdrawal of levodopa 
or any dopaminergic medication can cause neuroleptic malignant 
syndrome and should always be contemplated with the full support 
and involvement of a PD physician. 

Non-pharmacological interventions for PDD include several 
ongoing studies of deep brain stimulation (DBS) surgery in PD, 
targeting the bilateral nucleus basalis of Meynert, a cholinergic- 
innervated basal forebrain site involved in attention, learning, 
and memory processes, which is impaired in dementia. Non- 
pharmacological strategies for treating PD cognitive impairment 
represent an area of growing interest and include cognitive training, 
physical exercise and physical therapy, music and art therapy, and 
non-invasive brain stimulation techniques (see [58] for review). To 
date, many studies are open-label pilot studies; though there are sev- 
eral small RCTs, ‘double blinding’ of study personnel and patients 
in these types of interventions can be challenging. There is great 
heterogeneity in study methodologies [for example, different types 
of cognitive tasks and means of assessments (computerized inter- 
ventions, neuropsychological tests, and word games, along with 
the duration of study and practice), physical exercises and methods 
(aerobic, dance, strength, and so on), and cognitive targets (atten- 
tion, executive function, memory, and so on)]. 

Studies in PD have generally focused on cognitively intact or non- 
demented, but mildly cognitively impaired, PD patients, rather than 
those with PDD. Cognitive therapies include cognitive training ex- 
ercises, computerized brain training, and non-physical leisure ac- 
tivities which have the potential to improve cognitive outcomes and 
IADLs (see [58] for review). Physical exercise and activity have re- 
ported benefits on motor PD symptoms, while studies investigating 
their effects on cognition are growing. Beneficial effects of combined 
physical activity and cognitive training therapies may be potentially 
additive [68, 58 ]. 


Prognosis and future perspectives 


The cumulative incidence of PDD from longitudinal studies of in- 
cident PD cohorts is remarkably consistent, suggesting that around 
half will develop dementia within 10 years from diagnosis. The onset 
of PDD is insidious, with a similar reported mean annual decline on 
the MMSE to that seen in AD and DLB [35]. However, disease pro- 
gression varies considerably, with some patients declining rapidly, 
while others have a more benign course. The determinants under- 
pinning this variability are poorly understood. Post-mortem studies 
showed that concurrent AD neuropathology is associated with a 
more rapid cognitive decline in LBD patients, shorter time between 
parkinsonism and dementia onset, and a worse prognosis. There also 
appears to be a synergistic relationship between tau, a-synuclein, 
and amyloid pathology (see earlier). However, it is still unclear 
whether in vivo surrogate biomarkers (amyloid/tau imaging, CSF 
protein quantification) can predict LBD endophenotype in terms of 
disease progression. Response to current symptomatic treatments is 
also variable in PD, and a better understanding of the mechanisms of 
response and non-response to these agents will be key in informing 
who to target with a new range of emerging symptomatic drugs. 
There is major pharma investment into disease-modifying treat- 
ments in early AD targeting amyloid and tau, with recent data, ex- 
citingly, suggesting benefit. Aggregation of these brain proteins also 
occur in LBD and may have deleterious effects, as well as synergis- 
tically promote a-synuclein aggregation (the core aggregate in LB 
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disease). Developing relevant stratification of biomarkers could there- 
fore mean that any effective near-future anti-amyloid/tau treatments 
could also be offered to selected PDD and DLB patients. Furthermore, 
anti-a-synuclein therapies, protein degradation enhancers, and mito- 
chondrial stabilizers are all in development, and stratification in PDD 
would be highly apposite to de-risking trials of these agents. 

Lastly, the earlier any intervention, the more effective it is likely 
to be. Therefore, intervention at the PD-MCTI stage, to prevent the 
inexorable progression to PDD, must be a key focus of future treat- 
ments in PD if they are to be effective. Biomarkers, in particular, the 
emerging imaging biomarkers outlined, could be especially useful to 
identify those PD-MCI patients at high risk of developing dementia 
in the short to mid term. 
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Huntington’s disease (HD) was first described in 1872 by George 
Huntington, an American physician living on Long Island, New York. 
His father and grandfather practised medicine in the same commu- 
nity, so that he had access to case notes from several generations of 
families who lived there. This long period of record-keeping allowed 
him to document a hereditary form of chorea, similar to ‘common 
(Sydenham’s) chorea, but progressing over many years to death. Its suf- 
ferers had a tendency to insanity and suicide. Huntington’s brief essay, 
which also included a clear description of autosomal dominant inher- 
itance, remains one of the classic descriptions ofa medical disorder [1]. 














HD is an inherited neuropsychiatric disorder prominently affecting 
the striatum and its direct connections. It is characterized by a triad 
of clinical features that are common to diseases of this region: a 
non-aphasic dementia, depression and other disorders of affect, and 
a variety of dyskinesias, most typically chorea [2, 3]. Chorea, from 
the Greek word for ‘dance’, describes involuntary, non-stereotyped 
jerky movements. The illness, insidious in onset, may begin with all 
or any one of these three features. Patients who present initially to 
psychiatrists usually have dementia, personality changes (such as 
apathy, irritability, or loss of temper), or depression, often with sui- 
cidal thoughts or attempts. Symptoms may appear at any time from 
early childhood to old age, most frequently between 35 and 45 years 
of age. Once the illness begins, sufferers gradually deteriorate over 
many years in their cognitive and motor functioning and end in a 
persistent vegetative state, with almost complete loss of voluntary 
motor function. Death occurs after about 15-20 years and is usu- 
ally caused by inanition or aspiration pneumonia. Some patients die 
earlier from suicide or from injuries such as subdural haematomas 
caused by a fall. Patients with early onset seem to progress more rap- 
idly than those whose symptoms begin later in life. 











The earliest visible neuropathology is in the striosomes of the 
caudate/putamen [4], followed by a dorsal-to-ventral progressive 





loss of almost all striatal output neurons. The deep layers of mul- 
tiple cortical regions are also prominently affected, and there can 
also be milder neuronal loss in some brainstem nuclei. Protein ag- 
gregates, most easily detectable in neuronal nuclei, are prominent. 
Neuroimaging studies have shown that neuropathological changes 
typically begin before the onset of clinically detectable disease. In 
particular, the extent of striatal loss in pre-symptomatic individuals, 
as measured by MRI, correlates with the predicted time until dis- 
ease onset [5]. Cortical thinning [6] and white matter loss and dis- 
organization [7, 8] have also been detected in pre-symptomatic gene 
carriers. Subtle changes possibly related to abnormal brain develop- 
ment have also been reported. 

The prevalence of HD ranges from about 6 to 14 cases per 100,000 
population in North America, Western Europe, and Australia, with 
much lower prevalence in Asia [9]. HD is caused by the expansion 
of an unstable triplet repeat sequence (CAG) in the first exon of a 
gene near the telomere of chromosome 4p [10]. It is transmitted 
as an autosomal dominant trait; if one parent carries the mutation, 
each offspring (regardless of sex) has an independent 50% chance 
of inheriting the abnormal gene. Normal repeat lengths range from 
about 7 to 28 triplets. Individuals with 29-35 triplets will not de- 
velop HD (with possible rare exceptions) but may pass an expanded 
allele to an offspring, while individuals with 40 or more triplets 
will develop HD. Repeat lengths of 36-39 triplets may or may not 
cause disease. The rate of mutation from a normal-length allele 
to an expanded one is low, so that most patients have an affected 
parent. Family history, however, can be obscured by multiple factors, 
including misdiagnosis of the parent, death of the parent before dis- 
ease onset, adoption, and incorrect assignment of paternity. The re- 
peat length does not remain stable at meiosis. In HD, the number of 
CAG triplets is more likely to increase when the gene is transmitted 
by fathers. As the number of repeats increases, the age at onset is 
earlier. Thus, paternal transmission is often associated with ‘antici- 
pation, earlier onset in the subsequent generation; most individuals 
with childhood onset have affected fathers [11]. 

The pathogenesis of HD is not well understood but appears to be 
multi-faceted [12]. The gene huntingtin, with the expanded repeat, 
is expressed as the protein huntingtin. The CAG repeat expansion 
is translated as an expanded polyglutamine tract, which appears to 
have neurotoxic properties. The region of the huntingtin protein 
with the polyglutamine tract may be cleaved from the rest of the pro- 
tein and adopt an abnormal configuration, or it may be abnormally 


modified by post-translational processes such as phosphorylation. 
These changes, in turn, are thought to lead to disruption of cellular 
functions, including transcriptional machinery, protein degrad- 
ation processes, metabolism, and cellular transport. Other proposed 
pathogenic mechanisms include toxicity derived from huntingtin 
RNA transcripts containing the expanded CAG repeat, atypical 
translation of the huntingtin gene leading to proteins with long 
stretches of other amino acids, and loss of the normal function of 
the huntingtin protein. 


The clinical diagnosis of HD remains dependent on a thorough 
psychiatric history, including a detailed family history and history 
of changes in social adjustment, mental state examination, cogni- 
tive examination, and neurological examination. The features vary, 
depending on how long the patient has been ill [13]. Once the disease 
is suspected on clinical grounds, genetic testing, available through 
many commercial laboratories, provides the definitive diagnosis. 
A number of large completed and ongoing longitudinal studies of 
HD, including PREDICT, PHAROS, COHORT, REGISTRY, and 
Enroll-HD, have greatly enhanced knowledge of the signs, symp- 
toms, and course of HD. 


Diagnosis of patients with early symptoms 





Patients with HD who initially consult psychiatrists may present 
with a variety of psychiatric syndromes, including depression, bi- 
polar disorder, obsessive-compulsive disorder, schizophrenia, or 
excessive anxiety. Irritability or apathy may be a manifestation of one 
of these syndromes or may appear separately. The psychiatric syn- 
dromes seen in HD are clinically indistinguishable from idiopathic 
disorders, and in perhaps 20% of cases, neurological signs may not 
be present. Suicide is a risk in this prodromal phase, even if the pa- 
tient is unaware of their risk for HD [14]. Presenting symptoms and 
problems with functioning at work or at home must often be elicited 
from an informant; the patient may minimize them through em- 
barrassment or fear, or even be unaware of them. Common changes 
include decline in work speed or accuracy, which may result in de- 
motion or warnings from superiors, a tendency to become irritated 
or physically aggressive in response to annoying stimuli that would 
not have elicited such a response in the past, and a decreased interest 
in activities. Most of these symptoms and behaviours are common 
in psychiatric disorders, but cognitive inefficiency and irritability 
may seem disproportionately extreme relative to the patient's other 
symptoms. On cognitive examination, the patient may have diffi- 
culty recalling dates of important life events and more difficulty than 
expected with ‘serial sevens. Cognitive changes are often easier to 
notice after the psychiatric disorder is treated, which can usually be 
accomplished using standard medications. However, unlike the typ- 
ical response to treatment of idiopathic disorders, cognitive ineffi- 
ciency and difficulties at work, apathy (if present), and sometimes 
irritability remain, even after the HD patient’s mood, energy, and 
sleep patterns have improved. 

On neurological examination, motor restlessness is usually pre- 
sent but is easily misinterpreted as a manifestation of anxiety. Motor 
signs may be subtle: slightly slow saccadic eye movements [15], 
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writhing movements of the protruded tongue or of the fingertips 
when the arms are held at 90°, or mild dysdiadochokinesia. 

Diagnosis can be further complicated by the apparent lack of a 
family history of HD. The family may not have been informed about 
the affected parent’s diagnosis or may know only that a parent died 
in a psychiatric institution or committed suicide. In other cases, the 
paternity is uncertain. If the family history is actually negative (this 
is quite uncommon) or unobtainable (often the case for adopted in- 
dividuals who frequently present in childhood), the diagnosis may 
be confirmed by testing for the HD gene expansion. 

HD with onset in childhood or early adolescence [16] most often 
presents with cognitive and behavioural features, including speech 
and language problems, a decline in school performance, deterior- 
ation of handwriting, and loss of interest in school and social activ- 
ities [17]. These non-motoric disturbances may be the only clinical 
features for several years before motor impairment begins. The motor 
impairment in a majority of juvenile-onset patients will include prom- 
inent parkinsonism, bradykinesia, very slow saccades, lead pipe or 
cogwheel rigidity, and dystonia, with chorea less prominent or even 
absent. However, many affected juveniles have motor signs similar to 
typical HD. Myoclonus and epilepsy are observed in as many as 50% 
of early-onset HD, and some children develop a coarse tremor. 

Even though it can be difficult, it is important to make the diag- 
nosis of HD as early as possible, particularly in employed persons. 
Poor function at work (or in schoolwork or household duties) oc- 
curs early, and patients can lose their jobs or support of their family, 
often on suspicion of drug or alcohol abuse or of indifference to the 
work or home environment. This is usually avoided if the diagnosis 
is made known to the family and employer, allowing modification 
of the work environment or retirement on the basis of disability and 
family education about the illness. Prompt diagnosis does not al- 
ways mean that the patient needs to be immediately informed of the 
diagnosis. Occasionally, patients are too depressed to do this safely; 
others indicate that they do not wish to be told. Treatment can usu- 
ally proceed despite the patient’s reluctance to label the disorder. 


Diagnosis of patients with well-established signs 





and symptoms 


After a few years of illness, diagnosis is easier. The signs and symp- 
toms will have worsened, and usually the motor disorder is obvious. 
A typical patient who has been ill for about 5-7 years is unable to 
work or manage finances but lives at home and is able to manage 
personal needs. Some patients remain active and energetic, con- 
tinuing to participate as fully in life as their cognitive and motor 
disabilities allow; others are apathetic most of the time, but irrit- 
able when disturbed; still others may have a depressive syndrome, at 
times complicated by delusions, obsessions, or compulsions. Many 
patients are anxious and easily upset by changes of routine. An un- 
common, but very troublesome, feature of HD is sexual abnormality. 
While most patients become impotent or uninterested in sex, a few 
are hypersexual and may develop paraphilias [18]. It is important to 
inquire about these specifically because neither the patient nor his or 
her spouse will likely mention problems. 

Cognitively, patients complain of forgetfulness and distractability. 
Thinking is slow; patients have difficulty following a conversation 
and cannot complete a multistaged task. On cognitive examination, 
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Mini-Mental State Examination scores [19] may still be above the 
cut-off score of 23 for dementia, but serial sevens will be very poor, 
and one or two items will be missed on recalling words after a dis- 
traction. On neuropsychological testing, the IQ will be lower than 
expected for education, and there will be difficulty learning word 
lists and performing tests that require changing sets. 

Most patients will have obvious involuntary choreic movements, 
as well as difficulty with control of voluntary motor movements, as 
seen by clumsiness, slowness, dysarthria, and an unsteady gait. The 
involuntary movements will wax and wane with the level of arousal; 
it can be worsened by performing serial sevens or by fine motor 
tasks. Speech will have an irregular staccato, often laboured, quality. 
Saccadic eye movements will be slow or irregular, and the patient 
will be obviously clumsy on tests of dysdiadochokinesia and rapid 
movements such as finger-thumb tapping, although finger-to-nose 
testing is normal. Gait will be wide-based and irregular, with diffi- 
culty with tandem walking. Reflexes are usually brisk, and a history 
of falls can be elicited. 





Diagnosis of patients with advanced disease 


After 10 years of illness, dementia is more severe, with poor perform- 
ance on all aspects of the cognitive examination, except naming. 
Speech is dysfluent, with long lapses between the examiner's question 
and the patient’s reply, rather like Broca’s (expressive) aphasia. Some 
patients will be almost unable to speak, although language compre- 
hension is relatively preserved. Patients (ifthey are co-operative) can 
carry out simple commands and will recognize relatives and nursing 
staff. Patients may be irritable, particularly when their verbal re- 
quests cannot be understood or routines altered. Psychiatric syn- 
dromes are more difficult to discern, but most can be diagnosed by 
observing behaviour such as hoarding, sleeplessness, or diurnal vari- 
ation in mood. Physical disabilities are much worse. Patients often 
need to be fed, toileted, and helped with most daily needs. They have 
difficulty walking and may fall, causing further disability through 
broken limbs or subdural haematomas. Chorea often stabilizes or 
subsides [13], but the ability to carry out voluntary movements be- 
comes seriously disabling. If they survive long enough, patients be- 
come unable to initiate speech or to walk, swallow only with great 
difficulty, and have such severely rigid muscle tone that they may be 
nearly unable to move their bodies. Clonus and positive Babinski 
signs are present. Patients in this sort of ‘persistent vegetative state’ 
[20] are difficult to distinguish from individuals in the late stages of 
other movement disorders or dementias; as in early disease, diag- 
nosis will depend on eliciting a family history or genetic testing. 





Differential diagnosis 


The differential diagnosis of HD is extensive [3], but only a few of the 
disorders for which it can be mistaken are common [2]. These include 
other dementias, other movement disorders, and other psychiatric 
disorders. The most common subcortical dementia is Parkinson’ 
disease (PD), in which motor slowness resembles that of HD, but the 
characteristic pill-rolling tremor and festinating gait of PD are rare 
in HD. The dementia associated with late-life depression can look 
very similar to HD, including motor slowness. Alzheimer’s disease 


is easily distinguished by the lack of motor signs during the first 
several years of illness and more prominent difficulty with memory 
and language, as opposed to attention and calculation. Perhaps most 
difficult to distinguish clinically are the fronto-temporal dementias, 
which present with prominent behavioural disturbances and a posi- 
tive family history. The clinical presentation may be insufficient 
to distinguish these various dementias in patients with advanced 
disease, since they may all progress to a persistent vegetative state. 
The family history and duration of illness (which is longer for HD 
than for Alzheimer’s disease or fronto-temporal dementia) can be 
helpful. The C9ORF72 expansion mutation, a common cause of 
amyotrophic lateral sclerosis and fronto-temporal dementia, can 
also lead to a HD-like phenotype. 

Several less common diseases classified as movement disorders may 
closely resemble HD. They often have an autosomal dominant in- 
heritance pattern, and some, like HD, are caused by expansion of un- 
stable triplet repeat sequences. Examples include Fahr’s syndrome 
(calcification of the striatum), some forms of spinocerebellar degen- 
eration and benign familial chorea, neuroacanthocytosis, HD-like 2 
(HDL2), and dentatorubropallidoluysian atrophy (DRPLA). These 
disorders, while much rarer than HD (except for DRPLA in Japan), 
can so closely resemble HD that they can only be distinguished by 
genetic testing. 

The most common movement disorder that resembles HD is tar- 
dive dyskinesia. Patients with HD occasionally have several years of 
a schizophrenia syndrome before the movement disorder begins. If 
they have been treated with dopamine antagonist medicines, the sub- 
sequent onset of involuntary movements can be mistaken for tardive 
dyskinesia. On the other hand, the choreoathetotic involuntary move- 
ments of severe tardive dyskinesia, which may involve the trunk and 
extremities, as well as the face, may be mistaken for HD. Usually, it is 
possible to distinguish patients with tardive dyskinesia by their normal 
saccadic eye movements, normal tandem gait, and fluid and fluent 
speech [21]. However, genetic testing may be necessary in some cases. 
Wilson's disease also presents with the subcortical triad and should be 
considered when neither parent is affected. It is recessively inherited, 
so that the only affected relatives are siblings. Very late-onset HD may 
be diagnosed as ‘senile chorea’ because the family history appears to 
be negative. Family members will also present with symptoms only 
late in life and may have died before their manifestation. 

The differential diagnosis of nearly all psychiatric disorders in- 
cludes HD, as described. 





Treatment and management 


Currently, there is no treatment that influences the fundamental 
course of illness of HD, though much work is now directed in 
that direction (see Future of HD research and treatment, p. 452). 
However, it is possible to alleviate some of the symptoms of HD 
through judicious use of pharmacological interventions [22]. 
Chorea can be diminished with tetrabenazine, low doses of dopa- 
mine antagonist medicines (though probably with less effect in 
advanced disease), and occasionally benzodiazepines, though 
risks and benefits with all of these treatments must be carefully 
weighed. The recently available modified versions of tetrabenazine, 
deutetrabenazine, and valbenazine may prove more convenient 
to administer or less of a risk for depression than tetrabenazine 


itself [23]. Given the potential side effects of haloperidol, particu- 
larly at doses of more than 5 mg, newer dopamine antagonist medi- 
cines are now increasingly recommended for chorea suppression if 
tetrabenazine or its derivatives are contraindicated or ineffective. 
Dystonia may be treated with botulinum toxin injections or benzodi- 
azepines, and there is evidence that deep brain stimulation may prove 
helpful in some cases [24]. Muscle rigidity and consequent contrac- 
tions occur in late HD, causing pain and difficulty in positioning the 
patient to avoid pressure sores. Amantadine (which also has a positive 
effect on mood) can somewhat decrease the rigidity; chairs and beds 
must be padded and tailored to each patient's specific needs. In the 
setting of falls or unexpected worsening of movement or cognition, 
clinicians need to consider the possibility of a subdural haematoma. 
Psychiatric manifestations are present to some extent in a vast 
majority of HD patients and may become severe in more than one- 
third. Appropriate pharmacological and non-pharmacological 
treatment has the potential to significantly improve the quality of life 
for HD patients and their families, and may help decrease the high 
rate of suicide observed in HD. Clinical experience suggests that 
depression, anxiety, and obsessive-compulsive disorder associated 
with HD usually respond to pharmacological treatments used for 
the similar idiopathic disorders, though there have been few system- 
atic treatment trials of psychiatric syndromes in HD [25]. Because 
some patients are unaware of their depressed mood (just as they can 
be unaware of their involuntary movements), an informant is often 
needed to elicit the symptoms and monitor response to treatment. 
It is also important to distinguish depression (from which the pa- 
tient is miserable and sleepless) and apathy, which does not cause 
distress. Occasionally, mood and anxiety disorders are chronic and 
unresponsive to treatment. Severe, unresponsive depression can be 
treated successfully with electroconvulsive therapy [26]. Bipolar 
disorder in patients with HD may not respond to lithium but may 
improve with carbamazepine or valproic acid. In addition, lithium 
is difficult to administer because of the risk of lithium toxicity re- 
lated to insufficient fluid intake. Valproic acid, serotonin-specific 
reuptake inhibitors, and low-dose dopamine antagonist agents may 
also be helpful in the treatment of irritability. In one case report, 
high doses of sertraline were effective for intractable aggression [27]. 
Schizophrenic-like syndromes, while not particularly common, can 
be difficult to treat but may respond to a standard dopamine an- 
tagonist medicine, with a preference towards newer agents with less 
D, receptor affinity. Clozapine remains an option in patients not 
responding to other drugs. Apathy, when not part of a depression 
syndrome, does not respond well to pharmacological interventions. 
As with most dementias, psychopathology influences, and is in- 
fluenced by, the patient’s environment. Patients do best in a calm, 
highly predictable environment where cognitive expectations are 
not too complicated. When the environment is too taxing, patients 
become irritable, especially towards their family. HD seriously dam- 
ages family relationships, which, in turn, affects the patient. The well 
spouse becomes responsible for supporting the family, caring for the 
children and the patient, and making family and financial decisions. 
Spouses’ lives are further complicated by patients’ unwillingness to 
relinquish financial and family decision-making; patients usually 
make poor decisions that can damage family relationships and fi- 
nances. Some patients neglect their children or treat them badly. If 
the other parent cannot prevent this, it is wisest to remove the pa- 
tient from the home. There is no research on the treatment of sexual 
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aggression, which occasionally occurs in males, but a few men 
have been successfully treated with depot anti-androgen agents. 
Supportive psychotherapy for the patient should focus on minim- 
izing demoralization at lost abilities. Spouses can be helped with 
reorganizing family life to maximize predictability of the patient's 
environment, diplomatically decreasing the patient's domestic re- 
sponsibilities, and assuring that the spouse has time away from the 
patient. 


Helping persons at risk for HD 


People at risk for HD vary in their abilities to deal with the burden 
of uncertainty, depending on their personal attributes and their 
experience with the illness in a relative. A few consult physicians 
for reassurance, but most avoid doctors until they become ill, and 
even then many resist medical attention, claiming against all evi- 
dence that they are perfectly well. Currently, a minority of asymp- 
tomatic persons at risk for HD decide to have genetic testing, but 
these individuals, skewed towards those whose anxiety is lessened 
by planning for the future, have usually handled the test results well, 
regardless of whether positive or negative [28]. As more clinical 
trials are launched for individuals with the HD mutation who are 
without detectable symptoms, the incentive for pre-symptomatic 
testing will likely increase, with a concomitant change in the nature 
of individuals seeking testing. Fetal and pre-implantation genetic 
testing is now available in some centres, each with its own set of po- 
tentially complicated ethical and practical issues that must be sorted 
out prior to testing. 

Pre-symptomatic genetic testing [29] of any sort should always 
be preceded by genetic counselling, provided either by a genetic 
counsellor or by a clinician familiar with HD genetics and the po- 
tential practical and psychological consequences of both positive 
and negative test results. Counselling should include a discussion 
of the motivations for seeking testing, which may include decisions 
about childbearing, education, employment, finances, participation 
in clinical trials, or the potential at-risk status of offspring. Many 
individuals who come for testing have not seriously considered the 
possibility that they will test positive for the mutation, so that role- 
playing about various outcome scenarios is important. Occasionally, 
persons request testing who have learnt only recently that they are 
at risk for HD. Others apply who are depressed or under unusual 
stress for other reasons. Such persons should be encouraged to delay 
testing until their situation becomes more settled. Finally, some 
people who request testing already have symptoms of HD, yet do 
not wish to have a diagnosis. Considerable care is required to decide 
how best to support such individuals, and family members or close 
friends of the person should be consulted. 


Future of HD research and treatment 





While efforts to explore the course of HD and to improve symp- 
tomatic HD treatment continue, the focus of HD research is 
moving towards finding treatments that can stop, slow, or prevent 
the development and progression of the neuronal dysfunction 
and neurodegeneration that underlie the disease. New tools for 
modelling HD, including a variety of animal models and induced 
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pluripotent stem cells, should facilitate preclinical work [30]. HD 
investigators are also attempting to develop increasingly sensitive 
biomarkers, measurable correlates of disease progression that can 
facilitate the detection of efficacious therapies in individuals who do 
not have clinically manifest HD and that can lead to clinical study 
designs that require fewer subjects and a shorter duration of treat- 
ment [31, 32]. Recent studies have demonstrated that some agents 
thought to be of general neuroprotective value, such as co-enzyme 
Q10, likely lack significant clinical efficacy in HD [33], while other 
protective agents still under investigation, such as PBT2, may have 
benefit [34]. Attention and funding have increasingly emphasized 
the development of methods for suppressing the expression of the 
mutant HD allele [35], with antisense oligonucleotide strategies 
just reaching the stage of clinical trials. The potential power of such 
therapeutic approaches, or even more sophisticated methods such as 
gene editing to remove the mutation from the genome, has generated 
considerable optimism among HD investigators and HD families. 
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Vascular cognitive impairment (VCI), known as vascular dementia 
(VasD) in its more severe form, is cognitive impairment due to path- 
ology of blood vessels in the brain. This pathology encompasses 
both ischaemic and haemorrhagic disease; however, most literature 
focuses on ischaemic disease, as this is the far more prevalent form. 

Various sets of diagnostic criteria for VCI and VasD exist, but all 
share some factors in common—the patient must have evidence 
of cognitive impairment, evidence of cerebrovascular disease, and 
evidence of an association between the two [1]. The Diagnostic and 
Statistical Manual of Mental Disorders, fifth edition (DSM-5) out- 
lines criteria for major and mild vascular neurocognitive disorder, 
the term DSM-5 uses for VCI [2] (Box 46.1). 

Aggressive modification of vascular risk factors has a greater 
potential to modify the course of VCI than the progression of 
neurodegenerative disorders such as Alzheimer’s disease (AD). This 
makes an early and accurate diagnosis of VCI critically important. 











VasD is the second leading cause of dementia in the United States, 
affecting 1.2-4.2% of people over the age of 65 years and repre- 
senting 15-20% of dementia cases. The prevalence of VasD increases 
with age [1]. It is unclear whether the prevalence of VasD differs by 
sex [1]. Currently, there are no reliable statistics on the prevalence of 
VCI not rising to the level of dementia. 

Encouragingly, the rate of VasD appears to be falling. Recent data 
from the Framingham Heart Study found that the overall incidence 
of dementia has declined over the past 40 years, falling by 20% every 
10 years [3]. The rate of VasD has fallen faster than that of AD [3]. 
The risk of developing dementia after a stroke has also decreased, 
from nine times higher to less than twice higher than that in in- 
dividuals without a stroke [3]. While some evidence suggests that 
improved control of vascular risk factors may have contributed to 
the decline in dementia, this does not account for all the reduction 
seen [3]. 

Individuals with stroke are at higher risk for VCI than the general 
population. Following a stroke, 6-32% of patients develop dementia 
of some type [4]. Even small strokes substantially increase the risk of 


cular cognitive impairment 





cognitive impairment. One study of patients with lacunar ischaemic 
CVA, without large-vessel strokes, found that 47% had mild cogni- 
tive impairment (MCI), making MCI more common than physical 
disability [5]. 











Most classifications divide VCI into three subtypes: multi-infarct 
VCI, strategic infarct VCI, and small-vessel VCI [4]. Some au- 
thors refer to small-vessel VCI as subcortical VCI or Binswanger 
disease [4]. 

Patients with multi-infarct VCI may show the classic ‘stepwise de- 
cline’ pattern, in which each stroke adds a new cognitive burden, 
with periods of relative cognitive stability in between strokes. 
However, the other forms of VCI typically follow a different course. 

In strategic infarct VCI, a single stroke affects a region critical 
for cognitive functioning and produces VCI after the single event. 
Classic strategic infarct locations include the thalamus, mesial 
frontal lobe, caudate, mesial temporal lobe, left angular gyrus, and 
genu of the left internal capsule [1]. However, strokes in other loca- 
tions can also produce strategic infarct VCI, and due to individual 
variations in brain anatomy and connectivity, clinical findings may 
not always correlate with classic syndromes. 

Notably, patients can show progressive cognitive decline fol- 
lowing a single stroke; this may be due to disruption of brain net- 
works resulting in ongoing worsening, though this hypothesis 
still lacks definitive proof [4]. In any case, this finding indicates a 
complex relationship between stroke and cognitive decline, which 
goes beyond the impact of neuronal demise resulting from stroke. 
However, reducing rates of stroke likely constitutes a very effective 
way to reduce rates of VasD. 

In small-vessel VCI, patients typically show gradually progressive 
cognitive decline. They may never experience a ‘clinical’ stroke that 
produces an obvious and immediate change in neurologic function. 
However, physical examination may reveal abnormal neurologic 
findings such as hyperreflexia, Parkinsonism, and gait disturbance 
[6]. Small-vessel VCI can occur due to lacunar strokes or ischaemic 
demyelination [7]. 

As individuals with cerebrovascular disease commonly have 
involvement of both small and large vessels, patients can have 


Box 46.1 Diagnostic criteria for vascular 





neurocognitive disorder 


A. Cognitive impairment 
e Major vascular neurocognitive disorder: 
— Significant decline in one or more cognitive domains, based on 
clinical history and/or impaired performance on cognitive testing 
— Impaired functioning in everyday activities 
e Mild vascular neurocognitive disorder: 
— Modest decline in one or more cognitive domains, based on clin- 
ical history and/or impaired performance on cognitive testing 
— Independent functioning in everyday activities (though may require 
more effort than previously) 


B. Evidence of cerebrovascular disease 





e Supported by history, physical examination, and/or neuroimaging 


C. Clinical features consistent with a vascular aetiology 

e Onset of cognitive deficits is temporally related to a cerebrovascular 
event 
Or 

e Pattern of cognitive deficits is consistent with a vascular aetiology 
(that is, decline in complex attention or frontal executive functions) 

D. Symptoms not better explained by another disorder 

e Criteria A, B, C, and D must be met for diagnosis 


Source: data from the Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 











combinations of multi-infarct, strategic infarct, and small-vessel 
VCI. One study found that lacunar infarcts represent the most 
common type of stroke seen in VasD [8]. 

Some studies have found that 70% of individuals with VasD also 
have comorbid AD-type pathology, a condition referred to as mixed 
dementia [9]. Most VCI is sporadic, but familial forms, such as VCI 
due to cerebral autosomal dominant arteriopathy with subcortical 
infarcts and leukoencephalopathy (CADASIL), also exist [4]. 


Overlap between VCI and Alzheimer’s disease 





Although VCI can occur in isolation, cerebrovascular disease and 
AD can co-occur and synergistically worsen each other. 

One large community-based study found that, among individ- 
uals with either VasD, AD, or both, 13% had only VasD, 54% had 
only AD, and 33% had both [9] (Fig. 46.1). Notably, 83% of these 
patients with VasD and/or mixed VasD-AD had no clinical history 
of stroke and likely would have received an incorrect diagnosis if 
MRI had not been performed, underscoring the critical import- 
ance of neuroimaging for the correct determination of the dementia 
type [9]. 

In post-mortem studies, many individuals with dementia show a 
mix of cerebrovascular and AD neuropathology. Amyloid plaques, 
neurofibrillary tangles, cerebral amyloid angiopathy (CAA), and 
white matter lesions are found in both disorders [8]. About 10% 
of individuals with pure VasD show evidence of CAA, and CAA 
could contribute to further damage of blood vessels and progres- 
sion of cerebrovascular disease, large intracerebral hemorrhages, 
and microbleeds [1, 8]. In AD, amyloid-B builds up around blood 
vessels in the brain, causing impairment of vascular functioning [1]. 
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Mixed VasD and AD: 





Fig. 46.1 Overlap between VasD and AD. 


Almost 30% of individuals with VasD have hippocampal sclerosis, 
possibly related to hippocampal neurons being especially vulnerable 
to ischaemia [8]. Both vascular disease and AD-type pathology are 
associated with hippocampal atrophy [1]. 

Recent imaging advances—namely, the development of Pittsburgh 
compound B (PIB), a radiotracer used with PET—allow for the non- 
invasive assessment of amyloid-6 burden. This technique permits a 
more accurate classification of pure VasD (VasD without evidence 
of amyloid-B burden) vs mixed VasD-AD (VasD with excessive 
amyloid-B burden) and has led to new insights into how these two 
subcategories of VasD resemble and diverge from each other. 

Some studies have observed different patterns of MRI findings 
in individuals with VasD with and without concomitant amyloid-f 
burden. For instance, among patients with small-vessel VasD, those 
who are amyloid-positive may have smaller hippocampal volumes 
and those who are amyloid-negative may have more lacunar infarcts 
[10, 11]. One study found that individuals with small-vessel VasD, 
without evidence of amyloid burden, showed damage to the white 
matter throughout the entire cerebrum, as opposed to the regional 
pattern seen in AD without vascular disease [12]. 

Vascular risk factors not only increase the risk of VasD, but also 
the risk of AD. Hypertension, diabetes mellitus (DM), metabolic 
syndrome, coronary artery disease (CAD), smoking, atrial fibrilla- 
tion, atherosclerosis, and obesity are all associated with an increased 
risk for AD, and having a stroke may double the risk for AD [4]. 

Vascular disease also worsens the severity of neurodegenerative 
dementia. One neuropathologic study found that, among indi- 
viduals with AD or Parkinsonian neurodegenerative diseases, 
those who also had cerebrovascular disease required less primary 
neurodegenerative pathology (neurofibrillary tangles or Lewy 
bodies) to reach the same severity of clinical dementia as individ- 
uals with neurodegeneration only [13]. As cerebrovascular disease 
is common in individuals with neurodegenerative dementia, these 
findings suggest that tight control of vascular risk factors could po- 
tentially help in these conditions as well [13]. 

Conversely, neurodegenerative pathology can worsen VCI. One 
cross-sectional study of MCI due to subcortical vascular disease 
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found that individuals with excessive amyloid- burden showed 
more cognitive impairment than those with only ischaemic vascular 
pathology [14]. Another prospective longitudinal study of small- 
vessel VasD found that patients positive for amyloid showed faster 
cognitive decline over a 3-year follow-up period [15]. 


Clinical features 


In general, the pattern of cognitive impairment varies with the loca- 
tion of the vascular lesion or lesions. Although VCI has been clas- 
sically associated with frontal executive dysfunction, other patterns 
of impairment also occur [1]. For example, while a patient with sub- 
cortical ischaemic white matter disease may show predominant im- 
pairment in frontal executive functions, a patient with a strategic 
infarct in the left middle cerebral artery (MCA) territory may show 
aphasia and/or apraxia. Memory impairment in VCI may occur due 
to stroke affecting the hippocampus or to the concomitant presence 
of AD-type pathology. Although lacunar strokes are often associated 
with impairments in frontal executive functions, one study found 
that episodic memory impairment was just as common in this stroke 
type [5]. 

Depending on the location of their strokes, individuals with 
VCI can also show deficits in elementary neurologic function such 
as hemiparesis, visual field cuts, urinary incontinence, and gait 
disturbance. They may also show other neurologic signs such as 
hyperreflexia and Parkinsonism. 

Individuals with VCI also frequently experience neuropsychiatric 
symptoms. Up to 95% of patients with VCI may have at least one 
psychiatric or behaviour symptom [16]. Depression and apathy rep- 
resent the most common symptoms, with depression present in 50- 
75% and apathy in one-third to two-thirds of individuals with VCI 
[16-18]. Apathy becomes more common with increasing severity of 
cognitive impairment [16, 17, 19]. Irritability, anxiety, and agitation 
each occur in about half of patients, disinhibition in about 10-30%, 
and psychotic symptoms in 20-30% [16, 17]. 

Individuals with VasD show poorer sleep than those with AD, and 
one cross-sectional study of subcortical VasD found that patients 
with more white matter hyperintensities (WMHs) experienced 
worse sleep [20]. 


The diagnosis of VCI is based on history, neurological examination, 
cognitive testing, and neuroimaging. Neuroimaging can reveal evi- 
dence of cerebrovascular disease and ‘asymptomatic ischaemic or 
haemorrhagic damage, even in the absence of a clinical history of 
stroke. 

Clinicians should be aware that patients who have never had a 
clinical stroke can still have VCI, as some forms of VCI (such as 
small-vessel VCI) do not necessarily cause any obvious abrupt 
neurologic changes. 


Neuroimaging 


In recognition of the variety of international practice settings, most 
criteria sets require either clinical (clinical history of stroke and/ 
or findings of focal neurological deficits consistent with stroke on 


neurological examination) or radiologic evidence of cerebrovascular 
disease. 

However, in a country with ready availability of neuroimaging, 
clinicians should certainly obtain it to confirm evidence of cerebro- 
vascular disease. Even if the history and examination are compatible 
with stroke, other aetiologies such as a tumour could, in some cir- 
cumstances, produce a similar clinical picture but need drastically 
different treatment. Additionally, imaging distinguishes between 
ischaemic and haemorrhagic stroke—an important determin- 
ation to make, as their secondary prevention strategies may differ 
significantly. 

To diagnose VCI, MRI is preferable to CT, unless there is a contra- 
indication, given its greater sensitivity. Specialized MRI sequences, 
such as susceptibility-weighted imaging (SWI) and gradient echo 
(GRE) sequences, detect the presence of blood with high sensitivity 
and specificity, including microbleeds related to amyloid angiopathy 
[21]. By evaluating the integrity of white matter tracts, diffusion 
tensor imaging (DTI) can identify white matter damage not seen on 
conventional MRI sequences such as T2 and fluid-attenuated inver- 
sion recovery (FLAIR) [12]. 

Common imaging findings in VCI include evidence of large- 
vessel ischaemic stroke(s), lacunar ischaemic stroke(s), extensive 
WMHs, white matter atrophy, and/or large or small haemorrhages. 
Microhaemorrhages are very common in ischaemic VasD— 
occurring in two-thirds to four-fifths of patients, a higher percentage 
than in other dementias such as AD [21]. Of note, individuals 
without VCI, cognitive impairment, cerebrovascular disease, or 
other structural brain disease can also show WMHs, for instance, 
patients with migraine or some individuals with healthy ageing [7]. 
When long tract signs, such as weakness or hyperreflexia, accom- 
pany WMHs, the WMHs more likely reflect genuine pathology [7]. 
However, there is no clear cut-off for the size or amount of vascular 
lesions required to cause VCI, as substantial inter-individual vari- 
ability exists. 

Dementia or cognitive impairment—of any type—cannot be 
diagnosed solely on an imaging basis. Some individuals have more 
cognitive reserve than others, so two people with the same imaging 
findings could have very different levels of cognitive functioning. 


Cognitive evaluation 


Cognitive evaluation should encompass both a thorough clinical 
history of cognitive and functional changes, and cognitive testing 
[20]. To diagnose VCI, there should be evidence that the patient 
has experienced a cognitive decline relative to his or her baseline 
functioning. 

The clinical history provides key information about the 
patient’s prior functional level, the time course and nature of the 
cognitive changes, associated symptoms, and performance of ac- 
tivities of daily living and instrumental activities of daily living 
[22]. The clinical history does have limitations, however. The pa- 
tient and/or other informants may dismiss worrisome events— 
such as getting lost while driving in a familiar area—as normal 
ageing, even though they are not. Sometimes, a similarly aged 
family member, such as a spouse, may also have cognitive impair- 
ment and thus not accurately perceive or remember the patient's 
recent functioning. 

In addition to a thorough history, appropriate cognitive testing— 
taking into account the patient’s age and educational, occupational, 


and cultural background—is essential for diagnosing VCI. However, 
the test results must be interpreted in light of the overall clinical con- 
text, as no test is 100% sensitive or specific for cognitive impairment. 

No cognitive test is entirely free of cultural and socio-economic 
bias. Typical cognitive batteries fail to capture important domains 
of cognition—such as social cognition, artistic creativity, and 
mechanical ability—and the clinical history may be more useful 
in uncovering decline in these areas. Patients functioning at a high 
occupational level may experience a decline in job performance as 
an early symptom of VCI; while this decline may have significant 
real-world importance, insufficiently challenging or occupationally 
specific cognitive tests may not detect any problems. For example, 
if a theoretical physicist comes to clinic complaining of recent dif- 
ficulty solving problems in her field, very few physicians or neuro- 
psychologists could determine if this was objectively true through 
office-based testing! Consultation with a professional colleague of 
the patient—with the patient’s consent—may help establish whether 
there is an actual decline in functioning, even when cognitive test 
scores are in the normal range. 

Some patients’ premorbid functioning may have been higher 
or lower than predicted by standard test norms or by estimates of 
premorbid intellectual functioning, and thus isolated test results 
may mislead as to whether a decline from baseline has occurred. 
For instance, a patient may have limited formal education due to 
socio-economic circumstances but be self-taught, well read, and 
functioning at a high intellectual level prior to his or her current 
problems. Conversely, due to variation in educational quality, some 
individuals may have graduated from high school without achieving 
functional literacy. Additionally, no tests to estimate premorbid in- 
tellectual functioning have complete immunity to decline from the 
effects of an acquired cognitive disorder; the data do not support the 
idea that some areas of cognitive functioning are ‘crystallized’ and 
do not decline in dementia. 

However, in most cases, the results of appropriately interpreted 
norm-based cognitive testing significantly help in clarifying the 
presence, severity, and nature of cognitive impairment. 

Given the heterogeneity of cognitive deficits in VCI, testing 
should assess both overall cognitive function—using a test of gen- 
eral cognitive function such as the Folstein Mini-Mental Status 
Exam (MMSE) or the Montreal Cognitive Assessment (MoCA)— 
as well as multiple specific cognitive domains, including tests of 
frontal executive function, memory, attention and concentration, 
orientation, visuo-spatial function, and language, and assessment 
for neglect. 

All tests have strengths and weaknesses. The MMSE depends 
heavily on the assessment of orientation, memory, and language 
and show less sensitivity for detecting impairment in patients 
whose primary problems may involve other domains; clinicians 
should supplement these tests with other instruments which better 
assess other cognitive functions. The MoCA may assess a broader 
variety of domains than the MMSE but may misclassify cognitively 
normal individuals with low educational attainment as having 
dementia. 

One useful approach is to assess overall cognitive functioning with 
a general test such as the MMSE or the MoCA, and then use tests of 
specific domains to interrogate further into areas of possible impair- 
ment. Box 46.2 lists several brief domain-specific assessments useful 
for administration at the bedside or in the clinic 
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Box 46.2 Tests of cognition for bedside and clinic 


ATTENTION/CONCENTRATION 

e Digit span forward and backward 
e Months backward 

TAL EXECUTIVE FUNCTIONS 
e Frontal assessment battery 

e Antisaccades 

LANGUAGE 

e Bedside Western Aphasia Battery-Revised 
e Boston Naming Test 

MEMORY 

e Hopkins Verbal Learning Test 
NEGLECT 

e Line bisection 

e Target cancellation 
VISUO-SPATIAL FUNCTION 

e Clock-drawing test 

e Navon figures 























Differential diagnosis 





The differential diagnosis for VCI includes neurodegenerative 
dementias such as AD and fronto-temporal dementia; multiple 
sclerosis and other autoimmune neurologic conditions; chronic 
infections such as HIV or neurosyphilis; vitamin deficiencies; 
endocrinopathies; leukodystrophies; and toxic exposures such as 
medications which impair cognition, alcohol, opioids, heavy metals, 
and volatile substances. Of note, some of these disorders can cause 
white matter changes which may be difficult to distinguish from 
those of VCI without further investigation such as cerebrospinal 
fluid (CSF) analysis or enzyme testing. 

When evaluating a patient for a cognitive disorder—whether sus- 
pected VCI or another condition—the most important task is to 
rule out potentially reversible or modifiable causes. If a patient with 
AD receives a misdiagnosis of VCI, that patient will not miss out 
on a disease-modifying treatment, as none currently exists for AD. 
However, if a patient with multiple sclerosis, severe B12 deficiency, 
or neurosyphilis is misdiagnosed with VCI, this could result in a 
lost opportunity to halt or reverse the progression of these treatable 
diseases. 

Serum laboratory testing for reversible causes of cognitive impair- 
ment should include assessment of B12, thyroid function tests, com- 
plete blood count (CBC), comprehensive metabolic panel (CMP), 
ammonia, vitamin D, folate, thiamine, HIV, treponemal syphilis 
testing, erythrocyte sedimentation rate (ESR), and antinuclear anti- 
body (ANA) [22, 23]. Depending on clinical circumstances (for 
example, a history of gastric bypass or other condition affecting 
nutrient absorption, or an occupational history of heavy metal ex- 
posure), additional testing may be needed to fully evaluate for po- 
tentially reversible causes. 

Unfortunately, many older individuals use medications known 
to impair cognition and increase risk for dementia, such as anti- 
cholinergic medications (for example, allergy medications, over- 
the-counter sleeping aids, antispasmodic bladder medications, and 
certain drugs for depression, anxiety, and psychosis) and benzodi- 
azepines. Alcohol and recreational drugs can also cause cognitive im- 
pairment. Fortunately, the cognitive harm done by these exogenous 
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factors is at least potentially reversible. Before diagnosing a cognitive 
disorder, clinicians should make all efforts to eliminate the use of 
these agents and then reassess cognition and functioning. Even if the 
patient does have VCI, these substances will likely worsen impair- 
ment and should be eliminated in any case. 

Individuals with a history of stroke or ischaemic white matter 
disease show increased rates of depression, which can contribute 
to functional impairment and may resemble a neurocognitive 
disorder but which is reversible with appropriate treatment [1]. 
Clinicians assessing a patient for cognitive impairment should al- 
ways evaluate for depression and treat it whenever present, regard- 
less of whether or not it caused the cognitive problems—comorbid 
untreated depression can substantially worsen cognitive function. 
In addition to causing emotional suffering, post-stroke depres- 
sion is associated with increased mortality, more disability, slower 
neurologic recovery, an increased risk of suicide, and worse quality 
of life [24]. The cognitive deficits most consistently associated with 
depression are impairment of attention/concentration, frontal ex- 
ecutive function, working memory, and processing speed; depres- 
sion would be very unlikely to cause other deficits such as aphasia 
or neglect [25]. 

Sometimes, it may not be possible to determine whether cere- 
brovascular disease or another condition such as AD is the primary 
determinant of cognitive impairment. However, in either case, un- 
controlled vascular risk factors can cause worsening of functioning 
and cognition, so vascular risk factor modification should be under- 
taken regardless. Any patient with cerebrovascular disease needs 
vascular risk factor modification, regardless of cognitive status or 
type of cognitive disorder. 


Mechanisms of VCI 


Although strokes can produce cognitive impairment by direct 
damage to brain regions involved in cognition, additional mechan- 
isms likely can also give rise to VCI. 

Small-vessel VCI may disrupt the neurovascular unit, that is, 
the vascular endothelium, perivascular cells, glia, and neurons, 
which work together and influence each other to maintain cerebral 
homeostasis—via inflammation and damage to the blood-brain 
barrier, as shown in Fig. 46.2 [1, 7]. Injury to the neurovascular unit 
may play key role in the progression of white matter injury in small- 
vessel VCI and occurs more frequently in VCI than when white 
matter changes occur in the context of AD [6]. Neuropathologic 
findings in small-vessel VCI include abnormalities in and around 
small vessels, including endothelial dysfunction, vessel fibrosis, and 
inflammation [6]. These changes may cause cerebrovascular resist- 
ance and decreased cerebral blood flow, thereby potentially contrib- 
uting to hypoperfusion and further vascular damage [4].The CSF 
of individuals with small-vessel VCI shows elevations in albumin 
and matrix metalloproteinases, consistent with inflammation and 
blood-brain barrier disruption; specialized MRI also reveals evi- 
dence of damage to the blood-brain barrier [6, 26]. Damage to the 
neurovascular unit may impair perfusion during cognitive activities 
and thus further exacerbate neuronal dysfunction in vascular cog- 
nitive impairment [4]. 

In addition to the classic findings of ischaemia or haemorrhage, 
VCI may cause other brain changes which could adversely affect 


cognition. VCI may cause changes in the corpus callosum, such 
as decreased white matter integrity and perhaps decreased size, 
potentially affecting interhemispheric transmission of informa- 
tion [27]. One study also found decreased white matter integ- 
rity in the corpus callosum in VasD [27]. One study found that 
strategic-infarct VCI from isolated left thalamic stroke was asso- 
ciated with decreased cerebral blood flow in both hemispheres, 
suggesting that even a single small stroke can have far-reaching 
consequences, perhaps by disrupting broader networks involved 
in cognition [28]. 

A cross-sectional study found that individuals with ‘asymptom- 
atic’ subcortical lacunar infarcts showed more cortical and sub- 
cortical atrophy and more cognitive impairment than healthy 
controls [29]. 

Though classically associated with AD, cholinergic deficits may 
occur in VCI as well [1]. Neuropathologic case-control studies 
found impairment of cholinergic pathways in small-vessel VasD and 
CADASIL [30]. However, another neuropathologic case-control 
study found that individuals with ‘pure’ VasD, without any evidence 
of AD-type pathology, did not show evidence of temporal lobe cho- 
linergic deficits, as compared to healthy controls; on the other hand, 
individuals with mixed VasD-AD did show cholinergic deficits com- 
parable to those seen in ‘pure’ AD [31]. 


Outcomes and consequences of VCI 


There is contradictory evidence on whether VasD progresses faster 
or more slowly than AD. Some studies have found that individuals 
with VasD survive for a mean of 5-6 years following diagnosis, a 
slightly shorter survival than in AD; however, these studies are over 
20 years old and may not reflect changes in diagnosis (that is, in- 
creased detection sensitivity with more widespread use of MRI) and 
advances in vascular risk factor control, which could potentially 
improve survival in the current era [32]. One large cohort study 
of memory clinic patients seen found that individuals with VasD 
show slower progression of functional decline than those with AD 
[33]. However, only 2% of study participants had VasD, a far lower 
number than their proportion among dementia cases in the gen- 
eral population, raising questions about the generalizability of this 
finding. 

VasD increases the risk of subsequent stroke and adversely affects 
stroke recovery. Individuals with VasD, but no prior history of clin- 
ical stroke or transient ischaemic attack (TIA), show a 2-fold higher 
risk for subsequent ischaemic stroke and TIA and a 4-fold higher 
risk for subsequent haemorrhagic stroke [34]. One study of patients 
hospitalized for acute stroke found that those with VasD have less 
functional recovery at 1 year follow-up than those with normal 
cognition; however, VCI not rising to the level of dementia did not 
worsen functional outcomes [35]. 

Individuals with VasD also show higher rates of other vas- 
cular diseases—namely, atherosclerosis, heart failure, and atrial 
fibrillation—as well as higher rates of non-vascular comorbidities 
such as sepsis, injuries, lung diseases including chronic obstructive 
pulmonary disease, and urinary diseases [36]. 

VasD increases the risk for delirium, which can worsen de- 
mentia outcomes [37]. One study of memory clinic outpatients 
found that one-third of patients with VasD had superimposed 
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Fig. 46.2 Schematic diagram of the pathophysiology of small-vessel VCI. 


delirium at the time of their clinic visit [37]. Among individuals 
with neurodegenerative dementias such as AD, those with imaging 
evidence of comorbid cerebrovascular disease had higher rates of 
delirium than those without cerebrovascular disease [37]. Clinicians 
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treating patients with VasD should carefully monitor for delirium 
and educate family members and other caregivers about the symp- 
toms and significance of delirium, so that they can seek treatment 
promptly if it occurs. 
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Stroke 


Stroke, whether clinically apparent or silent, whether haemorrhagic 
or ischaemic, is a major risk factor for VCI. Having a clinical stroke 
doubles the risk for dementia [1]. Among people with stroke, 7% de- 
velop new-onset dementia within the first year after the stroke and 
nearly 50% develop dementia over the next 2 years [1]. Forty per cent 
of patients with first ever ischaemic or haemorrhagic stroke, who do 
not have severe aphasia, have VCI at 3-months’ follow-up [38]. 

Individuals with silent ischaemic strokes, WMHs, and/or ‘asymp- 
tomatic cerebral microbleeds also show an increased risk for sub- 
sequently developing cognitive impairment [1, 8]. Among patients 
with MCI, those with more WMHs are more likely to convert to 
VasD [39]. 

All else being equal, patients with more brain territory affected 
by infarcts are more likely to have cognitive impairment than those 
with less affected territory. However, since other factors, such as 
stroke location, vary significantly between patients, overall there 
is no clear relationship between the number/volume of strokes and 
cognitive impairment [1]. For instance, even a small lacunar infarct 
in the thalamus could produce major cognitive consequences, while 
a larger infarct in the deep white matter could go largely unnoticed. 


Other vascular risk factors 


In addition to stroke itself, vascular risk factors associated with 
stroke also increase the risk for VCI. Even in the absence of stroke 
or dementia, individuals with more vascular risk factors show worse 
cognitive performance [4]. 

One large population-based prospective study of factors im- 
portant for good vascular health—not smoking, maintaining a 
healthy body weight, engaging in regular physical activity, eating a 
healthy diet, and having blood pressure, cholesterol, and fasting glu- 
cose at goal—found that the more of these factors an individual had, 
the lower his or her risk for subsequent stroke, VasD, or cognitive 
decline [40]. Importantly, all of these factors are potentially modifi- 
able, suggesting a strategy for preventing dementia. 

Multiple longitudinal studies support an association between 
mid-life hypertension and subsequent development of VasD [1]. 
Hypertension appears to be a major risk factor for small-vessel VasD, 
found in a significant majority of patients [6]. However, one study in 
South Korea found that 25% of individuals with small-vessel VasD 
did not have any current or previous hypertension [41]. These in- 
dividuals also had lower rates of other vascular risk factors such as 
DM, hyperlipidaemia, and obesity and lower rates of clinical stroke 
than other patients with subcortical VasD, and were not more likely 
to smoke [41]. Potentially, they have other novel risk factors predis- 
posing them to small-vessel VasD [41]. One cross-sectional case- 
control study found that individuals with a history of hypertension 
had decreased cortical blood flow in the temporal and occipital lobes; 
this finding suggests an additional pathway by which hypertension 
could contribute to VCI, but the study did not actually measure cog- 
nition, making it impossible to draw a firm conclusion [42]. 

Most, but not all, studies have found that atrial fibrillation in- 
creases the risk for VasD; in the studies that did not find an in- 
crease risk, it is possible that participants were more effectively 
anticoagulated, thus attenuating their risk [1]. 


Diseases of other blood vessels—including CAD, peripheral ar- 
terial disease, carotid atherosclerosis, and carotid thickening or 
stiffening—are associated with an increased risk for VasD [1]. Some 
studies have found that increased carotid artery thickening and stiff- 
ness are associated with worse cognitive performance; although one 
study found that carotid artery stiffness predicted subsequent cogni- 
tive worsening, it is not known whether carotid artery changes cause 
or contribute to cognitive impairment or whether they are both re- 
lated to some other factors [1]. 

Smoking is associated with a 1.4-times greater risk for subse- 
quently developing VasD; however, former smokers who have quit 
do not have an elevated risk [43]. Similar relationships with smoking 
are seen for AD and all-cause dementia as well [43]. This pattern 
suggests that smoking increases the risk for developing VasD—as 
well as dementia in general—but that quitting smoking can elim- 
inate this excess risk [43]. 

Type 2 DM (T2DM) increases the risk for subsequently developing 
VasD by 2.5 times; T2DM also increases the risk for AD, but not by 
as much [42]. Insulin resistance also increases the risk for VasD [8]. 

The role of hyperlipidaemia in VCI remains unclear [1]. 


Genetic risk factors 


CADASIL is a Mendelian genetic disorder caused by mutations in 
the NOTCH3 gene and markedly increases the risk for VCI [43]. 
CADASIL is the most common hereditary stroke syndrome, with 
a prevalence of up to 1 in 20,000 adults [43]. One epidemiologic 
study found that 48% of individuals with CADASIL had cognitive 
impairment [43]. 

Apolipoprotein E (APOE) is a protein involved in lipid metab- 
olism, including CNS cholesterol transport, among other functions; 
it is well known that APOE 4 polymorphism increases the risk for AD 
[8]. The role of APOE polymorphisms in VasD risk is still unclear; a 
majority of studies have found that the APOE 4 allele does increase 
the risk for VasD, though the magnitude of increased risk is less than 
that for AD [8]. However, a significant minority of studies have found 
no relationship [8]. As the negative studies were conducted in areas 
where people may be more likely to eat a Mediterranean diet or a diet 
high in fish, diet could potentially moderate the relationship between 
APOE genotype and VasD, such that individuals with APOE 4 alleles 
who consume a heart-healthy diet do not face an increased risk, while 
those with APOE 4 alleles who eat more saturated fat and cholesterol 
do; however, no study has yet investigated this question [8]. 


Other risk factors 


Individuals with depression late in life have a 2.5 times greater like- 
lihood of subsequently developing VasD [46]. 

Inflammation and oxidative stress may also increase the risk for 
VCI. Increased blood levels of inflammatory proteins, such as a- 
1-chymotrypsin, C-reactive protein, and interleukin-6, are associ- 
ated with an increased risk for subsequently developing VasD [1]. 
Chronic kidney disease—typically accompanied by inflammation, 
in addition to other adverse physiologic changes—is also associated 
with an increased risk for VasD [1]. One cross-sectional case-control 
study found that individuals with VasD showed a biomarker pattern 
consistent with increased oxidative stress, as compared to healthy 
controls, even after adjusting for other vascular risk factors [47]. 
Individuals with VasD may also show hyperhomocysteinaemia and 
hyperuricaemia [47]. 


One cross-sectional case-control study found lower serum levels 
of brain-derived neurotrophic factor (BDNF) in individuals with 
VasD than in healthy controls [46]. BDNF plays an important role 
in synaptic plasticity and neuronal survival, so loss of BDNF could 
plausibly contribute to dementia [46]. However, a prior study found 
that BDNF levels were not decreased in VasD; this may possibly re- 
flect the different levels of VasD severity in the two studies [46]. 





Prevention and disease-modifying treatment 


As VCI is caused by damage to the brain from vascular causes, that 
is, strokes and ischaemic demyelination—interventions to reduce 
the risk of stroke and improve vascular health can help prevent VCI 
and help arrest disease progression once VCI is present. Rates of 
VasD are falling, a hopeful finding which appears due, in part, to 
improvements in vascular risk factor modification [3]. 

Hypertension is the single most common modifiable stroke 
risk factor, and hypertension should be treated to reduce the risk 
for VasD [1]. One RCT in patients with a history of stroke found 
that reducing blood pressure with antihypertensives reduced 
the risk for developing dementia, and a prospective cohort study 
found an inverse relationship between VasD and antihypertensive 
drug use [1, 8]. White matter lesions increase when hypertension 
is not controlled, while treatment of hypertension reduces their 
progression [6]. 

Numerous observational studies of the Mediterranean diet, as 
well as one RCT of a Mediterranean diet supplemented with olive oil 
or nuts, have found that this diet reduces the risk of cognitive decline 
[49]. Although these studies did not specifically target individuals 
with VCI, other studies have found that a Mediterranean diet and 
olive oil can both reduce the risk of clinical stroke, WMHs, and silent 
stroke, making it very plausible that this diet could reduce the risk 
of VCI and even potentially slow disease progression in those who 
have it [49]. Theories for why a Mediterranean diet could benefit 
cognition and reduce vascular disease include its high levels of anti- 
oxidant and anti-inflammatory substances, including polyphenols 
and omega-3 fatty acids [49]. 

A meta-analysis of prospective longitudinal studies found that 
people who regularly exercise are less likely to get VasD [50]; how- 
ever, these non-RCT studies do not permit a determination of caus- 
ality. One cohort study of patients with white matter lesions found 
that increased physical activity was associated with lower rates of 
cognitive impairment [6]. 

RCTs of antioxidant and B vitamin supplementation—albeit not 
focusing on individuals with VCI—have, in general, found no effects 
on cognitive function, though some subgroup analyses suggest that 
some particular individuals (for example, those with poor dietary 
intake of vitamin B) may potentially experience some benefit [1]. 

Patients with VCI, a history of stroke or TIA, or imaging evidence 
of cerebrovascular disease should be worked up for modifiable risk 
factors—most importantly, hypertension, DM, and atrial fibrillation. 
Patients without these classic vascular risk factors—particularly 
young individuals—should undergo work-up for hypercoaguable 
states and other causes of cryptogenic stroke. All clinicians should 
encourage smoking cessation, as quitting smoking reduces the 
risk for subsequent VCI to the level of that seen in never-smokers 
[43]. Aggressive secondary stroke prevention measures—such 
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those delineated in the American Heart Association/American 
Stroke Association guidelines—should be undertaken to prevent 
worsening of cognitive impairment and disability and to lessen the 
risk for future stroke. 


Symptomatic treatment 


Although there are no FDA-approved treatments for VCI, there is 
reasonably good evidence to support the off-label use of medications 
approved for other conditions to ameliorate the symptoms of VCI. 

Cognitive-enhancing drugs approved for AD—cholinesterase 
inhibitors and memantine—appear to show benefit in VCI as well. 
Donepezil has the best evidence base for use in VasD without con- 
comitant AD [1]. Multiple double-blind, placebo-controlled RCTs 
have shown that donepezil improves cognition in VasD; however, 
different studies have differed on whether donepezil also improves 
functional outcomes [1]. In one study of donepezil, the 10 mg/day 
dose showed superiority to the 5 mg/day dose on the measure of de- 
mentia severity, but the doses did not differ on other measures [51]. 

Galantamine also has evidence supporting its use in VasD, with 
RCTs showing it improves cognition; however, the evidence is more 
robust for mixed AD-VasD than for pure VasD [1]. 

One large placebo-controlled trial found that oral rivastigmine 
improved cognitive outcomes, though not overall functional status, 
in individuals with VasD; however, a subgroup analysis found that 
the cognitive benefit was driven by participants aged 75 years old 
or above, who may have been more likely to have concomitant AD 
pathology [52]. Two small studies of rivastigmine in VCI did not 
find any benefits; however, it is possible they were underpowered, 
and in addition, one study used a smaller dose (6 mg/day) than the 
study which did find cognitive benefit (mean dose 9.4 mg/day) [53]. 

Overall, cholinesterase inhibitors appear to be safe and well tol- 
erated in VasD, with rates and severity of adverse effects similar to 
those observed in AD [1]. 

Two double-blind, placebo-controlled RCTs examined the un- 
competitive NDMA inhibitor memantine in VasD. Both found that 
memantine improved cognitive outcomes but did not affect func- 
tional status [54, 55]. Memantine was safe and well tolerated in these 
studies [54, 55]. 

In addition to their benefits in VasD, donepezil, galantamine, and 
memantine have RCT evidence that they are effective for post-stroke 
aphasia, again suggesting these agents can improve cognitive deficits 
due to cerebrovascular disease [56]. 

Some preliminary data suggest that SSRIs may improve cogni- 
tion in VCI. A small randomized, open-label trial of fluoxetine in 
non-depressed patients with VasD found that participants in the flu- 
oxetine group showed a small, but statistically significant, increase 
in cognitive function following 12 weeks of treatment [57]. Serum 
BDNF levels in the fluoxetine group also increased more and posi- 
tively correlated with the degree of cognitive improvement, sug- 
gesting an SSRI-induced BDNF increase as one potential mechanism 
for the cognitive benefit [57]. A double-blind study of escitalopram 
in patients with stroke, but not necessarily VasD, and a retrospective 
open-label case series examining sertraline in VCI found similar 
cognitive improvement [57, 58]. Further studies are needed to help 
elucidate whether SSRIs may play a role in the treatment of cognitive 
impairment due to cerebrovascular disease, but these results suggest 








SECTION 7 Delirium, dementia, and other cognitive disorders 


SSRIs may represent one additional potential avenue of therapy. 
Fluoxetine has randomized, double-blind, placebo-controlled trial 
evidence that it can improve motor function following stroke, which 
suggests another way in which SSRIs could benefit individuals with 
VCI, many of whom also have motoric deficits [59]. 

One small double-blind, randomized, placebo-controlled 
trial for the treatment of aphasia in acute stroke found that 
dextroamphetamine improved language function in the subacute 
period [60]. It is unknown if stimulants could improve other post- 
stroke cognitive deficits. 

Some individuals with dementia may develop severe behaviour 
problems and often receive dopamine antagonist drugs as treatment, 
though given their potential for adverse side effects, they should be 
reserved for situations in which non-pharmacologic means have 
failed. While dopamine antagonist drugs appear to improve be- 
havioural disturbance in the short term in AD, there are no high- 
quality data on whether they are effective in VasD [61]. However, 
a retrospective cohort study of individuals with VasD, followed 
up for a mean of 2 years, found that patients who were prescribed 
risperidone, quetiapine, or olanzapine did not show an increased 
mortality risk; of note, the study did not examine the use of other 
dopamine antagonist drugs and did not evaluate adverse outcomes 
other than death [61]. 

Standard rehabilitation therapies for post-stroke cognitive 
deficits, for example speech therapy for aphasia, prism glasses for 
neglect, etc., may also be useful in patients with VCI who have these 
deficits, though they have not been specifically studied in the VCI 
population. Nonetheless, given the favourable benefit/risk ratio, if a 
patient with VCI has a cognitive deficit with a known post-stroke re- 
habilitation strategy, the patient should be offered the intervention. 

Thus far, psychosocial interventions for VCI have received little 
study. One uncontrolled trial of recreation therapy—playing games, 
dancing, and playing music—improved cognitive function in pa- 
tients with VasD, and the benefit increased with increasing number 
of sessions [62]. Given the potential for benefit and the very low risk 
of harm, it is reasonable to try psychosocial strategies with evidence 
for efficacy in dementia in general such as environmental enrich- 
ment, animal therapy, baby doll therapy, physical activity, social en- 
gagement, and provision of enjoyable activities [63]. 


VCI represents a leading cause of cognitive impairment, but vas- 
cular risk factor modification may reduce VCI risk and decrease dis- 
ease progression. Individuals without a history of clinical stroke may 
still have VCI, so clinicians should consider this diagnosis in any 
patient with acquired cognitive impairment. Cholinesterase inhibi- 
tors and possibly memantine may improve cognitive functioning in 
individuals with VCI. 
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Traumatic brain injury 
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and Martha E. Shenton 


Traumatic brain injury (TBI) is the result of blunt trauma, 
acceleration-deceleration forces, rotational forces, or blast exposure 
to the head. The injury involves a heterogenous pattern of focal and/ 
or diffuse damage to the brain. The severity of the injury covers 
the spectrum from mild to moderate to severe, with severe injury 
leading to possible coma and even death. In addition, the range of 
symptoms, the variability in treatment options, and the prognosis 
of TBI, as well as the psychosocial implications, make it a complex 
injury that often calls upon the services of neurosurgeons, neurolo- 
gists, psychiatrists, psychologists, and rehabilitation specialists to 
help patients achieve the best outcome possible. 

The aim of this chapter is to provide an overview of TBI that 
includes the classification, epidemiology, aetiology, and patho- 
physiology of TBI. This is followed by an overview of clinical 
symptoms and long-term outcome. We then review diagnostic 
implications and treatment options. This is followed by a re- 
view of other neuropsychiatric disorders that evince overlapping 
symptoms such as post-traumatic stress disorder (PTSD) and 
depression. We end with a summary that emphasizes evidence- 
based diagnosis and the need for more research focused on re- 
covery over time. 








TBIs are commonly classified into mild, moderate, and severe, using 
the Glasgow Coma Scale (GCS) score [1]. The GCS measures patient 
responsiveness along three dimensions: eye opening, verbal per- 
formance, and motor response. GCS scores range from 3 to 15, with 
higher scores reflecting greater functioning. GCS scores indicate 
TBI severity as follows: GCS scores of 13-15 are considered mild, 9- 
12 moderate, and 8 and below severe (Table 47.1). Further, duration 
of loss of consciousness, along with time span of post-traumatic am- 
nesia (PTA), is also used to classify the degree of severity. The term 
‘concussion is often used as a synonym for mild TBI (mTBI), both in 
general and in the field of sports-related brain injury. For a review of 
TBI classification approaches, see [2]. 





TBI is a leading cause of disability and mortality [3] throughout 
the world. In the United States, an estimated 2% of the population 
lives with disability resulting from TBI. The annual incidence of 
TBI is approximately 1.7 million [4]. Fortunately, approximately 
80% of these injuries are considered to be ‘mild. The prevalence 
of mTBI is, however, likely greatly underestimated, as the extant 
statistics frequently exclude cases of TBI treated in non-hospital 
and military settings. Additionally, it is not known how many in- 
dividuals do not seek medical care or whose diagnoses are missed 
altogether. Thus, the true prevalence is likely much higher for those 
with mTBI. 

The annual direct costs associated with TBI are estimated to be 
$13 billion, while the indirect costs, which include missing days of 
work, are estimated to be $65 billion, making TBI one of the most 
costly health problems needing rehabilitation services [5]. Of fur- 
ther note, longterm disability is estimated to affect 43% of post-acute 
hospitalization cases [5]. Further, within 1 year of TBI, nearly a third 
of patients report difficulties with two or more activities of daily 
living, while about 40% require the assistance of another person 
due to cognitive and/or physical disability (mainly those with mod- 
erate or severe TBI). For example, a prospective study by Ponsford 
et al. [6] showed that, of patients employed prior to moderate or 
severe TBI, only 40% returned to work in some capacity (either to 
pre-injury or modified duties) when assessed at 2, 5, and 10 years 
post-injury. Thus, the outcome for those with more moderate and 
severe TBI is worse than for the majority of patients who are diag- 
nosed with mT BL 





Sources of injury and mortality 


The leading causes of TBI based on emergency department visits in 
the United States are falls (41%), a head strike by or against an ob- 
ject (19%), motor vehicle traffic accidents (15%), and assaults (11%) 
[7]. Falls associated with TBI are most common in children under 
4 years of age and in adults 75 years and older, while TBI resulting 


Table 47.1 Traumatic brain injury severity classification 
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Severity GCS score Alteration in consciousness Loss of consciousness Post-traumatic amnesia 


Mild 13-15 <24 hours 
Moderate 9-12 >24 hours 
Severe 3-8 >24 hours 


GCS. Glasgow Coma Scale. 


from motor vehicle accidents is most common between the ages of 
15 and 34 years, and TBI resulting from assault is most common 
between the ages of 20 and 34 years [4]. Additionally, head im- 
pacts by or against an object are most common from childbirth to 
19 years and peak again between the ages of 25 and 34 [4]. Finally, 
TBI-related mortality is most commonly caused by firearms (35%), 
motor vehicle accidents (31%), and falls (17%) [8]. 


Risk factors for TBI 


The incidence of TBI is affected by a number of variables, including 
sex, age, socio-economic status, race and ethnicity, alcohol and other 
substance use, and experiencing a previous TBI. More specifically, 
across the sexes and over the lifespan, TBI is most common in chil- 
dren age 0 to 4, adolescents age 15 to 19, and adults over the age 
of 75 years [4]. Males have a higher incidence of TBI than females 
across all age groups [4, 9], with the exception of those who are aged 
75 years or older where females have a higher incidence. The greater 
number of TBIs in males has been attributed to increased risk-taking 
behaviours and activities. 

The incidence of TBI also varies by socio-economic status, with a 
higher incidence of TBI in families with lower income. The incidence 
of TBI is also increased in non-white ethnic groups in the United 
States [4, 8]. Moreover, African Americans, Native Americans, and 
Alaska Natives are reported to have the highest incidence of assault 
TBI [10]. The use of alcohol and other substances is also often as- 
sociated with an increased risk for TBI [11]. It has also long been 
recognized that an initial TBI carries a greater risk than the gen- 
eral population for subsequent TBI(s) [12]. In fact, individuals who 
incur a first TBI are three times more likely to incur a second TBI, 
and those who incur a second TBI are 7.8 (males) to 9.3 (females) 
times at greater risk for a third TBI [13]. 


Pathophysiology 


TBI may lead to shear deformation of the brain with stretching of 
axons and the surrounding myelin sheath. This stretching may re- 
sult in changes in cell membrane permeability, which may, in turn, 
result in ionic shifts, including an influx of calcium. Calcium may 
then accumulate in the mitochondria where it leads to impaired 
oxidative metabolism, with subsequent energy failure of the cell, 
and eventually to the breakdown of microtubules. In addition, there 
may be other factors that contribute to brain injury such as sudden 
changes in cerebral blood flow and decreases in inhibitory neuro- 
transmitters such as GABA and/or activation of NMDA receptors. 
[14, 15]. 

Brain injury following TBI is characterized by focal and/or diffuse 
neuropathologies that, depending upon their localization and se- 
verity, can lead to dysfunction at physiologic, cellular, and subcellular 
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levels. The initial effects of a head injury include scalp and skull le- 
sions, cerebral contusions, haemorrhages, intracranial haematomas, 
and diffuse traumatic axonal injuries [16]. The primary injury may 
also initiate processes that lead to secondary damage, which can 
evolve over the course of recovery and may result in later effects of 
injury. Such secondary damage is more likely in moderate and se- 
vere TBI and may include ischaemia, elevated intracranial pressure, 
neuroinflammation, cell death, infection, and gliosis [16]. In mTBI, 
there are generally no visible brain alterations observed on CT scans 
or conventional MRI scans, although there is evidence of brain in- 
jury in post-mortem studies of those diagnosed with mTBI who died 
from other causes [17]. 


Clinical presentation 


Moderate and severe TBI 


TBI is associated with a range of neurologic, neuropsychiatric, 
cognitive, and behavioural disturbances where the degree of im- 
pairment typically reflects the severity of the injury. In the acute 
phase following injury, the symptoms observed often reflect the 
location and severity of damage to brain tissue and blood vessels, 
as well as the presence of concomitant injuries, e.g., skull fracture. 
Acute symptoms may include headache, nausea, dizziness, dis- 
orientation, and disturbances of consciousness or even coma [18]. 
In the subacute phase following injury, seizures are common and 
carry the added risk of a secondary injury to the brain [19]. In the 
chronic phase, moderate to severe TBI may result in alterations 
in consciousness, paresis, neuropathies, speech impairments, 
seizure disorders, attention, memory, and concentration prob- 
lems, sleep disturbances, changes in personality, depression, irrit- 
ability, impulsivity, fatigue, and other emotional and behavioural 
problems [20]. 


Mild TBI 


Acute symptoms following mTBI may include nausea, headache, 
neck pain, blurred vision, insomnia, dizziness, postural instability, 
fatigue, memory, attention, concentration problems, and slowed 
processing speed, among other symptoms such as light and sound 
sensitivity [18]. These symptoms are typically most prominent in the 
acute stage of post-injury and, in most cases, resolve over days and 
weeks (Fig. 47.1). It is important to note, however, that not all symp- 
toms are present in all patients who suffer from mTBI, and in the 
absence of radiological evidence from CT or MRI, the diagnosis is 
generally made based on the occurrence of a head trauma and self- 
reported symptoms. 

In the event that there are radiological findings in mTBI, for ex- 
ample cerebral contusion or haematoma, such an injury is often re- 
ferred to as ‘complicated mTBP [21]. Of note here, there appear to 
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Fig. 47.1 This is a schematic multistage disease model of possible short- and long-term sequelae, following single and repetitive brain trauma. There 
are three possible trajectories: an acute/subacute phase, a chronic/static phase, and a neurodegeneration phase. Subconcussive head impacts usually 
do not result in acute symptoms indicated by the horizontal dotted line in the acute/subacute stage. Some patients may experience chronic post- 
concussive symptoms that resolve over time (black line) or continue (light grey band). A small subgroup will develop a neurodegenerative disease 


(darker grey band). 


Adapted from Brain Pathol., 25(3), Koerte IK, Lin AP, Willems A, et al., A review of neuroimaging findings in repetitive brain trauma, pp. 318-49, Copyright (2015), with 


permission from John Wiley and Sons. 


be no differences in reported symptoms, neurocognitive measures, 
or long-term prognosis in those diagnosed with complicated mTBI 
vs mTBI. Notably however, as a group, those with complicated mTBI 
tend to take longer to return to work than those with mTBI without 
radiological findings [21]. Moreover, a recent study by Panenka and 
colleagues [22] reported that while there were no differences be- 
tween mTBI and complicated mTBI with respect to reported symp- 
toms, neurocognitive functioning, or clinical outcome, there were 
differences in DTI findings 6-8 weeks following injury. Thus, dif- 
fusion imaging measures may be a more sensitive measure of dif- 
fuse axonal injury, which is one of the most characteristic findings 
in mTBI. 


Long-term outcome 


Post-concussive syndrome 


While most individuals recover completely from mTBI, there 
are up to 15-30% who continue to experience cognitive and be- 
havioural symptoms that may persist for months or even years 
[23] (Fig. 47.1). These individuals have been referred to as the 
‘miserable minority’ [24]. The term ‘post-concussion syndrome’ 
(PCS), or persistent post-concussive symptoms, has been used 
to describe these patients where there is often a combination 
of non-specific symptoms that persist beyond 3 months post- 
injury. Symptoms may include headache, fatigue, dizziness, 
blurred vision, nausea, irritability, sleep disturbances, hypersen- 
sitivity to light and noise, depression, and anxiety, in addition to 
deficits in attention, memory, concentration, executive function, 
and speed of processing (for example, [25]). PCS is presumed to 
result from a dysfunctional cognitive feedback loop [26] where 
acute TBI-related symptoms disrupt cognition and cause anxiety, 
which, in turn, results in further cognitive disruption. The lit- 
erature suggests that there may also be premorbid risk factors 
that contribute to the development of PCS. These risk factors 
may include neurologic findings, psychological and personality 


factors, psychosocial issues, and/or being a participant in liti- 
gation (for a review, see [27]). Cognitive biases and misattri- 
bution of symptoms, as well as excessive cognitive and physical 
rest, may also contribute to prolonged recovery from mTBI. The 
problem is further compounded by the fact that the symptoms 
in mTBI in general, and in PCS in particular, also overlap with 
other disorders such as depression and PTSD. Nonetheless, taken 
together, the possible explanations for PCS are many. It is also 
clear that radiological evidence that goes beyond conventional 
MRI and CT is needed because these more conventional meas- 
ures lack sufficient sensitivity to detect diffuse axonal injuries, 
which, as noted previously, are the most common brain injury 
in mTBI [28]. 


Psychosocial issues 


TBl-related alterations in cognition, behaviour, and emotion can 
also adversely affect psychosocial adjustment across multiple do- 
mains, including life satisfaction, social functioning, employment 
and school, independent living, and leisure activities (for reviews, 
see [29, 30]). For example, moderate to severe TBI is often associ- 
ated with lower overall life satisfaction, both in the acute and long 
term post-injury phases. Changes in life roles (for example, work, 
hobbies, relationships) and depression are common following such 
TBI, and they are strong predictors of life satisfaction across the first 
5 years post-injury [31], making these important targets for early 
intervention. 

Several pre-injury variables are also noted to increase the risk 
for poor psychosocial outcome following TBI. Pre-injury psychi- 
atric burden and a history of substance abuse are, for example, 
both associated with decreased employment and independent 
living status following moderate to severe TBI. Furthermore, 
both pre-injury functioning (including education and employ- 
ment) and pre-injury condition (including psychiatric disorder, 
substance abuse, sensory dysfunction, and learning difficul- 
ties) are associated with less post-injury life satisfaction and 
function [32]. 


Families and caregivers are also an integral part of rehabilitation 
following TBI. Unfortunately rates of distress, including anxiety, 
depression, and poor social adjustment, are common in adult care- 
givers of individuals with TBI (for a review, see [33]). Such inter- 
personal challenges, initiated by TBI, may change family roles, and 
relationships may become strained. In mild to moderate TBI, better 
family functioning, including reduced caregiver distress, is associ- 
ated with greater home and social integration [34]. In severe TBI, 
increased social support of the caregiver is associated with improved 
productivity and social integration in the patient [34]. Thus, support 
for TBI patients, as well as for their families and caregivers, is a crit- 
ical part of any rehabilitation programme. 


Neurodegenerative disorders 


Neurodegenerative disorders have been associated with a history of 
TBI (Fig. 47.1). There is evidence to suggest that a history of TBI 
is associated with a greater risk for Parkinson's disease [35], amyo- 
trophic lateral sclerosis [36], and Alzheimer’s disease [37]. In gen- 
eral, a history of TBI is believed to increase the risk for later-life 
proteinopathies, which are the underlying pathophysiology of these 
three neurodegenerative disorders [36]. Another proteinopathy 
with a strong link to a history of repetitive TBI is chronic traumatic 
encephalopathy (CTE). 

In post-mortem studies, CTE is characterized by a pathogno- 
monic pattern of perivascular deposition of hyperphosphorylated 
tau (p-tau) in neurons and astrocytes. In early stages of the dis- 
ease, more focal perivascular accumulations can be detected at the 
depths of the cortical sulci. As the disease progresses, widespread 
distribution is detected throughout the brain [38]. In later stages 
of the disease, brain atrophy may be extant [38]. At present, CTE 
is diagnosed post-mortem. Moreover, a history of exposure to re- 
petitive head impacts is assumed to be a necessary, but not suffi- 
cient, cause for the development of CTE (for a review, see [39]). 

CTE has most often been reported in athletes participating in 
contact sports involving repetitive head impacts (for example, 
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boxing, ice hockey, American football, and soccer) and in mili- 
tary service members who are exposed to blast and/or direct head 
impacts [38, 40]. The clinical presentation of CTE, however, is 
not well understood [38, 41, 42]. Via next-of-kin interviews and 
medical record review of neuropathologically confirmed CTE, a 
constellation of prodromal symptoms appears to herald the dis- 
ease. These symptoms include disturbances in behaviour (e.g., in- 
creased impulsivity and aggression), mood (e.g., depression), and 
cognition (e.g., difficulties with memory and executive function, 
as well as eventual dementia) [42]. In addition, symptoms asso- 
ciated with CTE may begin years following exposure to repetitive 
head impacts. At this point, it is not known why some individuals 
experience symptoms earlier and in closer proximity to the re- 
petitive head impacts. At present, knowledge regarding the course 
of neurodegenerative brain alterations that result from repetitive 
blows to the head is sparse. 


Repetitive subconcussive head impacts 


Subconcussive head impacts generally do not result in acute symp- 
toms but may lead to structural and functional brain changes 
similar to those caused by mTBI [43]. Using diffusion MRI, a 
study by Koerte and colleagues detected microstructural brain al- 
terations in active soccer players without a history of concussion 
or mTBI. However, professional soccer players experience a large 
number of repetitive head impacts while heading the ball [44] 
(Fig. 47.2). 

Another study reports impaired brain function using task-based 
functional MRI in contact sports athletes without a history of con- 
cussion. Further studies also demonstrate cognitive, functional, and 
biochemical changes in athletes participating in contact sports, des- 
pite the absence of clinical symptoms associated with mTBI (for a 
review, see [45]). These findings may be due to cumulative effects 
of repetitive head impacts or they may reflect the beginning stages 
of a neurodegenerative disease, including CTE [46]. It is also not 
known if these observed brain changes are reversible, although one 
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Fig. 47.2 Diffusion tensor imaging was used to compare professionally trained young adult soccer players with age- and gender-matched controls 
engaged in a non-contact sport (swimming). Increased radial diffusivity was observed in the white matter in soccer players, compared to swimmers, 


suggesting structural changes similar to those changes observed after mTBI. 


Reproduced from JAMA, 308(18), Koerte IK, Ertl-Wagner B, Reiser M, et al., White matter integrity in the brains of professional soccer players without a symptomatic 
concussion, pp. 1859-61, Copyright (2012), with permission from American Medical Association. 
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recent study of college football players suggests that following a 
rest period of 6 months, subtle microstructural changes reverted to 
baseline [47]. 





Diagnostic implications 


Neuroimaging 


In the acute setting, neuroimaging modalities such as CT and 
non-contrast MRI are often used to rule out complications in 
TBI, including fracture of the skull, haemorrhage in the intracra- 
nial compartments, and oedema. However, only 10% of CT scans 
and 30% of magnetic resonance scans reveal abnormalities in 
mTBI [28]. Moreover, CT and conventional MRI are not sensi- 
tive enough to detect subtle brain alterations, including traumatic 
axonal injury, which is commonly observed following mTBI [28]. 
Most importantly, information based on conventional CT and 
MRI is not associated with long-term prognosis following mTBI. 
There is thus a critical need to follow these individuals over time 
in future studies. 

Indeed, radiological evidence for mTBI is needed using highly 
sensitive and objective measures to ensure early diagnosis 
and provide accurate prognosis. Currently there are advanced 
neuroimaging modalities being developed to understand better 
brain abnormalities following TBI, and especially mTBI and re- 
petitive brain trauma (for a review, see [28, 45]). These imaging 
modalities include techniques to investigate brain structure 





Fig. 47.3 (see Colour Plate section) Diffusion tensor imaging 

measures the preferred direction of diffusion of water molecules. High 
directionality of diffusion can be observed in well-organized regions of 
the brain such as in the corpus callosum, as depicted here (see arrows). 
Diffusion tensor imaging provides information about the microstructure 
of brain tissue and thus may provide useful information in patients with 
TBI where diffuse axonal injury that involves the white matter is common. 
Reproduced from Koerte IK, Hufschmidt J, Muehlmann M, et al., Advanced 
Neuroimaging of Mild Traumatic Brain Injury. In: Laskowitz D, Grant G [Eds.], 


Translational Research in Traumatic Brain Injury, pp. 277-299, Copyright (2016), with 
permission from Taylor & Francis Group LLC. 











(structural MRI) and tissue microstructure (DTI) (Fig. 47.3), as 
well as blood flow (arterial spin-labelling, single-photon emission 
tomography) and to detect micro-haemorrhages (susceptibility- 
weighted imaging). There are also techniques to measure indir- 
ectly brain function (fMRI) and brain metabolism (PET, MRS) 
(for reviews, see [28, 45]). 


Neuropsychological evaluation 


A neuropsychological assessment may also contribute to under- 
standing cognitive functioning in TBI, including psychiatric and 
neurobehavioural sequelae. Such an evaluation typically includes 
the assessment of cognitive, emotional, and psychosocial domains. 
In the acute stage of recovery from TBI, comprehensive assessments 
are not typically carried out because in cases of moderate to severe 
TBI, patients often exhibit various symptoms, such as variable or ab- 
sent responsiveness, disorientation and/or delirium, and disrupted 
comprehension, each of which can interfere with accurate neuro- 
psychological assessment [48]. Instead, a brief evaluation at the bed- 
side may help to determine the patient’s level of consciousness, the 
magnitude of PTA and confusion, general cognitive functioning, 
language functioning, and the emotional state. Several such bedside 
evaluations may reveal changes in the patient’s status and also inform 
early clinical care. These assessments may also help to provide feed- 
back to the patient, family, and caregivers, as well to monitor treat- 
ment interventions (for example, medication) and decision-making 
capacity (for an in-depth review, see [49]). 

The cognitive sequelae of TBI are also variable and are influ- 
enced by pre-, peri-, and post-injury factors [50, 51]. Thus, when 
indicated, comprehensive neuropsychological assessments may 
play an important role in characterizing specific cognitive deficits 
and strengths, as well as aid in educating both patients and families, 
establishing treatment goals, developing rehabilitation plans, and 
informing prognosis [52]. In moderate to severe TBI, such assess- 
ments are recommended once patients have recovered from states of 
confusion and/or at 3- and 6-month intervals [53]. In mTBI, cogni- 
tive disruption, if any, is generally resolved within days of post-injury 
[54, 55], which has led some to recommend that comprehensive 
neuropsychological assessments are indicated only after this time 
period [56]. In sports-related concussion, on the other hand, base- 
line neuropsychological data are increasingly collected, albeit from 
briefer assessment batteries such as the Immediate Post-Concussion 
Assessment and Cognitive Testing (ImPACT) [57]. Here, repeat 
testing informs concussion management, including return-to-play 
decisions. Concussed athletes are usually removed from both play 
and practice until they are asymptomatic and neuropsychological 
testing demonstrates baseline performances [52]. 


Electroencephalography 


EEG is an important assessment tool for the clinical management of 
moderate and severe TBI in the acute phase of injury. More specif- 
ically, EEG is used to determine the depth of coma and to diagnose 
cerebral death [58]. Continuous EEG monitoring is recommended, 
particularly in the acute period following moderate to severe injury, 
in order to detect (subclinical) seizures which are common and carry 
the risk for secondary injuries to the brain [59]. Alterations in EEG 
in these patients also provide prognostic information regarding out- 
come. In subacute to chronic phases following moderate to severe 


TBI, EEG is important for the diagnosis of post-traumatic epilepsy. 
The severity of TBI is, in fact, a strong risk factor for post-traumatic 
epilepsy [60]. Although the risk is highest within 6 months after in- 
jury, it is important to note that post-traumatic epilepsy can mani- 
fest even years after trauma [60, 61]. Finally, in subacute and chronic 
phases following moderate to severe injury, EEG biofeedback is 
sometimes used in the rehabilitation process [62]. 

In contrast to moderate and severe TBI, the application of EEG 
in mTBI has not been established for clinical use. Recent research 
suggests, however, that EEG may provide information on brain ab- 
normalities associated with mTBI [63]. Alterations in EEG are more 
likely seen with a history of longer periods of loss of consciousness 
(LOC) or PTA and usually manifest as focal or generalized slowing, 
accompanied by a decrease in frequency of the posterior alpha [64]. 
In the subacute phase after mTBI, weeks to months after injury, a 
frequency increase of 1-2 Hz in the posterior alpha can be observed 
[64, 65]. This may reflect a return to baseline after the aforemen- 
tioned slowing in frequency. Notably, most EEG abnormalities re- 
solve within weeks to months post-injury [63]. In chronic phases of 
mTBI, 6 months or more following injury, EEG abnormalities can 
still be observed in a subgroup of patients. Of note, left temporal 
slowing may be associated with PCS [66]. It is, however, important 
to note that such EEG abnormalities may also occur in other psy- 
chiatric disorders such as PTSD, depression, and anxiety disorders 
and are therefore not specific to mTBI [67-69]. These abnormalities 
may also reflect comorbid illnesses with mTBI. Furthermore, and as 
noted previously, standardization for the use of EEG for mTBI in the 
clinic has not been established. 


Psychopharmacotherapy 


TBI at all levels of severity can initiate a cascade of neurobiological 
processes responsible for neurotransmitter dysregulation that con- 
tributes to cognitive impairment and neuropsychiatric sequelae (for 
a review, see [70]). Accordingly, pharmacotherapy has long been 
part of the treatment and rehabilitation of TBI-related cognitive and 
neuropsychiatric dysfunction (for example, see [71]). Moreover, 
by alleviating neurocognitive symptoms, pharmacotherapy can 
increase the benefits from other non-pharmacological treatments 
(for example, cognitive rehabilitation). However, to date, there is 
a lack of RCTs that could form the basis for evidence-based clin- 
ical practice [72, 73]. Thus, although pharmacological interventions 
are common after TBI, there exists considerable variability among 
the specific agents administered between and within cognitive and 
neuropsychiatric domains [74, 75]. 

Despite a shortage of guidelines, comprehensive reviews of the 
literature offer the following recommendations [70, 72, 74, 76]. 
Specifically, methylphenidate and, to a lesser extent, cholinesterase 
inhibitors have shown efficacy in treating deficits in attention and 
information processing speed. Similarly, cholinesterase inhibi- 
tors and neurostimulants have demonstrated limited recovery of 
memory and executive function. With respect to neurobehavioural 
and psychiatric symptoms, B-blockers can improve agitation and 
aggression, while limited evidence suggests that sertraline can alle- 
viate depression, particularly in the later phases of recovery. Beyond 
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these, clinicians may prescribe medications based on common 
practice for similar dysfunctions. However, this is based on limited 
evidence and mainly on subjective clinical impressions and expert 
opinion (see [75]). As a result, some authors have offered clinical 
practice directives for the delivery of pharmacotherapy—initial 
dosing should be lower than in non-brain-injured populations, and 
increased doses should occur at a more conservative pace (that is, 
start low, go slow). Also, reassessment of both direct and side effects 
should be frequent. Further, a partial response can be met by the 
introduction of a second agent with a separate mechanism of ac- 
tion, and if targeted symptoms worsen upon the introduction of the 
pharmacological agent, doses should be lowered, or ceased, if the 
symptoms intensify [76, 77]. 


Cognitive rehabilitation 


As noted previously, TBI-related disturbances in neurocognitive and 
psychiatric functioning can interfere with psychosocial functioning, 
independent living, and vocational and educational pursuits. 
Cognitive rehabilitation (CR) includes interventions aimed at as- 
sisting patients, as well as their families, to cope with cognitive and 
behavioural deficits following TBI. The evidence of CR has bur- 
geoned over the last 30 years, leading to many options for clinicians 
seeking to improve the cognitive and psychosocial functioning of 
TBI patients (for reviews, see [78-81]). 

Clinician-directed CR interventions may also improve cognitive 
functioning across many domains, including memory, attention, 
and executive function [82]. Deficits in memory are best addressed 
using external memory aids, such as journals, notebooks, and plan- 
ners, as well as electronic aids like smartphones and electronic cal- 
endars [83, 84]. Although generally thought of as distinct cognitive 
abilities, within the context of CR, attention and executive func- 
tion have been thought of as a unitary construct [78]. Accordingly, 
cognitive abilities within the purview of attention and executive 
functions include goal setting, planning, organization, initiation, 
and maintenance of activities, as well as executive control of atten- 
tion (that is, selective attention, alternating attention, and working 
memory) [78]. Attention and executive function interventions 
demonstrate efficacy when they include direct attention (that is, 
repetition, hierarchical strategy development) and metacogni- 
tive training [85, 86]. Thus, these components are recommended 
practice standards for the rehabilitation of attention and executive 
function [82]. 


Interventions for mild TBI 


Although, as previously mentioned, mTBI is associated with a con- 
stellation of cognitive, affective, and neurologic symptoms, for most 
individuals, injury-related impairments tend to resolve within days 
to weeks, with a smaller number resolving in 3 months. Historically, 
rest has been considered the gold standard for clinical management. 
However, there is little consensus on the definition and duration of 
rest following mTBI, and the benefits of rest are not based on empir- 
ical evidence. Prolonged rest may, in fact, even increase the risk for 
post-concussion-like symptoms [87]. Silverberg and Iverson, for ex- 
ample, noted the possible deleterious effects of too much bed rest in 
their guidelines for the clinical management of mTBI. Their guide- 
lines include: (1) bed rest for more than 3 days is not recommended; 
(2) pre-injury activities should resume gradually; (3) for the first 2 
weeks following post-acute injury, there should be a reduction in 
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physical and cognitive demands; and (4) for patients symptomatic 
after 1 month, supervised exercise should be considered. Marshall 
and colleagues [88] provide similar guidelines for athletes who 
have suffered a sports-related concussion. These include: (1) players 
should be medically evaluated on site by a licensed health provider 
who will determine the appropriate disposition of the player; (2) in 
the absence of a health provider, the player should be removed from 
practice or play and a referral to a physician should be arranged; 
(3) after first aid, an objective assessment of concussion should be 
administered; (4) the player should be accompanied and monitored 
over the first few hours post-injury; and (5) diagnosed or suspected 
concussions should prohibit return to play or practice [88]. It is im- 
portant to note, however, that while elite athletes may experience 
significant cognitive and psychological sequelae from concussions, 
they are more likely to minimize the short- and long-term conse- 
quences of head impacts. A clear example is an anonymous NFL 
nation survey conducted by ESPN. In this survey, 320 NFL players 
were asked if they would play in the Super Bowl 2014 with a concus- 
sion; 85% responded ‘yes’ [89]. 

Beyond rest in the acute stage (for example, 3 days post-injury), 
there is accumulating evidence to suggest that education and sup- 
port, provided shortly after mTBI, can be helpful, especially for 
somatic and psychological complaints (for reviews, see [27, 90- 
92]). Education should be both verbal and in print, and should 
include information about the symptoms common to mTBI and 
the time course of convalescence. Moreover, the impact of edu- 
cational interventions are optimal when delivered to patients 
and their families, friends, employers, insurers, and/or signifi- 
cant others [90]. Support may take the form of validation of the 
patient’s complaints, normalization of the symptoms, and reassur- 
ance about expected positive recovery, along with techniques to 
manage stress [88]. 

For patients with significant and prolonged complaints (that is, 
PCS), education and reassurance have been efficacious in short- 
ening the duration of post-concussive symptoms (for a review, see 
[27]). Marshall and colleagues [88] further recommended that, for 
patients with chronic post-concussive symptoms, primary care pro- 
viders must consider the multi-factorial nature of these complaints 
and management strategies should be developed with all contrib- 
uting factors in mind. Notably, there is little evidence that cognitive 
rehabilitation and cognitive behavioural therapy are effective inter- 
ventions for patients with persistent post-concussive symptoms 
[92, 93]. Importantly, highly sensitive neuroimaging techniques 
have recently become available that might shed light on the nature 
of post-concussive symptoms. 


Differential diagnoses 


It is important to note that there is an overlap among the symptoms 
of mTBI and other disorders, including depression, anxiety, and 
PTSD. This overlap in symptoms creates diagnostic and treatment 
challenges for the clinician. Thus, an important first step towards dif- 
ferential diagnosis is to understand both the intersection and delin- 
eation of symptoms following TBI and common comorbidities. As 
mentioned previously, common causes of TBI include motor vehicle 
accidents, assaults, and combat. In addition to the increased risk for 
extracranial injury, such traumatic events can initiate pathological 


anxiety and stress reactions, some of which may develop into PTSD, 
independent of TBI [94-96]. In addition, there is the issue of a dif- 
ferential diagnosis of depression and anxiety vs depression and anx- 
iety being a consequence of TBI [97]. Here, we highlight important 
features of anxiety and depression, to aid in the differential diagnosis 
of mTBI (readers are referred to Chapters 74 to 92 for in-depth re- 
views of these topics). 


Summary and future directions 


The diagnosis of moderate and severe TBI is typically clear, as are 
the acute treatment interventions, e.g., neurosurgical intervention. 
Nonetheless, the prognosis is less clear, although there are guide- 
lines regarding what to expect at different stages of progression or 
recovery. With mTBI, however, the picture becomes more com- 
plex, as even the diagnosis is not always clear. More studies are thus 
needed to understand better the neurobiological underpinnings of 
mTBI and to develop more sensitive methods to detect brain alter- 
ations associated with brain dysfunction. The previously mentioned 
advances in neuroimaging, including diffusion MRI and MRS, are 
among the most promising imaging modalities for providing sensi- 
tive measures of brain alterations following mTBI, which may be- 
come biomarkers for diagnosis and prognosis and for predicting 
treatment efficacy as new pharmacological treatments become 
available. Further, it is critical to characterize subject-specific injury 
profiles to move towards a more personalized medicine approach 
tailored to each patient. Moreover, it is important to identify risk 
factors for poor long-term outcome and to develop individual- 
ized interventions and potential preventative strategies. The de- 
velopment of reliable biomarkers for the diagnosis, prognosis, and 
monitoring of treatment efficacy is an important area for future 
research that will transform what we know about the underlying 
pathomechanisms, as well as what we know about the short- and 
long-term sequelae of TBI, and these developments will lead to 
more informed personalized medicine approaches to treatment by 
health care providers. 
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Substance use disorders and the underlying problem of drug ad- 
diction have been affected by two major developments: firstly, the 
revolution in understanding provided by neuroscientific research, 
aided by arguably the most successful animal model of psychiatric 
disorder; and secondly, a recent significant change in the diagnostic 
criteria for ‘substance use disorder’ provided by DSM-5 (Box 48.1). 
Relevant to the present chapter, under DSM-IV, substance depend- 
ence was preferred to the term ‘addiction, emphasizing the psycho- 
logical and physiological concomitants of withdrawal phenomena. 
For opioid dependence, caused, for example, by abstinence in hu- 
mans or by treatment with an opioid receptor antagonist, such as na- 
loxone, in animal experimental studies, these withdrawal symptoms 
are classically manifest as a group of severe autonomic responses 
such as palpitations, sweating, and cramps, and in rats, by ‘wet dog 
shakes’ and piloerection, as well as by psychological dysphoria in 
humans and elevations of the reward threshold in rats [1]. This con- 
stellation of aversive symptoms can theoretically be considered as 
constituting negative reinforcement, an event increasing the prob- 
ability of behaviour reducing its future occurrence (see Glossary of 
key terms, p. 488)—hence the drive to obtain more drug to avoid or 
escape from these withdrawal symptoms. This view of drug addic- 
tion thus accounts for the maintenance of drug-seeking and drug- 
taking behaviour, complementing the common sense view that the 
initiation of drug-taking is supported by positive subjective effects, 
before drug addiction sets in. These latter effects are thus described 
as positive reinforcement increasing the probability of behaviour 
producing them (see Glossary of key terms, p. 488). Such positive 
and negative effects are characteristic of all drugs of abuse, including 
stimulants such as cocaine and amphetamine, alcohol, nicotine, can- 
nabis, ketamine, and benzodiazepines. However, the precise pattern 
of withdrawal effects varies considerably across compounds; the 
physical signs of withdrawal following cocaine are much less signifi- 
cant than the accompanying psychological dysphoria, for example. 
At this point, it is important to distinguish objective measures of 
behavioural and autonomic responses, from which much may be 
deduced about the underlying subjective motivational states, and 





the subjective symptoms themselves, which are more difficult to 
measure (Box 48.2). 

These positive and negative effects of drugs can be linked by op- 
ponent motivational theories, such as that by Solomon [2], as ap- 
plied by Koob and Le Moal [3], to explain addiction (Fig. 48.1). 
These authors postulate that positive effects of drugs are counter- 
acted by negative effects as the drugs wear off, and moreover that, 
with repetition, the positive effects become progressively smaller, 
perhaps, in part, caused by tolerance (that is, reduced efficacy of re- 
peated drug treatments), whereas the negative effects become pro- 
gressively greater, so that negative reinforcing events predominate. 
According to Koob and Le Moal [4], this process ultimately pro- 
gresses to a state of ‘allostasis, by which the drug abuser is unable 
to attain normal bodily and subjective ‘homeostasis’ by moderate 
drug-taking—leading to a vicious circle of drug bingeing and de- 
pendence, which is further exacerbated by its stressful sequelae. 

This theory appears to be a compelling account of addiction and 
yet does not apparently explain why DSM-5 criteria of addiction 
have so radically moved away from substance dependence as its 
defining element. This is partly because it is evident from the study 
of other drug effects, such as those of caffeine, that a withdrawal 
syndrome per se is not necessary for the serious sequelae of drug 
addiction. Additionally, although withdrawal and tolerance are im- 
portant potential symptoms of substance use disorder, they con- 
stitute only two of about 11 symptoms by which addiction is now 
defined (Box 48.1). From Box 48.1, it can be noted that the other 
symptoms are mainly those associated with ‘top-down’ cognitive 
control, especially exemplified by the compulsive drug-seeking 
and drug-taking that occurs, for example, during relapse. These re- 
sponses apparently occur almost reflexively to cues that have be- 
come associated with the drugs, including even people and places, 
and are exceedingly difficult to bring under conscious restraint [5]. 
The other quality of compulsive responses is the tendency for such 
behaviour to be performed despite evident aversive consequences. 
This pattern of behaviour has thus been associated with a rather 
different theory that emphasizes positive reinforcement and a pro- 
gressive loss of top-down control from higher brain centres in ad- 
diction [6, 7]. 
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Box 48.1 The new criteria for substance use disorders in DSM-5 Box 48.2 Subjective feelings contributing to drug addiction 


e Taking the substance in larger amounts or for longer than you What is the underlying nature of positive and negative reinforcement? 
meant to Positive reinforcers are defined according to Thorndike’s Law of Effect 
e Wanting to cut down or stop using the substance but not managing to which posits that they are events that increase the probability of re- 


sponses that produced them. Early textbook theories of motivation, 
suggested that reinforcers have several functions including: (1) a reduc- 
ion in drive or need reduction in relation to homeostatic regulation; 
2) consolidation of learning or memory of associative or contingent 
relationships between environmental stimuli and responses or actions 


e Spending a lot of time getting, using, or recovering from use of the 
substance 

e Not managing to do what you should at work, home or school, be- 

cause of substance use 

Continuing to use, even when it causes problems in relationships 


























et C k à ; san leading, for example, to relatively automatic stimulus-response habits); 
e Giving up important social, occupational or recreational activities 3) incentive-motivational effects leading to appropriate preparatory 
because of substance use appetitive) responses, such as approach behavior, or physiological 
e Using substances again and again, even when it puts you in danger adjustments in expectation of outcome or goal (such as eating food). 
e Continuing to use, even when you know you have a physical or psy- egative reinforcers are more difficult to characterize as they are defined 
chological problem that could have been caused or made worse by as aversive events that increase the likelihood of their postponement or 
he substance omission. Clearly both positive and negative reinforcement may play a 
e Cravings and urges to use the substance (craving) role in drug abuse and addiction. But how do these aspects of reinforce- 
e Needing more of the substance to get the effect you want (tolerance) ment theory relate to the subjective effects of drugs that presumably play 
e Development of withdrawal symptoms, which can be relieved by some part in addiction? 
aking more of the substance (withdrawal). Incentive-motivational theories emphasize the hedonic prop- 
>6 of these symptoms for a specific substance such as heroin or co- aries eii ine Srey Peed y nan uae Eie oes ei ae 
uae st ; er: ; need state: for example, reinforcers such as intracranial electrical self- 
caine is sufficient for diagnosis as ‘severe substance use disorder’. Note 








stimulation of the brain, cocaine, sex, sweet foods or novel objects. Thus 
reinforcers may have yet another function, to generate a subjective ap- 
praisal of their effects in terms of pleasure or aversion. This conceptu- 
alization has encouraged the use of terms such as ‘reward’ and ‘liking’ 
that connote hedonic subjective responses associated with positive rein- 
forcers. Hedonic reactions are certainly important in early responses to 
drugs—for example, the subjective euphoria or ‘high’ initially associated 
with early experiences with the drug, which however may show a gradual 
reduction with repeated experience (‘tolerance’). Moreover, it can be ar- 
gued that behaviour governed by reinforcement is not always neces- 
sarily accompanied by conscious pleasure, as in the case, for example, 
theory of addiction of habits. Negative reinforcement may also be associated with subjective 
responses such as ‘relief’ (e.g. on avoiding an electric shock or escaping 
the pangs of withdrawal). Withdrawal itself is often accompanied by sub- 
jective craving for drug; however the precise causal, as distinct from cor- 
relative, role of such craving (also often called ‘wanting’) in motivating 


that the first 8 of these signs reflect failures of top-down cognitive 
control, or compulsive behaviour that persists despite detrimental 
consequences. 


Source: data from the Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 














The positive reinforcement or reward-based 











The idea that the positive subjective effects of drugs play a greater 
part in addiction gained much from the discovery of an apparent 





reward system in the brain. This was initially supported by the ob- drug seeking behaviour remains unclear. 
servation that rats would perform responses such as lever-pressing, These issues are difficult to address because experimental animals, of 
leading to intracranial stimulation (‘self-stimulation’ behaviour), an course, do not have verbal responses to express any subjective feelings 


hey may experience, whereas we can infer much about reinforcement 
processes from their overt behaviour. A causal analysis of the neural basis 
of subjective responses in humans is also only in a relatively early stage. 
Presumably, this must involve an attributional system which connects 


obvious manifestation of positive reinforcement which was, how- 
ever, limited to certain brain regions [8]. Eventually, it was realized 
that the regions coincided to a considerable degree, though not ex- 





clusively, with the location of dopamine (DA)-containing neurons houghts to actions and also may involve interactions with language 
in such areas as the ventral tegmental area (VTA) of the midbrain based processing. 
which innervate structures, including the nucleus accumbens (part Indiscriminate substitution of the term ‘reward’ for ‘reinforcement 


herefore may therefore sometimes lead to mistaken assumptions about 
he neural nature of the human ‘reward system’. This system undoubtedly 
includes those regions such as the nucleus accumbens, and its dopa- 


of the ventral striatum), amygdala, and medial prefrontal cortex 
(Fig. 48.2). Focusing on stimulant drugs, such as d-amphetamine 








and cocaine, which were already known to be indirect agonists of minergic innervation, as determined from animal studies, but may well 
the catecholamine neurotransmitter systems DA and noradrenaline extend to other interactive cortical areas within a greater neural network. 
(NA), Wise and others showed from pharmacological studies using The orbitofrontal cortex is linked to sensory and value-based represen- 


tations of reinforcers and the relative utility of different courses of action 
producing them, as well visual, auditory or olfactory cues which predict 
them through conditioning processes. Feelings of pleasure presumably 
arise from the visceral feedback provided from the autonomic nervous 


DA and NA receptor antagonists that rodents appeared to regulate 
the amount of drug that could be self-administered intravenously 
via implanted jugular vein catheters to some sort of optimal level 








[9]. This regulation was selectively affected by DA receptor antag- system which are integrated into emotional reactions by such structures 
onists (such as haloperidol) [10]. Thus, a small dose of haloperidol as the insular cortex, and labelled as pleasant (or otherwise) according 
would actually increase the rates of self-administration, presum- to the social and cognitive context in which they are experienced. This 





ably as the animal strives to overcome the effects of DA receptor is in accordance with many studies of how environmental context may 
y affect attributions of pleasure or aversion to what may be physiologically 


blockade. This self-administration paradigm has subsequently be- similar responses to drugs. Consequently, although it is important to take 
come the gold standard for research studies of the reinforcing effects subjective responses into account, it is also a viable strategy to measure 
of drugs. Evidence of its validity comes from the fact that virtually objectively how animal and human subjects actually behave in relation 
all drugs of abuse in humans are self-administered by experimental to repeated drug exposure. 
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Fig. 48.1 Opponent motivational processing. A theory of affective dynamics which suggests that the positive emotional State A arising from a 
stimulating (standard square wave) rewarding event undergoes several transitions, both after a few presentations and after many. The peak emotional 
impact of the event is experienced soon after its presentation and then exhibits sensory adaptation, declining over time while the stimulation is still 


ON. The hypothetical opponen 


negative or aversive experience (State B) comes after the event has been completed (that is, is OFF). This might relate 


to the effects of any reward (or, inversely speaking, of any punisher); in the case of a drug of abuse, it most obviously relates to the euphoric ‘high’, 
experienced soon after the drug takes effect (for example, almost immediately after intravenous self-administration), and the ‘low’ after the drug 





has worn o 





. Note that State B is always less marked than State A during the first few stimulations, so that the initial emotional state is positive. After 
many rewarding ‘stimulations’, the positive effects of State A have hypothetically markedly diminished, compared with 


heir peak during the first few 


stimulations. In the case of drugs of abuse, this relates to the well-known tolerance (progressive diminution) of pharmacological effects with repeated 
experience. However, the aversive after-effects in State B have increased, and so the net impact is negative. For drugs of abuse, this stage hypothetically 
corresponds to the state of ‘withdrawal’ (see text) and motivates avoidance behaviour controlled by negative reinforcement. 

Reproduced from Psychol Rev., 81(2), Solomon RL, Corbit JD, An Opponent- Process Theory of Motivation: |. Temporal Dynamics of Affect, pp. 119-145, Copyright (1974) with 








permission from American Psychological Association. 


animals [11]. (A notable exception is that of LSD; however, there are 
good grounds for believing that the psychomimetic use of this drug 
by humans is not typical of drug abuse.) More recently, drug self- 
administration has been incorporated into behavioural models of 
addiction that have some measure of face validity [1, 12, 13]. 

Further significant observations supporting a central dopamin- 
ergic mediation of the effects of stimulant drugs included findings 
that the depletion of DA in the so-called mesolimbic projections 
from the VTA to the nucleus accumbens reduced self-administration 
rates for cocaine [14]. Perhaps more dramatically, it was also shown 
that d-amphetamine was self-administered directly into the nucleus 
accumbens by rats through an implanted cannula [15], but not sig- 
nificantly into other brain regions (Fig. 48.2). Moreover, these effects 
were demonstrated to be DA-dependent, in that concurrent treat- 
ment with DA receptor antagonists again produced an upregulation 
of self-administration [16]. 

The implication of mesolimbic DA in the positive reinforcing ef- 
fects of drugs was consistent with evidence that natural rewards, such 
as food and sex, also exerted their reinforcing effects via this pathway 
[11]. However, even more significant was that the positive reinfor- 
cing effects of other drugs of abuse whose primary mechanisms of 
actions were not dopaminergic could also potentially be indirectly 
mediated by this system because of their modulatory effects on other 
receptors present in the VTA or nucleus accumbens—for example, 
nicotine, opioids (Fig. 48.2), cannabinoids, and, via GABA and glu- 
tamate receptors, alcohol and benzodiazepines [11]. This hypoth- 
esis was supported by evidence obtained using in vivo microdialysis 
which showed that many of these drugs upregulated DA levels in the 
nucleus accumbens. The hypothesis that the DA system was a ‘final 
common pathway’ in mediating the positive reinforcing effects of 
drugs of abuse has been an attractive hypothesis, not least because 





it provided a potential basis for understanding polydrug abuse, by 
which human drug abusers will readily switch between different 
combinations of drugs when the supply of one of them is cut off. 

The implication of a positive reinforcement system focused on 
the VTA that mediated effects of opioidergic agents, such as mor- 
phine and heroin, was exploited by Bozarth and Wise to test the 
opponent motivational process account of opioid addiction de- 
scribed here. They showed that intracranial self-administration of 
morphine could be maintained into the VTA region, supporting the 
positive reinforcement hypothesis, but that the self-administering 
rats did not exhibit the symptoms of physical withdrawal when 
challenged with naloxone [17]. Therefore, the maintenance of self- 
administration could not be attributed simply to overcoming nega- 
tive reinforcement. By contrast, administration of morphine to the 
periaqueductal grey in the hindbrain did lead to physical symptoms 
on challenge with naloxone [17]. Therefore, it appears that the posi- 
tive reinforcing effects of morphine and heroin and their actions on 
autonomic centres can be dissociated; the fact that opioid receptors 
are involved in these different responses is coincidental and not in- 
extricably related. Nevertheless, some form of the opponent theory 
can be supported, as a subsequent study showed that the antagonism 
of opioid receptors within the nucleus accumbens could produce 
conditioned aversion to the place where heroin was experienced, 
suggesting that the drug was producing aversive after-effects in this 
mesolimbic reward system [18]. The additional assumption that 
the positive effects were mediated by the DA reinforcement system 
was also contradicted by the fact that the self-administration of 
heroin was incompletely blocked by DA depletion from the nucleus 
accumbens, suggesting that at least some of its positive reinforcing 
action was DA-independent [19]. This picture is probably true for 
other drugs of abuse such as alcohol and nicotine; DA mechanisms 
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Fig. 48.2 (see Colour Plate section) (a) Intracerebral drug self-administration. By pressing a lever in an operant chamber, rats can deliver microlitre 
and microgram quantities of drugs, such as amphetamine, directly into the brain. A major neural locus that supports such drug self-administration 
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pathway. (See [90].) 











permission from Elsevier Ltd. 


may indeed play a role and be a part of a ‘final common pathway’ 
leading to rewarding effects, but these are not the only source of the 
positive reinforcing actions of these compounds. 


The importance of conditioning and 


associative learning 





One of the properties of reinforcers is that they support new 
learning, and there are many indications that drug addiction may 
represent aberrant associative learning. An implication of this hy- 
pothesis is that cues and contexts (places) associated with drugs may 
gain salience via association with drugs through Pavlovian condi- 
tioning and thus exert important control over behaviour. Such cues 
may be the sight of needles or drug-taking paraphernalia, which 
have been reported to produce euphoric ‘highs’ in their own right 
in some addicted individuals. Drugs, such as amphetamine and 
cocaine, can enhance the salience of stimuli paired with other re- 
wards, such as food, water, and brain stimulation, and such stimuli 
can even act as reinforcers themselves after conditioning, when 
they are termed conditioned reinforcers [20]. The potentiation of 
rewarding properties of environmental stimuli has been shown to 
depend on an interaction between two major factors: the integrity 
of the basolateral amygdala (BLA), which enables the association 
between the conditioned stimulus (CS) (for example, a predictive 
noise or light) and the unconditioned stimulus (US) (for example, 
food, sex, or addictive drug) and the dopaminergic innervation of 
the nucleus accumbens (which is responsible for the behavioural ac- 
tivation produced by such stimuli in responding under the influence 


s the nucleus accumbens. (b) A schematic sagittal section of the rat brain showing the mesolimbic dopamine system, comprising cell bodies in the 

run in the medial forebrain bundle in the lateral hypothalamus, and terminals in the nucleus accumbens 

NAc), medial prefrontal cortex (PFC), and amygdala (AMG), among other forebrain structures. Nicotinic and opioid receptors are shown on both 
dopamine neuron cell bodies and terminals in the nucleus accumbens. Some major cortical afferents to the nucleus accumbens are also shown and 
include the amygdala, hippocampal formation, and prefrontal cortex. Opioidergic (beta~endorphin-containing) projections are shown originating in 
he hypothalamic arcuate nucleus (ARC) and innervating the midbrain periaqueductal grey matter (PAG). Small enkephalin-containing interneurons 
are represented in green. There is widespread agreement that all drugs of abuse, in addition to actions on their specific molecular targets, can increase 
activity in the mesolimbic dopamine system and dopamine release in the nucleus accumbens and is often therefore referred to as the common reward 





Reproduced from Trends Pharmacol Sci., 13(5), Koob G, Drugs of abuse: anatomy, pharmacology and function of reward pathways, pp. 177-84, Copyright (1992) with 


of the stimulant drug) [21, 22]. These factors have been shown to 
influence drug-seeking behaviour to a considerable extent. Thus, 
in a schedule in which rats work to obtain intravenous infusions of 
drugs by their instrumental lever-pressing, drug-seeking behaviour 
can be maintained for long periods, even before drug delivery by 
the presentation of brief stimuli (for example, lights or noises) as- 
sociated with the ultimate delivery of the drug [23-25]. The same 
phenomenon has been demonstrated in humans [26]. The use of the 
conditioned reinforcers to maintain behaviour in this way can be 
termed a second-order schedule of reinforcement, its special utility 
being that it is then feasible to measure the motivational effects of 
the drug in terms of the responding made during the drug-seeking 
period prior to drug self-infusion [23]. Subsequent to that first infu- 
sion, the motivational salience of the drug-paired cues presumably 
becomes even greater as a consequence of the actions of the drug 
itself. This schedule has been used in several ways: (1) to define the 
neural pathways responsible for drug-seeking behaviour; and (2) as 
a method for screening possible remediative drug therapies that re- 
duce drug-seeking and may therefore be employed potentially to 
combat craving and relapse in human drug abusers [27]. 





Neural basis of drug-seeking behaviour 


(See Fig. 48.3.) 

Based on the previous findings implicating the BLA in CS-US 
association, it was perhaps not surprising to find that excitotoxic 
lesions of this structure significantly impaired the acquisition 
of cocaine-seeking dependent on drug CS to mediate delays to 
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Fig. 48.3 (see Colour Plate section) Representation of key components of limbic cortico-striatal circuitry in which psychological and physiological 
processes important in drug addiction are indicated. These include: (1) the processing of conditioned reinforcement (and Pavlovian associations 
between environmental stimuli and drugs in general) by the basolateral amygdala and of contextual information by the hippocampus; (2) goal-directed 


actions involve interactions between prefrontal cortical areas (orbitofrontal, 
associations) depend on interactions between 
prefrontal cortex and includes representation of contingencies and outcom 


with drugs; (5) in functional imaging studies, drug craving involves activation of the orbital, anterior cingulate, and insular cortices and 


ventromedial) and 
he prefrontal cortex (sensorimotor) and dorsolateral striatum; 
es and their value and subjective states (craving and feelings) associated 


he dorsomedial striatum; (3) habits (stimulus-response 
4) ‘executive control’ depends on the 


emporal lobe 














structures, including the amygdala; (6) connections between dopaminergic neurons and the striatum, linking the ventral with the dorsal striatum 

via interactions organized in a striato-midbrain-striatal spiralling cascade of neuronal interconnections; (7) reinforcing effects of drugs may engage 
stimulant Pavlovian influences on behaviour such as Paviovian-instrumental transfer, or conditioned motivation, and conditioned reinforcement 
processes in the nucleus accumbens shell and core and then engage stimulus-response habits that depend on the dorsal striatum; (8) the extended 
amygdala is composed of several basal forebrain structures, including the bed nucleus of the stria terminalis, the centromedial amygdala, and, more 
controversially, the medial portion (or shell) of the nucleus accumbens. A major transmitter in the extended amygdala is the corticotropin-releasing 
factor, which projects to the brainstem where noradrenergic neurons provide a major projection reciprocally to the extended amygdala. Activation of 
this system is closely associated with the negative affective state that occurs during withdrawal. Green/blue arrows, glutamatergic projections; orange 
arrows, dopaminergic projections; pink arrows, GABAergic projections. Acb, nucleus accumbens; BLA, basolateral amygdala; VTA, ventral tegmental 
area; SNc, substantia nigra pars compacta. VGP, ventral globus pallidus; DGP, dorsal globus pallidus; BNST, bed nucleus of the stria terminalis; CeA, 
central nucleus of the amygdala; NA, noradrenaline; CRF, corticotropin-releasing factor; PIT, Pavlovian-instrumental transfer. (See [91].) 












































Reproduced from Koob GE, Everitt BJ, Robbins TW, Chapter 43: Reward, Motivation, and Addiction. In: Squire L, Berg D, Bloom F et al. [Eds]., Fundamental Neuroscience, Third 





Edition, pp. 987-1016, Copyright (2008), with permission from Elsevier Inc. 


self-administration [28] (Fig. 48.2). Also thematic was the finding 
that similar lesions of the so-called core subregion of the nucleus 
accumbens produced similar effects, reflecting, in part, the innerv- 
ation by the BLA of this structure [29], as later confirmed by dis- 
connection of these two structures [30]. Intriguingly, lesions of the 
other major subregion of the nucleus accumbens—the shell—also 
affected performance, but mainly to reduce the rate of responding 
produced by the response-contingent conditioned reinforcers [29]. 
On the basis of these findings, it could be concluded that the acquisi- 
tion of cocaine-seeking was controlled by an amygdala—accumbens 


pathway. Whether this is also true for other drugs of abuse is not 
quite as clear, as second-order schedule performance maintained by 
heroin was shown to be less susceptible to BLA lesions [31]. There are 
several other afferent pathways to the nucleus accumbens, including 
from the hippocampus, insula, anterior cingulate, and prefrontal 
cortex, so it is possible that other pathways also contribute to the 
effects of conditioned stimuli in various contexts, including stress. 
There is indeed evidence from a specific model of relapse, termed 
reinstatement following extinction of drug-seeking, that these other 
structures participate in different forms of relapse [32]. 
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The possible role of DA itself in drug-seeking has been deter- 
mined by using in vivo microdialysis probes directed to various 
striatal sites, including the nucleus accumbens core and shell subre- 
gions. Of particular interest are not only the periods following self- 
administered infusions of cocaine, but also the initial period when 
only conditioned reinforcers are presented (Fig. 48.4). In the shell 
region (as well as other striatal domains), there were large increases 
following cocaine infusions, but no significant increases during the 
initial period. Presumably, this is one of the initial sites of the uncon- 
ditioned reinforcing effects of stimulant drugs, consistent with the 
earlier evidence of effects of lesions described previously. In the core 
region, there was a similar picture, although the levels of DA were 
lower than in the shell, and some effects of the CS alone could be 











discerned, but only if such stimuli were presented in a novel context 
(Fig. 48.4). Of greatest significance, however, was the finding that 
in the dorsal striatum (that is, the caudate-putamen of the rat), the 
release of DA in the periods producing the CS was approximately as 
great as for the drug itself (Fig. 48.4). This was one of the first pieces 
of evidence to suggest that the dorsal striatum also has a role to play 
in cocaine-seeking under the control of conditioned reinforcers, es- 
pecially after a protracted period of training and increasing experi- 
ence of the self-administered drug. However, other observations are 
consistent with this view. For example, autoradiographic evidence of 
changes in DA receptors in the striatum in rhesus monkeys chronic- 
ally self-administrating cocaine showed a gradual, all-encompassing 
involvement of the dorsal striatum [33]. 
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Fig. 48.4 In this study, rats were trained to respond for intravenous cocaine under a so-called second-order schedule of reinforcement, in which 


at the end of 1-hour seeking sessions): 
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ess engaged by habitual drug-seeking behaviour. (See [92, 93].) 











euroscience. 


Source: data from J Neurosci., 20(19), Ito R, Dalley JW, Howes SR, et al., Dissociation in Conditioned Dopamine Release in the 
o Cocaine cues and during Cocaine-Seeking Behavior in rats, pp. 7489-95, Copyright (2000), Society for Neuroscience; J Neurosci., 22(14), Ito R, Dalley JW, Robbins TW, 
et al., Dopamine release in the dorsal striatum during cocaine-seeking behavior under the control of a drug-associated cue, pp. 6247-53, Copyright (2002), Society for 


heir drug-seeking behaviour depended on the response-contingent presentation of cocaine-associated conditioned stimuli (acting as conditioned 
reinforcers) during delays to reward. After a prolonged period of training, dopamine extracellular levels in three areas of the striatum were measured 
he nucleus accumbens core, the shell, and the dorsolateral striatum. Following the infusion of cocaine, 
dopamine release was increased in all areas, indicating the effects of the drug on these richly dopamine-innervated striatal areas. However, there were 
no increases in dopamine in the nucleus accumbens core and shell following the prolonged bout of drug-seeking responding, but instead marked 
increases in the dorsolateral striatum. This area is strongly implicated in stimulus-response or habit behaviou 


,and this dopaminergic correlate of 


dence in favour of the development of the habitual nature of drug-seeking after prolonged experience. Also of 
note is the increase in dopamine in the nucleus accumbens core when the cocaine-associated conditioned stimulus was presented surprisingly (that 
is, in a different context to previously), indicating that this component of the dopamine system is responsive to cues associated with drug reward but is 
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Neuropsychological theories of drug addiction 


(See Fig. 48.5.) 

The evidence of dorsal striatal involvement can be interpreted to 
suggest a devolution of control over behaviour, from the ventral to 
the dorsal striatum, with chronic drug exposure. In psychological 
terms, this might be construed as a shift in balance between two 
learning systems—a goal-directed system in which behaviour is con- 
trolled by its consequences (that is, rewarding drug effects), and a 
relatively unconscious stimulus—-response habit-based learning 
system, which is more automatic and depends on stimuli that simply 
elicit responses without reference to the goal. Therefore, the hy- 
pothesis has been advanced [6, 7, 34] that drug-seeking behaviour 
eventually becomes less goal-directed and more habitual with drug 
experience—in other words, that habitual drug-seeking is at the basis 
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Fig. 48.5 Compulsive drug-seeking in an animal model of addictive 
behaviour. In this study, rats were trained on a task in which responding 
on one lever in an operant chamber gives access to a second lever, 
responding on which delivers an intravenous infusion of cocaine. These 
levers are therefore designated ‘seeking lever’ (on which responding 

is never reinforced) and ‘taking lever’ (on which responding is always 
reinforced), and the rats perform ‘cycles’ of seeking and taking. Then, 
either after a short or long drug-seeking-taking history, a cycle of seeking 
responses was unpredictably punished, rather than giving access to the 
taking lever and hence intravenous cocaine. Thus, in this procedure, rats 
must run the ‘risk’ of punishment in order to gain the opportunity to take 
cocaine; the analogy with humans foraging for cocaine is obvious. After a 
brief history of taking cocaine, all rats suppressed their seeking behaviour 
when punishment was introduced (they abstained). However, after a long 
history, the figure shows that although the majority of rats abstained, a 
subpopulation of about 20% of rats continued to seek cocaine in the face 
of punishment, that is, were compulsive (therefore meeting key DSM-5 
criteria for ‘addiction’. Rats with a behavioural trait of high impulsivity 
expressed before any cocaine experience are much more likely to 
become compulsive, that is, they are vulnerable to develop compulsive 
cocaine-seeking. (See [94].) 
Source: data from Psychopharmacology, 194(1), Pelloux Y, Everitt BJ, Dickinson A, 
Compulsive drug seeking by rats under punishment: effects of drug taking history, 
pp. 127-37, Copyright (2007), Springer-Verlag; Psychopharmacology, 232(1), Pelloux 
Y, Murray JE, Everitt BJ, Differential vulnerability to the punishment of cocaine related 


behaviours: effects of locus of punishment, cocaine taking history and alternative 
reinforcer availability, pp. 125-34, Copyright (2015), Springer-Verlag. 












































of compulsive drug-seeking symptoms, according to DSM-5 criteria 
(Box 48.1). However, such a theory requires some understanding 
of how habitual behaviour can become compulsive, despite adverse 
consequences, and this may require additional factors. One of these 
is the concept of loss of top-down control, especially by so-called ex- 
ecutive mechanisms of the prefrontal cortex. There is considerable 
evidence that chronic drug-taking compromises prefrontal func- 
tioning in experimental animals and, as will be seen from evidence 
to be considered later, also in humans, and therefore, a mechanistic 
basis for habits to become compulsive [35-38]. A parallel, competing 
hypothesis is that compulsive behaviour arises from a gradual sensi- 
tization of the mesolimbic reward system by repeated drug exposure, 
leading to excessive motivation for drugs [39]. Although there is no 
doubt that effects of drugs can gradually augment over time and can 
be manifest in responses, such as enhanced locomotor activity, it is 
less clear how such augmentation can occur to enhance compulsive 
instrumental drug-seeking, leading to drug-taking. A further sug- 
gestion has been that stress can exacerbate the transition from goal- 
directed responding to habits [40] and may cross-sensitize with the 
effects of stimulant drugs. It should be noted that both drug-induced 
sensitization and stress have been shown directly to influence dorsal 
striatal mechanisms of habitual responding [40, 41]. 

How does one begin to distinguish among these various accounts 
of addiction to stimulant drugs? And to what extent does it apply to 
other drugs of abuse? These two issues, as well as the opponent mo- 
tivational theory, are at the forefront of current research attempts to 
understand the theoretical basis of drug addiction. A possible rec- 
onciliation of habit with the opponent motivational theory would 
suggest that negatively reinforced behaviour is especially subject to 
habitual control [7]. 

Learning theory has advanced several methods for diagnosing 
the contribution of habits and goal-directed actions to behav- 
ioural output, with both playing a part (Fig. 48.5). Habits tend to 
emerge with extended training, and the nature of the reinforcement 
schedule is also important [42]. Habitual behaviour persists if: (1) 
goals are devalued, for example by satiety or counterconditioning 
with toxins, in the case of food reward; or (2) the contingent (that 
is, predictive) relationship between actions and goals is degraded in 
any way. Some of these tests have been used to probe the habitual 
nature of chronic drug self-administration for cocaine, alcohol, 
and nicotine in experimental animals, with, in every case, habitual 
control predominating (see [7, 43] for reviews). For example, it has 
been shown that overtraining of cocaine-seeking under a second- 
order schedule of reinforcement makes performance more vulner- 
able to suppression by DA receptor blockade of the dorsal striatum, 
compared with infusions into the ventral striatum [44]. This pat- 
tern contrasts markedly with that observed following the acquisi- 
tion of drug-seeking behaviour, which, as we have seen, implicates 
especially the nucleus accumbens core (which mediates Pavlovian 
influences on instrumental behaviour [45]) and the dorsomedial 
striatum (which mediates action—outcome learning [46]). The nu- 
cleus accumbens core circuitry likely continues to play a part in 
the control of this behaviour, which can be assumed to be at least 
partly habitual in nature. A study that disconnected the nucleus 
accumbens core and the dorsolateral striatum by combining unilat- 
eral manipulations of each structure on opposite sides of the brain 
showed that a circuit functionally linking these two structures was 
implicated [47]. An obvious candidate for their interaction is the 
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cascading circuitry that links the ventral striatum to the dorsal stri- 
atum in a unidirectional manner, via backward and forward projec- 
tions to and from successive sectors of the striatum, beginning in 
the accumbens shell and terminating in the putamen, a possible site 
of stimulus-response habit-based learning [48]. This circuitry pro- 
vides the basis for the BLA, via its direct projections to the nucleus 
accumbens core, to influence the dorsolateral striatum, and hence 
stimulus-response habits—circuitry that has been demonstrated 
electrophysiologically [49]. 

Another way of measuring compulsive behaviour is to make it 
‘risky’ by unpredictably punishing drug-seeking behaviour on one 
lever which, on other occasions, gives access to a second taking 
lever, responding on which results in drug self-administration. 
After a short cocaine history, all rats suppressed their drug-seeking. 
However, after a long history of cocaine exposure, a proportion 
of rats continued to respond (the majority suppressed their drug- 
seeking), despite these adverse consequences, and this therefore 
fulfils an operational definition of compulsive drug-seeking [12] 
(Fig. 48.5). Intriguingly, the proportion of rats showing such com- 
pulsive cocaine-seeking (about 15-20%) is rather similar to the pro- 
portion of human drug abusers thought to make the transition to 
cocaine addiction. Drug-seeking under punishment is associated 
with reduced forebrain levels of serotonin [(37] and depends on 
the dorsolateral striatum [50], consistent with the view that com- 
pulsive drug-seeking reflects loss of control over habitual behaviour. 
Moreover, optogenetic stimulation of the medial prefrontal cortex 
(which is hypoactive in compulsive rats) restores sensitivity to pun- 
ishment and reduces compulsive cocaine-seeking [38]. 


Impulsivity vs compulsivity 


The issue of individual differences in the propensity for addiction has 
led to the search for biobehavioural markers of possible vulnerability. 
An early suggestion for stimulant drug susceptibility was that those 
rats exhibiting behavioural hyperactivity in a novel test environment 
would subsequently respond for lower doses of d-amphetamine, 
perhaps as a consequence of their risky ‘sensation-seeking’ which 
hypothetically promoted drug-seeking [51]. Another finding has 
been that rats consistently responding prematurely in an attentional 
task (in other words, responding prior to the presentation of visual 
targets needing detection for food rewards) also had an increased 
tendency to self-administer cocaine, when allowed binge access to 
the drug, although there were no obvious differences in acquisition 
of this behaviour [52].They also exhibited compulsive drug-seeking 
in a procedure that measures addiction-like behavioural criteria 
(compulsivity, increased motivation, and persistent seeking in the 
signalled absence of the drug) [53]. Moreover, when behavioural 
hyperactivity was used to stratify the same population, there was no 
obvious effect in parallel with what had been shown earlier to affect 
the acquisition of drug-seeking; indeed ‘high responder’ rats were 
actually resistant to developing addiction-like behavioural criteria 
[53, 54]. Thus, it is possible that different phenotypes contribute to 
different aspects of drug addiction, for example to the tendencies 
to sample drug effects and for drug-seeking to become compulsive 
[7, 55]. Further analyses have been made of other predisposing fac- 
tors that contribute to drug abuse in experimental animals, and other 
influences have been suggested to include the temporal discounting 


of reward (that is, choosing small, immediate rewards in preference 
to larger, delayed ones), risky decision-making, individual differ- 
ences in Pavlovian conditioning, novelty seeking, and anxiety [55]. 
Different predisposing influences may exist for different drugs or for 
different components of abusing the same drug. For example, the 
propensity for stimulant self-administration in male rhesus mon- 
keys has been shown to depend on factors such as social domin- 
ance (submissive monkeys being more susceptible) [56]. Moreover, 
although high impulsivity has been shown to predict nicotine [57], 
as well as cocaine susceptibility, it does not predict vulnerability to 
self-administer heroin [58]. This is an area in which it is clearly im- 
portant to explore possible parallels with humans, and one feature 
of research into addiction using experimental animals has been how 
well the principles so gained from appropriate behavioural models 
sometimes appear to apply also to human drug addiction. 


Translation of experimental research on addiction 


in animals to humans 





Striking parallels of research in experimental animals and humans 
addicted to drugs exist in both the behavioural and neural domains. 
Although much human research has revolved around relapse and 
its subjective correlates, which include the ‘craving’ or urge to take 
drugs, this has not handicapped application of the basic research 
findings to any great degree, as long as operational principles have 
been applied. An important example of how behavioural analysis is 
at least as important in humans as in experimental animals has been 
the observation of consistent drug preferences in drug-experienced 
humans for low doses of heroin that appear to occur without 
detecting any subjective drug effect [59]. It would appear that pref- 
erence for certain drug effects may be occurring below the threshold 
for conscious detection of those effects, strongly implicating implicit 
factors at work, perhaps also involving brain regions not directly 
involved in conscious experience itself. Another intriguing con- 
nection is between the pharmacological studies described earlier, 
suggesting that drug-taking may begin as a form of self-medication 
by which rodents strive to attain an optimal level of functioning of 
the DA systems, and studies of the status of the central striatal DA 
function in healthy volunteers in relation to drug preference. ‘Thus, 
volunteers without a history of drug abuse, but exhibiting low stri- 
atal D2/3 receptor binding, tended to find the effects of the psycho- 
motor stimulant methylphenidate pleasurable, whereas those with 
high D2/3 striatal binding found them aversive [60, 61]. 

Research using experimental animals that has highlighted the role 
of the mesolimbic DA system and limbic-striatal mechanisms has 
stimulated parallel investigations in the human drug abuse litera- 
ture. One of the main examples has been the work on DA receptors 
in human drug addicts by Volkow and colleagues. The authors were 
able to find, using a radioligand for DA D2/3 receptors with positron 
emission tomography (PET), that stimulant and heroin abusers, as 
well as alcoholics, all exhibited reductions in D2/3 DA receptor avail- 
ability in the dorsal striatum [62], a striking endorsement of earlier 
animal work supporting a common role for DA in mediating the ef- 
fects of a number of drugs of abuse. Important as these observations 
are, they entail a number of interpretative difficulties. These include 
whether the prime change is in D2 or D3 receptors (the radioligand 
in question being non-selective), whether it is pre- or post-synaptic 


CHAPTER 48 Substance use disorders and the mechanisms of drug addiction 


(while acknowledging the much higher concentration of post- 
synaptic D2 receptors in the human striatum), and consequently the 
precise status of DA function, given the opposite effects that inhibi- 
tory presynaptic receptors and post-synaptic receptors have on DA 
function. These issues can all be addressed, and a current consensus 
is that the reduced D2/3 receptor function may be correlated with 
reduced DA activity (at least in chronic drug abusers) [63]. An add- 
itional finding of interest is that the reductions in D2/3 DA recep- 
tors are correlated with hypometabolism in the orbitofrontal cortex 
[64], a prefrontal region of great importance for reward function 
and decision-making cognition in humans. Therefore, it is evident 
that the changes in striatal DA status are associated with impaired 
cortical function—and may represent one of the deficits leading to 
impaired top-down control of behaviour, in this case impaired goal- 
directed behaviour, and suggestive of an alteration in the balance 
between goal-directed and habitual behaviour, favouring the latter, 
in drug addiction [7, 43]. 

However, a yet more difficult consideration to resolve is whether 
the changes in D2/3 DA receptors are effects of excessive drug ex- 
posure or actually predisposing causes of drug abuse, a question 
evidently difficult to address in humans without the aid of a pro- 
spective longitudinal study. A behavioural issue posing an analogous 
cause-and-effect quandary is whether impulsive decision-making 
and compulsive drug use arise from possible ‘neurotoxic effects of 
excessive drug-taking or again whether it is a predisposing factor. 
The presence of such predisposing factors in experimental animals 
described here suggests the latter [53]. Equally, studies of D2/3 DA 
receptors using microPET in high-impulsive rats showed that they 
exhibit reduced D2/3 receptors in the ventral striatum, suggesting 
that this change may, in part, reflect a pre-existing state in the rats, 
perhaps caused by some combination of genetic or environmental 


factors [52]. It is possible, of course, that both predisposing and 
drug-induced factors combine to drive down D2/3 receptor 
availability. 

The study of human drug abusers using other state-of-the-art 
neuroimaging methods, including structural magnetic resonance 
imaging (MRI), functional MRI, and diffusion tensor imaging, com- 
bined with sensitive behavioural indicators, is beginning to unravel 
the causes and effects of drug abuse. For example, it has been shown 
that measures of response inhibitory control (in which subjects have 
to cancel or restrain an already initiated response) are impaired 
in individuals with DSM-IV-defined stimulant drug dependence. 
However, of even greater significance was the fact that their siblings 
who did not abuse drugs were similarly impaired, as compared to 
age- and IQ-matched controls and to recreational drug abusers not 
attaining DSM-IV criteria of drug dependence [65]. This strongly 
implies an intermediate phenotype or endophenotype of impulsivity 
predicting vulnerability to stimulant drug addiction, in parallel with 
the animal model described previously. The hypothesis of an impul- 
sivity endophenotype has been further supported using question- 
naires to measure impulsivity and sensation-seeking; whereas the 
former scores were elevated in both the stimulant-dependent indi- 
viduals and their siblings, elevated sensation-seeking was confined 
to the stimulant-dependent and recreational groups only [66]. 

What factors then protect the siblings from becoming drug- 
addicted and thus confer resilience? A functional MRI study may 
provide some insight, as this shows reduced activity in a region of the 
prefrontal cortex implicated in inhibitory response control (in the 
region of the right inferior frontal cortex) in stimulant-dependent 
individuals, contrasting with enhanced activity in the same region 
in their siblings (Fig. 48.6) [67]. Hypothetically, the siblings may 
be attempting to exert enhanced restraint over predispositions to 


Endophenotypes for stimulant abuse: neuroimaging evidence 
for resilience and compensation? 
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Fig. 48.6 (see Colour Plate section) Evidence of hypothetical resilience factors operating in siblings of stimulant-dependent individuals (bottom row 
scans) in terms of BOLD response overactivation of the right inferior frontal gyrus during functional magnetic resonance imaging study of the stop- 
signal reaction time task, as compared with healthy control volunteers (middle row scans) and stimulant-dependent individuals (top row)—the latter 
showing significant reductions, compared with controls. Significant brain activation maps were associated with stopping in each group (P <0.05, family- 
wise error). Axial brain slices demonstrate main activation clusters per group at family-wise error, P <0.05. The z-co-ordinate slices are: -4, 8, 24, and 48. 


The right side of each slice corresponds to the right side of the brain. 


Reproduced from Neuropsychopharmacology, 38(10), Morein-Zamir S, Simon Jones P, Bullmore ET, et al., Prefrontal hypoactivity associated with impaired inhibition in 
stimulant-dependent individuals but evidence for hyperactivation in their unaffected siblings, pp. 1945-53, Copyright (2013) with permission from Springer Nature. 
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Fig. 48.7 (see Colour Plate section) Structural brain abnormalities associated with stimulant exposure and familial risk. Blue voxels indicate a decrease, 
and red voxels indicate an increase, in grey matter volume, compared with control volunteers. Both recreational and dependent stimulant users 
showed significant increases in the parahippocampal gyrus, compared with healthy control volunteers, but differed with regard to abnormalities in the 
orbitofrontal cortex. Recreational users did not show any of the changes in brain regions associated with familial risk such as increased volume of the 
amygdala and putamen and decreased volume in the posterior insula. Siblings of stimulant-dependent stimulant abusers showed larger grey matter 
volumes in the basal ganglia, particularly the putamen. Sections and horizontal numbers below each section of the image refer to its plane position 


(mm) relative to the origin in MNI stereotactic space. L, left; R, right. 


Reproduced from Biol Psychiatry, 74(2), Ersche KD, Jones PS, Williams GB, et al., Distinctive Personality Traits and Neural Correlates Associated with Stimulant Drug Use Versus 
Familial Risk of Stimulant Dependence, pp. 137-44, Copyright (2013), with permission from Elsevier Ltd. Reproduced under the Creative Commons Attribution License 


(CC BY). 


impulsive responding (although this may not always be successful). 
These attempts occur in the context of both the stimulant-dependent 
group as well as the siblings exhibiting reduced white matter innerv- 
ation of the inferior frontal cortex via the arcuate fasciculus that 
correlates significantly with their performance on the inhibitory 
control task. There is, in fact, a burgeoning literature of failures in 
inhibitory control in drug abusers associated with cortical impair- 
ment (reviewed in [68]). Such factors could contribute to compul- 
sive behaviour, in tandem with the hypothesized deficits in learning 
and conditioned drug responses observed in experimental animals. 

There has been, by now, extensive confirmation that Pavlovian 
conditioned cues elicit disruptive craving responses and concomi- 
tant neural changes in activity in drug abusers in brain regions, con- 
sistent with what has been found in experimental animals. Thus, for 
example, Grant and Childress were among the first to demonstrate 
excessive activation of limbic regions, including the amygdala, in re- 
sponse to drug-related cues in drug abusers, compared with healthy 
volunteers [69, 70]. Consistent with stimulus—response habit-based 
dorsal striatal mediation of such effects, Volkow and colleagues 
found in functional MRI studies that drug cues associated with co- 
caine activated the dorsal, rather than the ventral, striatum [71]. 
Another study revealed similar findings for alcoholics, although, 
revealingly, recreational drinkers showed activation of the ventral 
striatum [72], clearly consistent in this cross-sectional study with 
a transition to dorsal striatal mechanisms in the drug-dependent 
individual. 

Structural MRI has also shown that the amygdala and striatum 
tend to belarger in both stimulant-dependent individuals and their 
non-drug-abusing siblings, whereas the stimulant-dependent 











individuals (though not recreational users) have reduced cortical 
grey matter in regions such as the orbitofrontal and temporal 
cortices [65] (Fig. 48.7). The extent of reductions in grey matter 
in the ventromedial prefrontal cortex has been shown to be re- 
lated to the duration of stimulant drug use, suggesting that the 
drug has exerted a cumulative toxic effect, as well as to measures 
of compulsive drug use obtained using the Obsessive Compulsive 
Drug Use Scale (OCDUS). Indeed, a significant correlation be- 
tween grey matter volume in the inferior frontal and rectal gyri 
and measures of compulsive cocaine-taking (OCDUS) has been 
demonstrated in a group of chronic cocaine users [73] (Fig. 
48.8). These findings suggest that stimulant drug abuse does im- 
pair cortical functioning, although it is not known to what ex- 
tent these changes are permanent and irreversible, for example, 
following abstinence. Other evidence can be cited in support of 
the shift in balance between systems of goal-directed control vs 
habit learning in stimulant drug abusers [74]. Comparable ana- 
lyses exist of the sequelae of chronic alcoholism, as well as other 
drugs of abuse [72]. 

Behavioural studies of human stimulant drug addiction using the 
devaluation method and other computational paradigms are com- 
patible with the hypothesis of habit-based learning in stimulant- 
dependent individuals. Thus, for example, stimulant abusers were 
slower to learn goal-directed responses that earned points leading 
to monetary reward but exhibited more robust appetitive habits 
than controls. They were also impaired in learning to avoid elec- 
tric shocks, although their aversive habitual responding was normal 
[74]. These results suggest that there might be specific behavioural 
approaches to treatment. 
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Fig. 48.8 Significant association between grey matter volume and 
measures of compulsivity in a group of chronic cocaine users. Regions 
that correlated significantly with compulsive cocaine-taking (as assessed 
by OCDUS) are coloured in green. The scatter plot below the brain image 
shows the correlation between this measure and grey matter volume for 
each drug user in those regions. The probability threshold for significance 
was P ~0.002 for each analysis. The statistical results are overlaid on the 
FSL MNI152 standard T1 image, and the numbers above each section 

of the image refer to its plane position (mm) relative to the origin in MNI 
stereotactic space. L, left; R, right. 
Reproduced from Brain, 134(Pt 7), Ersche KD, Barnes A, Jones PS, et al., Abnormal 
structure of frontostriatal brain systems is associated with aspects of impulsivity 


and compulsivity in cocaine dependence, pp. 2013-24, Copyright (2011), with 
permission from Oxford University Press. 














Treatment of drug addiction via 





translational studies 


Despite the evident progress in understanding the behavioural and 
neural mechanisms underlying drug addiction, we are still some 
way from finding effective treatments for this heterogenous and dif- 
ficult patient group. This may arise, in part, from the habitual na- 
ture of addiction and its relapsing nature, which are antithetical to 
mainline cognitive behavioural therapy techniques, although mo- 
tivational enhancement programmes may prove useful. Substitution 


pharmacotherapy with the long-acting opioid methadone has long 
been a staple procedure for heroin abuse, although this can have sev- 
eral deleterious side effects, and there is certainly scope for prom- 
ising alternative opioid treatments such as buprenorphine. For 
nicotine addiction, there is a variety of substitution therapies based 
on the nicotine patch, gum, and lozenges, and now nicotine vaping 
devices, as well the partial nicotinic receptor agonist varenicline 
[75]. Nalmephene, an opioid receptor antagonist, has recently been 
introduced for the treatment of alcoholism through its ability to re- 
duce volumes of alcohol drunk in a binge [76]. No such effective 
substitution treatments exist for stimulant drugs, although the atyp- 
ical stimulant modafinil may offer some promise. 

These are important and effective harm reduction strategies that 
might, though not necessarily, open the door to abstinence and the 
prevention of relapse, which are major goals in the treatment of ad- 
diction. However, it should be possible to develop such treatments 
and there are different targets in this regard, for example to reduce 
the impact of stress and anxiety associated with relapse to alcohol use 
or to medicate dysphoric and anhedonic states that can persist long 
into withdrawal from stimulants, nicotine, and opiates [77], or treat- 
ments that reduce the impact of drug-associated stimuli on craving 
and relapse [78]. Many treatment leads have been identified in a 
wide range of animal studies in the context of drug discovery pro- 
grammes, well reviewed in the recent surveys by Koob and Mason 
[75] and Everitt [27]. Some have been licensed by the FDA, such as 
the opiate receptor antagonist naltrexone and acamprosate (a partial 
agonist at glutamate NMDA receptors/glutamate metabotropic re- 
ceptor antagonist), for alcohol abuse. Experimental agents based on 
combating the effects of stress (corticotrophin-releasing factor re- 
ceptor antagonist or dynorphin-kappa opioid receptor antagonists) 
or impulsivity (atomoxetine) or related anti-relapse actions by puta- 
tive actions on top-down circuitry (gabapentin, N-acetylcysteine) 
have so far been shown to have limited effects in human trials but re- 
main under experimental investigation. Despite a range of pharma- 
cological agents having the ability to reduce drug cue-elicited 
cocaine-seeking and relapse in animal models, including a D3 dopa- 
mine receptor antagonist, a u-opioid receptor antagonist, and drugs 
interfering with glutamate transmission, none has yet successfully 
made it to the clinic [27]. 

The evident overlap of neural circuitry controlling responses to 
emotional cues and memories with those implicated in addiction 
has led to the revisiting of extinction [79], as well as the applica- 
tion of memory reconsolidation methodologies [80] (Fig. 48.9), by 
which the associative links between drugs and associated cues that 
help to maintain drug-seeking behaviour can be disrupted in ex- 
perimental animals by a combination of behavioural procedure and 
pharmacological intervention. While it has been shown that drug 
cue extinction can be achieved in a clinical setting, with consequent 
reductions in cue-elicited craving, it is clear that this provides, at 
best, a mildly effective relapse prevention treatment because of the 
context dependence of extinction (therapy in the clinic does not ex- 
tend to real-life environments), such that spontaneous recovery, re- 
newal, and reinstatement of the extinguished response are common 
[81, 82]. However, the recent demonstration of ‘super-extinction’ 
[83], achieved by combining a brief drug cue memory retrieval with 
a conventional CS extinction protocol followed after a brief delay 
(30 minutes), has re-awoken interest in extinction therapies, since, 
in this case, a more complete and long-term extinction of the CS is 
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Fig. 48.9 Illustration of the concept of memory reconsolidation. 

The consolidated drug memory, established by repeated Pavlovian 
association with an environmental stimulus (CS) and self-administered 
drug effect, is stored in a stable state. Brief presentations of the drug CS 
called ‘reactivation’ can result in destabilization of the memory in the 
brain (in the case of a CS-drug memory, in the basolateral amygdala). 
The memory can persist in the brain if it is restabilized through de novo 
protein synthesis. The expression of the protein ZIF268 is a requirement 

of cued drug memory reconsolidation in the basolateral amygdala and is 
regulated by activation of NMDA receptors. Memory reconsolidation can 
be prevented by inhibiting protein synthesis in the amygdala or knocking 
down ZIF268 by infusing zif268 antisense oligonucleotides or by blocking 
MDA or B-adrenoceptors. Systemic NMDA or B-adrenoceptor 
blockade also prevents drug memory reconsolidation. The result is drug 
memory ‘erasure’, with the consequence that the drug-associated CS can 
no longer support drug-seeking, and this thereby reduces the propensity 
o relapse [27]. 
Reproduced from Eur J Neurosci., 40(1), Everitt BJ, Neural and psychological 
mechanisms underlying compulsive drug seeking habits and drug memories-- 
indications for novel treatments of addiction, pp. 2163-82, Copyright (2014), with 


permission from John Wiley and Sons. Reproduced under the Creative Commons 
Attribution License (CC BY). 























achieved. Although most often shown in conditioned fear studies, 
an impressive demonstration of the super-extinction of a CS—drug 
memory in animals self-administering cocaine or heroin, as well as, 
most remarkably, in a heroin-addicted inpatient population, with 
evidence of a decrement in craving and physiological responses to 
heroin cues 6 months later [84], suggests further detailed investiga- 
tion is warranted. The neural basis of these effects remains unknown. 

Targeting memory reconsolidation (Fig. 48.9) also holds trans- 
lational promise [80, 85]. It is now widely accepted that under 
specific circumstances, memory retrieval (or, more appropri- 
ately, ‘reactivation’) that is too brief to engage extinction learning 
causes the memory trace to become labile in the brain, from which 
it must undergo protein synthesis-dependent restablization if it 
is to persist [86]. Reconsolidation can be prevented by treatment 
with an amnestic agent, such as an NMDA receptor antagonist or 
a -adrenergic receptor antagonist, given just once in conjunction 
with the brief memory reactivation [27, 87]. This results in amnesia 
and apparent erasure of the memory trace, or a great diminution 
of the strength of the memory, such that presenting the CS subse- 
quently no longer elicits conditioned fear (in the case of aversive 
conditioning) or instrumental drug-seeking (in the case of animals 
seeking cocaine or alcohol) [27, 87, 88]. While the effects to reduce 
conditioned fear in animals has been successfully translated to the 
clinic in the successful treatment of phobias [89], to date, attempts 
to do so in the treatment of addiction have met with only limited, or 
sometimes no, success. It seems that a major obstacle to deploying 
reconsolidation strategies in the treatment of addiction is defining 


precisely the conditions under which retrieval results in the destabil- 
ization of the memory. If the behavioural parameters can be defined 
precisely—a challenge when the conditioning history is so variable 
and of such long duration—then this approach to relapse prevention 
holds great promise. 


GLOSSARY OF KEY TERMS 


Positive reinforcer. An event which increases the probability of a re- 
sponse upon which it is contingent, for example, intravenous drug 
infusions maintaining lever pressing, alcohol ingestion maintaining 
licking or drinking. 

Negative reinforcer. An event, the omission or termination of which 
increases the probability of the response upon which it is contin- 
gent, for example, withdrawal symptoms precipitated by sched- 
uled administration of naloxone in morphine-dependent animals 
avoided by lever pressing which postpones the naloxone. 

Incentive. A stimulus that elicits an approach behaviour (positive 
incentive) or a withdrawal behaviour (negative incentive). A con- 
ditioned incentive acquires such properties via Pavlovian condi- 
tioning. Incentives and conditioned incentives may also function as 
reinforcers and conditioned reinforcers, respectively, depending on 
environmental contingencies. 

Pavlovian (or classical) conditioning. The process by which a con- 
ditioned stimulus (CS) elicits conditioned responses (CRs) that 
are normally elicited by an unconditioned stimulus (US) after a 
number of pairings. Such CRs are normally considered to be invol- 
untary reflexes. The pairings require the onset of the CS to precede 
that of the US (temporal contiguity) and for there to be a positive 
temporal correlation (that is, predictive contingency) between the 
two events, for example, tolerance to a drug effect conditioned to a 
particular environmental CS. 

Conditioned reinforcer. A stimulus which acquires its reinforcing 
properties (positive or negative) by pairings with other, generally 
primary, reinforcers such as food, drugs, sex, or electric shock. 

A stimulus can function as a conditioned reinforcer or as a discrim- 
inative stimulus in the same situation. 

Contingency. A consistent temporal relationship between two 
(or more) events that reduces the uncertainty of the subsequent 
event, for example, between particular stimuli and particular 
responses. 

Action-outcome learning. When instrumental actions are goal- 
directed, the actions (for example, lever-pressing) are made with 
the intention of obtaining the goal. The actions are sensitive to the 
devaluation of the goal; for example, an animal that has learnt to 
lever-press for food will respond much less or not at all for that food 
if it is devalued either by making it ill after ingesting the food or by 
pre-feeding to satiety with the same food. This is called reinforcer 
devaluation. It is very easy to devalue ingestive reinforcers, but not 
intravenously self-administered drugs such as cocaine. 

Stimulus-response or ‘habit’ learning. In habit learning, instru- 
mental performance is acquired through the association of re- 
sponses with stimuli present during training. It therefore reflects 
the formation of stimulus-response associations, and reinforcers 
primarily serve the function of strengthening the stimulus- 
response association, but they do not become encoded as a goal. 
Therefore, devaluing the reinforcer does not affect instrumental 
responding. 

Drug-taking. A term used to describe drug self-administration 
when the drug is readily available, for example, following each in- 
strumental response on a lever or the simple drinking of alcohol 
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(so-called continuous reinforcement). The subject does not need to 
forage or to work for the drug nor mediate delays in acquiring, that 
is, does not actively need to ‘seek’ the drug. 


Drug-seeking. This is instrumental or foraging behaviour performed 


to give access to the opportunity to take a drug, as measured under 
second-order schedules of reinforcement, in seeking-taking, and 
extinction-reinstatement tasks in animal models. 


Compulsive drug-seeking. This is instrumental behaviour that per- 


sists in the face of adverse consequences such as punishment or the 
risk of punishment. 
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The currently accepted claim that ‘addiction is a brain disease with 
a genetic component’ would sound familiar to a nineteenth-century 
alienist. While Esquirol had already mentioned the importance 
of heredity in the physiopathology of mental disorders, it was in 
1860 that Morel gave heredity a central place within nosography by 
coining the term ‘degeneration theory. Morel separated psychiatric 
disorders into two categories, depending on whether they are her- 
editary or not. Magnan and other French psychiatrists retained this 
nomenclature. If the scientific rationale is sometimes surprising, or 
even fanciful, some questions remain relevant in the twenty-first 
century. Thus, Bouchereau discusses the impact of placental transfer 
of alcohol as a possible confounding factor in the heritability of al- 
cohol use disorder (AUD), and Morel evokes the possible inherit- 
ance of a diathesis, that is, a predisposition to the disease close to 
the modern concept of endophenotype. Finally, the ‘law of double 
fertilization of the same author attributes an identical importance to 
the ‘moral elements and organic conditions’ in the development of a 
mental disorder such as AUD, which could be compared to the gene 
x environment interactions at the forefront of our current thinking. 
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It has been over 50 years since Kaij et al’s first twin study on AUD, 
which suggested substance use disorders to be, at least in part, due 
to genetic variance. Subsequent twin and family studies have con- 
tinued to support a genetic component to substance use disorders 
(SUD). Having a first-degree relative with SUD confers a greater risk 
than any individual environmental factor. 

Heritability is defined as the proportion of the phenotypic vari- 
ance attributed to additive genetic factors. Several methods have 
been used to calculate heritability. The data from twin studies are 
most often presented as pair and proband concordance in monozy- 
gotic (MZ) and dizygotic (DZ) twins. Pair concordance represents 
the percentage of all twin pairs within the sample who are con- 
cordant for disease. 

Twin studies remain key to estimates of heritability in SUD, and 
their basis can be simply stated. Falconer’s equation of heritability 





states that h? = 2(rMZ - rDZ), with h? = heritability, rMZ = intraclass 
correlation of MZ twins, and rDZ = intraclass correlation of 
DZ twins. 

This equation assumes that: 


1. The presence or absence of the disease is determined by genetic 
and normally distributed environmental factors. 

2. All genetic influences are additive. 

3. MZ twins share 100% of segregating genes. 

4. DZ twins share, on average, 50% of segregating genes. 

5. MZ and DZ twin pairs have a similar shared environment. 

6. Genetic and environmental variations act independently of 
each other. 


These assumptions are likely to be an oversimplification of the 
real genetic and environmental actions, and the estimations of herit- 
ability using this model are generally considered to be too narrowed, 
given that they include neither gene x gene nor gene x environment 
interactions. 

Twin studies also assume random mating. However, a predispos- 
ition for SUD may be more frequently observed than expected in the 
parents of twins, and therefore, the genetic correlation between DZ 
twins is increased as a function of such assortive mating, biasing the 
estimation of heritability. 

The estimations of heritability for alcohol, nicotine, cannabis, and 
illicit SUD across selected samples of twins are reported in Table 
49.1 [1-3]. 

The recent twin literature suggests that this heritability of the com- 
pleted stage of addiction (that is, the genetic influence on the diag- 
nosis of SUD) lumps together genetic factors involved at different 
stages of an addiction. Firstly, some genetic factors increase the risk 
that individuals are exposed to a substance and experiment with it. 
Secondly, other genetic factors are involved in the reinforcing effect 
of the substance. Thirdly, genetic variants might also modify the risk 
for developing an SUD after regular use. Fourthly, individual vari- 
ation in the metabolism or pharmacological action of the substance 
may modulate the risk for addiction. Heritability will be a complex 
composite of the genetic factors operating early and late in the ex- 
posure to the substance of interest. 

Heritability of addictions is specific to the population and varies 
by gender, is influenced by the cultural environment, and will also 
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Table 49.1 Heritability of alcohol use disorder, nicotine dependence, cannabis use disorder, and illicit substance use disorder 


among twin cohorts 


samples Alcohol use disorder Nicotine dependence Cannabis use disorder Illicit substance use disorder 


VATSPUD 52% 52% 
Vietnam era twins 0-65% 60% 
National Swedish register 22-5/% 48-62% 
Meta-analysis of twin studies 43-53% [1] 37% [2] 


be modified by exposure to the substance. Ifa trait or behaviour has 
no variability, even if it is strongly supported by genetic factors, its 
heritability cannot be estimated. For example, in the western adult 
population, it is not meaningful to examine the heritability of life- 
time alcohol use, since most adults are likely to report at least one 
drink in their lifetime. Similarly, the heritability of nicotine depend- 
ence has ‘increased’ in women from zero in those born at the begin- 
ning of the twentieth century to levels comparable to those seen in 
men for those born after 1940, because of the greater opportunity for 
tobacco use in women in the second part of the twentieth century. 
The expression of the involved susceptibility genes could not be esti- 
mated when women had very limited access to tobacco [4]. 


Endophenotypes 


SUD constitute a broad category of heterogenous phenotypes, 
with patients exhibiting a large range of biomarkers and symptom 
severity, which has further complicated efforts to identify genetic 
variants. Employing more specific definitions of phenotypes may 
be more important than using larger sample sizes for detecting 
true genetic associations. The use of endophenotypes or objective 
measures of specific neurobiological functions may be particularly 
helpful in reducing clinical heterogeneity. 

Neurobehavioural phenotypes associated with SUD include im- 
pulsivity and novelty seeking, stress reactivity, behavioural disin- 
hibition, trait anxiety, and attention allocation, among many others. 
The expression of these traits varies through the lifespan and during 
the addiction’s stages. For example, impulsivity is higher during ado- 
lescence, a period associated with enhanced experimentation with 
psychoactive substances. Moreover, drug use increases impulsivity, 
which may, in turn, promote continuous use of the substance. Delay 
discounting (DD), a decline in the subjective value of reward with 
increasing delay until its receipt, is an established model to deter- 
mine one’s location on the continuum from impulsive decision- 
making to self-control, and constitutes a promising endophenotype 
in the study of SUD [5]. Patients with SUD exhibit a greater DD (a 
tendency to choose lower, but quicker, rewards rather than higher, 
but delayed, ones) than controls. A positive relationship between 
DD and subsequent initiation or increased use of tobacco has been 
described. DD is also improved in substance users in response to 
therapeutic interventions. Given that the heritability of DD varies 
between 46% and 56% [6], it provides an avenue of future research 
as a candidate behavioural marker of SUD. 

Subjective responses to initial substance use have also been con- 
sidered as a plausible endophenotype. Individuals vary widely in 
their subjective experience of a psychoactive substance, and this 


60-80% 
33% 34% 
70% 

51-59% [3] 


may play a role in the development of an SUD. Such subjective re- 
sponse represents a heritable endophenotype. Among the most im- 
portant findings, the literature consistently shows that individuals 
who demonstrated low response to alcohol in an alcohol challenge 
test are more likely to develop an AUD at 8-year follow-up [7]. Low 
sensitivity might be unique to this drug, because it leads to higher 
consumption. Other studies have confirmed that positive experi- 
ences in early use of other substances are important risk factors for 
later SUD [8, 9]. 


Candidate genes 


The candidate gene approach was the first strategy to investigate the 
genetic factors involved in SUD. This approach means searching for 
mutations, or screening for polymorphic markers, in one or more 
genes selected for their potential role in SUD. Such strategy requires 
recruiting patients with and without SUD, as: (1) a case-control 
study (to see if the studied allele is more frequent in the affected 
population, compared to the healthy control group); or (2) a trio 
approach (including affected patients and their fathers and mothers, 
in order to check if the vulnerability allele is more transmitted from 
the heterogenous parent(s) to the affected proband); or (3) siblings 
(where affected sibs should have the vulnerability allele more fre- 
quently in common than expected by chances only); or (4) multi- 
plex families (calculating if the vulnerability allele is more frequently 
transmitted to affected cases, and not transmitted to healthy rela- 
tives). Several national and international cohorts have been imple- 
mented, following at least one of these strategies. Some examples are 
the Collaborative Studies on Genetics of Alcoholism (COGA) from 
the National Institute on Alcohol Abuse and Alcoholism (NIAAA), 
the Family Study of Cocaine Dependence (FSCD), and the 
Collaborative Genetic Study of Nicotine Dependence (COGEND). 
Current research suggests that a major gene effect on SUD vulner- 
ability is unlikely [9], but some plausible positive findings have been 
obtained. The candidate genes involved in the reward pathway, es- 
pecially dopamine receptors, have been widely investigated in SUD. 
Several studies reported that the TaqlA variant (corresponding to 
rs1800497) of the gene encoding the dopamine D2 receptor (the 
DRD2 gene) is associated with alcohol dependence. Interestingly, 
this genetic variant changes an amino acid in a novel gene ANKK1, 
which encodes an X-kinase that regulates the presence of the dopa- 
mine D2 receptor at the cellular membrane [10]. Some genetic vari- 
ants of the enzymes involved in the metabolism of ethanol, namely 
the alcohol dehydrogenase (ADH) and acetaldehyde dehydrogenase 
(ALDH) genes, have also been associated with a lower risk for 
developing AUD, mainly in South Eastern Asian populations [11]. 
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Genome-wide association studies 





The first genome-wide association studies (GWAS) were published 
in 2006. SNPs are used as markers of a genomic region, with the ma- 
jority of them having minimal or no impact on the biological system. 
A minority can have functional consequences, by causing amino 
acid changes, altering mRNA transcript stability, or improving tran- 
scription factor binding affinity. SNPs are the most frequent form of 
genetic variation in the genome. The paradigm underlying GWAS 
is that complex disorders, including SUD, are polygenic, driven 
by multiple common genetic polymorphisms, and follow a non- 
Mendelian pattern of inheritance (Fig. 49.1). 

In the GWAS approach, it is assumed that common disorders are 
likely to be influenced by genetic variation that is also common in the 
general population. The discovery of several susceptibility variants 
for common diseases with high minor allele frequency (including 
the APOE gene for Alzheimer’s disease [12, 13] or the PPARg gene 
for type 2 diabetes) led to the development of what is now called the 
‘common disease/common variant hypothesis’ [14]. 

Contrasting with the candidate gene approach, GWAS do not re- 
quire any assumption on the genomic location of the causal vari- 
ants and use the strengths of association studies, even in the absence 
of convincing literature on the location of involved genes. The first 
GWAS strategy was published in 2014 [15]. 

The first GWAS of AUD was initially published in 2006 [16] in 
a sample of 1024 cases and 996 controls, using more than 500,000 
SNPs (Fig. 49.2). Complete analysis reported that no associations 
were genome-wide significant [17]. The largest independent GWAS 
published to date involved more than 16,000 participants, providing 
information on about a million autosomal SNPs [18]. This study 
added to the evidence for the association of variants in the ADH 
gene coding for an alcohol-metabolizing enzyme and provided in- 
sight on the role of new loci mapping to the ADH gene cluster [18]. 


<—____}_> 


As for other similar analyses in psychiatry, a recent review of 12 pub- 
lished AUD GWAS [19] concluded that larger samples will be re- 
quired to detect loci, in addition to those encoding the genes already 
mentioned. 

A first GWAS on heroin published in 2019 and performed on 325 
methadone stabilized, former severe heroin addicts and 250 con- 
trols showed an association with the the cytosolic dual-specificity 
phosphatase 27 encoded by the DUSP27 gene [20]. Three GWAS 
reported genome-wide significant SNP association with opioid 
use disorder. These results included SNPs that mapped to KCNG2, 
which encodes a potassium voltage-gated ion channel [21], and 
CNIH3, which encodes a glutamate receptor-associated regulatory 
protein [21]. A recent genome-wide study on cannabis dependence 
has been performed on three large independent cohorts including 
15,000 subjects and detected interesting associations in novel genes, 
including a novel antisense transcript RP11-206M11.7, the solute 
carrier family 35 member G1, the SLC35G1 gene, and the CUB and 
Sushi multiple domains 1 gene CSMD1 [21]. The first GWAS on co- 
caine use disorder detected a significant role of the FAM53B gene, 
the product of which is involved in the regulation of cell prolifer- 
ation, but with an unclear biological role in the development of co- 
caine dependence [21]. 

The most compelling evidence has come from GWAS and GWAS 
meta-analyses on nicotine use disorder, showing an association be- 
tween the nicotinic acetylcholine receptor gene cluster CHRNA5- 
A3-B4 on chromosome 15 (15q25) and both nicotine dependence 
and smoking cumulative quantity [22]. This is highly relevant, given 
that while there are thousands of compounds in cigarette smoke, 
nicotine is the principal component responsible for nicotine de- 
pendence and exerts its action in the brain through neuronal nico- 
tinic acetylcholine receptors, which are widely distributed in both 
the central and the peripheral nervous systems [22]. Two associ- 
ations between nicotine dependence and genetic variants was pub- 
lished in 2007, one in a candidate gene study [23] and one GWAS 
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Fig. 49.1 Schematic representation of the hypotheses of genes and variants involved in disorder inheritance according to their Mendelian and non- 
Mendelian status. The paradigm is that Mendelian-inherited disorders should involve genetic variants with a high penetrance and a low frequency 

in one gene, while non-Mendelian-inherited disorders should involve variants with a low penetrance but a high frequency in different genes. Thus, 
Mendelian disorders could be studied with family-based cohorts, while non-Mendelian disorders could be studied with population-based cohorts. 
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Fig. 49.2 Historical GWAS discoveries on substance use disorders. 


on nicotine [24]. In this preliminary study, examining 3713 SNPs in 
more than 300 candidate genes and involving 879 light smokers and 
1050 heavy smokers, multiple SNPs within the CHRNA5-A3-B4 
were associated with nicotine dependence [23]. In the GWAS study, 
1050 dependent smokers and 879 non-dependent smokers as con- 
trols allowed to associated CHRNB3 gene [24]. In 2008, two GWAS, 
considering smoking quantity, but also smoking-related disease 
(lung cancer and peripheral arterial disease), confirmed the associ- 
ation of variants within this locus and smoking-related phenotypes 
[25]. Further studies demonstrated the impact ofa non-synonymous 
SNP in the a5-nicotinic acetylcholine receptor gene (CHRNA5) on 
behaviours associated with the risk of developing tobacco depend- 
ence, including a decrease in the aversive effect of nicotine [26]. This 
genetic assessment for nicotine abstinence could be helpful to use as 
a biomarker of pharmacogenetics in smoking cessation [27]. 


Pharmacogenetics of substance use disorders 


Recent interventions emphasize the importance of early and inten- 
sive treatment for SUD. These studies also demonstrate additional 
factors that influence outcome such as psychiatric comorbidity 
(including post-traumatic stress disorder or schizophrenia, among 
many others) or medical conditions associated with substance use. 
This has called for treatment elements, such as the choice of psycho- 
therapy and medication, and the goal (abstinence vs controlled use, 
among many others) to be individualized, so that optimal patient 
outcomes can be met. It is also tempting to hope that recent genetic 
findings can help to develop an individualized approach in the clin- 
ical management of patients with SUD. 

Pharmacogenetic studies may provide the first examples for in- 
novation. In AUD, there appears to be important moderating effects 
of variation in OPRM1 on the response to the opioid antagonist 
naltrexone [28]. The Asp40 allele of Asn40Asp, encoded by the 
A118G SNP, predicts a significantly lower rate of relapse in four 
RCTs, while the others were not significant. A meta-analysis con- 
firmed that naltrexone-treated patients carrying the ASP40 allele of 
the A118G SNP had lower rates of relapse [29]. Human laboratory 
studies also suggested that the A118G SNP moderates the effects of 
alcohol, including a differential reduction of the alcohol euphoric ef- 
fect or craving. Other dopaminergic, serotonergic, GABAergic, and 


glutamatergic genes have been studied, but no other specific candi- 
date polymorphism has to date yielded replicable findings [28]. 

A number of studies have examined the association of genetic 
variants on features of opioid use disorder and treatment outcome. 
Methadone and buprenorphine are two common and effective 
maintenance medications for opioid use disorder. The DRD2 gene, 
coding for the D2 dopaminergic receptor and located on chromo- 
some 11 (region 11q23), displays a polymorphism known as TaqlA 
and has been the subject of over 300 studies [30, 31]. Meta-analyses 
confirmed a modest, but significant, higher prevalence of the A1 al- 
lele in patients with opiate use disorder [30]. Taq1A has also been 
associated with greater heroin craving. In predicting the outcome 
of an opioid maintenance treatment, the results have been less con- 
clusive so far. Indeed, in four studies including 404 participants with 
a maintenance medication, three failed to find an association of 
this variant with a significant difference in outcome. Similarly, sev- 
eral variants within the OPRM1 gene (including the A118G SNP) 
and the OPRD1 gene have provided inconclusive results so far in 
pharmacogenetic trials. 


Conclusions and perspectives 


National and international consortia, recruiting large samples of 
controls and affected patients, have been successfully used in the 
genetics of SUD. GWAS have demonstrated the role of variants of 
nicotinic receptor genes in tobacco dependence and confirmed 
the involvement of genes encoding enzymes in charge of the me- 
tabolism of ethanol in AUD. Furthermore, genes involved in the 
dopamine pathway have been found associated with several SUDs, 
although how they increase the vulnerability to SUDs is not entirely 
deciphered. 

The development of new tools to investigate the genome, such 
as next-generation sequencing (NGS), now make it possible to 
sequence all exons of the 25,000 genes of the human genome. This 
technique will identify genetic variations within exons, including 
rare de novo mutations (detected in patients while absent in their 
parents), which was almost impossible with previous approaches. 
Analysis of the exome, or genome-wide exome sequencing (GWES), 
has indeed been a diagnostic tool in common pathologies, but not 
yet in SUDs. However, GWES is only sequencing about 1.5% of the 
entire human genome, and other variants, for example those located 
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in the regulatory regions of genes, may impact the risk of the dis- 
order. Furthermore, genetic studies can now be combined with other 
analyses to investigate the epigenetics of SUD. Epigenetics relate to 
the modification of DNA chemistry (but not the DNA sequence), 
which leads to modifications of gene expression in response to dif- 
ferent environmental factors. Early life trauma has been found, for 
example to increase the level of methylation in specific parts (called 
CpG islands) of the GR1 gene, in patients who committed suicide 
and were abused during childhood [32]. 

Epigenetics could be the missing link between high heritability 
and the important role of purely environmental factors, that is, ex- 
plaining, in part, the ‘missing heritability, a concept focusing on the 
gap between high heritability and the weak role of any individual 
associated gene. 

Future directions should therefore combine the study of devel- 
opmental and environmental factors with genetic and epigenetic 
factors. Since the substratum of methylation sites (epigenetic regu- 
lation) is purely genetic, the impact of different trauma may vary 
according to individual genetic polymorphisms. This approach also 
places clinical evidence at the forefront when studying the genetic 
vulnerability of SUD. 
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Alcohol is the most frequently used addictive substance in the 
world, with large regional variations in the amount of alcohol that 
is consumed and related variations in damage due to excessive use 
and alcohol use disorders. About half of the global adult popula- 
tion (48%) has never consumed alcohol, whereas 14% was drinking 
in the past but has ceased alcohol consumption and ‘only’ 38% is 
currently drinking [1]. There are, however, large regional differ- 
ences, with 80-90% of lifetime abstainers in Islamic countries and 
only 20% of lifetime abstainers and 60-70% of current drinkers in 
North America and Europe. Worldwide, mean alcohol consump- 
tion among people aged 15 years and older is about 6.2 L of pure 
alcohol per person per year, which translates into about 13.5 g of 
pure alcohol per person per day, which is clearly within the WHO 
range for low-risk alcohol drinking (1-20 g/day for women and 
1-40 g/day for men). However, there are large inter-personal and 
regional differences in the level of alcohol consumption. People in 
the Eastern Mediterranean region (for example, Jordan: mean 0.7 
L) and the South East Asian region (for example, Pakistan: mean 
0.1 L) generally drink much less, whereas people in North America 
(mean 9.5 L) and Europe (mean 10.9 L) drink much more than the 
worldwide average, with the highest drinking levels in Central and 
Eastern European countries (for example, Russian Federation: mean 
16.1 L).[1] 

There is no officially defined safe drinking level, and the amount 
and type of damage due to alcohol are mainly dependent on the 
volume of alcohol consumed, the drinking pattern, and the quality 
of the alcohol that is used. For example, alcohol consumption has 
been identified as a component cause for more than 200 diseases, 
injuries, and other health conditions, and for most of these diseases 
and injuries, there is a clear dose-response relationship [2]. Also the 
pattern of drinking affects the risks [3]; for example, drinking while 
eating is associated with less harm from chronic diseases than the 
same pattern of drinking at other times [4], whereas heavy episodic 
or binge drinking is linked to unintentional injuries (accidents), in- 
tentional injuries (suicides) [5], and ischaemic heart disease and 
ischaemic stroke [6]. Finally, the quality of alcoholic beverages 
may impact on health and mortality; for example, home-made or 
illegally produced alcoholic beverages can be contaminated with 
methanol or other toxic substances. However, there is no evidence 





that consumption of illegally produced alcohol is markedly linked 
at a population level to morbidity or mortality over and above the 
effects of the level of ethanol use [7]. 

Worldwide, there are 3.3 million deaths every year resulting from 
the harmful use of alcohol representing 5.9% of all deaths, including 
33% of cardiovascular diseases and diabetes, 17% of unintentional 
injuries, 16% of gastrointestinal diseases, 13% of cancers, and 21% 
of other diseases. These deaths occur early in life, and in the age 
group of 20-39 years, approximately 25% of total deaths are alcohol- 
attributable. Therefore, it comes as no surprise that 5.1% of the 
global burden of disease and injury in terms of disability-adjusted 
life years (DALYs) is attributable to alcohol [1]. Finally, the harmful 
use of alcohol brings significant social and economic losses to indi- 
viduals and society at large. 

It is important to realize that about two-thirds of all alcohol- 
related deaths are caused by 4-7% of drinkers, that is, drinkers with 
an alcohol use disorder and often with chronic, excessive alcohol 
consumption [8]. Like the level of alcohol consumption in the popu- 
lation, the 12-month prevalence of alcohol use disorders (harmful 
use/abuse and dependence) shows considerable regional variation, 
with low rates in the Eastern Mediterranean region (0.3%) and 
the South East Asian region (2.2%) and much higher rates in the 
Americas (6.0%) and Europe (7.5%) [1]. In this chapter, we focus on 
these people with an alcohol use disorder. 














The concept of addiction, later called dependence and now substance 
use disorder, is long, complex, and full of controversies. Alcohol 
use and alcohol use problems are already known from ancient bib- 
lical, Egyptian, and Babylonian sources. In the seventeenth and 
early eighteenth centuries, regular alcohol use was very common in 
Europe since alcohol represented a cheap and useful combination of 
clean water and calories. With the change from a mainly agricultural 
society to an industrial society in the middle of the eighteenth cen- 
tury, regular intoxication became problematic and ‘habitual drunk- 
ards’ were seen as morally weak and in need of re-education in a 
correctional centre that generally took the shape of a prison (moral 
model). This view changed in the beginning of the nineteenth century 


with the start of the temperance movement. Habitual drunkards 
were no longer seen as morally weak, but as victims of the highly 
addictive substance alcohol, and regulation, or even prohibition, of 
the production, trade, and consumption of alcohol was seen as the 
best solution for the problem of excessive alcohol use (pharmaco- 
logical model). The term ‘alcoholism’ was first used in 1849 by the 
Swedish physician Magnus Huss to describe the detrimental effects 
of alcohol. In the early twentieth century, psychoanalytic theorists 
proposed that alcoholism was not a separate problem, but just a 
symptom of a character neurosis/personality disorder (symptom- 
atic model). Successful treatment of the underlying personality dis- 
order (in a therapeutic community) would also resolve the alcohol 
problem. Between 1946 and 1960, E. Morton Jellinek performed a 
series of studies among AA members, and based on the results of 
these studies, he coined the term ‘disease of alcoholism’ with five dif- 
ferent subtypes (biological or disease model). It was only in the 1970s 
that psychologists and sociologists developed theoretical and treat- 
ment models for problematic alcohol use, with psychologists pro- 
posing that alcoholism was a form of maladapted learnt behaviour 
that could be redressed during psychological interventions (psycho- 
logical or learning model) and sociologists stating that alcoholism is a 
normal response to abnormal circumstances and that normalization 
of these circumstances would solve the problem (social model). In 
1975, a WHO steering group discussed the latest developments, and 
based on this discussion, Edwards and Gross proposed a new model 
integrating biological, psychological, and social aspects [9]—the 
alcohol dependence syndrome with the following symptoms: nar- 
rowing of drinking repertoire; salience of drink-seeking behaviour; 
increased tolerance; repeated withdrawal; repeated relief/avoidance 
of withdrawal by drinking; subjective awareness of compulsion to 
drink (craving); and reinstatement of the syndrome after abstinence 
(biopsychosocial model). In 1997, Alan Leshner, then the director of 
the National Institute of Drug Abuse (NIDA), made a final contri- 
bution to the history of the concept of addiction when he wrote a 
paper with the challenging title ‘Addiction is a brain disease, and it 
matters (brain disease model) [10]. The brain disease model states 
that addiction is a chronic, relapsing disease that results from bio- 
logical vulnerabilities and the prolonged effects of drugs on the 
brain. However, as with many other brain diseases, addiction has 
embedded behavioural and social context aspects that are important 
parts of the disorder itself. Therefore, effective treatments should in- 
clude biological, behavioural, and social context components. 

The history of the classification of alcohol use problems runs 
partly parallel with the history of the concept of alcoholism. In the 
first two editions of the Diagnostic and Statistical Manual of Mental 
Disorders of the American Psychiatric Association (DSM-I, 1952; 
DSM-II, 1968), alcoholism was classified as a subcategory of the 
sociopathic personality disorder, and thus reminiscent of the moral 
or symptomatic model [11, 12]. In DSM-III (1980), the term ‘alco- 
holism’ was dropped in favour of two distinct, but loosely defined, 
categories labelled ‘alcohol abuse’ and ‘alcohol dependence’ and 
these new disorders were placed in a separate category ‘Substance 
use disorders, rather than as subsets of personality disorders [13]. 
DSM was revised again in 1987 (DSM-III-R) [14]. This time, the 
alcohol dependence syndrome [9] was used as a starting point and 
explicit diagnostic criteria were provided. As a consequence, the 
category of dependence (23 out of nine criteria) was expanded to 
include some criteria that, in DSM-III, were considered symptoms 
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of abuse, and the category ‘Abuse’ (21 out of 2 criteria) became a 
residual category for diagnosing those who never met criteria for 
dependence but who drank despite alcohol-related physical, social, 
psychological, or occupational problems or who repeatedly drank in 
dangerous situations such as in conjunction with driving. In 1994, 
the fourth edition of DSM (DSM-IV) was published, with some 
minor changes in the criteria sets for dependence (23 out of seven 
criteria) and abuse (22 out of four criteria) [15]. The latest revision 
of DSM (DSM 5) was introduced in 2013 and contained important 
changes with regard to the classification of alcohol-related problems 
[16]. In DSM-5, the dependence criterion ‘legal problems’ was elim- 
inated and replaced by ‘craving’ as a new criterion. In addition, al- 
cohol abuse and alcohol dependence were integrated into a single 
disorder called ‘alcohol use disorder’ (AUD) with 11 criteria (ten 
old criteria and one new criterion—craving) and three levels of dis- 
order severity, based on the number of criteria present in the last 
12 months: mild AUD (2-3 out of 11 criteria), moderate AUD (4-5 
out of 11 criteria), and severe AUD (26 out of 11 criteria). Table 50.1 
presents a summary of the criteria for the DSM-5 classification of an 
AUD [16] (also discussed in relation to mechanisms of addiction in 
Chapter 48). 

Similar to DSM, there were also substantial and largely parallel 
changes in the definition of alcohol use problems in the International 
Classification of Diseases (ICD) of the WHO. Early definitions of 
alcoholism in ICD-7 (1958) and ICD-8 (1968) stressed the socio- 
logical, rather than the physical, aspects of dependence [17, 18]. 
However, in ICD-9 (1978), the term alcoholism was dropped in 
favour of alcohol dependence syndrome [19]. Similar to DSM-IV 
(1994), ICD-10 (1992) defined two mutually exclusive alcohol diag- 
noses: harmful use and dependence [20]. However, in contrast to 
DSM-IV where abuse was defined by alcohol-induced social impair- 
ments, inter-personal problems, legal problems, and/or hazardous 
use, in ICD-10, harmful use was determined by alcohol-induced 
psychological or physical harm. ICD-10 dependence (23 out of 
six criteria) was very similar to DSM-IV dependence but ICD-10 
already included craving as one of the criteria. In the draft version 
of the latest version of ICD (ICD-11), the two mutually exclusive 
categories of harmful use and dependence are retained, but harmful 
use will now include the criterion ‘family harmed by substance use, 
whereas the six dependence criteria of ICD-10 will be reduced to 
only three criteria: physical dependence (tolerance or withdrawal), 
priority of use (much time or reduced activities, or psychological/ 
physical harm), and impaired control (quit/cut or larger/longer or 
craving) [21]. Symptoms from at least two of the three proposed 
ICD-11 dependence criteria are needed for a diagnosis of alcohol 
dependence (‘Table 50.1). 

There are some early findings comparing DSM-IV to DSM-5 and 
ICD-11 (draft) to DSM-5. These findings show that the prevalence 
of AUD in the general population according to DSM-5 is somewhat 
higher than according to DSM-IV (abuse and dependence), but that 
very similar prevalence rates are obtained in clinical settings [22]. 
Furthermore, the threshold for DSM-IV alcohol dependence (23 
out of seven criteria) seems to be very similar to the threshold for 
DSM-5 moderate alcohol use disorder (24 out of 11 criteria) [23]. In 
a general population study from Australia, the prevalence of alcohol 
dependence according to ICD-10, ICD-11, and DSM-IV was 
very similar, but the prevalence of DSM-5 moderate/severe AUD 
was substantially higher [24]. Finally, a study among American 
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Table 50.1 Classification of alcohol use disorders in DSM-IV, DSM-5, ICD-10, and ICD-11 







7a 


7b 


DSM-IV 
Alcohol dependence (23/7) 


o reduce or stop alcohol use 


Alcohol use more or longer than 
intended 


Reduced social, occupational, and 





Lots of time spent to obtain, use, or 
recover from the effects of alcohol 


Continued alcohol use despite recurrent 


or persistent physical or psychological 
problems due to alcohol use 





Tolerance indicated by increased use to 
obtain same effect or reduced effect with 


the same amount 


Withdrawal manifested by specific 


withdrawal syndrome or alcohol use to 


relieve or avoid withdrawal symptoms 


Persistent desire or unsuccessful attempts 


recreational activities due to alcohol use 


DSM-5 


Alcohol use disorder (22/11) 
(mild 2-3/11; moderate 4-5/11; 


severe >6/11) 


Craving 


Persistent desire or unsuccessful 
attempts to reduce or stop 


Alcohol use more or longer than 
intended 


Failure to fulfil role obligations due 
to alcohol use 





Lots of time spent to obtain, use, or 
recover from the effects of alcohol 


Continued alcohol use despite 
recurrent or persistent physical or 
psychological problems due to 
alcohol use 


Tolerance indicated by increased 
use to obtain same effect or 
reduced effect with the same 
amount 


Withdrawal manifested by specific 
withdrawal syndrome or alcohol 
use to relieve or avoid withdrawal 
symptoms 


ICD-10 
Alcohol dependence (23/6) 


Craving 


Alcohol use more or longer than 
intended 


Neglect of other pleasures and 
responsibilities due to time needed to 
obtain and recover from alcohol use 


(Subsumed in above criterion) 


Continued alcohol use despite 
harmful consequences 


Tolerance indicated by increased use 
to obtain same effect 


Withdrawal manifested by specific 
withdrawal syndrome or alcohol 
use to relieve or avoid withdrawal 
symptoms 


ICD-11 
Alcohol dependence (22/3) 


1. Impaired control over 
substance use often 
accompanied by craving 


2. Substance becomes 


increasing priority in life 
and relegates other areas of 
life to periphery; continued 
alcohol use despite 
problems 


3. Physiological features, 
including tolerance, 
withdrawal, and/or alcohol 
use to prevent or alleviate 
withdrawal symptoms (not 
just hangover) 


8 


Continued alcohol use despite recurrent 


or persistent social or inter-personal 
problems due to alcohol use 


Hazardous drinking (for example, while 


driving) 


Failure to fulfil major role obligations due 


to drinking 


Recurrent alcohol-related legal problems 


Continued alcohol use despite 
recurrent or persistent social or 
inter-personal problems due to 
alcohol use 


Hazardous drinking (for example, 
while driving) 


Failure to fulfil major role 
obligations due to drinking 


Continued alcohol use despite 
recurrent or persistent social or inter- 
personal problems due to alcohol 
use 


Continued alcohol use despite 
recurrent or persistent physical or 
psychological problems due to 
alcohol use (see also criterion 6) 


(Partly in criterion 2) 


(Partly in criterion 2) 


Source: data from: Diagnostic and Statistical Manual of Mental Disorders, (Fourth Edition, DSM-IV, Copyright (1994); Fifth Edition, DSM-5, Copyright (2013)), American Psychiatric 
Association; World Health Organization, The ICD-10 Classification of Mental and Behavioural Disorders, Copyright (1996), World Health Organization; Poznyak V, An Update on ICD- 
11 Taxonomy of Disorders Due to Psychoactive Substance Use and Related Health Conditions, International Society of Research on Alcoholism (ISBRA), Copyright (2016), World Health 


Organization. 


adolescents admitted for intensive outpatient addiction treatment 
showed that ICD-11 produced a lower prevalence of AUD, but a 
much higher prevalence of alcohol dependence than DSM-IV al- 
cohol dependence and DSM-5 moderate/severe AUD, with very low 
levels of intersystem diagnostic agreement [25]. These findings show 
that the final word about the classification of alcohol use-related 
problems is still not spoken and that further studies are needed. 


Mechanism of action of alcohol and neurobiology 


of alcohol use disorders 





Alcohol modulates neural activity directly via ethanol-binding sites 
on several membrane receptors, including the N-methyl-D-aspartate 


(NMDA) receptor, the gamma-aminobutyric acid A (GABA-A) re- 
ceptor, the 5-hydroxytryptamine-3 (5-HT;) receptor, and ion chan- 
nels [26]. In addition to these primary targets, indirect effects on 
neurochemical and neuroendocrine systems trigger reinforcing 
and stress-related effects. The main indirect targets are the dopa- 
mine system, the opioid system, and the hypothalamus-pituitary- 
adrenocortical (HPA) axis [27-30]. 

Following chronic administration of alcohol, several neurobio- 
logical adaptations (for example, changes in gene expression, mo- 
lecular alterations, synaptic and cellular changes) take place that may 
result eventually in long-lasting alterations in neuronal network ac- 
tivity [31]. These alterations persist during abstinence. Corresponding 
behavioural effects become evident when the recently abstinent 
subject is exposed to alcohol-related stimuli or stress. Thus, even after 


successful detoxification and abstinence treatment, somebody with 
an AUD remains at risk for relapse. Animal models have assisted in 
identifying the molecular mechanisms that are primarily involved in 
alcohol reward and compulsive alcohol intake behaviour [32]. For 
example, a sensitization process within the dopaminergic reward 
system, especially in response to alcohol-related cues, drives relapse 
behaviour, even after protracted abstinence [33]. It is suggested that 
in the non-addicted, but vulnerable, brain, a phasic dopamine signal 
within the ventral striatum/nucleus accumbens (VS/NcAcc) leads to 
incentive salience and acts as a form of stimulus—reward learning, in 
which incentive salience is assigned to alcohol-related reward cues 
[34]. In the transition to addictive behaviour, VS/NcAcc-driven in- 
centive salience to reward transforms into repetitive, automatized 
stimulus-reaction schemata that are mainly mediated by the re- 
cruitment of the dopaminergic dorsal striatum (DS), a change that is 
here referred to as the ‘ventral—dorsal striatal shift’ [10-37]. In add- 
ition, preclinical studies suggest a critical involvement of prefrontal 
cortical-striatal connectivity with glutamate-dopamine interactions 
in the transition from recreational to compulsive alcohol-seeking be- 
haviour [38, 39]. The loss of type 2 metabotropic glutamate receptors 
(mGluR2) in the cortico-accumbal neurocircuitry due to chronic al- 
cohol use is important for the increased propensity to relapse [40]. 
Loss of these auto-receptors is accompanied by a loss of neuronal 
plasticity, that is, the mGluR2-mediated long-term depression (LTD) 
is abolished in the addicted brain, resulting in reduced prefrontal 
cognitive control over the hyperactive reward system [41]. Due to 
this reduced prefrontal control and the progressive recruitment of 
a striatal dopaminergic mechanism (“VS-DS shift’), automatized 
behaviour and repetitive and compulsive uncontrolled alcohol use 
are facilitated [36, 42]. Human neuroimaging studies are supporting 
the importance of prefrontal regions in the alcohol-addicted brain 
[43, 44]. Further detail on animal models and human experimental 
studies underlying this formulation is given in Chapter 48. 

At the psychological level, relapse can be understood using the 
negative reinforcement theory. Thus, avoidance of an aversive state 
during withdrawal—which is characterized by depressed mood and 
elevated anxiety—triggers a relapse to alcohol use [45]. In most pa- 
tients, these symptoms abate over 3-6 weeks of abstinence, while 
relapse risk persists long beyond this period. However, more subtle 
changes, such as increased behavioural sensitivity to stress, support 
the clinical relevance of negative emotionality for protracted abstin- 
ence and relapse [46]. Modelling this process of alcohol-induced 
negative emotionality in laboratory animals has led to the discovery 
of augmented corticotropin-releasing hormone (CRH) signalling 
in the amygdala as a pathological mechanism that drives negative 
emotionality, and thereby relief drinking. It has been postulated 
that CRH signalling via its CRH1 receptor is a key element of the 
neuroadaptive changes driving relapse behaviour [47]. Pathological 
engagement of CRH signalling in the amygdala leads to negative 
emotionality during protracted abstinence and subsequently to re- 
lief drinking. Another mechanism that leads to relief drinking is an 
upregulation of the dynorphin/kappa opioid receptor (KOR) system 
in the brain [48-50]. It is important to note that activation of the 
dynorphin/KOR system, especially in the NcAcc [51], induces a dys- 
phoric state [52], which may also drive relief drinking. 

These findings are important to better understand the nature of 
substance use disorders and the problems that patients with these dis- 
orders experience and to guide the development of new treatments. 
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Psychological treatments can be directed at increased motivation 
to reduce or stop the use of alcohol (for example, motivational 
interviewing), while pharmacological treatments can be directed at 
reducing the rewarding effects of alcohol (for example, opioid re- 
ceptor antagonists). Psychological and pharmacological treatments 
can also be used to reduce negative emotions related to substance 
use (CBT, a-adrenergic antagonists), to reduce craving (for example, 
cue-exposure treatment, glutamate antagonists), to improve cogni- 
tive control (for example, CBT, stimulants), and finally to replace al- 
cohol with less toxic substitutes (for example, high-dose baclofen). 
In the following paragraphs, these approaches will be discussed in 
greater detail. 


Psychological treatments for patients with an 


alcohol use disorder 





A crucial step in the treatment of patients with an alcohol use dis- 
order is motivation for treatment. Many people with an alcohol use 
disorder are not aware of the fact that their alcohol use is causing 
problems (precontemplation stage), or they do recognize that al- 
cohol is causing problems but they feel that there is still a reasonable 
balance between the short-term gains and the long-term problems 
(contemplation stage). In both cases, there is no perceived need to 
change one’s behaviour. At a certain time or after appropriate mo- 
tivational stimulation, people with an alcohol problem may realize 
that the balance between pros and cons begins to tip in the direction 
of change (preparation stage), which may actually result in concrete 
changes in behaviour and in changes of the environment (action 
stage). However, even after a period of reduced drinking or abstin- 
ence, relapse is still very likely to occur if no measures are taken to 
prevent this (maintenance stage). 

Until the 1980s, the common way to deal with ‘unmotivated’ pa- 
tients was a harsh and confrontational strategy to overcome ‘patho- 
logical denial. Since then, a more effective approach has been 
developed—motivational interviewing. It combines a supportive 
and empathic counselling style with a directive method for re- 
solving the perceived ambivalence in the direction of change, that 
is, reduced drinking or total abstinence [53]. In the latest version of 
motivational enhancement therapy (MET), the intervention mainly 
tries to evoke ‘change talk’ and to prevent ‘counterchange talk’ [54]. 
Ina recent fMRI study, it was shown that in patients with an alcohol 
use disorder, listening to their own change talk significantly reduced 
activation of the cingulate cortex and the insula during a priming 
dose of alcohol and that this reduction in brain activation was asso- 
ciated with a reduction in subjective craving [55]. MET is currently 
accepted as an evidence-based treatment for patients with an al- 
cohol use disorder, with similar effects in patients in different stages 
of motivation. 

In addition to MET, there are several other evidence-based psy- 
chological treatments for patients with an alcohol use disorder, 
including cognitive behavioural therapy (CBT), often with a strong 
emphasis on social skills training, although without clear support 
that improved coping skills mediate the benefit [56]; cue-exposure 
treatment (CET; often as part of CBT); 12-step facilitation (TSF), a 
professional intervention intended to make patients join a self-help 
group, for example Alcohol Anonymous (AA) meetings; contin- 
gency management (CM) with rewards for negative alcohol breath 








SECTION 8 Substance use disorders 


tests or attendance of therapy sessions (often as part of a commu- 
nity reinforcement approach); community reinforcement approach 
(CRA), generally including an amalgam of different techniques, 
for example social skills training, medication compliance support, 
medication, vocational training, marital therapy, and/or CM; and 
behavioural couple therapy [57]. It should be noted, however, that 
behavioural couple therapy is probably not more effective than 
individual CBT [58]. 

There are also several interventions with no proven effectiveness 
or with proven non-effectiveness, including psychodynamic psycho- 
therapy, experientially oriented therapies, and system-theoretically 
based approaches [57]. 

Finally, there are some new treatments with some empirical sup- 
port, but without conclusive evidence of their effectiveness. The 
most promising of these are: mindfulness-based relapse prevention 
[59]; acceptance and commitment therapy [60]; approach bias re- 
training [61]; and counterconditioning [62]. 

According to most authors, MET and CBT should be available to 
all patients with an alcohol use disorder as the standard treatment, 
with other treatments being available for special patients or as an 
additional treatment for those who do not or insufficiently respond 
to MET or CBT. However, other authors claim that CRA, as a more 
comprehensive treatment, should be the standard with a different 
set of treatment modules for different patients. Unfortunately, there 
are very few studies directly comparing these different approaches. 

With regard to MET/CBT, it is important to notice that group 
and individual CBT are equally effective [64] and that MET/CBT 
offered as an online self-help intervention is also effective, although 
an online self-help intervention blended with chat sessions with a 
professional is more effective and more cost-effective than online 
self-help-only intervention [65]. These findings show that MET and 
CBT can be effectively and cost-effectively offered in different treat- 
ment settings and even to people who are not (yet) ready to seek 
contact with an official addiction treatment service. 

An unresolved question is the effectiveness and cost-effectiveness 
of treatment according to the time-honoured Minnesota model, 
that is, a comprehensive inpatient 12-step treatment followed by 
long-term AA attendance. In a review for the Netherlands’ National 
Health Care Institute, it was concluded that the evidence for the ef- 
ficacy of inpatient and day care treatments according to this model 
was weak at best (only two RCTs), that these treatments are probably 
not (much) more effective than standard outpatient treatments (for 
example, MET, CBT), and that these treatments are almost certainly 
less cost-effective than standard outpatient treatments [63]. 


Pharmacological treatments for patients with an 


alcohol use disorder 





The first medication for the treatment of alcohol use disorder was 
disulfiram. Disulfiram and similar compounds were originally used 
for the industrial vulcanization of rubber. Some workers noticed 
that exposure to disulfiram, in combination with alcohol, made 
them feel sick. In 1948, Danish researchers made the connection and 
proposed disulfiram for the treatment of patients with an alcohol use 
disorder. Normally, alcohol is broken down in the liver by the en- 
zyme alcohol dehydrogenase (ADH) to acetaldehyde, which is then 
converted by the enzyme acetaldehyde dehydrogenase (ALDH) to 


harmless acetate. Disulfiram (125-250 mg/day) blocks ALDH, and 
after alcohol intake, the concentration of acetaldehyde in the blood 
may become 5-10 times higher than during metabolism of the same 
amount of alcohol alone. As a consequence, patients using alcohol 
while under the influence of disulfiram will experience a set of very 
unpleasant symptoms, including flushing of the skin, accelerated 
heart rate, shortness of breath, nausea, vomiting, throbbing head- 
ache, and sometimes even visual disturbances, mental confusion, 
postural syncope, and circulatory collapse. 

In a recent meta-analysis, disulfiram was more effective than 
placebo in open-label, non-blinded studies (effect size g = 0.70), 
but not in blinded studies (g = 0.01), suggesting that the effect of 
disulfiram depends directly on the patient’s anxious anticipation of 
its adverse effects [66]. Therefore, disulfiram seems to be a form of 
assisted psychotherapy, rather than pharmacotherapy per se. In the 
same meta-analysis, supervised disulfiram was more effective than 
acamprosate and naltrexone in establishing abstinence and to pre- 
vent relapse. However, it does not reduce alcohol craving, and (thus) 
there is poor compliance and the medication seems to be effective 
only when taken under supervision [67]. Moreover, disulfiram has 
many contraindications, including brain damage, heart failure, liver 
cirrhosis, and psychosis—conditions that are relatively frequent in 
patients with an alcohol use disorder. 

From the introduction of disulfiram, it took about 40 years be- 
fore new medications for the treatment of alcohol dependence en- 
tered the market in the early 1990s: acamprosate first in Europe 
and—much later—in America, and naltrexone first in America 
and—somewhat later—in Europe. Acamprosate changes the balance 
between the GABA and glutamate systems probably by its action on 
the NMDA receptor. It reduces withdrawal and craving, and it pro- 
motes long-term abstinence in compliant patients. It has to be taken 
at 333 mg three times a day, and the main contraindication is severe 
kidney impairment. Naltrexone is a non-selective opioid receptor 
antagonist that reduces alcohol reward and craving and results in 
reduced drinking with less heavy drinking days, and in some pa- 
tients, it also promotes abstinence. Naltrexone is taken once a day 
(50 mg), and the main contraindications are the use of (prescribed 
or illicit) opioids, acute hepatitis, and liver failure. However, with 
the dose that is used, interference with liver function is rare, and if 
alcohol intake is reduced, liver function may actually improve. In a 
recent meta-analysis [67], acamprosate and naltrexone were equally 
effective, with no significant differences in drinking outcomes in 
studies where the two medications were directly compared. In 
placebo-controlled trials, acamprosate was mainly effective in the 
prevention of any drinking [number needed to treat (NNT) = 12), 
whereas naltrexone was mainly effective in the prevention of heavy 
drinking (NNT = 12) and much less so for the prevention of any 
drinking (NNT = 20). 

Recently, a fourth medication—nalmefene—was registered in 
Europe for the treatment of alcohol dependence. Nalmefene is a mu 
(MOR) and delta opioid receptor (DOR) antagonist and a kappa 
opioid receptor (KOR) partial agonist. Since in alcohol-dependent 
patients, the KOR is already overstimulated, nalmefene is function- 
ally a KOR antagonist. Nalmefene reduces alcohol reward (MOR 
antagonist), and it may reduce dysphoria (KOR partial agonist) 
in chronic alcohol-dependent patients [68]. In the recommended 
dose, nalmefene is quickly absorbed, with a maximum plasma con- 
centration reached within 1 hour, occupation of almost all MORs 


within less than 3 hours, and full receptor occupancy remaining for 
at least 26 hours [69]. In contrast to the other registered medica- 
tions, nalmefene is used ‘as needed’ (that is, only when the patient 
expriences the risk of relapse); the goal is reduction of (heavy) al- 
cohol use, rather than total abstinence, and the medication should 
only be prescribed to heavy-drinking patients. The strategy to take 
the medication ‘as needed’ puts the responsibility for the treatment 
clearly on the patient. As such, it synergizes with strategies that try 
to empower patients. Moreover, the goal of reduced drinking allows 
patients who are not ready or not willing to stop drinking completely 
to enter treatment. In a meta-analysis [68], nalmefene was shown to 
be effective in the reduction of the number of heavy drinking days, 
with a small effect in alcohol-dependent patients (g = 0.20) and a 
somewhat larger effect in alcohol-dependent patients with (very) 
heavy drinking risk levels (g = 0.33). 

There are also other medications, often with different molecular 
targets, that have been tested for the treatment of alcohol depend- 
ence. Since none of these compounds have relevant marketing au- 
thorization, their prescription for alcohol dependence is off-label. 
The most promising so far is topiramate (150-300 mg/day), an atyp- 
ical anti-epileptic drug that probably blocks voltage-gated sodium 
channels and high-voltage-activated calcium channels, strongly 
stimulates specific GABA-A receptor isoforms, and somewhat in- 
hibits glutamatergic AMPA/kainate receptors. In a meta-analysis of 
seven RCTs [70], topiramate was effective in both the prevention of 
any drinking (g = 0.47) and the reduction of the number of heavy 
drinking days (g = 0.41). Topiramate has many frequently occurring 
side effects (for example, weight loss, paraesthesia, fatigue, dizzi- 
ness, sleepiness, depression), and titration can take up to 8 weeks, 
making it less suitable for weakly motivated and/or impulsive 
patients. 

Other promising medications include [71, 72] gabapentin 
(GABAergic medication registered for epilepsy, neuropathic pain, 
and restless legs), sodium oxybate (GABAergic medication regis- 
tered for narcolepsy), varenicline [nicotinic acetylcholine receptor 
(partial) agonist registered for nicotine dependence], doxasozine 
(al-adrenergic antagonist registered for hypertension and urinary 
retention due to benign prostate hyperplasia), modafinil (a 
wakefulness-promoting dopamine reuptake inhibitor registered for 
narcolepsy), and baclofen (a GABA-B agonist registered for spasti- 
city). In France, more than 200,000 patients with an alcohol use dis- 
order have already been treated with baclofen at low (up to 60 mg/ 
day) or (very) high doses (up to 330 mg/day), despite a paucity of 
evidence for efficacy at low doses and for efficacy and safety at (very) 
high doses. This is a very promising medication, but further studies 
are needed to learn more about the most adequate dose, the most 
suited patient profile, and the long-term safety [73]. 


Improving treatment outcomes 


In the previous paragraphs, it has been shown that there many 
proven effective treatments available for patients with an alcohol use 
disorder. However, effect sizes are small to moderate (g = 0.20-0.50; 
NNT 10-20), and many patients do not respond to these treatments. 

One of the main determinants of outcome is the stage of the illness 
when patients enter treatment. Most patients are in a relatively late 
stage, and thus a rather advanced state of the disease with more 
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complex neural consequences of their chronic and excessive alcohol 
use (for example, VS-DS shift) and with serious physical, psycho- 
logical, and social consequences, including liver impairment, de- 
pression, homelessness, etc. Their response to treatment is generally 
less good than in patients with a short illness history. This situation 
can only be improved by interventions directed at destigmatization 
[74] and increased availability of (anonymous) online self-help and 
online blended interventions [75]. 

Other strategies to improve the effect of treatments for alcohol use 
disorders are: improving treatment compliance, combining psycho- 
therapy and pharmacotherapy, combining medications with a different 
mechanism of action (polypharmacy), and personalized medicine, 
that is, matching specific patients to specific treatments [71]. 

For improved compliance, attempts have been made to produce 
long-acting medications, including subcutaneous disulfiram im- 
plants and intramuscular naltrexone injections. However, so far no 
positive results have been reported. 

In the combination of psychotherapy and pharmacotherapy, it is 
clear that pharmacotherapy adds to the effect of psychotherapy (all 
pharmacotherapy registration trials used the medication/placebo as 
an add-on to some form of psychotherapy or psychosocial support), 
whereas psychotherapy does not seem to add substantially to the 
effect of pharmacotherapy, neither in terms of compliance nor in 
terms of alcohol use outcomes [76, 77]. 

Combined use of medications may hold more promise. 
Acamprosate plus naltrexone has been shown to be more effective 
than either medication alone [78, 79], and the same was true for 
acamprosate plus disulfiram, compared to acamprosate alone [80]. 
The main question that remains here is when combined use of dif- 
ferent medications should be considered: (1) right at the start of 
treatment to prevent treatment dropout due to non-response, but 
with the risk of more side effects; or (2) only after treatment with a 
single medication failed, but with the risk of further damage during 
the unsuccessful single medication treatment. 

In our view, the most promising way to improve the (cost-)effect- 
iveness of alcohol use disorder treatments is the use of personalized 
medicine, that is, tailoring treatments to specific patients based on 
direct observable clinical characteristics (phenotype), indirectly ob- 
servable brain functions (endophenotype), and/or genetic indica- 
tors (genotype). With regard to the intensity of treatment, it is well 
known that inpatient treatment is generally more expensive, but not 
more effective than (intensive) outpatient treatment [81], probably 
with the exception of some very severely alcohol-dependent patients 
and those with serious social problems, especially those with un- 
stable housing or being homeless [82]. 

With regard to psychotherapy, there have been some (very) large 
studies looking at patient-treatment matching, but none of these 
studies have found that patients with certain patient and/or en- 
vironment characteristics (level of alcohol involvement, cognitive 
impairment, conceptual level, gender, meaning seeking, motiv- 
ational stage, psychiatric comorbidity, sociopathy, social support 
for drinking, alcoholism typology) do clearly better with specific 
types of psychotherapy [83, 84]. However, in a re-analysis of the 
study MATCH, there were some indications that MET was per- 
forming better than CBT in patients with lower baseline motivation 
for change [85]. 

With regard to the pharmacotherapy of alcohol dependence, there 
are some very promising matching findings (Table 50.2). 
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Table 50.2 Personalized pharmacotherapy model for alcohol dependence 





Abstinence Acamprosate (anxiety, withdrawal, GATA4) 


Naltrexone’ (FH +, ASPD, SL, OPRM1) 


Naltrexone (FH +, ASPD, SL, OPRM1) 
Nalmefene (dysphoria?) 


Reduced drinking 


Topiramate (GRIK1) 


Disulfiram (partner) 


Baclofen HD (anxiety, withdrawal) 
Gabapentin (sleep problems) 
GHB* 


Modafinil (high impulsivity) 


Varenicline (smoking) 
Doxazosin (FH +) 


Personal characteristics for treatment choice within brackets in italics; bold medications are registered for the treatment of alcohol dependence. 

GATA4, gene encoding a member of the GATA family of zinc finger transcription factors; FH +, positive family history of alcohol use disorder; ASPD, antisocial personality disorder; SL, 
sweet-liking; OPRM1, mu opioid receptor gene; GRIKT, gene encoding the glutamate ionotropic receptor kainate type subunit 1; GHB, gamma-hydroxybutyric acid/sodium oxybate. 
“Naltrexone is registered for indication ‘abstinence’, but seems to work better for indication ‘reduced drinking’. 


* Only used for treatment of alcohol dependence in Austria and Italy. 


For example, in a review of studies on the use of naltrexone in the 
treatment of alcohol dependence, it was shown that patients with 
one or more of the following characteristics tended to do better with 
naltrexone: positive family history of alcohol problems, Asn40Asp 
polymorphism of the MOR (OPRM1) gene, pretreatment abstin- 
ence, and sweet-liking [86]. Genetic variations in the GATA4 gene 
seem to have an effect on the treatment response to acamprosate in 
alcohol-dependent patients via modulation of atrial natriuretic pep- 
tide (ANP) plasma levels [87]. Probably the most robust pharmaco- 
genetic finding so far is the moderating effect of the glutamatergic 
GRIK1 gene on the effect of topiramate in treating patients with al- 
cohol dependence [88]; in a prospective placebo-controlled RCT, 
topiramate was only effective in rs2832407 C allele homozygotic pa- 
tients (42%), and not in A allele carriers (58%). Finally, modafinil 
has been shown in one study to have a favourable effect in alcohol- 
dependent patients with high motor impulsivity, whereas it had a 
detrimental effect in alcohol-dependent patients with low motor 
impulsivity, indicating that individual patient characteristics cannot 
only be predictive of benefits, but also of potential harms [89]. 

These and similar findings may explain why trials find no or only 
very small treatment effects in unstratified, heterogenous patient 
populations. On the other hand, such findings often arise in sec- 
ondary analysis of trial data and thus require replication. For ex- 
ample, a prospective RCT failed to confirm the matching effect of 
the Asn40Asp polymorphism of the OPRM1 gene [90]. All future 
studies must clearly be informed by our growing understanding of 
individual or subgroup differences in complex disorders. 


Conclusions 


Alcohol use disorders are frequently occurring problems that in- 
volve severe personal suffering and create substantial societal costs. 
The definition of these disorders is still under debate, but the existing 
working definitions have made it possible to investigate their neuro- 
biology and to study the effectiveness and cost-effectiveness of psy- 
chotherapeutic and pharmacological interventions. These studies 
have shown that there are many proven effective interventions, 
albeit with limited effect sizes. Research has also shown that early 
detection and treatment (for example, with online interventions), 
combinations of treatments (polypharmacy), and a better system 
of patient-treatment matching can lead to a better outcome and a 
better future for patients with an alcohol use disorder. 
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The term opioid is a general term applied to drugs derived from 
the opium poppy (Papaver somniferum) and the range of naturally 
occurring, synthetic, and semi-synthetic compounds derived from 
them [1]. These drugs relieve pain (analgesic effect) and create a 
sense of euphoria (narcotic effect). Opioids produce most of their 
effects by binding to three main opioid receptor subtypes in the 
brain: mu (1), delta (5), and kappa (x) [1]. There are also endogenous 
opioids, such as endorphins, that help the body regulate the sensa- 
tion of pain, although mostly, the term opioid is used to describe a 
class of exogenous drugs (such as heroin and morphine). Examples 
of different opioids and their mode of action (that is, agonists, partial 
agonists, antagonists, and agonist-antagonists) [1] are provided in 
Table 51.1. Opioid agonists include heroin, morphine, and metha- 
done; a partial agonist is buprenorphine, and opioid antagonists in- 
clude naltrexone and naloxone. 


The role of opioid medications 


Pharmaceutical opioids have an important, legitimate role in med- 
ical practice and contribute to the health and well-being of many 
patients. There are two broad clinical indications for their use: (1) 
the management of pain that is either acute or chronic; and (2) the 
management of opioid dependence through opioid substitution 
therapy (OST). The World Health Organization (WHO) states that 
opioid medications are ‘absolutely necessary for the management of 
severe cancer pain [2, p. 7] and “essential medicines for the treatment 
of opioid dependence [3]. Although the optimal use of opioids in 
the management of chronic non-cancer pain is increasingly debated 
and a subject of significant controversy in North America [4], there 
are clear statements from pain organizations supporting their con- 
tinuing use [5, 6]. 


Common opioids 


Opium is extracted from the opium poppy, and heroin (diacetyl- 
morphine) is synthesized from the extraction; both are short-acting 
opioid agonists [1,7]. Heroin has medical applications in some coun- 
tries, with a long tradition of medical use for analgesia in England 





(for example, [8, 9]). In other countries, including Australia, both 
heroin and opium are illegal and their use is heavily stigmatized be- 
cause of the health and social harms associated with illicit use [10]. 
A large number of pharmaceutical opioids have been developed for 
medical purposes. 

Many of the opioid medications used most commonly in the 
management of pain (both acute and chronic) are longer acting 
than heroin and include morphine, oxycodone, hydromorphone, 
propoxyphene, fentanyl, pethidine, codeine, and less commonly 
methadone and buprenorphine (Table 51.1) [11]. Methadone and 
buprenorphine are the most commonly used opioids for the man- 
agement of opioid dependence. 
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The desirable effects of opioid agonists and partial agonists include 
euphoria, analgesia, and sedation [10]. The more common adverse 
effects associated with opioid use include nausea, vomiting, respira- 
tory depression, constipation, drowsiness, and confusion [14, 15]. 
These side effects often limit the dosing and effectiveness of opioids, 
leading to early discontinuation, under-dosing, and adequate anal- 
gesia [14]. Prolonged use of opioids may result in additional adverse 
consequences such as tolerance hyperalgesia, decreases in testos- 
terone, oestrogen, and other hormones, and immunosuppression 
[14]. Higher doses, particularly deliberate overdoses, can produce 
severe respiratory depression, circulatory failure, coma, and death 
[10, 16, 17]. 

When opioid medications are used outside the guidelines for safe 
and effective use, adverse effects are more likely, particularly those 
associated with higher doses and injection. Additional risks are as- 
sociated with concurrent use of sedative drugs (for example, [18, 
19-21]) and the presence of pre-existing conditions like impair- 
ment of liver function in which medicinal opioid use may be contra- 
indicated [1, 17]. The injection of opioids also carries risks such as 
the potential transmission of blood-borne virus (BBV) infection if 
injecting equipment is shared, as well as harms related to injection 
of a non-sterile medication that is intended for administration by 
another route. The risk of developing opioid dependence may also 
be greater if used outside of, or without, medical supervision. 
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Table 51.1 Common pharmaceutical opioids 


Type of Common drug names Main medical Half-life’ 
opioid indication 











Agonist Codeine Pain 3 hours 
Fentanyl Pain 3 hours’ 
Hydromorphone Pain 2.5 hours 

ethadone Pain/opioid 15-60 hours 
dependence 
orphine Pain 3 hours 
Oxycodone Pain 2.5 hours 
Pethidine Pain 3 hours 
Dextropropoxyphene Pain 6-12 hours 

Partial Buprenorphine Pain/opioid 35 hours? 

agonist dependence 

Antagonist altrexone Opioid 4-13 hours 

dependence 
aloxone Emergency 1-1.5 hours 
treatment of 


opioid overdose 


“Half-life for immediate-release formulation. 

1 Following an intravenous dose. 

? Following a sublingual dose. 

Source: data from Brunton L, Chabner B, Knollman B, Goodman & Gilman’ The 
Pharmacological Basis of Therapeutics, 11th ed., Copyright (2006), McGraw-Hill; Pain 
Physician, 11(2 Suppl), Trescot AM, Datta S, Lee M, et al., Opioid pharmacology, $133-53, 
Copyright (2008), American Society of Interventional Pain Physicians. 


Regardless of whether opioids are used with or without medical 
supervision, tolerance will develop following a period of regular use 
[22]. Opioid tolerance is a predictable pharmacological adaptation, 
and patients may require increasing amounts of the drug to maintain 
the same pharmacological effects [22, 23]. Tolerance develops to the 
analgesic, euphoric, sedative, respiratory depressant, and nauseating 
effects of opioids, but not to their peripheral effects on miosis (con- 
striction of the pupils) and constipation [22]. 

If use ceases after tolerance has developed, withdrawal symp- 
toms usually occur. Common withdrawal symptoms include body 
aches, diarrhoea, gooseflesh, loss of appetite, nervousness, restless- 
ness, runny nose, sneezing, tremors or shivering, stomach cramps, 
nausea, insomnia, increased sweating, lethargy, tachycardia, and 
fever [16, 22]. With appropriate medical supervision and gradual 
withdrawal, these symptoms are usually mild. When opioids are 
stopped abruptly, however, these symptoms are more severe. 


Opioid dependence 


The clinical features of opioid dependence 


The repeated, ongoing use of heroin, opium, or pharmaceutical opi- 
oids can result in people developing the chronic, often relapsing, 
condition known as opioid dependence. Opioid dependence causes 
significant burden to the user, their family, and the broader com- 
munity [10]. It can result in significant costs to society through 
unemployment, homelessness, family disruption, loss of economic 
productivity, social instability, criminal activity, and/or ill health 
[24, 25]. It is typically characterized by the symptoms of tolerance, 
withdrawal upon cessation, compulsive use, loss of control over use, 
escalation in dose or length of use, unsuccessful quit attempts, pro- 
gressive neglect of usual interests and responsibilities, continued use 
despite harm, and/or increased time spent obtaining, using, or re- 
covering from use (‘Table 51.2) [26, 27]. 


Table 51.2 Criteria for past year ICD-10/DSM-IV opioid 
dependence 


have been present together during the previous 12 months: 
Tico osm | 
Compulsive use aie 

Loss of control (onset, termination, or levels of use) v 

Escalation in dose or length of use 
Unsuccessful attempts to cease or control use 
Withdrawal 

Tolerance 


Progressive neglect of usual interests/responsibilities 


SIS ITS |S 


Continued use despite harm 


SIS PSI SIS ISIS 


Increased time spent obtaining, using, or recovering 
from use 


Source: data from: World Health Organization, The ICD-10 Classification of Mental 
and Behavioural Disorders, Copyright (1996), World Health Organization; Diagnostic 
and Statistical Manual of Mental Disorders, Fourth Edition, DSM-IV, Copyright (1994), 
American Psychiatric Association. 


The major health consequences of illicit opioid dependence in- 
clude an increased risk of blood-borne infections such as HIV, HCV, 
and HBV through injection [24], overdose, and highly elevated risks 
of premature mortality [28-30]. In the 2000 WHO Global Burden of 
Disease estimates, it was thought that illicit drug use caused around 
200,000 deaths worldwide, of which opioid overdose was estimated 
to account for around 70,000 (around 35%), and AIDS around 
105,000 (around 53%) [28]. 

People affected by opioid dependence frequently fluctuate be- 
tween the physical states of intoxication and withdrawal [31]. This 
pharmacological instability can disrupt the neurobiological sys- 
tems that regulate, among other things, emotion, mood, and sleep, 
and the individual can experience dysphoria and persistent drug 
cravings [32, 33]. Comorbid psychiatric disorders are common, 
including elevated levels of depression, anxiety, suicidality, person- 
ality disorders, and post-traumatic stress disorder [34, 35]. Among 
people who are opioid-dependent, the daily challenge of obtaining 
opioids tends to dominate their experience and activities. The cha- 
otic lifestyle of opioid dependence has profound social effects on the 
person and the people around them—managing dependence and 
the associated harms is often a long-term issue [10]. 

Currently, there are two main systems of classification for the diag- 
nosis of opioid use disorders internationally: (1) the International 
Classification of Diseases, tenth revision (ICD-10) produced by the 
WHO and typically preferred in health epidemiology; and (2) the 
Diagnostic and Statistical Manual of Mental Disorders (DSM-IV-TR, 
DSM-5) preferred by the mental health sector and the United States 
in particular [26, 27]. Both ICD-10 and DSM-IV-TR historically 
used the term opioid dependence to describe a pattern of ‘maladap- 
tive behaviours, such as loss of control over use, craving, preoccupa- 
tion with use, and continued use despite causing harm [26, 27], but 
both classification systems have recently undergone revisions, with 
ICD-11 still in draft form. 

The newest edition of the Diagnostic and Statistical Manual of 
Mental Disorders (DSM-5) made substantial changes to the way 
in which opioid use disorders were conceptualized. DSM-5 no 
longer uses the terms ‘abuse’ and ‘dependence; in favour of a single 


continuum of opioid use disorders classified as mild, moderate, or 
severe, depending on the number of diagnostic criteria met [36]. In 
addition, tolerance and withdrawal were previously included as fea- 
tures of the drug use disorder opioid dependence but, on their own, 
were not sufficient for the opioid dependence diagnosis. In DSM-5, 
tolerance and withdrawal are excluded as features of an opioid use 
disorder where an individual is prescribed an opioid and taking it as 
directed [36]. 


Epidemiology and natural history of problematic 

opioid use 

Illicit opioid use, typically involving heroin, but increasingly 
involving the extra-medical use of other pharmaceutical opioids, 
is a significant health problem internationally. In many developed 
countries, dependent heroin users are typically daily, or near daily, 
injectors of heroin, and other opioid and sedative drugs in com- 
bination with, or as an alternative to, heroin. They continue to use 
heroin despite the significant social and health problems arising 
from use, such as being arrested for drug or property crimes, with 
subsequent imprisonment, exposure to BBVs and increased poten- 
tial for contracting infectious diseases, and fatal or non-fatal opioid 
overdose. Increasingly, in countries such as the United States, there 
are new populations of people who developed opioid dependence 
through the extra-medical use of pharmaceutical opioids [37] and 
there is evidence of people transitioning from problematic use of 
pharmaceutical opioids to heroin injection [38]. 

Research with prospective cohort studies of dependent opioid users 
in the United Kingdom, Australia, and the United States indicates that 
many dependent heroin users who seek treatment or come to atten- 
tion through the legal system continue to use heroin for decades [39- 
42]. In this population, daily heroin use is punctuated by periods of 
abstinence, drug treatment, and imprisonment [10]. When periods 
of voluntary and involuntary abstinence during treatment or impris- 
onment are included, it has been estimated that dependent heroin 
users use heroin daily for 40-60% of their 20-year use careers [43, 44]. 
‘Remission rates for heroin dependence are half those for other illicit 
drugs [45]. It is this long-term pattern of problematic opioid use that 
presents challenges to clinicians and researchers in determining the 
best management approaches and in ensuring that the extent of the 
adverse social and health consequences are minimized. 

The illegality of opioids, such as heroin, makes the accurate assess- 
ment of how many people use these drugs difficult; however, recent 
estimates suggest there are 15.5 million problem opioid users world- 
wide [46, 47], and although the prevalence of opioid dependence is 
low (0.7-0.9% of the global population [48]), it is the largest direct 
contributor to the burden of illicit drug use [46, 48]. The harms to 
the individual and the community are significant, with large burden 
associated with infectious diseases (for example, HIV, HBV, and 
HBC) and overdose deaths [30, 46]. 


Risk factors for opioid use and dependence 


Very few studies have examined the risk factors and life pathways 
that lead young people to use, and become dependent on, heroin. 
This makes strong statements about the precursors of illicit opioid 
use difficult. There is, however, a large literature on risk factors for 
early use of alcohol and cannabis which indicates that young people 
who are the earliest initiators and heaviest users of alcohol and can- 
nabis are those who are most likely to use heroin [49-51]. 
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Two aspects of the family environment are associated with in- 
creased rates of licit and illicit drug use among children and ado- 
lescents. The first is the extent to which the child is exposed to a 
disadvantaged home environment, with parental conflict and poor 
discipline and supervision [52]. The second is the extent to which 
the child’s parents and siblings use alcohol and other drugs [52]. 
These family environment factors can give rise to chronic childhood 
adversity but specifically predispose to drug use. 

Children who perform poorly in school, those with impulsive or 
problem behaviour or conduct disorder in childhood, and those 
who are early users of alcohol and other drugs are more likely to use 
drugs like heroin [52, 53]. The nature of the relationship between 
peer affiliations and adolescent substance use remains controversial, 
but affiliation with drug-using peers is an important risk factor for 
drug use, which operates independently of individual and family 
risk factors [49]. 

Exposures to these family and social risk factors are highly cor- 
related. Young people who initiate substance use at an early age have 
often been exposed to multiple social and family disadvantages. 
They also tend to be impulsive, to have performed poorly at school, 
to come from families with problems and a history of parental sub- 
stance use, and to affiliate with delinquent peers [49, 51, 52]. These 
psychosocial factors are confounded by genetic factors predisposing 
to substance use. The genetics of addiction are described in more 
detail in Chapter 49. 

North America is currently experiencing escalating pharmaceut- 
ical opioid use and harm [37], recently labelled a ‘public health emer- 
gency’ by the Trump administration, with substantial increases in 
opioid-related deaths [54] and strain upon the health and drug treat- 
ment system [55]. The problems observed across North America 
appear to have been driven by widespread prescribing of controlled- 
release and ‘strong opioids’ such as oxycodone and fentanyl [56]. In 
this context, non-medical use of pharmaceutical opioids in child- 
hood and early adolescence is a strong predictor of heroin use onset 
among adolescents and young adults [57]. Other countries, such as 
Australia, have also documented increasing opioid exposure across 
the general population, paralleled by increasing harms, including 
opioid-related hospital admissions, overdose, and opioid depend- 
ence and treatment-seeking [58]. 


Treatment of opioid dependence 


Worldwide, of all the illicit drugs, the demand for treatment is 
highest for opioids. The chronic, relapsing nature of opioid depend- 
ence often results in repeated treatment episodes and prolonged 
treatment need. The aims of any opioid treatment programme 
should be guided by the principles of reducing or ceasing opioid use, 
preventing future harms associated with opioid use, minimizing 
mortality risk, and improving quality of life and well-being [59]. 


Treatment ideologies 


Like alcohol, the dependence or addiction to illicit opioids has some- 
times been viewed as a moral fault; however, increasingly, heroin 
dependence is being managed as a medical disorder. While early at- 
tempts to achieve abstinence were prominent, the recognition that 
opioid dependence is a chronic, relapsing disorder led to a focus on 
methods that would address the issue of relapse. Pharmacological 
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treatments using methadone were viewed as a means of achieving 
neurobiological stability, thus maintaining the person in treatment, 
minimizing their relapse to illicit opioid use, and reducing the risk 
of fatal opioid overdose. 

Other than methadone maintenance clinics, the treatment ap- 
proaches for opioid dependence in many countries (for example, the 
United States) were largely non-medically orientated, predominantly 
focusing on psychosocial approaches to achieving and maintaining 
abstinence. There is a long tradition of using residential rehabilita- 
tion through a range of models and philosophies including Concept 
House and 12-Step models derived from Alcoholics Anonymous 
(usually referred to as Narcotics Anonymous). More recently, the 
introduction of buprenorphine for use in general practice was 
viewed in many countries (including the United States and France) 
as an opportunity to move away from more specialist clinics into pri- 
mary care and to broaden access to opioid maintenance treatment 
[60]. Pharmacotherapies, such as methadone and buprenorphine, 
have been and remain the mainstays of effective management of 
opioid dependence. 


Opioid agonist (maintenance) treatments 


Opioid agonist (maintenance) treatments, typically oral methadone 
or sublingual buprenorphine (+ naloxone), used at the optimal dose 
range, are effective in treating opioid dependence. Maintenance 
treatment is designed to be an ongoing treatment. Maintenance 
use of pharmaceutical-grade opioids with a long duration of ac- 
tion, in known doses and purity, provides an opportunity to sta- 
bilize the person by eliminating withdrawal, craving, participation 
in obtaining illegal opioids, and the need for frequent (and risky) 
drug injection. When provided in the context of high-quality treat- 
ment services, these medications interrupt the cycle of intoxication 
and withdrawal, greatly reducing heroin and other illicit opioid use, 
crime, and the risk of death through overdose [59]. 


Methadone 


The real breakthrough in the treatment of opioid dependence was 
the development of opioid agonists for maintenance treatment. 
Methadone was the first of these agonist drugs, introduced by Dole 
and Nyswander in New York in the early 1960s. They defined opioid 
dependence as a ‘physiological disease characterised by a permanent 
metabolic deficiency, which was best managed by administering the 
opioid-dependent person ‘a sufficient amount of drug to stabilise 
the metabolic deficiency’ [61]. They introduced orally adminis- 
tered high or ‘blockade’ oral methadone doses at stabilization levels 
of between 50 mg to 150 mg/day as maintenance treatment [62]. 
Methadone treatment was administered as a component of a long- 
term supporting programme with maintenance, rather than opioid 
abstinence, as the original treatment goal [63]. 

This original maintenance model has sometimes been altered by 
lowering methadone doses, time-limiting treatment programmes, 
and responding to ‘treatment failure’ (for example, ongoing il- 
licit opioid use) in a punitive fashion such as expelling the indi- 
vidual from treatment [63]. A systematic review found methadone 
doses of 60-100 mg/day are more effective than lower dosages in 
retaining patients and in reducing the use of heroin and cocaine 
during treatment. The optimal dose will vary for individuals and 
requires clinical judgement, but clinical guidelines should reflect 
the evidence [64]. 


The most effective methadone programmes are those that follow 
the original Dole and Nyswander dose range [63]. There have been 
a small number of RCTs on the efficacy of methadone treatment in 
comparison to control conditions, and they have shown that metha- 
done maintenance was more effective than detoxification, no treat- 
ment, or placebo at retaining subjects in treatment and in reducing 
opioid use [65, 66]. To confirm these findings, similar efficacy has 
been noted in observational studies, with the additional finding that 
methadone-maintained individuals report reduced participation in 
criminal activity and reduced mortality [30, 67, 68]. 

With the spread of HIV/AIDS in the 1980s, methadone treat- 
ment received additional support and funding in countries around 
the world in the effort to slow the spread of the virus among the 
population of injecting drug users. There is reasonable evidence that 
methadone treatment was effective in this goal, if given at adequate 
dose levels and if patients remained in treatment [69, 70]. Its efficacy 
in helping reduce the prevalence of other established blood-borne 
epidemics such as HCV and HBV was and remains less clear, al- 
though there is evidence that good coverage of opioid maintenance 
treatment, combined with needle-syringe programmes, results in 
lower HCV incidence [71, 72]. Partly in recognition of its role in 
reducing the spread of HIV, methadone was included in the WHO's 
Model List of Essential Medicines in 2005 [73]. 


Buprenorphine 


The partial opioid agonist buprenorphine was developed in the 
1970s as an analgesic and was soon after investigated for the treat- 
ment of opioid dependence. Early work using buprenorphine 
administered by the subcutaneous route characterized it as an 
opioid with low physical dependence liability, with minimal with- 
drawal syndrome [74]. Subsequently, evidence has suggested that 
buprenorphine does produce mild to moderate withdrawal syn- 
drome [75]. Substantial international research effort has established 
the efficacy of buprenorphine maintenance therapy in RCTs [76]. 
France went from having no opioid maintenance treatments to 
registering both methadone and buprenorphine in 1995. By the 
next year, all registered doctors in France were able to prescribe 
buprenorphine without any additional training or licensing (the 
number of places in methadone treatment remained limited) [77]. 
This has resulted in approximately 65,000 patients per year being 
treated with buprenorphine, ten times the number treated there 
with methadone [78]. While other countries have not adopted 
Frances more liberal buprenorphine-prescribing policies, it has 
been registered for use in the maintenance treatment of opioid de- 
pendence in Australia since 2001 and the United States since 2002, 
and many European countries. In Australia, medical practitioners 
have to be registered as buprenorphine prescribers in a similar ac- 
creditation process to methadone prescribing. In the United States, 
buprenorphine is less strictly regulated than methadone [79]. 
Buprenorphine is poorly absorbed when taken orally, so the usual 
route of administration is sublingual. Because the opioid effects 
plateau at increasing doses, buprenorphine carries a lower overdose 
risk than methadone, even if taken with other opioids [80]. It may 
still cause respiratory depression, however [81], particularly iftaken 
with other sedative drugs such as alcohol or benzodiazepines [20,21]. 
Buprenorphine dissociates from opioid receptor binding sites very 
slowly, possibly explaining the limited withdrawal syndrome ob- 
served with its use [82, 83]. Like methadone, buprenorphine has 


applications in both the management of pain and the treatment of 
opioid dependence. High-dose formulations have been used in- 
creasingly as maintenance treatment, and buprenorphine joined 
methadone in 2005 on the WHO's Model List of Medicines [84] as 
maintenance treatment for opioid dependence. 


Buprenorphine-naloxone 


A newer formulation, also developed as a sublingual tablet and, 
more recently, as a sublingual film, contains both buprenorphine 
and naloxone (a ų , K, and 6-receptor antagonist with no agonist ef- 
fects) in a 4:1 ratio [85]. Buprenorphine-naloxone was specifically 
designed to minimize the risk of intravenous misuse among people 
who are opioid-dependent [85]. 

Buprenorphine-naloxone has clinical actions that are similar to 
those of buprenorphine. Addition of naloxone reduces the abuse 
liability of buprenorphine by attenuating the effects of its opioid 
agonist action and impacting withdrawal in certain circumstances 
[85]. When injected by opioid-tolerant individuals (an unintended 
route of administration), the formulation can precipitate un- 
pleasant withdrawal symptoms [86]. This is because a sublingual 
dose of naloxone has a potent antagonist effect if injected, con- 
sistent with its use in emergency reversal of acute adverse opioid 
effects [1]. 


Long-acting (depot and implant) formulations 
of buprenorphine 


There is considerable interest in the development of long-acting 
(depot and implant) preparations of buprenorphine that would 
potentially improve adherence and pharmacological stability and 
allow the patient to undertake activities without having to attend 
their local pharmacy/clinic on a daily basis. Studies are under way 
to evaluate a number of different long-acting formulations. This is 
likely to become a significant treatment option in many countries 
over the next few years. 


Other opioid agonist treatments 


The use of diacetylmorphine (heroin) as maintenance treat- 
ment was investigated decades ago, and again more recently [87]. 
Unfortunately, the stigma associated with heroin use means 
reasoned debate about its utility as a maintenance agent has been 
limited. Its use as a therapeutic agent is limited by its illicit status in 
many countries, but some countries have adopted the use of heroin 
as a maintenance agent for heroin dependence [88, 89]. 

Other opioid agonist treatments include dihydrocodeine 
and long-acting morphine [90]. Some countries have opted for 
introducing injectable opioid treatment for those people who do not 
respond well in first-line treatment with oral methadone or sublin- 
gual buprenorphine, for example the UK [87], Switzerland [91], and 
the Netherlands [88]. 


Benefits and risks of agonist maintenance treatments 


Opioid agonist treatments result in significant reductions in the 
negative health consequences and adverse effects of heroin depend- 
ence on public order [92]. There is strong evidence that methadone 
and buprenorphine are effective in reducing drug use [65, 93] and 
the spread of HIV [94]. Opioid agonist treatments have been dem- 
onstrated to improve physical and mental health and social func- 
tioning [65, 93] and to substantially reduce criminality [65, 95]. 
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Demand for treatment, however, typically far exceeds the number of 
treatment places that are needed [96]. 

Despite its advantages, methadone is a full opioid agonist, and 
there is no ceiling to the level of respiratory depression or sedation 
which methadone can induce; thus, an overdose can be fatal [97]. In 
some countries and settings, the inconvenience of daily dosing and 
clinic visits may be unattractive to clients, and restrictions imposed 
by the daily dosing schedule on clients’ general lifestyle and on op- 
portunities to sustain employment may also limit its acceptance by 
people who are opioid-dependent. 

The provision of unsupervised doses of methadone or 
buprenorphine results in diversion of the drug for illicit use by 
those not in treatment. The extent of this problem varies within 
and between countries, reflecting different policy and regulatory 
approaches. Under supervised dosing conditions, buprenorphine 
may be easier to divert than methadone, given its tablet formula- 
tion. It may also be easier to prepare for injection (an unintended 
route of administration), hence the development of buprenorphine- 
naloxone. Since buprenorphine is a partial agonist, it is safer in over- 
dose [98]. Other benefits of buprenorphine may include an easier 
withdrawal phase and, because of a longer duration of action, the 
option of alternate-day dosing. 


Management of opioid withdrawal 


In opioid-dependent patients, management of opioid withdrawal 
alone does not represent a treatment in and of itself, since relapse 
to opioid use is common following medication cessation [99]. 
Symptomatic treatment of the physical effects of opioid withdrawal 
with clonidine or lofexidine is sometimes used in settings where it 
is not possible to prescribe effective opioid substitution. However, 
buprenorphine is more effective than clonidine or lofexidine for 
the management of opioid withdrawal and may offer some ad- 
vantages over methadone in terms of quicker resolution of with- 
drawal symptoms and possibly slightly higher rates of completion 
of withdrawal [100]. 


Naltrexone for relapse prevention 


Naltrexone is an opioid receptor antagonist and so can prevent an 
individual who was formerly using opioids from experiencing the 
desired effects. Oral naltrexone is not widely used; its effects are 
not attractive to the target population, achieves limited adherence, 
and does not provide any additional benefits to other standards of 
care in terms of relapse prevention [101]. Individuals can only com- 
mence naltrexone following a withdrawal programme; otherwise, it 
can result in severe and prolonged withdrawal symptoms. It appears 
to be suited only to a small and highly motivated formerly opioid- 
dependent people seeking to maintain abstinence [102]. Depot and 
implant formulations of naltrexone may have greater utility, but cur- 
rent data are only preliminary. 


Psychosocial interventions 


In the absence of evidence for the effectiveness of psychosocial 
treatments alone, they are usually adjunctive to opioid maintenance 
treatment [95, 103-104]. 


Psychological interventions 


Psychological interventions can range from unstructured supportive 
counselling and motivational interviewing to highly structured 
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techniques. The choice will depend on the available resources, ap- 
propriateness to the individual’s situation, patient acceptability, staff 
training, and cultural appropriateness. 

The two main evidence-based therapy approaches include cogni- 
tive behavioural therapy (CBT) [105] and contingency management 
[106]. Applied to opioid dependence, CBT is based on the principle 
that addictions are learnt behaviours that can be modified most 
effectively by changing the faulty cognitions that maintain behav- 
iour and by promoting positive cognitions and behaviour change. 
Variants of this approach include ‘cognitive therapy’ and ‘motiv- 
ational enhancement therapy. CBT is typically delivered by trained 
clinicians [107]. 

Somewhat differently, contingency management assumes that 
behaviours are underpinned by conditioned learning, specifically 
classical and operant conditioning [106]. This approach aims to 
extinguish the classically conditioned responses using techniques 
like cue exposure, and to reinforce positive behaviour using instru- 
mental conditioning where behaviours, such as abstinence, are re- 
warded using strategies such as contingency management. Typically, 
contingency management programmes clearly define the desired 
behaviour (for example, opioid-free days), conduct regular moni- 
toring (for example, urine drug screening), specify rewards such 
as money or vouchers, and provide personalized, positive feedback 
from the treating staff. 


Social interventions 


Social interventions provide practical assistance in addressing basic 
needs, such as food or housing, as well as education and employ- 
ment support, and access to broader community resources (such 
as government benefits, recreational/leisure opportunities, etc.) 
[107, 108]. Typically, the patients needs are assessed and met using 
a key worker or a case management system outside the treatment 
service) [107, 108]. 


Self-help groups 


Typically, these groups are abstinence-based and provide emotional 
support. Many also promote a particular ideology or values through 
which members may develop a strong sense of identity. The most 
common framework for self-help groups is the 12-Step programme 
(for example, Narcotics Anonymous). Little research has been con- 
ducted on this approach, but observational studies indicate positive 
treatment effects. Self-help groups are inexpensive and are a valued 
source of psychosocial support among participants. An important 
component is voluntary participation. 





Clinical management and considerations 


Assessment and diagnosis of opioid dependence 
Substance use 


A detailed substance use history takes into account the use of all psy- 
choactive substances, including illicit and pharmaceutical opioids, 
alcohol, cannabis, stimulants, and benzodiazepines. The assessment 
should establish past and current substance use, duration of use, 
quantity and frequency of recent use, route of administration, and 
time of last use [107-109]. In addition to current substance use, it is 
important to assess the patient’s experiences of previous treatment 


episodes, including their perspectives on what worked previously, 
the factors leading to relapse, and the types of treatment they are 
prepared to consider [107]. 


Physical examination 


Physical examination should include assessment of intoxication and 
withdrawal (as observable indicators of neuroadaptation), relative 
to the patient’s report of last drug use. Withdrawal severity can as- 
sist in indicating the severity of dependence, as well as the timing 
and amount of the first dose of methadone or buprenorphine [109]. 
Recent use of opioids increases the likelihood of withdrawal being 
precipitated by buprenorphine or naloxone. Intoxication with cen- 
tral nervous system depressants, such as alcohol or benzodiazepines, 
in combination with methadone or buprenorphine, increases the 
risk of overdose [109]; this is a critical consideration during induc- 
tion and needs ongoing assessment and review. 

Among people who inject opioids, physical examination of in- 
jection sites can also provide useful information on the timing and 
duration of injecting drug use, as well as any related complications 
such as site infections [107-109]. A combination of recent and old 
injection sites will normally be seen in an opioid-dependent patient 
with current neuroadaptation [107]. 


Urine drug screening 


Urine drug screening is useful to corroborate patient history 
and establish recent opioid and other substance use [107-109]. 
Negative urine drug screening, combined with the absence of 
withdrawal symptoms, indicates caution is needed in the initiation 
of opioid maintenance therapy or other sedative medication [109] 
due to increased risk of overdose. Urine testing, in combination 
with patient history, is also useful in identifying other substances 
that have recently been consumed. International guidelines rec- 
ommend against using a naloxone challenge to confirm current 
neuroadaptation, as this can induce significant withdrawal effects 
unnecessarily [107]. Delays in obtaining urine drug screening re- 
sults should not delay treatment initiation where the diagnosis can 
be clearly established [107-109]. Urine drug screening can be per- 
ceived as punitive by patients, so it is important to explain the pur- 
pose clearly [109]. 


Other investigations 


Investigations for other conditions, either related to the presenting 
condition or related to the patient’s drug use (for example, BBVs, 
liver disease) should also be undertaken to inform future care plan- 
ning. A high proportion of people who inject drugs may also be 
infected with HBV or HCV. This is unlikely to pose problems for 
induction of methadone or buprenorphine, unless advanced liver 
disease is detectable at clinical examination [109], but it is an im- 
portant clinical consideration. Voluntary testing for HIV, hepatitis 
C, and common infectious diseases should be offered as part of an 
individual assessment, accompanied by counselling before and after 
the test [107]. Serology testing and vaccination for hepatitis B are 
also recommended [107]. Other infectious diseases that should be 
taken into consideration include tuberculosis and sexually trans- 
mitted infections. 

Pregnancy testing should be offered to all women, particularly 
those contemplating opioid withdrawal, because it may influence 
the choice of treatment [107]. 


Diagnosis of opioid dependence 


Typically, opioid dependence is established according to standard 
definitions in the ICD or DSM [107-109]. It can be more challen- 
ging to assess dependence in a patient using pharmaceutical opi- 
oids for chronic pain. Tolerance and withdrawal can arise due to 
neuroadaptation associated with regular use of opioids for pain, and 
on their own, these features are not sufficient to diagnose depend- 
ence in this context. Features consistent with diminished control 
over opioid use (for example, multiple dose escalations, unsanc- 
tioned routes of administration, use for reasons other than pain, 
etc.) should also be examined. The diagnosis of opioid dependence 
is typically based on the patient history, corroborated by physical 
examination and/or other investigations [107]. 


Other biopsychosocial considerations 
General health 


Beyond establishing substance use and opioid dependence, it is good 
practice to assess the general health of patients being considered for 
methadone or buprenorphine maintenance treatment. The aim is to 
identify unmet health care needs and any health problems that could 
interact with methadone or buprenorphine [108]. Such assessments 
might include: past medical issues, operations, injuries, and periods 
in hospital; current prescription medications; psychiatric history; 
drug-related complications such as injection-related injuries or 
diseases; family planning and contraception; sexual health; dental 
health; and allergies or sensitivities [108]. Additional investigations 
may include liver, thyroid, and renal function and haematological 
indices [108]. 

Patients with prolongation of the QTc interval (that is, >500 ms 
between the onset of ventricular depolarization and completion of 
ventricular repolarization) who are treated with methadone and, less 
commonly, buprenorphine are at increased risk of adverse cardiac- 
related events [109]. 


Psychiatric comorbidity 


Depression, anxiety, and post-traumatic stress disorder are common 
among people who are opioid-dependent [110], and psychiatric as- 
sessment and referral should be incorporated into care planning. 


Social issues 


Social exclusion and issues, such as housing, financial, and relation- 
ship problems, are also important clinical considerations and may 
impact on the choice of treatment and setting. 


Risk assessment 


Assessing risks that may impact on the safety or efficacy of treatment 
is essential, including risk behaviours such as overdose, polydrug 
use, problematic alcohol use, unsafe injecting practices, and unsafe 
sex. Wider risk may include self-harm or risk of harm to others. 
Risks to dependent children should also be assessed and responded 
to, according to local child protection requirements [108]. Risk as- 
sessments should be ongoing throughout treatment and involve the 
relevant health and welfare providers. 


Care planning 


The challenge in the management of opioid dependence does not 
simply come from the need for knowledge and expertise around 
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diagnosing and managing opioid dependence. A structured and 
compassionate approach to some of the more complex aspects of 
drug dependence and related lifestyle complications is required to 
assess and plan sensible care. In the first instance, current injectors 
will need advice about harm reduction, safer injection practices, and 
overdose prevention and management. Discussion about the choice 
of agonist maintenance treatment should take into account patient 
preferences, but local policies and guidelines will also be important 
factors. 

Current evidence suggests that key treatment outcomes for metha- 
done and buprenorphine are comparable under optimal treatment 
conditions. Other factors that may impact upon treatment choice 
include patient or clinician preferences and previous responses to 
treatment. There may be considerable individual variation in absorp- 
tion, metabolism, and clearance of methadone or buprenorphine. 
Patients who continue to use opioids, while receiving high doses of 
buprenorphine, may be better suited to higher doses of a full agonist 
like methadone. On the other hand, the less sedating effects of 
buprenorphine may be preferred by some patients. 


Specific groups and issues 


There are some patient groups with additional or complex needs, 
and specific treatment settings that may raise important consider- 
ations in planning treatment of opioids dependence. 


Adolescents 


Some adolescents are brought to the clinic by families concerned 
about their drug use. Others come from socially disadvantaged 
backgrounds, may be homeless, and may have more severe presen- 
tations. Assessment of adolescents should include medical, psycho- 
logical, education, family, and other aspects of the adolescent's life. 

Treatment approaches should take into account the higher levels 
of risk-taking, novelty-seeking, and peer pressure among adoles- 
cents, compared to older people. Treatment needs to be individu- 
alized and comprehensive, taking into consideration an adolescent's 
strengths, psychosocial supports, education, legal status, health, and 
pattern of illicit drug use. Agonist maintenance treatments in ado- 
lescent populations need further study, but early findings are prom- 
ising [111, 112]. The clinical profile of opioid-dependent adolescents 
can be complex, with a high prevalence of comorbid depression, 
post-traumatic stress disorder, conduct disorder, and attention- 
deficit/hyperactivity disorder. Where there is supportive family or 
carers, family counselling may be helpful. 


Women 


Women may be more vulnerable to the adverse medical and social 
consequences of opioid dependence. Compared to men, women 
with opioid dependence tend to have lower levels of education, fewer 
financial resources, and elevated histories of childhood trauma and 
physical and sexual abuse [113, 114]. 

Women typically use less quantity of opioids than men but transi- 
tion more rapidly to dependence and treatment-seeking, with more 
benzodiazepine use [109]. They are more likely to have childcare re- 
sponsibilities that could impede their access to treatments, and they 
may be reluctant to participate in group support activities with men 
[113]. Some evidence indicates that women are less likely to drop out 
of women-only programmes, which is likely to lead to better out- 
comes [113]. However, data are lacking on the relative efficacy of 
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gender-specific services for women. To be responsive to the needs 
of women, it has been suggested services need to provide individual 
or female-only group counselling and childcare facilities and take 
measures to prevent sexual harassment of female patients by male 
staff [109]. 


Pregnancy and breastfeeding 


Maintaining or initiating opioid agonist maintenance treatment is 
the preferred approach to the management of opioid dependence in 
women who are pregnant or breastfeeding [108, 115]. Methadone 
is frequently preferred to buprenorphine because it has been in use 
longer in this context [107, 109]. Among opioid-dependent women 
who are pregnant, methadone is associated with improved maternal 
and neonatal outcomes, compared to the pregnancies of women not 
receiving opioid agonist treatment. Although both methadone and 
buprenorphine are associated with neonatal abstinence syndrome, 
this can be resolved by administering morphine with supportive 
care [108]. 

Detoxification during pregnancy is not recommended. In the first 
trimester, detoxification carries significant risk of miscarriage. In the 
third trimester, it can cause significant fetal distress and premature 
labour [109]. Typically, pregnant women are encouraged to remain 
on agonist maintenance therapy during pregnancy and the postnatal 
period [107]. Agonist maintenance treatment is thought to have 
minimal long-term developmental impacts on children, when com- 
pared to the risk of maternal heroin use and resulting harms [109]. 

Both methadone and buprenorphine are detectable in breast 
milk at low levels, which are not thought to affect infant outcomes 
[107]. Thus, opioid-dependent mothers are normally encouraged to 
breastfeed; however, where there is polydrug dependence, caution 
may be necessary. 


Blood-borne virus infections and tuberculosis 


Early in the HIV epidemic of the 1980s, it was established that provi- 
sion of sterile injecting equipment and increased coverage of opioid 
dependence treatment could reduce the spread of BBVs. This ap- 
proach combines a public health strategy with a high standard of 
individual treatment. 

England and Australia, for example, have achieved major success, 
with <2% of people who inject drugs contracting HIV over the past 
few decades [116]. Unfortunately there are many countries, particu- 
larly in the Asia Pacific region, Eastern Europe, and Russia, that still 
report major rates of HIV infection among injecting drug users [116]. 

For those who are infected, combined treatment of both HIV 
with available antiretroviral medications and agonist maintenance 
treatment to manage the opioid dependence can achieve reasonable 
levels of adherence [116]. 

The new treatments for hepatitis C are also acceptable to patients 
and are associated with very high levels of clearance [117]. The chal- 
lenge for treatment systems is to develop integrated approaches that 
enable all current and former injectors to benefit. 

Tuberculosis is a new and increasing concern because of multi- 
resistant cases in vulnerable populations of prisoners [118], the 
homeless, and migrant populations. 


Forensic and prison populations 


Between a third and a half of most prison populations have a history 
of opioid dependence. 


This poses risks for individuals both at the point of entry and at 
the point of departure from prison. On entry, there is a need for 
a proper assessment of both physical and mental health status and 
plans for appropriate care and management of their drug depend- 
ence. Ideally, the provision of ongoing maintenance should be made 
available [119, 120]. This will reduce the risk of both suicide and un- 
natural death during the period of imprisonment [119]. It will also 
reduce very significantly the risk of overdose death in the first month 
after release [121]. In addition, provision of this treatment should 
help to engage individuals in help and support and care for other 
aspects of their mental and physical well-being, including possible 
treatment for hepatitis C [122]. 

Most individuals are in prison for short periods of time. A system 
works best when it aims to create continuity between treatment in 
the prison and treatment in the community that works to promote 
health and well-being and to assist in the reintegration into the com- 
munity after prison release. 


Psychiatric comorbidity and polysubstance use 


There are well-recognized overlaps between substance depend- 
ence and psychiatric comorbidity, so all assessments need to take 
into consideration the risks for self-harm and suicide and manage 
appropriately. However, polysubstance use poses a significant chal- 
lenge for treatment planning and ensuring appropriate prescribing. 
Such individuals require careful assessment, ideally by a specialist 
practitioner who has both inpatient facilities and community ar- 
rangements where daily supervision of prescribed medication can 
be undertaken. Careful and cautious approaches can stabilize such 
individuals and reduce the immediate risk of overdose death. 


Conclusions 


Over the past few decades, the treatment of opioid dependence has 
been well developed and the benefits and outcomes of treatment have 
been well documented. It has, however, remained a controversial 
treatment. The advent of prescription opioid dependence has further 
highlighted some of the complexities for doctors prescribing opi- 
oids in the longer term. The lifesaving benefits of good management 
of opioid dependence are beyond question. The opportunity to use 
such treatment to engage individuals in a broader range of medical 
and social interventions that will assist them to get their lives back on 
track is substantial. The issues of addiction in modern medicine are 
such that they require all competent and trained clinicians to have a 
full knowledge of basic assessment and management of drug depend- 
ence as core clinical skills. We look forward to the day when all doctors 
grasp this opportunity and extend it to their most vulnerable patients. 
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Cannabis sativa has long been of interest to psychiatrists, mostly, it is 
reasonable to say, for its perceived negative effects on mental health 
in certain key domains, including mood, psychosis, and cognition. 
The last decade, in particular, has seen a flurry of activities aimed at 
trying to understand the associations between psychotic disorders, 
such as schizophrenia, and exposure to cannabis. There has also 
been increasing concern about negative effects on cognitive func- 
tioning in people who use large quantities of high-potency cannabis. 

This occurs in the context of major changes to the potency of the 
plant product, as new strains and hydroponically grown cannabis 
have increasingly higher concentrations of the main psychotomim- 
etic constituent delta-9-tetrahydrocannabinol (THC) and less of the 
putatively antipsychotic constituent cannabidiol (CBD). Also, the 
evolution of numerous synthetic cannabinoids makes tracking and 
policing of THC-like chemicals very difficult. 

On the other side is a broader global debate about decriminaliza- 
tion, and even legalization, of cannabis. In part, this is driven by the 
clear failure of the ‘war on drugs’ to curb cannabis consumption and 
the understanding that criminal legislation and policing approaches 
to curtailing use are too simplistic and take up massive policing 
and legal resources. Also, many people believe that for most people, 
modest consumption of cannabis is not particularly harmful; one 
could argue that alcohol is a far more destructive drug at a popu- 
lation level and is associated with a myriad of physical and mental 
health problems. 

Also, there is a resurgence of interest in cannabis as medicine. 
There is a long history of cannabis being used for various medical 
indications, including certain severe epilepsies, chronic pain syn- 
dromes, nausea associated with chemotherapy, painful spasm as- 
sociated with multiple sclerosis, and appetite stimulation in people 
with cancer and HIV-AIDS. Thus, numerous jurisdictions around 
the globe are making cannabis available as a medication, and some 
are going further than this and essentially legalizing it. All these 
aspects are important in appreciating the place of cannabis in society 
and in informing debate about its availability. For a thoughtful dis- 
cussion of these matters, including policy implications, the reader is 
referred to Hall and Degenhardt [1]. 

Henceforth, this chapter focuses on the psychiatric aspects of 
cannabis. A brief overview is provided of the cannabis plant and its 





constituents, followed by a discussion about the human cannabinoid 
system. The impact of cannabis on anxiety and mood symptoms is 
then covered, followed by a consideration of cannabis as a psych- 
otomimetic agent and its impact on psychotic disorders, including 
whether it can be considered a cumulative causal factor for schizo- 
phrenia. Finally, the impact of cannabis on cognitive functioning, 
both short- and long-term, is discussed. 

















Cannabis sativa has been used for centuries by humans for its effects 
on the mind. Mostly it is smoked, sometimes mixed with tobacco. 
As it burns very hot, the smoke is often bubbled through water to 
cool it for ease of inhalation. It can also be eaten, usually cooked in 
a cookie or cake. It can also be made into an oily formulation that 
can be imbibed as an oil or in capsule form. Inhaled cannabis enters 
the brain within seconds and has a very abrupt onset of CNS effects, 
while eating the product results in a much slower onset of action. 

The plant is constituted of numerous chemicals, the most im- 
portant of which, for psychiatry at least, is THC, the major psych- 
otomimetic agent in the plant. There is also growing psychiatric 
interest in another cannabinoid, namely CBD, which appears to 
have some antipsychotic properties [2], as well as anti-inflammatory 
and neuroprotective effects [3]. Thus, the ratio of CBD to THC in the 
plant can be construed as an indicator of its potency in terms of its 
ability to induce psychotic symptoms. 

There is a wide variation in the THC content of Cannabis sativa, 
dependent upon the precise strain, the manner in which it is grown, 
the part of the plant imbibed, and the manner in which it is prepared. 
Generally speaking, the leaves are less potent than the flowering 
parts [4]. The age of the plant is also of relevance, as the chemical 
constituents change with age; for example, THC is degraded to CBD. 
There is some confusion about the terms used to describe various 
preparations of cannabis. Here we follow the suggestion of Slade 
et al. [5]—thus, marijuana (also known as ‘herbal’) is a combination 
of buds and leaves of pollinated female plants, usually cultivated 
outdoors; sinsemilla (also known as ‘skunk’) comprises unfertilized 
buds of the female plant, usually cultivated indoors; and cannabis 
oil (also known as ‘hash oil’) is an oily product produced by solvent 
extraction or distillation of the plant product. 





SECTION 8 Substance use disorders 


There are good longitudinal data to suggest that street cannabis is 
becoming more potent in terms of THC content, expressly over the 
last two decades. Slade and colleagues [5] provide a comprehensive 
overview of longitudinal drug seizure data from the United States 
covering the period 1975-2009. Two main trends emerge, namely 
that an increasingly higher proportion of products is sinsemilla, ra- 
ther than herbal marijuana and that the THC content of cannabis 
of all forms (that is, marijuana, sinsemilla, and hash oil) has in- 
creased, from under 1% in 1975 to around 10% in 2009. In terms 
of the CBD:THC ratio, for sinsemilla, this was around 9.5% over 
the period 1981-1996, dropping to 5.6% in 1997-2009 [5]. Thus, 
in terms of both THC content and DBD:THC ratio, street cannabis 
is becoming more potent. Data from jurisdictions other than the 
United States point in the same direction. This has important impli- 
cation for psychiatry, as described in the next section. 


The cannabinoid system 


The human cannabinoid system is increasingly better understood. 
This has been spurred inter alia by the synthesis of THC and the 
discovery of the cannabinoid CB1 and CB2 receptors and the nat- 
urally occurring substrate of the CB1 receptor, anadamide. The CB2 
receptor is expressed mainly in the spleen and testis and seems to be 
associated with immune function and, to a lesser extent, in the CNS 
where it has a role in inflammatory and degenerative processes [6]. 
We shall not address the CB2 system further here. 

The CB1 receptor is expressed diffusely throughout the CNS, 
including the frontal cortex, basal ganglia, caudate, anterior cingulate 
cortex, cerebellum, hippocampus, and hypothalamus. The receptor 
is predominantly presynaptic and plays a role in the mediation of 
release of a number of neurotransmitters, including dopamine, nor- 
adrenaline, acetylcholine, glutamate, serotonin, and GABA. It is the 
receptor subtype relevant to the drug’s psychotropic effects. 

The advent of antagonists at the CB1 receptor allows a dissection 
of cause and effect in terms of what is mediated by the CB1 system. In 
particular, the CB1 receptor antagonist rimonabant (also referred to 
as SR141716) was employed in a number of such studies and showed 
great promise in its effects on weight reduction and reduction in 
hypercholesterolaemia and diabetes in animal and human experi- 
ments. Unfortunately, it carried a risk of depression and suicidality, 
so it was withdrawn from the market [7]. 

The cannabinoid system has a wide range of functions, including 
the modulation of pain (there are close interactions with the opiate 
system), memory, psychomotor control, and sleep and appetitive 
functions. Iversen [6] has summarized these main effects and out- 
lined the major neural networks involved; this is summarized here, 
and the reader is referred to Iversen [6] for a full list of references. 
THC has acute effects on motor control, with a tendency to impaired 
fine motor movements and a wide-based gait, suggestive of cere- 
bellar pathology. Impaired short-term memory is well described in 
the setting of cannabis intoxication, with particular impairment of 
attention, presumably mediated by hippocampal pathways. There 
has been a suggestion that this might indicate a role for THC in 
the extinction of aversive memories in post-traumatic stress dis- 
order. There is a well-established association of THC exposure with 
a craving for high-fat, high-sugar foods (the so-called ‘munchies’), 
operating via hypothalamic mechanisms. This has led to the use 


of THC and synthetic cannabinoids in the stimulation of appetite 
in people with HIV-AIDS and some cancers. The appetitive effect 
and weight gain are blocked by rimonabant, which had therapeutic 
use in weight reduction, interestingly initially through a reduction 
in craving such foodstuffs, but after a number of weeks via direct 
metabolic effects. THC also has established anti-nausea proper- 
ties and has been used in amelioration of nausea associated with 
chemotherapy. 

Cascio and Pertwee [7] provided an overview of the effects of 
THC on acute and chronic pain syndromes, pointing out the distri- 
bution of CB1 (and CB2) receptors on pain pathways in the brain 
and spinal cord, as well as peripheral sensory nerves. There is an 
intricate interaction between the cannabinoid and opioid systems 
which might explain some of the analgesic effects. The synthetic can- 
nabinoid receptor agonist nabilone has been shown to ameliorate 
chronic neuropathic pain, headache, and fibromyalgic pain, while 
Sativex (which contains both THC and CBD) has been employed 
in the management of pain associated with multiple sclerosis and 
advanced malignancies (for references, see [7]). 

A matter which is still debated is whether THC is an addictive 
substance. Certainly, animal models show that tolerance develops 
to many of the behavioural and other effects of THC, and there is a 
withdrawal syndrome [8]. A dependence syndrome has also been 
shown in humans, albeit tolerance does not necessarily occur for 
all its effects and many users do not increase their use once they 
have reached some sort of plateau. Withdrawal is not as rapid and 
dramatic as with substances such as nicotine or alcohol—the highly 
lipophilic nature of THC means that it remains for weeks in the 
brain and is excreted only slowly upon cessation in regular heavy 
users. This said, withdrawal effects which have been reliably docu- 
mented include sleep difficulties, loss of appetite and weight loss, 
and, in some individuals, restlessness, irritability, and anger [9]. This 
syndrome can be precipitated by rimonabant in animal models [10]. 


Effects on anxiety and mood 


Many users of cannabis report that it reduces anxiety, enhances 
mood, and results in a relaxed sense of well-being. Its sedative prop- 
erties are also reportedly helpful in individuals with chronic sleep 
disorders, albeit this has not been an area of rigorous scientific study. 

In terms of anxiety, animal work has found that CB1 receptor 
knockout mice show increased anxiety-related behaviours, while 
CB1 receptor antagonists can reduce such behaviours in animal 
models of anxiety [7]. Also, rimonabant caused substantial anxiety 
symptoms in some individuals. This suggests that the cannabinoid 
system might have a potential therapeutic role in anxiety disorders; 
however, this has not been a major focus of scientific endeavour to 
date. Also, paradoxically, some individuals suffer extreme anxiety, 
and even panic attacks, upon exposure to THC; presumably this is 
mediated, in part at least, by tachycardia induced by THC, which 
could precipitate panic in vulnerable individuals. 

The mood effects of cannabis are complex. It is very well known 
that acute intoxication is associated with a child-like mirthful 
laughter which is very intoxicating, but short-lived. Some early re- 
ports of cannabis having more enduring mood effects, and even 
being effective for some people with treatment-resistant depres- 
sion, have not been supported by rigorous studies. Indeed, most 


work on mood effects of cannabis has reported an association with 
depression. However, albeit there is an established increased risk 
of depressive disorders in habitual cannabis users, the association 
is confounded by numerous psychosocial factors, including socio- 
economic status, as well as by the use of other substances such as 
alcohol. The most informative studies in this area are cohort studies, 
which follow large groups of people prospectively and attempt to 
control for confounders, as well as establish the temporal sequence 
between exposure (that is, cannabis) and the putative outcome (de- 
pression). Degenhardt et al. [11] reviewed such studies and con- 
cluded that there is a lack of consistent evidence supporting a causal 
relationship between cannabis exposure and depression or suicide. 
They point out the inconsistency in the data and variable method- 
ology. Most such studies have been performed in young people, and 
cannabis/depression associations that do appear to survive control- 
ling for confounding factors are strongest for young females. 

There has been surprisingly little scientific scrutiny of cannabis 
in relation to bipolar disorder, despite its widespread use by people 
with this malady, and reports of benefits in both mood and sleep 
modulation. Again, large prospective studies are required to inform 
our understanding of cannabis use in the context of a bipolar di- 
athesis. Silberberg et al. [12] selectively reviewed some such studies 
and found fairly mixed results, albeit the larger and more methodo- 
logically robust studies suggested worse outcomes for people with 
bipolar disorder who use cannabis. For example, van Rossum et al. 
[13] studied 3459 bipolar patients and found that those who were 
cannabis users had more manic relapses and worse psychosocial 
outcomes, but they were also less likely to adhere to medications, 
and one cannot exclude the possibility that people with more severe 
illness are inherently more likely to use cannabis. 

There is more debate about whether cannabis use can independ- 
ently increase the risk of bipolar disorder and/or bring forward 
the onset. In a general population study of 4815 people, Henquet 
et al. [14] reported an association between cannabis consumption 
and later manic symptoms, which was robust after controlling for a 
number of sociodemographic confounders, as well as for the evolu- 
tion of psychotic symptoms. Similarly, van Laar et al. [15] reported 
a 5-fold increased risk of bipolar disorder associated with cannabis 
use in a sample of 3881 individuals from the general population. 
Some evidence suggests that cannabis use is associated with an 
earlier onset of bipolar disorder. Overall, it seems sage to suggest 
that cannabis might increase the risk of bipolar disorder in vulner- 
able individuals and also result in an earlier first manifestation. Also, 
cannabis use in people with an established illness should be strongly 
discouraged. That said, attention to other substances of abuse, as 
well as broader psychosocial and illness-related parameters and ad- 
herence to medication, is very important. 


Cannabis and psychosis 


Arguably, consumption of cannabis can produce psychotic phenomena 
in anybody, but individual predisposition depends upon experience 
with the drug, the context in which it is consumed, the THC content 
(and the CBD:THC ratio), and crucially the individuals psychosis 
proneness. The latter is a measure of how vulnerable the individual is to 
the manifestation of psychotic symptoms. It is well known that anything 
up to 15% of the population have experienced psychotic phenomena at 


CHAPTER 52 52 Cannabis and mental illness 


some time, and elegant work from Verdoux and colleagues [16] has 
shown that increased psychosis proneness is associated with a higher 
likelihood of psychotic symptoms on exposure to cannabis. 

People with established psychotic illnesses, such as schizophrenia, 
have a very high psychotic predisposition and thus are likely to 
manifest psychotic symptoms when imbibing cannabis. This has 
been confirmed in people with schizophrenia given intravenous 
TCH [17]. This is certainly seen clinically, and the negative impact of 
cannabis on the longitudinal course of schizophrenia is well recog- 
nized in longitudinal studies, albeit there is still some debate about 
the direction of causality and the contribution of confounding fac- 
tors such as other illicit substances [18]. 

What is also of interest is that people with psychotic disorders 
still remain liable to psychotic relapse upon using cannabis, even if 
they are taking medications which effectively block dopamine D2 
receptors. Indeed, D’Souza and colleagues [17] found that people 
with schizophrenia were more likely than healthy controls to experi- 
ence psychotic symptoms when exposed to intravenous THC, even 
when they were on dopamine antagonist drugs. Thus, the induction 
of psychotic symptoms by THC is more complex than simply a re- 
lease of dopamine. What is clear is that the effects are mediated by 
the CB1 receptor. Thus, Huestis and colleagues [19] administered 
rimonabant to volunteers, then gave them a cannabis cigarette to 
smoke, and found that rimonabant effectively blocked the psychoto- 
mimetic effects of cannabis. 

An important area of investigation is to understand why people 
with established psychotic disorders smoke cannabis to the extent 
that they do. A number of studies have explored this and have gen- 
erally concluded that the motivation lies largely in the area of alle- 
viation of negative affect (for example, boredom, insomnia, anxiety, 
depression) [14, 20]. Importantly, very little support has been found 
for the notion that people with schizophrenia ‘self-medicate’ positive 
symptoms or try to alleviate medication side effects with cannabis; if 
there is a ‘self-medication component, it is the negative, rather than 
the positive, symptoms of psychosis that are targeted by users. This 
has clinical applicability in that understanding the reasons for use 
can help engage individuals in effective programmes to assist them 
in reducing their cannabis use [21]. 

There are also fairly consistent data to support the notion that can- 
nabis use in youth can ‘bring forward’ the onset of schizophrenia. 
For example, using a large Australian data set, Stefanis and col- 
leagues [22] estimated this effect to be of the order of 7-8 years, 
which obviously has a major impact on the individual, as they stand 
to lose a substantial period of early adulthood to the illness. Not all 
studies [23] have reported this effect, and controlling for all potential 
confounders is difficult. 

A rather more contentious aspect of the cannabis/schizophrenia 
debate is whether cannabis consumption can actually contribute 
causally to schizophrenia—that is, do some people who have used 
cannabis and who later develop schizophrenia only manifest the 
disorder because they used cannabis? Clearly the vast majority of 
people who do use cannabis do not go on to develop schizophrenia, 
and the rate of schizophrenia across the globe does not vary in 
alignment with the rates of use of cannabis. Any studies in this area 
face the fact that some of the genes involved in the predisposition 
to schizophrenia also predispose to cannabis consumption [24]; 
there are many potential confounding factors, including socio- 
economic factors and the use of other illicit drugs—and even the 
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licit drugs alcohol and tobacco—and it is hard to establish the true 
temporal sequence of exposure (that is, cannabis) and outcome (that 
is, schizophrenia). Many people with schizophrenia have very pro- 
tracted prodromes (which may last for years) and are characterized 
largely by negative affect, which could (as outlined previously) dir- 
ectly lead to the consumption of cannabis and spurious assumptions 
about the direction of causality [25, 26] 

Nevertheless, high-quality longitudinal cohort studies are now 
available that have fairly consistently reported an association be- 
tween the use of cannabis in youth and later psychotic symptoms, 
and so support a causal connection [27]. There are fewer such studies 
with a specific schizophrenia outcome, but Zammit and colleagues 
[27] have provided a thorough overview of this area and suggest 
there is sufficient evidence to support public health campaigns re- 
garding the potential risk of cannabis use and later schizophrenia, 
albeit conceding that the ‘number needed to prevent? (NNP) one 
case of schizophrenia is very high; Hickman et al. [28] suggested that 
for every prevented case, some 2800 males (95% confidence limits 
2018, 4530) aged 20-24 would need to become entirely abstinent (the 
estimates are even higher for older people and females). These esti- 
mates do not account for the fact that higher-potency cannabis (that 
is, sinsemilla) appears to carry a particularly strong association with 
schizophrenia [29], and this might be a specific focus of public health 
education and regulation where cannabis is effectively legalized. 

Health campaigns could also target those individuals who seem 
particularly liable to develop schizophrenia upon exposure to can- 
nabis. It seems obvious to warn people with psychosis proneness, 
those who have experienced a particularly strong psychotic reaction 
to acute exposure to cannabis, and those with a family history of 
schizophrenia. Certain genetic profiles may also signal vulnerability. 
For example, Caspi et al. [30] reported that, in the Dunedin cohort 
study, individuals homozygous for the valine allele at Val-158-Met 
within the catechol-O-methyltransferase (COMT) gene were more 
vulnerable than those with other variants to develop a psychotic 
disorder upon exposure to cannabis before the age of 18 years. 
This finding has not been convincingly replicated, but more recent 
interest in the AKT1 gene does seem to be bearing fruit in terms of 
our understanding of interaction effects between certain genotypes 
and the use of cannabis [31]. 


Cannabis and cognition 


As outlined in previous sections, acute intoxication with cannabis 
carries with it associated impairment in cognitive functions, not- 
ably registration and short-term memory. What is more contentious 
and more worrying from a general and public health perspective is 
whether cannabis can cause lasting cognitive dysfunction. 

Solowij and Pesa [32] reviewed animal and human investigations 
of the effects of cannabis on cognition in both short- and long-term 
studies. They confirm that in the short term, there are effects on at- 
tention, memory, executive, and inhibitory processes. They also cite 
evidence of some longer-lasting effects in heavy users where early 
use (that is, adolescence) of high-potency cannabis seems most con- 
cerning. This reflects increasing concern about high-THC cannabis 
use at sensitive stages of brain development [33]. 

There is a well-described ‘amotivational’ syndrome associated 
with chronic heavy use of cannabis. Characteristics include elements 


of dependence, such as craving, salience, and prioritization of drug 
acquisition above other pursuits, as well as some cognitive impair- 
ment in the domains outlined previously [34]. A debated issue is 
whether these cognitive problems revert if the individual stops using 
cannabis. The weight of evidence is that most of the effects do not en- 
dure, as long as the cannabis is given sufficient time to wash out of the 
brain (and this might take up to a month) [35, 36]. However, some 
research suggests not all cognitive functioning returns to baseline. 

Neuroimaging studies have been highly inconsistent regarding 
structural brain abnormalities in association with prolonged heavy 
cannabis consumption. It is very difficult to control for the amount 
of exposure to cannabis, as well as to all potential confounding fac- 
tors, not least the use of other drugs, such as alcohol, which are 
known brain toxins. Yucel et al. [37] have reported a unique group 
of individuals who were heavy long-term cannabis users but who 
apparently had no history of the use of other drugs; there were subtle 
volume reduction in the hippocampus and amygdala. Other studies 
have failed to replicate these findings, and particularly when alcohol 
exposure has been controlled for [33]. What is rather more con- 
sistent has been the finding of abnormalities in the white matter and 
impaired neuronal connectivity, including hippocampal connec- 
tions and axons in the corpus callosum [38]. 


Conclusions 


Cannabis sativa remains the most widely used illicit drug worldwide. 
Its relevance to psychiatry lies in balancing its potentially harmful 
effects vs its potentially therapeutic benefits. Certainly, it is a drug 
of potential dependence but is less habit-forming than, for example, 
tobacco, and the withdrawal syndrome less dangerous than for, say, 
alcohol. It has effects on anxiety (these effects are somewhat dose- 
dependent), and it may have a therapeutic place in the treatment of 
some aspects of post-traumatic stress disorder. The association with 
depression has probably been overstated, and cohort studies suggest 
the association is more reverse causality than direct causality, and the 
population-attributable fraction (PAF) is very modest at around 2%. 
Effects on people with bipolar disorder have been under-researched, 
but the weight of evidence is that it is associated with an increased 
risk of manic relapse. THC, acting via CB1 receptors, can produce 
psychotic phenomena, notably in individuals with psychosis prone- 
ness and particularly in people with schizophrenia. It may also ‘bring 
forward’ the onset of schizophrenia by some years and be a cumu- 
lative causal factor for schizophrenia, albeit again the PAF is low 
(the highest estimates are 15%, but many researchers would put it 
around the 8% mark). Acute cognitive effects of cannabis are well 
described, as is an amotivational syndrome which appears largely re- 
versible with cessation of the drug. Some evidence suggests enduring 
neurocognitive effects in heavy users, but the extent of the effect is 
much lower than, for example, sustained heavy alcohol use. 

All this needs to be balanced by a consideration of the established 
and potential medical benefits of the cannabinoids. The role of CBD 
in the management of psychotic disorders is particularly alluring, 
as is the potential role of other non-THC cannabinoids in various 
disease states. It is hoped that moves to reignite the role of cannabis 
as medicine will have therapeutic benefits for many, and that the 
ongoing exploration of the cannabinoid systems will enhance our 
understanding and treatment of a number of psychiatric disorders. 
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Stimulant drugs, such as cocaine, amphetamine, and 3,4- 
methylenedioxymethamphetamine (MDMA), remain the most 
commonly used group of illicit psychoactive drugs after cannabis. 
Their use crosses cultures, age groups, functions, and socio-economic 
profiles. For most users, the risks related to acute intoxication will 
be the most commonly experienced harm, with dependence being 
relatively rare. While variations in precise neuropharmacological 
action, potency, preparations, route of synthesis, and administration 
lead to discrete populations of users adopting diverse patterns of use 
with associated risks and harms, stimulant drugs share a common 
neurobiology, with similar effect profile and treatment options. 
Dose-related intoxication with these drugs can result in acute psy- 
chopathology that may require assessment, treatment, and follow- 
up by the general psychiatrist. It is important to note that few people 
will use these drugs in isolation and that polydrug use contributes 
significantly to the harm profiles associated with these drugs. In re- 
cent years, the appearance of a wide range of novel psychoactive sub- 
stances (NPS) that mimic the effects of more traditional drugs has 
posed a challenge to policy-makers and added a level of unpredict- 
ability for both users and treatment providers [1]. The appearance 
of a more robust ‘chem-sex’ (this refers to the use of drugs to chem- 
ically enhance sexual experience) scene in the UK has led to drugs 
such as crystal methamphetamine, gamma hydroxybutyrate (GHB), 
and mephedrone being added to the list of those drugs psychiatrists 
need to be able to assess and treat most effectively within a multi- 
disciplinary team [2]. 











The ICD-10 definition for a dependence syndrome to any substance 
revolves around having at least three of the following symptoms pre- 
sent together for at least 1 month, or iflasting less than 1 month have 
been present together repeatedly in the last 12 months: a strong de- 
sire to take the drug, difficulties in controlling its use, persisting in 
its use despite harmful consequences, a higher priority given to drug 
use than to other activities and obligations, increased tolerance, and 
sometimes a physical withdrawal state. The ICD codes then vary ac- 
cording to substances, and relevant to this chapter are: F14—mental 





and behavioural disorder due to the use of cocaine; and F15— 
mental and behavioural disorder due to the use of other stimulants, 
including caffeine. There are further specifiers such as .0 acute in- 
toxication, .1 harmful use, .2 dependence syndrome, .3 withdrawal 
state, .4 withdrawal state with delirium, .5 psychotic disorder (and 
many more; see ICD 10 for further details). 

DSM-5 classifies substance use disorders according to symp- 
toms of dependence similar in nature to ICD, but there are 11 of 
them. DSM-5 describes a continuum of substance use disorders 
and divides this into mild (2-3), moderate (4-5), and severe 
(>6) categories, based on how many symptoms are present at the 
same time. The codes relevant to this chapter are: mild 305.70 
(amphetamine-type substance) 305.60 (cocaine); moderate 304.40 
(amphetamine-type substance) 304.20 (cocaine); and severe 
304.40 (amphetamine-type substance) 304.20 (cocaine). There are 
other specifiers in DSM-5 such as acute intoxication, withdrawal, 
and other substance-induced disorders (for example, psychotic 
disorder, anxiety disorder, etc.) (see DSM-5 for further details). 























Assessing a person who reports the use of stimulant or other party 
drugs follows the same principles as for other drug use assess- 
ments. A focus on the type of drug/s, pattern, and frequency of use, 
including the amount used on the day of use (usually in grams or 
fractions thereof, or monetary spend), the preparation used, poly- 
drug use, including route of use, is mandatory. Asking about key 
psychosocial and life event determinants that accompanied the pro- 
gress from initial use to current use patterns, combined with assess- 
ment of perceived function (for example, work, managing stress/ 
anxiety, enhanced sexual function, partying) and the level of control 
over use, are useful. Any history of overdose, treatment-seeking, and 
involvement in associated high-risk behaviours, such as sex, crimes, 
and violence, should be noted. Specific assessment to determine the 
presence of dependence and withdrawal can usually be done by en- 
quiring about a typical day of use, from waking to sleep, with assess- 
ment of risk behaviours, drug use prioritization, and craving. Asking 
what happens if the person goes without the drug for more than a 
day or two can help you identify the levels of control over use and the 
presence of withdrawal. Withdrawal needs to be differentiated from 
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a shorter-lived comedown that many users of stimulants experience 
in the days following use more akin to a hangover. Blood-borne 
virus (BBV) screening should be offered routinely to injectors and 
those involved in high-risk sexual activities. Urinary drug screens 
(UDS) may be useful to corroborate which substances were used or 
confirm abstinence, though drugs such as GHB and many NPS may 
not be detected on routine UDS. Most stimulant drugs will be de- 
tectable in the urine for 2-3 days following use. 


Amphetamine (speed) and methamphetamine— 


‘crystalline methamphetamine hydrochloride’ 
(ice, crystal, shabu, yaba, meth, tina) 





Background 


Amphetamine-type stimulants are the most commonly used syn- 
thetic stimulant drugs in the world. Originally developed as medi- 
cines (initially as nasal decongestants, appetite suppressants/weight 
loss agents, and mood enhancers), their abuse potential and ability 
to induce a range of psychopathologies, most notably transient 
psychotic episodes, are well recognized. Still used in the treatment 
of narcolepsy and ADHD, dexamphetamine differs from metham- 
phetamine only by the addition of a methyl group, which lowers its 
melting point (allowing it to be smoked) and increases its potency 
and ability to cross the blood-brain barrier. Despite observed dif- 
ferences in the way the drugs are used, the problems they cause, and 
their clinical presentations, in laboratory double-blind conditions, 
the subjective experience of the two drugs is remarkably similar [3]. 
Until recently, the dominant form of amphetamine seen in the UK 
was low-purity amphetamine sulfate (typically 4-10% purity, com- 
prising both D- and L-isomers). While methamphetamine use has 
historically been problematic in the United States, Australia, and 
South East Asian countries, such as Thailand, Japan, and Korea, for 
many years, its appearance in the UK is actually a recent phenom- 
enon. Its arrival coincides with the rise of the gay ‘chem-sex’ scene 
in major cities in the UK, and methamphetamine is now starting 
to make its presence felt within both emergency departments and 
sexual health and addiction services. In the UK, methamphetamine 
was reclassified as a Class A drug in 2007, and amphetamine is a Class 
B drug. In the UK, a gram of amphetamine sulfate costs about £10, 
and a gram of methamphetamine sulfate between £100 and £150. 


Preparation, purity, and routes of use 


Amphetamine sulfate (speed) is usually sold as a white/off white/ 
beige crystalline powder and is most commonly snorted or taken 
orally, dissolved in a drink, or wrapped in cigarette paper and swal- 
lowed (known as bombing or parachuting). Amphetamine sulfate is 
water-soluble and can be injected, though because of its high melting 
point, it cannot be effectively smoked. Purities are typically very low 
in the UK, with common bulking agents being caffeine and lactose. 
Purities of over 30% are not uncommon across the European Union. 
Diversion of prescribed dexamphetamine tablets (and the closely re- 
lated methylphenidate) occurs and is a major problem in countries 
such as the United States where rates of prescription for conditions 
such as ADHD are much higher than in the UK. 

Unlike illicit amphetamine sulfate powder, methamphetamine is 
often of very high purity. Its physical form varies, depending on its 


route of synthesis and purity. Crystalline methamphetamine hydro- 
chloride (known as ice because it can resemble shards of glass) can 
be up to 80% pure. Base amphetamine (sometimes known as paste), 
is an oily, waxy intermediate product on the way to the manufacture 
of the crystalline hydrochloride salt of methamphetamine and has a 
lower purity of about 40-50%. Methamphetamine is a versatile drug 
and can be smoked, snorted, injected, and taken orally. 


Mechanism of action and metabolism 


Methamphetamine closely resembles amphetamine sulfate (com- 
monly referred to as speed) in structure and mechanism of action but is 
more potent in its sympathomimetic effects and has a longer duration 
of action (half-life of about 12 hours, compared to 2-4 hours for am- 
phetamine sulfate). Amphetamine has both direct sympathomimetic 
effects secondary to disruption of vesicular storage of monoamines 
and inhibition of their breakdown by MAOIs and indirect actions 
through inhibition of central presynaptic reuptake of catecholamines. 


Prevalence and patterns of use 


The use of amphetamine in the UK is less common than that of either 
cocaine or MDMA. A total of 0.6% of 16- to 59-year olds report using 
amphetamine in the last year, and 1.3% of 16- to 24-year olds—0.1% 
and 0.2%, respectively, of those age groups for methamphetamine. 
Lifetime ever use in 16- to 59-year olds is 10.1% for amphetamine 
and 0.9% for methamphetamine [4]. As noted in Background, 
p. 526, in the UK, methamphetamine is becoming more prevalent 
among certain segments of the population, particularly in associ- 
ation with the gay and dance music scene, but compared to the use 
of MDMA and cocaine in all its forms, the prevalence of metham- 
phetamine use at present is still low [5]. 


Doses and typical patterns of use 


Most consumers will use infrequently, with daily use, even among 
dependent users, being uncommon. Typical consumption amounts 
vary widely, but doses of somewhere between 0.25 g and 1 gin a ses- 
sion would not be uncommon. Heavier users develop tolerance and 
can consume considerably higher doses. Because these drugs ener- 
gize people and reduce the need for sleep, use sessions can extend 
over several days, with prolonged insomnia being associated with 
the development of paranoia and hostility. 


Physical and psychological effects and complications 


The effects of all amphetamines are dose-related. They induce eu- 
phoria, elevation of mood and alertness, increased speed of thought 
and speech, and heightened perceptual awareness and lead to re- 
duction in appetite and need for sleep. The physical manifestations 
of amphetamine and methamphetamine are broadly similar, char- 
acterized by sympathetic arousal with tachycardia, hypertension, 
hyperthermia, and increased respiration, accompanied by sweating, 
pupillary dilatation, increased motor activity, dry mouth, tremor, and 
blurred vision. Occasionally, serious medical complications arise, 
including malignant hyperthermia, coronary artery syndrome, seiz- 
ures, and cerebral bleeds (Table 53.1). Route-related harms, such as 
local trauma, septal ulceration, abscesses, and BBVs from snorting 
and injecting, can combine with more chronic physiological tolls, 
including malnutrition from prolonged anorexia and hyperactivity, 
poor dentition, skin picking, and gastrointestinal tract ulceration, 
leaving users in very poor physical health. 


Table 53.1 Clinical signs of intoxication with amphetamine 


Physical Psychological 


Elevated pulse, blood Euphoria/energized Motor hyperactivity 
pressure, temperature 


Increased respiratory rate Anxious/irritable Restless/twitching 





Sweating/dehydrated Rapid thoughts Talkative, pressured 
speech 

Dilated pupils Paranoia Aggressive 

Tremor/shakiness/bruxism Perceptual disturbance Stereotyped 
movements 


The acute sought-after effects of increased energy and euphoria, 
enhanced stamina, confidence, disinhibition, and heightened 
awareness can easily give way to a range of dose-related unwanted 
psychological effects. These include anxiety, panic, extreme agita- 
tion, paranoia, hostility, aggression, hallucinations, confusion, dis- 
orientation, and psychosis. 


Other consequences of use 
Dependence and withdrawal 


Dependence may occur and is more common among heavy 
users who are male and in those who smoke or inject the drug. 
Methamphetamine is considered to have a higher abuse liability and 
potential for harm than amphetamine. Although dependent users 
may use every day to avoid withdrawal, more typically users tend to 
consume a large amount of the drug (often several grams) over sev- 
eral days, going without sleep (a binge) before ceasing use through 
physical exhaustion or an exhaustion of funds. ‘Crashing’ refers 
to the period following a binge, which is characterized by fatigue, 
hypersomnia, hyperphagia, and low mood due to acute monoamine 
depletion. The crash and subsequent comedown period may last 2- 
7 days. If abstinence persists, a longer-term withdrawal period may 
be seen, characterized by craving, low mood, anergia, irritability, 
sleep, and appetite disturbance. Similar neurobiological mechan- 
isms involving alterations in the function and activity of mono- 
amine neurotransmitters are responsible for the overlap between the 
symptoms of depression and those of stimulant withdrawal [6, 7]. 
Typically, the withdrawal gradually diminishes over 2-4 weeks, 
though dysphoric symptoms may persist for up to 10 weeks. 


Withdrawal management and relapse prevention 


Management of withdrawal is largely supportive and preferably 
within a safe, well-supported home environment. Inpatient admis- 
sion is rarely required, other than in those with polyuse depend- 
ence and where use is accompanied by severe mental or physical 
illness. Despite decades of research, there are currently no widely 
accepted evidence-based pharmacotherapy regimes for the treat- 
ment of psychostimulant withdrawal [8]. The patient should be 
placed in quiet surroundings for several days and allowed to sleep 
and eat as much as is needed. Because a significant component of 
the withdrawal syndrome is probably related to neurotransmitter 
depletion, recovery may be delayed because of anorexia associated 
with amphetamine use. It may be useful in some to provide nu- 
tritional supplements or a well-balanced diet rich in monoamine 
precursors—phenylalanine, tyrosine, L-tryptophan, which are 
found in, for example, pumpkin seeds, chocolate, Marmite , and 
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bananas. Benzodiazepines may be prescribed on a short-term basis 
for agitation, though these may delay the return to normal sleep 
architecture. Some patients may become markedly despondent 
during withdrawal, and a suicide assessment may be necessary. 
Relapse prevention, based on cognitive behavioural therapeutic 
(CBT) principles and mutual aid groups, has shown to be effective in 
reducing relapse and should be part of any treatment intervention. 
Given the high rates of ADHD comorbidity among stimulant users, 
it is of note that there is growing interest and evidence to support 
the use of substitute prescribing in those with concurrent stimulant 
dependence and ADHD. Some of the most promising results have 
come from correctional facilities where methylphenidate has been 
used successfully to reduce both ADHD symptoms and relapse to 
stimulant drug use [9] (see Cocaine/crack cocaine (charlie, coke, 
chang, snow, white, crack), p. 528). 


Low mood and depression: assessment and management 


The mainstay of treatment for those with dependence on amphet- 
amines is psychological, with medication-based treatment having a 
limited role in the management of withdrawal and relapse prevention, 
other than in addressing underlying comorbidities such as depres- 
sion and ADHD. While amphetamine use can exacerbate depres- 
sion in those with a primary underlying disorder, antidepressants 
have no specific anti-craving effects, and the efficacy of antidepres- 
sants in reducing depression is confined to those stimulant users 
who are depressed. Clinicians should avoid diagnosing and initiating 
medication-based treatment for a primary depressive disorder in 
those presenting during active periods of use or withdrawal. Patients 
should have the difficulties of making an accurate diagnosis explained 
to them and should be supported to achieve a period of 2-4 weeks’ 
abstinence and have their mood reassessed before a decision is made 
to commence antidepressants. The advantages of waiting before 
deciding whether or not to prescribe antidepressants are improved 
diagnostic accuracy, avoidance of potentially unnecessary medication 
and the inaccurate attribution of low mood to drug use and not to 
an intrinsic disorder, and almost certainly an improvement in com- 
pliance and efficacy. Persistence of depressive symptoms beyond 2-4 
weeks after stopping amphetamine use may suggest that there is an 
underlying depressive illness, and this should be treated [10]; left un- 
managed, its presence represents a high risk for relapse. Psychosocial 
treatments, including CBT and motivational enhancement therapy 
(MET), for stimulant abuse and dependence have been found to be 
effective in reducing levels of use [11], with their efficacy enhanced 
through combining them with contingency management approaches 
offering non-drug reinforcers for engagement and abstinence. 


Stimulant-induced psychosis 


The use of high doses of methamphetamine may lead to the induc- 
tion of a temporary psychotic state that may be clinically indistin- 
guishable from paranoid schizophrenia. First recognized in 1938 in 
association with Benzedrine nasal inhalers, it was not until Connell’s 
classic 1958 study that the syndrome was well described. Acute tran- 
sient psychotic episodes (typically characterized by suspiciousness, 
hostility, unusual thought content, and hallucinations) occur in 
about 10-15% of users. Psychotic episodes are more common in de- 
pendent users, users of methamphetamine, men, injectors, polydrug 
users, those with a past history of psychosis, and following a binge in 
association with prolonged insomnia [12]. 
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Amphetamine-induced psychosis is characterized by persecu- 
tory delusions and hallucinations which are typically auditory but 
may be visual or tactile (typically formication, or the sensation of 
insects crawling under the skin; this can rarely lead to secondary 
delusions such as Ekbom’s syndrome of delusional parasitosis). Little 
has changed in the way of management since Connell’s time—he re- 
commended ‘removal of the drug and appropriate sedation. Often, 
patients present with high levels of hostility and violence secondary 
to persecutory delusions or hallucinations, and safe containment 
and management of the disturbed individual can require enormous 
levels of both physical and chemical restraint. Benzodiazepines 
(often required in very high doses) should be the first-line medica- 
tion, with antipsychotics used only where additional tranquillization 
is required. A diagnosis of a possible underlying or persistent psych- 
otic disorder must be deferred until a reassessment can be made in 
a drug-free state. These often florid psychoses usually remit within 
a few days, and the user returns to normal functioning, although 
some retain a vulnerability to such episodes [13]. Only a minority 
(1-15%) persist beyond 1 month, and many of these patients will 
have underlying psychiatric disorders [14]. 

The prognosis is variable, with those who have experienced 
stimulant-induced psychotic episodes being more vulnerable to 
future episodes (possibly through behavioural sensitization) on 
re-exposure to the drug, often at lower levels. Recent PET imaging 
studies in chronic methamphetamine users have demonstrated a 
reduction in dopamine transporter concentration, and this reduc- 
tion was significantly associated with the duration of methampheta- 
mine use and closely related to the severity of persistent psychiatric 
symptoms. Moreover, the severity of psychiatric symptoms was sig- 
nificantly correlated with the duration of methamphetamine use. 
Cessation is still potentially important since there does appear to be 
some recovery of dopamine transporter function with abstinence. 

From a neurotoxicity perspective, preclinical studies have shown 
amphetamines to be toxic to dopaminergic axon terminals in the 
striatum, in particular the nigrostriatal pathway. Other neurotrans- 
mitter systems (GABA, glutamate, serotonin, and ACh) may also 
be affected, and the underlying mechanisms causing such damage 
are thought to be multiple, including oxidative stress, excitotoxicity, 
neuroinflammation, and others [15]. Amphetamine use in hu- 
mans (in particular, methamphetamine) has been associated with 
abnormal morphology of the substantia nigra and possible vulner- 
ability to Parkinsons disease (PD) [16]. Epidemiological studies 
have suggested an increased rate (2- to 3-fold) of PD in amphet- 
amine users [17-19]. Similarities in methamphetamine-associated 
dopaminergic neural damage to that seen in PD have resulted in 
some animal models used in PD research being based on metham- 
phetamine exposure [20]. 


Stimulant overdose 


Dose-related toxicity results from excessive levels of extracellular dopa- 
mine, noradrenaline, and serotonin. Acute sympathomimetic toxicity is 
primarily manifested through neurological and cardiovascular effects, 
though pathology can extend to the gastrointestinal, renal, musculo- 
skeletal, and pulmonary systems. Clinically, stimulant overdose is rec- 
ognized by a constellation of psychological (anxiety, panic, paranoia, 
hallucinations, confusion, and hostility), behavioural (aggression, 
agitation, and hyperactivity), and physiological (mydriasis, hyper- 
thermia, dehydration, sweating, tremor, hyperreflexia, hypertension, 


tachycardia, movement disorders, and seizures) symptoms. The man- 
agement of sympathomimetic overdose is largely supportive, with 
cessation of stimulant consumption typically leading to resolution 
over a few hours or days. Where there is risk to self or others, either 
as a result of physiological overstimulation or aggressive behaviours, 
benzodiazepines (often in high doses) and, in some cases, medica- 
tions to stabilize cardiovascular overstimulation are required. In cases 
where agitation, delirium, and movement disorders are unresponsive 
to benzodiazepines, second-line therapies, such as antipsychotics, 
intravenous sedation, and muscle relaxants, can be considered. Rapid 
cooling may be required in cases of malignant hyperthermia. 


Harm reduction for stimulant users 





The risks of acute harm from stimulant-type drugs are related to the 
dose, length of current use episode, route of use, and polydrug use 
and are greater among heavy frequent users. As a result, the key safer 
use strategies include avoiding injecting and smoking, not mixing 
with other drugs or alcohol, limiting the amount used during a ses- 
sion, avoiding the development of tolerance through taking regular 
breaks from use (giving both the body and brain time to recover), 
trying to limit periods of use to 1-2 days, avoiding prolonged in- 
somnia which markedly increases the risks of developing psychosis, 
and eating and sleeping well between episodes of use. 


Cocaine/crack cocaine (charlie, coke, 





chang, snow, white, crack) 


Background 


Cocaine is a major alkaloid constituent of the Andean bush 
Erythroxylum coca. Originally used for millennia to increase stamina, 
suppress appetite, and manage altitude sickness by indigenous 
groups in South America, cocaine hydrochloride (powder) produc- 
tion and distribution is a multi-billion dollar industry, thought to 
underpin much of the violence and corruption that ravages countries 
such as Mexico. Used recreationally to induce euphoria, energy, and 
confidence, cocaine is frequently combined with depressants such 
as alcohol and opioids. It is sold either as a water-soluble crystalline 
hydrochloride powder or as a smokable free alkaloid base form (no 
longer the salt—therefore, no longer water-soluble) known as crack. 
In both forms, the mechanisms of action and potential for cocaine 
to cause harm are remarkably similar. In fact, it is the differences 
in routes of use (smoking/injecting vs snorting), socio-economic 
status (rich vs poor), and ethnicity (white vs black) and involve- 
ment in criminality and heroin use that are mostly responsible for 
the observed differences in the presentation and harms seen among 
users of these two different preparations. In the UK, cocaine in all its 
forms is a Class A drug. 


Mechanism of action 


Cocaine is an indirect sympathomimetic agent that causes an acute, 
but transient, blockade of the dopamine transporter (DAT), causing 
an acute elevation of dopamine levels, resulting in the experienced 
pleasure burst. Chronic cocaine administration leads to a compen- 
satory increase in DAT levels, such as that with abstinence, and these 


increased transporter levels lead to a relative reduction in levels of 
dopamine outside nerve cells. Neuroimaging studies suggest that 
there is persistent reorganization of reward pathways, which may 
underlie the persistence of craving, even after many months of ab- 
stinence. Its half-life is about 50 minutes. 


Route of administration 


Intranasal use of cocaine powder is the major route of administration 
in the general population. Purity in the UK has increased in recent 
years, with street deals varying between 10 and 70%, giving rise to a 
two-tier market, with ‘economy’ cocaine selling for £50-60/gram and 
‘premium cocaine for over £100 in some places. Intranasally, cocaine 
lasts about 1-3 hours and is re-dosed every 1-2 hours in small amounts 
(known as lines) over the course ofa session. Typical amounts used on 
the day of use would be 0.5-1 g/day; much larger amounts can be seen 
in those dependent upon the drug. Intravenous use is associated with 
a shorter duration of action, more intense effects, and greater mor- 
bidity. Cocaine injectors sometimes administer 20 times or more in a 
day. In combination with heroin (known as speedballing), it has high 
associated mortality rates. Smoked as crack cocaine, the high lasts a 
matter of minutes. Intense, short-lived euphoria, followed by a rapid 
return to baseline or an overshoot to feeling lousy, is a recipe for enor- 
mous reinforcement and is often a gateway into heroin use, which is 
marketed by dealers as a way of managing the comedown from crack 
cocaine. Crack cocaine users can often consume many grams of the 
drug over a session, with the rate-limiting factor being money. 

Many people may use powdered cocaine in a recreational fashion 
for many years and without significant problems. Others may in- 
sidiously develop problematic use, characterized by increasingly 
regular dosing, development of tolerance, craving, and loss of con- 
trol. Concurrent consumption of alcohol is common and can lead to 
increased rates of acute harm and dependence. At population level 
in England and Wales, in 18- to 59-year olds, the reported lifetime 
ever use is 9.7% for cocaine powder and 1.2% for crack. Last year use 
in 16- to 24-year olds was 4.8% for powder and 0.1% for crack, and 
in the 18- to 59-year olds, 2.3% for powder and 0.1% for crack [4]. 
Frequently seen as a way of enhancing confidence and performance, 
attempts at reducing use are often accompanied by a rebound fall 
in self-esteem and energy and the development of depression. It is 
of note that crack is sold in small unit cost deals (5-10), compared 
to the costlier dealing unit of cocaine. Within UK drug treatment 
services, coexisting crack cocaine and heroin use are common, with 
typical users reporting smoking 3-4 ‘bags’ per day, equating to ap- 
proximately half a gram. Although dealers usually provide the user 
with the preparation of their choice, some users will ‘wash their own 
cocaine powder with a base (ammonia or bicarbonate of soda) to 
make their own crack, while cocaine injectors are sometimes left 
with having to inject crack if powder cocaine is unavailable. It may 
be that there are groups of ‘functional’ occasional users of crack, but 
in those with substance use disorders, the pattern of use is often un- 
controlled and leads to significant harm. 


Dependence and withdrawal 


Dependence to cocaine can occur with both powder and crack but 
is potentially more likely if smoked as crack. This is due to the more 
rapid onset of action, intense highs, and early withdrawals, which are 
very reinforcing. Features of escalating use over time, loss of control, 
tolerance, withdrawal symptoms, cravings, salience, and continued 
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use despite knowledge of harm can all manifest. Dependent users 
may not necessarily have to use every day—typically crack users 
tend to use in binges over a few days, then rest—but there are rarely 
many consecutive days off, and even then, dependent users often use 
other substances to combat the withdrawal symptoms. 

The withdrawal syndrome is similar to that seen in amphetamine 
withdrawal, with common symptoms including dysphoric mood, 
fatigue, irritability, increased appetite, hypersomnia/insomnia (re- 
flecting a homeostatic resetting response to acute intoxication- 
related insomnia and anorexia), irritability, craving, lethargy, or 
agitation. Secondary to a hypo-dopaminergic state, the predom- 
inant psychological symptoms peak on days 2-7, reducing gradually 
over the following 3-6 weeks. 


Effects and toxicity 


These are similar to those seen with amphetamines and include signs 
of sympathetic arousal (tachycardia, hypertension, severe agitation, 
hyperthermia, seizures) and severe cardiac problems such as coronary 
artery spasm, congestive cardiac failure, arrhythmias, ischaemia, and 
myocarditis. Cocaine, when consumed with alcohol, results in the for- 
mation of the less anxiogenic, but more cardiotoxic, reinforcing, and 
longer-lasting, cocaethylene. Concurrent consumption of alcohol, 
and then cocaine, leads to a 30% increase in peak plasma cocaine 
levels, which may increase the risk of sudden death in a vulnerable 
minority. Chronic use has more recently been associated with acceler- 
ated cerebrovascular ischaemia and strokes in young users. 

Psychiatric comorbidity with cocaine dependence is high. About 
25% have coexisting ADHD; 50% experience alcohol use disorders, 
and rates of depression are much higher than that of the general 
population. Depression predicts poorer outcome in cocaine de- 
pendence and, untreated, can lead to an increased risk of relapse. 
Although adverse psychological experiences are less common than 
among users of methamphetamine, because of cocaine’s shorter half- 
life, transient episodes related to sympathomimetic hyperarousal, 
such as anxiety, panic, agitation, paranoia, and aggression (espe- 
cially when consumed with alcohol), are not uncommon. Acute ad- 
verse effects are seen with crack, because of its greater intensity of 
effect and the dose of drug consumed. 


Route-related harms 


Persistent intranasal use can result in nosebleeds, with the risk of 
viral BBV transmission. Smoking the free alkaloid base as ‘crack’ is 
associated with a shorter, more intense high and can lead to acute 
lung injury in the form of ‘crack lung’ a progressive fibrotic condi- 
tion of the lung. Users present with haemoptysis, hypoxaemia, fever, 
and cough, with pulmonary interstitial inflammation and periph- 
eral eosinophilia. Injecting cocaine is associated with higher rates 
of sharing of intravenous equipment and HIV transmission and can 
also cause significant local trauma secondary to local anaesthesia 
that precludes pain-associated restriction in damaging injecting 
behaviour. Cocaine use, especially via the smoking route, is asso- 
ciated with marked increase in risk of stroke (once controlled for 
tobacco and alcohol use, odds ratio of >5) in the 24 hours following 
use among young adults [21]. 


Treatment 


Hospital admission for withdrawal may be necessary in some cli- 
ents, such as those with unstable living conditions, comorbid mental 
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health or severe physical illness, or polydrug dependence, or those 
in whom severe withdrawal has been experienced and attempts at 
outpatient detoxification have proved ineffective. Optimized nutri- 
tion, rest, and symptomatic relief with short-term, low-dose benzo- 
diazepines [for example, diazepam 5 mg three times daily (tds)] may 
be helpful. 

However, as with other stimulant drugs, there is little evidence to 
support the routine use of any specific pharmacotherapy either in 
the management of cocaine withdrawal or in maintaining abstin- 
ence from it, apart from situations where a comorbid psychiatric 
condition has been identified (see Medication-based treatments for 
stimulant use disorders/dependence, p. 530). The evidence suggests 
the most effective interventions are based on CBT (see below). 


Medication-based treatments for stimulant use 
disorders/dependence 


Despite decades of research on a wide variety of different therapeutic 
agents, there is still no evidence-based pharmacotherapy available 
for the general treatment of cocaine dependence. In the absence of 
primary depressive disorders, antidepressants are not useful as a 
treatment for cocaine dependence. In those with a primary depres- 
sive disorder, the efficacy of antidepressant medication will be very 
limited if the person continues to use cocaine. Untreated depression, 
however, is a risk factor for relapse to cocaine use, and careful assess- 
ment is required, even after several weeks of abstinence, to ensure it 
does not emerge late and be left untreated. There is also no evidence 
for the use of antipsychotics, anti-epileptics, or opioid antagonists in 
the treatment of cocaine dependence. There is no widely utilized or 
evidence-based substitute medication, as is the case for opioid de- 
pendence. However, there is growing interest in the possible role of 
stimulant replacement therapy in the treatment of carefully selected 
groups, combined with psychosocial interventions. Promising re- 
sults for such approaches have been seen among those with comorbid 
ADHD where the provision of an oral alternative stimulant (for ex- 
ample, dexamphetamine or methylphenidate) has been initiated as 
a harm reduction approach in persistent high-risk injectors. In such 
groups, with appropriate assessment and monitoring (for unwanted 
cardiovascular and psychological effects), stimulant medications 
can be safe and effective. Concerns over prescribing a drug with 
abuse liability, risk of diversion, and difficulty with ensuring compli- 
ance mean the use of substitute prescribing is not straightforward. 
It is likely, however, that in combination with psychological support, 
including contingency management, such approaches may be cur- 
rently among the most effective available [22]. Extended-release 
preparations may be the most effective, being less reinforcing and 
requiring lower levels of supervision. 

Although rates of stimulant dependence are high among those 
with adult ADHD, there is no evidence to suggest they are signifi- 
cantly higher than in those with other substance use disorders, with 
rates among those with substance use dependence being between 
20% and 25% [23]. There is growing interest in the role of substitute 
prescribing in those with concurrent stimulant dependence and 
ADHD [24]. Some of the most promising results have come from 
correctional facilities where methylphenidate has been used suc- 
cessfully to reduce both ADHD symptoms and relapse to stimulant 
drug use [25]. It may therefore be the case that selective treatment 
of adults with coexisting stimulant dependence and ADHD may 
have specific therapeutic benefits. Screening for ADHD in those 


with stimulant disorders is therefore clinically useful. Some adults 
may give clues to their primary diagnosis by reporting a paradox- 
ical response to stimulant drugs as adults (for example, feeling 
calm and ‘normal’), as opposed to hyper and aroused which would 
be the normal subjective response to drugs like cocaine in those 
without ADHD. There is also interest in the use of non-stimulant 
medications in the treatment of stimulant use disorders, including 
modafinil, a wakefulness-promoting agent approved for the treat- 
ment of narcolepsy and shift-work sleep disorders. With both dopa- 
minergic and glutamatergic activity, early work suggests it might 
be useful in the treatment of cocaine dependence, with studies 
showing a reduction in subjective effects, use, and craving [26]. 

A more recent development in the treatment of those with drug 
dependence has been the provision of non-drug rewards (for ex- 
ample, a voucher or monetary incentive) for engagement in treat- 
ment and confirmed abstinence from drugs. Such approaches are 
referred to as contingency management (CM). In trials where CM 
has been used in the treatment of stimulant use disorders, combined 
with other psychological approaches, such as CBT or MET, CM has 
been shown to increase retention in treatment, increase abstinence, 
and possibly enhance the efficacy of pharmacological interventions 
[27]. Although still uncommon in routine practice, the evidence 
suggests that CM should be offered as part of routine clinical inter- 
ventions, in conjunction with other therapies. 

Interest and clinical trial research into a vaccine for cocaine de- 
pendence continue, with the development of antibodies to bind to 
cocaine and prevent it from reaching receptor sites. To date, clinical 
trials suggest such interventions are some way off and, once devel- 
oped, will no doubt face a series of ethical and biological hurdles be- 
fore immunization of vulnerable groups, such as the young or those 
with addiction histories, becomes part of treatment anytime soon. 


Ecstasy (3,4-methylenedioxymethamphetamine) 


(MDMA, E, X, XTC, pills, beans, Molly, Mandy) 





Background 


MDMA isa synthetic stimulant drug characterized by pro-social ef- 
fects, euphoria, energy, and empathy. First synthesized in 1912 by 
Merck Pharmaceuticals, MDMA remained largely ignored until 
the 1950s when the United States army explored its potential as a 
military agent (though, unlike methamphetamine, it lacked an 
aggressive edge and was more likely to make someone hug their 
enemies than kill them!). In the late 1960s and early 1970s, recre- 
ational drug users on the west coast of America began to recognize 
the desirable effect profile of MDMA and the closely related drug 
MDA (methylenedioxyamphetamine). In the mid to late 1970s, 
some clinicians—influenced by the biochemist and psychopharma- 
cologist Alexander Shulgin who had resynthesized MDMA in his 
laboratory in 1976—started exploring its therapeutic potential to 
facilitate marriage counselling and individual therapy. Such usage 
was dismissed by the United States government, and what followed 
were three decades of research focusing on MDMA-related harms, 
with MDMA being branded a potent neurotoxin with no place in 
medicine. The last 10 years, however, have seen a gradual resetting 
and revision of both its acute harms and longer-term neurotoxicity, 
as well as its therapeutic potential. 


Although MDMA may claim its place as the pre-eminent dance 
drug and possessor of the best branding in terms of name (‘Ecstasy’), 
MDMA is only one of a large number of synthetic amphetamine- 
type drugs possessing varying degrees of stimulant, hallucinogenic, 
and empathogenic effects that are used within the dance scene. As 
with most stimulant drugs, desirable effects and the risk of harm 
are dose-related. As seen in other illicit drugs, inconsistent prepar- 
ations and variations in quality (for example, the amount of MDMA 
in a pill) and the presence of more dangerous contaminants [such 
as paramethoxymethamphetamine (PMMA)] mean that informed 
dosing and safer use strategies have their limitations. Overall, 
MDMA can be considered a relatively safe drug; acutely, it has a low 
risk of death, and although tolerance and loss of control are reported 
by users [28], few, if any, users will ever seek help for problems as- 
sociated with its use in isolation from other drugs. Understanding 
its effects and risk profile is most useful to the clinician in ensuring 
an accurate assessment of mood and other psychological symptoms 
that can occur in the presence of MDMA use or that can be wors- 
ened by its use. In the UK, MDMA is classified as a Class A drug. 


Preparations, purity, and routes of use 


MDMA is most commonly sold as tablets (pills) or powder (crys- 
tals) and taken orally, though it may be snorted, taken rectally, or 
injected. MDMA tablets vary in size, shape, and colour and are 
typically branded with an imprinted logo (for example, of a car- 
toon character, a car manufacturer, or a luxury brand). The size and 
shape of the tablet give no clue to its content, though, since 2015, in 
the UK and across much of Europe, tablets sold as ecstasy do con- 
tain MDMA, though the dose and precise composition vary. In re- 
cent years, it appears that the average dose of MDMA found in a 
pill has increased, with the average ecstasy pill containing 80-150 
mg, though pills containing over 300 mg have been identified. These 
higher-dose pills have almost certainly contributed to both an escal- 
ation in the number of users seeking emergency medical treatment 
following the use of MDMA and an escalation in MDMA-related 
fatalities (from eight in 2010 to 63 in 2016, and over 56 in 2017 [29]). 
It is of note that while MDMA deaths are not clearly dose-related 
(the relationship between consumed dose and the risk of death is 
inconsistent and unpredictable on an individual level), higher doses 
of MDMA make you more vulnerable to environmental factors 
such as high temperature and overcrowding. The average dose of 
MDMA to give the user an optimal balance of positive vs negative 
effects is about 80-150 mg in divided doses (1.5 mg/kg). Higher- 
dose tablets can make it easy to take too much and expose people 
to dose-related unwanted effects, including agitation, panic, chest 
pain, overheating, hallucinations, and confusion. MDMA crystals 
are white, beige, or brownish in colour and can look very similar to 
other white crystalline drugs such as crystal meth. Typically swal- 
lowed either by wrapping in cigarette paper (known as parachuting 
or bombing) or taken by licking a finger and dipping it in MDMA 
powder (known as dabbing), MDMA powder can also be snorted or, 
rarely, injected. In the UK, an ecstasy tablet costs £5-10, with a gram 
of MDMA crystals costing around £40. 


Pill testing 


Because of the variations in quality, some users utilize drug testing 
methods (such as the Marquis test which gives colorimetric re- 
sults by mixing the substance with a reagent) and websites (http:// 
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www.EcstasyData.org) which provide the contents of different pills 
following more elaborate analytical methods. Although these ap- 
proaches have some role, not least in getting users to consider the 
risks and engage in conversations about harm reduction practices, 
these approaches are limited by accuracy of data, especially re- 
garding the amount of each active drug present. More sophisticated 
analytical techniques can help in identification of more psychoactive 
contents such as paramethoxyamphetamine (PMA) [30]. There is a 
good case for an early warning system to ensure that information 
about high-dose MDMA pills or pills found to contain dangerous 
contaminants is made available to the drug-using community, 
though exactly what approach is the best, given resource limitations, 
is uncertain. However, in recent years, across many countries in 
Europe, the availability of on-site sophisticated analytical processes 
[for example, gas chromatography-mass spectrometry (GCMS), 
high-performance liquid chromatography (HPLC)] are offering 
the possibility of integrated harm reduction and chemical compos- 
ition analysis, including the amount of active drug which can help 
people change their drug using behaviour. However, the majority of 
drug-related deaths remain unpredictable, and individual behaviour 
the biggest modifiable risk behaviour. Knowing the content of your 
tablet never guarantees the user a positive experience and can never 
reduce the risk associated with use to zero [31]. 


Prevalence and patterns of MDMA use as ecstasy 


British population studies show that 54% of 20- to 22-year olds have 
been offered ecstasy at some time and 15% have tried it at least once 
[32]. The current prevalence in 16- to 24-year olds has increased to 
5.4% in terms of last year use [4]. Use appears higher in those asso- 
ciated with the dance music drug scene, with over 90% reporting 
lifetime ever use. Similar findings have been reported from Europe 
[33], Australia [34], and the United States. 

The typical pattern of use in the UK and Europe is 1-2 tablets a 
night (equivalent to 100-250 mg of MDMA powder). Most people 
use less than ten times per year. Tolerance develops with regular use 
and is associated with higher-dose consumption and higher levels of 
polydrug use. Ecstasy, especially within the context of dance clubs, 
is rarely taken in isolation, and polydrug use is the norm, with dif- 
ferent adjunctive substances taken at different times over the course 
of a night [35]. For example, alcohol is taken with ecstasy at the be- 
ginning of the night to get a stronger/better high [36]. Cocaine, am- 
phetamines, and/or additional ecstasy tablets are taken to maintain 
arousal and a state of alertness (the MDMA entactogenic effects fade 
away in 2-4 hours). Finally, depressants such as cannabis, alcohol, 
benzodiazepines, and more rarely opiates may be taken in the last 
part of the night to calm down before going home, since the un- 
toward after-effects of ecstasy (namely irritability, insomnia, and 
restlessness) may persist well beyond its ‘pleasurable’ effects. With 
chronic high dosage, ecstasy users develop tolerance and experience 
a decrease in the desired effects over time, which could lead to use of 
other stimulants and hallucinogens [35]. 


Physical effects and complications 


Physiologically sympathomimetic properties similar to amphet- 
amine predominate, including tachycardia, anorexia, increased re- 
spiratory rate and blood pressure, increased motor activity, tremor, 
mydriasis, increased temperature, and sweating. Jaw tightening 
(bruxism), xerostomia, and teeth grinding with molar erosion may 
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also be seen (Table 53.2). Sleep architecture modification [37] and 
sexual activity alteration [38, 39] have also been described. 

After MDMA intake, a number of untoward effects (often part 
of the ‘coming up’ phase when the effects of MDMA first mani- 
fest) may commonly occur, including nausea, vomiting, diarrhoea, 
tachycardia, and palpitations. Pathologies less commonly seen 
include arrhythmias, hypertension, potentially life-threatening 
metabolic acidosis, cerebral haemorrhage, convulsions, coma, 
rhabdomyolysis, thrombocytopenia, disseminated intravascular 
coagulation, syndrome of inappropriate antidiuretic hormone 
(SIADH), acute kidney failure, acute liver failure, dehydration, and 
malignant hyperthermia [40-44]. 

Hyperthermia, although enhanced by exertional activity and 
poorly ventilated environments, may occur somewhat independ- 
ently from the setting in which the drug is taken, because MDMA 
has thermal dysregulation effects in its own right. However, in 
clubs, dehydration is common and thirst naturally prompts in- 
dividuals to replace body fluids lost during sweating sensibly 
with fruit juices, other isotonic fluids, or water, or less sensibly, 
but quite commonly, with alcohol. Very rarely, excessive intake of 
hypotonic fluids, coupled with an increase in vasopressin levels, 
has led to the occurrence of lethal hyponatraemia [45]. Deaths 
as a result of SIADH are very rare but may be most associated 
with excessive hypotonic fluid consumption, with MDMA poten- 
tially leading to urinary retention and impairing awareness of the 
need to micturate. In normal subjects who take MDMA and do 
not develop SIADH, there does appear to be an increase in both 
antidiuretic hormone (ADH) (greater in females than males) and 
oxytocin levels [46], the latter perhaps responsible for the drug’s 
pro-social effects. Ninety per cent of cases of SIADH are seen 
in women. 

MDMA is a potentially damaging cardiac stimulant [47], with 
reports suggesting long-term MDMA use may possibly lead to a 
fenfluramine-like valvular heart disease condition. Between 0.5% 
and 1% of last year users of MDMA report seeking emergency med- 
ical treatment in the previous 12 months, with the rate among young 
women being 2-3 times greater than among men. Presentations are 
more common in heavy regular users and those combining MDMA 
with alcohol and/or other drugs. 


MDMA’ psychological effects and problems 


(See Table 53.2 [48].) Being structurally related to both amphet- 
amine and mescaline, ‘empathogens’ or ‘entactogens’ [49] like 
MDMA possess both stimulant and hallucinogenic properties, 
which allow them to be discriminated from other related substances. 
After MDMA ingestion, enhanced mood, increased energy, open- 
ness, heightened sensory perception, and mild perceptual alter- 
ations are reported [40, 42, 48]. 

MDMA is described as evoking ‘an easily controlled altered state 
of consciousness with emotional and sensual overtones’ [50], with 
the substance’s appeal resting in its ‘dramatic and consistent ability to 
induce in the user a profound feeling of attachment and connection. 
With this in mind, it is perhaps not surprising that the Los Angeles 
dealer who coined the street name ‘ecstasy’ for MDMA would have 
preferred the name ‘empathy, but he did not feel that his typical cus- 
tomer would know what it meant. These qualities were also what has 
led to recent research into its use within a clinical psychotherapeutic 
setting in the treatment of PTSD [51-54]. 


Table 53.2 Psychological and physical effects of MDMA 


Physical Psychological 


ncrease in physical and emotional Relaxation/euphoria 


energy 
Dilated pupils, dry mouth Feelings of well-being 


Tachycardia, hypertension, increased 
respiratory rate 


Enhanced closeness and sociability 


ncreased sweating, dehydration Heightened perceptual awareness 


Disinhibition 





ncreased motor activity, tremor 





Blurred/double vision ncreased response to touch/ 


empathy 
Anorexia, nausea, weight loss Anxiety/panic/paranoia 


Teeth grinding, jaw clenching (bruxism) Agitation and restlessness 


Adapted from J Nerv Ment Dis., 180(6), Liester MB, Grob CS, Bravo GL, et al., 
Phenomenology and sequelae of 3,4-methylenedioxymethamphetamine use, 
pp. 345-54, Copyright (1992), with permission from Wolters Kluwer Health, Inc. 


Acute psychological problems associated 
with MDMA use 


There have been reports of acute episodes of anxiety, panic, para- 
noia, excessive agitation, confusion, hallucinations, and rarely brief 
psychotic episodes following the consumption of MDMA. Many 
users of MDMA report ‘mid-week blues, with some individuals 
reporting clinically borderline levels of depression in the days fol- 
lowing MDMA [55], which could reflect the depletion of serotonin 
following the acute elevation that follows ingestion of MDMA. This 
could be seen as a parallel to the ‘crash’ reported after abstinence of 
cocaine use or as a hangover effect from all-night dancing, exces- 
sive alcohol, and minimal sleep. Although depression, anxiety, and 
mood fluctuations attributed to ecstasy are reported to be strongly 
related to the number of occasions of MDMA use [56, 57], Morgan 
et al. [58] found that higher depression scores among current heavy 
ecstasy users, in comparison to drug-naive and polydrug controls, 
were no longer significant after treating cannabis use as a covariate. 


Other consequences of use 


Neurotoxicity, neuropsychological impairments, and 
psychiatric presentations 


Significant research into MDMA has been ongoing for around 
30 years. Much of the evidence for MDMA causing 5-HT neurotox- 
icity comes from preclinical studies in rodents and non-human pri- 
mates where findings of depleted 5-HT brain tissue levels, reduced 
5-hydroxyindoleacetic acid (5-HIAA) levels, reduced tyrosine 
hydroxylase activity, and reduction in serotonin transporter (SERT) 
binding have been consistently reported [59]. 

With regard to MDMA having 5-HT neurotoxic effects in hu- 
mans, there is ongoing debate. Given the preclinical data, it is likely 
that similar effects would occur in humans; however, there are issues 
around inter-species scaling and dosing regimes (animal studies 
often dose multiple times per day on consecutive days, which is quite 
different from human recreational MDMA use patterns, and also 
often in higher milligrams per kilogram than humans would typ- 
ically consume), which mean extrapolation from animal studies is 
complicated [60]. 

Ina recent meta-analysis of both preclinical and human molecular 
imaging studies on MDMA, the main finding was a reduction in 


SERT binding across the cerebral cortex and parts of the forebrain, 
especially in heavy users, but there is evidence of some recovery 
after cessation of MDMA use [61]. This reduction in SERT binding 
is thought to be indicative of serotonergic neurotoxicity; however, 
downregulation of SERT cannot be excluded. In a systematic review 
of imaging studies in moderate users (<100 tablets lifetime, or <50 
lifetime episodes of MDMA use), there was no convincing evidence 
that moderate MDMA use was associated with structural or func- 
tional brain alterations in neuroimaging measures [62]. 

From a functional perspective, MDMA use in humans has been 
associated with a number of neuropsychological deficits which are 
thought to be underpinned by 5-HT system dysfunction; however, 
the evidence for specific MDMA-attributable deficits remains a little 
controversial due to inherent confounds such as purity of tablets 
taken, use mainly of cross-sectional studies, and comorbid poly- 
drug use often seen in MDMA users. The latter issue has benefited 
from use of non-MDMA-using polydrug control groups, rather than 
drug-naive controls, in more recent studies. A recent meta-analysis 
has shown that MDMA users show impaired executive function, 
relative to non-users, in particular in its sub-domains: accessing 
long-term/semantic memory, ‘switching, and ‘updating’ [63]. Other 
functional deficits associated with MDMA use have been described 
in prospective memory, higher cognitive processing, such as logical 
reasoning, problem-solving, organizational control, and social in- 
telligence. Other psychobiological domains affected by MDMA are 
sleep apnoea, visual information processing, immunocompetence, 
pain perception, psychomotor skill, and damage to emergent chil- 
dren of pregnant women [64]. 

MDMA intake may put users at significant risk for developing 
psychiatric problems [65], although some have suggested that this 
may occur only in vulnerable individuals [66]. Studies suggest that 
ecstasy users may report both childhood emotional/physical abuse 
[67] and a history of familial depression, anxiety, and panic attacks 
[68] more frequently than ecstasy-naive controls. Many studies also 
report increases in psychopathology in MDMA users, relative to 
non-MDMA-using polydrug controls, in self-reported measures of 
anxiety, depression, memory, and impulsivity [69]. 


Depression and its management in MDMA users 


In a patient presenting with psychological problems who has a his- 
tory of MDMA use, the crucial assessment issues are the identifica- 
tion of any premorbid disorders, their position in the cycle of use/ 
post-use, and the persistence of any symptoms beyond a 2- to 4- 
week period following cessation of use. As with amphetamine use, in 
the days after taking MDMA, there may be symptoms attributable to 
monoamine depletion and subsequent repletion. A period of acute 
5-HT depletion due to vesicular monoamine depletion (mid-week 
blues) is likely to be the most potent cause for the relative reduction 
in monoamine neurotransmitters. Repeated use of MDMA over 
several days will be associated with markedly diminished effects. 
Recovery is delayed further by inhibition of the rate-limiting en- 
zyme (tyrosine hydroxylase in the case of MDMA) and the relative 
absence, especially in chronic users, of a good source of monoamine 
precursors following stimulant-induced anorexia and malnutrition. 
It is likely that, as with other stimulant drugs, including cocaine, a 
period of extended, but less intense, withdrawal symptoms (mood, 
sleep) may be seen with persistent abstinence, which may take 
weeks or months to recede and are associated with the more gradual 
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reversal of neuroadaptive changes in dopaminergic receptor sensi- 
tivity/expression [70]. 

As with amphetamine and cocaine users, antidepressant treat- 
ments should usually not be commenced until 2-4 weeks after 
cessation of MDMA use, in order to allow for reassessment and con- 
firmation of any disorder. There are also at least theoretical causes 
for concern over potentially fatal interactions between MDMA and 
SSRIs that have been reported very rarely, possibly because some 
SSRIs (that is, citalopram) can inhibit the CYP2D6 enzyme [71]. The 
precise effects of combining SSRIs and MDMA appear to be related 
to whether use of the SSRI was before or after MDMA and whether 
SSRI dosing is acute or chronic. For example, SSRIs given acutely 
after MDMA (taken by users to intensify the ecstasy effects) may the- 
oretically increase the risk of precipitating a serotonergic syndrome. 
It is probable that SSRIs and other classes of antidepressants can be 
used effectively in this group if a diagnosis of responsive affective/ 
anxiety disorder is confirmed and abstinence is maintained. CBT 
may be useful in this group, both to address their underlying drug 
use as well as to address any coexisting anxiety/depressive disorders. 

Dependence with the development of heavy regular use patterns 
is possible, though there are unlikely to be any specific signs or 
symptoms that differentiate diagnosis or management significantly 
from other forms of stimulant dependence. 


Assisted psychotherapy and the potential 





therapeutic role of MDMA 


While this chapter has concentrated on the recreational use and 
dangers of MDMA, interest in MDMA as an empathogen predates 
its incarnation as a party drug. There is now serious interest in the 
medical use of MDMA to treat PTSD. The initiative has evolved over 
many years as the Multidisciplinary Association for Psychedelic 
Studies (MAPS) project (http://www.maps.org/research/mdma). 
Agreement has been reached with the FDA to designate MDMA as 
a ‘breakthrough medicine’ A definitive RCT is planned, supported 
by philanthropic donations, and the objective is an approval by 2021. 


Novel psychoactive drugs 


Since the mid 2000s, a dizzying array of diverse novel psychoactive 
drugs have appeared in the UK and across the world. Unhelpfully 
labelled ‘legal high’ by the media, until recently, many were openly 
sold on the UK high streets as smokable herbal preparations, pills, 
and powders. They were carefully branded and labelled with ‘not 
for human consumption to ensure they did not contravene the 
law. Often attempting to mimic the effects of traditional drugs— 
most commonly, cannabis, stimulants, and hallucinogens—they 
vary widely in their desirability, potency, effect, and risk profile. 
These new compounds are often undetected by UDS, and statutory 
services are often unaware of their presence in the community until 
they cause clinical problems in emergency departments or are found 
in police seizures. Such triangulation of data sources is required to 
keep abreast of this rapidly developing area. 

The Psychoactive Substances Bill passed in the UK in May 2016 
is an attempt to reduce their widespread availability and has led to 
closure of over 300 ‘headshops’ across the UK. While reducing easy 
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access on high streets will lead to a reduction in use, it is probable 
that distribution and supply will continue either through illicit street 
dealing networks or dark-net drugs markets. Where the effect pro- 
file, demand, and profit are attractive enough, it is likely that some 
classes of these drugs will remain in circulation. For the psychiatrist, 
the important issues are that these drugs are not picked up on rou- 
tine drug screening; hence, clinicians should remain alert to the 
possibility that young people who present with disturbed behav- 
iour or psychopathology may have used them. Offering a relatively 
cheap, but generally less desirable, high, many people who use these 
drugs could be considered vulnerable in other ways, through their 
young age, coexisting mental illness, or homelessness. At the time 
of writing, synthetic cannabinoid products, which are a particular 
problem within prisons, pose some of the greatest risks. These po- 
tent CB1 receptor agonists not only carry the risk of dependence 
and withdrawal, but their acute use is also at least 30 times more 
likely to lead to emergency medical treatment than high-potency 
herbal cannabis [72]. Symptoms include extreme agitation and a 
transient psychotic state, which will often resolve after cessation of 
use but may persist in some cases for weeks and may require anti- 
psychotics. Acute psychotic mental states need appropriate moni- 
toring and follow-up. Although deaths are still relatively rare, some 
of these NPS can cause significant physical harms such as seizures 
and arrhythmias. Collaborative management with general medical 
colleagues and good clinical documentation, combined with appro- 
priate investigations such as ECGs and biochemical screens, should 
be undertaken. 





Mephedrone (meow meow, M-CAT, 4MMC) 


Background 


Mephedrone (4-methylmethcathinone) is one of a number of 
cathinone compounds (methylone, methcathinone, MDPV, and 
others), which derive from the natural herbal stimulant khat (Catha 
edulis) and are B-keto analogues of amphetamine. Mephedrone 
came to prominence in the UK in 2009 as a cheap and easily ac- 
cessible alternative to MDMA and cocaine (it was legally sold and 
marketed in the UK until April 2010) [73], both of which had been 
of poor quality in years prior to its appearance. After it was made 
illegal, its use among the general population diminished signifi- 
cantly, though access through street dealers and online markets has 
continued. It is most commonly associated with the gay music and 
‘chem-sex’ scenes where it is often taken in conjunction with other 
drugs, including GHB and crystal methamphetamine. 


Preparation and routes of use 


Mephedrone hydrochloride is typically sold as a white crystal- 
line powder. It is water-soluble and is most commonly snorted 
intranasally, with single doses being between 50 and 150 mg. Taken 
intranasally, effects come on swiftly (within 10-20 minutes) and last 
approximately 1-2 hours, with users re-dosing every 30-120 min- 
utes over the course of a session; some heavy users are capable of 
consuming over 5 g. It can also be taken orally, often wrapped in 
cigarette paper, a practice known as bombing, for a slower onset 
of action and less intense onset of effect. It can also be used rec- 
tally and by intravenous injection—the latter being associated with 


significant tissue damage. Injection among naive injecting groups 
can be a particular risk factor for HIV where the sexualized injec- 
tion of drugs such as mephedrone and crystal methamphetamine 
(known as ‘slamming’) can be seen in association with other risky 
sexual practices. 


Mechanism of action and effect profile 


Mephedrone consumption results in typical stimulant-related 
effects—euphoria, increased concentration, talkativeness, urge to 
move, empathy, jaw clenching, reduced appetite, and insomnia are 
most commonly reported. At higher doses, mephedrone has the po- 
tential to cause a toxic sympathetic syndrome, with agitation, panic, 
dehydration, overheating, seizures, cardiovascular dysregulation, 
and paranoid episodes, leading to emergency medical presenta- 
tions [74]. Its mechanism of action is similar to other stimulant 
drugs: extracellular release of monoamine neurotransmitters (dopa- 
mine and serotonin) and inhibition of their reuptake (dopamine, 
serotonin, and noradrenaline). Overall, mephedrone has an effect 
profile similar to MDMA, but with a duration of action and abuse 
liability similar to that of cocaine. Animal models confirm its po- 
tential to cause dependence with mephedrone (high levels of self- 
administration in rat models [75]). With a short duration of action 
and rapid development of tolerance, dependence will be seen among 
a proportion of regular users. The comedown after mephedrone ap- 
pears to be worse than with either cocaine or MDMA. Lowering of 
mood following use (presumably due to acute monoamine deple- 
tion) may be marked and can be associated with elevated suicide 
risk, especially in those with underlying psychiatric conditions [76]. 


Treatment 


Treatment interventions are broadly similar to those for other 
stimulant drugs, with emphasis on harm reduction, self-regulation, 
management of acute toxicity, differentiation between drug intoxi- 
cation/withdrawal, and primary underlying psychiatric condition 
and relapse prevention [77]. 


Gamma hydroxylbutyrate (GHB, GBH, G, fantasy, 


liquid ecstasy, and GBL) 





Background 


GHB is an endogenous short-chain fatty acid found in the central 
nervous system and elsewhere in the body. It is a putative neuro- 
transmitter, and specific binding sites have been identified in the 
hippocampus (linked to dopamine neurons), but its precise role 
is yet to be identified. Trace amounts may also be found in certain 
fruits such as guava. In the UK, GHB is a Class C drug. Along with 
its precursors, including the psychoactive pro-compounds GBL 
(gamma butyrolactone) and 1,4-butanediol, GHB is a colourless, 
odourless, slightly acidic-tasting liquid that, when consumed orally, 
is used for pro-social, sexual, and sedating effects. 

GHB was originally developed as an anaesthetic in the 1960s. In 
the 1980s, it had found clinical utility as a sedative, as a treatment 
for narcolepsy, and as a detoxification agent (it is effective in the 
management of alcohol withdrawal). It is also used (with little evi- 
dence of effect) to promote muscle growth in bodybuilders (through 
its effect on increasing slow-wave sleep). Since the 1990s, however, 


it has become best known for its place among the smorgasbord of 
drugs commonly used by those involved in the dance/rave scene, 
especially those from the LGBT community where its pro-social 
and aphrodisiac qualities have made it popular, particularly in con- 
junction with crystal methamphetamine and mephedrone. Almost 
unique among liquid drugs of abuse, GHB is typically consumed in 
doses of only 1-2 mL; combined with its narrow therapeutic index, 
this implies a high risk of overdose, which is increased in combin- 
ation with alcohol [78]. 


Preparation (pro-drugs), purity, and routes of use 


Until the late 1990s, GHB and its precursors were widely available in 
shops and the Internet as a clear liquid. As an industrial solvent anda 
cleaning product, it was also easy to come across these drugs in other 
guises. For example, GBL was sold in high-street chemists as nail 
polish remover and websites offered litre bottles as alloy cleaners. 
Because of its relative ease of manufacture, attempts at home pro- 
duction also occur but can result in preparations of widely varying 
concentrations; high concentrations can be quite caustic, resulting 
in gastrointestinal discomfort and nausea, with a risk of vomiting 
and aspiration. 


Doses and typical patterns of use 


Most users of GHB are not daily dependent users and will use 
for 1-2 days in association with other drugs, most commonly al- 
cohol, methamphetamine, poppers, and sexual enhancers such as 
sildenafil. Typical doses are 1-2 mL dispensed via an eye dropper or 
a syringe, often mixed with juice or alcohol to disguise the taste, and 
taken every 2-3 hours. Heavy users may dose 3-4 mL every hour, 
with bigger doses at night to aid sleep. While most users will limit 
their use to situations which exploit its pro-social, disinhibitory, and 
aphrodisiac qualities, a minority of users (less than 5%) will extend 
their use over periods of weeks or months. Daily use for more than 
1-2 weeks can lead users to developing tolerance and withdrawal 
symptoms upon cessation, which drives further use. 


Mechanism of action 


GHB readily crosses the blood-brain barrier and acutely leads to a 
transient decrease, followed by an increase, in dopamine levels (and 
an increase in endogenous opioid release). Increases in other neuro- 
transmitters, such as GABA, ACh, and 5-HT, are also seen. At higher 
doses, it exhibits some partial action at GABA-B receptors (epilepto- 
genic). GHB is usually taken orally, often mixed in fruit juice or an 
alcoholic beverage. It has a very rapid onset of action, with notice- 
able effects occurring within 15 minutes of administration; it has a 
relatively short duration of action (half-life 27 minutes), with effects 
peaking at 30-60 minutes and being over within 2-4 hours, being 
eliminated through its breakdown to carbon dioxide and water. 


Physical effects and complications 


GHB exhibits a very narrow therapeutic index, and as a result of wide 
inter-personal variation in tolerance and significantly enhanced 
toxicity (depressant effects) when combined with alcohol, overdose 
with GHB has been reported more frequently than for any other 
dance drug [79, 80]. Overdose should be suspected in someone who 
presents with nystagmus, ataxia, nausea, vomiting, sedation, weak- 
ness, bradycardia, hypotension, and rapid onset of unconsciousness 
(quite similar to severe alcohol intoxication, but without alcohol on 
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the breath). Management should include placing the person in the 
recovery position, airway management, and pulse oximetry. GCS 
scores may be very low (<6). If oxygen saturation drops or the pa- 
tient is so unconscious that they can tolerate a Guedel airway, then 
ventilation should be considered. Overdoses are short-lived, and 
most awake somewhat aroused and disorientated after a few hours. 
Other clinical presentations include agitation, anxiety, coma, am- 
nesia, seizures, and collapse. These patients, when in a coma, may 
require ventilation and typically suddenly emerge from their coma 
with high levels of agitation, arousal, and violence. 


Longer-term use, behavioural risks, and dependence 


GHB can cause marked disinhibition and amnesia. This can result 
in people engaging in high-risk sexual practices or falling victim to 
sexual assault. Shame and uncertainty over what may or may not 
have happened can cause significant emotional distress to individ- 
uals. Until recently, dependence on GHB was considered rare among 
users, but with the apparent escalation in use across many countries, 
increasing numbers of people are coming forward, seeking treat- 
ment for dependence and the management of withdrawal. Tolerance 
and withdrawal can be seen among users who persist with daily use 
for more than a few weeks. Continued use outside party environ- 
ments may be motivated by GHB’s amelioration of the comedown 
from other drugs, facilitation of sleep, or improved anxiety and 
confidence in social situations. Dependence is characterized by the 
development of tolerance (bigger doses taken at shorter intervals), 
intense craving, continued use despite evident harms, and a well- 
defined withdrawal syndrome. Withdrawal from GHB, like alcohol, 
sits on a spectrum and is dominated by autonomic and psychological 
arousal. Among severely dependent users, withdrawal commences 
within 2-4 hours and can escalate rapidly, from early symptoms of 
craving, anxiety, sweating, and tremor to confusion, disorientation, 
hallucinations, and seizures. Untreated/unplanned withdrawal can 
be life-threatening, and unplanned withdrawal can require huge 
doses of benzodiazepines to manage the high levels of autonomic 
instability, delirium, and seizure risk. Planned withdrawal can be 
safely done on an outpatient basis by experienced clinical teams. The 
basic principles of treatment are an initial planning and assessment 
phase, with encouragement to stop any concurrent stimulant use (as 
it tends to lead to larger GHB usage). A tapered reduction in dose 
over the following week is sometimes supported by preloading with 
baclofen, which seems to reduce craving for GHB and subsequently 
reduce the need for high-dose benzodiazepines during the active de- 
toxification phase. The aim is to lower total daily doses to below 25 
mL/day before cessation. At cessation of GHB use, the acute with- 
drawal syndrome is managed by diazepam (typically in the region 
of 30-60 mg/day), given in 3-4 divided doses, tapering down over 
7-10 days. Baclofen may be continued throughout the detoxifica- 
tion period and can be useful to continue for 2-3 weeks following 
detoxification completion, to aid sleep and manage residual craving 
and withdrawal. 


Harm reduction strategies for GHB users 


Given the small difference between a dose that induces euphoria 
and stimulation and a toxic dose, measuring doses with a syringe 
(1.0-1.5 mL would be a typical dose) or using pre-filled capsules 
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can reduce the risk of accidental overdose. Variations in preparation 
purity suggest test dosing from new batches is advisable. Avoiding 
mixing with alcohol, which increases the risk of overdose through 
synergistic respiratory depressant effects, is advisable, as is add- 
itional stimulant use, which is often associated with increased GHB 
consumption. Finally, avoiding use on more than 2-3 consecutive 
days can avoid the development of tolerance and of dependence and 
withdrawal in the first place. 


MDMA, cocaine, methamphetamine, and GHB are all capable of 
producing acute adverse psychological experiences in normal users 
and exacerbating symptoms in those with underlying psychological 
disorders. They vary widely in their ability to induce dependence and 
in their long-term neuropsychiatric consequences. For the psych- 
iatrist, the most common problems will be assisting in managing 
acute time-limited adverse psychological reactions, including with- 
drawal, and ensuring adequate follow-up assessment is provided. 
This is required to review the need for longer-term medication and 
to ensure that the differentiation between an underlying primary/ 
precipitated disorder or a de novo condition that is time-limited and 
resolved with maintained abstinence is made. Because of the signifi- 
cant potential of the drug to cause physical harm, attention should 
also be paid to clinical examination, biochemical investigation, 
BBV screening, and liaison with medical specialists. The provision 
of harm reduction advice and strategies to support self-regulation, 
combined with the treatment of underlying conditions that may be 
driving continued use, will also form important aspects of clinical 
practice. In those who present with acute drug-related psychological 
symptoms, there should be an emphasis on follow-up since, in some 
cases, the symptoms will represent the onset of a persistent inde- 
pendent disorder which requires treatment. Users who have experi- 
enced acute psychological problems should be encouraged to make 
the attribution that there may be something inherent in them that 
makes them susceptible to experiencing unpleasant reactions with 
a drug; they are likely to remain vulnerable to those adverse ex- 
periences. This may be difficult to accept for potentially vulnerable 
young people who may prefer to think that the experience was not 
enjoyable because the drugs were not good—‘it was a bad pill’ 


FURTHER INFORMATION 


Erowid. http://www.erowid.org 

European Monitoring Centre for Drugs and Drug Addition. http:// 
www.emcdda.europa.eu 

Global Drug Survey. http://www.globaldrugsurvey.com 

NEPTUNE. http://www.neptune-clinical-guidance.co.uk 
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The psychedelics are a distinct group of drugs, the primary effects of 
which are alterations in perception, mood, and thought processing 
in clear consciousness and usually with retained insight. They do not 
cause dependence, are physiologically non-toxic, and harms to the 
end user and society are notably low in comparison to other illicit 
drugs [1]. 

The term ‘psychedelic’ literally translates from the Greek as 
‘mind-manifesting, a term chosen by the psychiatrist Humphrey 
Osmond in 1957 to denote a curious and putatively therapeutic 
tendency to ‘unmask repressed elements of the psyche. The two 
most commonly used psychedelics in the UK are LSD (d-lysergic 
acid diethylamide), a semi-synthetic compound discovered 
in 1938 by Albert Hofman, and psilocybin (N,N-dimethyl-4- 
phosphoryloxytryptamine), which is found in, among others, the 
psilocybe group of ‘magic’ mushrooms that grow throughout the UK 
and worldwide. Less commonly encountered are mescaline (3,4,5- 
trimethoxyphenethylamine), found in the North American peyote 
cactus, and DMT (N,N-dimethyltryptamine), the principal compo- 
nent of the South American ceremonial sacrament ayahuasca. Other 
examples include the ‘2C’ group of psychedelics (for example, 2C-B, 
2C-B-FLY, 2C-I, and 2C-E) and a wide range of recently developed 
synthetic psychedelics, including the NBOMes. 

Classical psychedelics are divided into members of three chem- 
ical classes (Fig. 54.1): (1) tryptamines (for example, psilocybin); 
(2) ergolines (for example, LSD); and (3) phenethylamines (for ex- 
ample, mescaline and 2C-B). There are scores, if not hundreds, of 
compounds subsumed within these classes that have psychedelic 
effects. All classical psychedelics are active at the 2a subtype of 
the serotonin receptor (5-HT2a), with effects being strongly dose- 
related. Users of psychedelics typically report changes under five dis- 
tinct phenomenological categories: 


1. Perceptual distortions, most commonly visual, including syn- 
aesthesia (crossing of sensory modalities, for example, hearing 
colours). 


N 


. Changes in somatic experience—visceral, tactile, and inter- 
oceptive. 

Changes in affect and mood, including, but not limited to, ela- 
tion, euphoria, anxiety, and paranoia. 


w 





A 


. Distortions in thought processing, from changes in semantic 
attribution and belief structures to blurring and dissolution of 
conceptual boundaries (for example, between ‘self’ and ‘other, 
also known as ‘ego dissolution’). Increasing confusion occurs at 
higher doses or in cognitively destabilizing environments. 

5. Entheogenic experiences: spiritual, transcendent, often ineffable 

experiences. 


Hictary 
history 


Psychedelics have a long relationship with the human species, 
which contextualizes current and evolving attitudes towards them. 
Evidence for the use of psychedelics (particularly psilocybin and 
mescaline) dates back further than for any other psychoactive sub- 
stance, particularly in ceremonial, healing, and spiritual rituals. 
Depictions of mushrooms and temples built to mushroom ‘de- 
ities and use of the word teonandcatl, or ‘flesh of God; to denote 
psilocybin-containing mushrooms in indigenous cultures in Mexico 
and Guatemala date back to around 7000 Bc. Records of the Greek 
‘Eleusian Mysteries’ suggest that the ceremony surrounding them 
was based on a psychedelic compound. The modern history of psy- 
chedelic use began when Arthur Heffter, a German chemist, isolated 
mescaline from the peyote cactus in 1897. However, it was not until 
the development of LSD that the medical use of psychedelics began 
to be investigated in earnest. 

LSD was first synthesized by the Swiss chemist Albert Hofman 
in 1938, as part of a systematic investigation of compounds related 
to the 5-HT1 receptor agonist ergotamine [2]. Ergotamine, one of 
the ergot alkaloids, is produced by the parasitic rye fungus Claviceps 
purpurea and was responsible for the outbreaks of mass poisoning 
from consumption of spoilt stocks of rye in the Middle Ages, a con- 
dition known as St. Anthonys fire. LSD was found to be unremark- 
able in animal testing; however, Hofman resynthesized it on a hunch 
in 1943. After accidentally contaminating himself with a small 
amount, he experienced what was probably the world’s first ‘acid 
trip. Further investigation of LSD found it to be extremely potent 
(the threshold dose for subjective effects is approximately 50 ug), but 
notably physiologically safe, with animals surviving overdoses that 
were relatively much larger than other psychoactive drugs. 

Reassured by its physiological safety and recognizing that it might 
be of much interest to psychiatrists, LSD was marketed under the 
tradename Delysid’. Between 1947 and 1967, LSD and, to a lesser 
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Classical psychedelic chemical classes 
Class Ergolines Phenethylamines Tryptamines 
NH 

Basic 
structure 

/ NH, 

HN 
Common LSD (d-lysergic nae hoa a Psilocybin (N,N-dimethyl-4- 
example acid diethylamide) phenethylamine) phosphoryloxytryptamine) 
Fig. 54.1 Three chemically distinct classes of classical psychedelic compounds. 





extent, mescaline and psilocybin were widely used as therapeutic 
agents, particularly as ‘catalysts’ in psychotherapy [3]. LSD also 
came to the attention of the American secret service where it was 
investigated as an agent of ‘mind control’ under the notorious MK 
Ultra programme, which was finally shut down in 1973 [4]. LSD was 
found to be ineffective for such purposes, instead rendering civil- 
ians and troops liable to disorganized behaviour and indifference 
to authority. The proselytization of LSD by the disgraced Harvard 
professor Timothy Leary led to its increasing recreational use, and 
it became a counter-cultural symbol against the status quo. Medical 
concern was raised for psychological toxicity in certain vulnerable 
groups [5]. 

Partly because of medical concern, and partly because of the 
counter-cultural opposition to the Vietnam War and conservative 
values in general, the Nixon administration of the time heavily crim- 
inalized psychedelics, along with many other psychoactive drugs. 
This dramatic escalation of the so-called ‘war on drugs’ stopped all 
medical use and scientific research, while pushing recreational use 
underground. A systematic media-led campaign of demonization in 
the 1960s and 1970s led psychedelics, particularly LSD, to be among 
the most heavily stigmatized of all psychoactive drugs, a stigma that 
is still present today. 


Legal status in the UK 


Almost all classical psychedelics fall under the remit of the UK 
Misuse of Drugs Act, 1971, where they are generally Schedule 1, 
Class A compounds. As such, they are legally defined as having the 
maximum potential for harm and dependence and no identified 
medical use, a definition that does not survive objective scientific 
scrutiny [1, 6]. Until the passing of the Psychoactive Substances Act, 
2016, a number of chemical analogue compounds (for example, 2C- 
B-FLY and 1P-LSD) with similar effect profiles were legally avail- 
able for purchase online and from so-called ‘head shops. While head 
shops have now closed, the simple act of possession of these sub- 
stances for personal use, unlike those classified under the 1971 Act, 
is not considered an offence under the 2016 Act. 


Preparation, purity, and routes of use 


Psilocybin occurs in a wide variety of mushrooms, including (but 
not limited to) the genus Psilocybe, which grow throughout the UK. 


They are most profligate in wet periods during early autumn. Liberty 
cap mushrooms are the species most commonly encountered in the 
UK, with doses of 10-20 mushrooms being a typical starting dose. 
The greatest risk of harm comes from picking the wrong type of 
mushroom, as some are poisonous. Psilocybin-containing mush- 
rooms can be eaten raw, cooked, or dried and are often infused in hot 
water or added to other foodstuffs. Psilocybin is heat-sensitive, and 
prolonged cooking or boiling results in its degradation. Possession 
of psilocybin-containing mushrooms is an offence under the Misuse 
of Drugs Act, 1971. 

LSD can be chemically derived from the ergot alkaloids that occur 
in the parasitic grain fungus Claviceps purpurea. However, cultiva- 
tion of the fungus is unusually difficult, and the sequelae of acci- 
dental ingestion potentially fatal. It is usually chemically synthesized 
from lysergic acid or 3-indolepropionic acid. LSD is extremely po- 
tent, with 1 g making 10,000 moderate (100 ug) oral doses. It is sol- 
uble in either water or alcohol and is colourless and tasteless; it is 
therefore difficult to detect without sophisticated testing apparatus 
and is easily concealed and transported. Drops of liquid LSD can 
be sold for immediate consumption from dropper bottles or, more 
commonly, drops are absorbed onto squares (‘tabs’) of blotting 
paper, gelatin, sugar cubes, or small volumes of a binding material to 
form what is known as a ‘microdot’ In recent years, novel potent psy- 
chedelics with a riskier effect profile have been sold as LSD tabs, such 
as 2-(4-iodo-2,5-dimethoxyphenyl)-N-[(2-methoxyphenyl)methy]] 
ethanamine (25I-NBOMe). 


Mechanism of action and metabolism 


Classical psychedelics are broad serotonergic agonists. However, the 
subjective effect appears to be derived from partial agonism at 5-HT2a 
receptors that are predominantly expressed on pyramidal neurons 
within layer V of the human neocortex [7]. Thus, cross-tolerance oc- 
curs between different psychedelics, the degree of subjective potency 
is robustly associated with affinity at the 5-HT2a receptor, and antag- 
onists at the 5-HT2a receptor (for example, risperidone) abolish the 
subjective effects of psychedelics. However, some 5-HT2a agonists 
(for example, lisuride) fail to produce psychedelic effects, suggesting 
that 5-HT2a agonism, while necessary, is not sufficient [8]. The psy- 
chedelic effect may also be dependent on the type 2 metabotropic 
glutamate receptor, as mice deficient in the corresponding gene are 


unresponsive to psychedelics [9]. A summation of the current evi- 
dence suggests that psychedelics may act via a heteroreceptor complex 
of these two receptor types, modulating the balance of downstream 
second messenger systems and changing the phase of firing of layer 
V pyramidal neurons. Within cortical neural networks, there is 
widening of the measured local field potentials and human EEG re- 
cordings shows characteristic suppression of alpha waves; how this 
leads to the observed neuroimaging changes, subjective effects, and 
putative therapeutic effects is far from clear. While other subtypes of 
the 5-HT receptor are also stimulated by psychedelics, their contribu- 
tion to the subjective effect is likely to be only minor. 

Approximately 50% of ingested psilocybin is absorbed through 
the digestive tract. Psilocybin is biologically inactive but is rap- 
idly dephosphorylated to the active molecule psilocin. Psilocin is 
hepatically metabolized by monoamine oxidase; however, some 
undergoes glucuronidation and is excreted renally. The elimination 
half life of psilocybin is 50 minutes and for psilocin is 1.25 - 2.5 
hours, depending on whether psilocybin is administered intraven- 
ously or orally, respectively [10]. 

LSD is completely absorbed in the digestive tract, with a maximal 
plasma concentration achieved after about 90 minutes. LSD has a 
half-life of approximately 3.5 hours in both men and women and 
shows first-order kinetics. It is hepatically metabolized to a variety 
of different compounds, and only about 1% of the original drug is 
excreted in its original form in the urine. The principal metabolite is 
probably 2-oxo-3-hydroxy-LSD [11]. 


Recent brain scanning findings 


Modern neuroimaging research suggests that psychedelics are nei- 
ther stimulants nor depressants of brain activity. Instead, fMRI 
measures of ‘functional connectivity’ have suggested that brain net- 
work integrity is decreased under psychedelics and cross-talk be- 
tween normally well-differentiated brain regions and networks is 
increased. There is some evidence that these effects correlate with 
the subjective effects of psychedelics; decreased default-mode net- 
work integrity and increased global connectivity have been found to 
correlate with ratings of ego dissolution under LSD, and increased 
communication between the primary visual cortex and the rest of 
the brain correlated with ratings of eyes-closed visual imagery [12]. 


Prevalence 


Of the classical psychedelics, the lifetime prevalence of LSD and 
psilocybin ingestion is largely similar, while more users report past 
year usage of psilocybin than LSD. Use is more common among 
those in the dance music and rave scenes. The most recent British 
Crime Survey (2017/2018) suggested that, while lifetime use has re- 
mained static over the last 20 years at about 9% among 16- to 59- 
year olds, past year reported use of psychedelics by this group has 
fallen from 1.3% in 1996 to 0.7% in 2017/2018 (average 0.7%). The 
fall in use appears most marked in younger groups, with 6.6% of 
16- to 24-year olds reporting lifetime use of psychedelics in 2017/ 
2018, compared to 16.1% reporting lifetime usage in 1996. Past year 
use among younger age groups has also fallen from 5.3% in 1996 to 
2.3% in 2017/2018. In the UK, psychedelic use is notably rare among 
Asian, Black, and Chinese ethnic groups. Psychedelic usage does not 
tend to cluster in particular social classes. The most common pat- 
tern of use is one of episodic experimentation fading into abstin- 
ence. Chronic use is very uncommon, but of those who have used 
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psychedelics for decades, it is usual that they claim no lasting harms 
and claim considerable benefits. 


Harms 
LSD, psilocybin, and mescaline 


Despite politically led and media-driven rhetoric and the general lay 
perception that the classical psychedelics are dangerous, they are ob- 
jectively one of the safest known classes of CNS drugs, and death due 
to direct toxicity of LSD, psilocybin, or mescaline is almost unknown 
in the literature despite well over 50 years of recreational use [7, 13]. 
This is likely attributable to their lack of activity on neurotransmitter 
systems that control basic physiological functions and their partial 
agonist property at serotonin receptors [14]. Overdose generally re- 
quires cardiac monitoring and supportive management in a com- 
fortable, low-stimulus, reassuring environment. Benzodiazepines, 
such as lorazepam or diazepam, are the first-line treatment for symp- 
tomatic relief from emotional distress or behavioural disturbance. 
Significant behavioural disturbance after benzodiazepine treatment 
should raise the suspicion that other drugs have been taken. 


Other psychedelics 
The classical ‘2C’ phenethylamine compounds 


There are a large number of synthetically derived psychedelics. 
The classical ‘2C’ phenethylamine compounds, such as 2C-B, 2C- 
I, and 2C-E, as well as various tryptamine-derived psychedelics, 
were synthesized and self-administered by the chemist Alexander 
Shulgin in the 1960s and 1970s. His experiences and the chemical 
synthesis were described in the books TiHKAL and PiHKAL and in 
many online reports of use. Of these, 2C-B is the most popular and 
is generally well tolerated. Treatment of overdose is as per the other 
classical psychedelics. 


251-NBOMe 


A number of substituted 2C compounds have been developed since 
the turn of the century, and concern for physical toxicity exists for 
25I-NBOMe (N-Bomb), which is the N-benzyl derivative of 2C-L. It 
is exceptionally potent, to the extent that it can be used as a substi- 
tute for LSD on blotter papers [15]. Unlike classical psychedelics, it 
is a full agonist at the 5-HT2a receptor. It appears to promote both 
vascular constriction and platelet aggregation in overdose, placing 
the user at risk of acute cardiac, CNS, and limb ischaemia, as well as 
serotonin syndrome. Reports of deaths directly attributable to 251- 
NBOMe have accumulated since its emergence as a recreationally 
used substance in around 2010, and it remains a concern at the time 
of writing. LSD is tasteless, and therefore users should be advised 
to be suspicious of LSD blotters tasting of anything other than the 
blotter paper itself. 


Dimethyltryptamine 


Dimethyltryptamine (DMT), the active component of ayahuasca, 
has recently become increasingly popular with the rise of smoked 
DMT and the commercialization of the ceremonial aspects of the 
drugs [16]. When smoked, it leads to a short-lived (less than 2 
hours), but intense, psychedelic experience. Ayahuasca is an oral 
preparation usually derived from the Banisteriopsis caapi vine. DMT 
when taken orally needs to be consumed with harmine or another 
compound with monoamine oxidase-inhibiting (MAOI) activity. 
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Psychological effects and complications 


The psychological effects of psychedelics are dependent on the psy- 
chological mind ‘set’ of the user and the environmental ‘setting’ of the 
experience. Experienced recreational drug users rank psilocybin and 
LSD as having low harm and high benefit potential in comparison to 
other drugs [17]. However, recreational use is not without its risks, 
particularly in psychologically sensitive individuals and in psycho- 
logically destabilizing environments. Those with a personal or family 
history of severe and enduring psychotic disorders should be par- 
ticularly discouraged from experimentation with psychedelics. 

In a recent web-based survey of difficult or negative experiences 
during psilocybin use, 62% of 1993 respondents rated the experience 
as being within the top ten most challenging experiences of their 
lives; yet 84% reported that they benefited in some way from this, 
and 76% reported increases in current well-being or life satisfaction. 
Thirty-four per cent and 31% reported the experience to be in the top 
five most personally meaningful and spiritually significant experi- 
ences in their lives, respectively. In contrast, 8% reported a decrease 
in well-being or life satisfaction as a result of the difficult experience, 
and three respondents reported that they had attempted suicide or 
suffered prolonged psychotic experiences [18]. This result is con- 
sistent with the observation that psychedelics unmask repressed (and 
hence somewhat unpredictable) elements of the psyche, but that the 
successful integration of such elements within conscious experience 
can be beneficial to the overall psychological well-being. This under- 
lines the importance of set and setting and speaks to the possible 
therapeutic potential of psychedelics in medically controlled settings 
where both set and setting can be optimized and the risks minimized. 


Hallucinogen persisting perceptual disorder 


Hallucinogen persisting perceptual disorder (HPPD) was included 
in DSM-IV as an attempt to capture a syndrome whereby users of 
psychedelics report distressing residual effects long after the drug 
has cleared from the body. A summation of the evidence for HPPD 
in 2002 concluded that while the disorder probably existed, it was 
poorly defined, rare, and aetiologically opaque [19]. A variety of the- 
ories have been put forward, ranging from a direct neurotoxic effect 
of the drug to a negative psychological reaction to the effect of the 
drug. Direct neurotoxicity appears unlikely, particularly since ex- 
perience with recreational use is extensive and massive overdoses of 
psychedelics have been survived without apparent clinical sequelae. 
An idiosyncratic premorbid biological sensitivity in some individ- 
uals cannot be ruled out, but it appears more likely that some users 
are psychologically sensitive to the effects of the psychedelic experi- 
ence, perhaps becoming more ‘aware’ of the fluctating functioning of 
various aspects of experiential processing. Consequently, the symp- 
tomatology reported within the scope of HPPD is widely variable 
and no particular medical treatment has become prominent. In most 
cases, HPPD appears to resolve over time with psychological support. 


Other consequences of use 
Dependence and withdrawal 


There is no evidence that psychedelics cause physiological or psy- 
chological dependence. Indeed they may be a treatment for phys- 
ical dependence, with the evidence most strong for alcoholism [20]. 
Complete tolerance to the classical psychedelics develops, usually 
within 72 hours, and full resensitization takes 1 week or longer. No 


defined withdrawal syndrome has been identified, although ten- 
sion headaches responsive to simple analgesia are common in the 
24 hours after use. 


Psychedelic-induced psychosis 


Brief, self-limiting psychotic episodes can occur while intoxicated 
with psychedelics and are more common in naive users, those with 
a past history of psychiatric illness, and users of longer-acting, more 
potent substances such as LSD. Such episodes, subsumed along with 
anxiety and panic reactions under the term ‘bad trip, can be more 
likely if the drug is consumed in unfavourable environments or at a 
time when the person feels uncertain or unsafe. 

Many people report negative or difficult-to-deal-with experi- 
ences when taking psychedelics. For many, working through the 
experience with the support of a caring, empathic other can itself 
be a useful and enriching experience. Medical management is sup- 
portive, with cessation of drug consumption, psychological support, 
and sedation with benzodiazepines, if necessary. 

In the pre-prohibition era, psychedelic drugs were used on a wide 
variety of psychiatric patients and disorders. While some observed 
that they appeared to ‘unmask’ psychotic disorders in high-risk in- 
dividuals, this was tempered by evidence from observational studies 
suggesting that the rate of psychedelic-induced schizophrenia was 
probably no higher than chance. Even so, the current clinical con- 
sensus is that psychedelic drugs should be avoided by those at high 
risk of developing psychotic disorders. 


Use in therapy 


Prior to prohibition in the late 1960s, thousands of patients, reported 
in many hundreds of academic papers, were treated with psychedelics 
(usually LSD) in Europe and the United States, with six international 
conferences on psychedelic-catalysed psychotherapy organized during 
this period [21]. Their use was much wider than is usually appreciated 
today [13]. This use swiftly declined after the 1967 United Nations dec- 
laration on drugs, which legally defined them, paradoxically, as having 
no medical use and placing stringent restrictions on their manufacture, 
supply, and prescription. While pre-prohibition studies of the thera- 
peutic efficacy of psychedelics, in common with many clinical studies 
at the time, were of suboptimal standard, good-quality evidence for 
efficacy was found in alcoholism. A modern meta-analysis of six con- 
trolled trials found a substantial therapeutic effect of LSD over placebo 
in objective measures of alcohol use [20]. A recent systematic review 
of pre-prohibition evidence in unipolar mood disorders was also 
broadly encouraging [22]. Since 2006, small pilot studies have shown 
promising results in anxiety associated with advanced cancer [23], 
obsessive-compulsive disorder [24], tobacco [25] and alcohol [26] 
addiction, cluster headaches [27], and treatment-resistant unipolar 
depression (TRD) [28]. Two RCTs of psilocybin in existential distress 
associated with life threatening cancer were published in 2016 [29, 30]. 
RCTs with psilocybin for TRD are now under way [13]. 


Introduction 


Ketamine is a dissociative anaesthetic that exerts its action primarily 
through non-competitive antagonism at the NMDA receptor. It is 


almost unique as an anaesthetic in its ability to produce a ‘dissocia- 
tive’ state, which results in higher brain structures being prevented 
from perceiving auditory, visual, or painful stimuli, leading to ‘a 
lack of responsive awareness’ [31]. Over the last 20 years, its use as a 
recreational drug has spread, and although there are concerns that 
longer-term use may be associated with serious physical (urinary 
tract) and psychological (cognitive impairment) harms, ketamine 
does benefit from having a wide margin of safety in overdose, with 
the greatest risk for most users being that of trauma or accidental 
harm while intoxicated. 


Background and legal status 


Ketamine is a versatile medication and is in widespread use in both 
general, paediatric, and emergency medicine (in anaesthesia and 
pain management). It is also a remarkably safe drug in clinical prac- 
tice where the pronounced analgesia and amnesia occur without 
any significant depression of cardiac or respiratory function and 
maintenance of the gag reflex. The WHO has listed it as an essential 
medicine. Emergence phenomena (experienced as people recover 
from anaesthetic doses of the drug) encompass a range of unusual 
psychological experiences, including hallucinations, near-death 
and out-of-body experiences, delirium, and confusion; these have 
limited its use in wider clinical settings [32]. Ketamine has been part 
of the illicit drug scene in the UK since the 1990s. Currently, it is 
most commonly seen among those involved in the clubbing scene, 
especially trance music events. In June 2014, ketamine changed 
from a class C to a class B controlled drug in the UK. 


Preparation, purity, and routes of use 


Ketamine is typically sold as a crystalline powder for intranasal use 
(usual doses 50-250 mg), though it can also be found in liquid, tablet, 
or capsular form for ingestion. When obtained from diverted licit 
sources, the formulation of the drug is a solution prepared for intra- 
venous use, which is then dried and taken intranasally. Ketamine 
may be adequately absorbed via the intranasal, intravenous, subcuta- 
neous, intramuscular, rectal, and intrathecal routes, with snorting and 
injecting being the most common recreational routes of use. 


Mechanism of action and metabolism 


The most significant pharmacological action of ketamine is non- 
competitive antagonist binding at the cation channel of the NMDA 
receptor. Ketamine has a range of other actions; it is a weak u re- 
ceptor agonist, induces the release of dopamine, blocks muscarinic 
acetylcholine receptors, and may potentiate the effects of GABA 
synaptic inhibition. Ketamine undergoes marked first-pass metab- 
olism with the production of its less potent metabolite norketamine. 
Orally ketamine results in a more sedative and less psychedelic ex- 
perience. Most of the parent drug will be eliminated from the body 
within 24 hours. 


Prevalence 


Findings from the 2017/2018 Crime Survey of England and Wales 
suggest that the lifetime use of ketamine among 16- to 59-year olds 
(2.8%). Past year rates of use are on par with these more traditional 
hallucinogenic drugs, with 0.8% of those aged 16-59 years reporting 
use in the last 12 months. Rates are higher among younger users and 
those involved in the clubbing scene where last year rates of over 
25% have been reported [33]. 
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Patterns of use 


Compared to commonly used stimulant drugs, the abuse liability 
of ketamine is low, but compared to LSD and psilocybin, ketamine 
is far more likely to be associated with compulsive patterns of use, 
the development of tolerance, and acute problems [34]. Ketamine 
may also lend itself to binge use due to its shorter duration of ac- 
tion, compared to LSD and psilocybin. Typically, users will consume 
less than 0.5 g/day, though because tolerance can develop quickly, 
chronic regular users may use in excess of 5 g/day. Ketamine is 
taken in combination with other club drugs, with polyuse typically 
increasing the risk of acute harm. 


Psychological effects and complications 


Effects are highly sensitive to age, dose, route, set, and setting. 
While low doses can be stimulating with elevation in mood, dis- 
tortion of time and space, and enhanced sensory perception, feel- 
ings of ‘melting into one’s surroundings’ are common. At higher 
doses, ketamine induces the full range of psychedelic effects with 
emotional, cognitive, and perceptual distortion, including out-of- 
body experiences and visual hallucinations [35] (Table 54.1). The 
Harvard academic Timothy Leary described it as ‘the ultimate psy- 
chedelic journey. Users describe entering the K hole’ where they 
experience visits to god, aliens, their birth, past lives, and the ‘experi- 
ences of evolution: Some users report taking issues of set and setting 
into careful consideration prior to using ketamine. Such preparation 
cannot be performed if the drug is consumed unwittingly when it 
has been marketed under the guise of another drug such as ecstasy. 
Being an amnestic, it may become difficult to remember the total 
doses consumed. 

The relationship between long-term ketamine use and elevated 
depression scores has been investigated among both current and ex- 
users. Although heavier users were more likely to score more highly, 
these were not at clinically significant levels and there was no rela- 
tionship with changes in ketamine use [36]. 

Ketamine can also produce a psychotic picture that superficially 
mimics schizophrenia [37]. Both positive and negative symptoms of 
schizophrenia can be transiently seen in normal users, and its use can 
exacerbate symptoms in those with pre-existing psychotic disorders. 
Other adverse effects can include frightening hallucinations/out-of- 
body experiences, thought disorder, confusion, and dissociation. 
Such episodes tend to be short-lived, resolving in a few hours or at 
the most a few days. In many respects, these are like those adverse 
effects seen with LSD, though with ketamine, they come on after a 
shorter period following use and recede more quickly. (Table 54.1). 


Acute complications and harms 


Ketamine has a wide margin of safety within medical settings, and 
although relatively safe in overdose, in combination with ethanol 
or other CNS depressants, the use of ketamine can result in death. 
In recreational settings, the greatest risk of harm is from accidental 
injury, for example trauma, drowning and hypothermia secondary 
to immobility, and vulnerability to assault [38]. Rarely, more severe 
complications are reported, including severe agitation and rhabdo- 
myolysis. Within accident and emergency departments, the most 
common presentations are impaired consciousness, abdominal pain 
(known as K cramps), lower urinary tract symptoms, and dizzi- 
ness. Ketamine’s cardiovascular risks are enhanced when it is taken 
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Table 54.1 Ketamine: psychological and physical effects 


Psychological Physical 


Rapid onset, short duration of action (1 hour), wide 
safety margin 





Dissociative anaesthesia, ‘somatosensory blockade’ Dilated pupils 
analgesia 
Perceptual distortion/hallucinations/near death Tachycardia 


Out-of-body experience Hypertension 


Thought disorder/synaesthesia Ataxia 
Emergence phenomena Paralysis 
Cognitive impairment Sweating 


Amnesia Hypersalivation 


Derealization/depersonalization Little effect on cough 


reflex 


with stimulant drugs. Because of its fast urinary excretion (within 
2 hours), the ability to identify ketamine in urine screens is almost 
impossible, and thus a level of clinical suspicion is required, espe- 
cially ifa history of its use is not forthcoming. Detection by clinical 
examination relies on identifying elevated blood pressure, moderate 
tachycardia, abdominal tenderness, and white powder rings on the 
nose [39]. 


Ketamine bladder 


Heavy regular use of ketamine can result in progressive inflamma- 
tion, ulceration, and scarring of the bladder. Known as ketamine- 
induced cystitis or ketamine bladder, ketamine, or one of its 
metabolites, is thought to induce apoptosis (cell death). Early detec- 
tion and harm reduction should be part of any clinical consultation 
with ketamine users. Any clinical assessment of ketamine users must 
include enquiry as to the presence of symptoms suggestive of bladder 
damage, including frequency, dysuria, lower abdominal pain, and 
haematuria [33]. Early referral to urologists and addiction special- 
ists for co-ordinated interventions is recommended. Complete ces- 
sation of use in cases of ketamine bladder must be encouraged, since 
in the worst cases, the bladder may be lost, which is an exceedingly 
unpleasant outcome. 


Harm reduction 


Like other harms, ‘ketamine bladder’ appears to be dose-related and 
is more likely to occur in heavy, dependent users. It may be more 
likely to be seen in those who drink alcohol when they use ketamine, 
and it is wise to remain well hydrated when using the drug. Purity 
between batches can vary, and given the marked dose-response ef- 
fect of the drug, careful measurement of doses using milligram scales 
and test dosing new batches are advisable [40]. Intravenous use is to 
be avoided, being a risk for overdose and prolonged immobility, with 
associated vulnerability to environmental harms. Rapid referral to 
specialist services for those reporting bladder symptoms is advised. 


Dependence and withdrawal 


Animals models suggest that ketamine has the potential to cause de- 
pendence, with rats showing re-administration, conditioned place 
preference, and locomotor sensitization following repeated admin- 
istration. However, in man, dependence on ketamine is rare, with 


most of the literature coming from small case reports [41]. Regular 
users of ketamine report the rapid development of tolerance and 
compulsive patterns of use with loss of control and continued use 
despite harms. Evidence for a specific withdrawal syndrome is 
lacking, though upon cessation use, some dependent users report 
craving and sympathomimetic lability with shaking, sweaty, irrit- 
ability, and insomnia [42]. Among those with pain associated with 
bladder damage, persistent use may be driven by a desire for anal- 
gesia. While cessation can be associated with mood and behavioural 
disturbances, a well-defined physical withdrawal syndrome is not 
seen. Symptomatic relief of agitation and insomnia, along with an- 
algesia, if required, combined with good psychological support, re- 
mains the mainstay of managing withdrawal. There are no specific 
psychological therapies, but relapse prevention can be supported by 
the adaptation of CBT and MET, while addressing comorbid condi- 
tions and any other individual psychological needs. 


Cognitive impairment 


There has been considerable research into the neurocognitive con- 
sequences of ketamine. Among infrequent users, ketamine does not 
appear to be associated with any significant impairment. However, 
frequent, heavy use is associated with marked deficits in both short- 
and long-term memory [43], most notably deficits in visual recog- 
nition and spatial working memory. Cognitive impairments may 
not only interfere with a person's ability to perform in their normal 
working or social roles but can also interfere with their treatment. 


Ketamine in treatment-resistant depression and 
alcohol dependence 


Recent work has highlighted the potential of ketamine as a thera- 
peutic agent in those with TRD, typically using intravenous infu- 
sions 2-3 times per week, usually over several weeks. Initial results 
have been promising [44] and suggest the drug is well tolerated and 
effective (see Chapter 72). An intranasal formulation of an isomer of 
ketamine (esketamine) has recently been licensed for the treatment 
of depression in the United States. Ketamine is also being explored 
as a treatment for those with dependence, in particular alcohol [42]. 
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Tobacco use is the main preventable cause of premature death world- 
wide. Currently, about 5 million people die from smoking-related 
disease each year, and this number continues to grow. In most high- 
income countries, smoking rates have declined dramatically in the 
last 20 years, but they are increasing in many low- and middle- 
income countries [1]. The tobacco epidemic within high-income 
countries has been described in four stages [2] (Fig. 55.1). The first is 
marked by the widespread uptake of smoking in a population, espe- 
cially by men, and almost no smoking-related deaths. In the second, 
uptake from men continues to increase and women start taking up 
smoking too, while smoking-related deaths begin to occur. In the 
third, smoking rates in men start declining, as do rates in women 
towards the end of this phase, and smoking-related deaths rise to 
about 25-30% of all deaths in men. In the fourth, smoking-related 
deaths in men peak early in this period, and a couple of decades 
later the same happens in women. Smoking rates keep declining in a 
similar fashion in men and women, although the rate of the decline 
begins to slow down. This model was developed based on obser- 
vations in high-income countries, but it has been suggested that it 
is still largely applicable to low- and middle-income countries, al- 
though it may not be as applicable to women; the gender gap seems 
to remain large in many of these countries. 


alence 





The UK, along with most high-income countries, is in the fourth 
stage of the tobacco epidemic. Smoking rates in the UK have de- 
clined considerably—from the early 1970s when 45% of all adults 
were smokers to below 20% today. In particular, the UK has seen a 
marked narrowing of the gap between the prevalence of male and 
female smoking. The prevalence of smoking is highest in young 
adults (25- to 34-year olds), and lowest in adults over 60 years (al- 
though this is, in part, due to differential mortality between smokers 
and non-smokers as they age). The uptake of smoking usually oc- 
curs in mid adolescence, with two-thirds of smokers starting before 
the age of 18. Whereas smoking is nearly absent in 11-year olds, 
regular smoking (that is, at least once per week) begins to be seen 
among 15-years olds. Smoking prevalence is strongly patterned by 





socio-economic position, with the highest proportion of smokers 
being from less affluent socio-economic backgrounds. Smoking is 
therefore a strong driver of health inequalities between the rich and 
the poor today [3]. Smoking rates in the UK are comparable to most 
other high-income countries, but there are examples of high-income 
countries where smoking rates are higher (for example, Russia and 
Greece). Many low- and middle-income countries are at the second 
stage of the tobacco epidemic model and have some of the highest 
smoking rates in males today (for example, Indonesia, China, and 
Jordan), whereas smoking rates in females are much lower in these 
countries [1]. 


ituents 





Tobacco is made from the cured leaves of the tobacco plant. It is 
predominantly consumed in the form of manufactured cigarettes 
(making up the vast majority of worldwide sales), but it can also be 
chewed, inhaled, or smoked in other forms (for example, pipes or 
cigars). Burning tobacco produces further chemicals as a result of the 
combustion process; over 7000 different chemicals and compounds 
have been identified in tobacco smoke from cigarettes. Many are 
toxic, and over 70 are carcinogenic [4]. The primary psychoactive 
addictive substance in tobacco is nicotine, a nicotinic acetylcho- 
line receptor (nAChR) agonist. There is some evidence that other 
components of tobacco, or their metabolites, such as acetaldehyde, 
nornicotine, myosmine, cotinine, anabasine, and anatabine, have 
synergistic effects with nicotine or have reinforcing effects on their 
own. The investigation of the effects of these substances presents 
several methodological challenges such as disentangling the direct 
effect of the substances from the effects of nicotine and their inter- 
actions with nicotine and one another. Moreover, these substances 
usually occur in tobacco smoke in very low doses and are therefore 
difficult to model experimentally. 
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Diseases caused by tobacco smoking include various cancers, car- 
diovascular disease, and pulmonary disease, and it also has marked 
effects on prenatal development. Smoking is believed to be respon- 
sible for the majority of lung cancers and has also been linked to 
cancer of the mouth, bladder, kidney, cervix, stomach, and other 
organs. Smoking is responsible for much ischaemic heart disease 
and is a risk factor for heart attack and stroke. Two types of lung 
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Fig. 55.1 The different stages of the tobacco epidemic in men and women. 


Reproduced from Tobacco Control, 3(3), Lopez AD, Collishaw NE, Piha T, A descriptive model of the cigarette epidemic in developed countries, pp. 242-247, Copyright (1994), 


with permission from British Medical Journal. 


disease—chronic bronchitis and emphysema—are also caused 
mainly by smoking. Inhalation of tobacco smoke damages the 
airway and lung tissue, which also leaves the lungs more vulnerable 
to environmental pollutants and infections. Smoking during preg- 
nancy doubles the risk of pre-term delivery and stillbirth, and is 
linked to low birthweight, increased birth defects, and birth compli- 
cations, as well as negative long-term health outcomes of the infant 
such as asthma, breathing problems, and learning difficulties. It is 
still unclear whether tobacco exposure in the prenatal environment 
is a direct cause of these long-term outcomes. For example, exposure 
to smoking during pregnancy increases the risk of the offspring be- 
coming a smoker, but genetic predisposition or the postnatal envir- 
onment, such as sharing a household with smoking parents, could be 
responsible for these associations. Conclusive evidence on whether 
the effect of prenatal tobacco exposure on offspring smoking out- 
comes is causal or associative, or a mixture of both, requires genetic- 
ally informed investigation. 


Neurobiology and pharmacokinetics 


Routes of administration 


Tobacco can be consumed in different ways, and the route of ad- 
ministration affects its addictiveness, as well as its health risk profile. 
Most tobacco is smoked in the form of cigarettes, but it can also be 
smoked in the form of water pipes, traditional pipes, and cigars, or 
consumed as smokeless tobacco by chewing or inhaling it through 
the nose or mouth. Cigarettes are manufactured from flue-cured to- 
bacco; this lowers the pH of the saliva in the mouth, and therefore, 
nicotine cannot be absorbed through the oral mucosa, in contrast to 
tobacco used in pipes and cigars or chewing tobacco. It necessitates 


the inhalation of cigarette smoke into the lungs, in order to absorb 
nicotine at these pH levels. Nicotine levels then rise sharply in blood 
returning to the heart and quickly reach the brain. The delivery of 
nicotine via cigarette smoking has the most immediate, strong, and 
addictive effect of all tobacco delivery systems. It is also related to the 
most negative health outcomes, due to the production of carcino- 
gens through the burning of tobacco and the release of carbon mon- 
oxide, as well the wide distribution of these substances into the body 
via the lungs. Other forms of administration of nicotine, via the oral 
mucosa or nasal cavities (in the case of dry snuff), do not result in 
such sharp increases in nicotine blood levels (Fig. 55.2). However, 
forms of tobacco use other than cigarette smoking also carry health 
risks. Pipe and cigar smoking and smokeless tobacco are related to 
oral and pancreatic cancers, as well as lung and heart disease. 

Nicotine-containing products can be considered to lie on a con- 
tinuum of harm to the user where combustible cigarettes are the 
most harmful and pharmaceutical nicotine replacement prod- 
ucts the least harmful. Other products such as smokeless tobacco 
products and unregulated nicotine products such as nicotine lolli- 
pops occupy intermediate positions. Although the subject of some 
controversy, e-cigarettes are generally considered to be much less 
harmful than cigarettes, and closer in levels of harm to nicotine re- 
placement products [5]. 


Tobacco in the brain 


Nicotine binds to nAChRs, which are expressed throughout the 
brain. They are ligand-gated ion channels consisting of a and B sub- 
units, which have differential profiles in their response to nicotine. 
Their stimulation indirectly releases a variety of neurotransmitters 
[6]. The primary reinforcing effect of nicotine is ascribed to the ac- 
tivation of nAChRs in the mesolimbic pathway, specifically those 
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Fig. 55.2 Venous blood concentrations of nicotine over time for various 
nicotine delivery systems. 


Reproduced from Clinics in Office Practice, 26(3), Fant RV, Owen LL, Henningfield JE, 
Nicotine Replacement Therapy, pp. 633-652, Copyright (1999), with permission from 
W. B. Saunders Company. 


projecting from the ventral tegmental area (VTA) into the nucleus 
accumbens (NAcc) where they release dopamine (DA). nAChR sub- 
types thought to be involved in this process include a4, a6, and £2. 
The release of DA into the NAcc plays an important role in the pro- 
cessing of natural rewards and reinforcers, such as food and sex, and 
is a locus for the action of drugs of abuse like cocaine. Several studies 
using animal models have shown that nAChRs located within the 
VTA are involved in reinforcement behaviour to nicotine. Increases 
in DA after nicotine exposure have also been shown in brain imaging 
studies using human participants, although results are equivocal 
regarding the magnitude of the DA release [7]. Nicotine also pro- 
motes the release of serotonin linked to mood regulation and appe- 
tite, noradrenaline linked to arousal and appetite, and vasopressin 
linked to memory enhancement [8]. Other constituents of tobacco 
smoke also appear to play an important role in tobacco addiction. 
Much research has been conducted on the chronic effect of cigarette 
smoking on the inhibition of monoamine oxidase (MAO) A and B, 
which seems to enhance the direct effect of nicotine by leading to an 
increase of DA, noradrenaline, and serotonin in the brain [9, 10]. 


Tobacco reward and reinforcement 


Many smokers report that smoking is pleasurable and reduces nega- 
tive affect and stress, although there is conflicting evidence against 
such effects [11, 12]. Once tobacco dependence is established, the 
discontinuation of smoking causes withdrawal symptoms. Smoking 
is therefore associated with both positive reinforcement (that is, 
obtaining pleasurable consequences) and negative reinforcement 
(that is, avoiding aversive consequences). The primary reinforcing 


constituent of tobacco is nicotine; animals will preferentially self- 
administer nicotine intravenously relative to placebo [13], as will 
humans [14, 15]. However, other factors contribute to the reinfor- 
cing effects of tobacco use. In classic nicotine self-administration 
studies in animals, the delivery of nicotine is often paired with 
a neutral stimulus such as light or noise. These studies show that 
after repeated pairing, this neutral stimulus becomes as important 
as nicotine in the maintenance, extinction, and reacquisition of 
self-administration of nicotine. Similarly, in humans, smokers 
tend to associate specific cues, such as certain moods, situations, 
and environmental factors, with the positive effect of smoking (see 
Environmental drivers of relapse, p. 549). Through conditioning, 
these smoking-related cues acquire motivational properties and 
therefore increase craving for nicotine and trigger relapse. In cue re- 
activity studies, nicotine-dependent individuals exhibit increases in 
affect, heart rate, skin conductivity, and subjective craving, following 
the presentation of smoking-related stimuli [16]. Other substances 
present in tobacco smoke may also contribute to its reinforcing ac- 
tions. Much research has focused on monoamine oxidase A inhibi- 
tors, present in tobacco and tobacco smoke, which may enhance 
the rewarding effect of nicotine [9, 10]. Acetaldehydes, present in 
tobacco, also have reinforcing properties and may enhance the re- 
inforcing properties of nicotine [17]. 


Genetics of nicotine addiction 


Like other addictive habits, smoking behaviours are under a sub- 
stantial degree of genetic influence (see Chapter 49). Twin studies 
suggest most smoking-related traits, such as smoking initiation [18], 
the development of tobacco dependence [19], and cessation [20], are 
genetically influenced. However, the identification of specific gen- 
etic variants that influence smoking behaviour has been challenging. 
Candidate gene studies have met with limited success, despite genes 
encoding nicotine receptor subtypes, DA receptors, and GABA re- 
ceptors being plausible candidates [21]. Genome-wide association 
studies have identified several promising genetic variants associ- 
ated with smoking behaviour, most prominently the CHRNA5- 
CHRNA3-CHRNB4 gene cluster on chromosome 15 [22, 23]. These 
genetic variants influence consumption and titration (adjustment 
of nicotine intake according to the strength of the cigarette). ‘Thus, 
a5 knockout mice (a model for humans who carry the rs16969968 
risk allele) with delivery of self-administered nicotine were less able 
than normal mice to titrate the delivery of nicotine to a consistent, 
desired level [24]. Similarly, humans carrying the rs16969968 risk al- 
lele show a reduced aversive response to high doses of nicotine [25]. 
Therefore, deficient a5 signalling may attenuate the negative effects 
of nicotine, leading to heavier use. Nevertheless, the proportion of 
variance explained by such variants is much lower than the estimates 
of heritability indicated by twin studies. Larger sample sizes and re- 
finement of the methodology of genotyping may enable the identifi- 
cation of proper pathways in the future. 


Dependence and withdrawal 


Tobacco withdrawal syndrome 


After several hours of smoking abstinence, dependent smokers will 
experience tobacco withdrawal. The primary tobacco withdrawal 
symptoms reported by the fifth edition of Diagnostic and Statistical 


Manual of Mental Disorders (DSM-5) are irritability/anger/frustra- 
tion, anxiety, depressed mood, difficulty concentrating, increased ap- 
petite, insomnia, and restlessness [26]. Other symptoms can include 
constipation, dizziness, nausea, and disrupted cognitive function 
such as impairment of working memory [27]. Tobacco craving is also 
usually observed after smoking abstinence. As with most drugs of 
abuse, withdrawal symptoms are one of the main drivers for relapse 
and reinstatement of smoking behaviour following abstinence. They 
can persist for several months after smoking cessation, but the time 
course of withdrawal symptoms varies considerably between individ- 
uals. Increases in craving and negative affect are the most common 
symptoms after smoking cessation, and the intensity of these symp- 
toms is predictive of relapse. Furthermore, smokers who experience 
more variable symptoms have been found to be more likely to relapse. 


Environmental drivers of relapse 


Environmental stimuli associated with smoking can promote 
smoking behaviour in several ways. Firstly, a stimulus that has re- 
peatedly been paired with smoking (for example, the look, feel, 
and taste of a cigarette, a lighter, an ashtray, the effects of alcohol or 
coffee, etc.) can elicit a conditioned response and tobacco craving. 
The effect of smoking-related stimuli has been investigated experi- 
mentally with cue-exposure paradigms. Here smoking stimuli, or 
photographs or videos of these, are presented to the participant. 
Differences in outcomes, while the participant is subjected to a 
smoking stimulus (as compared to a neutral stimulus), can indicate 
which domains are influenced by smoking cues. The most reliable 
responses to smoking cues are increases in craving levels and heart 
rate [16]. Secondly, environmental stimuli related to smoking can 
become primary reinforcers in their own right. In animals, a for- 
merly extinguished nicotine self-administration behaviour can be 
reinstated when the animal is presented with a nicotine-related cue 
(in this case, a neutral light or sound that was paired with nicotine 
administration) [28]. In humans, denicotinized cigarettes lead to a 
greater reduction in the consumption of nicotine-containing cigar- 
ettes than nicotine gum [29], supporting the conclusion that nico- 
tine withdrawal is driven not only by the absence of nicotine, but 
also by the absence of smoking-related stimuli. 

‘These findings have important implications for the development of 
smoking cessation treatments—dissociating smoking cues from the 
reinforcing effect of nicotine might extinguish the learnt connection 
between nicotine administration and the smoking cue, reducing the 
risk of relapse. One approach is to substitute regular cigarettes with 
denicotinized cigarettes; once smoking does not provide any more 
nicotine reinforcement, the sensorimotor cues of smoking should 
lose their hedonic value. The reduction of withdrawal symptoms 
has been found to be even more pronounced when the adminis- 
tration of denicotinized cigarettes is combined with nicotine re- 
placement therapy (NRT). Similarly, varenicline, a nicotinic partial 
agonist, exerts its effect partly by blocking receptors that facilitate 
the rewarding experience of smoking. It is prescribed to be taken by 
smokers for a short period before a quit attempt, which may help to 
extinguish these associations. The number of cigarettes smoked per 
day decreases during the pre-cessation period, and cessation rates 
are higher than for other smoking cessation pharmacotherapies 
[30]. Smoking cues seem to vary in their impact on different groups 
of smokers; for example, females respond more strongly to smoking 
cues than males. 
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Psychological drivers of relapse 


Psychological factors, such as stress and negative affect, are often 
cited by smokers as reasons for smoking and for relapse during a 
cessation attempt. Smokers often attribute the stress- and anxiety- 
reducing properties of cigarettes as one of the main drivers for 
smoking and often attribute relapse to acute stress and negative af- 
fect. An alternative explanation is that this perceived relief of stress 
and negative affect through smoking is actually a misattribution of 
withdrawal relief [31]. In recent years, ecological momentary assess- 
ment (EMA) studies have advanced our understanding of the role of 
negative affect and stress on acute smoking behaviour. Participants 
are provided with a device (for example, a smartphone-app) that 
allows them to provide information about craving, mood, or stress 
levels in real time in their everyday environment. These studies have 
not found a systematic relationship between negative states and 
acute ‘regular’ smoking behaviour. In smokers attempting to quit, 
on the other hand, acute negative affect, as well as stress, has been 
found to play an important role in smoking lapses [32]. Rapid in- 
creases in negative affect precede lapse episodes, and lapses that are 
triggered by stress progress more quickly to another lapse. However, 
studies investigating stress levels and negative affect among suc- 
cessful quitters have found that, even though stress initially in- 
creases after quitting, in the long run, stress levels decrease even 
below pre-cessation levels [33]. This could mean that smoking, in 
fact, increases stress-levels, which are then reduced back to normal 
after quitting, although a recent Mendelian randomization meta- 
analysis did not find any evidence that smoking heaviness causally 
influenced anxiety or depression [33]. The results may be driven by 
other factors, for example the adaptation of a healthier lifestyle in 
successful quitters. The presence of clinically significant negative af- 
fect has also been related to poor cessation success [34], and several 
studies found that a history of depression increases the risk for re- 
lapse [35]. Smokers diagnosed with multiple episodes of depression 
(rather than one) may be at higher risk for relapse [36]. The causal 
relationship between smoking and depression (and other mental 
health issues) remains unclear (see Smoking in psychiatric popula- 
tions, pp.550-1). 


Cognition and relapse 


Many smokers report perceived cognitive benefits from smoking, 
and cognitive impairment during abstinence. However, it is unclear 
whether smoking results in genuine cognitive benefits, rather than 
simply relieving withdrawal symptoms that include impaired cog- 
nition. Laboratory research on tobacco addiction and cognition has 
focused on two issues: (1) the potentially positive effect of tobacco 
smoking (and specifically nicotine) on cognitive performance; and 
(2) the negative effect of tobacco withdrawal on cognitive perform- 
ance. A meta-analysis on the effect of cigarette smoking and/or nico- 
tine administration in non-smokers and non-deprived smokers on 
cognitive performance found an improvement on fine motor abil- 
ities and certain measures of attention and working memory [37]. 
Much of the interest in the cognition-enhancing effect of nicotine 
has been fuelled by its potential as a treatment for cognitive dysfunc- 
tion; early research suggested benefits from nAChR agonist treat- 
ment for Alzheimer’s disease, schizophrenia, and attention-deficit/ 
hyperactivity disorder (ADHD) but have not translated into clin- 
ical practice. The effect of withdrawal on cognitive performance is 
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usually investigated using an acute abstinence paradigm, in which 
cognitive task performance is compared between smokers who are 
subjected to a period of acute abstinence (of 8 hours or more) and 
smokers who smoke as per usual. A recent meta-analysis of these 
tasks found that abstinent smokers showed higher delay discounting 
of monetary rewards, lower response inhibition, and impaired 
arithmetic and recognition memory performance, compared with 
non-abstinent smokers [38]. Cognitive performance tasks that are 
sensitive to withdrawal in smokers might be useful in assessing the 
efficacy of novel treatments to ameliorate withdrawal symptoms. 
The advantage of objective cognitive performance task outcomes, 
such as reaction times or error rates, over self-report is that they 
are less biased. Nevertheless, the results of cognitive performance 
task studies, both investigating cognitive-enhancing effects of nico- 
tine, as well as the effect of nicotine abstinence on task performance, 
should be interpreted with caution. The tasks used to assess cogni- 
tive performance are usually quite heterogenous, even if indicated 
for the same cognitive domain. This is even true for the same tasks, 
as standardized guidelines on presentation times, stimuli, selection 
of participants, etc., are currently lacking. 


Smoking in psychiatric populations 





Schizophrenia 


Even though smoking rates in the general population are declining, 
they remain high in people with mental illness. In particular, 
among people with schizophrenia, the prevalence of smoking is 
substantially higher than in the general population [39]. Smokers 
with schizophrenia are also more dependent, smoke more heavily, 
and have greater difficulties quitting than smokers from the general 
population [40, 41]. There are several hypotheses for why smoking is 
so common in patients with schizophrenia. The self-medication hy- 
pothesis suggests that smoking might ameliorate some of the nega- 
tive symptoms of schizophrenia [42] and cognitive deficits common 
in schizophrenia such as reduced visuo-spatial working memory. 
Smoking could also counteract the effects of dopamine antagonist 
medications; for example, smoking rates increase after smokers with 
schizophrenia begin treatment with haloperidol. However, there is 
also emerging evidence that smoking could be an aetiological risk 
factor for schizophrenia [43]. Smoking usually predates the onset 
of psychotic symptoms, and cohort studies have found that there 
is a dose-response relationship between smoking and the onset of 
schizophrenia, so that people who smoke more heavily have a higher 
risk of developing schizophrenia [44, 45]. A recent genome-wide 
association study identified a variant in the nAChR gene cluster 
CHRNA5-A3-B4 (associated with smoking intensity) to be also 
associated with schizophrenia [46]. This association could be in- 
terpreted in two ways—smoking and schizophrenia might be influ- 
enced by this genetic variant independently, or this genetic variant 
may reflect a causal link between smoking and schizophrenia. 


Depression and anxiety 


Depression and anxiety are both associated with higher levels of 
smoking, and smokers are also more likely to receive a diagnosis 
of clinical depression or anxiety. Smoking rates among adults diag- 
nosed with major depressive disorder (MDD) are higher than in the 
general population, while greater nicotine dependence is associated 


with higher rates of a diagnosis of MDD [47]. Furthermore, at- 
tempts to quit smoking are more likely to fail in smokers with MDD 
than without [36]. In anxiety disorders, comorbidity estimates vary 
strongly with regard to the specific disorder investigated, the sample 
under investigation, and the definition of smoking behaviour. One 
study found a high prevalence of smoking in people with social 
phobia and panic disorders, whereas it was markedly lower than in 
the general population among people with obsessive-compulsive 
disorder [48]. Several hypotheses have been proposed regarding the 
nature of the relationship between smoking and depression/anxiety. 
The self-medication hypothesis suggests that depressed or anxious 
people smoke in order to manage their symptoms, so that symp- 
toms lead to smoking, including smoking initiation and subsequent 
heaviness of use [49]. However, the misattribution hypothesis sug- 
gests that the relief of stress and negative affect attributed to smoking 
is actually due to relief of withdrawal symptoms, rather than the re- 
lief of the underlying emotional psychopathology [31]. 

Smoking could be a risk factor for the onset of depression, for ex- 
ample by influencing a person's response to environmental stressors 
via its effect on the pituitary system, as has been suggested in animal 
models. Evidence against this hypothesis was cited previously [33]. 
It is also possible that a bi-directional relationship between smoking 
and anxiety or depression may exist, whereby smoking initiation 
might be motivated by the desire to manage symptoms, but chronic 
smoking might negatively influence outcomes. This is supported by 
two meta-analyses, one of which found that mental health symptoms 
generally improve after smoking cessation [50], and one that con- 
cluded that compelling evidence for either hypothesis is currently 
lacking [51]. Finally, shared genetic influences on both smoking 
levels and anxiety and depression could also be responsible for the 
high comorbidity of smoking and depression [52]. 


Other mental health problems 


Higher rates of smoking than in the general population have been 
recorded in people with several other mental health problems such 
as ADHD, conduct disorder, bipolar disorder (BP), and eating dis- 
orders [53]. In ADHD, the self-medication hypothesis has been sug- 
gested as a potential reason for smoking uptake. As dopaminergic 
function is altered in ADHD, this might be related to higher ini- 
tial rewarding effects of nicotine, which could lead to higher rates 
of smoking uptake. Nicotine has also been shown to improve some 
of the typical deficits in executive function seen in ADHD [54]. 
Epidemiological research has focused on the consistent association 
between maternal smoking during pregnancy and ADHD, as well as 
conduct disorder (which shows comorbidity with ADHD). Whether 
smoking during pregnancy is a risk factor for later development of 
these disorders in offspring or whether a common environment 
or common genetic variants drive this relationship is unclear [55]. 
A high rate of maternal smoking in people diagnosed with ADHD 
and conduct disorder could explain the high rates of smoking in 
later life in these populations. High rates of smoking have been 
found in people diagnosed with BP. It has been proposed that the 
mechanisms are similar to those underlying depression, such as 
mood dysregulation and heavy dependence, which lead to difficul- 
ties in quitting, but comparatively little research has been done in 
BP patients yet [56]. The relationship between smoking and eating 
disorders has typically been investigated in the context of smoking 
as an appetite and weight control method. Smoking rates are higher 


in people with eating disorders [57] and concerns about weight gain 
following cessation are cited by many smokers as a major reason for 
continued smoking. 


Smoking in mental health service settings 


As smoking is the greatest single contributor to the 10 to 20 year 
reduced life expectancy among people with severe mental illness, 
special attention should be given to smoking cessation in patients 
in mental health settings [58]. In the UK, mental health units have 
been required by law to ban smoking in all enclosed spaces since July 
2008. A survey in 2014 in medium- and low-secure mental health 
facilities showed that 83% prohibited smoking within buildings but 
allowed it within gardens and outside of the secure perimeter such as 
hospital grounds. A complete ban on smoking in all hospital build- 
ings and grounds has been recommended but is currently enforced 
by only 9% of facilities [59]. However, implementing smoking bans 
in mental health service settings brings specific challenges. There is 
often a long-standing culture of smoking among both patients and 
staff; cigarettes may be used as rewards for patients for meeting 
therapeutic goals, and smoking together may help staff and patients 
to create therapeutic relationships. Other potential barriers to the 
introduction of smoking bans may include increased aggression and 
stress in patients, weight gain in patients, unpredictable interactions 
of smoking cessation with psychotic medication, security issues (for 
example, due to patients smoking in their bedroom), and increased 
costs for implementation of smoking-free facilities. Several of these 
concerns have been reviewed in recent years and could either be al- 
leviated or are unfounded. For example, disruptive behaviour and 
verbal aggression decrease after the implementation of smoking 
bans in many cases, while stress and anxiety levels have been found 
to decrease after smoking cessation in many studies and may only 
be a concern in the short term (see Dependence and withdrawal, 
pp. 548-9). Smoking cessation often leads to improved efficacy of 
psychotropic medication, as smoking increases the metabolism of 
many drugs [60]. The dose should be carefully monitored during a 
smoking cessation attempt, but a reduction of antipsychotic medica- 
tion is typically seen as a positive outcome, as it may lead to a reduc- 
tion of negative side effects. 





Interventions: treatment and prevention 


Pharmacological treatments 


Several pharmacological treatments to support smoking cessation 
are available today. The most widely used are NRT, varenicline, and 
bupropion, which are available as first-line treatments in many coun- 
tries, including the UK. In the UK, NRT is available over the counter 
and in several formulations. The most commonly used method is the 
transdermal patch, which is available in several doses and strengths 
and allows nicotine to be absorbed slowly through the skin. Other 
forms of NRT include chewing gum, lozenges, sublingual tablets, 
sprays, and inhalers. With these, nicotine is absorbed via the nasal 
or oral mucosa; the rate of nicotine absorption is more rapid than 
with transdermal patches, but not as rapid as when smoking a cig- 
arette. The aim of NRT is to provide nicotine during smoking cessa- 
tion attempts in order to ameliorate physiological and psychological 
withdrawal symptoms and the motivation to smoke. The combin- 
ation of different methods of NRT, such as patch and nasal spray 
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or patch and gum, may be more effective than using any of these 
methods alone [61]. Varenicline is a selective nicotinic receptor par- 
tial agonist, licensed as a prescription-only treatment. It binds to 
the a482-nAChR, which is responsible for mediating the reinfor- 
cing properties of nicotine. This alleviates craving and withdrawal 
during smoking cessation. During nicotine exposure, the receptor 
occupancy of varenicline blocks reinforcing effects of smoking [62]. 
Since varenicline is prescribed to be taken for 2 weeks prior to a 
quit attempt, during which the smoker continues to smoke as usual, 
this may lead to extinction of the conditioned cues associated with 
smoking and may contribute, in part, to its superior efficacy relative 
to NRT. Bupropion was originally developed as a drug for depres- 
sion and is now licensed in sustained-release formulation for use in 
smoking cessation. Its antidepressant properties may make bupro- 
pion helpful for smoking cessation if nicotine withdrawal produces 
depressive symptoms. Nicotine itself may also have antidepressant 
properties, which are substituted for by bupropion [63]. The efficacy 
of varenicline for smoking cessation is somewhat greater than that of 
NRT or bupropion, while NRT and bupropion have similar efficacy 
[61]. However, the absolute quit rates achieved remain low—only 
about 20% of smokers using any of these methods are able to quit 
successfully and remain abstinent [64]. There are many reasons for 
this such as the importance of smoking cues and the sensorimotor 
stimulation in maintaining smoking behaviour (see Environmental 
drivers of relapse, p. 549), which are both lacking in pharmaco- 
logical treatment. 


E-cigarettes 


Electronic cigarettes, or ‘e-cigarettes, are handheld electronic de- 
vices that heat a liquid (‘e-liquid’) to create an aerosol, which the user 
inhales (‘vaping’). The e-liquid usually comprises propylene glycol 
and flavourings, with or without nicotine; there is wide variation in 
the exact composition of the fluid and in the different models of e- 
cigarettes available. The common features include a cartridge, which 
holds the e-liquid, a heating device, and a power source. Some e- 
cigarettes are manufactured for one time-use only, while others are 
refillable. 

Currently, there are considerable differences between countries 
in the regulation of the sale of e-cigarettes; their sale is completely 
banned in some countries. In Europe, new regulations concerning 
e-cigarettes have been introduced into the European Tobacco 
Products Directive (TPD). These new regulations came into force 
in May 2016, and all products now have to be fully compliant 
with the TPD since May 2017. The major changes in the regula- 
tion of e-cigarette products were as follows: (1) new products and 
changes to existing products must be notified to the Medicines and 
Healthcare products Regulatory Agency (MHRA); (2) informa- 
tion on the content and safety of the products must be provided to 
the MHRA; (3) products will have to be child-safe, so that they do 
not leak or break easily; (4) newspapers, radio, and TV advertise- 
ment for e-cigarette products are now banned (but not billboard 
and poster advertisement); and (5) e-liquids containing more than 
20 mg of nicotine per millilitre need a medical licence authorized 
by the MHRA. 

Since their widespread introduction in 2006, the use of e-cigarettes 
has grown rapidly and many smokers report successfully using 
e-cigarettes as an alternative form of nicotine delivery. There are 
some reasons to believe that they will be superior to other smoking 
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cessation methods, as they give the smoker a sensory and behavioural 
experience, with rapid delivery of nicotine to the brain, which re- 
sembles that achieved from a conventional cigarette. Experienced 
users of second- and third-generation e-cigarettes have been found to 
achieve nicotine levels very similar to those achieved through regular 
cigarette use. A recent Cochrane review concluded that e-cigarettes 
containing nicotine are effective for smoking cessation [65], anda re- 
cent randomized controlled trial found that e-cigarettes were more 
effective for smoking cessation than nicotine-replacement therapy, 
when both products were accompanied by behavioural support [65a]. 

At the same time, there has been controversy surrounding e- 
cigarettes. One concern has been the potential uptake of e-cigarettes 
by non-smokers, particularly children, and their potential as a 
gateway to smoking conventional cigarettes. Current evidence sug- 
gests that while children and young people are experimenting with 
e-cigarettes, this is rare in never-smokers. Thus, there is little evi- 
dence that vaping will readily progress to smoking. The long-term 
health consequences of using e-cigarettes are also unknown, so de- 
finitive statements about the toxicity of e-cigarettes are currently im- 
possible. The range of products and liquids available and the length 
of time most tobacco-related diseases take to become apparent are 
problematic. However, the models of e-cigarettes tested to date have 
been found to provide the users with substantially lower levels of 
potentially harmful chemicals than conventional cigarettes [5, 66]. 
A recent report of the Royal College of Physicians concluded that the 
using risk of e-cigarettes ‘is unlikely to exceed 5% of the harm from 
smoking tobacco’ [67]. The risk of second-hand exposure is likely to 
be correspondingly low as well. 


Behavioural treatments 


Behavioural treatments also play a role in smoking cessation; the 
combination of pharmacological and behavioural treatments is 
more effective than any one treatment method alone [68, 69]. There 
is evidence for the efficacy of low-level interventions such as pro- 
viding smokers with print-based self-help materials about cessation 
[70] and telephone support [71]. However, more intensive counsel- 
ling and behavioural therapies are generally superior to these low- 
level interventions [72-74]. Motivational interviewing (MI) is a 
client-centred counselling style, with the aim of eliciting behaviour 
change by exploring and resolving ambivalence. The MI approach 
was originally developed for use in substance abuse and has since 
been applied specifically to smoking cessation. A recent review on 
the current evidence found that MI is superior to brief advice and 
usual care when provided by a general practitioner. Interestingly, 
shorter sessions were found to be more effective than longer ones 
[75]. Other methods, such as mindfulness training and medita- 
tion, have claimed positive results, but the evidence is currently 
preliminary. A positive effect of exercise interventions on smoking 
cessation has also been reported in several studies, but more sub- 
stantial evidence of positive long-term effects is still needed [76]. 
Determining the ideal intervention in terms of content and intensity 
for each smoking population is challenging due to the difficulty of 
identifying the specific aspects that make an intervention effective 
and the specific needs of different smoking populations. 


Prevention and policy 


Tobacco control policies have contributed to the fall in smoking 
prevalence since the post-war period in many high-income 


countries. Within Europe, the strength of tobacco control pol- 
icies varies across countries and is considered strongest in the UK. 
Specific policies that have been found to be effective in decreasing 
smoking rates include taxation, bans on smoking in public places, 
bans on the advertising of tobacco products, health warnings on 
tobacco products, and plain (or standardized) packaging. The re- 
tail price of tobacco products can be increased by governments by 
raising taxes on them. The amount to which consumers will con- 
tinue or cease to buy a product depending on its price is referred 
to as ‘price elasticity. Compared to many other consumer products, 
the demand for tobacco products is only moderately elastic. A meta- 
analysis estimated the average price elasticity of demand for tobacco 
products at about -0.48, which is consistent with other findings 
[77, 78]. This means that a 10% increase in price will be followed by 
a decrease in consumption of about 4.8%. Nevertheless, as increases 
in tobacco taxes result in higher prices of tobacco for a whole popu- 
lation, even small reductions have a big impact. Increasing tobacco 
taxes has, in fact, been found to be the most effective way to decrease 
consumption in a population [79, 80]. Currently, 77% of the price 
of a packet of cigarettes account for taxes in the UK, and taxes have 
increased by 30% over the last 10 years. 

Banning smoking in public enclosed places has several obvious 
benefits. Apart from the fact that non-smokers are not subjected to 
second-hand smoke, it also decreases opportunities for smokers to 
smoke, thus supporting smoking cessation. It also makes smoking less 
socially acceptable, which, in turn, reduces smoking and exposure to 
tobacco smoke. The banning of advertisement of tobacco products is 
another important step in order to decrease smoking rates, specifically 
in teenagers and young people. In order to be effective, an advertise- 
ment ban has to be comprehensive and include all types of media and 
outdoor advertisement. In the UK, all forms of tobacco advertising and 
promotion are banned since the Tobacco Advertising and Promotion 
Act 2002. This now includes the display of tobacco products at the 
point of sale since 2015 and the implementation of standardized plain 
packaging for all tobacco products since May 2017, which requires the 
removal of all branding on any tobacco product. Finally, health warn- 
ings on tobacco packaging, as well as in media campaigns, have been 
found to increase health awareness in the population and to change 
the image of smoking. The last point is crucial in order to prevent 
young people from taking up smoking. In the UK, picture warnings 
on tobacco products have been implemented in 2008. 


FURTHER READING 


Caponnetto P, Polosa R, Russo C, Leotta C, Campagna D. Successful 
smoking cessation with electronic cigarettes in smokers with a 
documented history of recurring relapses: a case series. Journal of 
Medical Case Reports. 2011;5:585. 

Fant RV, Owen LL, Jack E. Henningfield JE. Nicotine replacement 
therapy. Clinics in Office Practice. 1999;26:633-52. 


REFERENCES 


1. World Health Organization. WHO global report on trends in 
prevalence of tobacco smoking 2018: World Health Organization, 
Geneva; 2018. 

2. Lopez AD, Collishaw NE, Piha T. A descriptive model of the cigar- 
ette epidemic in developed countries. Tobacco Control. 1994;3:242. 


10. 


Uk 


IP, 


13: 


14. 


15. 


16. 


Ws 


18. 


19. 


20. 


21 


227 


237 


. Office for National Statistics. General lifestyle survey. The 


Stationary Office, London; 2013. 


. US Food and Drug Administration. Harmful and potentially 


harmful constituents in tobacco products and tobacco smoke; 
established list. Federal Register. 2012;77:20034-7. 


. McNeill A, Brose L, Calder R, Hitchman S, Hajek P, McRobbie H. 


E-cigarettes: an evidence update: a report commissioned by Public 
Health England. Public Health England, London; 2015. 


. Court JA, Martin-Ruiz C, Graham A, Perry E. Nicotinic receptors 


in human brain: topography and pathology. Journal of Chemical 
Neuroanatomy. 2000;20:281-98. 


. Brody AL. Functional brain imaging of tobacco use and depend- 


ence. Journal of Psychiatric Research. 2006;40:404-18. 


. Benowitz NL. Nicotine addiction. New England Journal of 


Medicine. 2010;362:2295-303. 


. Berlin I, Anthenelli RM. Monoamine oxidases and tobacco 


smoking. International Journal of Neuropsychopharmacology. 
2001;4:33-42. 

Fowler JS, Logan J, Wang GJ, Volkow ND. Monoamine oxidase 
and cigarette smoking. Neurotoxicology. 2003;24:75-82. 
Shiffman S, Paty JA, Gwaltney CJ, Dang Q. Immediate ante- 
cedents of cigarette smoking: an analysis of unrestricted 
smoking patterns. Journal of Abnormal Psychology. 
2004;113:166-71. 

Shiffman S, Gwaltney CJ, Balabanis MH, et al. Immediate 
antecedents of cigarette smoking: an analysis from ecological 
momentary assessment. Journal of Abnormal Psychology. 
2002;111:531. 

Donny EC, Caggiula AR, Knopf S, Brown C. Nicotine 
self-administration in rats. Psychopharmacology. 
1995;122:390-4. 

Harvey DM, Yasar S, Heishman SJ, Panlilio LV, Henningfield 

JE, Goldberg SR. Nicotine serves as an effective reinforcer of 
intravenous drug-taking behavior in human cigarette smokers. 
Psychopharmacology. 2004;175:134-42. 

Henningfield JE, Goldberg SR. Nicotine as a reinforcer in human 
subjects and laboratory animals. Pharmacology Biochemistry and 
Behavior. 1983;19:989-92. 

Carter BL, Tiffany ST. Meta-analysis of cue-reactivity in addic- 
tion research. Addiction. 1999;94:327-40. 

Talhout R, Opperhuizen A, van Amsterdam JG. Role 

of acetaldehyde in tobacco smoke addiction. European 
Neuropsychopharmacology. 2007;17:627-36. 

Heath AC, Cates R, Martin NG, et al. Genetic contribution to 
risk of smoking initiation: comparisons across birth cohorts and 
across cultures. Journal of Substance Abuse. 1993;5:221-46. 
Maes HH, Sullivan PE, Bulik CM, et al. A twin study of genetic 
and environmental influences on tobacco initiation, regular 
tobacco use and nicotine dependence. Psychological Medicine. 
2004;34:1251-61. 

Carmelli D, Swan GE, Robinette D, Fabsitz R. Genetic influence 
on smoking—a study of male twins. New England Journal of 
Medicine. 1992;327:829-33. 

Ho MK, Tyndale RF. Overview of the pharmacogenomics of cig- 
arette smoking. Pharmacogenomics Journal. 2007;7:81-98. 
Saccone SF, Hinrichs AL, Saccone NL, et al. Cholinergic nicotinic 
receptor genes implicated in a nicotine dependence association 
study targeting 348 candidate genes with 3713 SNPs. Human 
Molecular Genetics. 2007;16:36-49. 

Thorgeirsson TE, Geller F, Sulem P, et al. A variant associated 
with nicotine dependence, lung cancer and peripheral arterial 
disease. Nature. 2008;452:638-42. 


24. 


25: 


26. 


2 


28. 


29; 


30. 


Sie 


92: 


B3 


34. 


35; 


36. 


Mio 


38. 


39) 


40. 


41. 


42. 


CHAPTER 55 Tobacco addiction 


Fowler CD, Lu Q, Johnson PM, Marks MJ, Kenny PJ. Habenular 
[agr] 5 nicotinic receptor subunit signalling controls nicotine 
intake. Nature. 2011;471:597-601. 

Jensen KP, Devito EE, Herman AI, Valentine GW, Gelernter J, 
Sofuoglu M. A CHRNA5 smoking risk variant decreases the 
aversive effects of nicotine in humans. Neuropsychopharmacology. 
2015;40:2813-21. 

American Psychiatric Association. Diagnostic and Statistical 
Manual of Mental Disorders, fifth edition. American Psychiatric 
Association, Washington, DC; 2013. 

Heishman SJ, Kleykamp BA, Singleton EG. Meta-analysis of the 
acute effects of nicotine and smoking on human performance. 
Psychopharmacology. 2010;210:453-69. 

Le Foll B, Goldberg SR. Control of the reinforcing effects of nico- 
tine by associated environmental stimuli in animals and humans. 
Trends in Pharmacological Sciences. 2005;26:287-93. 

Johnson MW, Bickel WK, Kirshenbaum AP. Substitutes for to- 
bacco smoking: a behavioral economic analysis of nicotine gum, 
denicotinized cigarettes, and nicotine-containing cigarettes. Drug 
and Alcohol Dependence. 2004;74:253-64. 

Hajek P, McRobbie HJ, Myers KE, Stapleton J, Dhanji A-R. Use of 
varenicline for 4 weeks before quitting smoking: decrease in ad 
lib smoking and increase in smoking cessation rates. Archives of 
Internal Medicine. 2011;171:770-7. 

DiFranza JR, Wellman RJ. A sensitization-homeostasis model of 
nicotine craving, withdrawal, and tolerance: integrating the clin- 
ical and basic science literature. Nicotine and Tobacco Research. 
2005;7:9-26. 

Shiffman S, Paty JA, Gnys M, Kassel JA, Hickcox M. First lapses 
to smoking: within-subjects analysis of real-time reports. Journal 
of Consulting and Clinical Psychology. 1996;64:366-79. 

Taylor AE, Fluharty ME, Bjorngaard JH, et al. Investigating the 
possible causal association of smoking with depression and anx- 
iety using Mendelian randomisation meta-analysis: the CARTA 
consortium. BMJ Open. 2014;4:e006141. 

Glassman AH, Helzer JE, Covey LS, et al. Smoking, smoking ces- 
sation, and major depression. JAMA. 1990;264:1546-9. 

Hitsman B, Papandonatos GD, McChargue DE, et al. Past major 
depression and smoking cessation outcome: a systematic review 
and meta-analysis update. Addiction. 2013;108:294-306. 
Hitsman B, Borrelli B, McChargue D, Spring B, Niaura R. History 
of depression and smoking cessation outcome: a meta-analysis. 
Journal of Consulting and Clinical Psychology. 2003;71:657. 
Heishman SJ, Kleykamp BA, Singleton EG. Meta-analysis of the 
acute effects of nicotine and smoking on human performance. 
Psychopharmacology. 2010;210:453-69. 

Grabski M, Curran HV, Nutt DJ, et al. Behavioral tasks sensi- 
tive to acute abstinence and predictive of smoking cessation 
success: a systematic review and meta-analysis. Addiction. 
2016;111:2134-44. 

de Leon J, Diaz FJ. A meta-analysis of worldwide studies dem- 
onstrates an association between schizophrenia and tobacco 
smoking behaviors. Schizophrenia Research. 2005;76:135-57. 
Olincy A, Young DA, Freedman R. Increased levels of the nico- 
tine metabolite cotinine in schizophrenic smokers compared to 
other smokers. Biological Psychiatry. 1997;42:1-5. 

Salokangas RK, Honkonen T, Stengard E, Koivisto A-M, Hietala 
J. Cigarette smoking in long-term schizophrenia. European 
Psychiatry. 2006;21:219-23. 

Kumari V, Postma P. Nicotine use in schizophrenia: the self 
medication hypotheses. Neuroscience and Biobehavioral Reviews. 
2005;29:1021-34. 





554 





SECTION 8 Substance use disorders 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


pie 


52 


53. 


54. 


55. 


56. 


DI 


58. 


Bye), 


60. 


61. 


62. 


Gage SH, Munafo MR. Rethinking the association between 
smoking and schizophrenia. The Lancet Psychiatry. 2015;2:118. 
Weiser M, Reichenberg A, Grotto I, et al. Higher rates of cig- 
arette smoking in male adolescents before the onset of schizo- 
phrenia: a historical-prospective cohort study. American Journal 
of Psychiatry. 2003;161:1219-23. 

Kendler KS, Lönn SL, Sundquist J, Sundquist K. Smoking and 
schizophrenia in population cohorts of Swedish women and 
men: a prospective co-relative control study. American Journal of 
Psychiatry. 2015;172:1092-100. 

Ripke S, Neale BM, Corvin A, et al. Biological insights from 108 
schizophrenia-associated genetic loci. Nature. 2014;511:421. 
Dierker LC, Avenevoli S, Merikangas KR, Flaherty BP, Stolar M. 
Association between psychiatric disorders and the progression of 
tobacco use behaviors. Journal of the American Academy of Child 
and Adolescent Psychiatry. 2001;40:1159-67. 

Himle J, Thyer BA, Fischer DJ. Prevalence of smoking among anx- 
ious 

outpatients. Phobia Practice and Research Journal. 1988;1:25-31. 
Chaiton MO, Cohen JE, O'Loughlin J, Rehm J. A systematic 
review of longitudinal studies on the association between 
depression and smoking in adolescents. BMC Public Health. 
2009;9:1. 

Taylor G, McNeill A, Girling A, Farley A, Lindson-Hawley N, 
Aveyard P. Change in mental health after smoking cessation: sys- 
tematic review and meta-analysis. BMJ. 2014;348:g1151. 
Fluharty M, Taylor AE, Grabski M, Munafo MR. The association 
of cigarette smoking with depression and anxiety: a systematic 
review. Nicotine and Tobacco Research. 2016:ntw140. 

Kendler KS, Neale MC, MacLean CJ, Heath AC, Eaves LJ, Kessler 
RC. Smoking and major depression: a causal analysis. Archives of 
General Psychiatry. 1993;50:36-43. 

Lasser K, Boyd JW, Woolhandler S, Himmelstein DU, 
McCormick D, Bor DH. Smoking and mental illness: a 
population-based prevalence study. JAMA. 2000;284:2606-10. 
McClernon FJ, Kollins SH. ADHD and smoking: from genes to 
brain to behavior. Annals of the New York Academy of Sciences. 
2008;1141:131-47. 

Thapar A, Fowler T, Rice F, et al. Maternal smoking during preg- 
nancy and attention deficit hyperactivity disorder symptoms in 
offspring. American Journal of Psychiatry. 2003;160:1985-9. 
Heffner JL, Strawn JR, DelBello MP, Strakowski SM, Anthenelli 
RM. The co-occurrence of cigarette smoking and bipolar dis- 
order: phenomenology and treatment considerations. Bipolar 
Disorders. 2011;13:439-53. 

Anzengruber D, Klump KL, Thornton L, et al. Smoking in eating 
disorders. Eating Behaviors. 2006;7:291-9. 

Brown S, Kim M, Mitchell C, Inskip H. Twenty-five year mor- 
tality of a community cohort with schizophrenia. British Journal 
of Psychiatry. 2010;196:116-21. 

Public Health England. Smoking cessation in secure mental health 
settings: guidance for commissioners. Public Health England, 
London; 2015. 

Schein JR. Cigarette smoking and clinically significant drug 
interactions. Annals of Pharmacotherapy. 1995;29:1139-48. 
Cahill K, Stevens S, Perera R, Lancaster T. Pharmacological 
interventions for smoking cessation: an overview and net- 
work meta-analysis. Cochrane Database of Systematic Reviews. 
2013;CD009329. 

Jorenby DE, Hays JT, Rigotti NA, et al. Efficacy of varenicline, an 
a4ß2 nicotinic acetylcholine receptor partial agonist, vs placebo 


63. 


64. 


65a. 


66. 


67. 


68. 


69. 


70. 


mle 


72 


Wa 


74. 


WS 


76. 


Vide 


78. 


19: 


80. 


or sustained-release bupropion for smoking cessation: a random- 
ized controlled trial. JAMA. 2006;296:56-63. 

Hurt RD, Sachs DP, Glover ED, et al. A comparison of sustained- 
release bupropion and placebo for smoking cessation. New 
England Journal of Medicine. 1997;337:1195-202. 

Schnoll RA, Lerman C. Current and emerging 
pharmacotherapies for treating tobacco dependence. Expert 
Opinion on Emerging Drugs. 2006;11:429-44. 


. Hartmann-Boyce J, McRobbie H, Bullen C, Begh R, Stead LF, 


Hajek P. Electronic cigarettes for smoking cessation. Cochrane 
Database of Systematic Reviews. 2016;CD010216. 

Hajek P, Phillips-Waller A, Przulj D, etal. A randomized trial of 
e-cigarettes versus nicotine-replacement therapy. N Engl J Med. 
2019;380(7):629-37. 

Hajek P, Etter JF, Benowitz N, Eissenberg T, McRobbie H. 
Electronic cigarettes: review of use, content, safety, effects 

on smokers and potential for harm and benefit. Addiction. 
2014;109:1801-10. 

Royal College of Physicians Tobacco Advisory Group. Nicotine 
without smoke-tobacco harm reduction. Royal College of 
Physicians, London; 2016. 

Stead LF, Koilpillai P, Fanshawe TR, Lancaster T. Combined 
pharmacotherapy and behavioural interventions for 

smoking cessation. Cochrane Database of Systematic Reviews. 
2016;3:CD008286. 

Stead LF, Koilpillai P, Lancaster T. Additional behavioural sup- 
port as an adjunct to pharmacotherapy for smoking cessation. 
Cochrane Database of Systematic Reviews. 2015;10:CD009670. 
Hartmann-Boyce J, Lancaster T, Stead LF. Print-based self-help 
interventions for smoking cessation. Cochrane Database of 
Systematic Reviews. 2014;6:CD001118. 

Whittaker R, McRobbie H, Bullen C, Borland R, Rodgers A, Gu 
Y. Mobile phone-based interventions for smoking cessation. 
Cochrane Database of Systematic Reviews. 2016;4:CD006611. 
Fiore M, Jaen CR, Baker T, et al. Treating tobacco use and de- 
pendence: 2008 update. US Department of Health and Human 
Services, Rockville, MD; 2008. 

Fiore MC, Bailey WC, Cohen SJ, et al. Treating tobacco use and 
dependence: clinical practice guideline. US Department of Health 
and Human Services, Rockville, MD; 2000. 

Lancaster T, Stead LF. Individual behavioural counselling for 
smoking cessation. Cochrane Database of Systematic Reviews. 
2005;2:CD001292. 

Lindson-Hawley N, Thompson TP, Begh R. Motivational 
interviewing for smoking cessation. Cochrane Database of 
Systematic Reviews. 2015;3:CD006936. 

Ussher MH, Taylor AH, Faulkner GEJ. Exercise interventions 
for smoking cessation. Cochrane Database of Systematic Reviews. 
2014;8:CD002295. 

Forster M, Jones AM. The role of tobacco taxes in starting and 
quitting smoking: duration analysis of British data. Journal 

of the Royal Statistical Society: Series A (Statistics in Society). 
2001;164:517-47. 

Gallet CA, List JA. Cigarette demand: a meta-analysis of elastici- 
ties. Health Economics. 2003;12:821-35. 

Jha P, Peto R. Global effects of smoking, of quitting, and of taxing 
tobacco. New England Journal of Medicine. 2014;370:60-8. 
Chaloupka FJ, Hu T-W, Warner KE, Jacobs R, Yurekli A. The tax- 
ation of tobacco products. In: Jha P, Chaloupka F (eds). Tobacco 
Control in Developing Countries. Oxford University Press, Oxford; 
2000. pp. 237-72. 


aorbidity of substance use 


d psychiatric disorders 


M.A. Sinclair and Anne Lingford-Hughes 
































For any individual, it is likely that a number of factors contribute to 
the comorbidity of psychiatric and substance use disorder (SUD). 
These may include genetics, family history, environmental factors 
(for example, stress, childhood experiences, current life events), 
and personality traits (for example, impulsivity). Such vulnerability 
factors may be common to both the psychiatric disorder and the 
SUD, rather than one being a consequence of the other (Fig. 56.1). 
For instance, higher levels of impulsivity are seen in alcohol use dis- 
order (AUD) and bipolar disorder and likely contribute to their high 
comorbidity [1]. The same symptom may also be either a cause or 
a consequence; for instance, depression or anxiety may increase the 
likelihood of substance use, which then is likely to exacerbate the 
mental state, resulting in greater anxiety and depression symptoms 
and fuelling further alcohol or other substance use which patients 
may regard as treating their anxiety or depression. Epidemiological 
studies in major depressive disorder (as opposed to depressive symp- 
toms) and AUD have shown that either may predate the other. It ap- 
pears that such independent mood disorder may be common in those 
with SUD who present for treatment [2]. Consequently, it should be 
determined how any substance use may be contributing to a patient's 
psychiatric symptoms and discussed with them in order to optimize a 
treatment approach. It may be unhelpful to refer to either the psychi- 
atric disorder or SUD as primary or secondary, since this may imply 
that treating one will improve the other; this is not borne out in many 
studies and so has limited clinical utility. Both disorders will need to 
be treated concurrently for any improvement to be robust. 



































The assessment of patients who have AUD or SUD comorbid with 
one or more other psychiatric disorder follows the same principles 
as those highlighted in other sections, including the importance 
of a collateral history from family members or carers. The pri- 
ority is to ensure that a full history is taken to identify the range 
of comorbidities, and an accurate assessment of the frequency and 





levels of use of any substances. The evidence is that this is, in general, 
poorly done [3], and patient outcomes are likely to be substantially 
worse where an AUD or SUD is not identified accurately [4]. 

Screening for nicotine, alcohol, and drug use should be a routine 
part of any psychiatric assessment and integrated into the history of 
the presenting complaint, given the impact these substances have 
on the mental state. For patients not presenting primarily with an 
SUD, it is especially important that these questions are asked in an 
objective, non-judgemental way, explaining to the patient that it is 
important that you fully understand the factors that may be con- 
tributing to their current distress in order to most effectively help 
them. Patients may (understandably) be hesitant to disclose their 
alcohol or substance use if they believe that it may result in sanctions 
or being denied treatment. 

Clinicians need to be competent in understanding how to calcu- 
late units of alcohol in order to accurately assess levels of use, under- 
stand the likely contribution that alcohol is making to the presenting 
complaint (both acutely and chronically), and make a competent 
risk assessment of the likelihood of symptoms on alcohol with- 
drawal [3, 5]. Full assessment of the need for alcohol detoxification 
is covered elsewhere (see Chapter 52) and is no different for patients 
with comorbid conditions. 














This is potentially more challenging given both the range and routes 
of consumption of other drugs. The terminology for substances may 
be unfamiliar to clinicians and frequently changes. Illicitly obtained 
drugs also vary in strength and purity, frequently containing fillers 
or adulterants, which may themselves have psychotropic properties. 
Patients can be reluctant to disclose their level of use or over-report 
it through fear of withdrawal and being undertreated, or as a form of 
active drug-seeking behaviour. There is no practical and reliable way 
to reconcile these issues. Record what is reported, ask for a collat- 
eral history, and regularly reassess the patient if they are presenting 
in an acutely disturbed state. Urine drug screens (where available) 
may be helpful to detect opioids, some stimulants, and cannabis, but 
the availability of novel psychoactive substances (NPS) means that 
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Fig. 56.1 Comorbid addiction vulnerability. 


increasingly clinicians will need to rely on the history and empir- 
ical observation of the effect of the psychopharmacological prop- 
erties of any substance use on the patient’s presentation. These can 
be divided broadly into three main categories: sedatives, stimulants, 
and hallucinogens—the properties and management of which are 
covered elsewhere (see Chapters 53, 54, 55, and 56). 


Diagnosis and goals of treatment 


One major challenge in the management of patients with comorbidity 
is that they may present requesting treatment for one condition (for 
example, depression) and not be aware, or willing to consider, the 
impact that comorbid alcohol or other substance use has on their 
presentation or outcome. 

A patient-centred approach to treatment needs to begin where the 
patient is, in terms of their own explanation for their illness, and to 
agree on some preliminary, achievable goals that build trust in the 
therapeutic relationship and increase the patient's self-efficacy. 

For patients who are not physically dependent on alcohol or 
other substances, psychosocial approaches may be sufficient; brief 
interventions, based on motivational interviewing principles for 
behaviour change, are as appropriate for patients with a comorbid 
psychiatric disorder and can be delivered within general psychi- 
atric settings as part of a holistic management plan. Essentially, what 
works in other settings should be tried but may be less effective [6]. 

Patients presenting with comorbid substance dependence (particu- 
larly with alcohol) require a clear management plan, integrating a com- 
bination of psychosocial and pharmacological treatments targeted at 
BOTH conditions, which is likely to be more effective for patients who 
have often tried and failed with a less co-ordinated approach. 

Establishing patients who are dependent on nicotine or opioids 
on some form of substitution therapy is beneficial, but there is no 
evidence for any pharmacological approaches in the management of 
comorbid cannabis use, stimulant use, or any of the NPS. For these, 
psychosocial approaches are the mainstay of treatment. 


The individual substances 


Nicotine dependence 


Individuals with severe mental illness (SMI) have among the 
highest prevalence rates of current nicotine smoking, at 50-85% 
[7]. Tobacco smoking is a substantial contributor to the reduction in 


life expectancy for such individuals. The introduction of a smoking 
ban in hospitals presented some challenges to inpatient psychiatric 
units, particularly where individuals may be residing involuntarily 
for some time. Individuals with schizophrenia are three times more 
likely to initiate smoking, and although they are equally as inter- 
ested in quitting smoking, including taking nicotine replacement 
therapy (NRT), they are five times less likely to be successful at so 
doing than the general population [7]. It is therefore important that 
support to stop or change smoking behaviour should be offered. 
The role of e-cigarettes in helping such individuals change is not 
currently clear. 

In a meta-analysis of efficacy and tolerability of adjunctive 
pharmacotherapy for smoking cessation in adults with SMI, trials 
included schizophrenia, schizoaffective disorder, and bipolar dis- 
order, and both bupropion and varenicline were superior to placebo, 
with no differences in tolerability [8]. However, the quality of the 
trials was noted as low. Systematic reviews of varenicline concluded 
that it is superior to placebo in smoking cessation in individuals 
with SMI [8, 9]. Varenicline increased the chances of successful 
long-term smoking cessation by between 2-3 times, compared with 
pharmacologically unassisted quit attempts [9]. The most common 
side effect reported was nausea. However, it is the risk of neuro- 
psychiatric side effects from varenicline that has been debated. One 
systematic review and meta-analysis did not find a significant dif- 
ference in risk of suicide, attempted suicide or suicidal ideation, de- 
pression, or death between varenicline and placebo, and no evidence 
that the presence of psychiatric illness impacted on depression or 
suicidal ideation [10]. The Evaluating Adverse Events in a Global 
Smoking Cessation Study (EAGLES) investigated neuropsychiatric 
complications from varenicline in over 8000 motivated-to-quit indi- 
viduals with and without a psychiatric disorder [11]. There was no 
significant increase in neuropsychiatric adverse events attributable 
to varenicline or bupropion relative to nicotine patch or placebo. 
Varenicline was more effective than placebo, nicotine patch, and bu- 
propion in helping smokers with a psychiatric disorder (including 
psychosis and mood and anxiety disorders) achieve abstinence, and 
bupropion and nicotine patch were more effective than placebo. It is 
therefore suggested that all patients quitting smoking should be told 
about the small chance they may experience severe mood changes 
and all clinicians should be alert for these changes. It is notable that 
the majority of trials only included those who were motivated to quit 
and whose psychiatric disorder was stable and so may not be gener- 
alizable to all individuals or psychiatric patients. 


Alcohol dependence 


Usual medication regimens for alcohol detoxification can be used in 
patients with comorbid psychiatric illness, with due consideration 
for doses required to avoid complications, for example seizures, and 
any liver or renal impairment altering pharmacokinetics. Mentally 
ill patients are more likely to require inpatient admission for detoxi- 
fication, and so relapse prevention plans should be established and 
implemented at this time. 

There are three main relapse prevention medications to be con- 
sidered for patients with AUD: acamprosate, naltrexone, and 
disulfiram [12]. The limited evidence for their use in different 
comorbid conditions will be considered later, but the principle of 
active treatment of alcohol dependence as part of overall patient 
management and treatment plan is key [13]. Where there is no spe- 
cific evidence for use in an individual comorbid condition, data on 
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the drug’s pharmacokinetics and other safety data can be used to 
extrapolate from the treatment of AUD alone to help determine a 
risk—-benefit profile of treatment with a specific medication in indi- 
vidual patients. 


Opioid dependence 


The most robust evidence for the pharmacological treatment of 
opioid dependence in patients with a range of psychiatric condtions 
is to engage in opiate substitution therapy (OST) programmes such 
as methadone or buprenorphine [13]. 


Psychiatric disorders 


Schizophrenia 


Managing patients with schizophrenia who also use substances is a 
common and substantial challenge. While the focus is often on can- 
nabinoids, comorbidity with nicotine and alcohol is more common 
and, as such, has a significant impact on life expectancy. Stimulant 
use disorder, which can be associated with psychotic phenomena, 
has been reported to be higher in the schizophrenia spectrum than in 
the general population at 10.4% but varies considerably, depending 
on the setting, geographical location, etc.; notably, cannabis use dis- 
order was strongly linked with stimulant use disorder [14]. While 
there is some evidence that residential ‘dual diagnosis’ treatment 
programmes may lead to improvements, their availability is likely to 
be very limited, at least in the UK. Most individuals will be treated in 
the community, with occasional admissions to a general psychiatric 
ward and varying input from specialist addiction services. 

There are a range of clinical guidelines providing approaches to 
treating individuals with SUD and schizophrenia [15, 16]. Negative 
symptoms and cognitive impairment can make it particularly chal- 
lenging to engage an individual with schizophrenia in psychological 
approaches to reduce their drug use or prevent relapse, as well as 
contributing to poor adherence to medication. The UK NICE guide- 
lines for such comorbidity recommend that the severity of both 
disorders, the individual’s social and treatment context, and their 
readiness to change should be considered [16]. In addition, clear ad- 
vice about reducing the risk of their substance abuse (for example, by 
not injecting, avoiding to use multiple drugs concurrently, etc.) and 
reminders of adverse consequences of their substance abuse, as well 
as not taking their drugs for psychosis, should not be forgotten [15]. 


Psychosocial approaches 


Cochrane reviews have concluded that there was no good evidence for 
effectiveness of any one particular psychosocial treatment in psych- 
otic spectrum disorders comorbid with substance abuse or for those 
with serious mental illness, of which the majority had schizophrenia 
or psychotic illness [17]. However, this should be seen in the context 
of a very limited evidence base. Motivational interviewing is the most 
common approach studied, often in addition to CBT, and other models 
investigated include contingency management and skills training. 

In the UK, a trial in schizophrenia, schizophreniform disorder, 
or schizoaffective disorder and substance (drug and/or alcohol) 
use disorder did not find that integrated motivational interviewing 
and CBT plus standard care were superior to standard care alone 
in improving outcomes, for example hospitalization, symptom 
outcomes, and functioning [18]. Alcohol was the most commonly 


misused substance, followed by cannabis. Integrated therapy resulted 
in a reduction in the amount of substance used for at least 1 year after 
completion of therapy. It is interesting to note that this reduction in 
substance use was not associated with improvements in outcomes 
for their psychotic illness and is in keeping with other studies. To 
test the theory that more prolonged treatment was needed, a further 
trial compared treatment as usual with shorter-term intervention 
(12 sessions over 4.5 months) and with a longer-term delivery (24 
sessions over 9 months) [19]. This again showed no differences be- 
tween groups in any of the outcomes regarding psychosis or cannabis 
use. Though no association was found between sessions attended and 
outcome, the median number of sessions attended was 12 of the 24 
available, despite good therapeutic relationships and meeting in the 
patients home. The authors note that this result is in keeping with 
trials treating cannabis use disorder alone; focus on associated prob- 
lems, rather than cannabis use itself, may be more helpful. 


Pharmacological approaches to treating schizophrenia 


There are few randomized trials comparing different medications to 
guide clinical practice. Published studies mostly describe uncontrolled 
designs with small numbers of participants, comparison of the impact 
of switching the type of medicines, or case series. Dopamine-blocking 
activity in the dopaminergic mesolimbic pathway was originally seen as 
an appropriate therapeutic strategy for substance misuse, since it would 
reduce their rewarding effects; it is now recognized that dopaminergic 
activity may be blunted in patients with SUD. Thus, further blockade is 
likely to be counterproductive, with the potential to increase substance 
use as a way to boost the hypodopaminergic mesolimbic system. It was 
hypothesized that newer drugs with weaker DRD2 antagonism (and 
less impact on the dopaminergic system) would result in less substance 
misuse when used to treat patients with schizophrenia. The Clinical 
Antipsychotic Trial of Intervention Effectiveness (CATIE) compared 
olanzapine, risperidone, ziprasidone, quetiapine, and perphenazine; 
the CATIE study participants included 37% with a current SUD, and of 
these, 87% drank alcohol, 44% used cannabis, and 36% used cocaine. 
While the CATIE study showed that olanzapine was superior to other 
drugs on the primary outcome of ‘time to all-cause discontinuation’ 
over 18 months, illicit drug use attenuated this superiority [20]. There 
was little between the different drugs for psychosis on measures of 
smoking, alcohol, and drug use in multiple subgroups with reductions 
evident across all groups. Nevertheless, there is no large randomized 
trial a priori comparing different dopamine antagonists on substance 
use outcomes. 

The role of clozapine in treating schizophrenia comorbid with 
SUD is of considerable interest, given its efficacy in treatment- 
resistant populations. In routine clinical practice, these patients 
would include those with comorbidity and there are case reports and 
case series describing beneficial effects of clozapine [13]. However, 
there has been no randomized prospective trial of clozapine in this 
clinical population. In summary, clinical guidance is that the choice 
of drugs for psychosis can be based on the side effect profile and tol- 
erability for each patient without regard to their SUD. 


Treating SUD in the context of schizophrenia 


Alcohol dependence 


There are a number of alcohol relapse prevention medications that 
should be considered to improve drinking behaviour, but there is 
limited evidence to guide practice [13]. For both acamprosate and 
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naltrexone, no adverse effects are apparent of additional particular 
relevance, compared with a non-comorbid population. A placebo- 
controlled RCT reported that the opiate antagonist naltrexone was 
associated with fewer drinking days, fewer heavy drinking days, and 
less craving, though psychosis did not concurrently improve [21]. 

Use of disulfiram in patients with psychosis is relatively contra- 
indicated because its use in high doses will result in higher dopa- 
mine levels and potentially acutely increase psychosis [22]. However, 
more recent studies in patients with psychosis—not all with schizo- 
phrenia [13, 21]—have reported disulfiram reduced alcohol con- 
sumption, with limited adverse events. 


Cannabis use disorders 


A systematic review and meta-analysis reported that motivational 
interviewing, with or without CBT, reduced the quantity of can- 
nabis use (but not the frequency) and positive symptoms of schizo- 
phrenia; however, there was limited impact on negative symptoms 
[23]. Delivery of MI + CBT to a group, rather than an individual, did 
not improve cannabis or psychotic outcomes, though the quality of 
life did improve [24]. One trial compared ‘treatment as usual’ with 
combined MI and CBT in patients with schizophrenia spectrum 
psychosis and cannabis use disorder [25]. The combined interven- 
tion showed a possible reduction in the number of monthly joints, 
but not days of cannabis use, and did not reduce psychotic symp- 
toms. However, further analysis revealed that those who received the 
combined intervention had more visits to a psychiatric emergency 
department and hospitalizations but spent less time in hospital. 

If focusing on the patient alone may not be particularly beneficial, 
does involving the family improve outcomes? A recent trial in recent- 
onset schizophrenia with cannabis use found that family-based MI 
and interaction skills were superior to routine family support for 
some outcomes. Thus, training parents helped reduce cannabis use 
but did not reduce the parent’s stress and sense of burden or their 
general level of functioning. Notably, this benefit in reducing can- 
nabis use was maintained at 15-month follow-up. While statistically 
there was no difference between groups or during trial of follow-up 
in other substance use, it is interesting to note that alcohol use did 
increase in the intervention group where a reduction in cannabis use 
was seen [26]. 

No single medicine has superiority over another; for example, 
olanzapine and risperidone are similar in impact on substance use 
and psychosis [13]. A pilot study comparing clozapine (mean: 225 
mg/day) with ziprasidone (mean: 200 mg/day) in patients with 
schizophrenia and cannabis use disorder found that cannabis use 
similarly decreased in both groups over 12 months [27]. While clo- 
zapine was associated with fewer positive symptoms, it also incurred 
more side effects and poorer compliance. Thus, the available data 
support clinical guidance that the choice of drugs for psychosis can 
be based on the side effect profile and tolerability for each patient 
without consideration of their SUD. 


Stimulant use disorder 


Psychological approaches are the mainstay of treatments for stimu- 
lant use disorders, with contingency management showing effective- 
ness [13]. An RCT compared 3 months of contingency management 
for stimulant abstinence plus treatment as usual, and treatment as 
usual with reinforcement for study participation only in patient 
with serious mental illness [28]. Of the participants, about 33% had 


schizoaffective spectrum disorder, 34% had bipolar disorder, and 
27% had major depression. Though they were less likely to remain 
in the 3-month trial, those who received contingency management 
were 2.4 times more likely to have a stimulant-negative urine. In 
addition, they had less use of stimulants in the following 3 months 
and significantly lower levels of alcohol use, injection drug use, and 
psychiatric symptoms, and were less likely to be admitted for psychi- 
atric hospitalization. 


Bipolar disorder 


Substance misuse in bipolar disorder provides clinical, diagnostic, 
and management challenges [15, 16]. AUDs are particularly highly 
prevalent in bipolar disorder and may complicate both manic and 
depressive presentations. There is no evidence that a particular mood 
stabilizer is superior to another in improving either substance misuse 
or bipolar disorder [13]. The combination of sodium valproate and 
lithium may be better than lithium alone in improving alcohol out- 
comes, though not necessarily manic and depressive symptoms. Due 
to the scant evidence base, clinical guidance generally recommends 
following usual practice in managing their bipolar disorder. 


Treatment of substance use disorder 


Again, there is limited evidence and it mainly applies to alcohol de- 
pendence. There is evidence to suggest that naltrexone or disulfiram 
are superior to placebo in improving drinking behaviours in those 
with alcohol dependence and a psychotic disorder, of whom the ma- 
jority (73%) had bipolar disorder [21]. However, they did complain 
more about side effects. Acamprosate has been used safely in pa- 
tients with bipolar disorder and alcohol dependence, in addition to 
their mood-stabilizing medication, but without benefit in a prelim- 
inary report [29]. Despite such limited evidence, medication should 
be considered for management of comorbid alcohol dependence. 
In addition and also for other substances, psychosocial approaches 
should be used. As with other psychiatric disorders, the key is as- 
sessing and diagnosing any substance misuse and how it may con- 
tribute to their bipolar disorder in order to manage it optimally. 


Depression 


The co-occurrence of mood and SUD is common, yet studies spe- 
cifically investigating effective management of both conditions 
are limited. There have been three systematic reviews [30-32], all 
examining slightly different, but related, questions. These used the 
same 11 RCTs (published between 1990 and 2010), evaluating the 
efficacy of different medicines in depression with comorbid alcohol 
dependence, as the basis of their analysis. Nunes and Levin (2004) 
[30] analysed 14 studies, eight with depression and comorbid al- 
cohol dependence, two with depression and cocaine use, and four 
with depression and comorbid opioid dependence (on methadone 
maintenance treatment). They concluded that overall, drugs for de- 
pression had a modest beneficial effect, but that ‘concurrent therapy 
directly targeting the addiciton was also required’. The meta-analysis 
by Torrens et al (2005) [31] investigated whether sertraline, tricyclic 
reuptake inhibitors, and nefazadone were an effective treatment for 
patients with SUD in a wider range of studies (nine with depression 
and comorbid alcohol dependence, five with cocaine, and seven with 
opioid dependence comorbid with depression), combined with an- 
other 40 studies examining the efficacy of drugs for depression in 
nicotine, cocaine, and alcohol dependence alone. They concluded 
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there was no significant effect for sertraline on depressive symp- 
toms in the comorbid alcohol group. There was some benefit for 
medicines with broader pharmacology (imipramine, desipramine, 
and nefazodone) on depressive symtoms in the comorbid group, al- 
though safety concerns in overdose limit their usefulness. In terms 
of reducing alcohol consumption, neither SSRIs nor the other drugs 
for depressiom on their own had any significant impact. The most 
recent meta-analysis [32] compared 11 RCTs of drug efficacy in al- 
cohol and comorbid depression (involving 891 patients), with the 
clinical trial data of over 46,000 patients with depression alone. 
Response rates to the active treatment was 50-60% in both the 
depression-only group and the depression comorbid with alcohol 
use group. However, placebo response differed significantly across 
the groups, with 37.7-39.3% in the non-comorbid depression group 
and 47.1-53.1% in the comorbid alcohol group. A limitation with 
all three meta-analyses is that they do not include any data on SSRIs 
(other than sertraline and fluoxetine), mirtazapine, or venlafaxine. 
A recent small (N = 14) double-blind, placebo-controlled pilot trial 
of mirtazapine in depressed patients and comorbid AUD suggests 
that it is safe and tolerated, and may have an effect on depressive 
symptoms, but not alcohol consumption, although it was extremely 
underpowered for confidence in its conclusions [33]. 


Treatment of substance use disorder 


While drugs for depression have a modest effect on levels of de- 
pression in the comorbid group, they have not been shown to cause 
a direct or consequent reduction in alcohol consumption [30], and 
similarly use of relapse prevention medication alone (without 
treating the underlying psychiatric disorder) had minimal ef- 
fect on outcomes [21]. Improved outcomes have been shown by 
combined treatment for depression (with sertraline 200 mg daily) 
and alcohol dependence (naltrexone 100 mg daily) in a placebo- 
controlled four-arm study. Only the active combination of both 
drugs was significantly better in improving drinking behaviour 
and also had a marginally better effect on depression, relative to 
placebo [34]. Overall the safety profile for use of acamprosate and 
naltrexone in patients with comorbid depression is good; a thor- 
ough risk assessment needs to be undertaken prior to the use of 
disulfiram, due to potential dangers of interaction with alcohol in 
suicidal patients [35], but active treatment of both conditions is 
recommended [13]. 

It is less common for patients dependent on opioids to present 
with depression, and beyond establishing them on OST, there 
is limited benefit of treating patients with a drug for depression 
[31, 36]. For patients with depression and use of dopamine-releasing 
drugs (‘stimulants’) or cannabis, there is no evidence for the benefit 
of pharmacological treatment for their substance use, and manage- 
ment is primarily psychosocial. 


PTSD and other anxiety disorders 


Anxiety symptoms are common in patients with SUD and may be 
present as part of intoxication, withdrawal, or early abstinence from 
substances, depending on the substance used, and a thorough his- 
tory is required to ascertain the role that they play in the presenting 
mental state. For example, patients with severe alcohol dependence 
frequently develop panic symptoms, which resolve on detoxification 
and abstinence. Patients with anxiety disorders also frequently have 
SUD, and the temporal relationship between the two conditions are 


frequently difficult to untangle, as PTSD has often been present for 
many years before patients present. 

In PTSD, SUD is 2-3 times more likely than in the general popu- 
lation, and outcomes in the comorbid population are worse that out- 
comes for either disorder alone. A recent review of the evidence base 
for PTSD co-occurring with AUD [37] showed that pharmacological 
interventions for AUD (naltrexone, disulfiram, and topiramate) or 
PTSD (sertraline, desipramine, prazosin) were mainly effective in 
reducing drinking outcomes (including one study with adjuvant 
psychotherapy), but only one study with sertaline demonstrated an 
effect over placebo in the reduction of PTSD symptoms. The review 
concludes that the safety profile for topiramate is likely to limit its 
usefulness in clinicial practice, but that naltrexone and disulfiram 
are relatively safe to use, although further studies are required to test 
the efficacy of pharmacological and psychotherapeutic interven- 
tions in combination for both disorders [13, 37]. 

Patients with social anxiety disorder (SAD) have often had symp- 
toms from childhood and start using substances (especially al- 
cohol) as a form of self-management during adolescence, with SAD 
being the primary condition in 80% of cases. Despite the significant 
comorbidity between the two conditions, the only studies conducted 
have been with paroxetine, and a recent Cochrane review found the 
quality of studies to be low [38]. The same review found a single 
RCT for the treatment of generalized anxiety disorder comorbid 
with AUD using buspirone but reported the quality of study design 
and reporting to be low, such that limited conclusions can be drawn. 

Overall, anxiety disorders, despite substantial comorbidity with 
SUD, especially alcohol, have a very limited evidence base for treat- 
ment, and extrapolation from what works in the non-comorbid con- 
dition is needed in clinical settings, in the absence of clear research 
evidence. 

There is some evidence that the GABA agonist baclofen may 
have a role to play in relapse prevention from alcohol in patients 
with anxiety symptoms, and at time of writing, it has a ‘temporary 
recommendation for use’ in France for the maintenance of alcohol 
abstinence or for the reduction of consumption to low levels in pa- 
tients with alcohol dependence who have not responded to other 
treatments. However, evidence for its effectivenss has yet to be fully 
demonstrated, and there are concerns about its safety profile in 
overdose [35]. 


Attention-deficit/hyperactivity disorder 


Substance use is six times more common in adolescents with ADHD 
than the general population. Atomoxetine is recommended as a 
first-line treatment in adults with ADHD comorbid with SUD 
due to the lack of abuse potential, although evidence of efficacy is 
limited. A single study of ADHD adults with comorbid AUD found 
significant effects on ADHD symptoms, but inconsistent effects 
on drinking behaviour; more research is required to investigate ef- 
fective treatments [39]. 


While professionals working in psychiatric services are highly likely 
to have had sufficient training and experience in managing mood 
and psychotic disorders, as well as other less common psychiatric 
disorders, the same is not the case for substance use and addiction. 
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For example, in the UK, the reduced psychiatric expertise in spe- 
cialist addiction services also means that general psychiatric services 
are more likely to have to be involved in managing individuals with 
comorbidity. 

While there is a small evidence base for the effectiveness of 
treatments targeting comorbid SUD and psychiatric disorder, the 
frequency of their occurrence and the negative impact on out- 
comes mean that psychiatrists need to develop a degree of clinical 
expertise in managing these complex conditions. Being aware of 
the evidence available, extrapolating from what is known to be 
effective in the single condtion, and making a balanced risk as- 
sessment prior to a trial of treatment will be necessary to optimize 
treatments. 
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Schizophrenia is probably the most devastating illness that psychi- 
atrists treat. An estimated 1% of the population has schizophrenia, 
which claims its victims at a youthful age and often prevents their 
full participation in society. Schizophrenia also creates an enormous 
economic burden, calculated at $65 billion annually in direct and 
indirect costs [1]. Despite its emotional and economic costs, schizo- 
phrenia has yet to receive sufficient recognition as a major public 
health concern or adequate research support to investigate its causes, 
treatments, and prevention. 





Schizophrenia and other psychotic disorders have been recognized 
in almost all cultures and described and/or portrayed throughout 
much of recorded time. 

Although schizophrenia was only demarcated as a specific diagnostic 
entity at the end of the nineteenth century, earlier accounts appear in the 
literature. Many characters ‘become mad in classical tragedy, although 
the patterns of illness do not map precisely on the modern conceptions 
of schizophrenia—a fact probably due to both literary licence and to the 
fact that the nosology of mental illness was not highly refined at that 
time. By the era of Elizabethan drama, however, we have portrayals of 
schizophrenia that closely resemble the modern concept. The madness 
of Ophelia in Hamlet is quite similar to modern schizophrenia, and the 
portrayal of ‘Poor Tom, son of Gloucester pretending to be a ‘Bedlam 
beggar’ escaped from the large Bethlehem Hospital where mental pa- 
tients were housed, is a near-perfect portrayal of both hallucinations 
and disorganized speech: 


Who gives anything to Poor Tom? Whom the foul fiend 
hath led through fire and through flame ... ? Tom’s 
a-cold,— O, do de, do de, do de... Do poor Tom some 
charity, whom the foul fiend vexes. There I could have 
him now,—and there, and there again, and there. 
King Lear IIL-iv.51 


Its modern history dates to Emil Kraepelin, who is credited with 
identifying schizophrenia [2]. His original term for schizophrenia 





dementia praecox was based on his observations that these pa- 
tients developed their illness at a relatively early age (praecox) and 
were likely to have a chronic and deteriorating course (dementia). 
Kraepelin was also instrumental in separating dementia praecox 
from manic-depressive illness, which had its onset distributed 
throughout life and a more episodic course. 

Dementia praecox was eventually renamed schizophrenia, a term 
coined by Eugen Bleuler to emphasize the cognitive impairment 
that occurs, which he conceptualized as a splitting of the psychic 
processes [3]. Bleuler believed that certain symptoms were funda- 
mental to the illness, including affective blunting, disturbances of 
association (that is, fragmented thinking), autism, and ambivalence 
(for example, fragmented emotional responses). These eventually 
enjoyed widespread acceptance as ‘Bleuler’s four As: Other symp- 
toms, such as delusions and hallucinations, were considered by him 
to be ‘accessory symptoms, which occurred in other disorders such 
as manic-depressive illness. 

Bleuler’s ideas enjoyed widespread acceptance, and generations 
of psychiatrists were taught the importance of his fundamental 
symptoms (the four As). Unlike hallucinations and delusions, these 
symptoms are on a continuum with normality and can be present 
in relatively mild forms, even in psychiatrically healthy persons. 
Consequently, the conceptualization of schizophrenia in the United 
States became increasingly broad. Later, the ideas of the German 
psychiatrist Kurt Schneider, who emphasized first-rank or spe- 
cific psychotic symptoms, were introduced, helping to reshape the 
concept of schizophrenia into one of a relatively severe psychotic 
disorder, bringing it back to the original ideas of Kraepelin [4]. 
DSM-5 represents a convergence of various points of view, with its 
Kraepelinian emphasis on course, specific delusions, and hallucin- 
ations thought important by Kurt Schneider, and acknowledgement 
of the importance of Bleuler’s fundamental symptoms. 








In DSM-5, schizophrenia is defined by a group of characteristic 
positive or negative symptoms; deterioration in social, work, or 
interpersonal relationships; and continuous signs of the disturbance 
for at least 6 months [5]. In addition, schizoaffective disorder and 
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mood disorder with psychotic features have been ruled out, and the 
disturbance is not due to the direct physiological effects of a sub- 
stance or a general medical condition. When an illness otherwise 
meets the criteria but has a duration of less than 6 months, it is called 
schizophreniform disorder. When the duration is less than 4 weeks, it 
may be classified as either a brief psychotic disorder or a psychotic dis- 
order not otherwise specified, which is a residual category for psych- 
otic disturbances that cannot be better classified. 





Differential diagnosis 


Schizophrenia should be thought of as a diagnosis of exclusion be- 
cause the consequences of the diagnosis are severe and limit thera- 
peutic options. There are no definitive tests for schizophrenia, so the 
diagnosis rests on historical and clinical information. 

A thorough physical examination and history are required 
to rule out other medical causes for schizophrenic symptoms. 
Psychotic symptoms are found in many other illnesses, including 
substance abuse (for example, hallucinogens, phencyclidine, am- 
phetamines, cocaine, alcohol), intoxication due to commonly 
prescribed medications (for example, corticosteroids, anti- 
cholinergics, levodopa), infections, metabolic and endocrine 
disorders, tumours and mass lesions, and temporal lobe epilepsy. 
Laboratory tests may be helpful in ruling out these aetiologies, 
and magnetic resonance imaging may be useful in selected pa- 
tients to rule out alternative diagnoses or during the initial 
workup for new-onset cases. 

The major differential diagnosis involves separating schizophrenia 
from mood disorder, delusional disorder, or personality disorder. 
The chief distinction from mood disorders is that with schizo- 
phrenia, a full depressive or manic syndrome is either absent or de- 
velops after the psychotic symptoms, or is brief in duration relative 
to the duration of psychotic symptoms. Unlike delusional disorder, 
schizophrenia is characterized by bizarre delusions and hallucin- 
ations are common. Patients with personality disorders, particularly 
those disorders within the eccentric cluster (for example, schizoid, 
schizotypal, or paranoid), may be characterized by indifference to 
social relationships and have a restricted affect, bizarre ideation, or 
odd speech but are not psychotic. 

Other psychiatric disorders also must be ruled out, including 
schizophreniform disorder, brief psychotic disorder, factitious dis- 
order with psychological symptoms, and malingering. When symp- 
toms persist for more than 6 months, schizophreniform disorder 
can be ruled out. The history of how the illness presents will help 
to rule out a brief psychotic disorder, because schizophrenia gen- 
erally has an insidious onset and there are usually no precipitating 
stressors. Factitious disorder may be difficult to distinguish from 
schizophrenia, especially when the patient is knowledgeable about 
mental illness or is medically trained, but careful observation should 
enable the clinician to make the distinction between real or feigned 
psychosis. Likewise, a malingerer could attempt to simulate schizo- 
phrenia, but careful observation and history-taking will help to dis- 
tinguish the disorders. With the malingerer, there will be evidence 
of obvious secondary gain, such as avoiding incarceration or severe 
punishment, and the history may suggest antisocial personality 
disorder. 


Dimensional approaches 


A variety of methods have been developed to describe and classify 
the symptoms in schizophrenia. Traditionally, schizophrenia is con- 
sidered to be a type of ‘psychosis, yet the definition of psychosis has 
been elusive. Older definitions stressed the subjective and internal 
psychological experience and defined psychosis as an ‘impairment 
in reality testing. Moreover, psychosis has been defined objectively 
and operationally as the occurrence of hallucinations and delusions. 
Because schizophrenia is characterized by so many different types 
of symptoms, clinicians and scientists have tried to simplify the de- 
scription of the clinical presentation by categorizing symptoms along 
dimensions, rather than discrete categories of psychopathology. 

Recent research suggests there are three symptom dimensions. 
The first—psychoticism—involves hallucinations and delusions, 
the classic symptoms of psychosis. The second—disorganization— 
involves bizarre speech and behaviour and incongruous affect. The 
third—negative symptoms—involves alogia, affective blunting, 
anhedonia, and avolition. The author has developed the Scale for 
the Assessment of Positive Symptoms (SAPS) and the Scale for the 
Assessment of Negative Symptoms (SANS) to evaluate these symp- 
toms [6, 7]. (See Table 57.1 for a description of the frequencies of 
positive and negative symptoms.) 


The psychotic dimension 


This symptom dimension refers to two classic ‘psychotic’ symp- 
toms that reflect a patient’s confusion about the loss of boundaries 
between himself or herself and the external world: hallucinations 
and delusions. Both symptoms reflect a ‘loss of ego boundaries’ — 
the patient is unable to distinguish between his or her own thoughts 
and perceptions and those that he or she obtains by observing the 
external world. 

Hallucinations have sometimes been considered to be the 
hallmark of schizophrenia, although they may occur in other 
disorders, including mood disorders and disorders induced by 
medical illness or various substances. Hallucinations are percep- 
tions experienced without an external stimulus to the sense or- 
gans and with a quality similar to a true perception. Schizophrenic 
patients commonly report auditory, visual, tactile, gustatory, or 
olfactory hallucinations, or a combination of these. Auditory 
hallucinations are the most frequent; they are commonly experi- 
enced as voices but may also be noises or music. The voices come 
from outside the person’s head, and they usually speak words, 
phrases, or sentences. Visual hallucinations may be simple or 
complex and include flashes of light, persons, animals, or objects. 
Olfactory and gustatory hallucinations are often experienced to- 
gether, especially as unpleasant tastes or odours. Tactile hallu- 
cinations may be experienced as sensations of being touched or 
pricked, electrical sensations, or sensations of insects crawling 
under the skin, which is called formication. 

Delusions involve disturbance in thought, rather than perception; 
they are firmly held beliefs that are untrue, as well as contrary to a 
person's educational and cultural background. Delusions occurring 
in schizophrenic patients may have somatic, grandiose, religious, ni- 
hilistic, or persecutory themes. The type and frequency of delusions 
tend to differ according to culture. For example, in the United States, 


Table 57.1 Frequency of symptoms in schizophrenia patients 





NEGATIVE SYMPTOMS POSITIVE SYMPTOMS 
Affective flattening Hallucinations 
Unchanging facial expression 96 Auditory 75 
Decreased spontaneous 66 Voices commenting 58 
movements 
Few expressive gestures 81 Voices conversing 5 
Poor eye contaci 7) Somatic-tactile 20 
Affective non-responsivity 64 Olfactory 6 
Inappropriate affect 63 Visual 49 
Lack of vocal inflections 13 
Delusions 
Alogia Persecutory 81 
Poverty of speech 53 Jealous 4 
Poverty of content of speech 51 Guilt, sin 26 
Blocking 23 Grandiose 29 
Increased response latency 21 Religious 31 
Somatic 28 


Avolition-apathy Delusions of reference 49 


Impaired grooming and 87 Delusions of being 46 

hygiene controlled 

Lack of persistence atworkor 95 Delusions of mind 48 

school reading 

Physical anergia 82 Thought broadcasting 23 
Thought insertion 3] 

Anhedonia-asociality Thought withdrawal 2y 

Few recreational interests/ 95 

activities 

Little sexual interest/activity 69 Bizarre behaviour 

Impaired intimacy/closeness 84 Clothing, appearance 20 


Few relationships with friends 96 Social, sexual behaviour 33 





Peers Aggressive-agitated oT 
Repetitive-stereotyped 28 

Attention 

Social inattentiveness 78 Positive formal thought 
disorder 

Inattentiveness during testing 64 Derailment 45 
Pressure of speech 24 
Tangentiality 5 
Distractible speech 23 
Clanging 3 
ncoherence 2) 
llogicality 23 
Circumstantiality 35 


Adapted from Schizophr Bull, 13(1), Andreasen NC, The diagnosis of schizophrenia, 
pp. 9-22, Copyright (1987), with permission from Oxford University Press. 


a patient might worry about being spied on by the FBI or CIA; in 
sub-Saharan Africa, a Bantu or Zulu patient is more likely to worry 
about persecution by demons or spirits. (See Table 57.2 for a de- 
tailed description of the various types of delusions.) 
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Table 57.2 Varied content in delusions 


| Delusions | Foci of preoccupation 


Grandiose Possessing wealth or great beauty, or having a special 
ability (for example, extrasensory perception); having 
influential friends; being an important figure (for example, 
Napoleon, Hitler) 





Nihilistic Believing that one is dead or dying; believing that one does 
not exist or that the world does not exist 


Persecutory Being persecuted by friends, neighbours, or spouses; being 
followed, monitored, or spied upon by the governmen 





Somatic Believing that one’s organs have stopped functioning (for 
example, that the heart is no longer beating) or are rotting 
away; believing that the nose or other body part is terribly 
misshapen or disfigured 


Sexual Belief that one’s sexual behaviour is commonly known; that 
one is a prostitute, paedophile, or rapist; that masturbation 
has led to illness or insanity; that one’s sexual abilities are 


legendary 





Religious Belief that one has sinned against God; that one has a 


special relationship to God or some other deity; that one 
has a special religious mission; that one is the Devil or is 
condemned to burn in Hell 


Certain types of hallucinations and delusions were considered 
‘first rank by Kurt Schneider [4]. These hallucinations include 
clearly audible voices commenting on a person’s actions, or arguing 
with each other about a patient, or repeating aloud the patient's 
thoughts. The delusions include thought broadcasting, thought 
withdrawal, thought insertion, or being controlled. These symptoms 
commonly occur in schizophrenic patients but are also found in pa- 
tients with psychoses due to mood disorders, medical illness, or the 
effect of substances. (See Table 57.3 for a description of Schneider's 
‘first-rank symptoms.) 


The disorganization dimension 


The disorganization dimension includes disorganized speech, disor- 
ganized or bizarre behaviour, and incongruous affect. 

Disorganized speech, or thought disorder, was regarded as the 
most important symptom of schizophrenia by Bleuler. Historically, 
the types of thought disorder have included associative loosening, 
illogical thinking, over-inclusive thinking, and loss of the ability to 
engage in abstract thinking. Standard definitions for various types of 
thought disorders have been developed that stress objective aspects 
of language and communication (which are empirical indicators of 
‘thought’), such as derailment, poverty of speech, poverty of content 
of speech, or tangential replies, and all have been found to occur in 
both schizophrenia and mood disorders. Manic patients often have 
a thought disorder characterized by tangentiality, derailment (loose 
associations), and illogicality. Depressed patients manifest thought 
disorder less frequently than manic patients but may display poverty 
of speech, tangentiality, or circumstantiality. Other types of formal 
thought disorder include perseveration, distractibility, clanging, 
neologisms, echolalia, and blocking; with the possible exception of 
clanging in mania, none appears to be disorder-specific. 

An example of speech from a schizophrenia patient with a prom- 
inent thought disorder, especially derailment, follows: 


Let’s see, there was one I would have liked if it wasn’t for the in- 
structor, well I go along with his, he was always wanted me to do the 
worse in the class, it seemed, like, and I'd always get bad, the grade, 
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Table 57.3 Schneider's first-rank symptoms 


Thought echo Hearing one’s thoughts being spoken aloud 


Voices commenting earing a voice making a running commentary 
about oneself 
































Voices conversing earing several voices talking or arguing about 
oneself 

Somatic hallucination A hallucination involving the perception of a physical 
experience within the body 

Thought withdrawal Belief that thoughts are being withdrawn or inserted 

or insertion into the patient's mind by an outside force 

Thought broadcasting Belief that the person's thoughts are being broadcast 
so that his/her private thoughts are known to others 

Delusional A true perception, to which the person attributes a 

perceptions alse meaning 

Delusions of passivity Being influenced or forced to do things or want things 
he patient does not wish to do and does not want 

Delusions of control Being made to feel emotions or sensations that are 


not the patient's own 


in my grading, and he tried to make other people like they were good 
enough to be in Hollywood or something, you know Ts be the last 
one down the ladder. That, that’s the way they wanted the grading 
to be in the first place according to whose, theirs, they, they have all 
different reasons than I, I, I think that they use that they want one, 
won't come out.' 


Many schizophrenic patients display various types of disorgan- 
ized motor and social behaviour, another aspect of this dimension. 
Abnormal motor behaviours range from catatonic stupor to excite- 
ment. In a catatonic stupor, the patient may be immobile, mute, and 
unresponsive, yet fully conscious. With catatonic excitement, the pa- 
tient may exhibit uncontrolled and aimless motor activity. Patients 
sometimes assume bizarre or uncomfortable postures, such as 
squatting, and maintain them for long periods. Patients may exhibit 
a stereotypy, which is repeated, but non-goal-directed, movement 
such as back and forth rocking. They may also display manner- 
isms, which are normal goal-directed activities, but are either odd 
in appearance or out of context such as grimacing. Other common 
symptoms are echopraxia, or imitating the movements and gestures 
of another person; automatic obedience, or carrying out simple com- 
mands in a robot-like fashion; and negativism, or refusing to co- 
operate with simple requests for no apparent reason. 

Deterioration of social behaviour often occurs along with social 
withdrawal. Patients may neglect themselves, become messy or un- 
kempt, and wear dirty or inappropriate clothing. Patients may ig- 
nore their surroundings, so that they become cluttered and untidy. 
Patients may develop other odd behaviours that break social con- 
ventions such as masturbating in public, foraging through garbage 
bins, or shouting obscenities. Many of today’s street people suffer 
from schizophrenia. 

Incongruity of affect is another component of the disorganized di- 
mension. Patients may smile inappropriately when speaking of neu- 
tral or sad topics, or giggle for no apparent reason. This symptom 





1. Reproduced from Andreasen NC, The Broken Brain: The biological revolution 
in psychiatry, pp. 61, Copyright (1984), with permission from HarperCollins 
Publishers. 


should not be confused with the nervous smiling or giggling that 
sometimes occurs in anxious patients. 


The negative dimension 


DSM-5 lists two negative symptoms as characteristic of schizo- 
phrenia: affective blunting (diminished emotional expression) and 
avolition. Other negative symptoms that are common in schizo- 
phrenia include alogia (poverty of speech), anhedonia, and at- 
tentional impairment. Taken together, they may be considered to 
represent a new ‘five As. Negative symptoms reflect a deficiency of 
mental functioning that is normally present. 

Affective flattening or blunting is a reduced intensity of emotional 
expression and response. It is manifested by unchanging facial ex- 
pression, decreased spontaneous movements, poverty of expres- 
sive gestures, poor eye contact, lack of voice inflections, and slowed 
speech. 

Avolition is a loss of the ability to initiate goal-directed behaviour 
and carry it through to completion. Patients seem to have lost their 
will or drive. They may initiate a project and then abandon it for a 
few days or weeks, and then fail to appear or wander aimlessly away 
while at work. This symptom is sometimes interpreted as laziness 
but, in fact, represents the loss or diminution of basic drives and the 
capacity to formulate and pursue long-range plans. 

Alogia is characterized by a diminution in the amount of spontan- 
eous speech, as well as a tendency to produce speech that is empty 
or impoverished in content when the amount is adequate. It is the 
external expression in language of the impoverishment of thought 
that occurs in many patients with schizophrenia. Patients may have 
great difficulty in producing fluent responses to questions. 

Anhedonia, or the inability to experience pleasure, is also very 
common. Many patients describe themselves as feeling emotion- 
ally empty. They are no longer able to enjoy activities that previously 
gave them pleasure such as playing sports or seeing family or friends. 
Their awareness that they have lost the capacity to enjoy themselves 
may be a source of great psychological pain. 

Attentional impairment is reflected in an inability to concentrate 
or by stimuli that they cannot process or filter; this, in turn, causes 
them to feel confused or to experience fragmented thoughts. 

Another aspect of this dimension is a reduced intensity of emo- 
tional response that leaves schizophrenic patients indifferent and 
apathetic. 

Depressive symptoms occur in up to 60% of schizophrenic pa- 
tients. Depression is often difficult to diagnose because symptoms of 
schizophrenia and depression frequently overlap. Dopamine antag- 
onists may also cause what may appear to be depression but is actu- 
ally drug-induced akinesia. 


Other symptoms 


Lack of insight is common in schizophrenia. A patient may not be- 
lieve that he or she is ill or abnormal in any way. To the patient, the 
hallucinations and delusions are real, not imagined. Poor insight is 
one of the most difficult symptoms to treat, and it may persist even 
when other symptoms, such as hallucinations or delusions, respond 
to treatment. Orientation and memory are usually normal, unless 
they are impaired by the patients psychotic symptoms, inattention, 
or distractibility. 

Non-localizing neurological soft signs occur in a substantial 
proportion of patients and include abnormalities in stereognosis, 


graphesthesia, balance, and proprioception [11]. A disorder of the 
visual tracking of smoothly moving targets (that is, smooth pursuit 
eye movement) has been observed in both schizophrenia patients 
and their relatives. Other ocular abnormalities commonly include 
the absence and avoidance of eye contact and staring for long 
periods. Decreased or rapid blink rates and bouts of rapid blinking 
may occur. Some patients display disturbances of sleep, sexual 
interest, and other bodily functions. A variety of disrupted sleep 
measures have been reported, but decreased delta sleep with dimin- 
ished stage 4 is the most consistent finding. Many schizophrenia pa- 
tients have inactive sex drives and derive little or no pleasure from 
sexual activity. 


Premorbid personality 


Several early writers, including Kraepelin and Bleuler, observed 
that patients with schizophrenia often had abnormal premorbid 
personalities. A review of early studies of personality and schizo- 
phrenia showed that premorbid schizoid traits were present in 
one-fourth of schizophrenic patients, but one-sixth had a range 
of other personality disturbances [12]. Poor premorbid adjust- 
ment has been shown to correlate with early disease onset, poor 
overall prognosis, negative symptoms, cognitive deficits, and 
poor social functioning. 


Alcohol and drug abuse 


Alcohol and drug abuse is especially common in patients with 
schizophrenia. Drug-using patients tend to be young, male, and 
poorly compliant with treatment; they also tend to have frequent 
hospitalizations. It is believed that many abuse drugs in an attempt to 
treat their depression or their medication side effects (for example, 
akinesia) or to ameliorate their lack of motivation and pleasure. 


Other problems 


People who suffer from schizophrenia are at high risk of commit- 
ting suicidal acts [13]. About one-third will attempt suicide, and 1 in 
10 will eventually complete suicide. Risk factors for suicide include 
male gender, age under 30 years, unemployment, chronic course, 
prior depression, past treatment for depression, history of substance 
abuse, and recent hospital discharge. 

Recent research shows that patients with schizophrenia and other 
severe mental disorders exhibit relatively high rates of violent behav- 
iour and criminality. In the Epidemiologic Catchment Area study 
[14], schizophrenic patients had rates of violent behaviour five times 
higher than persons without mental illness, although the rate was 
about half that seen in persons with alcohol abuse or dependence. 
Schizophrenic patients with coexisting alcoholism are even more 
likely to commit violent acts. 


Course of illness 


Schizophrenia is typically viewed as a chronic disorder that begins 
early in life and has a poor long-term outcome. Its onset generally 
begins with a prodromal phase characterized by social withdrawal 
and other subtle changes in behaviour and emotional responsive- 
ness, peculiar behaviour, deterioration in personal hygiene and 
grooming, and strange ideation. The prodrome varies in length but 
typically lasts from months to years. 
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The prodrome is followed by an active phase, in which psychotic 
symptoms first appear. At this point, the clinical disorder becomes 
evident, and a diagnosis of schizophrenia can be made. This phase 
is characterized by hallucinations and delusions, alarming both 
friends and family members, and often leading to medical interven- 
tion. A residual phase follows the resolution of the active phase and 
is similar to the prodrome. Psychotic symptoms may persist during 
this phase, but at a lower level of intensity, and they may not be as 
troublesome to the patient. Active phase symptoms may occur epi- 
sodically (‘acute exacerbations’), with variable levels of remission 
seen between episodes. The frequency and timing of these episodes 
are unpredictable, although stressful situations may precede these 
relapses or, in some instances, drug abuse. The stages of schizo- 
phrenia are described in Table 57.4. 

Relapses are often preceded by changes in thought, feeling, or 
behaviour noticed by the patient and family members. Symptoms 
preceding relapse may include dysphoria, seclusiveness, sleep dis- 
turbance, anxiety, and ideas of reference. Patients gradually accrue 
increased levels of morbidity in the form of residual or persistent 
symptoms and decrements in function from their premorbid status. 
Relatively severe psychosis is continuous and unrelenting in some 
patients. Schizophrenia may reach a plateau of severity at about 
5 years without further deterioration. 

Several long-term follow-up studies have been published using contem- 
porary definitions of schizophrenia [15, 16]. One of the best known is the 
‘Towa 500} in which 186 schizophrenic patients admitted to the University 
of Iowa Psychiatric Hospital between 1934 and 1944 were followed up [15]. 
Twenty per cent were reported to be psychiatrically well at follow-up, but 
54% had incapacitating psychiatric symptoms; 21% were married or wid- 
owed, but 67% had never married; 34% lived in their own home or with a 
relative, but 18% were in mental institutions (including transitional institu- 
tions such as halfway houses); 35% were economically productive, but 58% 
had never worked. The group experienced excessive mortality from both 
natural and unnatural causes, and more than 10% committed suicide. 

In summary, modern outcome studies show that schizophrenia 
can be a devastating illness that may affect multiple aspects of a 
patient's life. However, some patients with schizophrenia will have a 
relatively good outcome and do not experience the severe deterior- 
ation sometimes considered to be a hallmark of the disorder. 


Outcome predictors 


It is difficult to predict outcome in individual patients based on these 
studies. However, factors associated with a good outcome include 


Table 57.4 Typical stages of schizophrenia 


Typical features 


Prodromal Insidious onset over months or years; subtle behaviour 
changes, including social withdrawal, work impairment, 
inappropriate affect, avolition, and strange ideation 


Active phase Psychotic symptoms develop, including hallucinations, 
delusions, or disorganized speech and behaviour. These 
florid symptoms are alarming and lead to medical 
intervention. Acute phase symptoms may re-emerge 


during the residual phase (‘acute exacerbation’) 


Residual phase Active phase symptoms are absent or no longer 
prominent. There is role impairment, negative symptoms, 


or attenuated positive symptoms 
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Table 57.5 Features associated with good and poor outcome 








Onset Acute Insidious 
Duration Short Chronic 
Mood symptoms Present Absent 
Sensorium Clouded Clear 
Premorbid function Good Poor 

Marital functioning arried Never married 
Psychosexual functioning Good Poor 
Neurological functioning ormal Soft signs present 
Structural brain abnormalities one Present 
Parental education High Low 

Family history/schizophrenia egative Positive 


acute onset, short duration of illness, lack of prior psychiatric his- 
tory, presence of affective symptoms or confusion, good premorbid 
adjustment, steady work history, marriage, and older age at onset. 
Poor prognostic features include insidious onset, long duration of 
symptoms, affective blunting, obsessive-compulsive symptoms, 
assaultiveness, premorbid personality disorder, poor work history, 
celibacy, and young age at onset. Prognostic features in schizo- 
phrenia are summarized in Table 57.5. 

Schizophrenic patients are more likely to experience a good outcome 
now than 100 years ago. There are several possible explanations for this 
finding: (1) the illness has changed; (2) neuroleptic medication and 
other treatments have altered the natural history of the illness; or (3) our 
definitions of good outcome have changed. For example, good outcome 
now may include patients living in care facilities who have minimal 
symptoms but are clearly not well. For reasons that are not well under- 
stood, cross-cultural studies have shown that patients in less developed 
countries tend to have better outcomes than those in more developed 
countries [17]. It may be that a person suffering from schizophrenia is 
better accepted in less developed societies, has fewer external demands, 
and is more likely to be taken care of by family members. Women, in 
general, tend to have a better outcome than men in their response to 
medication and in long-term course. 


Mechanisms of schizophrenia 


Many thinkers have considered schizophrenia to be a neurocognitive 
disorder, with the various signs and symptoms reflecting the down- 
stream effects of a fundamental cognitive deficit. Schizophrenia 
poses special challenges to the development of cognitive models 
because of its breadth and diversity of symptoms. The symptoms 
include nearly all domains of cognitive function: perception (hal- 
lucinations), inferential thinking (delusions), fluency of thought 
and speech (alogia), clarity and organization of thought and speech 
(formal thought disorder), motor activity (catatonia), emotional 
expression (affective blunting), ability to initiate and complete 
goal-directed behaviour (avolition), and ability to seek out and ex- 
perience emotional gratification (anhedonia). 

An initial survey of the diversity of symptoms might suggest that 
multiple brain regions are involved in schizophrenia, in a spotty 


pattern much as once occurred in neurosyphilis. In the absence of 
visible lesions and known pathogens, investigators have turned to 
the exploration of models that could explain the diversity of symp- 
toms through a single cognitive mechanism. The convergent conclu- 
sions of these different models are striking. 

An approach some investigators have taken is to divide the symp- 
toms into three broad groups: (1) disorders of willed action (which 
lead to symptoms such as alogia and avolition); (2) disorders of 
self-monitoring (which lead to symptoms such as auditory hallu- 
cinations of alien control); and (3) disorders in monitoring the in- 
tentions of others (‘mentalizing’) (which lead to symptoms such as 
formal thought disorder and delusions of persecution). These could 
represent a more general underlying mechanism—a disorder of 
consciousness or self-awareness that impairs the ability to think with 
‘meta-representations’ (higher-order abstract concepts that are rep- 
resentations of mental states) [18-24]. 

One popular model suggests that the fundamental impairment 
in schizophrenia is an inability to guide behaviour by representa- 
tions, often referred to as a defect in working memory [19]. Working 
memory involves the ability to hold a representation ‘online’ and 
perform cognitive operations in a flexible manner, to formulate and 
modify plans, and to base behaviour on internally held ideas and 
thoughts, rather than being driven by external stimuli. A defect in 
this ability can explain a variety of symptoms in schizophrenia. For 
example, the inability to hold a discourse plan in mind and monitor 
speech output may lead to disorganized speech and thought dis- 
order; the inability to maintain a plan for behavioural activities 
could lead to negative symptoms such as avolition or alogia; the in- 
ability to reference a specific external or internal experience against 
associative memories (mediated by cortical and subcortical circuitry 
involving frontal/parietal/temporal regions and the thalamus) could 
lead to an altered consciousness of sensory experience that would 
be expressed as delusions or hallucinations. The model also explains 
the perseverative behaviour observed in studies using the Wisconsin 
Card Sorting Test and is consistent with the compromised blood flow 
to the prefrontal cortex in these patients. Overall, this model sug- 
gests a major role for prefrontal regions and their multiple distrib- 
uted cortical, thalamic, and striatal connections in a fundamental 
cognitive function—representationally guided behaviour—that 
permits organisms to adapt flexibly to a changing environment and 
to achieve temporal and spatial continuity between past experiences 
and present and future actions. 

Alternatively (or complementarily), the chapter author has used 
the clinical presentation of schizophrenia asa point of departure, pos- 
tulating that the symptoms arise from ‘cognitive dysmetria [25, 26]. 
This refers to impaired connectivity between frontal, cerebellar, and 
thalamic regions as a consequence of a neurodevelopmental defect 
or perhaps a series of them. Motor dysmetria has been observed in 
schizophrenia since its original description by Kraepelin, and soft 
signs of poor co-ordination are reported in more contemporary 
studies [11]. 

More injurious, however, is the related ‘cognitive dysmetria, 
which produces ‘poor co-ordination’ of mental activities. The word 
metron literally means ‘measure’—a person with schizophrenia has 
a fundamental deficit in taking measure of time and space and in 
making inferences about interrelationships between him- or her- 
self and others, or between the past, present, and future. He or she 
cannot accurately time input and output, and therefore cannot 


co-ordinate the perception, prioritization, retrieval, and expression 
of experiences and ideas. This model has received extensive support 
from work with magnetic resonance imaging and positron emission 
tomography. (See Chapter 60 for more detail.) 

The common thread in these observations is that schizophrenia 
reflects a disruption in a fundamental cognitive process that affects 
a specific circuitry in the brain [23, 24]. Various research teams may 
use different terminology and somewhat different concepts—meta- 
representations, representationally guided behaviour, informa- 
tion processing/attention, cognitive dysmetria—but they convey a 
common theme. Cognitive dysfunction in schizophrenia is an inef- 
ficient temporal and spatial referencing of information and experi- 
ence as the person attempts to determine boundaries between self 
and non-self and to formulate effective decisions or plans that will 
guide him or her through the small-scale (speaking a sentence) or 
large-scale (finding a job) manoeuvres of daily living. This capacity 
is sometimes referred to as consciousness. 

Investigators also converge on similar conclusions about the 
neuroanatomic substrates of the cognitive dysfunction. All agree 
that it must involve distributed circuits, rather than a single spe- 
cific ‘localization, and all suggest a key role for interrelationships 
among the prefrontal cortex, other interconnected cortical regions, 
and subcortical regions, particularly the thalamus and striatum (see 
Chapter 60). 


Pathophysiology and aetiology 


A consensus now exists among many clinicians and research inves- 
tigators that schizophrenia is best conceptualized as a ‘multiple-hit’ 
illness similar to cancer. Individuals may carry a genetic predispos- 
ition, but this vulnerability is not ‘released’ unless other factors also 
intervene. Although the majority of these factors are considered en- 
vironmental, in the sense that they are not encoded in DNA and 
could potentially produce mutations or influence gene expression, 
the majority are also biological, rather than psychological, and in- 
clude factors such as birth injuries or nutrition [27]. Current studies 
of the neurobiology of schizophrenia examine a multiplicity of 
factors, including genetics, anatomy (primarily through struc- 
tural neuroimaging)), functional circuitry (through functional 
neuroimaging), neuropathology, electrophysiology, neurochem- 
istry and neuropharmacology, and neurodevelopment. 


Genetics 


Evidence for a genetic contribution to schizophrenia is based on 
family studies, twin studies, studies of adoptees, and molecular gen- 
etic techniques. Summaries of individual family studies have shown 
siblings of schizophrenic patients to have about a 10% chance of 
developing schizophrenia, whereas children who have one parent 
with schizophrenia have a 5-6% chance. The risk of family members 
developing schizophrenia increases markedly when two or more 
family members have the illness. The risk of developing schizo- 
phrenia is 17% for persons with one sibling and one parent with 
schizophrenia, and 46% for children of two schizophrenic parents. 
Twin studies have been remarkably consistent in demonstrating 
high concordance rates for identical twins averaging 46%, compared 
to 14% concordance in non-identical twins. Adoption studies show 
that the risk for schizophrenia is greater in the biological relatives of 
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index adoptees who had schizophrenia than in the biological rela- 
tives of mentally healthy control adoptees. 

Studies are now under way to illuminate the genetic underpin- 
nings of schizophrenia at the molecular level (see Chapter 61 for 
more details). Researchers have implicated several different gene re- 
gions, but none has emerged as conclusive as yet. Because of impli- 
cations for solving multiple key problems (diagnostic, prognostic, 
treatment choices), this is one of the central topics for schizophrenia 
research and it is likely to remain so for some time [28]. 


Neuropathology 


A better understanding of neurodevelopment has helped shape 
contemporary post-mortem studies. During the second trimester, 
neurons in the fetal brain must migrate to the appropriate layers 
of the cortex, then connect with other groups of neurons to form 
functional networks. Other developmental processes include ex- 
cessive proliferation of cells and dendrites, subsequently followed 
by pruning and programmed cell death (apoptosis) of subplate 
neurons; surviving cells remain as interstitial neurons (or ‘inter- 
neurons’) of the white matter. The resulting network of neurons 
and cytoplasmic processes is called the neuropil. Research suggests 
that schizophrenia could be related to disturbances in any of these 
phases of brain maturation, ranging from migration to apoptosis. 
Failure of the cells to migrate to their proper position may show up 
as ectopic grey matter or neuronal disarray in specific regions of the 
hippocampus. 

Displacement of neurons and a paucity of neurons in the superfi- 
cial layers in the rostral and intermediate portions of the entorhinal 
cortex of the parahippocampal gyrus have also been reported and 
have been attributed to faulty neuronal migration. More recently, re- 
searchers have observed displacement of interneurons in the frontal 
lobe cortex, including decreased cell density in the superficial white 
matter and increased cell density in the deeper white matter, find- 
ings also thought consistent with an alteration in the migration of 
subplate neurons or in the pattern of programmed cell death [29]. 

Neuropathology has also been used to explore whether abnormal- 
ities can be found in key candidate regions such as the thalamus or 
prefrontal cortex. Decreased cell density in the medial dorsal nu- 
cleus of the thalamus, a crucial nucleus that projects to the prefrontal 
cortex, thalamic abnormalities, and increased cell packing density 
in the prefrontal cortex have all been described. The latter finding is 
consistent with a loss of the surrounding neuropil and consequent 
shrinkage of the interneuronal space. A convergence of findings is 
beginning to emerge from the variable perspectives of structural and 
functional neuroimaging and neuropathology. These perspectives 
are all consistent with neurodevelopmental mechanisms and with 
abnormalities in frontal, temporal, and thalamic regions. 


Neurodevelopmental influences 


Several lines of evidence have supported speculation that schizo- 
phrenia is a neurodevelopmental disorder resulting from brain in- 
jury occurring early in life. For example, schizophrenic patients are 
more likely than non-schizophrenic control subjects to have a his- 
tory of birth injury and perinatal complications, which could result 
in subtle brain injury, setting the stage for the development of schizo- 
phrenia [27]. Seasonality of birth in schizophrenic patients has also 
suggested to some a neurodevelopmental aetiology. Throughout the 
temperate northern latitudes, more people with schizophrenia are 
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born in winter months than at any other time. This suggests that 
some schizophrenic persons could have sustained central nervous 
system damage in the womb from a viral illness. 


Evolutionary aspects 


That schizophrenia persists in the human population despite the fact 
that the majority of people who develop it do not marry or procreate 
is fascinating. This observation should perhaps be coupled with 
another that has been less frequently noted and has not yet been 
systematically investigated—a substantial number of highly cre- 
ative individuals have family members who are in the schizophrenia 
spectrum. The association between manic-depressive illness and 
creativity is well established, but most of this work has focused on 
literary or artistic creativity [30]. 

In the case of schizophrenia, there may be a link with scien- 
tific creativity, although the evidence at present is only anecdotal. 
However, an impressive number of Nobel laureates have an as- 
sociation with schizophrenia. Bertrand Russell had a son and a 
granddaughter who suffered from schizophrenia, as well as an 
aunt and an uncle who probably also suffered from this illness 
when it was simply called ‘insanity. Albert Einstein had a son by 
his first wife who developed schizophrenia. John Nash, a recent 
Nobel laureate in economics, developed schizophrenia himself in 
his early 30s and also had a son who suffers from schizophrenia. 
James Joyce was emotionally aloof and cold and became increas- 
ingly disorganized artistically; his daughter Lucia suffered from 
hebephrenic schizophrenia. Isaac Newton was a solitary, chron- 
ically suspicious, and socially aloof man who had a variety of 
unusual interests and beliefs; he would probably be called schizo- 
typal using modern nomenclature; however, he also had a psych- 
otic break at age 40. 

Coupled with the persistence of schizophrenia throughout his- 
tory despite decreased fertility and procreation, this modest associ- 
ation between schizophrenia and ‘genius’ suggests that, perhaps like 
sickle-cell anemia, a predisposition to schizophrenia may convey a 
biological advantage in some situations. 
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Epidemiological research into schizophrenia aims to answer four es- 
sential questions: 


e What is the ‘true’ frequency of the disorder in various popula- 
tions, and how is it distributed across the various groups within 
populations? 

e Do the incidence, manifestations, and course of schizophrenia 
vary in relation to factors of the physical and social environment? 

e Who is at risk, and what forces determine or influence the risk of 
developing schizophrenia? 

e Can the answers to these questions help explain what causes the 
disorder and how to prevent it? 


Schizophrenia has been studied extensively from an epidemio- 
logical perspective since Kraepelin [1] introduced the concept of 
dementia praecox in 1896 [1-6]. In the first half of the twentieth cen- 
tury, epidemiological research into schizophrenia took two diver- 
gent paths. While European studies tended to focus on population 
distributions and genetic risks, North American researchers inves- 
tigated the social ecology of the disorder. A variety of methods were 
explored and successfully applied by the pioneers of psychiatric epi- 
demiology, and the contours of the epidemiological map of schizo- 
phrenia in Europe and North America were effectively laid down 
between the two World Wars. The early studies were carried out by 
dedicated researchers who often spent months or years collecting 
data ‘door-to-door’ in small communities. Close knowledge of the 
respondents, access to multigenerational records from local parish 
registers, and the co-operation of the community resulted in studies 
that remain landmarks of psychiatric epidemiology (Table 5 

During the last several decades, the scope of epidemiological 
studies of schizophrenia has expanded to include populations in 
Asia, Africa, and South America, about which little had been known 
previously. The World Health Organization (WHO) International 
Pilot Study of Schizophrenia [7] and its successor—the WHO 10- 
country epidemiological study [8]—were the first systematic inves- 
tigations of the comparative incidence, clinical manifestations, and 
course of schizophrenia in both developing and developed coun- 
tries. The WHO programme was an impetus for similar studies in 
India, China, the Caribbean, and Australia. Two major studies of 
psychiatric morbidity in the United States—the Epidemiological 








Catchment Area project [9] and the National Comorbidity Survey 
[10]—generated data on the prevalence of DSM-III/IIIR schizo- 
phrenia and related disorders in representative population samples. 
In the 1980s and 1990s, epidemiological studies increasingly utilized 
existing large databases, such as cumulative case registers or birth 
cohorts, to test hypotheses about risk factors and began to include 
methods of genetic epidemiology. There is a current tendency to- 
wards integrating epidemiological approaches with other types of 
aetiological research in schizophrenia. This predicts an important 
role for epidemiology in the era of genetics and molecular biology 
of mental disorders. 




















Measurement of the prevalence, incidence, and disease expectancy 
of schizophrenia depends critically on the sensitivity of the case- 
finding method (that is, its capacity to identify all affected persons in 
a given population) and the availability of a diagnostic instrument 
or procedure that selects ‘true’ cases (that is, those corresponding to 
an established clinical concept). 
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Case-finding designs fall into three broad groups: case detection in 
clinical populations, door-to-door surveys of population samples or 
whole communities, and birth cohort studies. Each method has its 
advantages and limitations. 

While case-finding through the mental health services provides 
relatively easy access to a substantial proportion of all persons with 
schizophrenia, the cases in treatment may not be fully representative 
of all individuals with the disorder. Bias related to gender, marital 
status, socio-economic factors, culture, or ethnicity is known to af- 
fect the probability of being in treatment at a given time in a given 
setting, and generalizations about schizophrenia from hospital or 
clinic samples are liable to error. Some of the deficiencies of case- 
finding through service contacts are avoided in cumulative national 
or regional psychiatric case registers, which cover large, well-defined 
populations and can be linked to other population databases (for ex- 
ample, birth records). This makes registers efficient research instru- 
ments in low-incidence disorders such as schizophrenia. 


Table 58.1 Historical landmarks in the epidemiology of psychoses 


Cases of psychosis and normal 


Koller (1895) [2] The first case-control study of 


psychoses controls 


Luxenburger (1928) [3] Twin concordance analysis 


Brugger (1931) [4] Census (door-to-door survey) Area in Thuringia 


(population 37,561) 


Klemperer (1933) [5] Birth cohort study 


Essen-Moller et al. 
(1956) [6] 


Census and follow-up surveys 


Surveys involve accounting for every person at risk within a de- 
fined community or a population sample in terms of either being 
or not being a case. Face-to-face interviews (and follow-up) of all 
residents in defined communities has been a feature of some high- 
quality research, especially in Scandinavian countries [6]. However, 
since the size of the populations surveyed in this way is limited, the 
number of detected cases of schizophrenia is usually too small to 
generate stable estimates of epidemiological parameters. Surveys of 
large populations involve two basic designs: a single-phase survey of 
a probability sample drawn from the general population; and a two- 
phase survey where a validated screening test is first applied to the 
entire population and only those scoring as screen-positive proceed 
to a full assessment. In the instance of schizophrenia, logistics dic- 
tates a choice between assessing large numbers less rigorously and 
investigating a smaller sample in greater depth. In the absence of 
a simple and valid screening procedure for schizophrenia, such as 
a biological or psychological test, the advantages of the two-phase 
survey may be offset by poor sensitivity or specificity of the screening 
device, which is usually a questionnaire or checklist. 

The study of birth cohorts at ages when their members have passed 
through the greater part of the period of risk for onset of schizo- 
phrenia is usually done by face-to-face interviewing or by analysing 
available case register data. Well-characterized birth cohorts are 
among the best tools for the study of the incidence of schizophrenia 
and associated risk factors [11]. However, even in settings where 
the population is stable and mortality and morbidity are adequately 
monitored, the size of birth cohorts with prospectively collected 
data may not be sufficient for conclusive epidemiological inferences. 

All this suggests that there is no single ‘gold standard’ of case- 
finding for schizophrenia that could be applied across all possible 
settings, and the assets and liabilities of particular case-finding 
procedures need to be evaluated in the context of each study. This 
makes the detailed reporting of case-finding methods a mandatory 
prerequisite for an ‘evidence-based’ epidemiology of schizophrenia. 


Diagnosis 

Variation in diagnostic concepts and practices always explains a pro- 
portion of the variation in the results of schizophrenia studies, es- 
pecially if they involve different populations or different periods. In 
the late 1960s, the WHO International Pilot Study of Schizophrenia 


[7] examined diagnostic variation in schizophrenia across nine 
countries by comparing the diagnoses made by psychiatrists using 


Monozygotic and dizygotic twin pairs 


Random sample (n = 1000) of all 
births in Munich, 1881-1890 


Rural community (n = 2550 + 1013 
new residents during follow-up) 
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Hospital and clinic 
records 


Genealogical inquiry 


Search of birth registers Diagnosis: ‘definite’ and 


probable’ 


1 


Personal examination of 
‘suspected’ cases and controls 


Hospital records and key 
informants 


Tracing of cohort 
members (44% 
successfully traced) 


Personal examination or key 
informant interview (271 
examined) 








Personal examination and 
re-examination of all residents 


Complete census; tracing 
of persons who had 
migrated out 


a clinical interview and diagnostic classification by a computer al- 
gorithm [12] utilizing the same interview data. The results dem- 
onstrated that in the majority of settings, psychiatrists were using 
comparable diagnostic concepts. The introduction of explicit diag- 
nostic criteria and rules with the consecutive editions of DSM and 
the WHO’s ICD-10 improved further the reliability of diagnosis but 
did not resolve all diagnostic issues, with implications for epidemi- 
ology. While ICD-10 and DSM-IV tended to agree well on the core 
cases of schizophrenia, they agree less well on the classification of 
atypical or milder cases. Such differences may be less important in 
clinical practice, but they present a problem for epidemiological and 
genetic studies. By providing more restrictive criteria for schizo- 
phrenia, both classifications aim to identify clinically similar cases 
and to minimize false-positive diagnoses. This is not an unequivocal 
advantage, since applying such criteria at case-finding may result in 
the rejection of potential cases which fail to meet the full set of cri- 
teria at initial assessment. Therefore, it is desirable to develop less 
restrictive screening versions of the DSM and ICD criteria for epi- 
demiological research. 


Instruments 


The diagnostic instruments used in surveys which involve 
interviewing fall into two categories: fully structured interviews such 
as the Diagnostic Interview Schedule (DIS) [9] and the Composite 
International Diagnostic Interview (CIDI) [13], both written to 
match exactly the diagnostic criteria of DSM-IIIR/IV and ICD-10; 
and semi-structured interview schedules, such as the Present State 
Examination (PSE) [12] and the Schedules for Clinical Assessment 
in Neuropsychiatry (SCAN) [14], which cover a broad range of 
psychopathology. 

The DIS/CIDI type of instrument is reliable and capable of 
generating standard diagnoses of common mental disorders in a 
single-phase survey design. Its clinical validity in schizophrenia is 
less certain because symptoms may not be reported accurately or 
impairment may be underestimated by the respondent. In contrast, 
the PSE/SCAN allows a greater amount of psychopathological data 
to be elicited in a flexible clinical interview format, but its use in 
epidemiological studies presupposes the availability of clinically 
trained interviewers. While SCAN and other similar interviews are 
suitable as second-stage diagnostic instruments, there is still a need 
for a relatively simple and effective screening procedure for case- 
finding of schizophrenia in field surveys. 
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The epidemiological description of schizophrenia draws on ex- 
tensive evidence available today on its frequency, age, and sex dis- 
tribution in relatively large populations or geographical areas. Less 
than complete information is available on variations in its epidemio- 
logical characteristics that may be found in unusual or isolated 
populations. 


Prevalence 


Prevalence provides an estimate of the proportion of cases per 1000 
persons at risk present in a population at a given time or over a de- 
fined period. Point prevalence refers to the ‘active’ (that is, symp- 
tomatic) cases on a given date or within a brief census period. 
Since asymptomatic cases (for example, persons in remission) will 
be missed in a point prevalence survey, it is useful to supplement 
the assessment of the present mental state with an enquiry about 
past episodes of the disorder to obtain a lifetime prevalence index. 
In schizophrenia, which tends to be a chronic course, estimates of 
point and lifetime prevalence will be closer to each other than in 
remitting illnesses. 

An overview of selected prevalence studies of schizophrenia span- 
ning over seven decades is presented in Table 58.2 and in two sys- 
tematic reviews of the literature [15, 16]. The studies differ in many 
aspects of methodology, but in the majority, they feature a high in- 
tensity of case-finding. Several studies are repeat surveys in which 
the original population was reassessed following an interval of 10 
or more years (the resulting consecutive prevalence figures are in- 
dicated by arrows). 

Most studies have produced point prevalence estimates in the 
range of 2.1-7.0 per 1000 population at risk and a lifetime prevalence 
of schizophrenia in the range of 15.0-19.0 per 1000. These figures 
should be considered with caution because of demographic differ- 
ences between populations related to factors such as age-specific 


Table 58.2 Selected prevalence studies of schizophrenia 


mortality and migration. A systematic review of 188 studies in 46 
countries, published between 1965 and 2002 [15], estimated the me- 
dian value for point prevalence at 4.6 per 1000 persons and for a 
lifetime prevalence at 7.2 per 1000. 

Certain populations and groups deviate markedly from the cen- 
tral tendency. A strikingly high prevalence of schizophrenia (2-3 
times the national or regional average) has been found in geograph- 
ically and genetically isolated populations, including small com- 
munities in northern Sweden and Finland [19, 25], while at the 
other extreme, virtual absence of schizophrenia and a high rate of 
depression have been claimed for the Hutterites of South Dakota, 
a Protestant sect whose members live in close-knit endogamous 
communities sheltered from the outside world [27]. Negative social 
selection for schizoid individuals who fail to adjust to the lifestyle 
of the majority and eventually migrate without leaving progeny 
has been suggested (but not definitively proven) as an explanation. 
Results of two surveys in Taiwan [28], separated by 15 years, pointed 
to a falling prevalence of schizophrenia (from 2.1 to 1.4 per 1000) in 
the context of major socio-economic change and an overall increase 
in total mental morbidity in the population. 

The question about the extent of true variation in the preva- 
lence of schizophrenia across populations has no simple answer. 
Methodological differences among studies, related to sampling, 
case-finding, and diagnostic assessment, are likely to account for a 
good deal of the observed variation [29]. 

Notwithstanding such caveats in the interpretation of survey findings, 
the prevalence rates are fairly similar in the majority of studies, though 
certain specific populations clearly deviate from the modal value. Even 
in those instances, however, the magnitude of the deviation is modest, 
compared with the 10- to 30-fold differences in prevalence observed 
in other multifactorial diseases (for example, diabetes, ischaemic heart 
disease, multiple sclerosis) across populations. 


Country Population Prevalence per 1000 
population at risk 


Bajholm and Strömgren Denmark Island of Bornholm 

(1983) [17] (n = 50,000) 

Böök (1953) [18] Sweden Genetic isolate 

Böök et al. (1978) [19] (n = 9000); age 15-50 
Crocetti et al. (1971) [20] Croatia Sample of 9201 households 
Dube and Kumar (1972) [21] India Four areas in Agra 





Census interviews, repeat census 3.9 > 3.3 
Census interviews, repeat census 9.5 > 17.0 
Census of hospital records and 59. 
interviews 

Census based on hospital and 2.6 





clinic records 


n = 29,468) 
Keith et al. (1991) [22] United States Aggregated population across five ECA sites Sample survey; interviews 7.0 (point); 15.0 (lifetime) 
Jeffreys et al. (1997) [23] UK London health district Census; interview of sample 5.1 
n= 112,127) (n= 172) 
Jablensky et al. (2000) [24] Australia Four urban areas Census, screen for psychosis; 47 (point); 
n= 1,084,978) patient interviews 5.8° (12 months) 
Perälä et al. (2007) [25] Finland ational sample (n = 8028) Screen for psychosis, interviews, 10.0 (lifetime) 
register and casenote data 
organ et al. (2014) [26] Australia Catchment areas with total population Sample (n = 1642) of patients with 3.1 (point) 
n= 1,464,921: psychoses in contact with mental 
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Incidence 


While prevalence is a proportion, incidence is a rate (an estimate of 
the annual number of new cases in a defined population per 1000 per- 
sons at risk). Incidence is of greater interest for the study of risk factors 
than prevalence, since it represents the so-called force of morbidity (the 
probability of disease occurrence) in a given population and is closer 
in time to the action of antecedent or precipitating factors. The estima- 
tion of incidence depends critically on the ability to determine reliably 
the point of onset of the disorder. In the case of schizophrenia, the long 
prodromal period and the fuzzy boundary between the premorbid state 
and the onset of psychosis make this particularly difficult. In the ab- 
sence of an objective biomarker of the disease, onset is usually defined 
as the point in time when clinical manifestations become recognizable 
and diagnosable according to specified criteria. The first hospital admis- 
sion, which has been used as a proxy for disease onset in many studies, 
is not a robust indicator because of the variable time lag between the 
earliest appearance of symptoms and the first admission across treat- 
ment facilities and settings. A better approximation is provided by the 
first contact, that is, the point at which any psychiatric, general medical, 
or alternative ‘helping’ agency is accessed by symptomatic individuals 
for the first time. A limitation common to both first-admission and 
first-contact studies is that they produce rates of ‘treated’ incidence and 
miss symptomatic cases that do not present for assessment or treatment. 
This limitation can be overcome by periodically repeated door-to-door 
surveys of the same population or by longitudinal cohort studies. 

Table 58.3 presents the essential features and findings of seven 
historical incidence studies of schizophrenia. Although these studies 
vary in details of their methodology, the variation of the reported 
rates in settings as different as Barbados and Dublin is relatively 
modest, in the range of 0.32 [38] to 0.57 [33] per 1000 population at 
risk. Studies using more stringent criteria, such as DSM-IIIR, DSM- 
IV, ICD-10, or Catego S+ [12], have reported incidence rates 2-3 
times lower than those based on ‘broad’ criteria. A systematic review 
of data from some 160 studies from 33 countries, published between 
1965 and 2001 [30] yielded a median value of 0.15 and mean value 
of 0.24 per 1000, with a 5-fold range of the rates and a tendency for 
more recent studies to report lower rates. Similar results have been 
reported in a systematic review and meta-analyses of schizophrenia 
incidence studies in England [31]. 


Table 58.3 Selected incidence studies of schizophrenia 
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Considering the methodological differences among studies, gen- 
eralizing about the incidence of schizophrenia from pooled data 
may be problematic [29]. To date, the only investigation that had 
applied a uniform design and common research tools to generate 
directly comparable incidence data for different populations was the 
WHO 10-country study [8]. Incidence counts in the WHO study 
were based on first-in-lifetime contacts with any ‘helping agency’ 
within defined areas (including traditional healers in the developing 
countries) which were monitored for new cases over a 2-year period. 
Potential cases and key informants were interviewed by clinicians 
using standardized instruments, and the timing of onset was ascer- 
tained for the majority of the patients. In 86% of the 1022 patients, 
the onset of diagnostic symptoms of schizophrenia was within the 
year preceding the first contact, and therefore, the first-contact in- 
cidence rate was adopted as a reasonable approximation to the ‘true’ 
onset rate. Two definitions of ‘caseness’ were used to determine inci- 
dence: a ‘broad’ clinical definition comprising ICD-9 schizophrenia 
and paranoid psychoses; and a more restrictive definition of PSE/ 
Catego S + ‘nuclear’ schizophrenia manifesting with Schneiderian 
first-rank symptoms. The rates for eight of the catchment areas are 
shown in Table 58.4. 

While the differences between the rates for ‘broadly’ defined 
schizophrenia (0.16-0.42 per 1000) were significant (P <0.001), 
those for ‘nuclear’ schizophrenia were not, suggesting that the 
frequency of this diagnostic subgroup varies less across different 
populations. Replications of the design of the WHO 10-country 
study, including its research procedures and instruments, have 
been carried out, with similar results in India [36] and the 
Caribbean [38]. 


Disease expectancy (morbid risk) 


This is the probability that an individual born into a defined popu- 
lation will develop the disease if he or she survives the period of 
risk for that disease. In the instance of schizophrenia, the period of 
risk is usually defined as 15-54 years. If age- and sex-specific inci- 
dence rates are known, disease expectancy can be estimated directly 
by a summation of the age-specific rates within the period of risk. 
Alternatively, disease expectancy can be estimated indirectly from 
prevalence data (‘Table 58.5). 











Odegaard (1946) [32] Cumulative national Total population of Norway All first admissions 1 926-1935 0.24 (hospital diagnoses) 
case register (n = 14,231); statistical analysis 
Walsh (1969) [33] reland City of Dublin First admissions 0.57 (males), 0.46 (females) 
n = 720,000) 
Murphy and Raman (1971) [34] auritius Total population First admissions 0.24 (Africans); 0.14 Indian 
n = 257,000) Hindus); 0.09 (Indian Moslems) 
Helgason (1977) [35] celand Total population First admissions (case register) 0.27 
n= 221,799) 
Rajkumar et al. (1993) [36] ndia Area in Chennai (Madras) Key informants; door-to-door 0.41 
n= 43,097) survey 
Brewin et al. (1997) [37] UK City of Nottingham Two cohorts of first contacts 0.25 > 0.29 
(1978-1980 and 1992-1994) 0.14 > 0.09 
Mahy et al. (1999) [38] Barbados Total population First contacts, interviews 0.32 (ICD-9) 





(n = 262,000) 
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Table 58.4 WHO ten-country study of schizophrenia 





(| ‘Broad’ definition (ICD-9) ‘Narrow’ definition (CATEGO S+) 


Denmark Aarhus 0.18 0.13 0.16 0.09 0.05 0.07 
India Chandigarh (rural area) oB 0.48 0.42 0.13 0.09 0.11 
India Chandigarh (urban area) 0.34 0.35 0.35 0.08 0.11 0.09 
Ireland Dublin 029 0.21 0.22 0.10 0.08 0.09 
Japan Nagasaki 0.23 0.18 0.20 0.11 0.09 0.10 
Russia Moscow 0.25 0.31 0.28 0.03 0.03 0.02 
UK Nottingham 0.28 0.15 0.22 0.17 0.12 0.14 
United States Honolulu 0.18 0.14 0.16 0.10 0.08 0.09 








Reproduced from Psychol Med Monogr Suppl., 20, Jablensky A, Sartorius N, Ernberg G, et al., Schizophrenia: manifestations and course in different cultures. A World Health 
Organization ten-country study, pp. 1-97, Copy-right (1992), with permission from Cambridge University Press. 


The estimates of disease expectancy produced by a number 
of studies are fairly consistent across populations and over time. 
Excluding outliers, such as the northern Swedish isolate [18, 19], 
they vary about 5-fold; in the WHO study [8], they range from 0.59% 
(Aarhus) to 1.8% (Chandigarh, rural area) for ICD-9 schizophrenia 
and from 0.26% (Honolulu) to 0.54% (Nottingham) for Catego S + 
‘nuclear’ schizophrenia. The frequently cited modal estimate of 
lifetime disease expectancy for broadly defined schizophrenia at 
around 1% seems to be consistent with the evidence. 


Associations with age and sex 


Schizophrenia may have its onset at any age—in childhood as well 
as past middle age—although the vast majority of onsets fall within 
the 15-54 years of age interval. Onsets in men peak steeply in the 
age group of 20-24 years; thereafter, the rate of inception remains 
more or less constant at a lower level. In women, a less prominent 
peak in the age group of 20-24 years is followed by another increase 
in incidence in age groups older than 35. While the age-specific inci- 
dence up to the mid-thirties is significantly higher in men, the male- 
to-female ratio becomes inverted with age, reaching 1:1.9 for onsets 
after the age of 40 and 1:4, or even 1:6, for onsets after the age of 60. 
There seems to be no real ‘point of rarity’ between the symptom- 
atology of late-onset schizophrenia and schizophrenia of early onset. 

The sex differences in mean age at onset are unlikely to be an in- 
variant biological characteristic of schizophrenia. For example, 


Table 58.5 Selected course and outcome studies of schizophrenia 


within families carrying high genetic risk (two or more affected 
members), no significant differences in age at onset have been found 
between male and female siblings with schizophrenia. In some 
populations (for example, India and China), the male predominance 
in the frequency of onsets in the younger age groups is attenuated or 
even inverted [39, 40]. 

The question of whether the total lifetime risks for men and 
women are about the same, or different, has not been answered de- 
finitively. In the WHO 10-country study, the cumulated risks for 
males and females up to the age of 54 were found to be approxi- 
mately equal [8]. Scandinavian studies which followed up popula- 
tion cohorts into very old age (over 80) reported a higher cumulated 
lifetime risk in women than in men [41]. 

Male-female differences have been described in relation to the 
premorbid history (better premorbid functioning in women), the 
occurrence of brain abnormalities (more frequent in men), course 
(a higher percentage of remitting illness episodes and shorter hos- 
pital stay in women), and outcome (higher survival rate in the com- 
munity, less disability in women). However, there is no unequivocal 
evidence of consistent sex differences in the symptom profiles of 
schizophrenia, including the frequency of positive and negative 
symptoms. Generally, the sex differences described in schizophrenia 
are more likely to result from normal sexual dimorphism in brain 
development, as well as from gender-related social roles, rather than 
from sex-specific aetiological factors. 











Bleuler M (1978) [107] Switzerland 20% complete remission; 33% ‘mild defect’ 

Tsuang et al. (1979) [108] United States 186 35 46% recovered or improved significantly 

Ciompi (1980) [109] Switzerland 289 Sm 20% recovered, 43% ‘definitely improved’ 

Huber et al. (1980) [110] Germany 502 22 26% recovered, 31% remission, ‘mild defect’ 

Johnstone et al. (1990) [73] UK 530 13 14% ‘excellent’; 18.5% ‘very good social adjustment’ 

WHO (2007) [74] Developing countries 467 15-26 62.7% complete remissions; 55.5% never hospitalized during 
follow-up; 42.9% unimpaired social functioning during 75% 
of follow-up 

WHO (2007) [74] Developed countries 603 15-26 36.8% complete remissions; 8.1% never hospitalized during 








follow-up; 31.6% unimpaired social functioning during 
follow-up 


Fertility, mortality, and comorbidity 


Earlier studies reported low fertility in both men and women diag- 
nosed with schizophrenia. The mean number of children fathered by 
men with schizophrenia in Sweden was 0.9, and the average number 
of live births over the entire reproductive period of women treated 
for schizophrenia in Norway between 1936 and 1975 was 1.8, com- 
pared with 2.2 for the general female population [42]. Yet this phe- 
nomenon is neither universal nor consistent over time. In the WHO 
10-country study [8], the fertility of women with schizophrenia in 
India did not differ from that of women in the general population 
within the same age groups and geographic areas. Although men 
with schizophrenia continue to be reproductively disadvantaged, 
the fertility of women with schizophrenia has increased over the last 
decades, and this trend is likely to be sustained as a result of deinsti- 
tutionalization and the greater number of people with mental dis- 
orders being able to live in the community. 

Excess mortality associated with schizophrenia has been well 
documented by studies on large cohorts. National case register data 
for Norway, 1926-1941 and 1950-1974, indicated that while the 
total mortality of psychiatric patients was decreasing, the relative 
mortality of patients with schizophrenia remained unchanged at a 
level higher than twice that of the general population [32]. A meta- 
analysis of 18 studies [43] estimated a crude mortality rate of 189 
deaths per 10,000 population per year and a 10-year survival rate of 
81%. Mortality among males was significantly higher than among 
females, and the difference was primarily due to an excess in suicides 
and accidents. Unnatural causes apart, the leading causes of death 
among schizophrenia patients are similar to those in the general 
population, with the exception of significantly lower-than-expected 
cancer morbidity and mortality, especially for tobacco-related ma- 
lignancies in males with schizophrenia [44]. This puzzling phenom- 
enon has been replicated by several case register and record linkage 
studies [45, 46] and does not appear to be an artefact. Its causes re- 
main unknown, though protective effects of both genes and anti- 
psychotic pharmacological agents have been proposed. 

The single most common cause of death among schizophrenia pa- 
tients at present is suicide (aggregated standardized mortality ratios 
of 9.6 for males and 6.8 for females), which accounts for 28% of the 
excess mortality in schizophrenia [47]. The suicide rate in schizo- 
phrenia patients is at least equal to, or may indeed be higher than, 
the suicide rate in major depression. Several risk factors, relatively 
specific to schizophrenia, have been suggested: being young and 
male; experiencing multiple relapses and remissions; comorbid sub- 
stance use; awareness of the deteriorating course of the condition; 
and loss of faith in treatment. Data from successive patient cohorts 
in Denmark [48], Scotland [49], and Western Australia [50] sug- 
gested an alarming trend of increasing mortality in first-admission 
patients with schizophrenia. Particularly striking was the standard- 
ized mortality ratio of 16.4 for males with schizophrenia in the first 
year after diagnosis. In the Australian study, suicide risk was highest 
in the first 7 days after discharge from inpatient care. These trends 
seem to parallel the significant reductions in the number of psychi- 
atric beds and the concomitant pressure for early patient discharge. 
Whether the increases in suicide mortality are associated with the 
shift in the management of schizophrenia from hospital to commu- 
nity care remains to be established. 
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Comorbidity 


There is significant comorbidity in schizophrenia, comprising: (1) 
common medical problems and diseases that affect schizophrenia 
patients more frequently than attributable to chance; and (2) certain 
rare conditions or abnormalities which tend to co-occur with the 
disorder. 

Physical disease is common but tends to remain often undetected 
and underdiagnosed. Between 46% and 80% of inpatients with 
schizophrenia, and between 20% and 43% of outpatients, have been 
found in different surveys to have concurrent medical illnesses [51]. 
Persons with schizophrenia, and especially those who are home- 
less or injection drug users, are at increased risk for potentially life- 
threatening communicable disease, such as HIV/AIDS, hepatitis 
C, and tuberculosis [52]. Among the chronic non-communicable 
diseases, patients with schizophrenia have significantly higher- 
than-expected rates of diabetes, arteriosclerosis, and ischaemic 
heart disease, which are increasingly contributing to their high mor- 
tality rate [54-56]. Obesity and concomitant metabolic syndrome 
involving insulin resistance are at present common problems in 
schizophrenia patients. Although a high incidence of diabetes in 
schizophrenia patients had been described long before the intro- 
duction of neuroleptic treatment, a contributing role for some of the 
drugs with antagonist properties at histamine and serotonin recep- 
tors has not been ruled out. 

Some rare genetic or idiopathic disorders, such as metachromatic 
leukodystrophy, acute intermittent porphyria, and coeliac disease, 
as well as dysmorphic features such as high-steepled palate, mal- 
formed ears, and other minor physical anomalies, have also been re- 
ported to co-occur with schizophrenia [57, 58]. On the other hand, 
several studies have found a lower-than-expected rate of rheuma- 
toid arthritis in schizophrenia patients [59]. A recent joint study by 
the Psychiatric Genomic Consortium and the Rheumatoid Arthritis 
Consortium analysed a large sample of cases and controls for the 
two disorders and concluded that the negative relationship between 
them reflects, ‘at least in part’, genetic factors [60]. 


Substance abuse 


Substance abuse is at present by far the most common associated 
health problem among patients with schizophrenia [61] and may in- 
volve any drug of abuse or a polydrug combination. It seems, how- 
ever, that the addictive use of cannabis, stimulants, and nicotine is 
disproportionately high among schizophrenia patients and may be 
linked to the underlying neurobiology of the disorder. In a nation- 
wide sample of patients with psychotic disorders in Australia [24], a 
lifetime diagnosis of comorbid drug abuse or dependence was made 
in 36.3% of males and 15.7% of females with schizophrenia (com- 
pared to 3.1% and 1.3%, respectively, in the general population). In 
addition to poor prognosis of schizophrenia in patients with heavy 
cannabis use, a systematic review of published data on cannabis 
exposure and the onset of schizophrenia [62] concluded that early 
use increased the risk of psychosis in a dose-related manner, espe- 
cially in persons at high genetic risk of schizophrenia [63]. Similarly, 
stimulants tend to exacerbate acute psychotic symptoms in over 50% 
of schizophrenia patients [64]. The prevalence of cigarette smoking 
among schizophrenia patients is, on average, 2-3 times higher than 
in the general population [24], but the evidence regarding any effects 
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of nicotine use specifically on the onset and course of schizophrenia 
is equivocal. 


Geographical and cultural variation 


To date, no population or culture has been identified in which schizo- 
phrenic illnesses do not occur. There is also no strong evidence that 
the incidence of schizophrenia either is uniform or varies widely 
across populations, provided that the populations being compared 
are large enough to minimize the effects of small-area variation. The 
evidence that specific psychosocial or cultural factors play an aetio- 
logical role in schizophrenia is also inconsistent [65-67]. However, 
there are well-replicated findings of variations in the course and out- 
come of schizophrenia across populations and cultures that involve, 
above all, a higher rate of symptomatic recovery and a lower rate 
of social deterioration in traditional rural communities. Data sup- 
porting this conclusion were provided by the WHO studies [7, 8], 
which found a higher proportion of recovering or improving pa- 
tients in developing countries such as India and Nigeria than in de- 
veloped countries. Sampling bias (for example, a higher percentage 
of acute-onset schizophreniform illnesses of good prognosis among 
Third World patients) was nota likely explanation. A better outcome 
in developing countries was found in patients with various modes of 
onset, and the initial symptoms of the disorder did not distinguish 
good-outcome from poor-outcome cases. What causes such differ- 
ences in the prognosis of schizophrenia remains largely unknown. 
The follow-up in the WHO studies demonstrated that the outcome 
of paranoid psychoses and affective disorders was also better in 
developing countries. Such a general effect on the outcome of psy- 
chiatric disorders may result from psychosocial factors such as avail- 
ability of social support networks, non-stigmatizing beliefs about 
mental illness, and positive expectations during the early stages of 
psychotic illness, as well as from unknown genetic or ecological 
(including nutritional) factors influencing brain development. 


Variations in the course and outcome 





of schizophrenia 


The course of schizophrenia is as variable as its symptoms. Since the 
great majority of schizophrenic patients are today receiving pharma- 
cological treatment, current and recent studies may not reflect the 
‘natural’ course and outcome of the disorder. 


The ‘natural history’ of schizophrenia before the 
neuroleptic era 


Studies in urban communities in Scotland [68] and India [69] and 
a study in a rural community in China [70] estimated the propor- 
tions of never-hospitalized schizophrenic patients at 6.7%, 28.7%, 
and 30.6%, respectively. About half of the Scottish patients had 
been prescribed neuroleptics by their general practitioners, while 
the Indian and Chinese patients had been virtually untreated. In all 
three settings, the outcomes of these samples were heterogenous, 
but they did not differ much from the outcomes in treated patients. 
In a historical study of 70 Swedish patients with first admissions in 
1925, lifetime records were retrieved and re-diagnosed in accord- 
ance with DSM-III criteria [71]. The outcome was rated as ‘good’ 
in 33% (but no patient was considered as completely recovered), 
as ‘profoundly deteriorated’ in 43%, and as intermediate in 24%. 


A long-term perspective on the course of schizophrenia over suc- 
cessive generations is provided by a meta-analysis of 320 outcome 
studies on schizophrenia or dementia praecox published between 
1895 and 1992, including a total of 51,800 subjects [72]. Overall, 
about 40% of the patients were reported as improved after an average 
length of follow-up of 5.6 years. There was a significant increase in 
improvement during the period 1956-1985, compared to 1895- 
1955, clearly related to the introduction of neuroleptic treatment, 
but a secular trend towards better outcomes with every successive 
decade had been present for much longer. Coupled with the virtual 
disappearance of the most ‘catastrophic’ forms of schizophrenia (for 
example, ‘lethal catatonia’), these observations suggest that a tran- 
sition to a less deteriorating course of the disorder had occurred 
prior to modern pharmacological treatment. Among the factors 
explaining this shift, one should consider improvements in general 
care and progressive changes in attitudes and hospital regime which 
occurred in a number of institutions on both sides of the Atlantic 
in the 1930s and 1940s, as well as heightened expectations that psy- 
chosocial measures, such as psychotherapy or rehabilitation, could 
result in a cure in some cases. 


Patterns and stages in the course of schizophrenia 


The marked heterogeneity in the course of schizophrenia can be 
reduced to a limited number of patterns into which cases tend to 
cluster over time. In earlier long-term follow-up studies, 8-12 dif- 
ferent categories of course had been described. These classifications 
were derived from empirical observation, rather than statistical 
modelling. They conflated into single categories: the mode of onset, 
longitudinal aspects such as frequency and duration of psychotic 
episodes, remissions, and end states. Treating these various aspects 
of the longitudinal profile of the illness as independent dimensions 
is today recommended [73]. However, the complexity of statistical 
modelling of the course of schizophrenia is such that the develop- 
ment ofa classification of course that would be both useful in clinical 
practice and rigorous in a statistical sense may not be easy to achieve. 
Therefore, a heuristic compromise between these two requirements 
should, as a minimum, define operationally and assess separately: (1) 
the number and duration of discrete episodes of illness; (2) the pre- 
dominant clinical features of each episode (for example, psychotic 
or affective); and (3) the number and length of remissions and their 
quality (presence/absence of residual negative or deficit symptoms 
and signs). By combining these variables, several patterns of course 
have been derived that have found good empirical support in inter- 
national follow-up studies: (1) single psychotic episode followed by 
complete remission; (2) single psychotic episode followed by incom- 
plete remission; (3) two or more psychotic episodes, with complete 
remissions between episodes; (4) two or more psychotic episodes, 
with incomplete remissions between episodes; and (5) continuous 
(unremitting) psychotic illness. With some modifications, these lon- 
gitudinal patterns have been incorporated into research tools such as 
the SCAN [14]. 


WHO studies on course and outcome of schizophrenia 


A transcultural investigation co-ordinated by the WHO [74]—the 
International Study of Schizophrenia (ISoS)—which involved 18 re- 
search centres in 14 countries, achieved the tracing of 75% of cases 
that had been assessed in the earlier WHO studies and also included 
additional patient cohorts from mainland China, Hong Kong, and 


India. Follow-up data were collected on a total of 1633 cases, and 890 
patients were re-interviewed at either 15- or 25-year follow-up since 
their first assessment. The main results were as follows. Firstly, there 
was striking heterogeneity in the course of schizophrenia. Patients 
with similar baseline clinical characteristics developed a spectrum 
of outcomes, ranging from clinical and social recovery after a single 
psychotic episode to chronic unremitting psychosis and severe im- 
pairment. However, the overall rate of recovery was 48% at the 15- 
year follow-up and as high as 54% at the 25-year follow-up. Secondly, 
the frequency of both relapses and remissions increased over time. 
Regardless of this, the cumulative proportion of time during which 
patients had psychotic symptoms tended to remain stable or de- 
crease. At 15- and 25-year follow-up, 42% and 41%, respectively, had 
been free of psychotic symptoms for the past 2 years. Thirdly, while 
the percentage of patients with continuous deteriorating illness was 
similar across the sites of the WHO studies, the outcome was gen- 
erally better in the developing countries. This could not be attrib- 
uted to any particular clinical subtype of the disorder, for example 
to cases of acute onset, since it applied equally to cases of slow, in- 
sidious onset. The main outcome difference across the study areas 
was in the average length of symptom-free remissions. No single 
factor accounting for this difference could be identified, but better 
integration of the mentally ill person in the domestic economy of 
traditional rural communities could be part of the explanation. 
It should be noted, however, that increasing social and economic 
stresses experienced by both urban and rural communities in many 
developing countries may have eroded the traditional support sys- 
tems, resulting in worse outcomes, as suggested by several recent 
studies. Fourthly, whether the better outcome of schizophrenia in 
developing countries is ‘transportable’ following migration to other 
settings remains unclear. Data on immigrants treated for first epi- 
sodes of schizophrenia in the UK suggest that while Asian patients 
have lower relapse and readmission rates than British-born whites, 
Afro-Caribbean immigrants show a higher rate. Marked social and 
family structure differences between the Asian and Afro-Caribbean 
immigrant communities suggest that the likelihood ofa more benign 
course may depend on the degree to which the immigrant group has 
retained its traditional values and intra-group cohesion [75]. 


Factors maintaining the incidence of 





schizophrenia in populations 


Coupled with the evidence that the lifetime risk of the disorder 
(about 1%) is similar across populations, the question about factors 
that sustain the incidence of schizophrenia despite reduced repro- 
ductive fitness remains open. Schizophrenia is a complex polygenic 
disorder, with incomplete or variable penetrance of the genotype 
and widespread locus and allelic heterogeneity. The polygenic model 
implies that loss of risk alleles resulting from the lower reproductive 
fitness of affected individuals would have a negligible effect on the 
overall gene pool in the population. Previous conjectures that de 
novo germ-line mutations inherited from an ageing father may be 
responsible for a substantial proportion of incident cases of schizo- 
phrenia [76] is difficult to reconcile with current knowledge that 
mutation rates for most human genes are within the range of 10%- 
10° per generation, that is, their contribution to the maintenance of 
schizophrenia in the population would be insignificant. Considering 
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that both multiple genes and multiple exogenous factors are likely 
to be involved in the causation of schizophrenia, neither increased 
fertility in asymptomatic carriers of the risk genes nor paternal in- 
heritance of germ-line mutations appear to be either necessary or 
sufficient for the persistence of the disorder. 


Genetic risk: necessary and sufficient? 


Family aggregation of schizophrenia is at present the only 
epidemiologically well-established risk factor for the disorder, with 
a relative risk for first-degree relatives of persons with schizophrenia 
in the range of 6-17% [78]. Allowing for diagnostic variation, the 
risk estimates generated by different studies are similar and suggest 
a general pattern of descending risk as the proportions of shared 
genes between any two individuals decrease. Although heritability 
of schizophrenia (commonly estimated at about 80%) provides the 
basis for the search of specific genes, gene networks, and regulatory 
polymorphisms involved in schizophrenia causation, the extent to 
which genetic vulnerability alone is necessary and sufficient to pro- 
duce the disorder remains unclear. While an environmental con- 
tribution to the aetiology of schizophrenia is highly plausible, the 
evidence in support of it is inferential, typically proceeding from the 
early observation that the concordance for schizophrenia in mono- 
zygotic twins (sharing 100% of their genes) is only about 50%. The 
majority of investigators now agree that genes and environments 
should be studied jointly, and three models of conjunction have been 
proposed [79]: (1) the effects of predisposing genes and environ- 
mental factors are additive and increase the risk of disease in a linear 
fashion; (2) genes modulate the sensitivity of the brain to environ- 
mental insults; and (3) by fostering certain personality traits and as- 
sociated behaviour, genes influence the likelihood of an individual’s 
exposure to stressful environments. 


Environmental insults during gestation and 
fetal development 


A winter-spring excess of schizophrenic births was first described 
in 1929 [80] and since then reported by numerous studies, mostly 
in the northern hemisphere (southern hemisphere data are less con- 
sistent). Though some of these studies did not definitively prove or 
rule out a seasonal effect, winter-spring births were associated with 
a mild, but significant, increase in the relative risk (RR) for schizo- 
phrenia (RR = 1.11; CI 1.06-1.18) in a large population cohort 
from Denmark [81]. Thus, birth seasonality appears to be a robust 
finding, though few biologically testable hypotheses have been ad- 
vanced to explain it. One of them is the seasonally increased risk of 
intrauterine exposure to viral infection in utero. Exposure to influ- 
enza has been implicated as a risk factor since a report that a signifi- 
cant proportion of adult schizophrenia in Helsinki was associated 
with presumed second-trimester in utero exposure to the 1957 A2 
influenza epidemic [82]. Numerous studies, attempting to replicate 
the link between maternal influenza and schizophrenia, have since 
reached conflicting results, with negative findings reported from 
an increasing number of studies based on large population samples 
[83], as well as studies including data on schizophrenia risk in the 
offspring of women with prospectively recorded influenza infec- 
tion during pregnancy [84]. However, positive associations between 
schizophrenia in the offspring and maternal infection during preg- 
nancy has been reported for rubella [85] and toxoplasmosis [86], 
and the issue of prenatal infection contributing to schizophrenia 
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risk merits further study. Maternal obstetric complications, ranging 
from placental abnormalities in the first trimester of pregnancy to 
diabetes, pre-eclampsia, perinatal hypoxia, and low birthweight, 
are widely regarded to be risk factors in schizophrenia. Population- 
based studies [87] using prospectively recorded obstetric data tend 
to report conflicting or inconclusive results, with generally small ef- 
fect sizes (odds ratio of <2) for any positive findings [88]. However, 
several birth cohort studies with long-term follow-up have found 
significantly increased risk of adult schizophrenia in individuals 
who had survived severe, mainly hypoxic perinatal brain damage 
[88]. Birthweight (adjusted for gestation) is another factor that may 
have a complex relationship with schizophrenia risk. A large cohort 
study in Sweden [89] found significant hazard ratios of 7.03 for males 
of low birthweight (<2500 g) and 3.37 for those of high birthweight 
(>4000 g). It remains unclear, however, if severe obstetric complica- 
tions, such as perinatal hypoxia or low birthweight, are capable of 
raising substantially the risk of schizophrenia in an adult in the ab- 
sence of increased genetic risk. Maternal schizophrenia is associated 
with a higher rate of pregnancy complications, including low birth- 
weight [90], but it is not known if the effects of genetic liability and 
obstetric complications on schizophrenia risk in the offspring are 
additive or interactive. It is also possible that genetic predisposition 
sensitizes the developing brain to lesions resulting from randomly 
occurring, less severe obstetric complications. Such gaps in know- 
ledge or inconsistencies among research findings point to the clarifi- 
cation of these issues as a priority for epidemiological research. 


Early developmental antecedents of schizophrenia 


Children at high genetic risk for schizophrenia (that is, having parents 
or other first-degree relatives with the disorder) tend to show early 
signs of aberrant neurodevelopment, including ventricular enlarge- 
ment on computerized tomography [91] and decreased activation in 
the prefrontal and parietal regions of the heteromodal association 
cortex on functional magnetic resonance imaging [92]. The results 
of such imaging studies are limited by small sample size and may 
not be generalizable. However, population-based or cohort studies, 
such as the National Child Development Study in the UK, have dem- 
onstrated a higher incidence of abnormal motor and speech devel- 
opment before 2 years of age and of soft neurological signs (poor 
motor control, co-ordination, and balance), non-right-handedness, 
and speech defects between the ages of 2-15 [93]. Cognitive deficits 
involving verbal memory, sustained attention, and executive func- 
tions, as well as abnormalities in event-related brain potentials and 
oculomotor control [94-96], are common in patients with schizo- 
phrenia and antedate the onset of clinical symptoms. They also 
occur in a proportion of their clinically normal biological relatives 
but are rare in control subjects drawn from the general population. 
Their specificity to schizophrenia needs to be investigated in larger 
population samples. Should such endophenotypes be validated as 
biological markers of schizophrenia, the power of risk prediction 
at the level of the individual is likely to increase substantially. In a 
Swedish cohort study [97] involving a 15-year follow-up of 109,643 
men conscripted into the army at the age of 18-20, the individuals 
who subsequently developed schizophrenia were compared with the 
rest of the cohort on the performance of IQ-related tests and tasks 
at the time of conscription. Controlling for potential confounders, 
the risk of schizophrenia was found to increase linearly with the 


decrement of IQ. The effect was mainly attributable to poor per- 
formance on verbal tasks and tests of reasoning. 


Premorbid social impairment 


Individuals who develop schizophrenia as adults are more likely 
to manifest difficulties in social interaction during childhood and 
adolescence than individuals who do not develop schizophrenia. 
Population-based evidence of early socialization difficulties (school 
problems, social anxiety, and preference for solitary play) in children 
who develop schizophrenia as adults is provided by the prospective 
study of a national birth cohort in the UK [93]. It should be noted, 
however, that the early behavioural traits that tend to be associated 
with schizophrenia in adult life have low specificity and their pre- 
dictive value is limited. 


Early rearing environment 


Support for an effect of the early rearing family environment on the 
risk of developing schizophrenia is provided by a study of a Finnish 
sample of adopted children born to mothers with schizophrenia (a 
high-risk group) and a control sample of adoptees at no increased 
genetic risk [98]. Though the rates of adult psychosis or severe per- 
sonality disorder were significantly higher in the high-risk group, 
compared with the control group, the difference was entirely attrib- 
utable to a subset of the high-risk children who grew up in dysfunc- 
tional or otherwise disturbed adoptive families—a result consistent 
with the gene-environment model of genetic influence on a person's 
sensitivity to psychosocial adversity. 


The urban environment 


Earlier hypotheses that urban environments increase the risk of 
psychosis, either by contributing to causation (the breeder effect) 
or by attracting vulnerable individuals (the drift effect), have been 
more recently revived in the light of epidemiological findings sug- 
gesting that urban birth is associated with a moderate, but statistic- 
ally significant, increase in the incidence of schizophrenia, affective 
psychoses, and other non-affective psychoses [99]. It remains un- 
clear whether the effect is linked to a factor operating pre- or peri- 
natally, or a factor influencing postnatal development. 


Socio-economic disadvantage 


Since the 1930s, numerous studies in North America and Europe 
have consistently found that the economically disadvantaged social 
groups contribute disproportionately to the first-admission rate 
for schizophrenia. Two explanatory hypotheses of social causation 
(‘breeder’) and of social selection (‘drift’) were originally proposed 
[100]. According to the social causation theory, greater socio- 
economic adversity, characteristic of lower-class living conditions, 
could precipitate psychosis in genetically vulnerable individuals 
who have a restricted capacity to cope with complex or stressful 
situations. However, another study [101] found that the social class 
distribution of fathers of schizophrenic patients did not deviate 
from that of the general population, and that the excess of low socio- 
economic status among schizophrenic patients was mainly attrib- 
utable to individuals who had drifted down the occupational and 
social scale prior to the onset of psychosis. As a result, aetiological 
research in schizophrenia in recent decades has tended to ignore 
such ‘macrosocial’ variables. However, the possibility remains that 


social stratification, socio-economic status, and acculturation stress 
are contributing factors in the causation of schizophrenia. 


Migrants and ethnic minorities 


An exceptionally high incidence rate of schizophrenia (about 6.0 
per 1000) has been reported to be found in the African—Caribbean 
population in the UK [102, 103]. The excess morbidity is not re- 
stricted to recent immigrants and is higher in the British-born 
second generation of migrants. Similar findings of nearly 4-fold 
excess over the general population rate have been reported for the 
Dutch Antillean and Surinamese immigrants in Holland [104]. The 
causes of this phenomenon remain obscure. Incidence studies in the 
Caribbean do not indicate any excess morbidity in the indigenous 
populations from which migrants are recruited. Explanations in 
terms of biological risk factors have found little support. A finding 
in need of replication is the significant increase of schizophrenia 
among the siblings of second-generation African—Caribbean 
schizophrenia patients, compared with the incidence of schizo- 
phrenia in the siblings of white patients [105]. Such ‘horizontal’ 
increase in the morbid risk suggests that an environmental factor 
may be modifying the penetrance of the genetic predisposition to 
schizophrenia carried by a proportion of the African—Caribbean 
population. Psychosocial hypotheses involving acculturation stress, 
demoralization due to racial discrimination, and blocked oppor- 
tunities for upward social mobility have been suggested but not yet 
definitively tested. 


Can schizophrenia be prevented? 


Increasing investment in early diagnosis and treatment of schizo- 
phrenia raises important questions of whether people likely to de- 
velop schizophrenia can be reliably recognized at the prodromal 
stage, prior to the onset of symptoms, or even earlier, with a view 
to pharmacological, cognitive, or social interventions aiming to 
primary prevention of the disorder [111]. Early diagnosis and 
treatment of symptomatic cases can have an effect on the short- 
or medium-term outcome, but the population-wide detection of 
people at risk is problematic. Screening young age groups in the 
population for predictors, such as family history of psychosis, ob- 
stetric complications, or abnormal eye tracking, is likely to end up 
in multiple false-positive and false-negative results and a generally 
low positive predictive value. Problems of reliability of measure- 
ment apart, population-based screening will pose huge practical 
and ethical problems of having to treat a large number of individ- 
uals who do not have the disorder and missing many others who 
eventually will develop the disorder. While pre-symptomatic de- 
tection and preventative intervention in schizophrenia do not, at 
present, appear to be feasible, secondary prevention, aiming at op- 
timizing the treatment of early symptomatic cases and supporting 
their social, educational, or occupational functioning, has the po- 
tential for improving the prognosis and outcome of the disorder. 


Summary and conclusions 


After many decades of epidemiological research, essential questions 
about the nature and causes of schizophrenia still await answers 
[106]. However, two major conclusions stand out. 


CHAPTER 58 Epidemiology and course of schizophrenia 


e Schizophrenia is characterized by phenotypic variability and likely 
genetic heterogeneity. While the clinical concept of schizophrenia 
as a broad syndrome with some internal cohesion and character- 
istic patterns of course over time is well supported by epidemio- 
logical evidence, its dissection into modular subtypes with specific 
neurocognitive and neurophysiological underpinnings is begin- 
ning to be perceived as a promising approach in schizophrenia 
genetics. The study of endophenotypes cutting across the conven- 
tional diagnostic boundaries may reveal unexpected patterns of 
associations with symptoms, personality traits, or behaviours, as 
well as genetic polymorphisms, providing epidemiology with rich 
material for future hypothesis testing at the population level. 


No single environmental risk factor of major effect on the inci- 
dence of schizophrenia has yet been established. Further studies 
using large samples are required to evaluate potential risk factors, 
antecedents, and predictors for which the present evidence is in- 
conclusive. Assuming that multiple environmental risk factors of 
small to moderate effect will eventually be identified, the results 
will complement those of genetic research. 
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The main focus of this chapter is the progress that has occurred 
over the past 10 years in identifying molecular genetic risk factors 
for schizophrenia. The recent successes have required persistence 
in the face of considerable adversity. Some of the barriers to pro- 
gress reflect difficulties inherent in genetic studies of almost any 
complex disorder, medical or psychiatric. Others reflect the par- 
ticular challenges of studying heterogenous psychiatric disorders 
where the diagnostic categories are, at best, of uncertain validity (see 
Chapters 7 and 8). That molecular genetic research in schizophrenia 
has been allowed to continue, and ultimately flourish, owes much 
to the work of genetic epidemiologists who, over many decades, 
have consistently documented a substantial heritable component to 
variation in liability to schizophrenia. This section provides a brief 
consideration of that essential underpinning work; readers with a 
particular interest in this work, much of it historical, are directed to 
authoritative reviews elsewhere [1-3]. 


Family studies 


Family studies spanning much of the 1900s demonstrated that first- 
degree relatives of index cases with schizophrenia (probands) had 
a higher risk, about 10-fold, of developing the disorder than the 
population at large [1]. The validity of these early studies was often 
questioned, given methodological shortcomings, for example lack 
of control groups, biased ascertainment, diagnoses that were not 
blind to family relationships, and diagnoses which were unreliable 
in the absence of operational diagnostic criteria. However, from the 
1980s onwards, a wave of more sophisticated studies addressed these 
deficiencies; overall, they consistently supported the earlier work 
[1, 3]. For example, the risk for siblings of probands with schizo- 
phrenia is around 8-10% (Fig. 59.1). At first inspection, the lifetime 
risk of schizophrenia among parents of an affected individual (6%) 
seems low, compared with other first-degree relatives (siblings and 
offspring). However, due to low fecundity associated with schizo- 
phrenia [4], parents as a general class have a low risk of the disorder, 
and when risk is expressed relative to population baseline rates, the 
relative risk is similar (around 9-10) for parents, siblings, and chil- 
dren of an affected individual [5]. 

Compared with earlier plots of morbid risk [1], the inclusion of 
newer data in Fig. 59.1 results in two major differences. One concerns 





the risk in twins (see Twin studies, p. 587). The other concerns the 
risk to offspring of two affected parents, which was previously esti- 
mated to be around 48%, based on a small number of observations, 
but a more recent (large) study puts this at around 28%, increasing to 
39% when schizophrenia-related disorders are included [6]. 





Twir es 


The principles underlying the many twin studies conducted in 
psychiatric disorders are the same. Monozygotic (MZ) twins are 
perfectly correlated for all genetic loci that show variation in the 
population, whereas dizygotic (DZ) twins are, on average, 50% cor- 
related. Assuming that the extent to which MZ and DZ twins share 
their environment is roughly equal, higher concordance in MZ 
twins suggests the disorder is, at least partly, genetic. Concordance 
of less than 100% in MZ twins suggests environmental factors or 
chance (including de novo mutation) also play a role. Early twin 
studies showed higher concordance in MZ twins, compared to DZ 
twins, but as with family studies, methodological concerns left these 
open to criticism. However, the most recent wave of studies have ad- 
dressed earlier methodological limitations. 

In a pooled sample analysis of four modern-era twin studies, 
proband-wise concordance rates for (strict and probable) DSM- 
III-R schizophrenia were 50% in MZ twins and 4.1% in DZ twins 
[7]. Modelling suggested that variance in population risk was best 
attributed to additive genetic factors acting together with specific 
(in the sense of exposures affecting only one twin) environmental 
factors. Heritability, the component attributable to genes, was 88%, 
although a more recent meta-analysis of a larger number of twins 
provided a lower estimate of around 81% [8]. Surprisingly, the life- 
time risk for schizophrenia among DZ co-twins of affected prob- 
ands seems considerably lower than that for non-twin siblings for 
reasons that are not fully established. The substantial difference in 
concordance between MZ and DZ twins is suggestive of influences 
from gene-environment correlation and/or non-additive genetic 
inheritance [9]. 


Adoption studies 


As both the genetic and environmental backgrounds of family 
member twins are, at least in part, correlated, it is difficult to sep- 
arate definitively the contributions of genetic and environmental 
factors through family and twin study designs alone. Moreover, the 
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Fig. 59.1 Lifetime risk of schizophrenia according to the relationship to an affected individual. 

Source: data from Gottesman II, Schizophrenia Genesis: The Origins of Madness, Copyright (1991), WH Freeman; Am J Med Genet, 97(1), Cardno AG, Gottesman II, Twin studies 
of schizophrenia: from bow-and-arrow concordances to star wars Mx and functional genomics, pp. 12-17, Copyright (2000), John Wiley and Sons; Arch Gen Psychiatry, 67(3), 
Gottesman II, Laursen TM, Bertelsen A, et al., Severe mental disorders in offspring with 2 psychiatrically ill parents, pp. 252-7, Copyright (2010), American Medical Association. 


validity of twin studies has repeatedly been criticized, based on the 
possibility of greater correlations between the prenatal and postnatal 
environments of MZ than DZ twins. Thus, a third design based on 
adoption studies has formed an important foundation for the hy- 
pothesis of an important role for genes in the aetiology of schizo- 
phrenia. The premise underpinning the adoption design is that if 
genes play a role in a phenotype, adoptees should show evidence for 
phenotypic correlation with their biological relatives, despite being 
removed from a correlated environment. 

Detailed discussion of the early adoption literature is given else- 
where [1, 10]. Briefly, adoptee studies have shown that the adopted- 
away children of people with schizophrenia have higher rates of 
schizophrenia and schizophrenia spectrum disorder than adoptees 
of controls. Adoptive family studies have shown that the biological 
relatives of affected adoptees have higher rates of the disorder than 
either their adoptive relatives or the relatives of unaffected adoptees. 
And one cross-fostering study showed that adoptees of control 
parents raised by a parent with schizophrenia had a lower risk of the 
disorder than the biological offspring of a person with schizophrenia 
raised by adoptive parents without the disorder. 

Most adoption studies were conducted prior to the use of modern 
operational diagnostic criteria, so particularly notable is a study 
based on the Finnish national register, which used such criteria 
from the outset. Narrowly defined schizophrenia only showed a 
trend level difference in the adopted-away children of mothers 
with the disorder, compared with those of controls (5.3% vs 1.7%, 
respectively). However, the risk for schizophrenia spectrum dis- 
orders (22.46%) among adoptees whose mothers had schizophrenia 
spectrum disorder was much higher than the adoptees of controls 
(4.36%), supporting the hypothesis of a genetic contribution to 
a broader schizophrenia-related phenotype [11]. In the most re- 
cent study incorporating adoption data, adoptees with an affected 


biological parent had an elevated relative risk (13.7) of developing 
schizophrenia [12] in the range expected for first-degree relatives of 
an affected proband. 

Overall, the findings from adoption studies of multiple designs 
are consistent with those from other genetic epidemiological ap- 
proaches; while the validity of the assumptions underpinning each 
study design can be debated, this consistency provides a strong and 
coherent body of evidence for an important contribution of genetic 
inheritance to the aetiology of schizophrenia. 


Genetic epidemiology and risk across disorders 


The challenges of psychiatric diagnosis, and the evidence for the 
validity or otherwise of the boundaries that separate diagnostic 
categories or that delineate subtypes, are discussed in Chapters 7 
and 8. However, to appreciate the context of recent genomic find- 
ings, this topic requires brief consideration. Genetic research has 
generally treated schizophrenia as a fully distinct entity from major 
mood disorders, based on both Kraepelinian tradition and also 
evidence from family studies that suggested the disorders breed 
true (that is, they segregate in different families). The status of 
schizoaffective disorder has historically been unclear. Some have 
considered it to be a categorical error, others a form of bipolar dis- 
order, of schizophrenia, a distinct entity, or a state of comorbidity 
[13]. Moreover, although schizophrenia is now widely considered 
to be a neurodevelopmental disorder, it has been regarded as com- 
pletely distinct from other neurodevelopmental categories such as 
intellectual disability (ID), autism spectrum disorder (ASD), and 
attention-deficit/hyperactivity disorder (ADHD). However, the 
idea that the disorders, in particular schizophrenia and mood dis- 
orders, breed true is not fully supported by historical family studies 
[3] and recent genetic epidemiology now offers substantial chal- 
lenges to this view. 


One study that has been particularly influential was based on 
over 2 million families from Sweden [12]. As expected, first-degree 
relatives of probands with schizophrenia had around a 9- to 10- 
fold increase in the risk for the disorder. However, they also had a 
substantially increased risk (3- to 6-fold, depending on the class of 
relative) of bipolar disorder, as did the adopted-away biological chil- 
dren of parents with schizophrenia. Modelling suggested around 
half of the heritability of schizophrenia comes from genetic effects 
that also contribute to bipolar disorder. Strong evidence for over- 
laps in the genetics of schizophrenia, schizoaffective disorder, and 
bipolar disorder were also found in an innovative twin study [14] 
in which individuals were allowed to have multiple diagnoses (that 
is, symptoms of one disorder were not used to exclude another dis- 
order), and similar conclusions can be drawn from a study showing 
that the heritability of narrowly defined schizophrenia is essentially 
unchanged when other psychotic disorders are included [9]. 

Regarding possible overlaps among neurodevelopmental dis- 
orders, a study of three different data sets supported an earlier 
Danish study in finding that first-degree relatives of people with 
schizophrenia had more than twice the expected risk for ASD [15]. 
The parents and sibs of probands with bipolar disorder also had 
an increased risk for ASD, albeit to a lesser degree. Extending the 
findings to ID, an Australian study found the children of mothers 
with schizophrenia, bipolar affective disorder, and unipolar depres- 
sion to be at an increased risk of ID [odds ratio (OR) 2.9-3.1] [16]. 
As we have discussed, familial clustering does not imply shared 
genes, but the findings are consistent with the hypothesis of genetic 
overlap, a theme we consider later when discussing the molecular 
genetics data. 


Paternal age and fecundity 


People with schizophrenia have fewer children than the general 
population, an effect that is quite substantial, with men and women 
with the disorder reproducing at rates of 75% and 50%, respectively, 
lower than expected [4]. Low fecundity associated with schizo- 
phrenia has led to speculation about how risk alleles, and therefore 
the disorder, remain apparently impervious to removal from the 
population by natural selection [4]. It has also been established that 
children of older fathers are at increased risk of developing schizo- 
phrenia, a recent study estimating the risk to be 1.5-fold higher than 
expected in children of fathers over the age of 45 [17]. A hypoth- 
esis that has tried to link these two observations and has become 
a topic of sufficient interest to deserve special mention is that the 
answer to the persistence of the disorder lies in de novo or new mu- 
tation. The hypothesis is largely founded upon the fact that, like the 
risk of schizophrenia, the rate of de novo point mutation also in- 
creases with paternal age [18]. It is clear that de novo mutation does 
indeed contribute to schizophrenia (discussed later), but this con- 
tribution is minor and is unlikely to explain either the persistence 
of the disorder in the population or the effect of paternal age. The 
true explanations for the paternal age effect are not fully resolved, 
but a credible alternative may be that unaffected men who have a 
higher genetic loading (and therefore whose children are at higher 
genetic risk) may also tend to have children later, perhaps as a re- 
sult of subclinical behavioural or personality traits [19]. This is of 
more than theoretical interest, given the potential to give incorrect 
advice about the risks that might accompany delayed fatherhood, 
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based upon a model where the directions of cause and effect are in- 
correctly specified. 


Molecular genetics 


Linkage, association, and genetic architecture 


Linkage performs best when applied to genetic disorders with 
simple (dominant, recessive, sex-linked) patterns of inheritance 
but rapidly loses power in the face of marked locus heterogeneity 
(that is, there are large numbers of distinct genomic regions with 
pathogenic variants). This is particularly true if the heterogeneity 
cannot be constrained by selecting families with particular pheno- 
typic features. It is also not suited to disorders where the risk alleles 
are numerous and common and have small effects because even in 
a single family, the risk is likely to be due to multiple alleles, none of 
which is required to be co-transmitted with the phenotype. Under 
that scenario, association studies offer a more powerful approach. 
Linkage allows researchers to survey the whole genome for 
disease-causing variants, using only a few hundreds of genetic 
markers, whereas hundreds of thousands of markers are required 
to screen the whole genome by association. In the late twentieth 
century, technological limitations in the speed and cost of genome 
analysis meant that linkage was perceived to have a huge advantage 
since researchers trying to apply association methodologies were 
forced to select specific candidate genes. Candidates were selected 
on the basis of being located within a region of putative linkage (pos- 
itional candidates) or because they were postulated to be involved on 
the basis of their role in biological processes (functional candidates). 


Linkage 


While linkage studies routinely delivered hundreds of pathogenic 
genes for single-gene disorders, they did not deliver replicable find- 
ings in schizophrenia, or indeed many other common medical dis- 
orders. Even a meta-analysis of over 3200 separate pedigrees [20] 
failed to identify high-confidence linked loci. With the benefit of 
hindsight, it is now clear that the genetic and phenotypic architec- 
tures of schizophrenia are probably as unfavourable for linkage as it 
is possible to be. 

It is true that large numbers of statistically significant linkages 
were observed in individual pedigrees or groups of pedigrees (for an 
example, see [21]) and that genuine linkages are difficult to replicate 
in the face of extensive locus heterogeneity. These two factors mean 
we cannot be certain whether the failures to replicate initial link- 
ages reflect the greater-than-expected locus heterogeneity of the dis- 
order or whether they were simply false positives. For now, what we 
can say with certainty is that no Mendelian forms of schizophrenia 
have yet been identified. Interest in applying (modified) principles of 
linkage is likely to be rekindled by the new genome-wide sequencing 
methods which, at single base, rather than mega-base, resolution, 
may deal better with heterogeneity. 


Candidate gene association 


In the 1990s and early 2000s, large numbers of candidate genes and 
variants were tested in schizophrenia. In Table 59.1, we present 
the findings for some of the most widely discussed genes from the 
candidate gene literature [22], based upon results from the largest 
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Table 59.1 Associations to the most widely discussed genes 
from the candidate literature (p values represent the best association 
within, or close to, the gene) 
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Source: data from Mol Psychiatry, 20(5), Farrell M, Werge T, Sklar P, et al., Evaluating 
historical candidate genes for schizophrenia, pp. 555-62, Copyright (2015), Springer 
Nature; Nature, 511, Schizophrenia Working Group of the Psychiatric Genomics 
Consortium (PGC), Biological insights from 108 schizophrenia-associated genetic loci, 
pp. 421-27, Copyright (2014), Springer Nature. 


published genome-wide association study (GWAS) [23] of schizo- 
phrenia, which included most of the data sets that were considered 
supportive of the earlier associations, and the findings can there- 
fore be considered to largely supersede the earlier evidence. In 
the vast majority of cases, the candidate genes are not supported 
at what is now considered an appropriately stringent threshold (p 
<5 x 107). However, there are caveats to this conclusion. In some 
cases (DTNBP1, NRG1, RGS4), the strongest findings depended on 
marker combinations (haplotypes), which are not specifically evalu- 
ated in GWAS data. Whether this is an important caveat is unclear, 
since the high-density coverage of markers in the GWAS is likely 
to have covered the markers presumed to drive the haplotype as- 
sociations. Secondly, for one locus—DRD2 encoding the dopamine 
D2 receptor, there is now strong evidence for an association in the 
vicinity of this gene, although not to an allele that was identified in 
the candidate gene literature. For those particularly interested in 
candidate genes, and the differences in approach that systematic 
coverage of candidate genes has enabled, we recommend two pa- 
pers: one pre-GWAS [22] and one post-GWAS [24]. As the latter 
concludes, the current evidence suggests that the historical candi- 
date gene literature did not yield clear insights into the genetic basis 
of schizophrenia. 

With hindsight, there are several reasons why this phase of gen- 
etic study did not deliver. Major limitations in our understanding of 
schizophrenia meant the choice of candidates did not constrain the 
enormous variation of the genome to a substantially smaller number 
of genes with a high prior probability of being associated. Moreover, 
how small the association effect sizes were likely to be, and there- 
fore what constitutes adequate sample sizes, was not widely appre- 
ciated until the landmark work of the Wellcome Trust Case Control 
Consortium [25]. Together, small effect sizes and the low prior prob- 
ability for any candidate variant meant the studies were substantially 
underpowered to detect true associations at the stringent levels of 
significance necessary to control for type I errors. For example, a 
candidate gene study of 2000 subjects (a large study for most of the 
candidate gene era) has power of only 0.001 to detect an effect size 
at the upper end of the range (OR 1.2) that has subsequently been 


shown to operate for schizophrenia, at genome-wide significance. 
Moreover, even if some of the candidate associations were true, 
subsequent replication studies would have been underpowered to 
confirm them. 


Genome-wide association studies 


Human variant mapping studies, such as The HapMap Project, 
have documented the nature and genomic location of a high frac- 
tion of all common DNA sequence variation that is present in the 
human population. This knowledge, coupled with novel genomic 
technology and statistical methodology, made it possible around 
15 years ago to undertake the so-called GWAS, in which millions 
of SNPs are comprehensively assayed in an individual in a single ex- 
periment. In contrast to earlier association approaches, the GWAS 
design does not rely on prior knowledge of the most likely candidate 
genes or candidate genes, making GWAS seem an ideal tool for dis- 
orders of unknown pathophysiology such as schizophrenia. 

The first GWAS studies of schizophrenia, based on small samples, 
failed to identify associations, leading some to suggest that the class 
of genetic variant accessible to this study design, that is, common 
alleles, plays no role in the disorder. However, an alternative view, 
informed by similar developments in complex genetics as a whole, 
was that the early studies were underpowered; other than suggesting 
that no common risk alleles operate in the disorder with large ef- 
fect sizes (ORs >2; equivalent to APOE in Alzheimer’s disease), the 
main message was that success would require extremely large sam- 
ples that could realistically only be obtained by collaboration. How 
large GWAS studies might have to be was shown by the study of 
O’Donovan and colleagues (2008) [26]. Like those that preceded it, 
this study was based on what would now be considered a very small- 
discovery GWAS sample, but the top findings were pursued sequen- 
tially in large replication cohorts into a final meta-analysis, which 
included 7308 schizophrenia cases and 12,834 controls. This led 
to highly suggestive evidence for association to a locus containing 
the gene ZNF804A, but genome-wide significant evidence was 
only obtained when the affected phenotype was broadened to in- 
clude bipolar disorder, allowing an analysis of 9173 cases and 12,834 
controls. Association to schizophrenia on its own at genome-wide 
significance was not obtained for a further 3 years, by which time the 
analysis was based on almost 60,000 subjects [27]. 

In terms of gene discovery, the early work was disappointing, 
but it did point to some of the key features that have emerged from 
much larger GWAS. Firstly, the effect size of the associated common 
variant at ZNF804A was small, with an OR of 1.1, an effect size typ- 
ical of the findings that have followed. Secondly, while it was difficult 
to obtain genome-wide significance, the study demonstrated an ex- 
cess of nominally significant replications when the top GWAS asso- 
ciations were tested in large independent samples. This suggested 
that large numbers of true associations lay among the markers 
showing moderate evidence of association in GWAS that even large 
meta-analyses were underpowered to unequivocally implicate. And 
thirdly, the enhanced evidence for association between a variant at 
the gene ZNF804A and a combined schizophrenia-bipolar pheno- 
type tends to support the findings from epidemiology for genetic 
overlaps between these two disorders. 

A major insight into the role of common variants in schizo- 
phrenia came not from specific associations, but from patterns of 
data from large groups of markers. Seminal here was the work of 


the International Schizophrenia Consortium [28] that proposed 
large numbers of common risk alleles exist for schizophrenia, but 
in the GWAS sample they had available (about 3000 cases and 4000 
controls), these could not be identified at the required levels of sig- 
nificance due to small effect sizes. To test this, the ISC considered 
all alleles in their study to be possible risk alleles if they were asso- 
ciated with schizophrenia at a very loose threshold of significance 
(for example, p <0.5). The ISC postulated that under a highly poly- 
genic model, although most such associated alleles will be false, 
there would also be large numbers of true risk alleles captured at this 
threshold. They further postulated that if the disorder were poly- 
genic enough, in independent data sets, individuals with the dis- 
order would, on average, carry a larger burden of the presumed risk 
alleles than those without the disorder. In their landmark study, not 
only did the ISC confirm this to be true, they also showed by testing 
a range of models of genetic architecture that it was likely that the 
component of risk conferred by common risk alleles was distributed 
across at least 1000 alleles, and quite possibly many more than this. 
Modelling also suggested that the common polygenic SNP compo- 
nent to the disorder contributed around a third of the genetic risk to 
the disorder. The approach developed by the ISC, variously known 
as polygenic risk scoring (PRS), risk profile scoring (RPS), and gen- 
etic risk scoring (GRS), has, as we shall discuss later, become an 
important tool in psychiatric research extending beyond discovery 
genetics. 

Since these early studies, approximately 20 risk alleles were 
identified from studies conducted by informal groupings of inves- 
tigators and structured consortia (see [29]). However, the main 
breakthrough in detecting common alleles for the disorder re- 
quired the international community to pool their resources into 
a super-consortium, which is now known as the Schizophrenia 
Working Group of the Psychiatric Genomics Consortium. The ef- 
fect of this has been dramatic; in its first paper in 2011, the PGC 
reported ten associations using a sample of 9394 cases, but only 
3 years later, based on a discovery GWAS sample of approximately 
35,500 cases and 45,000 controls and with additional follow-up in 
another sample of 1513 cases and 66,236 controls, 128 independent 
genetic associations spanning 108 physically distinct genomic loci 
were identified [23]. 

Associations were particularly enriched for genes expressed in 
the brain, confirming schizophrenia is primarily a CNS disorder, 
rather than a manifestation of peripheral dysfunction, although 
the findings do not exclude a role for the latter. Some of the asso- 
ciations appear to give support to classical neurotransmission hy- 
potheses of schizophrenia, including associations to loci containing 
DRD2 (encoding the dopamine receptor D2) and genes involved 
in glutamatergic neurotransmission (for example, GRM3 encoding 
metabotropic glutamate receptor type 3, GRIN2A encoding subunit 
2A of the ionotropic NMDA receptor, and GRIA1 encoding subunit 
1 of the ionotropic AMPA receptor). The finding of associations to 
the locus encoding DRD2, the only known effective therapeutic 
target in schizophrenia, also holds out the possibility that other as- 
sociations highlight drug treatment targets. However, the vast ma- 
jority of the findings are not so plausibly interpreted and likely hold 
clues to novel aspects of pathophysiology. In particular, multiple as- 
sociations to genes (CACNA1C, CACNB2, and CACNA1I) encoding 
calcium channels point to the likely importance of this family of pro- 
teins in the disorder. 
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As is characteristic of GWAS, the vast majority of the associ- 
ations (including those to the loci containing the candidate genes 
discussed) could not be credibly ascribed to DNA variants that 
are predicted to change the amino acid sequences of proteins, and 
it seems most likely that the majority of susceptibility alleles exert 
their effects by altering gene expression. This makes interpreting the 
functional significance of the associations a challenging task and re- 
searchers are still mining the findings, aiming to develop a deeper 
understanding of their implications for pathophysiology. 

A recent example of the sort of work required was recently re- 
ported by Sekar and colleagues (2016) [30], who dissected the 
source of an association at the extended MHC region of chromo- 
some 6. This region spans about 8 million bases, contains hundreds 
of genes, and represents the most robustly associated locus in the 
PGC study and the locus with the largest (though still modest) effect 
size (OR = 1.2) [23]. The authors showed that much of the associ- 
ation at this locus was driven by a complex chromosomal structural 
rearrangement, which affects expression of the gene complement 
component C4A. How this finding potentially fits with emerging 
insights into the biological basis of schizophrenia is discussed later. 

Despite the undoubted successes of the PGC study, only a 
small fraction of all common variation has been identified, repre- 
senting 3-4% of the variance for the disorder. However, the PGC 
and GWAS that preceded it do provide a clear strategy for success; 
notwithstanding the enhanced heterogeneity that most investiga- 
tors assume comes from pooling samples from a diverse range of 
ancestries (the PGC study was based on individuals of European and 
Asian ancestry from over 30 countries), diagnostic methods, and as- 
certainment strategies, large samples clearly deliver. It is a certainty 
that in the next few years, the number of associated loci will increase 
considerably. Developing methods and resources that accelerate the 
rate by which those findings can be converted to biological insights 
into the aetiology will be critical if the wealth of findings are to be 
translated into benefits for patients. 


Genomic copy number variants 


CNVs are deletions and duplications of chromosomal segments of at 
least 1000 bases (1 kb); they are highly relevant to autism and other 
forms of ID, as discussed in Chapter 28. Prior to the GWAS era, a 
deletion CNV at chromosome 22q11.2 (22q11.2del) was the only 
DNA variant, common or rare, that was indisputably pathogenic for 
schizophrenia [22]. This deletion is also known to cause a congenital 
syndrome variously called velo-cardio-facial syndrome (VCFS), di 
George syndrome, or Shprintzen syndrome. The fact that carriers 
of this CNV usually have recognizable clinical and dysmorphic fea- 
tures was critical to the early discovery of its involvement in schizo- 
phrenia, since it was possible to demonstrate an elevated rate of 
psychosis in those with the clinical features of VCFS. 

Although few doubted that the rate of psychosis was elevated in 
those with VCFS, it was not uncommon for it to be argued that the 
associated psychiatric phenotype represented a symptomatic form 
of psychosis secondary to ID, rather than a disorder that should 
be considered true schizophrenia. However, it has now become 
more widely accepted that those with VCFS have schizophrenia for 
reasons that we touch on here, as they are relevant more generally to 
any discussion of rare variants in schizophrenia. Firstly, studies using 
research diagnoses showed that the psychosis that occurs in VCFS 
meets operational diagnostic criteria for schizophrenia, the only 
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criterion for schizophrenia that we currently have. Secondly, studies 
of ‘routine’ clinical cases of schizophrenia have demonstrated that 
those with operationally diagnosed schizophrenia also have a greatly 
elevated rate (about 40- to 60-fold) of 22q11.2del [31]. Thirdly, the 
elevated risk of schizophrenia in VCFS is not restricted to those with 
obvious ID. Fourthly, cognitive impairment is increasingly accepted 
as a core feature of schizophrenia, rather than a primary cause of 
the disorder (see Chapter 57). Finally, as the number of CNVs as- 
sociated with both cognitive impairment and schizophrenia has in- 
creased, so the idea that the psychosis in 22q11.2del is distinct from 
the rest of schizophrenia seems less tenable, and more so since it has 
been recently shown that people with CNVs and schizophrenia also 
have elevated burdens of the sorts of common schizophrenia risk 
alleles derived from GWAS in general (that is, non-CNV carrier) 
patient populations [32]. 

Beyond 22q11.2del, the recognition of a wider role for CNVs in 
schizophrenia had to await the development of genome-wide tech- 
nologies that allow a genotype-, rather than a phenotype-led, ap- 
proach. Evidence for a role of CNVs in schizophrenia comes from 
three main sources. 

Firstly, as we already discussed, risk alleles such as CNVs that 
confer large effects on risk of the disorder are likely to be under 
strong negative (natural) selection, leading some to the hypothesis 
that risk alleles of large effect might be maintained by new or de 
novo mutations. Consistent with this prediction, CNVs have been 
observed to occur as new or de novo mutations more frequently 
in cases (about 5% at the current resolution for detecting CNVs) 
relative to controls (about 1-2%) [33]. Secondly, at a genome-wide 
level, large studies have consistently shown that the total burden of 
large (>100 kb) and rare (frequency <1%) CNVs (both deletions and 
duplications) is increased around 1.15-fold in patients with schizo- 
phrenia in comparison with controls [34]. Finally, and most conclu- 
sively, there is now strong evidence implicating multiple specific loci 
in the disorder. 

The earliest genotype-led study to suggest the role of a CNV 
in schizophrenia was a small study of parent-proband trios in 
which the aim was to discover de novo mutations [35]. Two candi- 
date CNVs were identified—one a de novo duplication spanning a 
number of genes, including APBA2, and the other was a transmitted 
deletion at NRXN1. Both CNVs had been previously implicated in 
neurodevelopmental phenotypes, suggesting they may be patho- 
genic. Moreover, APBA2 and neurexin 1 are members of a complex, 
which mediates interactions between pre- and post-synaptic struc- 
tures in the brain. Promising though these findings were, they were 
no more than suggestive, and it took a larger case-control study to 
convincingly implicate deletions at the neurexin 1 locus in the dis- 
order, with a substantial OR of around 9 [36]. 

The most robust early evidence implicating specific CNVs, how- 
ever, came from simultaneous publications from two consortia— 
the SGENE+ consortium [37] and the International Schizophrenia 
Consortium [34]. Hitherto unprecedented in psychiatric genetics, 
the two studies had almost identical results. Both implicated de- 
letions at 15q13.3 and 1q21.1, while the SGENE+ study addition- 
ally reported evidence supporting deletions at 15q11.2 that the ISC 
study had excluded for technical reasons. Since this early work, a 
number of additional, strongly supported schizophrenia-associated 
CNVs have been identified (Fig. 59.2). All the CNV loci implicated 


so far are rare (population frequency usually <0.1%), each occurring 
in fewer than 1% of cases, and those that have been discovered have 
considerably larger effect sizes than SNPs, with ORs ranging be- 
tween about 2 and 60 [38]. It should be noted that power consider- 
ations imply that rare variants can only be convincingly associated 
if they have large effect sizes, and therefore, the profile of effect sizes 
documented to date may be more a feature of the current sample 
sizes than an intrinsic property of CNVs as a mutation class. 

One of the most notable features of schizophrenia risk CNVs is 
that they are also risk factors for other neurodevelopmental dis- 
orders, including ID, ASD, epilepsy, and ADHD, and a range of con- 
genital malformations, that is, they are often pleiotropic. Indeed, 
every CNV known to increase the risk of schizophrenia also does 
so for ID [39]. The theme of shared risk factors across disorders is 
taken up further under Sequencing studies, p. 592 and Applications 
of polygenic methodology, p. 594. 


Sequencing studies 


For the present purposes, although CNVs and other chromosomal 
arrangements are detectable by sequencing, the primary aim of 
sequencing studies is to identify variants similar in structure to those 
detected by GWAS (that is, single nucleotide and small insertion/de- 
letion variants), but which are too rare to be captured by GWAS. Cost 
implications have led researchers to focus on the coding exome, so 
we will refer to these as rare coding variants (RCVs) that change the 
DNA sequence at single or a few nucleotides. As for studies of CNVs, 
exome sequencing studies have been prosecuted along two main de- 
signs: family-based seeking a contribution from de novo RCVs and 
case-control. Even though the largest of the first wave of studies 
were grossly underpowered to detect individual pathogenic single 
nucleotide variants (SNVs), they did show that groups of genes that 
had prevously been identified as likely related to the disorder were 
enriched for very rare, non-synonymous RCVs in cases, compared 
with controls, and among mutations occuring as de novo mutations 
[40, 41]. Just as elevated burdens of rare CNVs [34] and common 
risk alleles [28] heralded the identification of specific risk alleles of 
those classes, this burden of RCVs provided for optimism that the 
same would be true for this class of alleles when sample sizes are 
adequate. 

Sample sizes are still too small to expect large-scale discovery, 
but nevertheless, this optimism has already been shown to be well 
founded. A meta-analysis of 4264 cases, 9343 controls, and 1077 
parent-proband trios obtained strong evidence for a single gene en- 
riched for very damaging (so-called loss of function, or LoF) mu- 
tations in schizophrenia [42]. The gene SETD1A encodes a protein 
that methylates histone proteins, a process believed to be important 
in regulating gene expression. Identifying which genes are regulated 
by SETD1A is the next step in understanding how loss of activity of 
this protein leads to disease. 

Sequencing studies have also recapitulated the findings from 
CNVs of overlaps between neurodevelopmental disorders. As a 
group, genes impacted by LoF de novo mutations in schizophrenia 
are enriched for those affected by this same class of mutation in 
people with ASD and ID [40], while at an even finer level of reso- 
lution, the same LoF mutation in SETD1A that contributes high 
risk to schizophrenia also does so for severe ID and developmental 
delay [42]. 
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Fig. 59.2 Penetrance of schizophrenia-associated CNVs for schizophrenia and other developmental disorders [38]. DD, developmental delay; ASD, 


autism spectrum disorders; CM, congenital malformation. 


Glimpses of the biological basis of schizophrenia 


Most of the genetic risk for schizophrenia is not yet linked to spe- 
cific DNA variants, and even the majority of robust associations do 
not implicate individual genes at high levels of certainty. This is be- 
cause GWAS implicates chromosomal regions, rather than defini- 
tively pointing to only a single gene, and CNVs usually span multiple 
genes. Nevertheless, there is already strong evidence that many of 
the findings converge onto plausible pathophysiological processes. 
System genomics studies (analyses that determine if certain types of 
mutations are overrepresented in groups of genes that have related 
functions) show that in people with schizophrenia, rare mutations 
of all classes (CNVs and RCVs, transmitted and de novo) tend to 
cluster in genes encoding post-synaptic components of excitatory 
synapses. In particular, an excess of mutations have been found re- 
peatedly in genes encoding proteins affiliated with the N-methyl-D- 
aspartate receptor (NMDAR) and those affiliated with the neuronal 
activity-regulated cytoskeleton-associated (ARC) protein which 
regulates post-synaptic strength at glutamatergic synapses [40, 41, 
43, 44]. Studies have also documented that rare mutations and, to 
a lesser extent, common variant associations are enriched among 


genes encoding proteins whose synthesis is inhibited by fragile X 
mental retardation protein (FMRP) [23, 40, 41]. Genes encoding 
voltage-gated calcium channels are similarly implicated both by 
GWAS and by one study of rare genetic variation [23, 41, 45], and 
more recently, evidence has emerged for enrichment of case CNVs 
in genes encoding GABA, receptor affiliated proteins [44], which 
has strong functional links with the glutamatergic system. All these 
sets of genes are important components or regulators of synaptic 
plasticity, a set of processes that are known to be critical for a range 
of cognitive processes, including learning, memory, and, it now 
seems highly likely, schizophrenia [44, 46]. 

These convergent findings point to some general biological hy- 
potheses, but they do not capture the totality of the association 
signal. There is evidence from the association between schizo- 
phrenia and rare variants at SETD1A, as well as from common 
GWAS signals [47] that chromatin modification is also implicated, 
while the identification of complement C4A as a susceptibility factor 
highlights the involvement of immune mechanisms. It is unclear if 
the findings discussed here also converge on the general area of syn- 
aptic plasticity, but it is notable that in the brain, much of C4 protein 
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co-localizes with glutamatergic markers. There is also evidence that 
mice lacking C4 show abnormalities of plasticity in visual pathways, 
perhaps through synaptic pruning. It is therefore possible that these 
apparently disparate biological pathways converge onto a coherent, 
and yet broad, set of pathogenic processes. 


Genetic prediction and testing 


Here we briefly consider the implications of the findings discussed 
previously. In people with schizophrenia entering genetic studies 
(which are biased towards chronicity and severity), the rate of known 
pathogenic CNVs is around 2.5%, implying low sensitivity for pre- 
diction. In the general population, the penetrance for schizophrenia 
in carriers (the proportion of carriers developing the disorder) is 
between 2% and 18% (Fig. 59.2) [38], implying low specificity for 
prediction. While the screening value of CNVs for prediction is 
therefore low for the proportion of cases who are carriers, that know- 
ledge may have value for patients who often are desperate for an ex- 
planation for their illness. Moreover, as we have seen, schizophrenia 
is not the only possible outcome in CNV carriers; considering de- 
velopmental delay, ASD, and congenital malformations, the average 
penetrance for any of these disorders is around 40% (Fig. 59.2). So 
for the 50% of the offspring of CNV carriers who themselves are car- 
riers, the risk for a range of neurodevelopmental outcomes is high. 
CNVs may also lead to comorbidity for medical phenotypes, which 
may be overlooked in people with psychosis who often do not gain 
good access to physical health care but that are treatable if detected 
(for example, cardiac defects, epilepsy, cataracts). Considerations 
such as these have led to calls for CNV testing to be offered to people 
with schizophrenia [48], although at the time of writing, as far as we 
are aware, this is not routine practice in any part of the world. 

The principles of PRS are discussed previously. This method pro- 
vides researchers with a tool that allows an unaffected individual's li- 
ability to the disorder to be estimated. While this has many research 
applications, currently, the degree of liability captured by PRS is not 
of clinical value; its sensitivity and specificity for predicting affected 
status are inadequate, and the predictive power as depicted by area 
under the curve (AUC) analysis is only 0.62 [23]. 


Applications of polygenic methodology 


As perhaps the first valid biomarker for liability to the disorder, PRS 
is opening up interesting areas for research [49]. In the first paper 
to exploit this, the ISC (2009) [28] showed the burden of common 
schizophrenia-associated alleles, as measured by PRS, was not just 
increased in people with schizophrenia, but it was also increased in 
people with bipolar disorder. This finding points to a shared genetic 
contribution between the two disorders and is consistent with some 
of the newer genetic epidemiology discussed earlier in the chapter 
[12]. Subsequently, RPS and other polygenic approaches [49] have 
documented substantial sharing of risk alleles between schizo- 
phrenia and major depression, as well as significant, and perhaps 
more unexpected, overlaps with autism, ADHD, anorexia nervosa, 
obsessive-compulsive disorder, and neuroticism. More controver- 
sially, polygenic risk for schizophrenia (in unaffected people) has 
been linked to poor cognitive performance yet, simultaneously, to 
higher educational performance and creativity [50]. Given the large 
number of risk alleles involved in schizophrenia, and the likelihood 
that these, in turn, influence multiple domains of brain function, 
it is perhaps not surprising that overlaps exist for common alleles 


across multiple psychiatric, behavioural, and cognitive pheno- 
types. Nevertheless, considered alongside results of studies showing 
overlap between neurodevelopmental disorders with respect to rare 
genetic variation [51], these findings pose challenges to the validity 
of current classification systems. 

The ability to measure, imperfectly, genetic liability in unaffected 
people also offers new opportunities for patient stratification. In 
the earliest such application, it was shown that in people with bi- 
polar disorder, loading for schizophrenia polygenic risk was higher 
in people with psychotic features and in those who met criteria for 
schizoaffective disorder [52]. Conversely, in people with schizo- 
phrenia, the polygenic burden of bipolar risk alleles is higher in 
those who have co-occurrence of manic symptoms [53]. Findings 
such as these suggest a biological structure of psychosis corres- 
ponding, in part, to symptom domains, rather than categories, 
although much more work is required to establish this unequivo- 
cally. Many other approaches to stratification based on risk alleles 
are being investigated, including treatment response, outcome after 
first episode, and predicting functional impairment, but none of the 
findings yet have the robustness of those documenting phenotype 
overlaps. 

PRS also offers the potential for looking in unaffected people for 
phenotypes that correspond to increased liability, for studying the 
developmental trajectory of individuals at elevated risk, and perhaps 
for identifying modifiable environmental factors or psychological 
cognitive traits that alter trajectories to illness. Again, these findings 
generally are not yet entirely robust, but it has been shown that ele- 
vated PRS relates to negative, rather than positive, symptom dimen- 
sions, suggesting the former may be better markers for detecting 
those at risk of the disorder [54]. 

Finally, PRS also offers the possibility of evaluating putative inter- 
mediate phenotypes that might mediate the link between genes 
and disorder. In schizophrenia, clear positive findings have yet to 
emerge, and for now, perhaps the most interesting of this type of 
study concerns negative studies. A particularly notable example of 
this, given the size and power of the study, was one that found no 
overlap between genes that influence the size of subcortical brain 
structures and those that influence schizophrenia [55], implying 
that the volume of those brain structures (for example, striatum, 
hippocampus) is not an important mediating factor of genetic risk 
for schizophrenia. 


Conclusions 


After a lengthy period of apparent advances and retreats, molecular 
genetic studies of schizophrenia have recently delivered robust find- 
ings in large number. These, in turn, are beginning to provide in- 
sights into general areas of pathophysiology, challenge the validity 
of psychiatric nosology, and provide novel tools for research more 
widely. Thus far, there has been limited clinical translation of these 
results, though some of the findings implicating rare genetic variants 
may be of direct relevance in the clinic. On a cautionary note, the 
current findings represent only a fraction of the total genetic con- 
tribution to schizophrenia. While there are grounds to believe that 
much more will be uncovered and that the process of doing so will 
accelerate, there remains much to be done to convert the genetic 
findings to a deep mechanistic understanding that will be required 


if genetics is to fulfil its promise of driving the development of novel 


treatments. 
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Neuroimaging has played a pivotal role in the scientific investigation 
of schizophrenia. In particular, imaging provided empirical evidence 
that the brain of patients with schizophrenia exhibits structural and 
functional differences, compared to both controls and patients with 
other diseases. This evidence was crucial for the long-standing de- 
bate of whether schizophrenia has biological roots or is purely social 
and psychological in origin (for example, as hypothesized by the- 
ories of ‘double-bind’ [1] or the concept of the ‘schizophrenogenic 
mother’ [2]). This introductory section provides a brief overview 
of seminal neuroimaging studies of schizophrenia across different 
modalities. 

Historically, an early and extremely influential in vivo demonstra- 
tion of structural brain changes in schizophrenia was provided by 
a computed tomography (CT) study by Johnstone and colleagues 
(1976). This study demonstrated significant enlargement of vent- 
ricles in patients with schizophrenia, compared to matched healthy 
controls; furthermore, ventricular size correlated significantly with 
cognitive impairment [3]. One concern was that these structural 
changes might have arisen from the long-term institutionalization 
of these particular patients and the often invasive treatments they 
had received (including insulin coma therapy). However, the finding 
of enlarged ventricles was subsequently replicated in multiple CT 
and magnetic resonance imaging (MRI) studies (Fig. 60.1) [4-6] 
and confirmed by meta-analyses [7]. 

Early functional neuroimaging studies of schizophrenia examined 
basic physiology, using radioactive measurements of cerebral per- 
fusion and metabolism, respectively, as provided by single-photon 
emission computed tomography (SPECT) and positron emission 
tomography (PET). A seminal SPECT study found that older (but 
not younger) patients with chronic schizophrenia had decreased 
perfusion in their frontal lobes, compared to a control group of alco- 
holic patients [8]. This ‘hypofrontality’ was also detected using PET 
and was related to poor performance in cognitive tests [5]. Together 
with increased metabolic activity in the basal ganglia, hypofrontality 





offered an explanation for the ‘[ ... ] simultaneous generation of 
positive and negative symptoms [.. . ]’ [9]. However, subsequent 
PET studies failed to find hypofrontality [10, 11]; this discrepancy 
may have arisen from several factors, including performance differ- 
ences between patients and controls in particular, but also differ- 
ences in whether studies examined ‘resting’ conditions or cognitive 
tasks and differences in statistical analysis techniques (for discus- 
sions of hypofrontality, see [12, 13]). 

Electroencephalography (EEG) played an important role in early 
brain activity investigations of schizophrenia, providing high tem- 
poral resolution of normal and pathological brain processes. Early 
studies reported multiple abnormalities in the electrical brain ac- 
tivity of patients with schizophrenia, for example epileptic-like 
potentials, reduced alpha activity, and/or enhanced fast-wave ac- 
tivity [14]. The investigation of evoked EEG responses to sensory 
events [event-related potentials (ERPs)] received particular interest. 
Notable findings in schizophrenia patients included reductions in 
N100 [15, 16] and P300 amplitudes [17], irrespective of the sensory 
modality. Since the early 1990s, the auditory mismatch negativity 
(MMN) has been widely investigated in schizophrenia. The MMN is 
the difference potential between responses to unpredictable stimuli 
(‘deviants’) and predictable stimuli (‘standards’). In the first study of 
MMN in schizophrenia, Shelley et al. (1991) found an attenuation 
of the auditory MMN amplitude in schizophrenia patients, as com- 
pared to controls [18]. This finding has since been replicated by a 
large number of studies (for a meta-analysis, see [19]) and continues 
to play a major role in contemporary schizophrenia research. 

Prior to the advent of functional MRI (fMRI), PET played a crit- 
ical role in assessing regional activation changes in schizophrenia 
during cognitive tasks in vivo [20-24]. Given the long-standing no- 
tions of schizophrenia as resulting from abnormal connectivity [25] 
(see The dysconnection theory of schizophrenia, p. 602), particular 
attempts were made to link aberrant activity in neural networks to 
symptoms of schizophrenia [23, 26-28]. For instance, the concept of 
‘cognitive dysmetria—‘the difficulty in prioritizing, processing, co- 
ordinating, and responding to information’ [29]—was linked to al- 
terations of a prefrontal-thalamic-cerebellar circuit measured with 
PET in schizophrenia patients [30]. 
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Fig. 60.1 Structural brain differences in schizophrenia. 

a) An 8-year follow-up CT scan in two patients (panels a and b, and 
panels c and d are the same patients) showing ventricular size and 
configuration. 

Reproduced from J Neurol Neurosurg Psychiatry, 51(2), Illowsky BP, Juliano DM, 
Bigelow LB, et al., Stability of CT scan findings in schizophrenia: results of an 8 year 
ollow-up study, pp. 209-13, Copyright (1988), with permission from British Medical 
Journal. 

b) Upper panel: control subject and outlined regions of interest in the 
emporal lobe. Lower panel: schizophrenia patient; increased ventricle 
size, increased tissue loss in the parahippocampal gyrus imaged 

with MRI. 

Reproduced from New Engl J Med, 327(9), Shenton ME, Kikinis R, Jolesz FA, et al., 
Abnormalities of the Left Temporal Lobe and Thought Disorder in Schizophrenia, 
pp. 604-612, Copyright (1992), with permission from Massachusetts Medical Society. 





Finally, the development of fMRI in the 1990s led to a break- 
through in neurophysiological research on schizophrenia in vivo. 
While the non-quantitative nature of the blood-oxygen level- 
dependent (BOLD) signal and the complexities of data analysis in- 
duced an initial delay in the uptake of fMRI, its considerably higher 
temporal and spatial resolution, compared to PET, as well as its 
radioactive-free nature, radically increased the utility and avail- 
ability of functional neuroimaging for schizophrenia research. The 
earliest fMRI applications to schizophrenia included neurological 
soft signs, as probed by simple motor tasks [31], auditory hallucin- 
ations [32], and working memory [33]. Since then, the vast majority 
of functional imaging studies on schizophrenia have used fMRI, as 
reflected by the literature covered in the following sections. 

Following the early milestone neuroimaging studies described 
previously, the neuroimaging research in schizophrenia has virtu- 
ally exploded, resulting in a vast body of literature that is impos- 
sible to cover in a single chapter. In the following, we partition 
the neuroimaging literature on schizophrenia into two broad 
classes: discovery-oriented and theory-driven approaches. With 
regard to the latter, we use four major contemporary pathophysio- 
logical theories in order to structure the imaging literature on 
schizophrenia and focus on studies with a direct connection to pro- 
posed disease mechanisms. We conclude with an outlook on com- 
putational neuroimaging approaches that strive for mechanistic 
interpretability, as opposed to descriptive accounts of neuroimaging 
data [34]. 





Discovery-oriented neuroimaging approaches 


The introduction of non-invasive neuroimaging with MRI enabled 
the systematic search for structural and functional differences in the 
brain of patients with schizophrenia. Most of the initial work, be- 
ginning in the early 1980s [35, 36], used structural MRI to identify 
neuroanatomical abnormalities. Two main approaches have been 
used: region of interest (ROI) analyses, which are based on a priori 
chosen regions delineated by atlases or manual tracing, and whole- 
brain approaches such as voxel-based morphometry (VBM) [37]. 

Some of the most notable replicated findings include: ventricular 
enlargement [38], cerebral atrophy [39], grey matter reductions in 
the amygdala, hippocampus, parahippocampal gyrus, and anterior 
cingulate gyrus [40, 41], and frontal lobe abnormalities, particu- 
larly in prefrontal and orbitofrontal subregions [42]. Whole-brain 
approaches, such as VBM, that are not biased to one particular re- 
gion, have pointed to grey matter decreases in patients with schizo- 
phrenia, compared to healthy controls, in several regions, including 
the thalamus, insula, superior temporal gyrus (in particular, in the 
left hemisphere, which is associated with auditory information pro- 
cessing and language), anterior and posterior cingulate, and medial 
frontal gyrus [43, 44]. 

Longitudinal studies addressed the question of whether these 
volumetric abnormalities were present before the onset of the illness 
or materialized only after its onset. A particular focus has been on in- 
dividuals at risk for schizophrenia; here, longitudinal studies found 
volumetric reductions of the temporal, cingulate, insular, prefrontal, 
and parahippocampal cortices in those individuals who later devel- 
oped first-episode psychosis symptoms [45]. In a large multi-site 
early prevention study of at-risk mental state (ARMS) individuals, 
larger reductions in grey matter volume of the parahippocampal 
gyrus were detected in ARMS individuals who transitioned to psych- 
osis, compared to those who maintained a subclinical symptom level 
[46]. Grey matter abnormalities also progressed after the onset of 
schizophrenia. Longitudinal studies in chronically ill patients sug- 
gest that progressive brain changes continue for up to 20 years after 
the first symptoms [7, 47]. These changes include lateral ventricle 
volume increases and grey matter density decreases in specific brain 
regions, including the superior frontal gyrus, superior temporal 
gyrus (particularly in the left hemisphere), right caudate nucleus, 
and right thalamus [7, 47]. 

The development of diffusion-weighted imaging (DWI) has made 
it possible to investigate white matter in vivo and derive estimates 
of connectivity by tractography. Meta-analyses of DWI studies in 
schizophrenia suggested that white matter abnormalities are ap- 
parent in two main regions—the left frontal deep white matter tract 
linking the frontal lobe, thalamus, and cingulate gyrus, and the left 
temporal deep white matter tract linking the frontal lobe, hippo- 
campus, amygdala, and temporal and occipital lobes [48, 49]. DWI- 
derived connectivity matrices have fuelled the emerging field of 
connectomics, which uses graph-theoretical methods to examine the 
topology of networks. Applications of the graph theory to DWI data 
from patients with schizophrenia suggested reduced network cap- 
acity for integration, as reflected by longer path lengths [50], reduced 
network efficiency [51], and reduced ‘rich club’ organization [52]. 

A parallel line of research has assessed these putative functional 
integration abnormalities in schizophrenia by corresponding 
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analyses of functional connectivity. While reduced functional 
connectivity—in particular fronto-temporal connectivity [27]—has 
featured prominently in the literature [52, 53], enhanced functional 
connectivity has also been reported [54, 55], even in the presence 
of reduced structural connectivity in patients [56]. However, this 
diversity may, at least in part, also result from methodological dif- 
ferences across studies. For example, Yang et al. (2014) showed that 
global signal regression strongly affected coupling estimates (see 
[56] for discussion). 

Graph-theoretical investigations of functional connectivity 
matrices have provided important insights into network alterations 
in schizophrenia [56]. Notable findings include reductions in clus- 
tering, small worldness, and the probability of high-degree hubs 
[53]. Interestingly, meta-analyses of fMRI activity in schizophrenia 
have shown that abnormal activations are concentrated in hubs of 
a ‘normative connectome’ such as the thalamus, middle cingulate 
gyrus, and the dorsolateral prefrontal cortex (PFC) [57]. 

An alternative approach to examining large-scale functional con- 
nectivity without a priori parcellation is independent component 
analysis (ICA). This method identifies networks as spatially distinct 
maps of voxels which share a common time course. This approach 
has shown across both ‘resting state’ conditions and cognitive para- 
digms that schizophrenia is characterized by altered global connect- 
ivity patterns [58, 59]. 





Theory-driven neuroimaging approaches 


The short summary of discovery-oriented analyses of structural and 
functional neuroimaging data in the previous section outlined alter- 
ations of global brain organization that are largely descriptive and do 
not specify a specific disease mechanism. We now turn to imaging 
studies that were motivated by a particular pathophysiological 
theory. These include: (1) the neurodevelopmental theory; (2) the 
dopamine theory; (3) GABAergic and glutamatergic theories; and 
(4) the dysconnection theory of schizophrenia. 


The neurodevelopmental theory of schizophrenia 


The neurodevelopmental theory of schizophrenia postulates an ab- 
errant development of the central nervous system as the cause of 
the disorder [60, 61]. In brief, this theory assumes that a disruption 
occurs during early (pre-/perinatal) or late (puberty and adoles- 
cence) neurodevelopmental processes that impacts on the biochem- 
istry, function, and structure (including connectivity) of the brain. 
Possible environmental factors include (social) stressors that result 
in endocrine and metabolic changes and/or infections, presumably 
interacting with specific genetic predispositions [62]. 

One starting point for the neurodevelopmental theory was the 
finding, from the early neuroimaging studies described previously, 
of structural brain differences in schizophrenia, especially increased 
lateral ventricles (Fig. 60.1). For example, intrigued by the finding 
that enlarged ventricles were already present at the earliest stage 
of schizophrenia, Murray and Lewis (1987) speculated that these 
changes represented possible consequences of earlier developmental 
disturbances [60]. Furthermore, this structural abnormality was ini- 
tially thought not to be associated with the duration of the disease 
or type of therapy [63]. (Later studies showed, however, that the en- 
largement of ventricles continues during the course of schizophrenia; 


for a meta-analysis and review, see [7, 64], respectively). These struc- 
tural findings were hypothesized to result from the interaction be- 
tween genetic predisposition and environmental insults. One model 
suggested that disruptions in normal neurodevelopment may take 
place in the pre- or perinatal period, through infections or obstetric 
complications which impact globally on the normal development 
of the nervous system, including proliferation and migration of 
neurons, and the growth and pruning of axonal connections [60, 
62, 63]. This was supported by studies showing that individuals who 
developed schizophrenia were more likely to have experienced pre- 
or perinatal adverse events, compared to healthy subjects (for re- 
view, see [65]); mixed evidence (reviewed in [66]) exists for a similar 
risk role of perinatal infections. 

An alternative to a more ‘global lesior is that the early damage 
of circumscribed brain regions may affect the development of other 
regions such as the PFC [61]. One particular candidate for such an 
initial lesion is the hippocampus, which contributes to regulating 
the activity of midbrain dopamine neurons; indeed, animal models 
with prenatal hippocampal lesions found increased activity of dopa- 
minergic neurons in the midbrain [67]. The ensuing change in 
neuromodulatory projections might impact on the development of 
prefrontal (and other) regions. 

A non-trivial challenge for the neurodevelopmental account of 
schizophrenia is to explain why morphological changes are not con- 
fined to the time of the hypothetical insult but continue to unfold in 
time. As noted by Kempton et al. (2010), the finding of continuously 
accelerated enlargements of the lateral ventricles ‘[ ... ] challenges 
an exclusively neurodevelopmental model of schizophrenia’ [64]. 
Beyond simple volumetric changes, several longitudinal structural 
MRI studies have demonstrated that schizophrenia has a progres- 
sive nature, with pronounced grey matter changes occurring over at 
least 20 years after disease onset; comprehensive reviews and meta- 
analyses can be found in [7, 68-70]. For example, an early study by 
Thompson and colleagues (2001) observed accelerated grey matter 
loss over 5 years in very early-onset schizophrenia, compared to 
healthy controls, starting in parietal brain regions and motor and 
supplementary motor cortices, and progressing later to the superior 
frontal gyri, dorsolateral PFC, and temporal cortices, including 
the superior temporal gyrus (Fig. 60.2) [71]. This progressive grey 
matter loss was correlated with an overall decrease in global func- 
tioning in the patient group. In particular, faster loss rates of grey 
matter in temporal regions were associated with more severe posi- 
tive symptoms, and faster loss rates of frontal grey matter loss with 
negative symptoms. 

Consistent with these results, a more rapid grey matter reduction 
relative to age-related loss in healthy controls was found in subse- 
quent longitudinal MRI studies across different clinical stages and 
medication status. This included first-episode schizophrenia pa- 
tients [72], medicated patients [73], and, to a lesser extent, drug- 
naive patients [74] (Fig. 60.2). Generally, medication appears to 
modulate, but not explain, accelerated grey matter loss in patients 
with schizophrenia [68, 75]. 


The dopamine theory of schizophrenia 


The dopamine (DA) theory of schizophrenia was initially based upon 
the thesis that the efficacy of neuroleptic drugs to attenuate psych- 
osis symptoms depended on striatal D2 antagonism [76]. Since then, 
a substantial number of PET and SPECT studies have provided 
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Fig. 60.2 Progressive grey and white matter loss in schizophrenia. 


pronounced in patients using a higher dose of atypical antipsychotics. 


Copyright (2013), with permission from Oxford University Press. 





permission from Elsevier B.V. 


compelling evidence of elevated presynaptic DA levels in the stri- 
atum (for example, [77]). Presynaptic DA can be investigated using 
tracers like '*F-dihydroxyphenyl-L-alanine ('*F-DOPA), which are 
taken up by dopaminergic terminals and probe DA synthesis cap- 
acity. Beyond individual studies, evidence of elevated DA synthesis 
capacity in the striatum was provided by meta-analyses of patients 
with schizophrenia [78, 79]. This elevation was also found in indi- 
viduals at risk for psychosis [80, 81], including medication-naïve 
prodromal individuals [82]. When followed longitudinally, a subset 
of the ultra-high-risk subjects who transitioned from the prodromal 
stage to frank psychosis showed a progressive increase in DA syn- 
thesis [83]. In addition to the striatum, corresponding findings were 
also recently made for the midbrain [84]. 

Another argument in favour of a central role of DA relates to 
the role of the DA system in learning and plasticity. DA modulates 


Dose antipsychotic medication at time of scanning 


a) Significant grey matter loss in normal adolescents and schizophrenia (see Colour Plate section). 


Reproduced from Proc Natl Acad Sci U S A, 98(20), Thompson PM, Vidal C, Giedd JN, et al., Mapping adolescent brain change reveals dynamic wave of accelerated gray matter 
oss in very early-onset schizophrenia, pp. 11650-5, Copyright (2001), with permission from National Academy of Sciences. 


b) Upper panel: total grey matter volume reduction depends on duration of illness; lower panel: total grey matter volume reduction was more 


Reproduced from Schizophr Bull., 39(5), Haijma SV, Haren NV, Cahn W, et al., Brain Volumes in Schizophrenia: A Meta-Analysis in Over 18 000 Subjects, pp. 1129-38, 


c) Regional fractional anisotropy reductions in the deep white matter of the left frontal and left temporal lobes in schizophrenia (see Colour Plate section). 
Reproduced from Schizophr Res, 108(1-3), Ellison-Wright |, Bullmore E, Meta-analysis of diffusion tensor imaging studies in schizophrenia, pp. 3-10, Copyright (2009), with 


glutamatergic plasticity in several ways; for example, activation of 
D1 receptors enhances, and of D2 receptors diminishes, NMDA 
receptor-mediated long-term potentiation [85]. Referring to the im- 
portance of DA in learning and plasticity, one theory of psychosis 
holds that individuals with schizophrenia attribute aberrant salience 
to irrelevant information due to inappropriate, ‘chaotic phasic firing 
of DA neurons [86-89]. This theory posits a key role of the DA system 
in mediating the misattribution of salience [90] and is supported by a 
host of fMRI studies in patients with schizophrenia. Firstly, empirical 
support for this theory comes from Pavlovian conditioning experi- 
ments where neutral cues were associated with appetitive or aver- 
sive events [91-93]. Behaviourally and autonomically, schizophrenia 
patients exhibited reduced differences in responding to paired cues 
(that is, cues that predict reinforcement) relative to unpaired ones. 
These behavioural effects were related to increased activity in the 
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dopaminergic midbrain and dopaminoceptive regions, including 
the ventral striatum, across both aversive and appetitive contexts 
[91, 92]. Secondly, in the context of instrumental learning, BOLD ac- 
tivation in the ventral striatum in response to reward-predicting cues 
was shown to be blunted in medicated patients [94], medication-free 
patients [95-97], unaffected patient siblings [98], and healthy con- 
trols with subclinical psychosis-like symptoms [99]. In some studies 
(for example, [94]), attenuated responses to reward prediction error, 
but increased responses to prediction error, on neutral trials were 
also found in the dopaminergic midbrain. 

Blunted striatal activations in response to task-relevant stimuli 
may be specifically linked to negative symptoms [100], even in 
unmedicated patients [95], whereas enhanced midbrain and stri- 
atal response to task-irrelevant stimuli appear to be related to 
positive symptoms in both unmedicated [94, 101] and medicated 
patients [93]. fMRI studies that specifically tested salience attribu- 
tions [102] provide further support for aberrant signals in the mid- 
brain and ventral striatum, as accounting for positive symptoms in 
schizophrenia [103]. 


The GABAergic and glutamatergic theories 
of schizophrenia 


The NMDA _ (N-methyl-D-aspartate) and GABA (gamma 
aminobutyric acid) receptor hypofunction theory of schizophrenia was 
based on the finding that dissociative anaesthetics, including keta- 
mine, an NMDA receptor (NMDAR) antagonist [104], can induce a 
transient psychosis-like state in healthy volunteers that is character- 
ized by perceptual alterations, delusion-like ideas, thought disorder, 
and blunted affect [105]. Ketamine can also aggravate psychosis in 
patients with schizophrenia [106]. 
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A single dose of ketamine leads to a significant reduction in the 
auditory MMN, an electrophysiological response to rule violations. 
This was first shown using invasive recordings in monkeys [107], 
followed by multiple demonstrations in humans, based on EEG (for 
example, [108-110]). Importantly, the auditory MMN is also sig- 
nificantly reduced in patients with schizophrenia, in a comparable 
manner as under ketamine application [111], and represents one 
of the most robust group-level physiological indices of the clinical 
diagnosis of schizophrenia. Reduced auditory MMN amplitude in 
schizophrenia was first described by Shelley et al. (1991), a finding 
that has since been replicated by dozens of studies (for a meta- 
analysis, see [19]). 

The auditory MMN is reduced not only in schizophrenia patients, 
but also among ARMS individuals with subclinical symptoms, and 
substantially more reduced in those who transition to psychosis 
[112]. A recent fMRI study also demonstrated auditory mismatch 
impairments in schizophrenia patients, as reflected by reduced 
BOLD responses to mismatch blocks in auditory regions (primary 
auditory cortex and superior temporal gyrus), anterior cingulate, 
and medial prefrontal gyri [113] (Fig. 60.3). 

In addition to NMDARs, GABAergic receptors have been im- 
plicated in the pathophysiology of schizophrenia [114-116]. Both 
processes may be closely connected—given that NMDARs pro- 
vide a major source of excitatory input to inhibitory interneurons 
[117, 118], NMDAR antagonism may lead to disinhibition of their 
post-synaptic target neurons, with less coherent pyramidal cell firing 
as a possible consequence (for review, see [119]). Indirect empirical 
evidence for this process in schizophrenia is provided by studies 
examining neuronal oscillations with EEG/MEG recordings [120]. 
For example, gamma-band oscillations in the dorsolateral PFC, 





Fig. 60.3 (see Colour Plate section) Haemodynamic response to mismatch blocks (P <0.05). A corrected cluster size threshold was based on a Monte 
Carlo simulation. First row: healthy controls; second row: schizophrenia patients; third row: difference between healthy controls and schizophrenia 
patients for the contrast mismatch > standard (baseline) condition. Schizophrenia patients exhibited less activity in the auditory cortex and prefrontal 
and salience network (bilateral ACC, medial frontal gyrus, and right insula). Enhanced activity was found in schizophrenia patients in the pre- and post- 
central gyri, compared to the control group. z-co-ordinates refer to the Talairach system. 

Reproduced from Brain, 138(Pt 5), Gaebler AJ, Mathiak K, Koten JW, et al., Auditory mismatch impairments are characterized by core neural dysfunctions in schizophrenia, 


pp. 1410-23, Copyright (2015), with permission from Oxford University Press. 
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which are in the 30- to 80-Hz range and scale with working memory 
load [121], are significantly reduced in schizophrenia patients, com- 
pared to controls [122]. 

A more direct, but static, readout of GABA levels can be obtained 
by magnetic resonance spectroscopy (MRS) (for review, see [123]). 
MRS can be implemented in different ways; most commonly, it fo- 
cuses on hydrogen nuclear spins ('H MRS). These hydrogen nuclear 
spins are reflected by specific radiofrequency signals that are de- 
termined by the chemical environment of the hydrogen spins. This 
allows, in principle, for the detection of certain endogenous metab- 
olites, including GABA and glutamate. 

MRS is increasingly being applied to schizophrenia. For example, 
Yoon et al. (2010) found reduced GABA levels in the visual cortex 
of patients with schizophrenia [124]. This reduction correlated 
with orientation-specific surround suppression, a psychophysical 
measure that is believed to reflect cortical inhibition. Kegeles et al. 
(2012) employed MRS to measure GABA and glutamate levels in the 
dorsolateral and medial PFC, respectively, contrasting both medi- 
cated and non-medicated patients with schizophrenia to healthy 
controls. They found a significant increase of both GABA and glu- 
tamate levels in the medial prefrontal region in unmedicated, but 
not medicated, patients. By contrast, there were no group differences 
in the dorsolateral PFC for either transmitter [125]. 

The practical application of MRS to schizophrenia, however, is 
aggravated by various methodological challenges [126]. One of the 
most obvious practical limitations is that most presently used MRS 
techniques are limited spatially to a small ROI (although alternatives 
are emerging such as chemical exchange saturation transfer (CEST) 
techniques; see, for example, [127]). Variations in the ROIs across 
reports may provide one partial explanation for why the MRS litera- 
ture on alterations of GABA levels in schizophrenia is diverse and 
difficult to reconcile. Put differently, ‘MRS assays of GABA concen- 
tration have yielded equivocal evidence of large-scale alteration in 
GABA concentration’ in schizophrenia [126]. The most comprehen- 
sive review to date that we are aware of (that is, [128]) failed to find 
significant alterations in GABA levels in schizophrenia, as assessed 
by MRS. 


The dysconnection theory of schizophrenia 


The proposition that schizophrenia represents a mental condition 
characterized by cognitive disintegration dates back to Wernicke, 
who proposed that the disorder arises due to a disruption of asso- 
ciation fibre tracts, and later to Bleuler (1911) who described the 
‘splitting’ of mental functions [129]. The dysconnection theory of 
schizophrenia follows this tradition. It arose from the initial thesis 
that positive symptoms in schizophrenia might be explained as a 
consequence of disrupted temporo-prefrontal functional connect- 
ivity, as had been observed in early PET studies [130]. This led to 
conceptualizing schizophrenia initially as a disconnection syn- 
drome [27, 131]. 

However, although most structural and functional connectivity 
analyses in schizophrenia tend to report a decrease in connect- 
ivity, especially with the frontal cortex [132], enhanced connectivity 
is also a frequent finding, as discussed previously. This motivated 
a replacement of the Latin prefix ‘dis’ (apart) by the Greek prefix 
‘dys’ (bad or ill) [133] and led to a more concrete specification of 
the pathophysiology that underlies aberrant functional coupling of 
schizophrenia. Specifically, the dysconnection hypothesis postulates 


that the central disease mechanism in schizophrenia is an aberrant 
neuromodulatory regulation (dopaminergic and/or cholinergic) of 
NMDAR-dependent synaptic plasticity [131]. This explains both 
functional and structural connectivity in schizophrenia as the 
consequences of disturbances in ‘NMDAR-neuromodulator inter- 
actions’ (NNI) [133, 134]. 

Abnormal NNI is consistent with the other pathophysiological 
theories described previously and may offer a unifying theme for 
their integration (for details, see [134]). For example, NUDAR 
hypofunction due to aberrant neuromodulatory regulation can 
explain progressive grey matter loss (as NMDARs provide a po- 
tent neurotrophic signal); it naturally incorporates the NMDAR 
hypofunction and DA theories and it relates to GABAergic dysfunc- 
tion, given the role of the latter in post-synaptic gain control, and 
thus the encoding of precision. Equally importantly, NNI is affected 
by both genetic [135] and environmental (hormonal, immuno- 
logical, and metabolic) risk factors for schizophrenia. This makes 
NNI a key target of gene-environmental interactions and could help 
explain the clinical heterogeneity across patients [134]. 

What distinguishes the dysconnection theory from other the- 
ories of schizophrenia, however, is that it proposes a specific 
computational patho-mechanism—an abnormal hierarchical 
Bayesian inference in the cortex, as a result of aberrant NNI 
[133, 136]. In hierarchical Bayesian theories of brain function, 
such as predictive coding [137, 138], the brain is assumed to con- 
struct a hierarchical model of the world in order to infer the en- 
vironmental causes of its sensory inputs. In this predictive coding 
framework, hierarchically related levels of cortical processing 
streams exchange predictions and prediction errors in order to 
estimate states of the world given sensory input (perceptual in- 
ference) and adjust the model’s predictions (learning). Critically, 
prediction errors are weighted by how uncertain or precise both 
predictions and sensory inputs are. This ensures that perception 
and learning are influenced by the relative reliability of sensory 
information and prior beliefs. 

Importantly, in addition to physiological studies in animals (for 
example, [139]), human pharmacological neuroimaging studies 
have disclosed likely relations between the computational quantities 
in predictive coding and physiological processes (for reviews, see 
[34, 136, 140]). In brief, prediction errors are thought to be conveyed 
by ascending connections via both AMPA (a-amino-3-hydroxyl-5- 
methyl-4-isoxazole propionate) and NMDA receptors, while predic- 
tions are signalled via NMDARs by descending connections. Finally, 
precision weighting is likely implemented by neuromodulatory, as 
well as GABAergic, influences, both of which regulate the gain of 
glutamatergic post-synaptic responses. 

The physiological-computational links suggested by the 
dysconnection theory are important for at least two reasons. Firstly, 
they suggest that the heterogenous spectrum nature of schizo- 
phrenia could be resolved by tools that infer subject-specific abnor- 
malities of NNI from measured brain activity and behaviour [133]. 
Secondly, they highlight the connections of the dysconnection hy- 
pothesis to the other theories of schizophrenia discussed previously. 
In particular, precision provides an important bridge to the theory 
of DA-induced ‘aberrant salience’ in schizophrenia [87]. Put differ- 
ently, precision offers a formal definition of salience that can help 
explain prominent symptoms such as hallucinations and delusions 
[136, 141, 142]. 
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Neuroimaging studies have provided several lines of empirical 
evidence for the physiological (NNI) and computational (hier- 
archical Bayesian inference) aspects of the dysconnection theory. 
Reviews can be found elsewhere [134, 136, 143]; here, for lack of 
space, we focus on three particular aspects from the recent literature. 

Firstly, according to the dysconnection hypothesis, the empiric- 
ally observed reduction in the auditory MMN should be understood 
as resulting from abnormal NNI and reflecting impaired hierarch- 
ical Bayesian inference [133]. This has been corroborated by recent 
studies. For example, computational modelling of empirical EEG 
data demonstrated that a Bayesian inference process [144] provides 
a better explanation of trial-by-trial changes in the auditory MMN 
than conventional theories such as change detection or adaptation. 
Physiologically, reduced MMN in schizophrenia patients is reflected 
by abnormal connectivity in auditory hierarchies, as shown by a 
dynamic causal model (DCM) of EEG data that revealed both ab- 
normal ascending and descending connection strengths at the time 
when surprising stimuli (deviants) were presented [145]. 

Secondly, from the perspective of the dysconnection hypotheses, 
hallucinations would arise naturally when prior expectations (about 
the occurrence of human voices) in higher areas of the auditory hier- 
archy become abnormally tight (for example, due to a cholinergic 
abnormality; cf. [137]). One consequence of such overly precise 
priors would be a down-weighting of prediction errors computed 
in lower areas. This is precisely what was found in a recent fMRI 
study on schizophrenia patients with auditory hallucinations that 
showed a reduced prediction error signal in the auditory cortex 
[146]. Interestingly, the putative hyper-precision of auditory priors 
could reflect a compensatory response to initially abnormally high 
sensory precision. 

Thirdly, an imbalance in the precision weighting of bottom-up sig- 
nals and top-down predictions—in this case, reduced precision of 
predictions relative to sensory precision—would also explain the em- 
pirically observed reduced sensitivity of patients with schizophrenia 
to visual illusions [147]. This has been studied in patients with 
schizophrenia, using an identical ‘hollow mask illusion paradigm for 
both fMRI and EEG. In both modalities, DCM demonstrated that, 
consistent with reduced precision of predictions (in this case about 
facial stimuli), the patients’ pronounced lack of susceptibility to the 
‘hollow mask illusion could be explained by increased strengths of 
bottom-up connections, combined with reduced strengths of top- 
down connections, compared to healthy controls [148, 149]. 


Neuroimaging studies have provided invaluable insights into the 
structural and functional changes associated with schizophrenia. 
In spite of their scientific success, neuroimaging and electrophysio- 
logical techniques have not provided concrete diagnostic tools for 
psychiatry so far [150]. One explanation for this lack of transla- 
tion is that conventional analysis methods have remained mostly 
descriptive—they describe patterns of activations, connections, or 
changes in brain structure, but they do not allow for quantifying the 
probability that a particular disease mechanism is present. This, how- 
ever, is the basis for differential diagnosis and targeted treatment. 

In the following, we briefly outline two complementary strat- 
egies for addressing this problem. A first possibility is to invest in 


the development of neuroimaging techniques that provide more 
direct physiological interpretability than the available structural or 
functional contrasts in MRI. A prime candidate for this purpose is 
MRS. Despite its potential, it has, somewhat surprisingly, not made 
decisive contributions to schizophrenia research so far. Reasons 
include variability in ROIs across studies, as discussed previously, 
but also technical limitations since at lower field strengths, the sep- 
aration of different peaks in the spectrum is difficult. Advances in 
high-field MRI and the development of CEST techniques—which 
make MRI sensitive to concentrations of a host of endogenous me- 
tabolites [151, 152] and provide whole-brain coverage of neuro- 
transmitter systems—are beginning to change the clinical utility of 
MRS, as illustrated by recent applications to epilepsy [127]. 

A second option is to pursue a neuromodelling strategy. This usu- 
ally takes one of two forms: biophysical network models (BNMs) 
and generative models (for review, see [34]). The former consist of 
regional circuit models that are connected by long-range connec- 
tions, typically informed by DWI data (for review, see [153]). While 
BNMs have found application for schizophrenia (for example, [54]), 
their complexity makes parameter estimation from fMRI or EEG 
data difficult; often, only a single global scaling parameter of con- 
nection strengths is estimated. This has restricted the potential of 
BNMs to provide clinical tests. ‘Generative models’ (Fig. 60.4) [163] 
represent an alternative. These describe potential neural mechan- 
isms that could have generated the measured neuroimaging data 
(EEG or fMRI), including noise. Put differently, generative models 
embody a probabilistic mapping from the (hidden) neural states to 
the measured data; this mapping can, in principle, be inverted to 
uncover disease mechanisms in individual patients [154]. One ex- 
ample of such an approach is DCM, which describes the dynamics 
of neuronal populations using differential equations [155]. DCMs 
explain measured brain activity as arising circuit dynamics that is a 
function of: (1) intrinsic connectivity; (2) experimentally induced 
perturbations; and (3) modulatory inputs that invoke contextual 
changes in synaptic strengths (that is, short-term plasticity during 
learning or neuromodulatory influences). In conjunction with suit- 
ably defined observation models, DCMs can be applied to either 
fMRI or electrophysiological data. Importantly, such models re- 
quire a prior distribution, indicating the possible range of parameter 
values. Techniques such as DWI or MRS can be used to provide ana- 
tomical or physiological constraints on the variance of these prior 
distributions in individual patients (for proof of concept, see [156]). 
As has been demonstrated in various validation studies in humans 
and animals (for example, [109, 157-159]), DCM can potentially de- 
tect alterations in glutamatergic, GABAergic, and neuromodulatory 
(for example, dopaminergic and cholinergic) synaptic transmission 
and thus provide a considerably more detailed assessment of patho- 
physiology than a phenomenological account of changes in function 
coupling. 

A complementary approach within a computational approach to 
neuroimaging-based differential diagnosis is provided by generative 
models of behaviour. These can be fitted to individual behavioural 
responses in order to infer on trajectories of computational quan- 
tities such as prediction errors. While this approach has featured 
prominently in the investigation of dopaminergic abnormalities in 
schizophrenia, as discussed previously, recent results based on hier- 
archical Bayesian models have suggested the possibility of assessing 
trial-wise activity in both dopaminergic and cholinergic regions 
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Fig. 60.4 (see Colour Plate section) Generative models for neuroimaging data. 


Generative models applied to neuroimaging data propose that we can in 
Constrained by priors that define neuroanatomical connectivity obtained 


er on neural activity (for example, firing rate) from the measured signal. 


by DWI techniques, generative models comprise neural state equations, 


which describe the hidden dynamics in terms of differential equations that are parameterized, and a forward model that describes a mapping from 
the neural state to the measured signal. To estimate how neural states translate into the measured signal, the model is inverted, that is, fitted to the 
neuroimaging data and the connectivity parameters that generated the observed data are estimated. DWI, diffusion-weighted imaging; M/EEG, 


magneto-/electroencephalography. 
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under the same paradigm [160, 161]. This may provide a powerful 
approach for classifying patients with schizophrenia into different 
NNI subgroups. 

The utility of generative models for detecting physiologically de- 
fined subgroups within schizophrenia has been demonstrated pre- 
viously. For example, Brodersen and colleagues (2014) applied a 
simple DCM of interactions between visual, parietal, and prefrontal 
regions to {MRI data from 41 patients with schizophrenia who per- 
formed a working memory task [162]. Using a variational Gaussian 
mixture model, they found that three clusters provided an optimal 
explanation of variability in connectivity estimates across patients. 
Remarkably, these subgroups, defined in an entirely unsupervised 
way, exhibited significant differences in negative symptoms, thus 
establishing a link between physiology and clinical symptoms. 
While this finding did not constitute a clinically relevant solution, as 
it only predicted symptoms that were already known to the clinician, 
it illustrates the potential of a ‘generative embedding’ approach. The 
hope is that more sophisticated models, such as DCMs capable of 
inferring transmitter-specific synaptic properties from electro- 
physiological data [157], could convey real clinical utility such as 
predictions about treatment responses [34]. 

While generative modelling approaches open up exciting oppor- 
tunities for neuroimaging research on schizophrenia, validation 
studies are needed to determine their clinical utility. In other words, 
regardless of how well a model may capture a putative pathophysi- 
ology, it needs to support differential diagnosis or predict treatment 
response with sufficient accuracy and in individual patients. This can 
only be tested in prospective studies where patients are followed up 
after a clinical intervention and which target clinically relevant ques- 
tions such as predicting the response to specific drugs for psychosis. 
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This chapter focuses on several disorders that show similarities to 
schizophrenia, including schizoaffective disorder, schizotypal per- 
sonality disorder, and brief psychotic states. Areas of emphasis in- 
clude the clinical features, classification, diagnosis, epidemiology, 
course, and prognosis for each disorder. Some aspects will be 
underscored to reflect critical issues such as levels of heterogeneity, 
reliability, and validity. These issues are related to the accurate classi- 
fication of the disorders, which are important for at least three related 
reasons. Firstly, it is essential to develop reliable and valid diagnostic 
criteria in order to study the aetiology of the disorders and then 
utilize that knowledge to develop rational and testable intervention 
strategies. Heterogeneity adds variance to the process that reduces 
both the reliability of diagnosis and also the statistical power of ex- 
perimental designs to detect intervention/treatment effects. 

Secondly, the development of newer generations of psychophar- 
macological treatments holds the promise of matching more ap- 
propriate and efficacious medications with specific syndromes or 
types of symptoms. This trend underscores the importance of dif- 
ferential diagnosis in determining what treatment a patient will re- 
ceive. Heterogeneity within a diagnostic category complicates the 
achievement of this goal. Thirdly, as psychiatry inches closer to the 
inclusion of dimensional and trans-diagnostic models in the con- 
ceptualization, assessment, and treatment of psychopathology, diag- 
nostic reliability and validity are increasingly critical components 
of translational efforts to integrate clinical, behavioural, cognitive, 
biological, and genetic markers/phenotypes across psychiatric dis- 
orders [1, 2]. Each disorder will be considered separately, starting 
with a review of schizoaffective disorder, the most severe of the three 
conditions. 
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Clinical features 


Each edition of the Diagnostic and Statistical Manual of Mental 
Disorders (DSM) described a version of a disorder that afflicts 





patients with both schizophrenic and affective symptoms. The 
term ‘schizoaffective disorder’ first appeared in DSM, third edition 
(DSM-II) in 1980 [3], but without operational diagnostic criteria. 
DSM-II-Revised (DSM-III-R) introduced four diagnostic criteria 
and depressive and bipolar subtypes [4]. DSM-IV retained the diag- 
nostic criteria and added a mixed subtype [5] that did not change in 
the DSM-IV “Text Revision (DSM-IV-TR). Criterion A in DSM-IV 
required an uninterrupted mood episode that could be major de- 
pressive (and must include depressed mood), manic, or mixed in 
nature. During the same period, Criterion B required a period of 
at least 2 weeks with hallucinations or delusions that were not as- 
sociated with prominent mood disturbances. Criterion C specified 
that the symptoms comprising the mood disturbance persisted for 
significant portions of the active and residual stages of the disorder. 
Criterion D required that symptoms of the disorder were not attrib- 
utable to substance use/abuse or to general medical conditions. 

Schizoaffective disorder in DSM-5 does not differ radically from 
its conception in DSM-IV [6, 7] but is defined more stringently. 
Criterion C, in particular, now requires that symptoms of a major 
mood episode must be present for over half of the total duration of 
the active and residual phases of the illness (that is, the entire course 
of illness). This change is also significant because DSM-IV did not 
define the required duration of the mood disturbance. Moreover, 
DSM-5 requires the mood disturbance to meet diagnostic criteria 
for a major mood episode for a majority of the illness. Affective dis- 
orders that do not meet full criteria for a major mood episode do not 
meet the criterion. One effect of the change in Criterion C may be 
to reduce an apparent bias in clinical diagnosis towards less severe 
diagnoses, as shown, for example, by a study that compared clin- 
ical diagnoses with research diagnoses of hospitalized psychiatric 
inpatients [8]. In that study, treating clinicians diagnosed 36% of the 
sample (n = 134) with schizophrenia, 37% with schizoaffective dis- 
order, and 27% with psychotic bipolar disorder. By contrast, diag- 
noses made by trained research personnel using the Structured 
Clinical Interview of DSM-IV-TR included 48% with schizophrenia, 
28% with schizoaffective disorder, and 24% with psychotic bipolar 
disorder. 

Like schizophrenia, Criterion A in DSM-5 schizophrenia must be 
met at some point in schizoaffective disorder. Unlike schizophrenia, 
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Criterion B, which involves functional declines in social, interper- 
sonal relations, or work, and Criterion F, which involves the ex- 
clusion of autistic spectrum disorder or communication disorder 
in childhood, are not required in DSM-5 schizoaffective disorder. 
At any point in time, affective and non-affective symptoms may or 
may not occur simultaneously, which underscores the importance 
of considering the course of the illness longitudinally, in addition to 
its cross-sectional presentation. Symptom clusters that are primarily 
affective, schizophrenic, or mixed predominate at different times. 

Compared to patients with schizophrenia, patients with 
schizoaffective disorder often (though not always) demonstrate 
relatively high levels of premorbid function [9, 10]. This trend 
partly reflects the absence of a required functional decline in 
schizoaffective disorder. Nevertheless, premorbid vulnerabilities in 
multiple cognitive and clinical functions are common [11]. Patients 
with schizoaffective disorder also tend to show more identifiable 
precipitating events. The nature of the precipitating stressor may 
vary widely; for example, it may be physical (for example, recently 
giving birth or experiencing a head injury) or interpersonal (for ex- 
ample, a change in an important relationship). The clinical course 
of the disorder is often characterized by a periodic, rapid onset of 
symptoms that shows a relatively high degree of remission after sev- 
eral weeks or months. As Vaillant pointed out in the 1960s, many 
of these patients ‘recover’ completely after an episode and resume 
their lives at premorbid levels of function [12]. As will be further 
noted later in this chapter, the clinical features of some cases of 
schizoaffective disorder mainly resemble those of schizophrenia, 
while the features of other cases are more similar to those of bi- 
polar disorder. Regardless of the subtype or variant of the disorder, 
however, the mortality rate is of special concern. Rates of death due 
mainly to suicide or accident show elevations in this disorder that 
are similar to those observed in schizophrenia and major affective 
disorders [13]. A recent study showed, however, that even when 
schizophrenia and schizoaffective disorder subjects were matched 
for age, gender, and race, schizoaffective subjects showed higher 
rates of suicide attempts, were hospitalized for suicidality more fre- 
quently, and showed higher rates of anxiety disorders, compared to 
schizophrenia subjects [14]. 

Schizoaffective disorder is more common in females than in males 
[6, 9]. The age of onset varies but tends to be younger than that of 
unipolar or bipolar disorder. Tsuang et al. found the median age of 
onset for schizoaffective disorder was 29 years, which was signifi- 
cantly lower than groups with bipolar or unipolar affective disorder, 
but similar to a group with schizophrenia. Marneros et al. [10] also 
reported that a median age of onset of 29 years for schizoaffective 
disorder was lower than the median age for groups with affective 
disorders (35 years), but reported that it was higher than a group 
with schizophrenia (24 years). In contrast, Reichenberg et al. [11] 
reported no differences in the age of first hospitalization between pa- 
tients with schizophrenia, schizoaffective disorder, or non-psychotic 
bipolar disorder. These differences between studies reflect differ- 
ences in both the diagnostic criteria employed and the heterogeneity 
of the disorder. 


Classification 


The classification of schizoaffective disorder has always been contro- 
versial. Kraepelin reported in 1919 that patients with both affective 
and schizophrenic symptoms complicated the differential diagnosis 


due to the ‘mingling of morbid symptoms of both psychoses. 
Kasanin first employed the term ‘acute schizophrenic psychoses’ in 
1933 to describe a group of patients who experienced a rapid onset 
of emotional turmoil and psychotic symptoms but who recovered 
after several weeks or months [9, 15]. These symptoms appeared 
similar to schizophrenia during periods of exacerbation, but unlike 
schizophrenia, they showed a greater tendency to remit between 
episodes. Kasinin’s view of schizoaffective disorder as a (chronic) 
schizophrenia-like condition with a better outcome was not adopted 
by DSM-5 or earlier editions [7], though it is more consistent gen- 
erally with the notion of acute and transient psychotic disorders de- 
scribed later in this chapter. Nevertheless, ongoing debates about the 
nature of schizoaffective disorder have persisted since Kraepelin’s 
division of psychoses into affective and schizophrenia types, with 
most conceptualizations focusing on the following possibilities: (1) 
it is a type of schizophrenia; (2) it is a type of affective disorder; (3) it 
is a unique disorder that is separate from both schizophrenia and 
bipolar disorder; (4) it reflects an arbitrary categorization of clinical 
symptoms that masks a continuum of pathology between schizo- 
phrenia and affective illness; and (5) it contained a heterogenous col- 
lection of ‘interforms’ between schizophrenia and affective disorder 
(that is, symptoms of both disorders). 

The last possibility is not mutually exclusive of the first four; for 
example, one or more variants of schizoaffective disorder may be 
related closely to schizophrenia, while another may be related more 
closely to an affective disorder. The issue has yet to be resolved. 
Classification problems reflect conceptual difficulties about the na- 
ture of schizoaffective disorder, its diagnosis, and its reliability and 
validity. Compared to schizophrenia, bipolar disorder, and unipolar 
depression, both test-retest reliability and inter-rater reliability are 
low [16, 17]. It is too soon to know whether DSM-5 attempts to 
tighten diagnostic criteria will improve inter-rater reliability, espe- 
cially in clinical settings [8]. Different conceptual assumptions and 
research strategies also lead to different conclusions about the nature 
of the disorder. Several meta-analyses by Baethge and colleagues 
showed, for example, that subjects with schizoaffective disorder 
showed clinical and demographic characteristics that fell between 
schizophrenia and bipolar disorder [18] and between schizophrenia 
and unipolar depression [19]. Levels of heterogeneity were also 
intermediate between schizophrenia and both bipolar disorder and 
unipolar depression [19, 20]. These findings were interpreted as con- 
sistent with the Kraepelinian dichotomy. 

Family and outcome studies provide useful ways of assessing the 
relative merits of each of the possibilities outlined previously. These 
approaches are informative and will be reviewed later in this chapter, 
although interpretations of such studies are complicated at times by 
the use of different diagnostic criteria across investigations. 


Family studies 


Family studies employ a behavioural genetic method that assumes 
that related disorders co-aggregate more frequently in biologic- 
ally related individuals than they do in the general population. 
Thus, a disorder more likely belongs in the schizophrenia spec- 
trum if it occurs more frequently in biological relatives of schizo- 
phrenia patients, compared with appropriate controls. Similarly, 
a disorder more likely belongs in the affective spectrum if it oc- 
curs more frequently among the relatives of patients with affective 
disorders. Consideration of evidence directly pertinent to the 
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question of where schizoaffective disorder belongs in the schizo- 
phrenia spectrum, which includes a growing number of genetic 
studies, is beyond the scope of the current discussion. Instead, 
only representative findings pertinent to the present discussion 
about the clinical classification of schizoaffective disorder will be 
summarized here. 

Bertelsen and Gottesman [21] summarized a series of seven family 
studies published between 1979 and 1993, using structured diag- 
nostic criteria. Analyses of risk to the development of schizophrenia, 
schizoaffective disorder, and affective disorder in the first-degree re- 
latives of patients with schizoaffective disorder were included. In 
all seven studies, the relatives showed a higher risk of developing 
an affective disorder than of developing schizoaffective disorder. 
In five of the seven studies, the risks of developing schizophrenia 
was equal to, or greater than, the risk of developing schizoaffective 
disorder. Thus, the relatives of schizoaffective patients showed gen- 
erally higher risks of developing disorders other than the one with 
which their ill relatives were diagnosed. These findings were con- 
sistent with a heterogenous view of schizoaffective disorder, in 
which individual cases represented subtypes of either schizophrenia 
or affective disorder. The findings were also consistent with the pos- 
sibility that schizoaffective disorder represents a chance collection of 
‘interforms’ between schizophrenia and affective disorder. 

This latter view also found a measure of support in a more recent 
report that utilized the Danish National Register [22] where the rela- 
tive risk of schizoaffective disorder was elevated significantly (rela- 
tive risk = 2.76) if a first-degree relative had any history of mental 
illness. The relative risk increased further if the disorder demon- 
strated by the first-degree relative included schizophrenia (2.57), 
bipolar disorder (3.23), or schizoaffective disorder itself (1.92). 
Notably, the relative risk for schizophrenia was highest when a first- 
degree relative had schizophrenia (3.22), and the relative risk for bi- 
polar disorder was highest when a first-degree relative had bipolar 
disorder (5.19). These findings show further that schizoaffective dis- 
order may be related genetically to both schizophrenia and bipolar 
disorder, either as a subtype or as an interform. 

‘These findings were not consistent with the view that schizoaffective 
disorder represented a continuum between the other two disorders, 
because in that case, the rate of schizoaffective disorder in first- 
degree relatives would have been higher, compared with the rates 
at which these relatives developed schizophrenia or affective dis- 
order. The findings were also inconsistent with the possibility that 
schizoaffective disorder represented a unique disorder that was in- 
dependent of either schizophrenia or an affective disorder. In that 
case, the first-degree relatives of patients with schizoaffective dis- 
order should show relatively high rates of schizoaffective disorder 
itself, but relatively low rates of the other disorders. In a series of 
studies reviewed by Bertelsen and Gottesman [21], the morbid risk 
for schizoaffective disorder itself ranged from 1.8% to 6.1% in first- 
degree relatives of patients with schizoaffective disorder, which was 
still higher than the rate observed in the general population (see 
Epidemiology, p. 612). These results, taken together with the higher 
risks for both schizophrenia and affective disorder, suggest that 
schizoaffective disorder is a heterogenous condition. Recent reviews 
of family studies, including those that considered depressed (that is, 
unipolar) and bipolar subtypes, have also underscored both the het- 
erogeneity of schizoaffective disorder and the controversial nature of 
its classification [23, 24]. 


Outcome studies 


A majority of outcome studies show that schizoaffective disorder has 
a better course than schizophrenia, but a poorer course than affective 
disorder [25-27]. Other researchers reported similar findings. 
Kendler et al., for example, showed intermediate levels of clin- 
ical impairment for schizoaffective disorder in an epidemiological 
family study. Marneros et al. reported on outcomes as part of the 
Cologne Longitudinal Study, using modified DSM-III-R diagnoses 
[28]. The outcomes were measured by symptoms in five dimensions 
(psychotic symptoms, reduction of energetic potential, qualitative 
and quantitative disturbances of affect, and other disturbances of 
behaviour) that persisted for at least 3 years. Consistent with the pat- 
tern described thus far, poor outcomes in the schizoaffective group 
occurred at a rate (49.5% of the sample) that was intermediate be- 
tween those observed in the schizophrenic (93.2%) and affective 
groups (35.8%), and differed significantly from both of them. In a 
more recent study, Jager et al. studied 241 patients at the time of their 
first hospitalization, and then again 15 years later [29, 30]. Again, 
schizoaffective subjects presented a less impaired clinical picture 
than the one shown by schizophrenic subjects, but more impaired 
than the one shown by affective subjects. 

While these studies show schizoaffective disorder to have inter- 
mediate outcomes generally, there are categories in which it resem- 
bles schizophrenia or affective disorder more closely. For example, 
Samson et al. [25] and Reinares et al. [27] noted that outcomes for 
schizoaffective disorder were equivalent to those for affective dis- 
order in several dimensions. Marneros et al. showed that 70% of a 
schizoaffective group were rated as good or excellent on a measure 
of social adjustment, which did not differ significantly from 84% of 
an affective group who received the same rating (reviewed in [27]). 
Both groups differed significantly from a schizophrenic group, how- 
ever, in which only 44% of the group demonstrated good or excel- 
lent outcomes. Moreover, the schizoaffective and affective disorder 
groups did not differ on a rating scale of psychological impairments 
(for example, body language, affect display, conversation skills, and 
co-operation), although both were rated as significantly less im- 
paired than the schizophrenic group. 

Other studies, however, such as Kendler et al. [28], reported simi- 
larities between some types of psychotic symptoms in schizoaffective 
disorder and schizophrenia, including the severity of delusions and 
positive thought disorder and the frequency of hallucinations. Each 
of these groups showed higher levels of these symptoms than an 
affective disorder group. Hizdon et al. reported recently that indi- 
viduals with schizoaffective disorder did not differ from individuals 
with schizophrenia on basic cognitive measures of executive func- 
tion, memory, and processing speed, although the schizoaffective 
group did perform better on measures of social cognition [31]. 
Reichenberg et al. showed that individuals with schizophrenia and 
schizoaffective disorder who were assessed premorbidly performed 
similar to each other, but lower than individuals who later developed 
non-psychotic bipolar disorder, on selected tests of intellectual and 
academic functions [11]. 

These overall differences in outcome serve to validate the clas- 
sification of schizoaffective disorder as a separate syndrome fur- 
ther. Its heterogeneity, however, raises the issue of whether such 
intermediate outcomes might reflect the mean of a combination of 
mainly good and mainly poor outcomes. This, in turn, leads to the 
question of whether schizoaffective disorder can be subtyped in a 
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useful and valid manner. If so, are better and worse outcomes asso- 
ciated with different variants of the syndrome? 

Vaillant suggested in the 1960s that prognostic indicators, 
including a good premorbid level of adjustment, the presence of 
precipitating factors, an acute onset, confusion, the presence of af- 
fective symptoms, and a family history of affective disorder (or the 
absence of a schizophrenic history), could predict remission in 
approximately 80% of cases of ‘remitting schizophrenia [31]. The 
inclusion of affective symptoms and a positive family history for af- 
fective illness on the list contributed (later) to hypotheses that vari- 
ants of schizoaffective disorder were related to affective illness and to 
better outcomes. In contrast, variants associated more with schizo- 
phrenic symptoms or family histories were associated more with 
schizophrenia and relatively poor outcomes. [32]. 

There have been a variety of attempts to subtype schizoaffective 
disorders, based on whether affective or schizophrenic symptoms 
predominate. The validity of many of these attempts, however, is 
inconclusive [21, 28]. Conversely, a latent class analysis of psych- 
otic patients from the Roscommon Study showed that most cases of 
DSM-III-R schizoaffective disorder were categorized in either a bi- 
polar schizomania class (n = 19) or a schizodepression class (n = 13), 
rather than in schizophrenia (n = 1), major depression (n = 0), 
schizophreniform (n = 3), or hebephrenia (n = 3) classes [33]. 
Moreover, Reinares et al. reviewed evidence showing that bipolar 
and depressive subtypes differed from each other in ways consistent 
with differences between bipolar and unipolar affective disorders 
[27]. For example, the bipolar schizoaffective subtype was associ- 
ated with more total episodes, more episodes with shorter periods 
and cycles, and higher frequency of cycles. Higher numbers of cycles 
were associated with poorer long-term outcomes. Taken together, 
these studies show at least some recent support for the subtyping 
of schizoaffective disorder into mainly affective and mainly schizo- 
phrenic variants. 

Other factors associated with poor outcomes include poor inter- 
episode recoveries [28], persistent psychotic symptoms in the ab- 
sence of affective features, poor premorbid social adjustment, 
chronicity, a higher number of schizophrenia-like symptoms [34], 
and the presence of schizoaffective mixed states [27]. 


Differential diagnosis 


The major feature of DSM-5 schizoaffective disorder [6] involves a 
mood disorder occurring concurrently with Criterion A of schizo- 
phrenia for a continuous period of time. Many other disorders that 
present with psychotic or mood symptoms should be considered 
in the differential diagnosis of schizoaffective disorder, including, 
for example, other psychiatric or neurological disorders involving 
psychotic symptoms, acquired substance or toxin-induced psych- 
otic conditions, cognitive disorders, prodromal syndromes, person- 
ality disorders, anxiety disorders, neurodevelopmental disorders, 
and states of delirium. Particular concern involves the differential 
diagnosis between schizoaffective disorder and schizophrenia, and 
between schizoaffective disorder and either bipolar disorder or a 
depressive disorder with psychotic symptoms. In DSM-5, Criterion 
C distinguishes schizoaffective disorder from schizophrenia by re- 
quiring that the prominent mood disorder in schizoaffective disorder 
characterizes over half the period spanning the active and residual 
phases of the disorder. Criterion B distinguishes schizoaffective 


disorder from other mood disorders involving psychosis by re- 
quiring the presence of delusions or hallucinations for at least 2 
weeks in the absence of prominent depressive or manic symptoms. 
Notably, these diagnostic criteria may be met during some phases of 
the disorder, but not others. Thus, the presence of a prominent mood 
disturbance occurring over half the time could shift, for example, to 
occur less than half the time during the course of the disorder, which 
might necessitate a valid change in diagnosis from schizoaffective 
disorder to schizophrenia. 


Epidemiology 
Incidence 


Earlier studies showed that new cases of ‘schizomanic’ patients 
(that is, manic patients who also demonstrated schizophrenic or 
paranoid symptoms) numbered approximately 1.7 per 100,000 per 
year [35]. This was less than the 4 per 100,000 per year shown by 
‘schizodepressive’ patients. The number of schizoaffective cases in 
this study exceeded the number of manic patients and made up half 
of the number of schizophrenia cases. Since then, Tien and Eaton 
analysed data from the Epidemiologic Catchment Area Study for 
three non-overlapping groups with psychotic symptoms [36]. One 
of these groups comprised individuals with ‘psychotic affective syn- 
drome, which was similar to schizoaffective disorder, except that 
most members of the group (59%) demonstrated psychotic symp- 
toms only in conjunction with a mood disturbance (essentially 
DSM-III-R mood disturbance with psychotic symptoms). The in- 
cidence of this disorder was 1.7 per 1000 per year, which was ap- 
proximately equal to the rate for schizophrenia (2.0 per 1000 per 
year). Even if the 59% of the group who met the criteria for a mood 
disorder with psychotic features were excluded, the remaining 41% 
would still comprise a higher incidence rate than that detected by 
earlier studies. Differences in sampling procedures (treated vs non- 
treated samples) may have contributed to the differences observed 
in the rates. More importantly, however, these studies showed that 
schizoaffective disorder occurred at 50-85% of the rate of schizo- 
phrenia, thus confirming that patients with this disorder comprise 
a clinically significant population. One current, but long-standing, 
issue involves questions about the temporal stability of incidence 
rates in schizophrenia-related disorders, as reflected by reports of 
both increases and decreases. 


Prevalence 


Until recently, prevalence estimates for schizoaffective disorder re- 
lied mainly on samples that were treated in clinics or other psychi- 
atric settings. Because a variety of factors influence the decision to 
enter and remain in treatment, the estimates varied substantially. 
For example, Okasha reviewed studies that reported rates varying 
between 2% and 29% [23]. A recent epidemiological study in 
Finland using 8028 people who were at least 30 years old showed a 
lifetime prevalence rate of 0.32% for schizoaffective disorder (com- 
pared to 0.87% for schizophrenia), which accounted for 10.5% of all 
psychotic disorders [37]. This is a lower estimate than many earlier 
studies reported and likely results from a combination of factors 
(including a narrowing of diagnostic criteria and increased utiliza- 
tion of multiple sources of information such as case notes and re- 
gisters, in addition to interview data) that together have improved 
diagnostic accuracy. 
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Review of evidence 


Treatments for schizoaffective disorder are the same as those for 
schizophrenia and affective disorders alone. As the nature and ef- 
ficacy of those treatments are discussed elsewhere, they will not 
be considered here. Rather, this section will focus on management 
issues related to the need to treat symptoms of both disorders simul- 
taneously or sequentially. 


Psychopharmacological treatments may be viewed in terms of DSM- 
5 schizoaffective disorder subtypes, including the bipolar type and 
the depressive type. Treatment of the bipolar type will include dopa- 
mine antagonist medication (for example, clozapine, risperidone, 
quetiapine, ziprasidone, or olanzapine), particularly if psychotic 
symptoms are present. In addition, drugs for unipolar depression, 
mood stabilizers (for example, lithium), or anticonvulsants (for ex- 
ample, valproate or carbamazepine) may be useful with this group. 
It will be necessary in such cases to weigh the potential risks of such 
medications, such as elevated toxicity and metabolic abnormalities, 
against the potential benefits. 

In the depressive subtype, combination treatments may also be 
more effective than a single treatment. Dopamine antagonist treat- 
ments alone may be more efficient, however, if affective symp- 
toms (that is, depression) are largely secondary to the experience 
of having a psychotic condition and its attendant interpersonal, 
social, and financial difficulties. In these cases, remediation of the 
psychotic symptoms may also ameliorate the affective problems. For 
other cases, which include more of a treatment-refractory depres- 
sion, drugs for psychosis may be augmented with a mood stabilizer 
or antidepressant medication. Moreover, electroconvulsive therapy 
may reduce mortality rates in schizoaffective patients. 

As was true in earlier versions of DSM, the bipolar subtype 
(including a manic episode and possibly major depressive epi- 
sodes) may be difficult to distinguish from the depressive subtype 
(including major depressive episodes in the absence of manic epi- 
sodes), especially in the presence of psychotic symptoms. In these 
cases, treatment decisions may rest on the presenting symptoms of 
the patient. At any point, treatment is partly dependent on the pres- 
ence or absence of psychotic symptoms. As noted previously, psych- 
otic episodes in this period are associated with relatively poorer 
outcomes and are likely to require chronic drug therapy. 


Schizotypal personality disorder 


Clinical features 


Like schizoaffective disorder, schizotypal personality disorder (PD) 
is a complex and chronic condition that includes some, but not 
all, of the features of schizoaffective disorder and schizophrenia. 
Changes from DSM-IV to DSM-5 reflect a growing recognition that, 
while this disorder should still be viewed as a personality disorder, 
it should also be classified as a schizophrenia spectrum condition 
[6, 38]. Most notably, persistent psychosis is not part of the syn- 
drome in individuals with schizotypal PD, although milder forms 
of thought disorder may occur such as magical thinking or ideas 


of reference (as opposed to delusions of reference, which indicate 
psychosis). Moreover, brief episodes of psychosis may occur in times 
of stress but will not persist. 

Schizotypal patients show pervasive deficits in social and inter- 
personal traits. They often demonstrate aloofness, poor eye con- 
tact, affective constriction, and suspiciousness. Consequently, close 
interpersonal relationships either are avoided or cause discomfort 
and anxiety. This social anxiety persists despite familiarity and is 
often related to suspiciousness or attenuated psychotic paranoia, 
rather than to fears of negative evaluations by others. Individuals 
with schizotypal PD usually have few friends and demonstrate defi- 
ciencies in social cognition such as problems in sensing social cues 
or affective signals accurately from others. These problems in social 
cognition exacerbate suspiciousness or paranoia and contribute to 
the persistence of anxiety over time. 

Schizotypal patients may show magical thinking, ideas of ref- 
erence, unusual perceptions (for example, sensing the presence of 
another person or that people are talking about them), and/or per- 
ceptual illusions (for example, often perceiving a dimly lit lamp post 
as a person). Both their social deficits and these cognitive-perceptual 
problems contribute to an overall impression of oddness. However, 
this feature may occur independently of other clinical symptoms 
[39] and manifest itself in odd speech or unusual appearance. The 
oddness or eccentricities evident in these patients are often ego- 
syntonic (that is, they are not experienced as problems). Moreover, 
schizotypal patients show deficits in attention, long-term verbal 
memory, and executive functions. These cognitive deficits are quali- 
tatively similar to those seen in schizophrenia (and schizoaffective 
disorder), but like many other clinical manifestations of this dis- 
order, they are quantitatively milder [40, 41]. 

Like schizophrenia, schizotypal PD is often evident by early 
adulthood, but schizotypal traits may be evident in late child- 
hood or adolescence. Once it appears, the disorder tends to show 
a chronic course, but one that includes periodic exacerbations and 
attenuations of symptoms. A study that followed individuals with 
schizotypal PD for 2 years showed that paranoid thoughts and un- 
usual perceptual experiences were among the most stable and least 
malleable DSM-IV symptoms, while the most changeable were odd 
behaviours and restricted affect [40]. The former symptoms were 
thus more trait-like, and the latter were more intermittent. The same 
group also showed that in the course of 2 years (with treatment), 61% 
of schizotypal patients no longer met DSM-IV diagnostic criteria for 
the disorder [42]. With a more stringent definition of improvement 
(12 months with two or fewer symptoms meeting criteria), the rate of 
remission dropped to 23%. These studies show that both the severity 
and the expression of the disorder vary over time and probably as a 
function of treatment. A recent study that assessed twins with either 
schizotypal or paranoid PD over a 10-year period showed much of 
the observed stability of the disorders (about 67%) was attributable 
to shared genetic factors between early and middle adulthood, while 
the effects of environmental risk factors were more transient [43]. 
Consistent with previous studies, subjects with schizotypal PD also 
showed moderate declines in symptom levels over time [44, 45]. 


Classification 


In contrast to the controversy surrounding the classification of 
schizoaffective disorder, family, twin, and adoption studies clearly 
support the view that schizotypal PD is best classified in the 
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schizophrenia spectrum [38, 46, 47]. Nevertheless, it is a complex 
and chronic disorder that, in all likelihood, is also heterogenous. 
Kendler pointed out that this heterogeneity was at least partly related 
to the two primary methods used to study the disorder [48]. One of 
these involves the ‘clinical method; which identifies patients with 
mild forms of schizophrenic or psychotic-like symptoms. This type 
of patient, for example, is often characterized by relatively high levels 
of positive psychiatric symptoms (for example, magical thinking 
and perceptual distortions). Patients identified in this manner often 
comprise a heterogenous group themselves, however, involving pa- 
tients with schizotypal symptoms, together with patients with other 
types of severe personality problems such as borderline personality 
disorder [47]. 

In contrast, the ‘family research method’ identifies relatives of 
patients with schizophrenia who have subtle schizophrenia-like 
symptoms. Features associated more with familial than with clin- 
ical schizotypal PD include a predominance of negative symptoms 
(for example, social withdrawal and impairment, and higher levels 
of anxiety and poor rapport), cognitive or electrophysiological im- 
pairments (for example, impaired language comprehension, eye- 
tracking, and attentional dysfunctions, though all of these also occur 
in clinical PD), and elevated rates of schizophrenia and related dis- 
orders in family members [46]. Thaker et al. reported that familial 
and clinical schizotypal PDs were similar on measures of physical or 
social anhedonia [49]. 

The concept of familial schizotypal disorder is particularly im- 
portant because it may share a common genetic basis with schizo- 
phrenia. Paul Meehl first proposed the term ‘schizotaxia to describe 
the genetic vulnerability to schizophrenia and suggested that indi- 
viduals with schizotaxia would eventually develop either schizotypal 
PD or schizophrenia, depending on the protection or liability af- 
forded by environmental circumstances [50]. As the concept evolved, 
Meehl reformulated it to allow for the possibility that some people 
with schizotaxia would develop neither schizophrenia nor schizo- 
typal PD. In fact, evidence now shows that the clinical symptoms ob- 
served in many non-psychotic first-degree relatives of people with 
schizophrenia are similar to those observed in familial schizotypal 
PD [46, 51]. Psychiatric features in such relatives frequently include 
an aggregation of negative symptoms that are qualitatively similar 
to, but milder than, those often cited in schizophrenia. Positive 
symptoms, however, are usually less evident in these relatives than 
they are in schizophrenia or schizotypal PD. Neuropsychological 
impairments in biological relatives of people with schizophrenia are 
also qualitatively similar to, but milder than, those seen in people 
with schizophrenia [41]. 

Faraone et al. suggested a reformulation of Meehl’s concept of 
schizotaxia that focuses on these features of negative symptoms and 
neuropsychological deficits [46]. Unlike schizotypal PD, which oc- 
curs in less than 10% of adult relatives of patients diagnosed with 
schizophrenia, the basic symptoms of schizotaxia occur in 20-50% 
of adult relatives, suggesting further that the genetic liability to 
schizophrenia does not lead inevitably to schizophrenia, schizotypal 
PD, or schizoid PD. More recent studies showed that operational 
definitions of schizotaxia in non-psychotic adult relatives, based on 
neuropsychological deficits and negative symptoms, were supported 
by cluster analyses and were partially amenable to pharmacological 
intervention [41]. 


Diagnosis and differential diagnosis 


The DSM-5 diagnostic criteria for schizotypal PD do not differ 
substantially from the criteria used in DSM-IV. While an alternative, 
more dimensional model of PDs is included in the DSM-5 manual 
that structures these symptoms somewhat differently, the categor- 
ical system of previous versions remains in place [6]. In this con- 
text, DSM-5 Criterion A includes deficits in personality that involves 
ideas (but not delusions) of reference (A1), unusual ideas or types of 
thought such as magical thinking (unrelated to cultural norms) or 
beliefs in special powers such as clairvoyance (A2), unusual percep- 
tual experiences such as illusions or transitory hallucinations (A3), 
unusual forms of thinking or speech (such as frequent use of vague, 
tangential, metaphorical, or stereotyped language) (A4), frequent 
suspiciousness or paranoid thoughts (A5), constricted or inappro- 
priate affect (A6), odd or unusual behaviours, manners of dress or 
other aspects of appearance (A7), constricted interpersonal rela- 
tionships that often include few, if any, close friends or other non- 
familial relationships (A8), and acute anxiety in social situations that 
does not decline with familiarity and is related primarily to suspi- 
ciousness of others, rather than to self-consciousness or fear of nega- 
tive evaluations by others (as is typical in social anxiety) (A9). 

As in DSM-IV, five out of the nine symptoms are required to 
meet Criterion A. These symptoms emerge by early adulthood and 
occur in a variety of contexts. Criterion B requires that symptoms 
in Criterion A do not occur only during the periods of disorders 
involving sustained psychosis, such as schizophrenia, bipolar dis- 
order, or depression with psychotic features, or autism spectrum 
disorder (ASD). 

The differential diagnosis includes a variety of other disorders. 
A key difference between schizotypal PD and schizophrenia, psych- 
otic mood disorders, or presentations of psychosis in ASD involves 
the transient nature of psychotic symptoms in schizotypal PD. 
It may be distinguished from ASD (though sometimes with diffi- 
culty) by the relatively greater deficits in social awareness, lack of 
emotional reciprocity, or frequent presence of stereotyped behav- 
iours or interests in ASD. Schizotypal PD may be distinguished 
from neurodevelopmental disorders involving language or other 
forms of communication by the presence of relatively more specific 
problems in language reception or production that are usually iden- 
tified by specialized testing. Such problems are not typically present 
in schizotypal PD. It also differs from communication disorders by 
its oddness of its content, rather than by the deficiencies of its form. 

Schizotypal PD may be confused with several other PDs but can be 
distinguished from them. It differs from schizoid and from paranoid 
PD, for example, by its pattern of cognitive and perceptual distor- 
tions and by the eccentricities in appearance, language, or behaviour 
shown frequently by schizotypal patients. Schizotypal individuals 
who show transient psychotic symptoms may resemble individuals 
with borderline PD, but the pattern differs in that psychotic-like 
symptoms and social isolation in schizotypal PD are more likely to 
persist in the absence of affective turmoil, and schizotypal individ- 
uals are less likely to display the impulsive and manipulative traits 
that are often associated with borderline PD. Some individuals with 
narcissistic PD may demonstrate suspiciousness or social alienation, 
but, unlike in schizotypal PD, these symptoms/traits are related to 
underlying fears of exposing imperfections or flaws. Individuals with 
avoidant PD may be aversive to, or anxious in, social interactions, 
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but it reflects a fear of rejection in the context of a desire for such 
relationships in avoidant PD, whereas it reflects suspicion and a lack 
of desire for such relationships in schizotypal PD. 


Epidemiology 
Incidence 


A literature search did not identify incidence studies for 
schizotypal PD. 


Prevalence 


A review by Tsuang et al. showed prevalence rates in non-clinical 
samples ranged from 0.7% to 5.1%, with a median nearing 3.0% [52]. 
Higher rates occurred in clinical samples—2.0-64.0%, with a me- 
dian of 17.5%. More recently, Torgersen et al. reported a rate of 0.6% 
for DSM-III-R in a community sample, which is lower than the rates 
in studies reviewed by Tsuang et al. [53]. In contrast to non-clinical 
samples, the prevalence of schizotypal PD among the relatives of 
schizophrenic individuals is as high as 10%. More recently, the life- 
time prevalence of schizotypal PD in the United States was reported 
to be near 4.0%, with modestly higher rates for men (4.2%) than 
women (3.7%) [54]. 


Treatment 


There is unfortunately a continuing dearth of outcome studies 
involving psychotherapy, psychosocial, or psychopharmacological 
treatments for schizotypal PD [47]. Older published studies often 
show methodological limitations (for example, small samples, 
subjects with mixed diagnoses, inadequate controls, and problems 
with internal validity) or provide outcome data on only limited 
aspects of the disorder. Despite these caveats, it is clear that few 
treatment gains are evident in earlier studies [55]. Recent evidence 
for the efficacy of psychotherapy for PDs is more promising but is 
limited mainly to other PDs [56]. 

Several earlier studies investigated the usefulness of medications 
in treating schizotypal PD, although they typically employed small 
numbers of subjects, combined samples of schizotypal and border- 
line PDs, and showed little clinical improvement. Dopamine antag- 
onist drugs were proposed to reduce positive symptoms or depressed 
mood in times of acute stress, but the high incidence of adverse side 
effects discouraged their widespread use at other times, including 
the more chronic stable (that is, non-crisis) phases of the disorder. 
Other types of medication, including fluoxetine, have shown gener- 
ally non-specific effects of treatment. 

Hymowitz et al. administered a low dose of haloperidol to 17 out- 
patients with DSM-III diagnoses of schizotypal PD for 6 weeks [57]. 
The initial dose of 2.0 mg was intended to rise to 12.0 mg, but side 
effects prevented increases beyond a mean dose of 3.6 mg. Even with 
lower doses, 50% of the sample withdrew from the study because of 
side effects. The 17 subjects who completed 2 weeks of the protocol 
improved somewhat in ratings of ideas of reference, odd communi- 
cations, social isolation, and overall functioning. 

More recently, Koenigisberg et al. employed a double-blind 
protocol to administer low doses of risperidone (0.25-2.0 mg/day), a 
medication with serotonin antagonist properties, to 25 patients with 
DSM-IV schizotypal PD, for 9 weeks [58]. Compared to a placebo 
control group, patients who received risperidone demonstrated 
significant reductions in positive and negative symptoms, with 


no difference in dropout rates between groups. These findings are 
encouraging and consistent with evidence described previously that 
schizotypal symptoms are amenable to change [40, 42]. Hopefully, 
findings like these will stimulate additional research into pharmaco- 
logical treatments for this disorder. 


Management 


Though trust and rapport with a therapist are often difficult to estab- 
lish in schizotypal PD, the therapeutic relationship may be used to 
mitigate the marked deficits in interpersonal relationships that char- 
acterize this syndrome [58]. The frequent occurrence of paranoia 
and suspiciousness, together with social aloofness and constricted 
affect, may make exploratory psychotherapeutic approaches less ef- 
fective than supportive cognitive behavioural therapies. In fact, these 
patients may only seek treatment to alleviate circumscribed prob- 
lems like anxiety or somatic complaints. Approaches that emphasize 
concrete interim goals and stipulate explicit means of attaining them 
thus have the best chances of success. Because individuals with this 
disorder are vulnerable to decompensation during times of stress 
and may experience transient episodes of psychosis, they may also 
benefit from techniques to facilitate stress reduction (for example, 
relaxation techniques, exercise, yoga, and meditation). Fortunately, 
some people with schizotypal features are likely to seek treatment in 
times of stress [59, 60]. In the short term, brief courses of dopamine 
antagonist treatment may be useful if symptoms of psychosis appear. 
It is important to note, however, that individuals with schizotypal 
PD may receive unwarranted diagnoses of schizophrenia and re- 
ceive longer-term treatment [47]. 

Cognitive problems are also frequently amenable to concrete 
goal-oriented approaches to treatment. Patients benefit from under- 
standing their cognitive strengths and weaknesses because it helps 
them confront and cope with long-standing difficulties in their 
lives. For example, problems in attention, verbal memory, or organ- 
izational skills contribute to failures in educational, occupational, 
and social endeavours, while reinforcing negative self-images and 
increasing performance anxiety. Knowledge of circumscribed cog- 
nitive problems allows patients to reframe their difficulties in a more 
positive manner and facilitate the selection of realistic personal, 
educational, and occupational goals. While cognitive deficits in 
schizotypal PD are well established [47], there have been few trials 
of medications to improve cognition, though the administration of 
guanfacine and dopamine agonists may show promise. 


Brief psychotic disorder 


Clinical features 


Brief psychotic episodes have long been a source of interest [61] but 
have defied simple classifications. Consequently, different versions 
of this condition have been proposed and utilized. Brief psychotic 
disorder in DSM-5 involves at least one symptom from a group 
that includes delusions, hallucinations, disorganized speech, and/ 
or grossly disorganized or catatonic behaviour [6]. The onset of 
these symptoms is rapid, with the transition from a non-psychotic 
state to a psychotic one occurring within 2 weeks. Typically, a pro- 
dromal period is not present. The duration of symptoms lasts from 
1 day to 1 month. This diagnosis does not require associated social 
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or occupational dysfunction. Symptoms may not be attributable 
to substance abuse or therapeutic medications or to other medical 
conditions, nor can they be explained better by other psychiatric 
disorders that involve psychotic symptoms such as schizophrenia 
or bipolar disorder. Specifiers may be added that note that the epi- 
sode followed a stressful incident (that is, with marked stressors) or 
did not (that is, without marked stressors). The episode may also be 
specified as post-partum, meaning it occurred during pregnancy or 
within a month of giving birth, and it may be specified as involving 
catatonia. The severity of the episode may be recorded on a 5-point 
scale that ranges from 0 (absent) to 4 for the previous 7 days. At the 
conclusion of the episode, individuals return to premorbid levels of 
function. 

The International Classification of Diseases, tenth edition (ICD- 
10) defines a similar diagnostic category—the acute and transient 
psychotic disorders (ATPDs), with six subtypes [62]. One difference 
involves a longer period, compared to the DSM-5 version (3 months, 
compared to 1 month). ATPDs should also show a fully developed 
episode within 2 weeks. 

In addition to classifications as psychotic disorders, brief psych- 
otic episodes are also classified as clinical high-risk states. The 
Structured Interview for Prodromal Syndromes (SIPS) operation- 
alizes a syndrome [Brief Intermittent Psychotic Symptoms (BIPS)] 
which may last up to 3 months and include at least one symptom 
of psychosis (at least one of the scales P1-P5 is rated a 6) in that 
period. The symptom(s) must last at least several minutes per day at 
least once per month, but less than 1 hour per day for at least 4 days 
per week for the last month [63, 64]. No decline in social or occu- 
pational functioning is required to meet the diagnostic criteria for 
this syndrome. The Comprehensive Assessment of At-Risk Mental 
State (CAARMS) defines a similar clinical high-risk syndrome 
[Brief Limited Intermittent Psychotic Symptoms (BLIPS)] that 
also requires at least one symptom of psychosis (at least one of the 
scales P1, P2, or P4 is rated 6 or P3 is rated 25) [65-67]. CAARMS 
BLIPS symptoms differ from those in the SIPS BIPS syndrome in 
that symptoms should have been evident in the previous 12 months, 
though not for more than 5 years. They occur 3-4 times per week 
when they persist for at least an hour, or they occur daily when they 
persist for less than an hour. Unlike BIPS, CAARMS symptoms are 
associated with at least a 30% drop in functioning on the Social 
and Occupational Functioning Assessment Scale (SOFAS), lasting 
at least 1 month in the previous year, or show a low score for over 
a year. 

The risk of recurrent psychotic features in individuals who meet 
criteria for either the disorders described previously or the clinical 
high-risk syndromes is significant, though not as high as individuals 
who demonstrate first-episode schizophrenia [61, 68]. Differences 
between rates of recurrence in these different brief conditions 
are small. 


Differential diagnosis 


Brief psychotic disorders may be differentiated from chronic dis- 
orders with psychotic features, such as schizophrenia, bipolar 
disorder, and depression with psychotic features, both by the per- 
sistence of the symptoms and the loss of function in multiple do- 
mains and by the failure to resolve fully, even with treatment. It 
is important to note, however, that some individuals, despite the 


generally favourable outcomes associated with brief psychotic symp- 
toms, may develop a chronic psychotic disorder subsequently (espe- 
cially those with longer symptom durations such as BLIPS cases). 
Problems involving substance abuse, traumatic brain injury, de- 
lirium, and other medical conditions should be considered and as- 
sessed. If forensic circumstances exist, the possibility of malingering 
or other forms of symptom magnification should be considered. 
Some PDs may also present with brief psychotic symptoms that may 
be distinguished (for example, borderline PD) by their chronicity 
and symptom history. 


Management 


In line with the brief course of these symptoms/disorders, crisis man- 
agement is a first line of response and may include short-term hos- 
pitalization and treatments with dopamine antagonist medications. 
Behavioural treatments, such as cognitive behavioural therapy, may 
be useful to reduce or minimize stress and emotional dysregulation 
and thereby reduce the likelihood of future episodes. Nevertheless, 
it will be important to monitor individuals who have demonstrated 
brief psychotic disorders or episodes to assess their stability of clin- 
ical functions and the possible emergence of additional brief or 
chronic psychotic disorders. 


Possibilities for prevention 


Current early intervention programmes involve secondary preven- 
tion, which includes early identification and treatment of clinical 
(usually psychotic or psychotic-like) symptoms. While intervention 
is necessary to alleviate clinical symptoms at any point during the 
disorder, it is particularly important early on because it might alter 
the trajectory of illness. Patients treated with drugs for psychosis 
during their first or second hospital admission, for example, show 
better outcomes than those who are not treated until later. 

Primary prevention, which involves treatment before the dis- 
order manifests itself clinically, is not yet available for schizophrenia, 
schizoaffective disorder, schizotypal PD, brief psychotic disorder, 
or other disorders in the schizophrenia spectrum. To develop such 
treatments, it will be necessary to predict first who is most likely to 
develop a disorder and who is not. There are a few encouraging ap- 
proaches, including ongoing genetic ‘high-risk’ studies that follow 
the offspring of schizophrenic parents longitudinally. Such studies 
help to identify traits early in life that predict which individuals 
are most likely to experience emergent clinical symptoms in adult- 
hood. This type of study is particularly important because it can 
facilitate the formation of homogenous high-risk groups, which, 
in turn, can facilitate the development of focused prevention strat- 
egies. Similarly, longitudinal studies of clinical high-risk patients 
also provide critical information about the nature of ‘conversion’ 
to psychosis. 

With our current knowledge, however, it is difficult to justify 
preventive treatments—especially medication—for people without 
symptoms. The authors have argued elsewhere, however, that if 
people in high-risk groups (like first-degree biological relatives of 
patients with schizophrenia, or clinical high-risk patients) show clin- 
ically meaningful symptoms that can be organized into valid liability 
syndromes, then intervention attempts may become appropriate 
[46, 51]. The authors proposed this course of action for people with 
‘schizotaxia and suggested preliminary research guidelines [69]. 
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Delusional disorder is a psychotic disorder, characterized by well- 
systematized and long-lasting delusions, whereas other mental and 
personality domains usually remain intact. 

Although modern diagnostic systems (DSM-5 297.1, ICD-10 F22, 
and presumably also ICD-11) [1-3] categorize it in a spectrum with 
schizophrenia and other psychotic disorders, it is autochthonous 
and autonomous, that is, a disorder by its own and independent with 
its specific clinical features and rules. A shift to schizophrenia in the 
course of the disease is rare [4, 5]. 








There is a plethora of literature dealing with delusional disorders and 
paranoia, most of which is historical or in the form of case reports. 
It is beyond the scope of this chapter to deal with it in its entirety, 
so the basic conclusions drawn from it are included. Many of these 
original papers are quoted within references provided in this chapter 
[4-6]. The concept we nowadays call delusional disorder has a long 
history, as long as psychiatry itself. The term delusional disorder pre- 
sents a new name for the old concept of paranoia and was ‘officially’ 
introduced with DSM-III-R in 1987, but it was created by George 
Winokur as early as in 1977 [7]. Delusional disorder, in fact, is 
nothing else than the latest successor, or the new name, of paranoia 
as defined in their final version by Emil Kraepelin in 1915, which 
was the successor of concepts like ‘monomania’ in French psych- 
iatry and ‘primary insanity’ in German psychiatry [4]. The concept 
of paranoia has a long history which is closely connected with the 
development of psychiatry itself. Nevertheless, the debate on, and 
the interest in, paranoia lost its vim and vigour after the first decades 
of the twentieth century and came to a near standstill in the middle 
of that century. A wave of overdiagnosing schizophrenia was one of 
the main reasons of its marginalization, as well as the beginning of 
operationalized diagnostics and the introduction of new treatments, 
which are mostly refused by patients with paranoia. As mentioned 
previously, George Winokur introduced the term delusional dis- 
order in 1977, aiming to bring clarity to the equivocal and diffuse 
use of the term paranoia, especially in the anglophone literature. 





Winokur’s term covers, in fact, the same as Emil Kraepelin’s concept 
of paranoia. 








The ICD-10 category ‘persistent delusional disorder’ is likely to 
change into ‘delusional disorder’ in ICD-11 [3]. However, the major 
diagnostic criteria, according to ICD-11 Beta Draft, will probably re- 
main unchanged, with the addition of a specifier for the present state 
of the condition (symptomatic—partial remission—full remission) 
(see Box 62.1). 

The diagnostic criteria of DSM-5 are very similar to that of ICD. 
The most important difference regards the duration of delusions 
(ICD at least 3 months; DSM-5 at least 1 month). 








Prevalence and demography of delusional disorder within the gen- 
eral population are not exactly known. The base rate of delusional 
disorder is unclear. The most important reasons why this psych- 
otic condition is difficult to uncover and little is known so far about 
treatment, prognosis, etc. are patients with delusional disorder do 
not feel ill or disabled and many other psychosocial abilities usu- 
ally remain intact, so they do not seek medical or psychological 
help. In addition, the disorder is usually monosymptomatic and 
monothematic, so distinction between delusions, overvalued ideas, 
and reality is difficult, sometimes even impossible. The impairments 
in psychosocial functioning can be more circumscribed, as seen in 
other psychotic disorders; behaviour is not obviously bizzare or odd, 
and patients often show sufficient integration in relevant domains 
of social life, so there is no necessity for their families or authorities 
to initiate contact with mental services. However, boundaries be- 
tween delusional disorder, other psychotic disorders, and paranoid 
personality are often elusive. The fact that delusions are common in 
a number of different mental disorders makes it difficult to estimate 
the prevalence of pure delusional disorder. Epidemiological investi- 
gations reporting on ‘paranoid’ symptoms in the general population 
refer to a great spectrum of phenomena, which are rarely symptoms 
of delusional disorder [4]. 


(210) 
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Box 62.1 Persistent delusional disorder in ICD-10 (F22) 


(presumably delusional disorder in ICD-11) 





A. A delusion or a set of related delusions, other than those listed as 
typically schizophrenic in criterion G1(1)b or d for F20.0-F20.3 
(that is, other than completely impossible or culturally inappro- 
priate), must be present. The most common examples are perse- 
cutory, grandiose, hypochondriacal, jealous (zelotypic), or erotic 
delusions. 

B. The delusion(s) in criterion A must be present for at least 3 months. 


C. The general criteria for schizophrenia (F20.0-F20.3) are not 
fulfilled. 


D. There must be no persistent hallucinations in any modality 
but there may be transitory or occasional auditory hallu- 
cinations that are not in the third person or giving a running 
commentary). 





E. Depressive symptoms (or even a depressive episode (F32) may be 
present intermittently, provided that the delusions persist at times 
when there is no disturbance of mood. 





F Most commonly used exclusion clause. There must be no evidence 
of primary or secondary organic mental disorder, as listed under 
FOO-FO9, or of a psychotic disorder due to psychoactive sub- 
stance use (F1x.5). 

Reproduced from World Health Organization, The ICD-10 Classification of Mental 


and Behavioural Disorders, Copyright (1996), with permission from World Health 
Organization. 











Nevertheless, based on available findings, the lifetime prevalence 
of delusional disorder in the general population may be estimated 
to be somewhat between 0.03% and 0.2%, while their prevalence 
in a clinical population is approximately 0.5-1% of all admissions 
(older investigations reported up to 4%) and approximately 2-3% 
of all psychotic disorders. Reported gender differences are not 
significant. Age at onset is usually middle to late adult life, with 
a peak between ages 35 and 50. Concerning sociodemographic 
characteristics, the majority of patients are married or in a stable 
partnership and they represent all social classes and levels of edu- 
cation, though lower socio-educational levels and, among these 
patients, immigrants, are more frequently represented. Similar 
to other psychotic disorders, many patients come from a broken 
home [4, 8-11]. 


Aetiopathogenesis 


Various efforts are made to explain how delusional disorder arises, 
yet all are mainly speculative. The main approaches include the 
following: 


1. The phenomenological approach describes preconditions and 
pre-delusional states, as well as formation, consolidation, and re- 
covery or chronicity of delusional states [4, 12]. 

2. The psychodynamic approach tries to explain delusional dis- 
orders (paranoia) as a defence against awareness of vulnerability, 
having projection as the main mechanism. Libidinous alloca- 
tions being connected to homosexuality, anality, etc. play an im- 
portant role [4]. 


3. The neuropsychological approach uses cognitive deficit theories, 
involving cognitions, attributions, and emotions as an explana- 
tory model [4]. 

4. Other psychological approaches assume that interactions of spe- 
cial personality features, stressful life events, and the social envir- 
onment are essential for the development of delusional disorder. 
The many findings regarding personality features in patients with 
delusional disorder are inconsistent. According to psychological 
approaches, patients with delusional disorder often show strong 
introversion and anancastic features, little openness to experi- 
ence, lack of concern for others, and lability of self-esteem and of 
self-concepts [4]. 

5. The genetic approach contributes only marginally, mainly be- 
cause of very small samples [13]. 

6. The same applies to other biological approaches. Some inves- 
tigations provide evidence of brain abnormality in the medial 
frontal/anterior cingulate cortex and insula [14]. But it is not 
certain whether such findings are specific for delusional dis- 
order. Some delusional types can be present in brain dam- 
ages, but that is an exclusion criterion for the autochthonous 
disorder [4]. 





Symptoms and differential diagnosis 


The disorder, as already pointed out, is usually monosymptomatic, 
having as main symptom autochthonous, monothematic delusions 
of long duration. This means that: (1) the delusions are not associ- 
ated, derivable, or embedded into a net of other psychopathological 
symptoms; some other symptoms, like hallucinations, can occur, 
although very rarely, but they are transient, not intensive, and de- 
rivable from the delusions—actually they are delusional misinter- 
pretations, and not real hallucinations; (2) the delusions usually 
deal exclusively or predominantly with only one theme, that is, they 
are monothematic; only seldom—in a mixed type—more than one 
topic is evident with equal presence and intensity, none of which 
predominating over the others; the most common theme is perse- 
cution and related ideas; (3) their duration is long—months, years, 
or even decades, sometimes lifelong; (4) the consciousness is always 
clear and signs of dementia or of any other central nervous system 
disease are not present; and (5) they are not due to medical condi- 
tions or substances. 

The first and second of these aspects differentiate delusional dis- 
order from schizophrenia, schizoaffective disorder, psychotic bipolar 
or depressive disorders and (together with the third aspect) the poly- 
morphous type of brief psychotic disorder [4, 6]. If occasionally mood 
episodes occur, the delusions are of longer duration and not deriv- 
able from the mood symptoms. In cases of obsessive-compulsive 
disorder with lack of insight respectively with delusional beliefs, the 
obsessive-compulsive behaviour is the differentiating feature. But 
one has to be aware that some types of delusional disorder, especially 
the querulous-litigious form, can be associated with obsessive be- 
haviour. In paranoid personality disorder—in contrast to delusional 
disorder—paranoid thinking is usually diffuse and affects many 
areas of life; it does not have a priori evidence and is accompanied by 
a general suspiciousness during the lifespan [4]. The fourth and fifth 
aspects differentiate delusional disorder from organic psychosis. 


Subtypes 


The DSM-5 identifies seven subtypes of delusional disorder— 
erotomanic, grandiose, jealous, persecutory, somatic, mixed, and 
unspecified type, while ICD-10 also names a litigious and a self- 
referential type. But according to other views, the litigious form be- 
longs to the persecutory type, whereas the self-referential aspect is 
associated with all types [4, 5]. However, all subtypes basically focus 
on the following three ‘arche-themes, that is, existential, essential, 
and universal groups of concerns, fears, and desires of human beings: 


e Sexuality and mating: focusing on Eros. 

e Fears or desires regarding one’s own body as carrier of health, 
image, performance, and reproduction: focusing on Soma. 

e Survival and safety: focusing on Security. 


Delusional disorder can be therefore divided, according to their 
‘arche-theme;, into three groups [4]: 


e Erotocentric (erotomania, jealousy), 

e Securocentric (persecutory and related delusions, Capgras syn- 
drome, etc.), and 

e Somatocentric (delusions of health threat, parasitosis, dys- 
morphia, own odour, pregnancy, Cotard’s syndrome, etc.). 


Erotocentric group 
Erotomanic type 


Patients with delusions of erotomanic type are imperturbably con- 
vinced of being loved by someone, mostly by a celebrity or someone 
of higher social standing. That person usually is completely un- 
aware and out of reach for the erotomanic, which may cause intense 
stalking behaviour from the patient [4, 5]. 

Although this syndrome has been known since the ancient Greek 
period, it found its first scientific reference as an autonomous syn- 
drome by the German psychiatrist Ernst Kretschmer in 1918 [4]. 
The detailed descriptions of the French psychiatrist de Clérambault, 
however, contributed to connecting erotomania to his name (‘de 
Clérambault syndrome’), although he mainly described patients 
with delusions of being loved as a symptom of schizophrenia 
[15]. Erotomanic and jealous types can occur together, creating a 
common erotocentric group [4]. 

The epidemiology of the erotomanic type is unknown but seems to 
be rare. Delusions of being loved, however, can be found as symp- 
toms in other psychotic conditions. The “Halle study on Delusional 
Disorders’ [4, 10, 11] found that out of 9969 inpatients at the Halle 
Psychiatric University Clinic in Germany between 1994 and 2007, 
and among 4753 non-organic psychotic disorders, only two patients 
had an erotomanic subtype of delusional disorder, both of whom 
were men. Although an earlier study [16] identified 15 patients with 
erotomanic delusions in a catchment area of approximately 400,000 
people within a year, only three of these patients had an erotomanic 
delusional disorder. 

Summarizing the data available, it can be concluded that the 
erotomanic type of delusional disorder is rare, more frequently— 
but not exclusively—occurring in unmarried females, with the first 
manifestation usually after the age of 30. But young men, even when 
married, may also be affected. The object of love usually is het- 
erosexual, sometimes also homosexual. Stalking behaviour is not 
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unusual. The delusion may occur suddenly or develop gradually, 
and although the course and outcome are uncertain, they are usu- 
ally long-lasting; sometimes the disorder may last several decades 
[4, 5, 17]. 


Jealous type 


‘The jealous type is characterized by the delusion that the patient’s own 
sexual partner is being unfaithful. Delusional jealousy can also be a 
symptom of other psychotic disorders. As already mentioned, it can 
co-occur with the erotomanic type, building a common erotocentric 
group [4]. 

A synonym of the disorder is Othello syndrome, after the character 
in Shakespeare's drama Othello who murdered his wife, based on his 
false belief that she had been disloyal. However, it is wrong to name 
delusional jealousy Othello syndrome. Othello’s belief that his wife 
was not faithful to him was not delusional, but based on false facts 
deliberately arranged by others, especially by Jago [4]. Othello never 
had delusions of jealousy. 

Patients’ beliefs are sometimes bizarre or obscure; for example, 
even a high age of 80 or even higher is not an obstacle. Patients change 
their usual behaviour, become irritable, and show rough, aggressive, 
or even violent behaviour towards the partner. Their conviction 
cannot be disproven by any counter-argument or counter-evidence. 

The prevalence of the jealous type is hard to be estimated for 
the same reasons as for other delusional disorders. Presumably 
it is approximately less than 10% within the group of delusional 
disorder. It remains unclear if there is a gender imbalance; some 
studies found more females, others more males. Previous as- 
sumptions that delusional jealousy is exclusively or predomin- 
antly an alcoholic syndrome could not be confirmed by modern 
authors [4, 5]. It is important to keep in mind that the disorder 
tends to be chronic and may be dangerous for the partner or the 
delusional rival. 


Securocentric group 
Persecutory and related types 


The persecutory type is characterized by the delusion of being perse- 
cuted or harmed. The theme of persecution and the kind of persecu- 
tors are very broad. In the eyes of the deluded individual, everything 
can be perceived as an indication of persecution and everybody as a 
persecutor, even persons trying to help. Patients tend to relate mu- 
tual activities of everyday life to themselves (as self-reverential delu- 
sions). Patients experience problems in interpersonal relationships 
and sometimes with the law. In some rare cases, the subject of perse- 
cution is not the patient himself, but a related person, hence ‘perse- 
cutory delusional disorder by proxy. 

The phenomenology of persecutory delusional disorder can be 
classified into three types, according to the patients behaviour and 
reactions [4]: 


1. The flight type. 
2. The fight type. 
3. The height type. 


The ‘flight type’ is the most common and is characterized by 
‘fleeing from the danger, that is avoidance, isolation, and anxiety. 

The ‘fight type’ is characterized by sthenic reactions, aggression, 
and sometimes also violence. It can be divided into two subtypes: 
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1. The ‘assaultive’ subtype is characterized by violent accusations 
against the assumed persecutors. 

2. The ‘querulous-litigious’ subtype is characterized by convictions 
of being treated unjustly and incorrectly and of being discrim- 
inated against, insofar patients feel persecuted by institutions or 
persons, or even by the legal system. 


The ‘height type’ is characterized by compensation of delusional 
persecutions by delusions of grandiosity. 

Regarding prevalence and demography, the majority of cases of 
delusional disorder (approximately 60%) are included among the 
persecutory type, affecting more frequently men. Psychosocial ad- 
versity, like perceived ethnicity discrimination, may play a role in 
the development of persecutory symptomatology [18]. Though the 
diagnosis in many cases is stable over years, the persecutory type 
may change more often into schizophrenia than other types [4]. 


Querulous-litigious subtype 


Although ICD-10 names a querulous-litigious subtype, this can be 
seen as a form of the persecutory subtype, as mentioned previously. 
The affected individual fights against delusional injustice, seeking 
help of the courts or authorities or, in some cases, by debating. This 
behaviour can sometimes be dangerous, showing the ‘Nemesis 
syndrome’—which means a subjective justified penalty, named 
according to Nemesis, the Greek goddess of high and divine ven- 
geance. Patients with querulous-litigious delusions have sthenic and 
combative personalities, not seldom also showing obsessive features, 
and are willing to fight for their own satisfaction and compensation. 
But they are also isolated, because they usually do not win favours 
from their environments. 

In a clinical setting, this subtype is rare. The “Halle study on 
Delusional Disorders’ found a querulous-litigious subtype in only 
three patients (0.03%) out of all 9969 admissions between 1994 and 
2008—that is 7.0% out of all patients diagnosed with having a delu- 
sional disorder, which is in accordance with the findings of other au- 
thors [4]. A decrease in treated cases has been reported after the first 
half of the twentieth century [19]. At the University Clinic of Bern 
(Switzerland), the prevalence of patients with litigious symptom- 
atology was 0.275% between 1922 and 1951, but only 0.09% between 
1964 and 1993. Regarding psychotic querulous-litigious behaviour, 
the decrease in diagnoses was calculated as 74% between those two 
time periods [20]. One reason has to be taken into account—that 
patients with querulous-litigious delusions mostly contact the police 
or legal authorities, not the medical system. Another reason is prob- 
ably that psychiatrists, on the other hand, do not have great interest 
in dealing with querulous-litigious patients because the psychiatrists 
themselves experience a number of difficulties with these patients— 
they fight doctors and their institutions through letters, emails, tele- 
phone calls, legal complaints, and even stalking behaviour. Another 
reason might be the difficulty in precisely distinguishing between 
querulous-litigious delusions, paranoid personality, and reality. 


Grandiose type 


Patients with this type show delusions of inflated worth, power, know- 
ledge, identity, or special relationship to a deity or famous person. 
Although the official diagnostic systems define a grandiose type, 
experienced clinicians and researchers question its independent 
existence [4, 5]. Munro, for example, an expert in the topic, only 


remembers two cases in his long period of clinical experience. Also 
the author of the present chapter encountered, over 40 years, primary 
delusions of grandiosity only in patients with mania, schizomania, 
schizophrenia, ecstatic forms of brief psychotic disorders, and or- 
ganic psychosis, but never a primary grandiose type of delusional 
disorder. Nor did the “Halle study on Delusional Disorders’ find any 
case [4, 11, 12]. Possibly, the phenomenon could be explained by the 
following: 


1. An independent ‘grandiose delusional disorder’ is so extremely 
rare that it practically does not exist. 

2. Patients with grandiose type feel healthy, glorious, and superior 
and are self-confident. Hence, they do not seek help and usually 
do not harm anybody, and their social functioning is not im- 
paired. So there is no reason to contact medical authorities. 

3. There is a change in today’s delusional themes, in comparison to 
earlier times, and grandiosity has become rare or presents only 
in psychotic disorders with manic or hyperthymic mood. 

4. Modern operational diagnostic systems allocate states previously 
diagnosed as ‘grandiose delusional disorder’ to other categories 
like mania, schizomania, schizophrenia, etc. 

5. A thorough analysis of the background of delusion of gran- 
diosity could reveal them as the ‘height’ reaction type or the 
persecutory type. 


Grandiosity and persecution can be the opposite sides of the same 
coin, as illustrated by related and generally inseparable beliefs: ‘I am 
important enough to be singled out for persecution’ and ‘I am sin- 
gled out for persecution because I am so important’; ‘they persecuted 
me, because I am such an important person or ‘they persecuted me, 
because I am such an intelligent persor or ‘such a powerful person, 
etc.; and ‘as I am so superior, they are not able to harm me. There is 
no danger for me. I dont need to be afraid’ A flight into grandiosity 
may be one of the possibilities to compensate not only feelings of 
worthlessness and other negative thoughts and feelings, but also fear 
of persecution. 

Thus, there is strong evidence that the very rare cases of the gran- 
diose type’ are simply a variation of the persecutory type, as it was 
presented previously [4, 5]. 


Delusional misidentification 


Delusional misidentification is characterized by misidentification of 
persons, but sometimes also of animals or inanimate things. 

It is a non-specific phenomenon, symptom of various psychotic 
disorders, but obviously very rare as an independent delusional dis- 
order. The content of the delusion is almost always threatening to 
the existence of the individual, related persons, or even the whole 
human race. Hence it could be mostly allocated to the securogenic 
type. The group of delusional misidentification syndromes do not 
include states in which the patient misjudges his environment due 
to clouded consciousness. 

Delusional misidentification has varying phenomenology, giving 
rise to the following subtypes. 

Capgras syndrome (named according to the French psychiatrist 
Joseph Capgras, one of its descriptors in the twentieth century— 
although it was already repeatedly described in the nineteenth cen- 
tury [4]) is the core of the group of misidentification syndromes. The 
main belief of a patient with Capgras syndrome is that someone in 


their environment, usually a close relative or a significant other, has 
been replaced by an identical-looking double. Only subtle details re- 
veal the double; some hair behind the ear, some small movements, 
some minor details of the smile, etc. may be hints that the person is 
not the real one, but a replacement. The real person who has been re- 
placed is thought to be absent. Patients usually indicate ‘dark powers, 
‘the evil; ‘secret services, ‘the Mafia, or ‘enemies’ as having initiated 
the replacement. The ‘double’ is usually believed to be a spy or to be 
instructed to persecute and harm the patient, the city, the country, 
or even the whole world. The patient is occupied by uncertainties, is 
increasingly suspicious, feels great internal tensions, and is anxious, 
agitated, perplexed, and hostile against the ‘double’ 

Other phenomenological variations of the misidentification syn- 
dromes are extremely rare, or even exotic, like the following: 


Frégoli syndrome: the patient believes that imposters frequently alter 
their facial appearance to resemble familiar people in his environ- 
ment. It is named after Léopoldo Frégolian, an Italian actor famous 
for his ability to transform himself on stage into various characters. 
The patient thus identifies the imposter (usually as the persecutor) 
in several persons, for example the doctor, next time the nurses or 
fellow inpatients, then perhaps in neighbours, the postman, and so 
on. However, the persecutor is always one and the same. 

Intermetamorphosis: a delusion that people around the patient have 
transformed physically and psychologically into one another. 
The transformation may concern also animals, as well as ob- 
jects. It also implies the transformation of one’s own person into 
someone else. 

Subjective doubles: the patient believes in the existence of exact 
doubles of themselves. Although the syndrome of doubles of 
the self has superficial similarities to heautoscopy, the two phe- 
nomena are far from being identical. In cases of subjective 
doubles, the patients are convinced of the existence of another 
person associated with the delusional belief of being his or her 
double, while heautoscopy is a sort of hallucination, seeing one- 
self outside one’s own body. 

Clonal doubles or clonal pluralization: it is characterized by the delu- 
sion that there are several persons looking very much alike, that 
is, as clones of one and the same person. 

Zoocentric misidentification: the patient is convinced that pets have 
been replaced by others. 

Inanimate misidentification: the patient is convinced that things 
have been replaced by other things or by their doubles. 

Misidentification of time: it is characterized by the patient’s convic- 
tion that the present time has been substituted or that the usual 
passage of time has been changed in the sense of permanently 
repeating itself. 


Somatocentric group 


The somatocentric delusional disorders, somewhat broader than the 
somatic type of DSM-5 or ICD-10 (-11), focus on one’s own body 
and its functions. The delusional theme concerns negative alterations 
of one’s own body image, bodily functions, and bodily health, as in the 
following syndromes: 


e Delusional parasitosis. 
e Cotard’s syndrome. 
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e Delusional syndrome of one’s own odour (olfactory reference 
syndrome). 

e Delusional pregnancy. 

e Health threat and dysmorphia delusional syndrome. 

e Lycanthropy and zooanthropy. 


Delusional parasitosis 


Delusional parasitosis is the delusion of being infested with parasites. 
The infested organ is usually the skin, but sometimes also the intes- 
tines, vagina, mouth, nose, etc. 

The syndrome is known under various names: delusions of in- 
festation, Ekbom’s syndrome, prdseniler Dermatazoenwahn (this 
is the original name given to the syndrome by Ekbom and means 
pre-senile delusional parasitosis), chronic tactile hallucinosis, 
delusional infestation, _monosymptomatic hypochondriasis, 
monosymptomatic hypochondriacal psychosis, acarophobia, and 
névrodermie parasitophobique. 

The Swedish psychiatrist Karl Ekbom introduced the German 
term Dermatozoenwahn in 1938, describing seven cases, mainly 
women, aged between 50 and 60, who were complaining of itching 
and paraesthesiae and were stubbornly and fixedly convinced that 
the discomfort was caused by small animals (for example, insects, 
parasites). However, the phenomenon had already been described 
many decades before Ekbom [4]. But it is his main contribution— 
that he described it as a delusional syndrome. 

The phenomenology of delusional parasitosis is very homogenous 
and extremely alike in all affected patients. Almost all patients de- 
scribe very intense itching as the key symptom. They complain 
about little creatures or microorganisms crawling, biting, stinging, 
hopping, or running on or under their skin. They show a high level 
of suffering, but only few report actual pain. The skin sensations usu- 
ally persist the whole day and may also disturb the patient's sleep. 
More than one-third of the patients additionally describe sensations 
inside body openings or in the interior of their body: in their mouth, 
nose, inner ear, anus, intestines, lungs, or genitals. More than two- 
thirds of the patients present illusionary misidentifications or visual 
hallucinations related with the skin or parasites. Some patients even 
approach local health authorities or their general practitioners, 
carrying small containers with the ‘parasites’ with them, attempting 
to provide evidence of the cause of their torment. Thorough investi- 
gations of those objects usually reveal that they are small fragments 
of skin, hair, dried blood, scabs, dried nasal mucus, lint, textile fibres, 
dust, or excrement particles, etc., but in spite of such explanations, 
the patients are utterly convinced that these are parasites or their 
excretions. 

The visual phenomena usually do not fulfil the definition of real 
hallucinations, because they are not perceptions without a stimulus; 
there is a stimulus—the small signs mentioned previously, which are 
misidentified by the patients as parasites. This can be defined as illu- 
sionary misidentifications secondary to the tactile sensations or to the 
delusion of parasitosis. 

Other delusions secondary to the primary delusion (for example, 
delusions of reference) are possible, but based on, and derivable 
from, the primary delusional conviction. Other associated hallu- 
cinatory phenomena or misinterpreted perceptions (for example, 
visual, auditory, olfactory, or gustatory) are rare and without diag- 
nostic relevance. 
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The prevalence of delusional parasitosis is not exactly known. 
Psychiatrists are likely to be the last specialists contacted by pa- 
tients with delusional parasitosis. Most patients first seek help from 
dermatologists, general practitioners, or local health authorities. 
Only when distress and suffering seem to become unbearable do 
they seek a psychiatrist’s advice. Nevertheless, delusional parasitosis, 
with approximately 0.5% of psychiatric inpatients, is not common. 
Females seem to be more frequently affected than males. 

Although delusional parasitosis can manifest itself in every period 
of adult life, it is more frequent after the fifth decade, but not exclu- 
sively ‘pre-senile, as Ekbom supposed. 

Perhaps delusional parasitosis is the most common delusion that 
can be ‘infectious: Although the so-called shared psychosis, mostly 
in the form of ‘folie a deux, is rare, the ratio of shared psychosis in 
delusional parasitosis seems to be higher than in other psychotic dis- 
orders [4, 5]. 

There has been long discussions, mostly in the anglophone litera- 
ture, of whether delusional parasitosis is a type of hypochondriasis 
or whether it is more correct to define it as hypochondriacal delusion 
of parasitosis. However, only a minority of patients with delusional 
parasitosis fulfil the essential criterion of hypochondriasis, that is, 
excessive preoccupation with anxious self-observation and wor- 
ries about having a serious disease. In contrast to such concerns re- 
garding one’s own health, many patients with delusional parasitosis 
deny every kind of disease; they empathetically underline that they 
do not have any physical or mental disorder; they are ‘just attacked 
by parasites. This is why many patients do not seek the help of a 
doctor, but of an exterminator, a parasitologist, or even of local 
health and hygiene authorities. 

But it is also true that some patients, although not the majority, 
have a hypochondriacal background. Also, during the long-term 
course of their illness, some patients may move towards the hypo- 
chondriacal pole. From a theoretical point of view, it seems im- 
portant to differentiate between delusions and tactile hallucinations. 
Some authors believe that delusions are secondary, whereas tactile 
hallucinations are primary. But as mentioned, in contrast to other 
types of hallucinations, it is extremely difficult to define a patient's 
bodily sensation as being hallucinatory or not. 

Summarizing the literature on the topic, it can be concluded that 
both mechanisms—hallucinatory and delusional—may occur in the 
sense of organic hallucinosis or delusional disorder. 

Delusions of parasitosis may manifest as symptoms of schizo- 
phrenia, of organic psychotic disorders such as delirium and cocaine 
abuse, or of other psychotic disorders. The distinction between au- 
tonomous delusional disorder of parasitosis and symptomatic delu- 
sional parasitosis due to other conditions usually is easy, assuming 
the other defining criteria of the previously mentioned disorders are 
fulfilled, that is, the criteria of schizophrenia, psychotic affective dis- 
orders, organic psychotic disorders, etc. 

Treatment is that of the underlying condition (somatic or psych- 
otic). Clinical evidence has shown the effectiveness of dopamine 
antagonists, occasionally in combination with antidepressants, in 
some cases of autochthonous and autonomous delusional disorder 
of parasitosis. 

In regard to prognosis, it can be concluded, that although in some 
cases the syndrome can become chronic, in many cases the prog- 
nosis seems to be benign following adequate treatment. 


Cotard’s syndrome 


Cotard’s syndrome is a rare clinical condition, the most prominent 
symptom of which is the nihilistic delusion of denial of one’s own 
body or parts of the body or even the delusion of being dead. 

The syndrome was called after Jules Cotard, a French psychiatrist, 
to whom we owe the first systematic description in 1880, although 
the syndrome had been known before [4]. 

Patients hold the distinct delusion of being dead, not existing, 
putrefying, or having lost body parts, blood, internal organs, or very 
rarely their own soul. Symptoms of negation sometimes extend to 
the environment and even to the whole cosmos. Initially, the symp- 
tomatology is quite similar to that of derealization and/or deperson- 
alization. The delusional beliefs determine the patient’s behaviour. 
Patients being convinced that they do not have a stomach or any in- 
testinal organs will refuse to eat or drink. Patients with the delusional 
conviction that their legs are dead or ‘lifeless’ will refuse to walk and 
will stay in bed for a long period of time, sometimes extending to 
several months. In severe cases, the patients no longer communicate 
with their surroundings. They present total negativism with mutism, 
in more severe cases even stupor. Thus, it is recommendable in pa- 
tients with severe delusional depression, who develop stupor in the 
form of catatonic depression, to find out whether this is a conse- 
quence or a result of the patients’ delusional conviction of negation. 
Analgesia, or even anaesthesia, has been observed in some patients. 

Cotard’s symptomatology is rare and can accompany severe 
psychotic depression but can also be observed in schizophrenia 
and organic brain damage; as an autonomous delusional disorder, 
it is apparently extremely rare. Insofar treatment of the syndrome 
depends on the underlying illnesses, it is symptomatic. Dopamine 
antagonists, alone or in combination with antidepressants and ECT 
treatment, could be effective. 


Delusions of one’s own odour (olfactory reference syndrome) 


Delusions of one’s own odour can be defined as the unfounded and 
uncorrectable conviction of a patient to emit a bad or foul odour. 

There are several synonyms, which only partially are correct and 
identical with the delusional disorder of one’s own odour: olfactory 
reference syndrome, delusions of bromosis or of smell, delusional 
halitosis, hallucinatory halitosis, imaginary halitosis, etc. 

If the delusions of one’s own odour are the only prominent 
symptom and there’s no indication for another type of psychosis like 
schizophrenia, schizoaffective or depressive disorder, and if they are 
not caused by organic factors, they fulfil the criteria of an autono- 
mous delusional disorder. Delusions of one’s own odour must also 
be distinguished from social anxiety spectrum disorders in which 
the respective believes are not delusional. 

Olfactory delusional disorder is not only very rare, but also very 
difficult to be identified as such. If the patients report on smells from 
other people or things of the environment, it usually indicates ol- 
factory hallucinations. But when the patient reports on own body 
odours, distinction between delusion and hallucination is needed. 
If the patient reports perceptions of smell originating from their 
own body or body fluids and the smell cannot be perceived by other 
people, the patient may have hallucinations. In contrast, patients 
with symptoms of delusions of their own odour normally phrase it 
like: “No, I do not perceive any strange or other extraordinary odour 
coming from my body, but I am sure that I give off such smells. 


I notice it from other people's reactions when I enter a room or when 
I am close to people. They look at me very strangely; their facial ex- 
pressions change; they move away, they talk to each other meaning- 
fully. It can only be related to my own odour, which I personally do 
not perceive. This is logical because people usually do not perceive 
their own odours.” 

The patients also give a reason for their strange and bad odours 
perceived by others: they have done or thought something wrong 
or immoral. For instance, a 28-year-old male patient believed that 
people knew he masturbated because the next day his body smelled 
strange and extraordinary. He never perceived that smell, but could 
conclude it from the reactions of his colleagues in the office. So 
even if a patient reports on own odours it needs to be differenti- 
ated whether it is an olfactory hallucinosis, which—other than 
delusions—often is related to temporal lobe epilepsy or other tem- 
poral lobe damages [4]. 

There are no statistical surveys on the prevalence of the autono- 
mous delusional disorder, but only case reports, and even those are 
very rare. 

Psychogenic factors, especially sensitive and anxious personality 
features, are reported to have pathogenetic or pathoplastic relevance. 

Treatment with drugs for psychosis and psychotherapy may be 
effective, but findings are inconsistent. As a result of their condi- 
tion patients suffer from severe impairment in work and social 
relationships [21]. 


Delusional pregnancy and Couvade syndrome 


Delusional pregnancy (or delusional pseudocyesis) is characterized 
by the delusion to be pregnant (even in women with no sexual rela- 
tions or in non-reproductive phases of life). 

Although delusional pregnancy is not an extremely rare syn- 
drome within major psychotic disorders, especially in schizo- 
phrenia, schizoaffective disorder, or psychotic mood disorders, it 
is extremely rare as autonomous delusional disorder. Delusions of 
pregnancy may last over a course of many years—in some reports 
over 25 years—and also in women over the age of 60. 

Couvade syndrome is defined as the phenomenon in which hus- 
bands, boyfriends, or other exceptionally related persons of preg- 
nant women develop symptoms of pregnancy too. 

Couvade syndrome has apparently been well known since ancient 
times; it was even mentioned in the Greek myth of the ‘Argonauts, 
and 2000 years ago, the Greek geographer Strabon reported on that 
phenomenon too [4]. 

The name Couvade syndrome was derived from the early French 
verb couver, which means to brood or to hatch. Couvade syndrome is 
a very rare syndrome in clinical reality, although some publications 
state the opposite. A reason for this discrepancy might be the fact 
that in some studies, certain symptoms having none or only little 
association with pregnancy (like toothache) are assumed to be an 
indicator for Couvade syndrome. There is no evidence that such 
symptoms are delusional. In contrast to Couvade syndrome, which 
is usually non-delusional, there are only few reports on real delusions 
of pregnancy in men, but as a symptom of psychotic or organic dis- 
orders, among other symptoms, especially in epilepsy, and post-ictal 
psychosis. 

There is a plethora of theories trying to explain the phenom- 
enon, for instance psychodynamic ones, but none of which has been 
proven. 
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Lycanthropy or zooanthropy 


Lycanthropy or zooanthropy is a mainly delusional conviction that one 
has been transformed into a wolf or another animal (lýkos is the Greek 
word for wolf). 

Lycanthropy or zooanthropy is extremely rare. In international 
literature, over the last two centuries, only a few cases reported ly- 
canthropic delusions. They are usually of short duration, lasting a 
few days or weeks, sometimes only hours, and seldom over a longer 
time. Mostly it is considered a symptom of major psychiatric dis- 
orders, especially of mood disorders, schizophrenia, or organic brain 
disorders, also involving alcohol and drug abuse, or psychogenic 
(dissociative or hysterical). Therefore, there is only little support to 
define lycanthropy as an autonomous delusional disorder [4]. 


Delusional disorders of health threat and dysmorphia 
(somatic type) 


Patients with delusional disorder of health threat have unsubstanti- 
ated and uncorrectable convictions of having a severe illness or any 
other threat to their health. 

Hypochondriac delusions is a usual synonym for the disorder, but 
the term is not only discriminating and inexact, but it also does not 
cover the broad spectrum of health threat delusions [4]. 

Diagnosis, especially differential diagnosis, of delusional disorder 
of health threat is really difficult, mainly because the boundaries to 
non-delusional (hypochondriacal) fears are diffuse, uncertain, and 
difficult to draw. 

The spectrum of themes of delusions of health threat are as broad 
and variable as there are organic diseases. Delusional ideas may focus 
on all parts and organs of the human body. They may occur as an au- 
tonomous delusional disorder, but more often they develop as symp- 
toms of a major psychotic disorder (schizophrenia, schizoaffective 
disorder, delusional depression, or organic psychosis). Delusional 
disorders of health threat may be based on the patient's own or the 
family’s experiences. A successfully treated ‘real’ medical condition in 
the past can be the source of the theme ofa later delusional disorder. 

Delusional dysmorphia is characterized by the unsubstantiated 
and uncorrectable conviction that the body schema is unattractive or 
disproportioned. 

Delusional forms of the conviction of being misshapen have to be 
distinguished from phobic fears with that content. Nevertheless, pa- 
tients with delusions of dysmorphia—usually younger than in other 
types of delusional disorder—are very unhappy people. The adverse 
influence of the disorder on their social interactions is massive; pa- 
tients isolate themselves and are irritable, envious, and insecure, 
with low self-esteem. 

Unlike other somatocentric delusions, delusions of dysmorphia 
usually begin early in life, mostly in adolescence, sometimes even 
in childhood. Adolescence is a fruitful soil for ideas of dysmorphia 
or dysmorphophobia, which may, in some cases, develop into delu- 
sions of dysmorphia. 

In summary, it can be concluded that presumably there might 
never be reliable statistics on the prevalence of delusional disorder of 
health threat or of dysmorphia. Most patients from the first category 
will seek help from various physicians, but not from psychiatrists or 
psychologists. And patients not only with delusions of dysmorphia, 
but also any kind of dysmorphophobia, will most likely seek advice 
from an aesthetic surgeon, rather than from a psychiatrist. 
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Delineation of delusional disorders of health threat or of dys- 
morphia from non-delusional health threat syndromes (hypochon- 
dria or hypochondriacal disorder) and body dysmorphic disorder 
remains difficult—this is one of the reasons why we do not have— 
and do not get—reliable epidemiological data. 

Delusions of health threat and of dysmorphia are rare as autono- 
mous and autochthonous disorders but are usually associated with 
depression, schizophrenia, schizoaffective disorder, or disorders of 
the brain. 

Prognosis is largely related to response to treatment, which mostly 
consists of a multimodal therapy setting of drugs for psychosis and/ 
or depression and psychotherapy. Some authors report better re- 
sponse to treatment in somatocentric delusional disorders than in 
other types [5]. 

It is no surprise that in somatocentric syndromes, the relevance 
of culturally accepted fears and beliefs has to be taken into account. 
Some culture-related somatocentric syndromes are phobic, rather 
than delusional—for example, koro syndrome (acute fear or convic- 
tion of patients of having a genital retraction, which leads to death), 
in South East Asia; Kaza basolo (fear of women that their clitoris 
has been stolen, in Central Africa; Dhat syndrome with anxiety and 
somatocentric contents, like weakness and exhaustion, associated 
with convictions that their cause is loss of semen, in India; or taijin 
kyofusho (fear concerning mainly dysmorphia), in Japan. 


The prognosis of delusional disorder is discussed controversially, 
moving between pessimistic and optimistic views [4, 5]. According 
to evidence collected to date, the following can be concluded. 

Delusional disorders are highly stable over many years, that is, 
in the great majority of patients, there is no syndrome shift in the 
course of the illness. 

Symptoms usually persist over years or decades, and in some 
patients lifelong. Nevertheless, cases of some specific types of 
somatocentric delusional disorder, like delusional parasitosis, can 
have a benign prognosis after treatment with neuroleptics. 

Although a number of patients suffer interactional problems with 
the social environment, which leads to early retirement, autarky re- 
mains intact in the great majority. This means patients are not im- 
paired or disabled, live independently, and do not need any kind of 
support. 

There exists comorbidity with depressive episodes, but reports 
vary considerably—approximately 30% to more than 70% of pa- 
tients develop temporarly during course also depressive episodes; 
comorbidity with alcoholism seems to be lower, approximately 10%. 

Controversially discussed in recent literature is an association 
of delusional disorder and dementia. Some studies found patients 
with delusional disorder at higher risk than the general population 
[22, 23], but others not [4, 11, 12]. 


There is a lack of systematic, double-blind, randomized controlled 
trials on this topic, using methods similar to those in other psychotic 


disorders. What we know about the efficacy of treatment is mainly 
from anecdotic reports or very small heterogenous samples. 

Whereas some authors expressed therapeutic optimism mainly 
regarding dopamine antagonists [5], other investigations, as well 
as clinical experience, do not share such over-optimistic views [4]. 
Comparative trials did not find a better outcome in the group ad- 
mitted after the introduction of dopamine antagonists in modern 
psychiatry, in comparison to the time before their introduction [24]. 
Nevertheless, some somatocentric delusional disorders, like delu- 
sional parasitosis, respond positively to treatment with dopamine 
antagonists [4, 5]. 

All reports on electroconvulsive therapy are anecdotal [4], just like 
reviews on the efficacy of psychotherapy in delusional disorder. One 
of the rare comparative psychotherapeutic trials [25] included only 
a small number of patients (n = 11 allocated to a cognitive behav- 
ioural therapy group; n = 6 to an attention placebo control group) for 
a relatively short treatment period of 24 weeks. Completers in both 
groups showed clinical improvement, but those in the cognitive be- 
havioural group improved in more aspects. 

Notwithstanding, long-term investigations proved that in 
spite of occasional or even continuous treatment in the majority 
of patients (up to 80%), the delusions remained for a long time 
unremitted [4, 11, 12]. 

The enormous difficulties associated with the treatment of patients 
with delusional disorder have to be taken into consideration, mainly 
because patients do not feel ill or disturbed and so refuse any kind 
of therapeutic help. Usually, such patients become dissatisfied, non- 
compliant, and suspicious, discontinue therapy, regard hospitaliza- 
tion as a hostile act, and often set off a long-lasting war of nerves 
against the treating staff, even after their discharge from hospital. 

Any patient with delusional disorder might be difficult, but 
querulous-litigious patients are perhaps the most difficult and un- 
pleasant ones—not only for their social environment and the legal 
system, but also for therapists. Their psychotherapists or psychiat- 
rists are frequently persecuted by querulous-litigious patients and 
reported to university authorities, hospital administrations, minis- 
tries, or even legal representatives. 

Nevertheless, some authors recommend an integrated method for 
treating ‘paranoia, using a multimodal therapeutic approach of com- 
bination of psychodynamic (uncovering), interpersonal, cognitive 
behavioural, and pharmacological therapeutic techniques [26]. 
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What is early psychosis? 


Psychosis is a severe mental illness that most often begins in late 
adolescence or early adulthood, though it can affect people across 
the lifespan. Early psychosis refers to the onset of symptoms or pro- 
dromal features that begin during early neurodevelopment. The term 
first-episode psychosis (FEP) is often used to refer to the first time 
psychosis is manifest, though this may be later in life (beyond ado- 
lescence). Psychosis is associated with a distinct break from reality 
and characterized by disorganized thinking and sometimes speech, 
delusions, hallucinations, and disturbances in behaviour and affect. 
It can also be associated with prominent motor/volitional disturb- 
ances, cognitive disturbances, and impaired insight (for discussion 
about the types of psychosis, see [1]). Early psychosis may be pre- 
ceded by prodromal stages before full-blown psychosis is apparent. 
This may include changes to thought and perception, which are 
often vague, yet can be distressing. It is, however, important to con- 
sider these phenomena carefully across the stages, from prodrome 
(and even earlier) to psychosis, so that effective strategies for illness 
prevention and intervention can be put in place. In this chapter, we 
discuss the importance of early identification and treatment of early 
psychosis and its prodrome. 


What are the prodromal symptoms of psychosis? 


The ‘prodrome’ of psychosis (which includes, but is not limited to, 
schizophrenia) constitutes a period of change from baseline pre- 
morbid levels of functioning and may progress to a stage where pa- 
tients are experiencing perceptual, mood, and behavioural changes 
prior to the onset of frank psychotic symptoms. Thus often, these 
are only recognized in hindsight, rather than ahead of time. Patients 
who develop schizophrenia, with the benefit of hindsight, are able to 
recall that they experienced prodromal symptoms about 80-90% of 
the time. Ten to 20% of patients with schizophrenia can develop the 
illness ‘out of the blue; that is, without prodromal symptoms [2, 3]. 
In order to better understand and accurately identify prodromal 
symptoms, various groups have defined sets of phenomena that 





characterize the prodrome and earlier stages. Some of these ap- 
proaches were based on identifying individuals showing higher 
conversion rate, based on genetic risk, together with other vulner- 
abilities, including familial risk and functional decline, multiple 
affected family members, and familial risk together with cognitive 
deficits or negative symptoms [4, 5]. Other approaches aim to rec- 
ognize individuals who are help-seeking due to emerging psych- 
otic symptoms, who have not met psychotic disorder criteria as 
yet but are potentially in the prodromal phase [clinical high-risk 
(CHR) or ultra-high-risk (UHR) for psychosis]. An approach that 
seems to encompass all of these varied symptoms can be found ina 
key paper [6], which describes the contemporary thinking around 
CHR psychosis states. Broadly, there are two key concepts— 
the UHR criteria and basic symptoms (BS), both of which have 
similar criteria, with a goal of identifying those who would be 
more likely to transition to psychosis within a short time period 
(12-24 months). Preceding the CHR phase is the period relating 
to other factors leading to increased vulnerability. This is outlined 
further in Fig. 63.1. 

The notion of very early signs of psychosis has been evident in the 
literature for over 40 years. Gerd Huber introduced the concept of 
BS of psychosis, a relatively well-known concept in Germany, but 
less widely used elsewhere, which describes the early prodromal 
phases of psychosis [7, 8]. BS are subtle and sometimes vague (ill- 
defined) features experienced by the individual and may include elu- 
sive disturbances in thought, speech, mood, or bodily perceptions, 
and can also include cognitive—-perceptive and/or cognitive disturb- 
ances [9]. The individual may use personalized coping strategies, 
such as avoidance of certain situations and social isolation, and may 
overcome them for at least a period of time. The development of 
psychosis and cognitive deficits are often associated with partial or 
complete loss of insight [10]. Together with more recent conceptu- 
alizations of prodromal symptoms, the approach of identifying the 
earliest prodromal stages of schizophrenia has provided the tantal- 
izing possibility for early identification and treatment, with the hope 
of delaying, or even preventing, the illness. This approach also allows 
for the identification of brain changes and potential mechanisms in 
early psychosis and schizophrenia [11]. 
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Fig. 63.1 Model of transition from CHR states to psychosis [6]. 





Reproduced from Eur Psychiatry, 30(3), Schultze-Lutter F, Michel C, Schmidt SJ, et al., EPA guidance on the early detection of clinical high risk states of psychoses, pp. 405-16, 


Copyright (2015), with permission from Elsevier Masson SAS. 


The original concept of UHR for psychosis was defined according 
to the presence of state [psychotic-like experiences (PLEs), brief 
psychotic symptoms] and trait risk factors (schizotypy, family his- 
tory), associated with a clinically significant drop in function, as- 
sessed using the Global Assessment of Functioning (GAF) scale 
[12]. Early studies indicated that 37% converted to psychosis within 
12 months [13]. A meta-analysis of more recent studies using such 
criteria has identified the transition rate to be 22% [14]. The import- 
ance of this notion was the possibility of early/pre-onset identifi- 
cation and intervention to prevent or ameliorate psychosis and the 
possibility to examine neurobiological changes around the transi- 
tion to active illness [11, 15, 16]. Important criticisms of this ap- 
proach, however, are the focus on psychotic-like experiences (PLEs) 
and the lack of a clear boundary for transition to psychosis. Thus, to 
date, it is unclear that there is a discontinuity between the UHR state 
and psychosis, which requires longitudinal mapping of trajectories 
from pre-psychosis onset [17]. More recent conceptualizations sug- 
gest that a dimensional approach may be more appropriate [18]. 

The UHR criteria over the last two decades encompass other 
related concepts such as Brief Limited Intermittent Psychotic 
Symptoms (BLIPS), Attenuated Psychotic Symptoms (APS), 
Genetic Risk and Deterioration Syndrome (GRDS) Subgroup, and 
the Unspecified Prodromal Symptoms (UPS) group. Other groups 


have defined genetic high-risk individuals, which potentially allows 
for an even earlier identification of pre-psychotic individuals at high 
risk for psychosis [19]. There is no consensus as to whether these are 
distinct entities or fall under the umbrella of UHR, as outlined in 
this detailed review [20]. 


Detection of individuals at ultra-high risk and 
conversion rates for psychosis 


In recent years, there has been increasing effort to develop a greater 
focus on specific risk paradigms to identify individuals who have 
a higher risk and transition rate, in comparison to psychometric 
schizotypy, PLEs, and familial high-risk approaches. A number 
of structured interview methods, including Comprehensive 
Assessment of At-Risk Mental States (CAARM) and Structured 
Interview for Psychosis-Risk Syndromes (SIPS), have been devel- 
oped to detect CHR groups [21]. In practice, the predominant pres- 
entation of the majority of individuals meeting CHR criteria is APS 
(85%) in a recent meta-analysis [22]. The same study also found 
that the conversion rate for developing psychosis was considerably 
higher across these risk paradigms, compared with psychometric 
schizotypy, PLEs, and family history (for example, the transition 
risk within 36 months was 20% for APS and 26% for APS + GRDS) 
Transition rate of BLIPS was much higher (39% within 24 months). 
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On the other hand, the positive predictive value of CHR criteria for 
transition to psychotic disorders in non-help-seeking individuals 
was only 5.7%. Similarly, the annual risk of PLEs for psychotic out- 
come was relatively low (0.56% per year) [23]. 


Staging model for psychosis 


Following on from the work in defining UHR and psychosis transi- 
tion, McGorry and colleagues [24] have proposed a staging model 
of psychosis (Table 63.1), which incorporates the earliest stages of 
psychosis and considers the onset and progression of the illness to 
chronicity. This model borrows from notions developed for cancer 
staging and treatment, and is appealing as it suggests that each stage 
may require different interventions/treatment strategies. The central 
concept for the psychoses is that earlier stages are characterized by 
differences neurobiologically and better functional abilities, and re- 
quire different interventions. Thus, earlier stages may warrant less 
extreme measures, while later stages will likely require more radical 
interventions. Further, each stage may be associated with differences 
neurobiologically and there may be identifiable biomarkers by stage 
[25], though it is yet to be demonstrated that there are clear bound- 
aries between the proposed stages. 

McGorry and colleagues [26] have gone on to propose that the 
earliest stages, namely UHR or earlier, are less differentiated diag- 
nostically, prompting them to consider that this represents a 


Table 63.1 Staging model 


Stage Definition Target populations and referral 
sources 








‘pluripotential’ state, with the possibility of a number of diagnostic 
outcomes for the individual. However, it remains unclear if the con- 
dition at this stage is truly ‘pluripotential’ or whether progression to 
one or other psychotic condition has already been pre-determined 
by factors yet to be identified (for example, genetic). Current initia- 
tives are attempting to use machine-learning and other analytical 
approaches to examine if features of the illness at its earliest stages 
may predict the onset of psychosis, diagnosis, long-term course, or 
functional outcomes [27]. 


Why is early intervention for psychosis important? 


In general medicine, early intervention (EI) targeting prevention or 
disease modification has been a topic of interest for some time for a 
number of conditions such as diabetes, cancer, and cardiovascular 
diseases. EI aims to target pathophysiological processes causing a 
disorder, with the aim to cease or slow its progression before symp- 
toms emerge or become chronic and untreatable. While the ideal goal 
of EI is prevention, in most cases, modification of the illness course 
is a more realistic outcome. The proposed staging model for schizo- 
phrenia and other psychotic disorders, as discussed, appears to have 
face validity as an appealing notion, to intervene early and reduce 
the damaging impacts of the illness, in terms of clinical outcome, as 
well as brain structure and function. However, the concept of EI in 
psychotic disorders is perhaps not as straightforward, compared to 


Potential interventions 























Increased risk of psychotic or severe mood disorder First-degree teenage relatives of Improved mental health literacy 
and no symptoms currently probands 2. Fami y education, drug education 
3. Brief cognitive skills training 
la Mild/non-specific symptoms (including neurocognitive Screening of teenage populations. 1. Formal mental health literacy 
deficits) of psychosis or severe mood disorder. Mild Referral by: primary care physicians; 2. Family psychoeducation, formal CBT 
functional change or decline school counsellors 3. Active substance misuse reduction 
1b Ultra-high risk: moderate but subthreshold symptoms, Referral by: educational agencies; 1. Family psychoeducation, formal CBT 
with moderate neurocognitive changes and functional primary care physicians; emergency 2. Active substance misuse reduction 
decline to caseness (GAF, <70) departments; welfare agencies 3. Omega-3 fatty acids 
4. Dopamine antagonists/partial agonists 
5. Antidepressant agents or mood stabilizers 
2 First episode of psychotic or severe mood disorder Referral by: primary care physicians; 1. Family psychoeducation, formal CBT 
Full-threshold disorder with moderate to severe emergency departments; welfare 2. Active substance misuse reduction 
symptoms, neurocognitive deficits, and functional agencies; specialist care agencies; drug 3. Dopamine antagonists/partial agonists 
decline (GAF 30-50) and alcohol services 4. Antidepressant agents or mood stabilizers 
5. Vocational rehabilitation 
3a ncomplete remission from first episode of care; Primary and specialist care services 1. As for Stage 2, but with additional emphasis on 
patient's management could be linked or fast-tracked medical and psychosocial strategies to achieve 
o Stage 4 full remission 
3b Recurrence or relapse of psychotic or mood disorder, Primary and specialist care services 1. As for Stage 3a, but with additional emphasis 
which stabilizes with treatment at a GAF level, or with on relapse prevention and strategies to detect 
residual symptoms or neurocognition below the best ‘early warning signs’ 
evel achieved after remission from the first episode 
3c ultiple relapses, provided worsening in clinical extent Specialist care services 1. As for Stage 3b, but with emphasis on long- 
and impact of illness is objectively present term stabilization 
4 Severe, persistent, or unremitting illness (based on Specialized care services 1. As for Stage 3c, but with emphasis on 
symptoms, neurocognition, and disability criteria). clozapine, other tertiary treatments, and social 
Patient's management could be fast-tracked to this participation despite ongoing disability 
stage at first presentation, based on specific clinical and 





functional criteria (from Stage 2) or because of failure 
to respond to treatment (from Stage 3a) 


Adapted from Aust N ZJ Psychiatry, 40(8), McGorry, PD, Hickie IB, Yung AR, et al., Clinical staging of psychiatric disorders: a heuristic framework for choosing earlier, safer and more 
effective interventions, pp. 616-622, Copyright (2006), with permission from SAGE Publications. 
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other medical conditions, due to the complexity of factors involved, 
the diagnostic difficulties given the reliance on psychopathology, 
which may be ill-defined at illness onset, the variable trajectory, and 
the importance of other comorbid prognostic factors (for example, 
stress, substance use, and other environmental insults). Further, EI 
programmes require substantial resources. Therefore, it is important 
to review the available evidence of efficacy and effectiveness of avail- 
able EI models for psychosis, which should complement resources 
needed for those with established schizophrenia. 


Interventions in the prenatal period and infancy 


The greatest single risk factor for schizophrenia is genetic load 
[28], and recent studies attempt to delineate the genes relevant to 
the disorder [29], which provide clues for a number of potential 
treatment targets. Genetic susceptibility factors are likely to cause 
schizophrenia, with their negative influence on brain develop- 
ment from the prenatal period to later phases of brain development 
[11, 30]. Other critical events during early brain development, such 
as prenatal infection and starvation, and perinatal hypoxia are well- 
established risk factors for schizophrenia, while a range of other in- 
sults during early neurodevelopment, such as vitamin D deficiency, 
are also implicated [31], suggesting that public health measures at 
the population level may be important. Recent studies focus on the 
importance of gene-environment interactions [32], which should 
be examined across neurodevelopment. However, there is a lack of 
preclinical/animal research that examines gene-environment inter- 
actions across development and the impact on brain maturation. 
Such strategies would inform the development of novel pharmaco- 
logical strategies relevant to the emergence of the disorder. 


Public health measures to reduce the burden 
of psychotic illness 


General non-specific public health measures, including nutritional 
health and better prenatal and perinatal care, can have a poten- 
tial effect on reducing the risk for schizophrenia and co-occurring 
neurodevelopmental cognitive deficits. Another approach can be 
prenatal dietary supplementation to prevent schizophrenia, similar 
to the use of folate to reduce the risk of spina bifida and other dis- 
orders related to failure of closing of midline structures. Some au- 
thors have argued that prenatal choline supplementation, based on 
a-7 nicotinic receptor-related effects on neuronal inhibition deficits 
in schizophrenia, can reduce the incidence of schizophrenia. Low 
levels of vitamin D, folate, and retinol and increased homocysteine 
levels in early life have been proposed as risk factors for schizo- 
phrenia and neurodevelopmental disorders [33-35]. However, 
methodological difficulties, particularly the low population preva- 
lence and the late onset around the beginning of the third decade 
of life, provide a significant challenge in testing the role of dietary 
supplementation during early development in reducing the risk of 
schizophrenia. Strategies to examine this question include long- 
term population-based studies where data are available at, or prior 
to, birth and/or targeting high-risk groups of individuals (for ex- 
ample, those with a positive family history). 

A few RCTs have investigated the effect of prenatal choline sup- 
plementation on biological markers that have been associated with 
future schizophrenia risk (cognitive abilities and inhibition of P50 
potential) in infants and children. In a placebo controlled trial of 
choline supplementation in pregnant women, babies in the choline 


group had better P50 inhibition than controls at 1 month, but no 
difference was found at 3 months [36]. Another study found no evi- 
dence of cognitive differences at age of 1 year between choline and 
placebo groups [37]; however, maternal choline intake above 400 mg/ 
day had significant effects on cognition at 7-year follow-up [38]. In 
a Finnish birth cohort study, vitamin D supplementation during the 
first year of life was associated with a reduced risk of schizophrenia 
in males [39]. Another Danish population-based case-controlled 
study found that both increased and reduced levels of vitamin D in 
neonates were associated with a higher risk of developing schizo- 
phrenia, suggesting a non-linear relationship [40]. Correcting ab- 
normal levels of vitamin D and other deficiencies during early life 
may potentially reduce the incidence of schizophrenia. However, 
there is a need for longitudinal studies investigating the role of 
dietary supplements as an EI strategy, which could target high-risk 
subjects (for example, those with a positive family history). 

It is also important to consider the possibility that the interaction 
of familial risk and environmental risks factors (nutritional, ob- 
stetric problems, stress) can increase the risk for infants, as women 
with psychosis are less likely to receive adequate prenatal care 
than healthy women. Multi-disciplinary teams (for example, ob- 
stetricians, neonatologists, nutritionists, psychiatrists, adult- and 
infant-parent-trained psychologists, and social workers) could ad- 
dress problematic prenatal issues, including nutritional deficien- 
cies, lifestyle concerns such as the use of tobacco, alcohol, or other 
drug use, or life stresses including housing or violence in the home 
[41]. Enhancing parenting skills, supporting parents, and effective 
treatment of symptoms and cognitive rehabilitation in parents 
can potentially improve the resilience of a child at genetic risk for 
developing psychotic disorders [41]. Other than children ofa parent 
with schizophrenia, strategies to identify high-risk children (for ex- 
ample, those with 22q11.2 deletion and other schizophrenia-related 
CNVs) might be appropriate for such interventions. 


Intervention for specific and non-specific at-risk groups 
in childhood/early adolescence 


An alternative approach involves interventions targeting non- 
specific risk factors such as trauma and stress, bullying, drug abuse, 
and migration in childhood and teenage years, which are associated 
with an increased risk of mental disorders, including schizophrenia. 
For example, evidence suggests that interventions for teens and their 
parents, which improve family communication and rule setting, can 
reduce the rate of subsequent drug abuse and associated problem 
behaviours [42]. Reducing peer rejection and bullying with social 
(anti-bullying programs) and individual (social skills training) 
interventions might be another potential strategy [43, 44]. 

The use of treatments as early as possible in the premorbid phase of 
the illness may also improve outcomes. Such intervention strategies 
are currently being investigated in those individuals identified as at 
CHR for psychosis; however, such studies are relatively few. Very 
few studies and case reports have provided evidence of short-term 
CBT and a web-based self-help programme in reducing PLEs [45]. 
However, there are some efforts to develop psychosocial interven- 
tion programmes targeting children with familial high and low risk 
presenting with developmental delays, PLEs, and emotional prob- 
lems [46]. Cognitive remediation in children with neuropsycho- 
logical difficulties can also potentially improve resilience of children 
at risk. Studies investigating trajectories of PLEs, social withdrawal, 
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cognitive development, and other potential antecedents of psych- 
osis can help to identify target populations. For example, emerging 
evidence suggests that, while PLEs in children might be mostly be- 
nign and temporary, persistence of such symptoms are more likely 
to precede psychotic disorders and might be indicators of a need for 
interventions. 

The strategy and focus in the last two decades on CHR for psych- 
osis have provided the possibility to explore EI strategies in ado- 
lescence; however, there is currently only a limited focus on early 
(pre-emptive) intervention in childhood. This presents an important 
area of endeavour for future studies. Importantly, most of the pro- 
posed interventions have low risk and can benefit in enhancing re- 
silience in a relatively large population, as childhood antecedents 
of psychosis have low specificity and can be associated with other 
psychopathologies. 


Early intervention in adolescence and young adults 


Goals and challenges for early intervention in individuals 
at clinical high risk for psychosis 


Improving long-term outcome is the main goal for EI in individ- 
uals at high risk for psychosis. It is also hoped that EI in individuals 
at high risk can prevent or delay a first psychotic episode, which is 
often a very stressful experience for the individual and their family. 
Supporting individuals with mental illness and their families during 
late adolescence and early adulthood is also critical for helping in- 
dividuals in pursuing academic or vocational goals, developing 
social and sexual relationships and developing skills necessary for 
independent living. EI programmes also emphasize the concept 
of phase-specific treatment, suggesting that treatments that have 
limited benefits in chronic patients can be helpful in at-risk individ- 
uals (for example, CBT, fish oil). 

There are a number of conceptual and practical challenges in 
developing and implementing EI programmes in individuals at 
high risk for psychosis. The main challenge is the low positive pre- 
dictive value of the current construct of high risk for psychosis. 
The majority of individuals who meet CHR criteria never develop 
psychotic disorders. Even though some of these help-seeking in- 
dividuals might have subclinical psychotic conditions with a good 
prognosis, most (false-positives) would have non-psychotic condi- 
tions, including affective disorders, dissociative symptoms, trauma 
cluster B personality traits, and substance use. The heterogeneity of 
individuals meeting current CHR criteria has important implica- 
tions for research and treatment of early psychosis. The problem of 
high false-positive rates leads to ethical problems, including use of 
drugs for psychosis and stigmatization in individuals who have no 
underlying psychotic disorder. For true positives, the current CHR 
criteria, including the condition of being a help-seeking individual, 
together with the demands of EI programmes, would lead to recruit- 
ment of individuals with milder psychotic disorders, rather than 
individuals with poor insight, non-adherence, and poor prognosis. 
Also, current criteria for detecting CHR omit negative and disor- 
ganized symptoms and cognitive deficits, which might be beneficial 
for identifying those individuals with the highest need for help. The 
heterogeneity of CHR is also a challenge for assessing the efficacy of 
treatment methods, as the individuals with different underlying psy- 
chopathological conditions can respond to different components of 
EI programmes. That it is called ‘phase-specific treatment for CHR 


might be simply a reflection of the relationship between the het- 
erogeneity of individuals with CHR and treatment response. Thus, 
some individuals might respond to treatments that do not work in 
chronic psychotic patients, as they have no underlying psychosis. 
On the other hand, dopamine antagonist drugs would remain as the 
main treatment of choice for true positives, if we were able to detect 
these individuals with sufficiently high accuracy. Stigma associated 
with psychotic disorder is another challenge in managing individ- 
uals with CHR. 


Current evidence for efficacy of early 
intervention programmes 


CHR has been by far the most commonly used high-risk paradigm 
in EI studies. Only a few studies have investigated the role of inter- 
vention in other high-risk paradigms. Several studies with small 
sample sizes have investigated the role of low-dose risperidone 
on first-degree relatives of individuals with schizophrenia pre- 
senting with negative symptoms and/or neurocognitive deficits 
(schizotaxia). These studies have provided evidence for a reduction 
of negative symptoms and cognitive deficits in these individuals [5, 
47, 48]. Another study of standard vs integrated treatment (assertive 
community treatment, social skills training, and psychoeducation 
groups to patients and families) in schizotypal personality disorder 
individuals found lower transition rates to FEP in the integrated 
treatment group at 2-year follow-up (25% vs 48%) [49]. 

In accordance with the popularity of the CHR concept, a number 
of studies have investigated the effects of interventions with psycho- 
social and medication/dietary supplements in help-seeking individ- 
uals meeting CHR criteria. Many of these studies included control 
groups (that is, standard treatment and placebo). Some studies have 
reported reductions in symptom severity in both EI and control 
groups; however, most used conversion to psychosis as their primary 
outcome. Importantly, several meta-analyses of RCTs suggested that 
EI has generally produced significantly reduced conversion rates at 
6- to 48-month follow-up [50, 51]. 

The use of dopamine antagonists in CHR individuals has been 
trialled early but has led to controversy about the ethics of this ap- 
proach at such an early ‘pre-psychosis’ stage. Various drugs have 
been examined, including risperidone, aripiprazole, perospirone, 
and amisulpride, as a standalone or in combination with psycho- 
logical treatment [50, 51], with few of these studies being RCTs [52- 
55]. These studies provided some evidence for efficacy of drugs in 
reducing the transition rate at follow-up. The ethical issues in pre- 
scribing dopamine antagonists in CHR include exposure of false- 
positive individuals to these medications and their side effects. 

Alternative, less invasive pharmacological interventions showed 
initial promise and did not suffer from such ethical issues. These in- 
clude the use of dietary supplementation with long-chain omega- 
3 polyunsaturated fatty acids (PUFAs/fish oil), which do not have 
any clinically relevant adverse effects. Initial findings suggested that 
PUFAs can be beneficial in improving outcomes, as suggested by the 
Vienna Omega-3 Study. This was the first RCT of omega-3 PUFAs 
in CHR, demonstrating that a 12-week intervention was associated 
with a significantly reduced risk of experiencing an FEP during a 
12-month follow-up (5% for omega-3 PUFAs vs 28% for placebo) 
and with significant symptomatic and functional improvements 
[56]. Recently, findings from a longer follow-up (median = 6.7years) 
of the same individuals suggested that benefits of this intervention 
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might be sustained. The cumulative conversion rate to psychosis at 
the longer-term follow-up was 9.8% (4/41) of subjects in the omega- 
3 PUFA group, and 40% (16/40) of subjects in the placebo group. 
This study was followed by two larger replication studies that investi- 
gated the effect of omega-3 PUFAs as an EI strategy. NEURAPRO-E 
is a multi-centre RCT (involving 304 participants), which compared 
outcome in one group treated with omega-3 PUFAs plus cognitive 
behavioural case management (CBCM) with a placebo plus CBCM 
group. Unfortunately, the outcome of NEURAPRO-E has not rep- 
licated the findings of the Vienna trial. In this study, there was no 
advantage of PUFAs, in comparison to placebo, for change in symp- 
toms and functioning and 12-month transition rates (11.2% vs 
11.5%) [57]. Similarly, the rate of psychotic conversion did not differ 
in the omega-3 fatty acid (13%) vs placebo (8%) samples in the re- 
cent NAPLS Omega-3 Fatty Acid Versus Placebo Study [58]. 

As a summary, studies in CHR suggest that psychological treat- 
ments and dopamine antagonists might reduce the incidence of 
developing a full-blown psychotic episode at 6- to 48-month follow- 
up. The findings regarding the effect of PUFAs are controversial; 
however, the larger multi-centre studies would suggest there is 
limited benefit. Importantly, the long-term effects of these psycho- 
social and pharmacological interventions on transition rates, func- 
tioning, and symptoms have not been clarified. Given the ethical 
issues of using potentially invasive treatments to groups of individ- 
uals with relatively low transition rates, psychosocial approaches are 
the mainstay of EI in CHR. 





Early intervention in first-episode psychosis 


Treatment of early psychosis 
Pharmacological 


Dopamine antagonists are the mainstay of treatment of psychosis 
and have an important role in treating acute episodes and reducing 
relapse rates in schizophrenia. Large multi-centre trials provide sup- 
port for their use in early psychosis, as demonstrated by the European 
First Episode Schizophrenia Trial (EUFEST) [59]. In this study 
(N = 498), second-generation antipsychotics (SGAs) (also referred 
to as atypical antipsychotics) were tolerated better and were associ- 
ated with lower discontinuation rates. The term ‘second-generation 
antipsychotic’ was the terminology used at the time to distinguish a 
heterogenous group of new drugs from classical or ‘first-generation’ 
drugs for psychosis. This terminology will be used to describe the re- 
sults of the trial, but it is not otherwise to be encouraged for reasons 
given in Chapter 10. While an initial analysis did not find signifi- 
cant differences between SGAs and low-dose haloperidol [a first- 
generation antipsychotic (FGA), classical or typical antipsychotic], 
subsequent analysis demonstrated a higher response and remission 
rate for most SGAs, compared to low-dose haloperidol [60], while 
there was a similar level of improvement in cognition [61]. A meta- 
analysis of drugs for psychosis in FEP indicated that olanzapine and 
amisulpride were relatively more effective than older drugs [62]. In 
considering the evidence in FEP, however, it is important to con- 
sider issues such as compliance and engagement with treatment 
strategies. Further, there are important decisions about duration 
of use of medication in young people, given the issue of diagnostic 
uncertainty at this stage of illness and the impact of adverse events 


caused by these medications. Thus, milder forms of the illness will be 
apparent at this early stage; symptom remission will be more likely, 
and affective psychoses or substance-related psychotic disorders 
may have differing outcomes. Therefore, smaller doses might be suf- 
ficient in many patients with FEP, compared to chronic patients, and 
early dose reduction, in comparison to maintenance, might be asso- 
ciated with better functional outcome [63] and reduced side effect 
burden. In the short term, younger people might be susceptible to 
developing extra-pyramidal side effects, especially with older dopa- 
mine antagonist drugs, while the newer dopamine antagonists have 
histamine and serotonin-blocking properties associated with weight 
gain and metabolic syndrome. Weight gain is a particularly im- 
portant issue in young people (that is, negative effect on self-esteem 
and peer relations) that can lead to non-adherence. Further, weight 
gain and metabolic syndrome can be associated with increased cog- 
nitive deficits, morbidity, and mortality in the long term. Most drugs 
for psychosis, with some exceptions (for example, ziprasidone), are 
associated with weight gain in FEP [64]. 

Like in chronic schizophrenia, psychological treatments might 
benefit individuals with FEP. Evidence suggests that CBT and family 
interventions are effective in FEP [65]. Recently, a number of studies 
have investigated the role of cognitive remediation in FEP, sum- 
marized in a meta-analysis [66], which confirmed significant, but 
modest, gains for verbal memory and social cognition, although the 
domain being examined may be important to consider [67]. Social 
skills training, psychoeducation, and vocational training have also 
been used in early psychosis [68, 69]. 


Psychosocial interventions 


Most of the available early interventions in CHR studies used psycho- 
logical interventions, with CBT being most commonly employed. 
Others [70] examined family-focused therapy (FFT), including 
a study investigating the effects of psychoeducational multifamily 
group treatment [71]. A number of these studies combined elem- 
ents of various CBT, social skills training, psychoeducation, and 
family intervention and support (integrated treatments) [72-74]. 
A few studies investigating the effect of cognitive remediation in 
CHR [75, 76] have found that psychological interventions and cog- 
nitive remediation are associated with improved outcomes during 
the intervention period. Such studies are consistent with the no- 
tion that less invasive interventions would be appropriate at earlier 
illness stages, as proposed in the staging model. However, it should 
be noted that such interventions may also be useful at later illness 
stages. 


Early intervention services 


Specialized early intervention programmes for FEP 


Programmes providing interventions for FEP are becoming com- 
monplace in anumber of developed countries. These services provide 
specialized assertive EI to improve outcomes in psychotic disorders. 
While implementing these programmes is costly and resource- 
intensive, it has been hypothesized that they can improve long-term 
outcomes (and decrease overall costs) as the first 3-5 years after 
FEP have been suggested to be a critical period for making a mean- 
ingful change to the future course of the illness. Another important 
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concept for EI in FEP is the duration of untreated psychosis (DUP). 
Patients with psychotic symptoms typically seek help for their symp- 
toms after a considerable delay. Evidence suggests that longer DUP 
might be associated with poor prognosis [77]. Therefore, it is hoped 
that reducing DUP can improve long-term outcomes in FEP. EI in 
early psychosis can also reduce complications of onset of illness, 
including substance abuse, suicide, and inter-personal problems 
and family burden. EI programmes include intensive case manage- 
ment with low caseloads, assertive outreach, psychiatric treatment, 
and easy access to staff, providing psychoeducation, family interven- 
tions, and other psychosocial supports, which might include psy- 
chological treatments, social skills training, and vocational training. 
In most of these models, a specialist/standalone multi-disciplinary 
team provides Els. Several alternative approaches, including the 
‘Hub and Spoke’ model and enhanced community mental health 
care, also exist [78]. ‘Hub and Spoke’ models aim to overcome the 
challenges of providing EI in remote and rural areas, in which spe- 
cialist staff within community systems are supported by a central 
‘hub. In the enhanced community mental health care (‘specialist 
within a generalist’) model, staff in the community team have an EI 
role for some clients, in addition to their usual role. 

It is also important to note that integrative treatment for psychotic 
disorders was initially developed in chronic patients. The original 
integrative treatment approach is based on a ‘chronic disease’ model, 
in which individuals need to be supported by intensive case man- 
agement and specialist services, in order to function in the society 
or residential units. Studies on chronic schizophrenia have shown 
benefits of integrative treatment, in comparison to treatment as 
usual [79]. In the chronic disease model, it is not expected that long- 
term functioning of individuals would significantly improve a few 
years following discharge from such services. However, the promise 
of integrative treatment in EI services is changing the illness course 
and improving long-term functioning by intervening in the ‘critical 
period’ and by reducing the need of individuals for such services in 
the coming years. Therefore, it is important to test the long-term 
functioning of individuals and their capacity to maintain gains once 
the EI programme is ceased. 


Evidence for short-term efficacy of El in FEP 


Currently, there are many EI services in Australia, North America, 
Europe, and Asia. A number of RCTs have investigated the efficacy 
of such programmes in comparison to treatment as usual (TAU). 
The Danish OPUS trial, which randomized 547 patients with FEP 
to either a 2-year specialized assertive EI programme or standard 
treatment, is the most comprehensive study to date that has exam- 
ined short- and long-term outcomes [80]. Results of the OPUS 
trial showed positive effects on psychotic and negative symptoms, 
treatment non-adherence, level of functioning, secondary sub- 
stance abuse, level of user, and family satisfaction in the EI group, 
in comparison to the TAU group. There was a reduction in the 
number of days patients used beds in the EI group, compared to the 
TAU group. A more recent EI trial—the Recovery After an Initial 
Schizophrenia Episode Early Treatment Program [NIMH RAISE- 
ETP (NAVIGATE)]—included 404 individuals with FEP. This study 
reported greater improvement in quality of life and symptoms, in 
comparison to usual community care. A number of smaller trials, 
including the smaller Lambeth Early Onset (LEO) study [82], the 
International Optimal Treatment (IOT) study subset [83], the 


Croydon Outreach and Assertive Support Team (COAST) [84], the 
Specialized Treatment Early in Psychosis (STEP) [85], the Psychosis 
early Intervention and Assessment of Needs and Outcome (PIANO) 
[86], and the Early Psychosis Intervention Center (EPICENTER) 
[87] studies, had already reported similar benefits during the inter- 
vention period. Several studies recruited individuals who already 
completed an EI service [OPUS, Early Assessment Service for 
Young People with Psychosis (EASY) and Canadian Prevention and 
Early Intervention Program for Psychoses (PEPP)] and random- 
ized them to prolonged EI (additional 1-3 years) and regular care 
groups [88, 89]. These studies found that prolonged EI was associ- 
ated with sustained benefits, in comparison to the regular care group 
within the intervention period. It has been suggested that EI services 
might be cost-effective, as increased costs of integrated treatment are 
balanced by reduced days of inpatient admissions [85, 90]. In sum- 
mary, the current evidence suggests that EI services are better than 
TAU in ordinary community mental health teams during the inter- 
vention period. This finding is similar to the outcome of integrative 
treatment studies in chronic schizophrenia [91]. 


Evidence for long-term efficacy of El in FEP 
after ceasing intervention 


Evidence suggests that EI can significantly reduce the DUP, however, 
the clinical significance of this finding is unclear. The Treatment and 
Intervention in Psychosis Study (TIPS) attempted to answer this 
question using a quasi-experimental design (92). The TIPS study 
compared the outcome of individuals living in areas where public 
information campaigns and low-threshold early detection teams 
were established, compared to individuals living in other areas. In 
early detection areas, DUP was significantly reduced (from 16 to 
4 weeks), but the clinical significance of this finding was dubious. 
At 10 years, half of the individuals in both areas were not remitted 
[93]. In the same study, the recovery rate was better in individuals in 
the early detection area, as they were more likely to be in full-time 
employment (27% vs 11%). However, it is likely that baseline dif- 
ferences between groups can explain the observed findings, as pa- 
tients from the early detection area had fewer positive and negative 
symptoms at presentation. Thus, low-threshold early detection ap- 
proaches are likely to recruit a higher percentage of individuals with 
milder psychotic illness and shorter DUP. Clearly, better controlled 
studies are required to address this important question. 

RCT studies comparing the long-term outcome of EI services and 
TAU can be more informative. A critical question for EI outcome re- 
search is whether time-limited intervention during a ‘critical period, 
unlike similar interventions in chronic patients, dramatically im- 
proves long-term outcomes, in comparison to TAU, after individ- 
uals are discharged to regular services. To date, several groups have 
reported their findings regarding long-term outcomes. In contrast 
to findings suggesting short-term benefits of EI, a number of 5- to 
12-year follow-up findings reported in the OPUS [80, 94], IOT [95], 
and LEO [96] RCTs have not found evidence for sustained benefits 
in EI groupa, in comparison to TAU groups. Current evidence does 
not support the long-term sustaining benefits of EI after ceasing 
treatment. It was suggested that the duration of treatment in the 
EI service can be important for maintaining outcomes in the long 
term and that 2-3 years of intervention might be insufficient. As dis- 
cussed, several recent ongoing studies investigated the effect of pro- 
longed EI and recently completed prolonged intervention periods 
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[89, 97]. The Jockey Club Early Psychosis (JCEP) project is another 
ongoing study comparing effectiveness of 4 years and 2 years of 
EI and standard care [98]. When completed, 5- and 10-year post- 
intervention outcome findings of these studies may show whether 
prolonged EI, unlike the 2-year intervention, has sustained bene- 
fits after intervention is ceased. The findings to date therefore raise 
important questions about the benefits of EI services, unless such 
intervention can be sustained in the long term, perhaps even beyond 
an extended EI programme. Evidence is needed to inform decisions 
about the configuration of services and the balance between EI vs 
post-EI services. 


Current evidence suggests that integrative treatment in early psych- 
osis is more successful than ‘treatment as usual’ care in reducing 
symptoms and relapse rates and improving engagement. Offering 
high-quality, integrative, intensive, needs-based services that in- 
corporate age-specific/relevant goals at all stages of psychotic dis- 
orders, including early, medium, or late stages, is the best service 
we can offer at this point in time. Emphasizing the importance of 
optimism and instilling hope for symptomatic and functional re- 
covery should be implemented across all stages of psychosis. Early 
detection and intervention services and public awareness cam- 
paigns are important to increase the possibility for individuals 
with psychotic disorders to access help in the early stages of the 
illness. For individuals having potential prodromal symptoms, 
given the significant issue of false-positives identified by CHR cri- 
teria, together with the clinical heterogeneity of this population, a 
generic psychological treatment approach remains the main inter- 
vention for the foreseeable future. Studies investigating the tra- 
jectory of developmental antecedents of psychotic disorders and 
the effectiveness of pre-emptive interventions are clearly needed. 
Continuing research efforts to develop better and highly accurate 
markers of psychotic disorders and other mental disorders at pre- 
morbid and prodromal stages are critical for true EI, which could 
reduce the incidence and/or improve long-term outcomes for 
psychotic disorders. 


Acknowledgements 


Prof C. Pantelis was supported by a NHMRC Senior Principal 
Research Fellowship (ID: 1105825). The authors contributed equally 
in the preparation of this chapter. 


REFERENCES 


1. Thomas N, Pantelis C. Psychotic disorders. In: Cautin R, Lilienfeld 
S (eds). The Encyclopedia of Clinical Psychology. John Wiley & 
Sons, New York, NY; 2015. p. 16. 

2. Yung AR, McGorry PD. The initial prodrome in psych- 
osis: descriptive and qualitative aspects. Aust N Z J Psychiatry. 
1996;30:587-99. 

3. Yung AR, Phillips LJ, McGorry PD, et al. Prediction of psych- 
osis. A step towards indicated prevention of schizophrenia. Br J 
Psychiatry Suppl. 1998;172:14-20. 


10. 


11. 


12: 


13. 


14. 


15: 


16. 


We 


18. 


19: 


20. 


217 


22 


Johnstone EC, Lawrie SM, Cosway R. What does the Edinburgh 
high-risk study tell us about schizophrenia? Am J Med Genet. 
2002;114:906-12. 

Tsuang MT, Stone WS, Tarbox SI, Faraone SV. Treatment of 
nonpsychotic relatives of patients with schizophrenia: six case 
studies. Am J Med Genet. 2002;114:943-8. 

Schultze-Lutter F, Michel C, Schmidt SJ, et al. EPA guidance on 
the early detection of clinical high risk states of psychoses. Eur 
Psychiatry. 2015;30:405-16. 

Huber G, Gross G. The concept of basic symptoms in schizo- 
phrenic and schizoaffective psychoses. Recenti Prog Med. 
1989;80:646-52. 

Schultze-Lutter E Subjective symptoms of schizophrenia in re- 
search and the clinic: the basic symptom concept. Schizophr Bull. 
2009;35:5-8. 

Klosterkotter J, Schultze-Lutter F, Ruhrmann S. Kraepelin and 
psychotic prodromal conditions. Eur Arch Psychiatry Clin 
Neurosci. 2008;258 Suppl 2:74-84. 

Quee PJ, van der Meer L, Bruggeman R, et al. Insight in psych- 
osis: relationship with neurocognition, social cognition and 
clinical symptoms depends on phase of illness. Schizophr Bull. 
201137-292537; 

Pantelis C, Yücel M, Wood SJ, et al. Structural brain imaging 
evidence for multiple pathological processes at different 

stages of brain development in schizophrenia. Schizophr Bull. 
2005;3 1:672-96. 

American Psychiatric Association. Diagnostic and statistical 
manual of mental disorders: DSM-IV-TR, fourth edition, text revi- 
sion. American Psychiatric Association, Washington, DC; 2000. 
Yung AR, McGorry PD. The prodromal phase of first-episode 
psychosis: past and current conceptualizations. Schizophr Bull. 
1996;22:353-70. 

Fusar-Poli P, BonoldiI, Yung AR, et al. Predicting psych- 

osis: meta-analysis of transition outcomes in individuals at high 
clinical risk. Arch Gen Psychiatry. 2012;69:220-9. 

Fusar-Poli P, Radua J, McGuire P, Borgwardt S. Neuroanatomical 
maps of psychosis onset: voxel-wise meta-analysis 

of antipsychotic-naive VBM studies. Schizophr Bull. 
2012;38:1297-307. 

Pantelis C, Velakoulis D, McGorry PD, et al. Neuroanatomical 
abnormalities before and after onset of psychosis: a cross- 
sectional and longitudinal MRI comparison. Lancet. 
2003;361:281-8. 

Cropley VL, Pantelis C. Using longitudinal imaging to map 

the ‘relapse signature’ of schizophrenia and other psychoses. 
Epidemiol Psychiatr Sci. 2014;23:219-25. 

van Os J, Guloksuz S. A critique of the “ultra-high risk” and 
“transition” paradigm. World Psychiatry. 2017;16:200-6. 
Johnstone EC, Ebmeier KP, Miller P, Owens DG, Lawrie SM. 
Predicting schizophrenia: findings from the Edinburgh High- 
Risk Study. Br J Psychiatry. 2005;186:18-25. 

Fusar-Poli P, Borgwardt S, Bechdolf A, et al. The psychosis 
high-risk state: a comprehensive state-of-the-art review. JAMA 
Psychiatry. 2013;70:107-20. 

Yung AR, Yuen HP, McGorry PD, et al. Mapping the onset of 
psychosis: the Comprehensive Assessment of At-Risk Mental 
States. Aust N Z J Psychiatry. 2005;39:964-71. 

Fusar-Poli P, Cappucciati M, Borgwardt S, et al. Heterogeneity 
of Psychosis Risk Within Individuals at Clinical High Risk: 

A Meta-analytical Stratification. JAMA Psychiatry. 2016; 

73- NBEO: 





731 SECTION 9 Schizophrenia and psychotic disorders 





23. 


24. 


25: 


26. 


fe 


28. 


297 


30. 


ale 


32 


33 


34. 


Boe 


36. 


37: 


38. 


39 


40. 


Kaymaz N, Drukker M, Lieb R, et al. Do subthreshold psych- 
otic experiences predict clinical outcomes in unselected non- 
help-seeking population-based samples? A systematic review 
and meta-analysis, enriched with new results. Psychol Med. 
2012;42:2239-53. 

McGorry PD, Hickie IB, Yung AR, Pantelis C, Jackson HJ. 
Clinical staging of psychiatric disorders: a heuristic framework 
for choosing earlier, safer and more effective interventions. Aust 
N Z J Psychiatry. 2006;40:616-22. 

Wood SJ, Yung AR, McGorry PD, Pantelis C. Neuroimaging 
and treatment evidence for clinical staging in psychotic dis- 
orders: from the at-risk mental state to chronic schizophrenia. 
Biol Psychiatry. 2011;70:619-25. 

McGorry PD. Risk syndromes, clinical staging and DSM V: new 
diagnostic infrastructure for early intervention in psychiatry. 
Schizophr Res. 2010;120:49-53. 

Koutsouleris N, Gaser C, Bottlender R, et al. Use of neuroana- 
tomical pattern regression to predict the structural brain dy- 
namics of vulnerability and transition to psychosis. Schizophr 
Res. 2010;123:175-87. 

Gottesman, II, Shields J. A polygenic theory of schizophrenia. 
Proc Natl Acad Sci U S A. 1967;58:199-205. 

Schizophrenia Working Group of the Psychiatric Genomics 
Consortium. Biological insights from 108 schizophrenia- 
associated genetic loci. Nature. 2014;511:421-7. 

Bora E. Neurodevelopmental origin of cognitive impairment in 
schizophrenia. Psychol Med. 2015;45:1-9. 

McGrath JJ, Murray RM. Risk factors for schizophrenia: from 
conception to birth. In: Hirsch SR, Weinberger DR, editors. 
Schizophrenia, second edition. Blackwell Science, Oxford; 2003. 
pp. 232-50. 

Van Os J, Rutten BP, Myin-Germeys I, et al. Identifying gene- 
environment interactions in schizophrenia: Contemporary chal- 
lenges for integrated, large-scale investigations. Schizophr Bull. 
2014;40:729-36. 

Bao Y, Ibram G, Blaner WS, et al. Low maternal retinol as a 

risk factor for schizophrenia in adult offspring. Schizophr Res. 
2012;137:159-65. 

Brown AS, Bottiglieri T, Schaefer CA, et al. Elevated prenatal 
homocysteine levels as a risk factor for schizophrenia. Arch Gen 
Psychiatry. 2007;64:31-9. 

Picker JD, Coyle JT. Do maternal folate and homocysteine levels 
play a role in neurodevelopmental processes that increase risk for 
schizophrenia? Harv Rev Psychiatry. 2005;13:197-205. 

Ross RG, Hunter SK, McCarthy L, et al. Perinatal choline effects 
on neonatal pathophysiology related to later schizophrenia risk. 
Am J Psychiatry. 2013;170:290-8. 

Cheatham CL, Goldman BD, Fischer LM, da Costa KA, Reznick 
JS, Zeisel SH. Phosphatidylcholine supplementation in pregnant 
women consuming moderate-choline diets does not enhance 
infant cognitive function: a randomized, double-blind, placebo- 
controlled trial. Am J Clin Nutr. 2012;96:1465-72. 

Boeke CE, Gillman MW, Hughes MD, Rifas-Shiman SL, Villamor 
E, Oken E. Choline intake during pregnancy and child cognition 
at age 7 years. Am J Epidemiol. 2013;177:1338-47. 

McGrath J, Saari K, Hakko H, et al. Vitamin D supplementation 
during the first year of life and risk of schizophrenia: a Finnish 
birth cohort study. Schizophr Res. 2004;67:237-45. 

McGrath JJ, Eyles DW, Pedersen CB, et al. Neonatal vitamin D 
status and risk of schizophrenia: a population-based case-control 
study. Arch Gen Psychiatry. 2010;67:889-94. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


pile 


52: 


590 


54. 


53. 


56. 


Newman L, Judd F, Olsson CA, et al. Early origins of mental 
disorder—risk factors in the perinatal and infant period. BMC 
Psychiatry. 2016;16:270. 

Patnode CD, O’Connor E, Rowland M, Burda BU, Perdue LA, 
Whitlock EP. Primary care behavioral interventions to prevent 

or reduce illicit drug use and nonmedical pharmaceutical use in 
children and adolescents: a systematic evidence review for the U.S. 
Preventive Services Task Force. Ann Intern Med. 2014;160:612-20. 
Stoltz S, van Londen M, Dekovic M, de Castro BO, Prinzie P, 
Lochman JE. Effectiveness of an individual school-based inter- 
vention for children with aggressive behaviour: a randomized 
controlled trial. Behav Cogn Psychother. 2013;41:525-48. 
Waasdorp TE, Bradshaw CP, Leaf PJ. The impact of schoolwide 
positive behavioral interventions and supports on bullying and 
peer rejection: a randomized controlled effectiveness trial. Arch 
Pediatr Adolesc Med. 2012;166:149-56. 

Stafford E, Hides L, Kavanagh DJ. The acceptability, usability 
and short-term outcomes of Get Real: a web-based program 

for psychotic-like experiences (PLEs). Internet Interventions. 
2015;2:266-71. 

Uher R, Cumby J, MacKenzie LE, et al. A familial risk enriched 
cohort as a platform for testing early interventions to prevent 
severe mental illness. BMC Psychiatry. 2014;14:344. 

Rybakowski JK, Drozdz W, Borkowska A. Long-term adminis- 
tration of the low-dose risperidone in schizotaxia subjects. Hum 
Psychopharmacol. 2007;22:407-12. 

Stone WS, Hsi X, Giuliano AJ, et al. Are neurocognitive, clinical 
and social dysfunctions in schizotaxia reversible pharmaco- 
logically?: Results from the Changsha study. Asian J Psychiatr. 
2012;5:73=82. 

Nordentoft M, Thorup A, Petersen L, et al. Transition rates from 
schizotypal disorder to psychotic disorder for first-contact pa- 
tients included in the OPUS trial. A randomized clinical trial 

of integrated treatment and standard treatment. Schizophr Res. 
2006;83:29-40. 

Schmidt SJ, Schultze-Lutter F, Schimmelmann BG, et al. EPA 
guidance on the early intervention in clinical high risk states of 
psychoses. Eur Psychiatry. 2015;30:388-404. 

van der Gaag M, Smit F, Bechdolf A, et al. Preventing a first epi- 
sode of psychosis: meta-analysis of randomized controlled pre- 
vention trials of 12 month and longer-term follow-ups. Schizophr 
Res. 2013;14956-62. 

Ruhrmann S, Bechdolf A, Kuhn KU, et al. Acute effects of 
treatment for prodromal symptoms for people putatively in a 
late initial prodromal state of psychosis. Br J Psychiatry Suppl. 
2007;51:s88-95. 

McGorry PD, Nelson B, Phillips LJ, et al. Randomized controlled 
trial of interventions for young people at ultra-high risk of psych- 
osis: twelve-month outcome. J Clin Psychiatry. 2013;74:349-56. 
McGorry PD, Yung AR, Phillips LJ, et al. Randomized controlled 
trial of interventions designed to reduce the risk of progression 
to first-episode psychosis in a clinical sample with subthreshold 
symptoms. Arch Gen Psychiatry. 2002;59:921-8. 

McGlashan TH, Zipursky RB, Perkins D, et al. Randomized, 
double-blind trial of olanzapine versus placebo in patients 
prodromally symptomatic for psychosis. Am J Psychiatry. 
2006;163:790-9. 

Amminger GP, Schafer MR, Papageorgiou K, et al. Long-chain 
omega-3 fatty acids for indicated prevention of psychotic dis- 
orders: a randomized, placebo-controlled trial. Arch Gen 
Psychiatry. 2010;67:146-54. 


577 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


TAL. 


CHAPTER 63 Prevention and early intervention in psychotic disorders | (=:¥/ 


McGorry PD, Markulev C, Nelson B, et al. The NEURAPRO-E 
study: a multicentre RCT of omega 3 fatty acids and cognitive 
behavioural case management for patients at ultra high risk of 
schizophrenia and other psychotic disorders. Schizophr Bull. 
2015;41 (Suppl 1):8322-3. 

Cadenhead K, Addington J, Cannon T, et al. Omega-3 Fatty Acid 
Versus Placebo in a Clinical High-Risk Sample From the North 
American Prodrome Longitudinal Studies (NAPLS) Consortium. 
Schizophr Bull. 2017;43(suppl 1):S16. 

Kahn RS, Fleischhacker WW, Boter H, et al.; EUFEST study 
group. Effectiveness of antipsychotic drugs in first-episode 
schizophrenia and schizophreniform disorder an open random- 
ised clinical trial. Lancet. 2008;371:1085-97. 

Boter H, Peuskens J, Libiger J, et al. Effectiveness of anti- 
psychotics in first-episode schizophrenia and schizophreniform 
disorder on response and remission: an open randomized clinical 
trial (EUFEST). Schizophr Res. 2009;115:97-103. 

Davidson M, Galderisi S, Weiser M, et al. Cognitive effects 

of antipsychotic drugs in first-episode schizophrenia and 
schizophreniform disorder: a randomized, open-label clinical 
trial (EUFEST). Am J Psychiatry. 2009;166:675-82. 

Zhang JP, Gallego JA, Robinson DG, Malhotra AK, Kane 

JM, Correll CU. Efficacy and safety of individual second- 
generation vs. first-generation antipsychotics in first-episode 
psychosis: a systematic review and meta-analysis. Int J 
Neuropsychopharmacol. 2013;16:1205-18. 

Wunderink L, Nieboer RM, Wiersma D, Sytema S, Nienhuis FJ. 
Recovery in remitted first-episode psychosis at 7 years of follow- 
up of an early dose reduction/discontinuation or maintenance 
treatment strategy: long-term follow-up of a 2-year randomized 
clinical trial. JAMA Psychiatry. 2013;70:913-20. 

Tek C, Kucukgoncu S, Guloksuz S, Woods SW, Srihari VH, 
Annamalai A. Antipsychotic-induced weight gain in first-episode 
psychosis patients: a meta-analysis of differential effects of anti- 
psychotic medications. Early Interv Psychiatry. 2016;10:193-202. 
Bird V, Premkumar P, Kendall T, Whittington C, Mitchell J, 
Kuipers E. Early intervention services, cognitive-behavioural 
therapy and family intervention in early psychosis: systematic 
review. Br J Psychiatry. 2010;197:350-6. 

Revell ER, Neill JC, Harte M, Khan Z, Drake RJ. A systematic re- 
view and meta-analysis of cognitive remediation in early schizo- 
phrenia. Schizophr Res. 2015;168:213-22. 

Pantelis C, Wannan C, Bartholomeusz CF, Allott K, McGorry PD. 
Cognitive Intervention in Early Psychosis—Preserving abilities 
versus remediating deficits. Curr Opin Behav Sci. 2015;4:63-72. 
Rinaldi M, Perkins R, McNeil K, Hickman N, Singh SP. The 
Individual Placement and Support approach to vocational re- 
habilitation for young people with first episode psychosis in the 
UK. J Ment Health. 2010;19:483-91. 

Liu P, Parker AG, Hetrick SE, Callahan P, de Silva S, Purcell 

R. An evidence map of interventions across premorbid, ultra- 
high risk and first episode phases of psychosis. Schizophr Res. 
2010;123:37-44. 

Miklowitz DJ, O’Brien MP, Schlosser DA, et al. Family-focused 
treatment for adolescents and young adults at high risk for psych- 
osis: results of a randomized trial. J Am Acad Child Adolesc 
Psychiatry. 2014;53:848-58. 

McFarlane WR, Levin B, Travis L, et al. Clinical and functional 
outcomes after 2 years in the early detection and intervention 
for the prevention of psychosis multisite effectiveness trial. 
Schizophr Bull. 2015;41:30-43. 


Tis 


73. 


74. 


TS, 


76. 


Te 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 





Addington J, Epstein I, Liu L, French P, Boydell KM, Zipursky 
RB. A randomized controlled trial of cognitive behavioral therapy 
for individuals at clinical high risk of psychosis. Schizophr Res. 
2011;125:54-61. 

Bechdolf A, Wagner M, Veith V, et al. Randomized controlled 
multicentre trial of cognitive behaviour therapy in the early ini- 
tial prodromal state: effects on social adjustment post treatment. 
Early Interv Psychiatry. 2007;1:71-8. 

Thompson E, Millman ZB, Okuzawa N, et al. Evidence-based 
early interventions for individuals at clinical high risk for 
psychosis: a review of treatment components. J Nerv Ment Dis. 
2015;203:342-51. 

Loewy R, Fisher M, Schlosser DA, et al. Intensive Auditory 
Cognitive Training Improves Verbal Memory in Adolescents and 
Young Adults at Clinical High Risk for Psychosis. Schizophr Bull. 
2016;42 Suppl 1:S118-26. 

Piskulic D, Barbato M, Liu L, Addington J. Pilot study of cogni- 
tive remediation therapy on cognition in young people at clinical 
high risk of psychosis. Psychiatry Res. 2015;225:93-8. 

Addington J, Van Mastrigt S, Addington D. Duration of untreated 
psychosis: impact on 2-year outcome. Psychol Med. 2004;34:277-84. 
Behan C, Masterson S, Clarke M. Systematic review of the 
evidence for service models delivering early intervention in 
psychosis outside the stand-alone centre. Early Intervention in 
Psychiatry. 2017;11:3-13. 

Lenroot R, Bustillo JR, Lauriello J, Keith SJ. Integrated treatment 
of schizophrenia. Psychiatr Serv. 2003;54:1499-507. 

Kane JM, Robinson DG, Schooler NR, et al. Comprehensive 
versus usual community care for first-episode psychosis: 2-year 
outcomes from the NIMH RAISE Early Treatment Program. Am 
J Psychiatry. 2016;173:362-72. 

Bertelsen M, Jeppesen P, Petersen L, et al. Five-year follow-up of 
a randomized multicenter trial of intensive early intervention vs 
standard treatment for patients with a first episode of psychotic 
illness: the OPUS trial. Arch Gen Psychiatry. 2008;65:762-71. 
Craig TK, Garety P, Power P, et al. The Lambeth Early Onset 
(LEO) Team: randomised controlled trial of the effectiveness of 
specialised care for early psychosis. BMJ. 2004;329:1067. 

Grawe RW, Falloon IR, Widen JH, Skogvoll E. Two years of con- 
tinued early treatment for recent-onset schizophrenia: a random- 
ised controlled study. Acta Psychiatr Scand. 2006;114:328-36. 
Kuipers E, Holloway F, Rabe-Hesketh S, Tennakoon L; Croydon 
Outreach and Assertive Support Team (COAST). An RCT of 
early intervention in psychosis: Croydon Outreach and Assertive 
Support Team (COAST). Soc Psychiatry Psychiatr Epidemiol. 
2004;39:358-63. 

Srihari VH, Tek C, Kucukgoncu S, et al. First-Episode Services 
for Psychotic Disorders in the U.S. Public Sector: A Pragmatic 
Randomized Controlled Trial. Psychiatr Serv. 2015;66:705-12. 
Ruggeri M, Bonetto C, Lasalvia A, et al. Feasibility and 
Effectiveness of a Multi-Element Psychosocial Intervention for 
First-Episode Psychosis: Results From the Cluster-Randomized 
Controlled GET UP PIANO Trial in a Catchment Area of 10 
Million Inhabitants. Schizophr Bull. 2015;41:1192-203. 
Breitborde NJ, Bell EK, Dawley D, et al. The Early Psychosis 
Intervention Center (EPICENTER): development and six-month 
outcomes of an American first-episode psychosis clinical service. 
BMC Psychiatry. 2015;15:266. 

Albert N, Melau M, Jensen H, et al. Five years of specialised early 
intervention versus two years of specialised early intervention 
followed by three years of standard treatment for patients with a 


£}: SECTION 9 Schizophrenia and psychotic disorders 





89. 


90. 


91. 


92% 


93% 


first episode psychosis: randomised, superiority, parallel group 
trial in Denmark (OPUS II). BMJ. 2017;356:16681. 

Chang WC, Chan GH, Jim OT, et al. Optimal duration of an early 
intervention programme for first-episode psychosis: randomised 
controlled trial. Br J Psychiatry. 2015;206:492-500. 

Hastrup LH, Kronborg C, Bertelsen M, et al. Cost-effectiveness 
of early intervention in first-episode psychosis: economic evalu- 
ation of a randomised controlled trial (the OPUS study). Br J 
Psychiatry. 2013;202:35-41. 

Reilly S, Planner C, Gask L, et al. Collaborative care approaches 
for people with severe mental illness. Cochrane Database Syst 
Rev. 20133;11:CD009531. 

Friis S, Vaglum P, Haahr U, et al. Effect of an early detection 
programme on duration of untreated psychosis: part of the 
Scandinavian TIPS study. Br J Psychiatry Suppl. 2005;48:s29-32. 
Hegelstad WT, Larsen TK, Auestad B, et al. Long-term follow-up 
of the TIPS early detection in psychosis study: effects on 10-year 
outcome. Am J Psychiatry. 2012;169:374-80. 


94. 


QS, 


96. 


OW 


98. 


Secher RG, Hjorthoj CR, Austin SF, et al. Ten-year follow-up of 
the OPUS specialized early intervention trial for patients with a 
first episode of psychosis. Schizophr Bull. 2015;41:617-26. 
Sigrunarson V, Grawe RW, Morken G. Integrated treatment 

vs. treatment-as-usual for recent onset schizophrenia; 12 year 
follow-up on a randomized controlled trial. BMC Psychiatry. 
2013;13:200. 

Gafoor R, Nitsch D, McCrone P, et al. Effect of early intervention 
on 5-year outcome in non-affective psychosis. Br J Psychiatry. 
2010;196:372-6. 

Albert N, Melau M, Jensen H, et al. Five years of specialised early 
intervention versus two years of specialised early intervention 
followed by three years of standard treatment for patients with a 
first episode psychosis: randomised, superiority, parallel group 
trial in Denmark (OPUS II). BMJ. 2017;356:16681. 

Hui CL, Chang WC, Chan SK, et al. Early intervention and evalu- 
ation for adult-onset psychosis: the JCEP study rationale and 
design. Early Interv Psychiatry. 2014;8:261-8. 


ipsychotic and anticholinergic drugs 


t Y. Meltzer and William V. Bobo 




















The discovery by Delay and Denicker in 1953 that chlorpromazine 
was highly effective in alleviating delusions, hallucinations, and dis- 
organized thinking was the seminal breakthrough in the treatment 
of schizophrenia, the first agent to produce sufficient relief of core 
psychotic symptoms to permit life outside of institutions for many 
patients with schizophrenia, and even a return to a semblance of 
normal function for those individuals with relatively little cognitive 
impairment. Chlorpromazine and the other typical (first gener- 
ation) antipsychotic drugs introduced over the next 30 years were of 
immense benefit to people who experience psychotic symptoms as a 
component of a diverse group of neuropsychiatric and medical dis- 
orders, as well as drug-induced psychoses. Clinical and basic studies 
of these and the subsequent classes of drugs for psychosis have con- 
tributed to understanding the pathophysiology of schizophrenia 
and other forms of mental illness with psychotic features. The first 
generation of drugs for psychosis led to identifying the role of dopa- 
mine in psychosis, motor function, and some endocrine functions. 
This group of drugs led to the development of drugs with minimal 
motor side effects. These were marketed as ‘atypical antipsychotics, 
as opposed to the older ‘typical’ drugs; they are also sometimes re- 
ferred to as second- and third-generation antipsychotic drugs, 
and clozapine is the prototype. Their pharmacology is much more 
complex and involves, beyond interactions with dopamine recep- 
tors, direct and indirect effects on a variety or neurotransmitters, 
especially serotonin, glutamate, GABA, acetylcholine, histamine, 
and neuromodulators, such as brain-derived neurotrophic factor 
(BDNF), affecting synaptic structure and function. This chapter will 
describe the various classes of antipsychotic agents, with emphasis 
on the drugs with minimal motor side effects, their benefits and ad- 
verse effects, recommendations for use in clinical practice, and their 
mechanism of action (MOA). The drugs used to treat the extrapyr- 
amidal side effects (EPS) produced mainly by the typical dopamine 
antagonist drugs are also considered. 











As explained in Chapter 10, the broad classification of psychotropic 
drugs has usually been on the basis of indication. It has been argued 





there that we should move away from such classification to one based 
on the MOA. However, the present chapter explains how thinking 
around the MOA developed. The indication (antipsychotic) pre- 
ceded the understanding of the MOA. Therefore, the conventional 
terminology is used here to illustrate how the field has developed. In 
most other chapters, which describe the many uses of these drugs, 
the older terminology is avoided where possible. 

Antipsychotic drugs have been classified into two broad 
categories: typical and atypical [1]. Typical antipsychotic drugs are 
those which ‘typically’ produce EPS at clinically effective doses, 
including parkinsonism (muscle rigidity, tremor, bradykinesia), 
acute dystonic reactions, akathisia (restlessness), and tardive dyskin- 
esia. The efficacy of typical antipsychotics is mediated by blockade 
of dopamine D2 receptors in the dorsal striatum and limbic system, 
which includes the nucleus accumbens, stria terminalis, and amyg- 
dala [2, 3]. However, their blockade of D2 receptors in the dorsal 
striatum leads to the motor side effects described previously [3]. 

The typical antipsychotic drugs are members of a variety of chem- 
ical families (Table 64.1). They vary in their affinity for the D2 
receptor, with low-affinity drugs such as chlorpromazine, which re- 
quire high doses for clinical efficacy, to high-affinity drugs, such as 
haloperidol, which are effective at lower doses (‘Table 64.1). Whereas 
typical antipsychotics are associated with EPS at therapeutic doses, 
atypical antipsychotic drugs are associated with a lower (but not ab- 
sent) risk of EPS at clinically effective doses. As such, ‘atypicality’ 
of antipsychotic drugs is based on clinical effects on motor system 
functioning. Kapur and Seeman [4] proposed that the rate of dissoci- 
ation of all antipsychotic drugs from the D2 receptor provides the 
basis for the distinction between typical and atypical antipsychotic 
drugs, with atypical antipsychotic drugs dissociating more rapidly. 
While this is true for clozapine and quetiapine, the atypical drugs 
risperidone, sertindole, olanzapine, and asenapine dissociate no 
more rapidly or even slower than haloperidol. As such, ‘fast dissoci- 
ation’ cannot provide the pharmacological basis for atypicality for 
most of the drugs that are considered atypical. The pharmacologic 
basis for atypicality will be discussed subsequently. Sertindole is a 
phenylindole compound with a receptor profile which places it in 
the class of 5-HT2A/D2 antagonist atypical antipsychotic drugs [5]. 
Concerns about it causing QTc prolongation have limited its use [6]. 

Low-potency typical agents are those in which the usual dose 
range in schizophrenia is 2200 mg/day, whereas mid- to high- 
potency agents are those in which the usual dose range is between 
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Table 64.1 Selected antipsychotic drugs and classification 















































Drug name Trade name (examples from Chemical class General class D2 potency’ 
pharmaceutical companies in the 
United States) 
Aripiprazole Abilify” Dihydrocarbostyril Atypica 
Asenapine Saphris” Dibenzo-oxepino pyrrole Atypical 
Blonaserin Lonasen’” 4-phenyl-2-(1-piperazinyl) pyridine Atypical 
Brexpiprazole Rexulti” 4-piperazin-1-yl-4-benzo[B]thiophene Atypica 
Cariprazine Vraylar” Dichloro-phenyl-piperazine Atypica 
Chlorpromazine Thorazine” Phenothiazine Typica Low 
Clozapine Clozaril” Dibenzazepine Atypical 
Droperidol napsine” Butyrophenone Typica id 
Fluphenazine Prolixin” Phenothiazine Typica igh 
Haloperidol Haldol? Butyrophenone Typical igh 
lloperidone Fanapt™ Piperidinyl-benzisoxazole derivative Atypical 
Loxapine Loxitane” Dibenzazepine Typical id 
Lurasidone Latuda” Piperidinyl-benzisoxazole derivative Atypical 
Mesoridazine Serentil” Phenothiazine Typica Low 
Molindone oban’ Dihydroindolone Typica id 
Olanzapine Zyprexa’ Thiobenzodiazepine Atypical 
Paliperidone nvega” 9-hydroxy metabolite of risperidone Atypical 
Perphenazine Trilafon” Phenothiazine Typical Mid 
Pimozide Orap’ Butyrophenone Typical Mid 
Promazine Phenothiazine Typical Mid 
Quetiapine Seroquel” Dibenzothiazepine Atypical 
Risperidone Risperdal’, Risperdal CONSTA” Benzisoxazole Atypical 
Sertindole Serdolect’ Phenylindole Atypical 
Thioridazine Mellaril® Phenothiazine Typical Low 
Tiotixene Navane” Thioxanthene Typical High 
Trifluoperazine Stelazine” Phenothiazine Typical Mid 
Ziprasidone Geodon’ Benzisothiazole Atypical 


* Classification on the basis of potency of D2 receptor binding for typical antipsychotic drugs only. 


2 mg/day and 175 mg/day. In general, the low-potency drugs are 
associated with more sedative, orthostatic hypotensive, and anti- 
cholinergic adverse effects than the high-potency agents, but also 
have less of a tendency to produce EPS. All typical antipsychotics 
are equally effective to reduce psychosis, but because of secondary 
pharmacologic features, they differ from one another with regard to 
their adverse effect profiles, for example EPS, weight gain, sedation, 
hypotension, etc. [7]. 

Atypical antipsychotics are characterized by a more diverse and 
complex pattern of pharmacological activity. Most are serotonin [5- 
hydroxytryptamine (5-HT)] 2A and dopamine D2 antagonists or 
D2 partial agonists, and many have a variety of activities at other re- 
ceptors whose contribution to their mode of action is still being elu- 
cidated [2]. Substituted benzamides, for example amisulpride, also 
produce low EPS at clinically effective doses and may constitute an- 
other class of atypical agents. The prototypical atypical antipsychotic 
drug is clozapine, a dibenzodiazepine (Table 64.1) [8]. Other atyp- 
ical antipsychotic drugs are shown in Table 64.1 [8-11]. These drugs 
are all more potent 5-HT2A than D2 receptor antagonists, as well 


as multi-receptor antagonists [12, 13], except for aripiprazole and 
brexpiprazole, which act as dopamine D2 receptor partial agonists 
[14, 15]. Amisulpride is a unique atypical antipsychotic drug that acts 
as an antagonist at dopamine D2 and D3 receptors and at the sero- 
tonin 5-HT7 receptor. Cariprazine is a dopamine D2 and D3 receptor 
partial agonist, with high selectivity towards the D3 receptor [16]. As 
will be discussed, the atypical antipsychotic drugs differ not only with 
regard to pharmacological activity and adverse effects, but also with 
regard to efficacy [17, 18]. Atypical antipsychotic agents have been 
shown to have advantages, albeit modest, in treating negative and 
mood symptoms [19-21] and to improve some domains of cognitive 
dysfunction in patients with schizophrenia, related primary psych- 
otic disorders, and perhaps other psychiatric disorders [22-24]. 


Pharmacology 


There is abundant evidence that dopamine plays a key role in the 
aetiology of psychosis and the action of antipsychotic drugs [25]. 


The doses of typical antipsychotic drugs is highly correlated with 
their affinities for D2 receptors. Amphetamine and metham- 
phetamine, which increase synaptic concentrations of dopamine, 
exacerbate delusions and hallucinations in some patients with 
schizophrenia, due to stimulation of a subgroup of D2 receptors in 
mesolimbic nuclei [26, 27] and the dorsal striatum. The cell bodies 
of mesolimbic and dorsal striatal dopamine neurons reside in the 
ventral tegmentum and substantia nigra, respectively [28]. The 
outflow of these regions to the thalamus and the cortex has been 
postulated to mediate psychotic symptoms. As such, blockade of 
mesolimbic and striatal dopamine D2 receptors is thought to be 
the basis for drugs to ameliorate psychotic symptoms, although the 
firing rate of the mesolimbic dopaminergic neurons is subject to 
multiple influences, including stimulatory serotonergic input from 
the median raphe nucleus [29]. 

A group of ventral tegmental dopamine neurons project to 
various regions of the cortex and comprise the mesocortical dopa- 
mine system. There is extensive evidence that these neurons are im- 
portant for cognition, especially working memory [30], as well as 
negative symptoms [31]. Typical drugs occupy 80-95% of striatal D2 
receptors in patients with schizophrenia at clinically effective doses, 
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though much lower occupancy can lead to control of psychosis [32]. 
EPS occurs at above 80% occupancy of these receptors. Blockade 
of D2 receptors in the anterior pituitary gland is the basis for their 
ability to stimulate prolactin secretion [33]. 

The prefrontal cortex has relatively low concentrations of D2 re- 
ceptors and has a higher density of D1, D3, and D4 dopamine re- 
ceptors [27]. The activation of D1 receptors in the prefrontal cortex 
may be especially critical for normal working memory and other 
executive-type functions subserved by this brain region. However, 
no D1 agonists are currently approved. D1-positive allosteric modu- 
lators have shown promise in preclinical studies as cognitive en- 
hancers. Drugs which selectively block D4 receptors have not been 
found to have an antipsychotic effect [34]. There is extensive evi- 
dence that selective D3 receptor partial agonists and antagonists 
show promise to treat cognitive impairment and negative symptoms 
[35, 36]. The typical antipsychotic drugs vary in their in vitro and 
in vivo affinities for receptors such as the dopamine D1, histamine 
H1, muscarinic, al- and a2-adrenergic, and serotonergic receptors 
(Table 64.2), which mediate effects on arousal, extra-pyramidal, 
cognitive, cardiovascular, gastrointestinal, and genitourinary func- 
tion (Table 64.3) [15, 16, 37-42]. 


Table 64.2 Affinities of selected antipsychotic drugs at various neuroreceptors 








Aripiprazole 0.95 181.0 25.0 74.0 29.0 
Asenapine 2.0 15.0 0.8 0.3 11 16.1 29 243 
Blonanserin 0.7 804.0 0.8 26.0 27 530.0 765.0 100.0 
Brexpiprazole 0.3 0.1 0.5 0.2 0.6 19.0 Negligible 
Cariprazine DP 8.6 UN 6.9 6.7 <6.0 76 Negligible 
Chlorpromazine 2.0 23K Bre 26.0 0.28 184.0 0.18 47.0 
Clozapine 431.0 105.0 13.0 29.0 1.6 142.0 2.0 14.0 
Fluphenazine 0.54 145.0 7A 418.0 6.4 314.0 ve 21K 
Haloperidol 2.0 21K 73.0 210K 12.0 21K 23K 210K 
lloperidone Shs) 33.0 0.2 14.0 0.3 3.0 123 >1K 
Lurasidone 17 6.8 2.0 47.9 40.7 >1K >1K 
Loxapine 10.0 22K 38) 21.0 30 ISI 2.8 175.0 
Molindone 63.0 23K 320.0 210K 22K 21K 22K NA 
Olanzapine 72.0 22K 3.0 24.0 109.0 314.0 49 24.0 
9-OH risperidone’ 9.4 637.8 1 100.3 25 47 5.6 210K 
Perphenazine 14 421.0 5.6 132.0 10.0 810.5 8.0 NA 
Pimozide 0.65 650.0 19.0 23K WH 21K 692.0 800.0 
Quetiapine 567.0 431.0 366.0 21K 22.0 23K WS 858.0 
Risperidone 49 427.0 0.19 94.9 5.0 ISI 5,2 210K 
Sertindole 6.6 >1K 0.6 6.0 29 190.0 320.0 >1K 
Thioridazine 10.0 108.0 11.0 69.0 les} 134.0 14.0 33.0 
Tiothixene 14 410.0 111.0 21K 12.0 80.0 1210 210K 
Trifluoperazine 1s 950.0 13.0 378.0 24. 653.7 63.0 NA 
Ziprasidone 40 76.0 28 68.0 18.0 160.0 130.0 210K 
All receptor-binding affinities are reported as Ki (nM) using National Institutes of Mental Health (NIMH) Psychoactive Drug Screening Program (PDSP) certified data, available online at 
http://pdsp.cwru.edu/pdsp.php, unless otherwise specified. In general, the lower the Ki (nM) value, the higher the binding affinity for the drug at a given receptor site. 
A, human cloned receptor data not available. 





“9-hydroxy (9-OH) risperidone is marketed as paliperidone. 
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Table 64.3 Hypothesized therapeutic and adverse effects of receptor 

















occupancy by antipsychotic drugs 


Pharmacological activity Therapeutic effect(s) Adverse effect(s) 


Dopamine D2 Antagonism or partial agonist effects Reduction of positive symptoms Extrapyramidal effects (EPS) 
Hyperprolactinemia 
Serotonin (5-HT) 1A Full or partial agonist effects Cognitive enhancement 
Reduction of mood and anxiety symptoms 
5-HT2A Antagonism Reduction of negative symptoms 
Reduction of EPS 
Reduction of mood and anxiety symptoms 
ncreased deep sleep 
SEAE Antagonism Reduced anxiety symptoms Weight gain 
Adrenergic a-1 Antagonism Orthostatic hypotension 
Dizziness 
Adrenergic a-2 Antagonism Reflex tachycardia 
Histamine H-1 Sedation Sedation 
Drowsiness 
Weight gain 
Muscarinic (cholinergic) Antagonism Reduction of EPS Blurry vision 


M-1 


Exacerbation of acute angle closure glaucoma 
Sinus tachycardia 

Constipation 

Urinary retention 

Memory dysfunction 


Source: data from Psychopharmacol Bull., 35(4), Kelly DL, Love RC, Ziprasidone and the QTC interval: phar-macokinetic and pharmacodynamic considerations, pp. 66-79, Copyright 


(2001), MedWorks Media Global, LLC. 


Thioridazine is a relatively potent antimuscarinic agent. Most of 
the low-potency drugs for psychosis are potent a-1 and H1 antagon- 
ists, which contribute to orthostatic hypotension and weight gain, 
respectively. 

The receptor-binding affinities of the atypical antipsychotic drugs 
have been related to their efficacy and side effect profiles. As noted 
previously, the most important determinant of atypicality for most of 
the currently available agents of this type is that they are more potent 
5-HT2A than D2 receptor antagonists. An exception is aripiprazole, 
which combines potent 5-HT2A antagonism and 5-HT1A agonism, 
with partial D2 receptor agonism. Another exception is amisulpride, 
which is a selective D2/3 antagonist, with little pharmacological ac- 
tivity at 5-HT2A receptors. Combined 5-HT2A and less potent D2 
antagonism is the most consistent principle yet discovered to pro- 
duce a separation between antipsychotic action and interference 
with motor function [43]. The low potential for EPS with clozapine, 
and subsequently olanzapine, quetiapine, risperidone, iloperidone, 
ziprasidone, paliperidone, and asenapine is, in part, due to their rela- 
tively stronger 5-HT2 antagonist and weaker D2 antagonist prop- 
erties. Direct and indirect 5-HT1A partial and full agonist effects 
of the atypical antipsychotic drugs have also been shown to con- 
tribute to their low EPS profile. The atypical antipsychotic agents 
have the ability to increase prefrontal cortical dopaminergic ac- 
tivity, compared with subcortical dopaminergic activity [44]. The 
ability to increase the release of dopamine in the prefrontal cortex 
may be to improve cognition and negative symptoms. It may also 
contribute to decreasing the release of dopamine in the mesolimbic 
region, because prefrontal dopamine neurons modulate the activity 
of corticolimbic glutamatergic neurons that regulate the activity of 
ventral and dorsal striatal DA neurons [30]. Clozapine and some of 


the other atypical antipsychotic drugs that are also potent 5-HT2A 
antagonists also produce marked increases in prefrontal cortical 
and hippocampal acetylcholine efflux [45]. These atypical agents 
also produce marked increases in noradrenaline efflux in the pre- 
frontal cortex, which is correlated in time and magnitude with the 
increase in extracellular dopamine [46]. It is of interest that in ro- 
dents, combining ritanserin (a mixed 5-HT2A/2B/2C antagonist) or 
M-100907 (a selective 5-HT2A antagonist) with a selective D2/3 an- 
tagonist resulted in increased prefrontal dopamine release [47, 48]. 
The importance of serotonin receptors other than 5-HT2A for anti- 
psychotic action has received considerable attention. Activation of 
5-HT1A receptors are believed to have a dopamine-modulating ef- 
fect similar to that of 5-HT2A antagonism [49]. Under experimental 
conditions, 5-HT1A agonists have been shown to stimulate cortical 
dopamine release [50, 51], and in schizophrenic patients who were 
stabilized on haloperidol, the addition of tandospirone, a 5-HT1A 
partial agonist, resulted in improved neurocognitive performance 
[52]. This effect has also been demonstrated for buspirone, another 
5-HT1A partial agonist [53]. Antagonism of 5-HT2C receptors also 
appears to result in cortical dopamine and noradrenaline release, as 
well as in the nucleus accumbens [54]. As is the case with 5-HT1A 
activity, not all atypical antipsychotic drugs are active at 5-HT2C 
receptors (Table 64.2). Like antagonism at histamine H1 receptors 
[55], 5-HT2C antagonist activity may be related to weight gain [56]. 

Clozapine, olanzapine, risperidone, and quetiapine are able 
to block the interference in prepulse inhibition produced by d- 
amphetamine, apomorphine, or phencyclidine at doses that do not 
interfere with locomotor function [57]. Clozapine and M100907 are 
able to block the effects of phencyclidine, an N-methyl-D-aspartate 
(NMDA) receptor antagonist, on locomotor activity in rodents [58]. 


The extent of involvement of other atypical relative to typical drugs 
at NMDA receptors and other glutamatergic targets is an area of 
active interest. Other receptor targets that are of special interest in 
terms of improving cognitive functioning and selected psychotic 
symptoms include M1 muscarinic, a-7 nicotinic, and a-1 and a-2 
adrenergic receptors [59]. 





Administration, pharmacokinetics, and dosage 


Administration 
Typical antipsychotic drugs 


The major use is for the treatment of schizophrenia. Other major 
uses that will be touched on briefly in this chapter include treat- 
ment of mood disorders such as major depression (as an adjunct 
to antidepressants) and bipolar disorder. Indications discussed in 
other chapters of this textbook include Tourette's syndrome and ag- 
gression. The major advantage of the typical drugs is their ability 
to improve positive symptoms in patients with schizophrenia, that 
is, delusions and hallucinations, and to rapidly reduce psychotic 
symptoms, as well as the core non-psychotic signs and symptoms 
of manic episodes (including agitation, hostility, aggressive behav- 
iour, psychomotor agitation, and severe insomnia). Administration 
of typical drugs leads to the complete or nearly complete elimination 
of positive symptoms and disorganization of thought and affect in 
about 60-70% of patients with schizophrenia and an even higher 
proportion of those with psychotic mania and psychotic depres- 
sion [60]. The antipsychotic response in schizophrenia and mania 
is sometimes apparent within a few days in many patients, but max- 
imum benefit usually takes up to several weeks or months to unfold. 
A reasonable duration for a therapeutic trial with one of these agents 
is 4-6 weeks. It is not appropriate to switch medications after 1 or 2 
weeks, even if a response is not apparent, unless side effects pose a 
serious problem. Positive symptoms (delusions and hallucinations) 
do not respond to typical drugs in about 10% of schizophrenic pa- 
tients, even during the first episode [61]. Another 20% of patients 
with schizophrenia develop resistance to these agents during the 
subsequent course of their illnesses [62]. Development of resistance 
to typical drugs may occur at any time during the course of treat- 
ment, even after many years of control of positive symptoms. Such 
patients may respond to clozapine [63] or one of the other atypical 
drugs [61, 62]. The average doses of the typical neuroleptic drugs are 
given in Table 64.4. The best results with these drugs in terms of ef- 
ficacy and side effects may be expected with the lowest dose needed 
to produce control of positive symptoms with the fewest EPS [7, 64]. 

Controlled studies have failed to find benefits from high-dose strat- 
egies or combining two or more typical antipsychotics. Increasing 
the dose of these agents when patients fail to respond rapidly, for ex- 
ample within days, should be avoided. Augmentation with a benzo- 
diazepine may be useful to decrease anxiety until the lower doses of 
drugs produce adequate control of positive symptoms [7, 64]. Patients 
who receive high doses of these drugs adequately are at greater risk 
of hyperprolactinaemic effects, EPS, and tardive dyskinesia and are 
generally better treated with an atypical antipsychotic drug. 

However, improvement in positive symptoms is only one element 
in the treatment of schizophrenia. Additional efficacy factors of 
major importance are summarized in Table 64.5. 
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Tolerability and safety factors, such as compliance, tardive dys- 
kinesia, weight gain, and medical morbidity, are also major elements 
in outcome and influence the choice of a typical or atypical drug. 
Typical drugs are not as effective as atypical drugs for improving pri- 
mary negative symptoms [65]. Abnormalities in specific domains 
of cognition (Table 64.5) are present in first-episode schizophrenic 
patients at a moderate to severe level and show additional deteri- 
oration during the course of illness [66]. Approximately 85% of 
patients with schizophrenia are clinically impaired in one or more 
domains of cognition [67, 68]. Cognition has been shown to be the 
most critical determinant of functional capacity among patients 
with schizophrenia, even more so than positive symptoms [67]. 
Typical antipsychotic drugs usually do not improve cognitive func- 
tion [70]. Development of tardive dyskinesia may diminish the pos- 
sibility of obtaining benefit from atypical antipsychotic to improve 
cognition [71]. 

All of the typical drugs are likely to be equally effective in treating 
either the initial presentation or recurrent psychosis due to break- 
through of symptoms, despite compliance, or because of having 
stopped medication [7, 62, 64]. First-episode patients with schizo- 
phrenia usually require much lower doses than patients with two or 
more episodes, suggesting some progression of the disease process 
or the development of tolerance to the mechanism of action of these 
drugs [72]. Doses for more chronic patients should be in the range 
of 5-10 mg of haloperidol equivalents per day (Table 64.4) for up 
to 4-6 weeks, unless there is a major need for chemical means to 
prevent harm to self or others, to decrease excitement, or to induce 
sleep [73]. Auxiliary medications for anxiety and sleeplessness, for 
example benzodiazepines, may supplement these low doses [74]. 

Parenteral injections of haloperidol, chlorpromazine, or other 
dopamine antagonists/partial agonists may be needed for patients 
who refuse oral medication or where very rapid onset of action is 
needed to control acutely dangerous behaviours if less restrictive 
means either fail or cannot be utilized safely. Commonly, halo- 
peridol (2-10 mg) with or without lorazepam (2-4 mg) is delivered 
intramuscularly every 30-60 minutes, as required, up to three doses. 
Doses of haloperidol given intramuscularly in such situations gen- 
erally should not exceed 18 mg per day. Oral medication should be 
substituted as soon as feasible. If positive symptoms fail to respond 
to a single trial of a typical drug at adequate doses in patients with 
schizophrenia, there is evidence that switching to another typical 
antipsychotic, even of a different chemical class, is unlikely to pro- 
duce greater control [7, 60, 62]. This is likely to be true for other 
indications for the use of antipsychotic agents as well. 

In cases of repeated illness relapse due to poor compliance or 
when patients prefer it, the use of long-acting (for example, depot) 
injectable medications, typically administered once every 2-4 weeks, 
should be used. The use of injectable antipsychotic medication has 
been associated with lower rates of relapse and rehospitalization 
and greater global improvement, compared with oral typical anti- 
psychotics [75], possibly as a result of ensured drug delivery. Long- 
acting injectable drugs should not be given to ameliorate acute 
behavioural disturbances. 

In summary, typical antipsychotic drugs may be less desirable 
than typical antipsychotic drugs for some patients because of their 
greater risk for EPS, tardive dyskinesia, and prolactin elevations and 
lower adherence. On the other hand, typical antipsychotics are less 
expensive, on average, than atypical antipsychotic drugs. 
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Table 64.4 Oral dosing of antipsychotic drugs 






Typical antipsychotic drugs 


Equivalent doses | Starting dose Titration schedule Dose range (mg/day) 
(mg/day) 


























Chlorpromazine* 100 15-50 mg 2-4 times daily clinically indicate 300-1000 (divided once daily to 
four times Ve 
Fluphenazine? 2 0.5-10 mg/day (divided every 6-8 hours) As clinically indicated 5-20 
Haloperidol 2 0.5-5 mg twice daily As clinically indicated 520 
Loxapine 10 10 mg twice daily As clinically indicated 30-100 
esoridazine 50 50-400 
olindone 10 50-75 mg/day divided 3-4 times daily As clinically indicated 30-100 
Perphenazine® 10 4-8 mg three times daily (8-16 mg 2-4 times As clinically indicated 6-64 
daily if hospitalized) 
Thioridazine 100 50-100 mg three times daily As clinically indicated 300-800 
Tiotixene 5 2 mg three times daily As clinically indicated 5-50 
Trifluoperazine 5 2-5 mg twice daily As clinically indicated 5-50 
For elderly patients or those with renal or hepatic problems, the doses of drug may need to be reduced by one-half or more. 
* Short-acting IM formulation may be given 25-50 mg (may repeat after 1-4 hours, as required); may gradually increase the dose up to 400 mg IM every 4-6 hours 
(maximum of 2000 mg/day)—may be needed for severe cases. 
P Short-acting IM formulation may be given 2.5-10 mg/day at every 6- to 8-hour intervals; depot IM formulation may be given 12.5-25 mg every 3 weeks. 


© Short-acting IM formulation may be given 2-5 mg every 1-4 hours; depot IM formulation may be given at approximately 10-20 times the stable oral dose every 4 weeks. 
4 Short-acting IM formulation may be given 5-10 mg every 6 hours (maximum of 30 mg/day). 


Atypical antipsychotic drugs 
| | Startingdose e| Titration schedule Dose range (mg/day) 


























Asenapine 5 mg twice daily As clinically indicated, to usual effective dose range of 5-10 mg twice daily 5-10 mg twice daily 
Aripiprazole*® 10-15 mg daily As clinically indicated, every 2 weeks 0-30 
Blonanserin 4 mg twice daily As clinically indicated, to usual effective dose range of 4-12 mg twice daily 4-12 mg twice daily 
Brexpiprazole mg daily Increase to 2 mg/day on days 5-7, 2 mg/day or 4 mg/day on days 8+ 2-4 
Cariprazine 5 mg daily As clinically indicated, to usual effective dose range of 1.5-6.0 mg/day SeN 
Clozapine 2.5 mg once dailyto Increase by 25-50 mg/day until usual effective dose of 300-450 mg/day after 2-4 weeks 150-600 
twice daily 

lloperidone mg twice daily Increase to 2, 4, 6, 8, and 12 mg twice daily on treatment days 2, 3, 4, 5, 6, and 7 (respectively) 6-12 mg twice daily 
Lurasidone 40 mg daily (with food) As clinically indicated, to usual effective dose range of 40-160 mg/day (all doses with food) 40-160 
Olanzapine“? 5-10 mg daily As clinically indicated, by 5 mg/day every 7 days 0-30 
Paliperidone® 6 mg/day As clinically indicated, by 3 mg/day every 2-4 week increments, up to 12 mg daily 6-2 
Quetiapine 25 mg twice daily Increase by 25-50 mg 2-3 times daily on days 2 and 3, to target dose of 300-400 mg daily 300- 800 

(QD -TID) by day 4. Further increases as clinically indicated by 25-50 mg twice daily every 

2 days 
Risperidone’ 0.5-1 mg twice daily Increase by 0.5-1 mg twice daily on days 2 and 3, with further dose increases thereafter by 2-8 


0.5-1 mg increments every 7 days as required 


Ziprasidone® 20 mg twice daily with Increase by 20-40 mg twice daily every 2 days to target dose of 80 mg (all doses with food) 120-200 
food 


For elderly patients or those with renal or hepatic problems, doses of drug may need to be reduced by one-half or more and titration may be slower. 

è Short-acting IM formulation may be given at 9.75 mg, though the lower 5.25 mg dose may be indicated in some situations. 

° There are two long-acting IM formulations. Aripiprazole monohydrate may be initiated at 400 mg as a gluteal or deltoid injection (continue oral aripiprazole for 2 weeks), with once- 
monthly dosing at 400 mg (160-300 mg once monthly if there are adverse reactions, in pharmacogenetic CYP2D6 poor metabolizers, or if taken with drugs that inhibit CYP2D6 or 
3A4). Aripiprazole lauroxil may be initiated at 441-882 mg (continue oral aripiprazole for 3 weeks), followed by repeated administration every 4 weeks [441 mg (deltoid or gluteal 
injection), 662 mg (gluteal injection only), or 882 mg doses (gluteal injection only)] or every 6 weeks (882 mg dose only). 

€ Short-acting IM formulation may be given 10 mg as required (may be repeated after 2 hours, up to 30 mg/day). 

4 Long-acting IM formulation may be initiated at 210 mg or 300 mg every 2 weeks for a total of 8 weeks, followed by maintenance dosing (150-405 mg every 2-4 weeks). 

€ There are two long-acting IM formulations of paliperidone. The once-monthly long-acting injectable form is may be initiated at 234 mg (deltoid injection), followed by a 156 mg 
deltoid injection on day 8. Maintenance doses are given by deltoid or gluteal injection once monthly in the 39-234 mg dose range. The once-quarterly (every 3 months) long-acting 
injectable form can be considered after successful treatment with the once-monthly form for at least 4 months and may be initiated at 273-819 mg. 

f Long-acting IM formulation may be initiated at 25 mg Q 2 weeks (continue oral risperidone dose for 3 weeks), with increases as clinically indicated every 4 weeks up to a dose of 50 
mg Q 2 weeks. 

8 Short-acting IM formulation may be given 10-20 mg as required (may be repeated Q 2-4 hours as needed, up to 40 mg/day). 
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Table 64.5 Target signs and symptoms for the pharmacological management of schizophrenia 


A pesan 


Positive syndrome 


Negative syndrome 


Hostility/excitement 


Mood and anxiety symptoms 


Hallucinations 
Delusions 

Avolition 

Apathy 

Anhedonia 

Lack of responsiveness 
Poor rapport with others 
Passive social withdrawal 
Poverty of speech 
Affective flattening 


Verbal or physical aggression 


Depressed mood 
Anxious mood 


Typically the most amenable to treatment with all antipsychotic drugs 


Robustly correlated with functional impairment in schizophrenia 


More difficult to treat pharmacologically and may require longer to respond than 
positive signs and symptoms 


Pharmacological adjuncts may be needed, though understudied 


Atypical antipsychotic drugs are believed to be more efficacious than typical 
antipsychotics 


Typically amenable to treatment with all antipsychotic drugs 

Use of parenteral formulation may be required 

Believed to be more responsive to treatment with atypical antipsychotic drugs 
Clozapine has demonstrated superiority for treating chronic suicidality in schizophrenia 





ervousness 
Panic symptoms 
Suicidal ideation 


Cognitive impairment Disorientation 


(psychopathological definition) Problems with abstraction 
Attentional problems 


Preoccupations 





Disorganized thought processes 


Cognitive impairment 
(neuropsychological testing 
definition) 


Working memory 
Attention/vigilance 
Verbal learning/memory 
Visual learning/memory 
Problem-solving 
Processing speed 


Atypical antipsychotic drugs 


There are major advantages for the atypical drugs, and these agents 
are recommended as first-line treatment [76, 77]. They are the 
most widely prescribed drug treatment for schizophrenia, mania, 
and psychotic depression in clinical practice in many parts of the 
world. There are many such drugs which share some pharma- 
cology but also have unique features. New ones are still being added, 
some with markedly different core features. With one exception— 
pimavanserin, recently approved for treatment of the psychosis of 
Parkinson's disease [78], all the atypical antipsychotic drugs directly 
affect dopamine D2 receptors, usually as antagonists, but also as par- 
tial agonists, for example aripiprazole and brexpiprazole. However, 
D2 receptor actions are part of a multi-receptor profile, which in- 
cludes actions on multiple dopamine, serotonin, and other recep- 
tors. Some atypical antipsychotic drugs (for example, clozapine, 
olanzapine, quetiapine in its immediate-release form, risperidone, 
and ziprasidone) are no longer patent-protected and are available 
in less expensive generic brands. Most of the atypical drugs will be 
off patent by 2020. Although the conversion from branded to gen- 
eric (and vice versa) forms of antipsychotic drugs is relatively easy 
in most patients, not requiring cross-titration, close monitoring is 
required during the switch, particularly with clozapine [79]. 

The relative effectiveness of atypical, compared with typical, anti- 
psychotics for the treatment of schizophrenia has been the subject 


Some domains respond favourably to antipsychotic drug treatment, though response is 
often incomplete 


Neuropsychological deficits, like negative signs and symptoms, are robustly correlated 
with functional outcome in schizophrenia 


Very difficult to treat with medication alone 


Atypical antipsychotic drugs are believed to be superior to typical antipsychotics, though 
effect sizes are only mild to moderate for the former 


of some controversy, particularly after the publication of the ini- 
tial phase results of three large studies—the Clinical Antipsychotic 
Trial of Intervention Effectiveness (CATIE) conducted in the United 
States [80], the Cost Utility of the Latest Antipsychotic drugs in 
Schizophrenia Study (CUtLASS) conducted in the UK [81] and the 
European First Episode Schizophrenia Trial (EUFEST) conducted 
at numerous clinical sites across Europe and in Israel [82]. Broadly, 
these trials had less stringent inclusion criteria than those typically 
encountered in experimental randomized trials and focused on out- 
come measures other than symptom change such as time to study 
drug discontinuation for any reason and quality of life. In general, 
these studies failed to demonstrate the superiority of atypical over 
typical antipsychotics across these effectiveness outcomes, although 
the second phases of two of these trials showed superior effective- 
ness of clozapine over comparator antipsychotics for patients con- 
sidered to have treatment-resistant schizophrenia [83, 84], and 
olanzapine was shown to have superior effectiveness to comparator 
drugs in CATIE phase 1 (CATIE-1). Although the pragmatic and 
comparative effectiveness aspects of these trials were innovative, 
their methodologies and results have been subject to criticism, 
based on dosing imbalances between individual antipsychotics in 
CATIE-1, the inclusion of sulpiride (which has atypical properties) 
and long-acting injectable antipsychotics in the ‘first-generation’ 
antipsychotic drug treatment arm (no long-acting injectable agents 
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in the ‘second-generation’ antipsychotic arm) in CUtLASS phase 1 
(CUtLASS-1), questionable sensitivity of the chosen effectiveness 
endpoints to between-drug differences in outcome, the channelling 
of patients with tardive dyskinesia away from treatment with typ- 
ical antipsychotics, and high rates of cross-contamination between 
treatment arms owing to antipsychotic drug switching (CUtLASS-1) 
[85-88]. 


Clozapine 


Clozapine was synthesized in 1959 as part of a project to discover 
antipsychotic drugs with low potential for EPS. It proved to be one of 
the most interesting and clinically important compounds ever dis- 
covered. It was labelled as being ‘atypical’ because of its ability to block 
amphetamine-induced locomotor activity, one of the most widely 
accepted models for antipsychotic effects, without producing cata- 
lepsy in rodents, the leading model for causation of EPS in humans. 
Subsequent clinical studies showed it to have the lowest EPS liability 
of any antipsychotic drug [89]. Clinical trials in the 1960s and 1970s 
demonstrated efficacy for positive symptoms [90]. In 1975, 6 years 
after its introduction in Europe, clozapine was reported to produce 
a high rate of agranulocytosis. Six deaths occurred in clozapine- 
treated patients in a geographically restricted area of Finland over 
a short period of time. Clozapine was withdrawn from general use, 
although it remained available for humanitarian use [91]. 

Clozapine was reintroduced in 1989 after it was demonstrated 
to be superior to chlorpromazine to improve positive and nega- 
tive symptoms in 300 patients who were resistant to the action of 
at least three typical antipsychotics [92]. Thirty per cent of the pa- 
tients treated with clozapine responded after 6 weeks of treatment, 
compared to 4% of the chlorpromazine-treated patients. Subsequent 
studies have shown that 40-60% of patients will respond within 
6 months of initiating clozapine treatment. Other predictors of re- 
sponse include weight gain and absence of atrophy in the prefrontal 
cortex [61]. Clozapine has been shown to uniquely reduce the risk 
of suicide in schizophrenia [63, 93]. It has also been shown to im- 
prove some aspects of cognitive function, especially verbal fluency, 
immediate and delayed verbal learning and memory, and attention 
[22-24]. N-desmethylclozapine is the major metabolite of clozapine. 
It isan M1 muscarinic receptor agonist and has been shown to have 
beneficial effects on working memory [94]. 

Because of the side effect profile of clozapine and because it has not 
been shown to be superior to other drugs for treatment-responsive 
schizophrenia [95], it is not generally used as a first-line drug. On 
the other hand, monitoring neutrophil counts for the development 
of agranulocytosis or granulocytopenia and improved methods of 
treating agranulocytosis have made it much safer to use. Clozapine 
is under-utilized in many parts of the world. All schizophrenia pa- 
tients with persistent psychotic symptoms after two adequate trials 
of other drugs should be considered for a trial of clozapine, which 
should be 6 months in duration, before concluding it is ineffective. 

Clozapine is usually given twice daily, but sometimes more than 
half of the dose or the entire dose is given at bedtime to minimize 
daytime sedation. The daily dosage is gradually titrated to the target 
range described in Table 64.4. Patients who are treatment-resistant 
usually require doses of 400 mg/day or higher, or a minimum blood 
concentration of 350 ug/L [96]. Typical or non-clozapine atypical 
antipsychotic drugs should be discontinued either before beginning 
clozapine or by eliminating them over a 1- to 2-week period as the 


dose of clozapine is increased. Typical antipsychotic drugs would 
be predicted to interfere with some of the benefits of clozapine and 
should not ordinarily be prescribed with clozapine. 

Determination of clozapine plasma levels is useful whenever pa- 
tients are not responding adequately. It may be useful to augment clo- 
zapine treatment with valproic acid or other mood stabilizers (such 
as lithium, carbamazepine, lamotrigine, or topiramate), anxiolytic 
drugs, or an antidepressant [63]. The choice of augmenting agent 
is largely driven by symptomatic considerations, or pharmacokin- 
etic interactions in the case of fluvoxamine. However, none of these 
strategies have strong empiric support. One exception may be the 
addition of sulpiride, which may result in a significant reduction in 
symptom burden when added to clozapine [97]. Electroconvulsive 
therapy (ECT) also resulted in a modest further reduction in symp- 
toms when used in conjunction with clozapine and appears to be 
well tolerated [98]. It is difficult to postulate a rationale for adding 
another atypical drug, with the exception of amisulpride, because 
of their similarity in pharmacology to clozapine, although a modest 
number of short-term studies have shown modest benefit with ad- 
junctive aripiprazole for attenuating psychotic symptoms when 
combined with clozapine [99]. It should be discontinued if side 
effects are intolerable or if there is no apparent response after a 6- 
month trial of clozapine alone and subsequent trials with augmenta- 
tion therapy. Clearly, further studies involving clozapine partial- or 
non-responders are urgently needed. It should be noted that discon- 
tinuation of clozapine can precipitate a severe relapse, even when 
clozapine is slowly tapered [100]. 


Risperidone 


Risperidone is a first-line drug for the treatment of schizophrenia 
[101]. Definitive data are lacking for its efficacy in patients who 
have treatment-resistant schizophrenia, although preliminary evi- 
dence suggests that the use of higher doses of long-acting inject- 
able risperidone [up to 100 mg intramuscularly (IM) every 2 weeks] 
may be effective [102]. Risperidone may be useful in patients who 
fail to tolerate other antipsychotic agents because of side effects not 
shared by risperidone such as anticholinergic effects. Risperidone is 
well tolerated in low doses by the elderly and has been widely used 
in the United States for the treatment of a variety of senile psych- 
oses [103, 104]. Its efficacy against haloperidol was established in 
a series of multi-centre trials which demonstrated advantages for 
risperidone in overall psychopathology in mainly chronic schizo- 
phrenic patients in an acute exacerbation at doses in the 6-8 mg/ 
day range. However, these doses have proven to be higher than is 
needed for most patients in clinical practice, possibly reflecting 
some of the problems in generalizing from controlled clinical trials. 
The doses for schizophrenia most often used in non-elderly adults 
are now 4-6 mg/day. First-episode patients may not tolerate higher 
doses (for example, above 5 mg/day), and some may respond to as 
little as 1-2 mg/day. Treatment-resistant patients have been shown 
to respond to long-acting risperidone at doses of 50 or 100 mg every 
2 weeks [105]. 

Beyond treatment of acute symptoms, risperidone is also ef- 
fective for long-term maintenance phase and relapse prevention. 
For instance, relative to haloperidol, risperidone has also been as- 
sociated with a lower risk of relapse (34% vs 60%) over a minimum 
of 12 months of treatment [106]. In another study that retrospect- 
ively compared rates of rehospitalization for patients who received 


treatment with risperidone, olanzapine, or typical antipsychotics, 
rehospitalization rates for risperidone and olanzapine were similar 
and both were significantly less than those of patients treated with 
typical antipsychotics [107]. 

Risperidone is usually initiated at low doses (for example, 1-2 mg 
daily) and is titrated into the dosage range provided in Table 64.4. 
The medication is often initiated in twice-daily dosing; however, be- 
cause its primary active metabolite 9-OH risperidone is pharmaco- 
logically equivalent to its parent drug and because it has a longer 
elimination half-life, once-daily dosing is also possible. Risperidone 
is available in soluble water and liquid forms, which may be advanta- 
geous for patients who have swallowing difficulties or require taking 
their medication in a non-pill form for other reasons, including their 
own preference. 

For patients who have a history of poor compliance leading to fre- 
quently relapsing illness, or for those who prefer it, a long-acting 
injectable form of risperidone is available (Table 64.1) for admin- 
istration every 2 weeks. Response may be expected to occur in 
the 25-50 mg (per every 2-week dose) range [108]; however, oral 
risperidone must be continued through at least the first 3 weeks of 
treatment with the long-acting injectable form before being slowly 
tapered. Supplementation with oral medication may be required 
when the dose of the long-acting drug is upwardly adjusted due to 
breakthrough psychotic symptoms. 

Risperidone has more of a tendency to produce EPS than other 
atypical antipsychotics, but this can be minimized by using the 
lowest dose which controls positive symptoms and adding an 
anticholinergic drug, if necessary [109]. Among atypical drugs, 
risperidone and paliperidone produce the greatest increases in 
serum prolactin, particularly in women [110]. Elevations in pro- 
lactin levels as a result of treatment with risperidone do not always 
translate into clinical symptoms such as sexual dysfunction or gy- 
naecomastia in men and menstrual changes and breast discharge in 
women [110]. 

A meta-analysis of six studies comparing risperidone to typical 
antipsychotic treatment indicated risperidone was superior for 
treating negative symptoms [111]. Risperidone has also been shown 
to improve cognition in schizophrenia, more so than typical anti- 
psychotic drugs [23]. Improvement in working memory was greater 
than that of other domains. 

In summary, risperidone is a first-line pharmacological treatment 
of schizophrenia. It may have advantages over typical drugs with re- 
gard to negative symptoms, cognition, and EPS, but it does produce 
dose-dependent increases in EPS risk and large increases in serum 
prolactin levels, even compared to typical antipsychotic drugs. The 
long-acting injectable form of risperidone is preferred in patients 
with problems with oral drug compliance. Risperidone in its long- 
acting injectable form has also been shown to be effective for the 
prevention of relapse or hospitalization during bipolar maintenance 
treatment, although it may be more effective for preventing manic or 
mixed episodes than depressive episodes [112]. 


Olanzapine 


Olanzapine is also a first-line treatment for schizophrenia [77]. Some 
treatment-resistant patients respond to olanzapine at doses in the 
20-40 mg/day range, but the side effect burden may be treatment- 
limiting for some patients, especially weight gain and related meta- 
bolic effects [113]. 
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The efficacy of olanzapine in treating psychosis and negative 
symptoms in patients with an acute exacerbation of schizophrenia 
has been established in multiple large-scale, multi-centre trials 
[114]. In these trials, olanzapine at doses of 10-20 mg/day has been 
superior to placebo and equivalent or superior to haloperidol in 
some measures of total psychopathology and positive or negative 
symptoms. For example, in one North American multi-centre trial, 
olanzapine (15 + 5 mg/day) was superior to haloperidol (15 + 5 mg/ 
day) in the treatment of negative symptoms [115]. Olanzapine im- 
proves primary negative symptoms, rather than secondary negative 
symptoms [116]. 

Olanzapine has also been found to be effective as a maintenance 
treatment of schizophrenia [117]. The estimated relapse rates, de- 
fined as the need for hospitalization, during a 1-year period in three 
studies of patients receiving olanzapine for maintenance treatment 
were 19.6-28.6% [117]. 

Olanzapine has been reported to improve cognitive dysfunction 
[23, 24] in patients with schizophrenia or schizoaffective disorder. 
Pharmacoeconomic studies and investigations of medication effects 
on quality of life measures indicate that olanzapine has a beneficial 
cost-outcome profile. For instance, in one investigation, the higher 
cost of olanzapine, relative to haloperidol, was offset by olanzapine 
treatment-associated reductions in rehospitalization and overall 
treatment costs [118]. 

The average clinical dose of olanzapine is 12.5-20 mg/day, but 
many patients respond to lower doses (for example, 10 mg daily) 
[119]. A principle advantage of olanzapine is its once-daily dosing 
and the feasibility of starting the medication at a dose that is clin- 
ically effective for most patients. For acute situations where rapid 
control of agitation, hostility, or other dangerous behaviours is re- 
quired, olanzapine is available as a short-acting injectable medica- 
tion (Table 64.1) [120]. For patients with frequent relapses owing to 
poor adherence to oral medication, a long-acting injectable form of 
olanzapine is available. A small number of patients experienced ex- 
cessive sedation, fatigue, dizziness, or delirium owing to inadvertent 
intravascular injection of long-acting injectable olanzapine [120]. 
Olanzapine long-acting injectable must therefore be administered 
by trained individuals, along with close observation and additional 
safety measures (such as avoidance of driving) that address the risk 
of developing post-injection syndrome symptoms. 

In summary, olanzapine is an effective atypical drug with specific 
advantages in terms of its once-a-day administration, low risk for 
EPS, and efficacy for cognitive dysfunction and negative symptoms. 
Clinically significant weight gain and related dysmetabolic adverse 
effects may be a problem for some patients, as will be discussed. 
Olanzapine has been shown to effectively reduce manic symptoms 
in adults with bipolar I disorder [21] and to effectively reduce de- 
pressive symptoms when used as a pharmacological adjunct for the 
treatment of refractory unipolar major depression and acute depres- 
sive episodes in patients with bipolar I disorder [121, 122]. 


Quetiapine 


Quetiapine has been shown to be as effective as typical antipsychotics 
in the acute treatment of schizophrenia, with low EPS risk and no ef- 
fect on serum prolactin levels [123, 124]. Quetiapine and clozapine 
both appear to bind more loosely to striatal D2 receptors than other 
antipsychotic drugs, and both drugs show antipsychotic activity 
at D2 receptor occupancies that are well below the 60% threshold 
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identified for most other antipsychotic drugs [125]. In spite of this 
similarity with clozapine, quetiapine does not appear to have efficacy 
comparable to clozapine for treatment-resistant patients. 

The efficacy of quetiapine for acute-phase schizophrenia is sup- 
ported by results from several RCTs that documented superiority 
of quetiapine, relative to placebo, across several doses, with some 
patients responding to 150 mg/day and others requiring 750 mg/ 
day [123]. For instance, in one high- (750 mg/day) vs low-dosage 
(250 mg/day) study, both dosage groups evidenced greater reduc- 
tion in positive symptoms, relative to placebo; however, the differ- 
ences were significant only for the high-dose group [124]. In another 
study that assessed multiple fixed doses of quetiapine (75-750 mg/ 
day), compared with haloperidol and placebo, significant differ- 
ences in improvement over placebo for quetiapine were observed in 
the dosage range of 150-750 mg/day [123]. 

Quetiapine'’s effect on negative symptoms continue to be investi- 
gated. One placebo-controlled comparison documented improve- 
ments in negative symptoms with quetiapine treatment across 
a wide range of doses, with the greatest improvement reported at 
300 mg daily [123]. In the high- vs low-dose study reviewed previ- 
ously, the high-dose group also experienced greater improvement in 
negative symptoms, relative to placebo [124]. Like risperidone and 
olanzapine, quetiapine appears to improve depressive symptoms 
and certain cognitive deficits [23, 24] associated with schizophrenia 
or schizoaffective disorder. The improvements in cognition with 
quetiapine appear to be superior to those of haloperidol [126]. 

These results suggest that overall the greatest improvement in posi- 
tive and negative symptoms may occur when quetiapine is used at 
the higher end of its dosage range. The average clinical dose appears 
to be between 300 and 500 mg/day, usually given twice daily, though 
some benefit from the medication when given only once daily. The 
effects of using higher doses for patients who do not respond ad- 
equately to these doses are uncertain. A titration of the dosage is 
required after initiating the medication. From the viewpoint of EPS 
and hyperprolactinaemic effects, quetiapine appears to confer only 
low risk. As such, like clozapine, it appears to be well tolerated, even 
among patients with idiopathic Parkinson's disease [127]. Sedation 
may be a limiting side effect for some, especially during dosage ti- 
tration. Weight-related, metabolic, and other adverse effects will be 
discussed in greater detail. 

Quetiapine is one of the best studied pharmacotherapies for bi- 
polar disorders. In addition to its effectiveness for reducing the acute 
symptoms of mania [128], quetiapine is effective for the acute treat- 
ment of depressive episodes in patients with bipolar I and II dis- 
orders [129, 130]. Quetiapine has also been shown to be effective as 
a bipolar maintenance treatment [131]. 

In summary, quetiapine also appears to be effective for a wide 
range of schizophrenia-associated symptoms and confers a lower 
level of risk in terms of antidopaminergic adverse effects. The 
dosage range of this medication may be quite wide, though pa- 
tients may have a greater chance of benefiting from the medication 
at the higher end of this range. Quetiapine is a first-line option for 
treating acute manic episodes in patients with bipolar I disorder, 
and depressive episodes in patients with bipolar I or II disorder. 
Quetiapine has also been shown to reduce depressive symptoms 
in adults with inadequate response to some antidepressants such 
as selective serotonin reuptake inhibitors (for example, fluoxetine, 


etc.) and serotonin-noradrenaline reuptake inhibitors (for example, 
venlafaxine, etc.) [132]. 


Ziprasidone 


Ziprasidone has a varied receptor occupancy profile. Like most 
atypical antipsychotic drugs, it displays high-affinity 5-HT2A 
binding, coupled with relatively lower-affinity D2 receptor binding. 
Ziprasidone is also a 5-HT1A agonist, as well as both a serotonin 
and a noradrenaline reuptake pump inhibitor [133]. This pro- 
file predicts a wide range of pharmacological activity against core 
psychotic symptoms and negative and affective symptoms, as well 
as neurocognition. 

Ziprasidone, like quetiapine, has been shown to be superior to 
placebo for the reduction of total psychopathology and positive and 
negative symptoms in patients with schizophrenia [134, 135]. There 
is limited evidence to suggest superiority over typical antipsychotics 
with regard to improvement in positive and negative symptoms 
[134, 136]. Studies of multiple fixed doses of ziprasidone vs halo- 
peridol at conventional doses indicate that ziprasidone yields similar 
efficacy to haloperidol for reducing positive symptoms and global 
psychopathology at a dose of 160 mg/day [136]. 

Ziprasidone significantly improved negative symptoms and re- 
duced the risk of relapse, compared to placebo, in a 1-year main- 
tenance study in stable hospitalized chronic schizophrenic patients 
[137]. These maintenance-phase effects were not dependent on 
the daily dose of ziprasidone. In a 28-week comparison with 
haloperidol, the two groups evidenced similar overall effects for 
positive symptoms; however, between-group differences were docu- 
mented, favouring ziprasidone for negative symptoms and EPS 
[138]. Ziprasidone was effective against depressive symptoms as- 
sociated with schizophrenia in one study at a dose of 160 mg/day 
[139]. Significant improvements in multiple cognitive domains have 
been reported among ziprasidone-treated patients in a variety of 
treatment contexts [140]. Such changes appear to be unrelated to 
improvements in other symptoms of schizophrenia. Ziprasidone 
treatment has been associated with significant improvement in 
quality of life measures in one post hoc data analysis [141]. Further 
investigation of the effect of ziprasidone on health-related quality of 
life and similar outcomes are warranted. Ziprasidone treatment of 
schizophrenia appears to be cost-effective, relative to no treatment 
[142]. Further cost-benefit studies are needed. 

The dose range of ziprasidone for acute treatment appears to be 
between 80 mg/day and 160 mg/day, and higher doses within this 
range may be more effective (Table 64.4). Doses greater than 120 
mg/day appear to be required to achieve approx. 60% dopamine D2 
receptor blockade [143], the D2 receptor occupancy threshold that 
appears to coincide with efficacy against positive symptoms, as pre- 
sented earlier. The medication is usually given twice daily, although 
some may take the medication once daily at night-time. Titration 
of the total daily dose into the recommended range is required 
after initiating the medication. One critical aspect of medication 
administration for ziprasidone is the requirement that the medica- 
tion be taken with food. There appear to be profound differences in 
bioavailability at equivalent doses between the fed and unfed state 
[144]. A full meal, as opposed to a light snack, appears to be re- 
quired. Therefore, patients are encouraged to take their medication 
with meals. 


A short-acting intramuscular formulation of ziprasidone has 
been developed, which should be useful in situations where more 
rapid action is needed. This formulation is available in two doses (10 
mg and 20 mg), the preferred dosage being 20 mg due to a signifi- 
cantly greater reduction in agitation, relative to the lower dose [145]. 
Use of the short-acting injectable form can facilitate a transition to 
oral medication and may reduce the time required to titrate the daily 
dose of ziprasidone to one that is likely to be effective. 

Ziprasidone appears to be well tolerated. Treatment-emergent 
EPS burden is low [135, 136]. Initial problems with somnolence or 
behavioural activation are usually self-limited, although temporary 
use of clonazepam or other benzodiazepine at low doses may im- 
prove tolerability, especially during the titration phase, should the 
latter occur. Importantly, data from both short- and long-term 
studies indicated that ziprasidone is not associated with clinically 
significant changes in weight, glycaemic measures, or markers of 
lipid homeostasis [146]. 

Ziprasidone can result in partial blockade of the slow potassium 
rectifier current in the cardiac conduction system, which may re- 
sult in prolongation of the QTc interval [147]. On the other hand, 
ziprasidone was not shown to be associated with an elevated risk 
of non-suicidal mortality, relative to olanzapine, in a large simple 
trial of over 18,000 patients with schizophrenia [148]. Under rou- 
tine circumstances, screening electrocardiograms are not required. 
Nevertheless, caution may be warranted for individuals who are at 
risk for significant prolongation of the QTc interval, including pa- 
tients who take medications other than ziprasidone that prolong the 
QTc interval. Concomitant use of CYP450 3A4 inhibitors does not 
appear to pose a significant risk [149]. 

In summary, ziprasidone appears to be a useful additional atyp- 
ical antipsychotic agent because of its favourable side effect profile, 
including no weight gain—a major problem with olanzapine and 
clozapine—and no prolactin elevation, which is a less serious side 
effect of risperidone. Patients should be instructed to take the medi- 
cation with food. Ziprasidone treatment may result in an increase 
in the QTc interval; however, in a great majority of cases, this is not 
clinically significant. Although it is effective for treating acute manic 
episodes in patients with bipolar I disorder [128], it has not been 
shown to be clearly effective for bipolar depression when used as an 
adjunct to mood stabilizers [150]. 


Aripiprazole 


Aripiprazole is pharmacologically distinct from the drugs reviewed 
previously in that it combines partial D2 receptor agonism with 
high-potency 5-HT2A antagonism. As a partial D2 receptor agonist 
with low intrinsic activity, it will generally reduce dopamine D2 re- 
ceptor stimulation [151]. Conversely, it should act primarily as an 
agonist when endogenous dopamine is limited, as may be the case 
in the prefrontal cortex in patients with schizophrenia [30]. For 
this reason, aripiprazole is sometimes referred to as a ‘dopamine 
stabilizer? Aripiprazole also functions as a potent 5-HT1A partial 
agonist [152]. 

The efficacy of aripiprazole in the treatment of acute schizophrenia 
at doses ranging between 10 mg and 30 mg (taken once daily) was 
established on the basis of four short-term randomized controlled 
studies [153]. Relative to placebo, efficacy against negative symptoms 
was also demonstrated [154]. Long-term superiority of aripiprazole 
(vs placebo) for relapse over 26 weeks [155] and medication 
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compliance and symptom response (vs haloperidol) for up to 52 
weeks has also been established [156]. One study reported on the 
effectiveness of flexibly dosed aripiprazole (15-30 mg daily) among 
patients with schizophrenia with a history of resistance to treatment 
with olanzapine or risperidone [157]. Aripiprazole is an effective 
maintenance treatment with a favourable side effect profile, particu- 
larly for weight gain [158, 159]. However, it does cause akathisia and 
some nausea, particularly early in its usage. Aripiprazole has been 
reported to have beneficial effects on neurocognitive performance, 
but the data are rather limited [160]. 

Treatment with aripiprazole is usually initiated with 10-15 mg 
daily, although some patients may not be able to tolerate these 
doses due to agitation, nausea, or vomiting. Doses as high as 30 mg 
are sometimes indicated. An oral solution form is also available. 
Aripiprazole is also available in a water-soluble, as well as an acute 
intramuscular, form. The acute injectable form appears to be ef- 
fective in the dosage range of 5.25-15 mg [161]. The recommended 
dose is 9.75 mg. 

Several long-acting injectable forms of aripiprazole are available for 
patients who prefer or require long-acting injectable medications [162]. 
The first is aripiprazole monohydrate, a once-monthly injectable form 
of aripiprazole. The recommended starting and monthly maintenance 
dose of once-monthly long-acting injectable aripiprazole is 400 mg in 
the absence of interacting medications [163]. Oral aripiprazole (10-20 
mg daily) must be taken for at least the first 14 days of long-acting 
aripiprazole initiation. More recently, aripiprazole lauroxil, a prodrug 
of aripiprazole, has also become available as a once-monthly injectable 
medication [164]. The recommended starting and monthly mainten- 
ance doses are 441 mg, 662 mg, or 882 mg, which correspond roughly 
to oral aripiprazole daily doses of 10 mg, 15 mg, and 20 mg or more. 
The highest dose (882 mg) can be administered once every 6 weeks via 
gluteal injection. Oral aripiprazole must be taken for at least the first 
21 days of aripiprazole lauroxil. 

For mood disorders, aripiprazole has been shown to be effective 
as an adjunctive treatment in adults with major depressive disorder 
who responded poorly to antidepressants [165]. Aripiprazole has 
been shown to effectively reduce manic symptoms in adults with bi- 
polar I disorder—both alone and as an adjunct to mood stabilizers 
[166]. However, aripiprazole has not demonstrated clear benefit for 
treating bipolar depression [167]. 

Aripiprazole is generally well tolerated, with an adverse effect pro- 
file similar to placebo in short-term studies involving patients with 
acute schizophrenia and in longer-term studies of chronic, stable 
patients [153]. As is the case with all atypical antipsychotic drugs, 
the EPS burden is lower than that of typical antipsychotics. In sum- 
mary, aripiprazole appears to be effective as an acute and long-term 
maintenance treatment for schizophrenia and related psychotic dis- 
orders at recommended doses, though some patients may require 
higher doses. Aripiprazole was initiated in most studies at doses of 
10-15 mg once daily; however, some patients may require a slower 
titration following a lower starting dose. This medication is avail- 
able in many forms, all of which appear to be very well tolerated. 
Aripiprazole is effective as an adjunctive treatment for patients with 
major depression who respond suboptimally to antidepressants and 
as monotherapy or adjunctive treatment for acute manic episodes 
in patients with bipolar I disorder. Important benefits from a toler- 
ability viewpoint include very low rates of prolactin elevation and a 
low risk of weight gain and metabolic adverse effects. 
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Paliperidone 


Paliperidone is the 9-OH metabolite of risperidone, which has a 
longer elimination half-life than the parent compound, as reviewed 
previously. Paliperidone, which is pharmacologically similar with 
regard to receptor occupancy profile to risperidone, is available 
commercially in an extended-release oral form and in two long- 
acting injectable forms. 

The short-term efficacy of oral paliperidone has been established 
on the basis of multiple randomized, placebo-controlled studies 
[168]. In these studies, all doses of paliperidone were superior to 
placebo for reducing global psychopathology and positive symp- 
toms, as well as negative symptoms, anxious/depressive symptoms 
associated with schizophrenia, and hostility/excitement. The recom- 
mended starting dose of oral paliperidone is 6 mg, given once daily. 
Doses may be upwardly adjusted at 3 mg/day increments, up to 12 
mg daily. Elevations in prolactin levels are like those observed with 
risperidone treatment. 

Paliperidone is available in once-monthly and 3-monthly long- 
acting injectable formulations. Long-acting injectable paliperidone 
has been shown to be effective for maintenance treatment in patients 
with schizophrenia [169, 170]. The paliperidone once-monthly long- 
acting injectable formulation is initiated at 234 mg (paliperidone 
palmitate salt) IM (deltoid muscle) on day 1, followed by 156 mg IM 
(deltoid muscle) 7 days later. The typical maintenance dose is 117 mg 
IM (deltoid or gluteal muscle) once monthly, with a range of 39-234 
mg IM 1-monthly, depending on effectiveness and tolerability. 

In summary, paliperidone appears to be safe and effective for 
both short- and long-term treatment of schizophrenia. The EPS 
and prolactinaemic adverse effect burden may resemble that of 
risperidone, but this notion requires prospective investigation. Oral 
paliperidone can be started at a clinically effective dose. Long- acting 
injectable forms are also currently available. 


Amisulpride 


The efficacy of amisulpride for the treatment of positive symptoms 
has been established over a wide dosage range (200-1200 mg daily) in 
treatment studies of up to 12 months’ duration [171]. In general, it ap- 
pears that higher doses (above 400 mg/day) are effective for treating 
patients with predominantly positive symptoms, although efficacy 
against negative symptoms has also been demonstrated [172, 173]. 
Low-dose amisulpride (approx. 300 mg/day) has been reported to 
be effective in treating negative symptoms in schizophrenics with 
predominantly negative symptoms [174, 175]. Amisulpride may im- 
prove primary negative symptoms [174, 175]. Amisulpride produces 
minimal EPS but may result in increased prolactin levels [176]. There 
are some data which suggest it can improve depressive symptoms in 
schizophrenia [177]. There are no data on its efficacy in treatment- 
resistant patients. Because its pharmacology is quite distinct from 
that of the 5-HT2A-based receptor antagonists previously discussed, 
amisulpride may be useful in patients who fail to tolerate that class of 
drugs. Amisulpride has also been demonstrated as being superior to 
haloperidol on quality of life measures and global functioning [178] 
and to significantly improve quality of life over 12 months of treat- 
ment [179] in patients with schizophrenia. Sulpiride is closely related 
to amisulpride and is widely used in the UK and some other regions 
of the world, but it is not licensed in the United States. Information 


about its typical and atypical properties are not as clear as for 
amisulpride and will not be discussed here. 


Lurasidone 


Lurasidone, like most other atypical antipsychotic drugs, exhibits 
potent 5-HT2A blockade, coupled with D2 receptor blockade [39]. 
In addition, lurasidone is a potent 5-HT7 receptor antagonist and 
a 5-HT1A partial agonist, with moderate affinity for a1-adrenergic 
receptors and negligible binding activity at other monoamine recep- 
tors and at dopamine and serotonin transporters [10, 39]. Lurasidone 
is metabolized almost entirely by CYP3A4 and has a mean elimin- 
ation half-life of 18 hours. Similar to ziprasidone, lurasidone must 
be taken with food in order to ensure adequate exposure to the 
medication [10]. 

The short-term efficacy of lurasidone for treating schizophrenia 
is established on the basis of multiple acute-phase randomized, 
placebo-controlled studies [180-183]. In these studies, lurasidone 
was superior to placebo for reducing global psychopathology, posi- 
tive symptoms, and negative symptoms in the 40-160 mg dose 
range. The efficacy of lurasidone (40-80 mg/day) for the mainten- 
ance treatment of schizophrenia was recently demonstrated in a 
randomized, placebo-controlled withdrawal study [184]. During 
the double-blind maintenance phase of the study, lurasidone signifi- 
cantly delayed time to relapse, as compared with placebo. Lurasidone 
is associated with mild dose-dependent akathisia, but a low risk of 
weight gain and associated dysmetabolic effects [185]. 

The recommended starting dose of lurasidone is 40 mg, given 
once daily. The typical effective dose range is 40-80 mg daily. The 
maximum recommended dose is 160 mg daily with meals. 

Lurasidone is also indicated for treatment of depressive epi- 
sodes in patients with bipolar I disorder. The short-term efficacy of 
lurasidone monotherapy for bipolar I depression was demonstrated 
in a 6-week randomized, placebo-controlled trial [186]. Lurasidone 
at both 20-60 mg/day and 80-120 mg/day showed significantly 
greater improvement in depressive symptoms than placebo. In a 
second 6-week randomized trial, lurasidone was superior to placebo 
as adjuncts to lithium or valproate in adults with bipolar I depression 
[187]. Lurasidone was also shown to improve depressive symptoms 
and overall illness severity in adults with major depressive disorder 
and subthreshold hypomanic symptoms in short-term (6 week) ran- 
domized, placebo-controlled trials [188, 189]. 

In summary, lurasidone appears to be efficacious and safe for the 
short- and long-term treatment of schizophrenia. All doses must 
be taken with food to ensure adequate exposure to the medica- 
tion. Its side effect profile appears to be favourable, compared with 
many other atypical antipsychotic drugs. Lurasidone is effective for 
treating acute depressive episodes in patients with bipolar I disorder, 
both as monotherapy and as an adjunct to lithium or valproate. 


Asenapine 


Asenapine is also a potent antagonist at dopamine D2 and sero- 
tonin 5-HT2A receptors [10]. Asenapine also binds with relatively 
high potency to serotonin 5-HT2C, 5-HT6, 5-HT’7, histamine H1, 
and al-noradrenergic receptors where it acts as an antagonist, but 
has negligible affinity for muscarinic M1 receptors [10, 12, 190]. 
Asenapine also binds with high affinity to 5-HT1A receptors where 
it acts as a partial agonist [191]. 


The efficacy of asenapine for acute-phase treatment of schizo- 
phrenia was demonstrated in two randomized, double-blind, 
placebo-controlled trials [192, 193]. In these studies, asenapine at 
doses of 5-10 mg twice daily demonstrated superiority over placebo 
for reducing psychotic and negative symptoms. The longer-term ef- 
ficacy of asenapine in adults with schizophrenia was investigated in 
two published randomized studies [194, 195], including one relapse 
prevention study [196]. In the relapse prevention study, patients who 
were clinically stable at the end of an open-label pre-randomization 
phase were randomized to double-blind continuation treatment 
with sublingual asenapine 10 mg twice daily or placebo for an add- 
itional 26 weeks. During double-blind treatment, relapse rates were 
significantly lower and mean time to relapse was significantly longer 
with asenapine than placebo. 

Asenapine has also been shown to be an effective treatment for 
manic and mixed episodes in adults with bipolar I disorder on the 
basis of four randomized trials [196]. Efficacy for this indication has 
been demonstrated at both the 5 mg and 10 mg twice-daily doses in 
adults [197]. 

Asenapine is available as a rapidly dissolving tablet formulated for 
absorption via the buccal mucosa [10]. Swallowing asenapine tablets 
renders the drug ineffective. For adults, asenapine is initiated at a 
dose of 5 mg sublingually twice daily. The recommended dose range 
across indications in adults is 5-10 mg twice daily. 


lloperidone 


Iloperidone exhibits high affinity for dopamine D2 and D3 and 
serotonin 5-HT2A receptors where it acts as an antagonist [8, 10]. 
Iloperidone binds with moderate affinity to dopamine D4, serotonin 
5-HT6 and 5-HT7, and al-noradrenergic receptors, and with low 
affinity to dopamine D1, serotonin 5-HT1A, and histamine H1 re- 
ceptors [10]. Iloperidone has no significant affinity for muscarinic 
cholinergic M1 receptors, but its antagonism of a1-noradrenergic re- 
ceptors leads to orthostatic hypotension [10]. Thus, even though the 
clinically effective dose range of iloperidone for treating adults with 
schizophrenia is 6-12 mg twice daily, the recommended starting 
dose is low (1 mg twice daily). Dose titration is required in order to 
lower the risk of treatment-emergent orthostatic hypotension [10]. 

The 6-12 mg twice-daily dose range of iloperidone has been 
shown to be effective for treating acute symptoms of schizophrenia 
[198-200]. The long-term safety and efficacy of iloperidone were es- 
tablished in three 52-week, randomized, double-blind, multi-centre 
trials, and pooled data from these studies have been published [201]. 
Iloperidone is available as solid oral tablets. As indicated earlier, 
iloperidone must be started at a low dose (1 mg twice daily) and 
slowly increased to a target dose of 6-12 mg twice daily over several 
days in order to reduce the risk of treatment-emergent orthostatic 
hypotension. 


Brexpiprazole 


Brexpiprazole is a partial agonist at 5-HT1A and D2 receptors and 
an antagonist at 5-HT2A and a1B/2C-adrenergic receptors [202]. 
It acts as a moderate antagonist at serotonin 5HT7 and 5HT2C re- 
ceptors and histamine H1 receptors, and has only negligible binding 
activity at muscarinic M1 receptors [15]. Brexpiprazole thus shares 
with aripiprazole high-affinity partial agonist effects at D2 and 5- 
HTI1A receptors, although brexpiprazole has higher intrinsic ac- 
tivity at 5-HT1A receptors. 
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The efficacy of brexpiprazole for acute treatment of schizophrenia 
in adults is supported by two 6-week randomized trials [203, 204]. 
In one study, only the 4 mg brexpiprazole dose group (not the 1 or 2 
mg dose groups) was significantly more effective than placebo [204]. 
Brexpiprazole was associated with significantly lower relapse rates 
than placebo (14% vs 39%) during a 12-month maintenance period 
in one study [205]. 

The most common and dose-dependent side effects of 
brexpiprazole include nausea, akathisia, headache, and modest 
weight gain. Brexpiprazole was associated with modest and dose- 
dependent increases in prolactin levels. No significant differences 
with placebo in the risk of adverse changes in glucose or lipid pro- 
files were observed in short-term trials. 

The results of randomized short-term trials also support the ef- 
ficacy of brexpiprazole as an adjunct to antidepressants for major 
depressive disorder [206-208]. These studies enrolled patients with 
major depression and inadequate response to antidepressants who 
then failed to respond to a prospective trial of conventional anti- 
depressant plus placebo, before being randomized to adjunctive 
brexpiprazole or placebo for 6 more weeks. In each of these studies, 
adjunctive brexpiprazole (1.5-3 mg) resulted in significantly greater 
reduction in depressive symptoms than placebo. 

The recommended starting dose for schizophrenia is 1 mg/day. 
After 7 days, the dose should be increased to 2 mg/day, with fur- 
ther adjustment based on clinical response and tolerability, up to 
a maximum daily dose of 4 mg/day. For major depression, the re- 
commended starting dose is 0.5 mg/day, followed by weekly dose 
titration up to 1.0 mg/day, and then to an initial target dose of 2 mg/ 
day. The maximum recommended dose for adjunctive treatment of 
major depressive disorder is 3 mg/day. 


Cariprazine 


Cariprazine is a partial agonist at dopamine D3 receptors and a 
weaker partial agonist at D2 and serotonin 5-HT1A receptors. 
Its mechanism of action is therefore distinct from that of most 
other atypical antipsychotics, so it may be a reasonable alterna- 
tive treatment when other drugs fail and the patient is not clearly 
treatment-refractory. Caripirazine is a weak antagonist at 5-HT2A, 
al-noradrenergic, and histamine H1 receptors [16, 209]. Efficacy 
for acute treatment of schizophrenia in adults has been demon- 
strated in multiple randomized trials across a wide range of doses 
(1.5-9.0 mg/day) [210-212]. Cariprazine has been shown to signifi- 
cantly reduce relapse rates in longer-term studies of patients with 
schizophrenia [213]. 

The efficacy of cariprazine has also been demonstrated for acute 
manic or mixed episodes in patients with bipolar I disorder. In these 
studies, cariprazine demonstrated significantly greater reduction 
in manic symptoms than placebo across a variety of doses, ran- 
ging from 3 mg/day to 12 mg/day [214-216] after 3 weeks of treat- 
ment. One 8-week randomized trial also demonstrated significantly 
greater improvement in depressive symptoms at a daily dose of 1.5 
mg than placebo in adults with bipolar I depression [217]. 

In conclusion, cariprazine has demonstrated efficacy in multiple 
randomized trials for treating acutely exacerbated schizophrenia 
and acute manic or mixed episodes in patients with bipolar I dis- 
order. For acute schizophrenia, higher cariprazine doses may be 
more effective than lower doses. Cariprazine has shown promise 
as a treatment for acute bipolar I depression and as an adjunct to 
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conventional antidepressants for patients with major depressive dis- 
order who are responding poorly to treatment with antidepressants 
alone. Additional studies of cariprazine for bipolar depression and 
poorly responding major depression are needed to reach definitive 
conclusions about its role in treating each of these conditions. 


Pharmacokinetics, metabolism, and drug interactions 
Typical antipsychotic drugs 


Typical antipsychotics are well absorbed when administered orally 
or parenterally. IM injection leads to more rapid and higher plasma 
levels. Peak plasma levels are reached in 30 minutes after IM injec- 
tion and 1-4 hours after oral injection. Steady state is achieved in 
3-5 days. The half-life for elimination is in the range of 10-30 hours. 
Metabolism of typical and atypical drugs occurs mainly in the liver, 
via the hepatic cytochrome (CYP)450 enzymes, particularly the 
2D6 and 3A4 sub-families for most drugs. Dosing of typical medica- 
tions are determined by clinical effects, and less by pharmacokinetic 
factors. 

Pharmacokinetic drug-drug interactions at the level of pro- 
tein binding are expected to be minimal, even though most typ- 
ical antipsychotics are tightly bound to plasma proteins. Even so, 
appropriate therapeutic monitoring of drugs that are also tightly 
bound to plasma proteins but have a narrow therapeutic index (for 
example, warfarin, digoxin, phenytoin) when used in conjunction 
with typical antipsychotics is warranted. Interactions at the level 
of the CYP450 system are also thought to be minimal for most 
agents. Because smoking is so common among patients with schizo- 
phrenia and because smoking can be associated with potent induc- 
tion of CYP450 1A2 isoenzymatic activity, dosage adjustments may 
be needed for selected antipsychotic drugs during any changes in 
smoking status. Other combinations with typical antipsychotics 
may be worth avoiding for other reasons such as increased central 
nervous system effects (for example, anxiolytics, other central ner- 
vous system depressants, anticholinergics, certain antihypertensive 
drugs), increased EPS (for example, metoclopramide, D2-blocking 
anti-nausea drugs, caffeine), impaired cardiac conduction (certain 
drugs combined with typical antipsychotics known to prolong the 
QTc interval), and neurotoxicity (lithium), especially among indi- 
viduals who are more advanced in age. 


Atypical antipsychotic drugs 
Clozapine 


There are wide variations in the pharmacokinetics of clozapine in 
patients. The average half-life is 6-12 hours. Plasma concentra- 
tions are higher in Chinese patients than in Caucasian patients, in 
non-smokers than smokers, and in females than males. The me- 
tabolism of clozapine occurs mainly in the liver. Clozapine’s chief 
metabolite is N-desmethylclozapine, which has some biological 
activity. Clozapine is metabolized by CYP1A2, and several poten- 
tial drug-drug interactions are thus possible. When agents that 
induce CYP1A2 are prescribed or ingested, close monitoring of 
patients for a worsening of symptoms is warranted. Plasma levels 
of clozapine of approximately 350 ng/mL are more often associ- 
ated with good response than lower levels [218] and should be 
checked in such cases. Upward adjustment of the clozapine daily 
dosage will typically correct the problem. On the other hand, if a 
CYP1A2 inducer is discontinued (for example, sudden abstinence 


from cigarette smoking, etc.) or a potent inhibitor is added, this 
may result in a rise in clozapine concentration and an increase 
in adverse effect risk [219]. Caution may also be warranted for 
drugs that are potent inhibitors of CYP2C19 and CYP3A4 [220]. 
In addition, caution is warranted when considering concomi- 
tant use of drugs which can also cause bone marrow suppression 
(for example, carbamazepine) or precipitously drop the seizure 
threshold. 


Risperidone 


Risperidone is well absorbed from the gut and is extensively metab- 
olized in the liver by CYP2D6 to 9-hydroxyrisperidone in approxi- 
mately 92-94% of Caucasians [221]. Thus, 9-OH risperidone is an 
active species in the majority of patients. About 6-8% of Caucasians 
and a small proportion of Asians have a polymorphism of the 
CYP2D6 gene, which leads to poor metabolism of risperidone. For 
poor metabolizers of risperidone, the active moiety is mainly the 
parent compound. The half-life of the 9-OH metabolite is about 21 
hours, whereas the half-life of risperidone is about 3 hours. Thus, 
risperidone can be used on a once-a-day schedule for normal met- 
abolizers, whereas multiple doses are needed for those who are 
poor metabolizers. Risperidone should be titrated from 2 to 5 mg/ 
day over at least a 3-day period to minimize hypotensive and neuro- 
muscular side effects. Drugs known to induce or inhibit CYP2D6 
and 3A4 may alter plasma levels of risperidone; thus, close moni- 
toring is advised when such agents are added to ongoing risperidone 
treatment. 


Olanzapine 


Oral olanzapine has a half-life of 24-30 hours, which indicates 
that single daily administration is adequate [222]. The metabolic 
pathways of olanzapine involves CYP2D6, CYP1A2, and flavin- 
containing mono-oxygenases, as well as N-glucuronidation. It has 
a low potential for drug-drug interactions and requires extremely 
high concentrations, not likely to be achieved under clinical con- 
ditions, to inhibit CYP450 systems. Plasma levels of approximately 
9.3 mg/mL have been reported to predict better clinical response 
to olanzapine in inpatients with an acute exacerbation [223]. 
Olanzapine is detectable in plasma immediately following the 
first injection of its long-acting injectable formulation; therefore, 
oral olanzapine is not required during the initiation of olanzapine 
long-acting injection therapy. Drugs that are known inducers 
or inhibitors of CYP1A2 may significantly affect plasma levels of 
olanzapine and alter its clinical effects at a given dosage; thus, ac- 
tive monitoring of symptoms and adverse effects is indicated if such 
agents are added. As is the case with clozapine, gender and smoking 
status may influence olanzapine levels, leading to adjustment in 
dosage [224]. 


Quetiapine 


Quetiapine is well absorbed and is approximately 83% protein- 
bound [223]. Quetiapine is better absorbed after eating [225]. It 
has a half-life of 6 hours and is metabolized in the liver by CYP3A4 
to inactive metabolites. Quetiapine has significant interactions 
with inducers and inhibitors of CYP3A4. Co-administration with 
these agents may require dosage adjustment. Once-daily dosing, 
a common dosing strategy for quetiapine, is also supported in the 
literature [226]. 


Ziprasidone 


Ziprasidone has a half-life of 4-10 hours. Twice-daily administra- 
tion is possible despite this relatively short half-life. Ziprasidone 
should always be taken after eating in order to facilitate absorption. 
About two-thirds of ziprasidone is metabolized by aldehyde oxidase 
into inactive metabolites. The remainder is metabolized by CYP3A4 
and CYP1A2 into inactive metabolites. At the current time, there 
are no known drug interactions with ziprasidone at the level of al- 
dehyde oxidase, since enzymatic activity does not appear to be al- 
tered by co-administered drugs. Although CYP3A4 appears to play 
only a minor role in the metabolism of ziprasidone, potent inhibitors 
or inducers of CYP3A4 may significantly alter plasma concentra- 
tions of ziprasidone [227] and may thus necessitate an adjustment 
in dosage. The use of concomitant medications that may prolong the 
QTc interval should be avoided. Ziprasidone is contraindicated for 
patients with a history of known QT prolongation, recent myocar- 
dial infarction, or uncompensated heart failure. 


Aripiprazole 


Oralaripiprazole is well absorbed from the gut and has an elimination 
half-life of 75 hours. It is metabolized primarily by CYP3A4 and 2D6 
isoenzymes into an active metabolite dehydro-aripiprazole, which 
has a half-life of 94 hours. This pharmacokinetic pattern supports 
once-daily dosing. Aripiprazole is also available in a long-acting in- 
jectable form. The absorption of long-acting injectable aripiprazole 
following IM injection is slow; therefore, oral aripiprazole must be 
co-administered during the first 2 weeks of long-acting injectable 
aripiprazole therapy. The mean elimination half-life of aripiprazole 
is dose-proportional (approximately 30 days following 300-mg in- 
jections provided every 4 weeks; approximately 47 days following 
400-mg injections given every 4 weeks). Because aripiprazole is 
metabolized by CYP3A4 and 2D6, known inhibitors or inducers of 
these isoenzymes may result in increased or decreased clearance of 
aripiprazole and dehydro-aripiprazole [228]. 


Paliperidone 


Oral paliperidone is marketed in an osmotically controlled extended- 
release formulation, which results in steady release of active drug 
over a 24-hour period. Long-acting injectable paliperidone is avail- 
able as a palmitate ester. Because the active moiety paliperidone 
is detectable in plasma after the first injection, co-administration 
of oral paliperidone is not required. The elimination half-life of 
paliperidone palmitate is variable, ranging from 25 to 49 days. 
Regardless of how it is formulated, hepatic metabolism is not con- 
sidered a major route of clearance for paliperidone. Paliperidone is 
converted into metabolites that are not believed to contribute sig- 
nificantly to its overall pharmacological activity. Few significant 
drug—drug interactions at the level of the CYP450 system are there- 
fore anticipated. Even so, the plasma concentration of paliperidone 
may be altered by drug interactions at CYP3A4 [229]. 


Amisulpride 

Amisulpride has a half-life of 10-15 hours. It is well tolerated. As yet, 
there are no known drug interactions. 

Lurasidone 


Lurasidone is available as a solid oral tablet. The absorption of 
lurasidone is significantly increased when administered with food; 
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the area under the curve and Cmax are increased 2- to 3-fold, re- 
spectively, in the fed vs the fasting state [10]. Lurasidone is exten- 
sively metabolized in the liver via the CYP3A4 isoenzyme. The mean 
elimination half-life of lurasidone is 18 hours. 


Asenapine 


Asenapine is available as a rapidly dissolving tablet for sublingual or 
buccal administration. Asenapine is absorbed by the oral mucosa, 
which results in a bioavailability of approximately 35%. Swallowing 
the tablet reduces bioavailability to <2% [10]. The consumption ofli- 
quids within 10 minutes of administration of asenapine can also sig- 
nificantly reduce its bioavailability, so it is recommended that eating 
or drinking be avoided within 10 minutes of asenapine adminis- 
tration. Asenapine undergoes direct glucuronidation by UGT1A4 
and is metabolized in the liver by the CYP1A2 isoenzyme (and, to a 
lesser degree, by CYP3A4 and CYP2D6). 


lloperidone 


Iloperidone has a half-life of 12-15 hours. Its absorption is not af- 
fected by food. Iloperidone is approximately 95% protein-bound 
and is extensively metabolized in the liver via CYP2D6 and 
CYP3A4 [230]. The mean elimination half-life of iloperidone is 18 
hours for CYP2D6 extensive metabolizers, and 33 hours for slow 
metabolizers [10]. 


Brexpiprazole 


Brexpiprazole is rapidly absorbed after oral administration and is 
eliminated primarily by hepatic metabolism (CYP3A4 and 2D6). 
The mean half-life of brexpiprazole at steady-state is approximately 
91 hours. 


Cariprazine 


Cariprazine is rapidly absorbed after oral administration, and it 
is eliminated primarily by hepatic metabolism via CYP3A4 and, 
to a lesser degree, via CYP2D6. The mean elimination half-life of 
cariprazine is 2-5 days in the dose range of 1.5-12.5 mg/day. 


(See Table 64.6.) 


Typical antipsychotics 


The adverse effects that are most routinely concerning for typical 
antipsychotic drug treatment are EPS. High-potency dopamine an- 
tagonist drugs, such as haloperidol and fluphenazine, are more likely 
to produce EPS than low-potency agents such as chlorpromazine 
and thioridazine. There are a wide range of EPS produced by typ- 
ical antipsychotics, including dystonic reactions when first adminis- 
tered, akathisia during the first 2-3 weeks, parkinsonism during the 
first several weeks with variable persistence, neuroleptic malignant 
syndrome at any time point but usually in the initial weeks, and tar- 
dive dyskinesia. 

Dystonic reactions due to dopamine antagonist drugs can be 
treated with parenteral anticholinergic agents or diphenhydramine, 
an antihistamine with some anticholinergic properties. The use of 
anticholinergic and other agents to manage parkinsonism due to 
typical dopamine antagonist drugs will be discussed subsequently. 
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Table 64.6 Adverse effects of selected antipsychotic drugs 


Typical antipsychotic drugs 


Tardive Prolactin Sedation Weight gain | Orthostasis | Anticholinergic | Diabetes 
dyskinesia | elevation exacerbation 
and 
dyslipidaemia 
Chlorpromazine Mild to moderate Moderateto Moderate to high Mild to Mild to Mild to Mild to moderate Low to 
Fluphenazine (for low-potency high (risk higher for moderate (for moderate (for moderate (for high-potency moderate 
i drugs) to high (for high-potency high-potency high-potency (for high- drugs) to high 
Haloperidol high-potency“ drugs) drugs) to drugs) to potency (for low-potency 
Loxapine drugs) high (for low- high (for drugs) to drugs) 
Mecondceina potency drugs); low-potency high (for 
j lowest risk for = drugs); low-potency 
Molindone molindone least for drugs) 
Perphenazine molindone 
Thioridazine 
Tiotixene 
Trifluoperazine 





“See Table 64.1 for list of low-, mid-, and high-potency (with respect to dopamine D2 receptor blockade) antipsychotic drugs. 
Estimates were derived by the chapter authors. 


Source: data from the International Psychopharmacology Algorithm Project (IPAP), Algorithm for the treatment of schizophrenia, Copyright (2008), International Psychopharmacology 
Algorithm Project (IPAP). 


Atypical antipsychotic drugs 
































Tardive Prolactin Sedation Orthostasis | Anticholinergic | Glucose 
dyskinesia elevation dysregulation 
and 
dyslipidaemia 

Amisulpride Low Rare High Low Low Low Rarely Rarely 
Aripiprazole Low (moderate at higher Rarely tolow Rarely Rarely to Low Low to Rarely Rarely 

doses low moderate 
Asenapine Low (moderate at higher ? Low to moderate Low to Low to Rarely Rarely Low 

doses moderate moderate 
Brexpiprazole Low (moderate at higher ? Low (risk may Low Moderate Rarely Rarely Low 

doses increase at higher 

doses) 

Cariprazine Low (moderate at higher ? Rarely to low Low to Low Rarely Rarely Low 

doses moderate 
Clozapine Rarely Rarely Transient High High High High High 
lloperidone Low 2 Rarely to low Low Moderate High Rarely Low 
Lurasidone Low (moderate at higher ? Rarely to low Low to Low Rarely Rarely Low 

doses) moderate 
Olanzapine Moderate (low if <10 Rare Low (if <20 mg/day) Moderate to High Low Low High 

mg/day) high 
Paliperidone Moderate 2 High Low Low to Moderate Rarely Low 

moderate 

Quetiapine Rarely Rarely Rarely Moderate Moderate Moderate Rarely to low Moderate 
Risperidone Moderate (less if <4 mg/ Rarely High Low Moderate Moderate Rarely Low 

day) 
Ziprasidone Rarely to low Rarely Rarely to low Rarely to Low Low to Rarely Rarely 

moderate moderate 
Estimates were derived by the chapter authors. In many cases, only limited data were available, particularly for newer agents. 
Additional information based on references [10, 168, 185, 203, 253, and 254], and US drug label information (available online at https://dailymed.nlm.nih.gov/dailymed/index.cfm) for 





asenapine, brexpiprazole, cariprazine, iloperidone, lurasidone, and paliperidone. 

There are insufficient data regarding the risk of tardive dyskinesia for asenapine, brexpiprazole, cariprazine, iloperidone, lurasidone, or paliperidone. 

Source: data from the International Psychopharmacology Algorithm Project (IPAP), Algorithm for the treatment of schizophrenia, Copyright (2008), International Psychopharmacology 
Algorithm Project (IPAP). 


Akathisia may be the most common EPS, occurring in up to 70% 
of patients treated long term with haloperidol [231]. The term re- 
fers to a subjective uncomfortable experience of motor restlessness 
which is relieved by movement. As such, patients will complain of 
discomfort and manifest increases in psychomotor behaviour. These 
symptoms can be distressing enough to cause agitation or suicidal 
behaviours [232]. Patient age does not seem to influence the risk of 
developing akathisia, but women are believed to be at higher risk. 
Diagnosis of this condition is necessary to prevent inadvertent in- 
creases in doses due to mistaken conclusion that discomfort from 
akathisia is due to worsening psychosis. This effect may be managed 
by reduction in dosage and switching to an atypical antipsychotic 
drug or a drug that is less likely to cause akathisia. Akathisia may 
respond to anticholinergic medications, usually within 3-7 days. 
Other options include low doses of benzodiazepines or -adrenergic 
blockers, assuming no contraindications to either. 

Parkinsonism caused by dopamine antagonist drugs resem- 
bles idiopathic parkinsonism. Diagnostically, severe dopamine 
antagonist-induced parkinsonism may resemble depression or nega- 
tive symptoms of schizophrenia; however, the associated motor signs 
and time course of symptoms in relation to starting antipsychotic 
treatment distinguish the former. Like akathisia, the onset and se- 
verity of drug-induced parkinsonism is related to dosage; thus, a 
lowering of the dose or switching to a medication that is less likely 
to cause this effect may provide significant relief or ameliorate the 
parkinsonian signs and symptoms altogether. When this is not feas- 
ible, anticholinergic medications may provide relief, typically within 
3-7 days. The response to anticholinergic medication is quite vari- 
able, however. 

Tardive dyskinesia emerges at various rates, depending upon 
age, sex, and diagnosis [233, 234]. The rate in younger patients is 
between 3% and 5% per year. It is higher in bipolar than schizo- 
phrenic patients and much higher in people above the age of 60. It is 
related to the dose and will be less likely with lower doses of typical 
antipsychotics. Tardive dyskinesia is ordinarily reversible, although 
irreversible and/or extremely severe and rarely life-threatening 
forms can occur. The best way to minimize its occurrence is to use 
an atypical antipsychotic drug in lieu of a typical agent, since these 
drugs as a class are associated with a much lower risk of tardive dys- 
kinesia [234]. Patients with mood disorders should generally not 
receive maintenance treatment with typical antipsychotic drugs, 
unless clinically necessary. There are no definitive treatments for 
tardive dyskinesia, though some controlled evidence supports the 
use of adjunctive gingko biloba extracts, amantadine, tetrabenazine, 
and valbenazine [235-240]. Generally, the best strategy is preven- 
tion through the use of atypical drugs and periodic screening with 
a structured assessment tool such as the Abnormal Involuntary 
Movement Scale (AIMS). There is some suggestion in the literature 
that continuation of antipsychotic treatment does not worsen tar- 
dive dyskinesia and may eventually result in stabilization and im- 
provement of tardive symptoms. Switching to clozapine is useful to 
control tardive symptoms when they are very severe. 

Neuroleptic malignant syndrome (NMS) is a rare, life-threatening 
side effect related to an apparent compromise of the neuromuscular 
and sympathetic nervous systems [241]. It usually occurs at the ini- 
tiation of treatment with a high-potency dopamine antagonist but 
may occur with any of the typical (or atypical) agents at any point. 
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NMS is characterized by muscle rigidity, breakdown of muscle fibres 
leading to large increases in plasma creatine kinase concentration, 
fever, autonomic instability, changing levels of consciousness, and 
sometimes death. It is treated by immediately discontinuing all 
dopamine antagonist drugs, applying external hypothermia, and 
supporting blood pressure. Administering a direct-acting dopamine 
agonist, such as bromocriptine or pergolide, and dantrolene sodium, 
which blocks the release of intracellular stored calcium ions, may 
be useful adjuncts. After successful treatment of NMS, an atypical 
antipsychotic should be used, even though these agents, including 
clozapine, may also induce NMS. 

The typical dopamine antagonist drugs produce a wide variety 
of other side effects, including weight gain, seizures (especially 
pimozide), sedation, hypotension, elevated liver enzymes, retinitis 
pigmentosa (thioridazine), orthostatic hypotension, prolongation 
of the QTc interval (low-potency phenothiazines, pimozide), and 
anticholinergic effects (mesoridazine, chlorpromazine, thiorida- 
zine). All of the typical dopamine antagonist drugs produce marked 
increases in serum prolactin levels, with increases being greater in 
females than males [33]. Prolactin elevations may affect sexual func- 
tion in both males and females, with difficulty achieving erection or 
orgasm being among the most common side effects [33]. 


Atypical antipsychotic drugs 
Clozapine 
Agranulocytosis 


It has now been reliably established that clozapine produces agran- 
ulocytosis in slightly <1 per 100 patients [242, 243]. The peak of 
agranulocytosis with clozapine occurs between 4 and 18 weeks and 
then falls off sharply. Weekly monitoring of the white cell or absolute 
neutrophil count is required for 26 weeks in most countries, with the 
frequency decreasing to biweekly or monthly thereafter, sometimes 
on a voluntary basis. In the United States, monthly monitoring is 
required, assuming no haematological abnormalities after 1 year of 
treatment. With monitoring, agranulocytosis can usually be detected 
before infection sets in or becomes overwhelming. Discontinuation 
of clozapine, beginning treatment with colony cell-stimulating fac- 
tors, and the usual procedures for treating an infection are usually 
effective in restoring the white cell line. 


Other side effects 


Clozapine produces a wide range of side effects [243]. These can 
generally be managed by dose adjustment and concomitant medi- 
cations. Clozapine produces hypotension because of its potent 
al-adrenoceptor antagonism and must be slowly titrated in most 
patients. Low-dose glucocorticoid treatment may be helpful in some 
patients with severe hypotension. Clozapine rarely, if ever, produces 
significant EPS, although some cases of akathisia and NMS have 
been reported. 

Major motor seizures are another important side effect of clo- 
zapine. They are dose-related, with the incidence being about 2% 
in patients at low doses and 6% at doses >600 mg/day. They are 
sometimes preceded by myoclonic jerks. Valproic acid and dose re- 
ductions are usually effective in preventing the progression of myo- 
clonic jerks or treating major motor seizures. Other anticonvulsants 
can be combined with clozapine, if needed, though caution would be 
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clearly warranted with the use of carbamazepine due to its potential 
for bone marrow suppression. 

Hypersalivation is a common side effect that usually re- 
sponds to anticholinergic therapy or clonidine. Exacerbation of 
obsessive-compulsive symptoms has been reported with clozapine. 
Augmentation with an SSRI or lithium carbonate is usually effective. 

Weight gain is a frequent side effect of clozapine, with about 30% 
of patients gaining >7% of body weight [243]. Dietary changes and 
exercise are useful in minimizing this effect. A related problem is the 
emergence of insulin resistance or type 2 diabetes or exacerbation of 
existing diabetes, with or without atherogenic changes in the serum 
lipid profile. There have also been reports of diabetic ketoacidosis 
that emerged in the context of clozapine treatment. Of the atypical 
drugs, clozapine and olanzapine are associated with the highest risk 
for clinically significant weight gain, as well as abnormalities in gly- 
caemic control and lipid homeostasis [146]. 

Somnolence, tachycardia, hypertension, constipation, and stut- 
tering are also side effects of clozapine. Tachycardia is treated only 
when the pulse is >100 beats/minute. B-blockers are effective to re- 
duce the heart rate but may also result in synergism of hypotensive 
effects [243]. 

There have been reports of clozapine-associated myocarditis and 
cardiomyopathy [244]. The presence of eosinophilia accompanied 
by cardiotoxic signs, such as tachycardia, fatigue, orthostasis, or re- 
spiratory problems (many of which are adverse effects of clozapine), 
should alert the clinician to the possibility of myocarditis and the 
need for medical evaluation. 

Finally, treatment with clozapine may not uncommonly result 
in an asymptomatic mild elevation in hepatic transaminase levels; 
however, there have also been reports of hepatotoxicity in the setting 
of clozapine treatment. Polypharmacy appears to be a risk factor. 
Cases of fulminant hepatotoxicity leading to liver failure are rare. 


Risperidone 


Risperidone is associated with moderate weight gain, comparable 
to that of typical dopamine antagonist drugs in most cases and less 
than that of clozapine and olanzapine [146]. Risperidone also pro- 
duces some postural hypotension because of its al-adrenoceptor- 
blocking properties. Risperidone produces greater increases in 
serum prolactin secretion than any of the other atypical anti- 
psychotic drugs [33]. The increases appear to be at least comparable 
to those of typical antipsychotics [245]. At higher doses, particularly 
above 6 mg daily in most adults, the incidence of EPS also increases 
[246], though typically not to the degree observed when using typ- 
ical dopamine antagonist drugs in clinical practice. Risperidone, like 
clozapine and other agents of this type, can sometimes exacerbate or 
induce symptoms of obsessive-compulsive disorder and tics, prob- 
ably due to its antiserotonergic properties. This can be counteracted 
in some patients by the addition of an SSRI. Risperidone is not asso- 
ciated with agranulocytosis or increased risk of seizures. Because of 
its low affinity for muscarinic receptors, risperidone treatment is not 
associated with significant anticholinergic effects. 


Olanzapine 


Olanzapine also produces dose-dependent EPS, including some 
dystonic reactions in patients with schizophrenia, but these are less 
frequent and severe than those produced by typical dopamine an- 
tagonist drugs or risperidone [245]. Olanzapine is less well tolerated 


than clozapine in patients with Parkinson's disease. Olanzapine, like 
other atypical drugs, is associated with a lower risk of tardive dyskin- 
esia than typical antipsychotics. 

The major side effect of olanzapine is weight gain [245]. Large 
weight gains due to increased appetite occur in 10-15% of 
olanzapine-treated patients during the first 6 months of treatment. 
Another 20-35% gain between 7% and 10% of baseline body weight. 
These gains tend to persist for as long as patients continue the medi- 
cation. Like clozapine, olanzapine is also associated with higher risk 
of insulin resistance, glycaemic changes, and the development of 
atherogenic dyslipidaemias [146]. Cases of diabetic ketoacidosis as- 
sociated with olanzapine treatment have been reported. 

Olanzapine is also associated with an increase in liver enzymes, 
orthostatic hypotension, anticholinergic side effects, and sedation. 
Many of these adverse effects are time-limited and reduce in inten- 
sity or resolve over the first few weeks of treatment with continuous 
use. Olanzapine produces transient increases in serum prolactin 
levels, which are smaller in magnitude than those produced by typ- 
ical dopamine antagonist drugs or risperidone [33, 245]. 

Olanzapine, like other agents of this type, can occasionally exacer- 
bate or induce symptoms of obsessive-compulsive disorder and tics, 
probably due to its antiserotonergic properties. This can be counter- 
acted in some patients by the addition of an SSRI. Olanzapine is not 
associated with agranulocytosis or increased risk of seizures. 


Quetiapine 


Quetiapine appears to have fewer EPS than either risperidone or 
olanzapine [225, 245]. Quetiapine is tolerated in patients with 
Parkinsons disease to a much greater extent than risperidone or 
olanzapine. The incidence of EPS with quetiapine in schizophrenic 
patients appears to be comparable to placebo. The major side ef- 
fects with quetiapine are headache, agitation, dry mouth, dizziness, 
weight gain, and postural hypotension [245]. 

With regard to weight gain and other metabolic effects, 
quetiapine treatment appears to confer moderate risk—similar to 
that of risperidone, but less than that associated with clozapine or 
olanzapine treatment [146]. Far less is known about the long-term 
effects of quetiapine on markers of glycaemic and lipid homeostasis. 
Nevertheless, clinically significant changes in serum lipids have 
been reported. 

Decreased serum thyroid hormone levels, increased hepatic 
transaminases, and elevated serum lipids have been reported. 
Decreases in total and free thyroxine, when they occur, are mild and 
non-progressive and are not believed to be clinically significant. The 
effect may be dose-dependent. Similar to clozapine, asymptomatic 
elevations in hepatic transaminases may be encountered early in the 
course of treatment, followed by a return to baseline values. Animal 
studies suggest an increased risk of cataracts [245]. Periodic oph- 
thalmological screening for lenticular opacities is recommended by 
the manufacturer, though no causal relationship between the use of 
quetiapine and the development of cataracts has been demonstrated 
to date. 


Ziprasidone 


Ziprasidone does not appear to significantly increase serum pro- 
lactin concentration and has a very low risk of EPS, weight gain, and 
changes in markers of glucose handling and lipid metabolism. Its 
major side effects are nasal congestion and somnolence [245], the 


latter of which is usually transient. There has been some concern 
of cardiac arrhythmias related to increased QTc interval; however, 
perusal of the available data does not reveal a significant problem 
in this regard. However, caution is warranted when considering the 
co-administration of ziprasidone with other drugs that are known 
to prolong the QTc interval, since ziprasidone has been associated 
with a significant increase in the QTc interval of 16.6 ms, which was 
greater than that with other atypical antipsychotics and haloperidol, 
but less than that with thioridazine [247]. Screening for electrolyte 
abnormalities and cardiac disease (including recent myocardial in- 
farction, congestive heart failure symptoms, and arrythmias, with 
or without syncope) may be indicated prior to starting ziprasidone. 


Aripiprazole 


Aripiprazole is well tolerated and does not appear to routinely cause 
hyperprolactinaemic changes at recommended dosages, although 
EPS has been reported to occur with aripiprazole, particularly for 
patients with affective disorders (major depression, bipolar dis- 
orders) [248]. Aripiprazole is also not associated with lower risk of 
clinically significant increases in weight or changes in markers of 
glucose handling or lipid homeostasis, relative to several other atyp- 
ical drugs such as clozapine, olanzapine, quetiapine, and risperidone 
[146]. Aripiprazole, ziprasidone, and lurasidone are believed to have 
the most advantageous metabolic risk profiles [146, 185]. 


Paliperidone 


Paliperidone was also well tolerated, with the most common side 
effect being tachycardia. Rates of discontinuation due to adverse ef- 
fects are very low. The risk of hyperprolactinaemia with paliperidone 
appears to resemble that of risperidone, although no head-to-head 
comparisons have been carried out. The changes in prolactin 
levels may be dose-related [168]. The EPS burden associated with 
paliperidone during short-term studies was low for the 6-mg dose; 
however, at higher doses, the incidence of EPS appears to be higher 
[168]. Short-term weight gain in a meta-analysis of 15 randomized 
trials was shown to be +1.2 kg; however, measures of metabolic ef- 
fects with paliperidone showed no significant changes from baseline 
[185]. Similar results were found for paliperidone during medium- 
term treatment [249]. 


Lurasidone 


The most common adverse effects of lurasidone are somnolence, 
akathisia, nausea, and parkinsonism. The occurrence of akathisia 
appears to increase in a dose-dependent manner, up to 120 mg/day 
[10], and appears to be higher with lurasidone than with aripiprazole 
or asenapine, based on indirect comparisons vs placebo [250]. 
Minimal changes in body weight were observed among lurasidone- 
treated subjects in short-term studies [185]. In a pooled analysis of 
longer-term studies, there were only minimal changes from base- 
line in body weight, and the incidence rates of clinically significant 
weight gain (an increase of =7% above baseline) and weight loss (a 
decrease from baseline by the same amount) were 16% and 19%, 
respectively [251]. 


Asenapine 


The most common adverse effects of asenapine are insomnia, som- 
nolence, and EPS, including dose-dependent risk of akathisia [10]. 
Transient oral hypoesthesias (usually resolved within 1 hour) have 
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also been reported. Preliminary data suggest that asenapine may 
be associated with a similar degree of short-term body weight gain 
as paliperidone (+1.2 kg), but with low risk of clinically significant 
short-term changes in metabolic measures [185]. The relative risk of 
clinically significant weight gain (defined as a 27% weight increase 
from baseline) vs placebo was 4.1 (95% CI 2.3-7.4) [185]. 


lloperidone 


The most common side effects from short-term studies of 
iloperidone were dizziness, dry mouth, somnolence, and dyspepsia. 
Iloperidone was not shown to increase the risk of EPS during short- 
term follow-up [10]. Iloperidone was associated with numerically 
higher rates of orthostatic hypotension (20%) than haloperidol 
(15%) and risperidone (12%), although orthostatic hypotension 
with iloperidone generally did not persist past the first 7 days of 
treatment. Statistically significant increases in the QTc interval were 
observed with iloperidone treatment at all tested dose ranges [252]. 
Iloperidone has been associated with a statistically significant in- 
crease in body weight and changes in lipid profiles, relative to pla- 
cebo, but these results were based on analysis of only one clinical 
trial [185]. Additional studies of iloperidone’s anthropometric and 
metabolic effects are needed. 


Brexpiprazole 


Dose-dependent adverse effects of brexpiprazole include nausea, 
akathisia, and headache [253]. Brexpiprazole has also been associ- 
ated with modest weight gain, but as yet no significant differences 
from placebo with respect to adverse changes in glycaemic or lipid 
profiles have been observed in short-term randomized trials [253]. 
There is no consistent evidence yet of clinically relevant short-term 
changes in prolactin concentration with brexpiprazole therapy. 
Brexpiprazole 2 mg has been associated with decreases in prolactin 
levels, while the 4-mg dose has been associated with increased pro- 
lactin concentrations [203]. Additional studies are needed to clarify 
brexpiprazole’s anthropometric, prolactinaemic, and metabolic risk 
profile. 


Cariprazine 


Common adverse effects of cariprazine include insomnia, extra- 
pyramidal symptoms, akathisia, sedation, dizziness, anxiety, and 
gastrointestinal distress [254]. Cariprazine appears to be associated 
with greater risk of clinically significant weight gain than placebo 
[254]; however, no statistically significant differences in adverse 
metabolic parameters or cardiovascular-related events have been 
observed. Additional studies are needed to better define the adverse 
effect profile of cariprazine, particularly over longer-term treatment. 


Indications and contraindications 





The main indication for antipsychotic drugs is the treatment of all 
phases of schizophrenia, including acute, florid symptoms of psych- 
osis, and the prevention of relapse. Important other uses include the 
acute treatment and prophylaxis of mania, the acute treatment of de- 
pressive episodes in patients with bipolar disorders (for some agents 
such as quetiapine, olanzapine, lurasidone, and cariprazine) and 
major depression with psychotic features, the psychosis, agitation, 
and aggression of various dementias, the treatment of psychoses 
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due to L-dopa or other dopamine agonists in Parkinson’s disease, 
Tourette's syndrome, treatment-resistant obsessive-compulsive dis- 
order, self-injurious behaviour, porphyria, anti-emesis, intractable 
hiccoughs, and in some cases as antipruritics. Some current research 
has suggested that antipsychotic drugs may be of use to prevent the 
onset of schizophrenia by administering them to individuals who are 
in the prodromal phase of the illness. Several atypical antipsychotic 
drugs are being trialled on an experimental basis for various char- 
acter disorders such as borderline, schizoid, and schizotypal person- 
ality disorders. Clozapine, which has the lowest incidence of EPS 
of any of the antipsychotic drugs, has some special applications in 
neurological conditions such as essential tremor and the treatment 
of water intoxication syndrome in schizophrenic patients. Uses in 
other psychiatric and neurological conditions may be expected to 
emerge, as the safety profile of these agents is better described. 





Anti-parkinsonian agents 


Anticholinergic drugs 


Anticholinergic, antihistaminic, benzodiazepines, dopamine agon- 
ists, and B-blockers are of importance in the management of EPS. 
They are usually needed with typical dopamine antagonist drugs, but 
some patients will require such treatment when taking atypical anti- 
psychotic drugs. Anticholinergics and antihistaminics (for example, 
diphenhydramine) are used to treat acute dyskinesias and dystonias, 
parkinsonian side effects, and akathisia [255]. These agents block the 
effects of increased acetylcholine release due to D2 receptor blockade 
in the basal ganglia. The most widely used anticholinergic drugs are 
benztropine (1-6 mg/day, usually in divided doses), biperiden (2- 
16 mg/day in two or three doses), procyclidine (5-30 mg/day), and 
trihexyphenidyl (1-15 mg/day in a single or divided doses). 

These anticholinergic agents are competitive antagonists of the 
five subtypes of muscarinic receptors (M1 through to M5). They 
have minimal antagonist effects at nicotinic cholinergic recep- 
tors. Blockade of cholinergic receptors on intrastriatal neurons by 
these agents restores the cholinergic balance, which is disrupted by 
blockade of D2 dopamine receptors. Other central effects include 
impairment of various forms of memory. Elderly patients, in par- 
ticular, may develop anticholinergic-induced agitation, irritability, 
disorientation, hallucinations, and delirium because of the natural 
loss of cholinergic neurons with ageing. 


Side effects 


These agents have some preference for the central nervous system, 
but peripheral anticholinergic effects are to be expected. Blockade 
of vagal tone in the heart produces tachycardia. Other adverse ef- 
fects include decreased bladder function, urinary retention, and 
decreased bowel motility leading to constipation and impaction. 
Decreased saliva and bronchial secretions contribute to dry mouth 
and increased dental caries, while decreased sweating increases the 
risk of heat stroke. Blockade of muscarinic receptors in the eye cause 
pupillary dilatation and inhibition of accommodation, leading to 
photophobia and blurred vision. The muscarinic receptors in the 
basal ganglia are predominantly M2, whereas those in the periphery 
are M1. The rank order of anticholinergic drugs for relative select- 
ivity for the M2 receptor is biperiden, procyclidine, trihexyphenidyl, 
and benztropine. All these agents can cause dry mouth, blurred 


vision, urinary retention, constipation, and increased intraocular 
pressure. They may cause anticholinergic delirium in elderly patients 
or after taking high doses. Biperiden is less likely to cause peripheral 
anticholinergic effects. Benztropine, biperiden, and trihexyphenidyl 
may cause euphoria because of their ability to inhibit dopamine re- 
uptake and they may be subject to abuse. 


Indications 


Anticholinergic drugs or the antihistamine diphenhydramine 
are given IM for the treatment of acute dystonic reactions. They 
are usually effective within minutes and may have to be repeated. 
It is usually not necessary to prescribe an oral anticholinergic fol- 
lowing a dystonic reaction, though some may require their brief 
use, depending on which antipsychotic is prescribed. These agents 
should not be given prophylactically, unless the patient is at estab- 
lished risk for EPS at the dose of antipsychotic which is being started. 
If akathisia or parkinsonism develops following treatment with a 
typical dopamine antagonist drug, the first consideration should be 
whether to continue to use the offending agent, drop the dosage, or 
substitute with an atypical antipsychotic drug. If decreasing the dose 
of antipsychotic drugs is not clinically feasible, substituting with an 
atypical agent is clearly the recommended choice since it avoids all 
the unpleasant side effects of anticholinergic agents. 


Other drugs 


Amantadine, which also has antiviral actions, is able to increase the 
release of dopamine in the basal ganglia, which diminishes the re- 
lease of acetylcholine. It may improve drug-induced EPS [255, 256]. 
It has also been reported to improve sexual function and decrease 
weight gain due to dopamine antagonist drugs. But it may also cause 
worsening psychosis, increased arousal, agitation, and indigestion. 
As such, caution is warranted when using amantadine to treat EPS. 
The usual oral dose is 100-400 mg/day. 

B-blockers, such as propranolol, atenolol, and pindolol, are useful 
for treating akathisia and tremor [255]. They may cause bradycardia, 
and particularly immediate-release forms should not be stopped 
abruptly due to rebound tachycardia. 

Benzodiazepines, such as clonazepam, lorazepam, and diazepam, 
are useful for treating akathisia, acute dystonias, and acute dys- 
kinesias. They can cause drowsiness and lethargy, and have abuse 
potential. 


Conclusions 


Dopamine antagonist/partial agonist drugs are the principle treat- 
ment for schizophrenia, and selected agents are widely used in other 
psychotic and non-psychotic disorders such as bipolar disorder and 
major depression. Their main benefits are to treat psychotic symp- 
toms across various illnesses. The atypical drugs have largely dis- 
placed the typical ones because of lower EPS and tardive dyskinesia 
risk profiles. However, several atypical drugs are associated with 
higher risk of weight gain and dysmetabolic effects. Atypical, but not 
typical, drugs appear to improve negative symptoms and cognition. 
Newatypical drugs for psychosis continue to be developed, including 
those with partial agonist activity on presynaptic dopamine recep- 
tors and more extensive interactions with serotonergic receptors 
[257]. Several atypical drugs have established effectiveness—or 
have shown promise—for treating bipolar depression and major 


depression (as adjunctive therapy). Long-term treatment of affective 
disorders with typical antipsychotics is generally avoided, owing to 
heightened risk of tardive dyskinesia in such patients. 

Dopamine antagonist/partial agonist drugs are useful as both 
acute and maintenance treatment to prevent the recurrence of psych- 
otic symptoms in patients with schizophrenia and related psychotic 
disorders. The greater EPS and tardive dyskinesia risk of the typical 
antipsychotics, coupled with their lesser efficacy to improve negative 
symptoms and cognition, suggest that atypical agents are preferred, 
whenever possible, for long-term treatment. These advantages must 
be balanced against the greater long-term risk of weight gain and ad- 
verse metabolic effects associated with some atypical antipsychotic 
drugs. Clozapine, despite its risk of agranulocytosis and adverse meta- 
bolic effects, is the treatment of choice for patients with treatment- 
resistant schizophrenia. Although they have been considered together 
here asa single group, atypical antipsychotics have somewhat different 
pharmacologic profiles. Outside of clozapine for refractory schizo- 
phrenia, it is not yet clear which of these agents should be trialled in 
a given patient, but ongoing research may clarify this. These com- 
pounds, as well as others expected to be approved for use in the near 
future, will need to be compared with each other to determine if differ- 
ential indications exist. Side effect differences among these drugs, as 
well as the availability of long-acting preparations, may help clinicians 
choose among them. As long as the typical drugs remain in use and for 
some patients who receive atypical agents, anticholinergic and other 
anti-parkinsonian drugs will continue to be necessary to treat EPS. 

Because of the high rates of poor adherence to drugs in patients 
with schizophrenia and other disorders, it is important to develop 
more long-acting injectable atypical antipsychotics. Fortunately, 
several typical and atypical antipsychotic drugs have long-acting 
injectable formulations that are currently available for clinical use. 

While the current group of atypical antipsychotic drugs is pre- 
dominantly characterized by relatively more potent 5-HT2A than D2 
receptor antagonism, or by partial D2 agonist effects, new pharma- 
cological strategies will emerge for drugs with lower EPS liability 
than the typical antipsychotics. Because newer antipsychotic drugs 
are so effective in treating core psychotic symptoms, while limiting 
the risk for EPS, the remaining challenge rests in the development 
of agents that effectively treat other important features of schizo- 
phrenia, such as cognitive impairment and negative symptoms, 
without the metabolic and other side effects associated with these 
agents. Finally, the development of biomarkers, particularly genetic 
measures, to enable personalized medicine and reduce the number 
of therapeutic trials required to find optimal treatments for patients 
requiring drug treatment for psychosis remains a long-sought goal 
in the field that may be achievable in the near term [258-260]. 
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In this chapter, treatment will be defined as ‘medical care given to 
a patient for an illness or injury’ [1]. The treatment section will 
focus on biological aspects of schizophrenia and comorbid med- 
ical illnesses for which efficacious somatic treatments are available, 
including: 


1. Positive psychopathology (delusional beliefs, hallucinatory 
perceptions, and disorganization of thinking and behaviour) 
associated with excessive dopamine release, which responds ad- 
equately to first-line antipsychotic medication (FL-APM.) 

2. Positive psychopathology which does not respond adequately to 
FL-APM, for which a trial of clozapine is indicated. 

3. Comorbid medical illnesses that shorten the lifespan of patients 
with schizophrenia, including those caused or worsened by 
treatment with FL-APM or clozapine. 


In this chapter, management will be defined as ‘the manner in 
which someone behaves toward or deals with someone or some- 
thing’ [1]. The management section will focus on: 


1. Dealing with enduring deficits in cognition, motivation/expres- 
sion, and sensorimotor function resulting from biological pro- 
cesses for which no somatic treatments are currently available, 
that are prominent in a subset of patients. 

2. Dealing with enduring deficits in self-control, manifest in 
comorbid substance use disorder (SUD), impulsive character 
traits, and high-risk and disruptive behaviours, that are prom- 
inent in a subset of patients. 

3. Dealing with enduring distress from adverse childhood experi- 
ences, ongoing life stress, and external and internalized stigma. 


Treatment (medical care) can only be delivered effectively if clin- 
ical care systems modify their expectations of patients’ functional 
capacities and accommodate clinical interactions and environ- 
ments to manage (deal with) the enduring attributes of patients with 
schizophrenia. 





Team-based care, for example, comprehensive first-episode 
psychosis (FEP) treatment programmes, integrated mental health 
care/primary care treatment programmes, and assertive community 
treatment (ACT) programmes, may offer the greatest benefit for the 
majority of patients with schizophrenia. 

The enduring attributes of patients with schizophrenia interfere 
with chronic illness self-management. Family and important others 
(for example, partners, friends, and certified peer specialists) can as- 
sist where patients’ functional capacities are limited; they should be 
welcome as members of care teams. 








Medical care for dopamine-related positive 
psychopathology (FL-APM) 


Dopamine release in the terminal fields of ventral tegmental area 
(VTA) neurons is increased in the majority of unmedicated patients 
experiencing FEP or recurrent psychosis and is correlated with the 
severity of positive psychopathology [2]. Dopamine release in these 
regions is involved in the assignment of salience to items of sensory 
experience and intrapsychic life [3]. 

Patients entering FEP report experiences compatible with in- 
appropriate assignment of salience to random observations and 
thoughts—“There is a brightness and clarity of outline of things 
around me. Last week I was with a girl and suddenly she seemed 
to get bigger and bigger, “Ihe sun seemed too big for me and it was 
coming closer, ‘Big, magnified thoughts come into my head when 
I am speaking and put away the words I want to say, and make me 
stray away from what was in my mind [4]. 

FL-APM act by binding, as antagonists or partial agonists, to 
dopamine D2 receptors in the terminal fields of VTA dopamine 
neurons, shielding them from excessive dopamine. These dopamine 
neurons subsequently reduce their activity, shifting into depolariza- 
tion block [5]. 

The efficacy of FL-APM is greatest early in the illness. Therapeutic 
benefit occurs at lower doses of FL-APM in FEP, and a higher 
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percentage of patients achieve remission of positive psychopath- 
ology (all items rated ‘mild’ or less, that is, they are uncommon 
and non-intrusive and do not drive behaviour), in comparison to 
chronic, multi-episode patients [6]. 

The goal of treatment with FL-APM is the achievement and main- 
tenance of sustained remission, which is associated with better 
quality of life. For most patients, uninterrupted treatment with FL- 
APM is necessary for sustained remission. 

Treatment with FL-APM should be initiated early. Longer dur- 
ation of unmedicated psychosis (DUP) prior to FL-APM for FEP is 
associated with reduced likelihood of remission, more prominent 
negative psychopathology, and poorer functioning [7]. Longer 
cumulative duration of unmedicated relapses during the years 
following FEP is associated with loss of brain volume, more prom- 
inent cognitive impairment, and more FL-APM-resistant positive 
psychopathology [8]. 

The benefits of sustained remission with uninterrupted FL- 
APM are large. The risks of relapse are grievous, including derail- 
ment from trajectories of development (school, work, independent 
living, and relationships), violence, self-injury, hospitalization, or 
incarceration. 

It appears that a minority of patients in sustained remission can 
be withdrawn from FL-APM after FEP and maintain good function 
and quality of life [9]. However, clinicians cannot identify these pa- 
tients prospectively. 

If a remitted patient insists on FL-APM discontinuation, gradual 
tapering over several months, with close observation by others (for 
example, family) and regular clinical contact, may permit early 
detection of relapse and re-initiation of FL-APM before grievous 
consequences occur. 

Non-remitted patients are highly likely to rapidly relapse after 
FL-APM discontinuation, and FL-APM discontinuation should be 
discouraged [10]. 

Interruptions in treatment with FL-APM following FEP are 
common (50% in the first year), delay the achievement of remission, 
and increase the likelihood of relapse 5-fold [11]. Factors associated 
with relapse include comorbid SUD, lack of insight, and lack of en- 
gaged family [12], all acting through the final common pathway of 
interrupted FL-APM. 

Impaired insight, an enduring deficit, is associated with higher 
levels of psychopathology and cognitive impairment, and reduced 
brain volumes, in patients with FEP [13]. 

Long-acting injected (LAI) FL-APM provide real-time, accurate 
information regarding treatment interruption. If patients fail to 
present for, or refuse, scheduled injections, clinicians and engaged 
family/others can express concern. Shared decision-making is 
undertaken with shared facts. 


Tolerability 


There is little difference in efficacy, but substantial differences in side 
effects (and the ease with which side effects can be mitigated) among 
the FL-APM. Many FL-APM and many strategies to mitigate their 
side effects are available. It is reasonable to propose to patients the 
goal of minimal or no subjective distress or objective toxicity with 
FL-APM. 

Clinicians may select 2-3 FL-APM to prescribe regularly, so as 
to become experienced and skilled in their use. Characteristics to 
consider include: 


1. Side effects that can be mitigated by sensible dosing or well- 
tolerated concomitant medications. 

2. Once-daily dosing of oral preparations. 

3. Availability of LAI preparations. 

4. Affordability. 


Clinicians should describe common potential side effects of any 
FL-APM prescribed and strategies to monitor for, and mitigate, 
them. Patients should have contact information for clinicians, so 
that relief of side effects is quickly achieved and FL-APM discon- 
tinuation avoided. 


Early-onset extra-pyramidal side effects (EPSEs) 


Dystonic reactions are spasmodic contractions of the face, neck, 
trunk, and/or extremities that can be frightening and uncomfort- 
able, occurring most commonly in young male patients receiving 
high doses of FL-APM with preponderant dopamine D2 antag- 
onism (for example, haloperidol). Dystonic reactions can be rap- 
idly relieved by intramuscular anticholinergic or benzodiazepine 
medications. Clinicians may then consider switching the FL-APM 
or lowering the dose. 

Restlessness (akathisia) occurs most commonly during treatment 
with haloperidol-like FL-APM or dopamine D2 partial agonists (for 
example, aripiprazole). Unrelenting restlessness has been associated 
with suicide and violence. Clinicians should inquire about, and ob- 
serve for, (pacing, foot-tapping) restlessness. Clinicians may con- 
sider switching the FL-APM. Otherwise, extended treatment with 
B-blocking agents or benzodiazepines may be necessary to mitigate 
restlessness. 

Bradykinesia-rigidity indicates excessive dopamine D2 re- 
ceptor antagonism (calling for reduced FL-APM dose) or in- 
trinsic susceptibility (suggesting consideration of clozapine). 
Tailoring individualized doses of haloperidol-like FL-APM to 
the ‘neuroleptic threshold; at which point minimal bradykinesia- 
rigidity can be detected on examination, but no coarse EPSE is 
present, offers the therapeutic benefit of these agents with good 
tolerability [14]. 

Anticholinergic medications prescribed to treat early EPSE of FL- 
APM have dose-related side effects, including dry mouth, blurred 
vision, constipation, and cognitive impairment [15]. Reduction in 
coarse EPSE produced by anticholinergic medication is not accom- 
panied by a reduced risk for tardive dyskinesia (TD). Dose reduction 
of FL-APM is the preferred approach to coarse EPSE. 


Late-onset EPSE 


Tardive dyskinesia (TD) is manifest as involuntary, purposeless, 
repetitive movements commonly involving the lips (for example, 
pursing or smacking), jaw, and/or tongue, but also the upper regions 
of the face (blinking), distal extremities, and trunk. Patients with TD 
have difficulty not moving. 

The movements can be disfiguring and stigmatizing. ‘Mild’ move- 
ments in a patient whose job requires social interaction may have 
‘severe’ repercussions. 

TD is believed to result from enhanced sensitivity of striatal dopa- 
mine D2 receptors (after years of deprivation of dopamine), which 
respond excessively to ambient synaptic dopamine. 

Patients with FL-APM-resistant schizophrenia, especially those 
who are violent, may receive high doses of FL-APM, leading to TD. 
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Clozapine may relieve FL-APM-resistant positive psychopathology 
and TD in such patients [16]. 

Inhibitors of vesicular monoamine transporter-2 (VMAT2) block 
the uptake of dopamine into presynaptic vesicles of nigrostriatal 
dopamine neurons; these agents significantly reduce the severity of 
TD [17]. 

Neuroleptic malignant syndrome (NMS) is characterized by muscle 
rigidity that may be intense, creating heat and dissipating moisture. 
Immobility prevents fluid intake and increases the risk for infection 
and thromboembolism. Muscle fibre breakdown releases myoglobin 
that can precipitate in renal tubules in the setting of dehydration. FL- 
APM should be discontinued until NMS is resolved. 

Supportive care focusing on hydration, identification and treat- 
ment of infection, prevention of thromboembolism, and treatment 
with dopamine agonists (for example, amantadine) and benzodi- 
azepines reduces morbidity and mortality [18]. 

In severe cases that do not respond to supportive care, dantrolene 
(reduces excitation-contraction coupling in skeletal muscle) or 
electroconvulsive therapy should be considered. Extended muscle 
rigidity may result in persistent contractures. 


Prolactin-related side effects 


Elevated prolactin levels result from antagonism at tubero- 
infundibular dopamine D2 receptors, located outside the blood- 
brain barrier (BBB), by FL-APM that penetrate the BBB poorly (for 
example, risperidone). Breast enlargement/lactation and sexual dys- 
function may result in men and women, and menstrual cycling may 
be suppressed in women. Bone demineralization may result from 
reductions in oestrogen levels [19]. 

Clinicians may consider preferentially prescribing FL-APM that 
do not elevate prolactin. 

Weight gain and metabolic side effects are discussed in Antipsychotic 
medications may cause or worsen modifiable risk factors , p. 671. 


Medical care for positive psychopathology that responds 
inadequately to FL-APM 


Approximately 80% of FEP patients achieve remission of positive 
psychopathology over months of uninterrupted treatment with FL- 
APM; in most, substantial improvement is apparent within days. 
Approximately 20% are treatment-resistant at FEP and remain so 
thereafter [20]. Earlier onset of FEP and longer DUP prior to initi- 
ation of FL-APM reduce the likelihood of remission. 

During the years following FEP, the duration of unmedicated 
psychotic relapses is associated with accrued FL-APM-resistant 
positive psychopathology and reduced brain volume [8]. 
Approximately 33% of patients become FL-APM-resistant, no 
longer demonstrating correlations between dopamine release and 
positive psychopathology [21]. 

Clozapine reduces positive psychopathology and mortality [22] 
in at least 40% of FL-APM-resistant patients, reduces aggression in 
patients selected for aggressiveness [23], and reduces self-injury in 
patients selected for self-injury [24]. Community tenure (avoidance 
of hospitalization or incarceration) is increased with clozapine. 

Clozapine is under-utilized. Clinicians unfamiliar with the use of 
clozapine may consider referring FL-APM-resistant patients to clin- 
icians who are experienced in its use. 

Clozapine should be considered after two FL-APM trials have 
failed to produce adequate therapeutic benefit. Olanzapine may 


offer the greatest therapeutic benefit among FL-APM and should 
be considered for the second trial [25]. For early-onset psychosis or 
if aggression or self-injury is prominent, clozapine should be con- 
sidered after one failed FL-APM trial. Patients who prove to be ex- 
quisitely susceptible to EPSE or who have experienced NMS may do 
best with clozapine. 

Augmentation of clozapine with electroconvulsive therapy may re- 
duce positive psychopathology, aggression, or self-injury in incom- 
plete responders to clozapine alone [26]. 


Clozapine side effects 


Agranulocytosis occurs in approximately 1% of clozapine-treated pa- 
tients, most commonly during the first 3 months of treatment [27]. 
Risk evaluation and mitigation strategies (REMS) guide monitoring. 

Myocarditis occurs in approximately 0.2%, most commonly 
during weeks 3-4 of treatment. Monitoring eosinophil count, in- 
flammation (for example, C-reactive protein), and muscle damage 
(for example, troponin) at baseline and at weeks 1-4 permits early 
detection and discontinuation of clozapine prior to extensive heart 
damage [28]. Slower titration of clozapine dose may reduce risk. 

Seizures occur in 2-6%; risk is greater with prior seizures (for 
example, febrile seizures in childhood), head injury, and higher 
clozapine dose [28]. Therapeutic drug monitoring may avoid exces- 
sive clozapine blood levels. Myoclonus during dose titration fore- 
shadows seizures [29]. 

Salivary pooling, related to increased salivation and decreased swal- 
lowing, may increase the risk for aspiration, especially during sleep. 
Quaternary anticholinergic agents (for example, glycopyrrolate) re- 
duce salivation and do not penetrate the BBB [30]. 

Constipation is common and can lead to adynamic ileus and death. 
Daily stool softeners (for example, docusate) and a weekly osmotic 
laxative can be started when clozapine is initiated [31]. A stimulant 
laxative can be prescribed as needed. 

Venous thromboembolism risk is increased 30-fold with clozapine 
[32]. Daily aspirin should be considered when starting clozapine. 

There is no therapeutic alternative to clozapine in patients whose 
positive psychopathology, aggression, or self-injurious behaviour 
are resistant to FL-APM. Clinicians must strive to mitigate cloza- 
pine side effects for patients who benefit from its use, because few, if 
any, other therapeutic options exist. 


Accelerated mortality 


Patients with schizophrenia die more than 20 years earlier than gen- 
eral population controls, primarily from medical causes of death that 
are similar to those of the general population (heart attacks, strokes, 
cancer, and complications of diabetes mellitus) [33]. Modifiable 
risk factors (MRFs) (for example, insulin resistance, dyslipidaemia, 
hypertension, cigarette smoking) for these causes of death are more 
prevalent and more severe, at any age, among patients with schizo- 
phrenia than among general population controls [34]. 

Children later diagnosed with schizophrenia are less physically 
active than general population children. Medication-naive FEP pa- 
tients have higher rates of obesity, insulin resistance, and smoking 
than general population controls [35]. 

Patients with schizophrenia and comorbid SUD (vide infra) en- 
gage in more high-risk behaviours, resulting in higher rates of in- 
fection with bloodborne viruses [HIV, hepatitis B virus (HBV), and 
hepatitis C virus (HCV)] than general population controls. 
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Rates of suicide, homicide, and accidental death are also increased. 


Antipsychotic medications may cause or worsen 
modifiable risk factors 


FL-APM have varying propensities to cause or worsen weight gain, 
insulin resistance, or dyslipidaemia, or to reduce physical activity 
through sedation or EPSEs [35]. Clinicians may consider preferen- 
tially prescribing FL-APM with minimal propensities to cause or 
worsen MRFs. 

Olanzapine and clozapine have high propensities and should not 
be prescribed for patients who achieve adequate therapeutic benefit 
from more benign FL-APM. If olanzapine or clozapine are neces- 
sary to achieve adequate therapeutic benefit, pre-emptive mitiga- 
tion strategies (see Interventions to mitigate modifiable risk factors, 
p. 671) should be considered when these agents are started. 


Patients with schizophrenia commonly receive poor 
medical care 


Rates of adherence to practice guidelines in general medical care set- 
tings are ‘considerably lower’ for patients with schizophrenia than 
for non-schizophrenic patients [33, 34]. Patients with schizophrenia 
and diabetes mellitus or hypertension [36] are less likely to receive 
guideline-concordant screenings, testing, and treatments. Following 
myocardial infarction, patients with schizophrenia are less likely to 
receive indicated reperfusion procedures [37]. 

Patients with schizophrenia and cancer die sooner than those 
with cancer alone. Including a psychiatrist in the cancer care team, 
ongoing prescription of antipsychotic medications, and prevention 
of relapse after cancer diagnosis significantly reduce the likelihood 
of disruption in cancer care [38]. 

The ‘chronic care model’ for optimal treatment and management 
of chronic biological illnesses envisions ‘informed, activated patients 
interacting productively with ‘prepared, proactive practice teams 
[39]. Patients with schizophrenia have difficulties being informed 
and activated because of distracting positive psychopathology and 
enduring deficits in cognition, motivation, and expression. Few 
medical care systems accommodate to make interactions with pa- 
tients with schizophrenia more productive. 


Interventions to mitigate modifiable risk factors 


Providing uninterrupted antipsychotic medications is associated 
with decreased all-cause mortality in schizophrenia [40]. 

Co-localization of primary care in mental health care facilities 
simplifies the cognitive/motivational challenges that patients with 
schizophrenia must overcome (for example, appointments at mul- 
tiple locations and different times), facilitates communication among 
treating clinicians, and improves primary care outcomes [41]. 

Embedding nurses in mental health care facilities to assist pa- 
tients with schizophrenia in overcoming obstacles to care (for 
example, making primary care appointments and arranging trans- 
portation), instruct patients and their families in chronic illness 
self-management, and co-ordinate prescribing across mental health 
and primary care providers reduces MRFs and mortality risk [42]. 
Certified peer specialists may also assist patients with chronic illness 
self-management, in particular through increased physical activity 
and medication compliance [43]. 

Efforts at smoking cessation should include both individual or 
group psychosocial support and adjunctive pharmacotherapy. 


Efforts without pharmacotherapy are rarely successful in patients 
with schizophrenia. In 12-week trials, varenicline is superior to 
bupropion/nicotine replacement therapy (B/NRT), and B/NRT 
is superior to placebo in achieving smoking cessation in patients 
with schizophrenia [44], but post-trial relapse rates are high. 
Maintenance treatment with varenicline is significantly superior to 
placebo in sustaining smoking cessation [45]. Neither varenicline 
nor B/NRT has been associated with worsening of psychopath- 
ology or suicide, relative to placebo. Clinicians treating patients 
with schizophrenia may consider becoming skilled in the use of 
varenicline and B/NRT. 

Metformin [46] and liraglutide [47] are associated with weight 
loss and reduced insulin resistance in patients receiving dopa- 
mine antagonist medications with high propensities to cause or 
worsen MRFs. Clinicians may consider becoming skilled in their 
use. Topiramate reduces weight gain associated with antipsychotic 
medications [48]. 

Patients with schizophrenia are more sedentary than the general 
population. Diet and exercise programmes targeting weight and fit- 
ness may reduce MRFs in patients with schizophrenia, but they are 
not commonly available. 


Women’s issues 


Women with schizophrenia report more lifetime sexual partners and 
higher rates of being raped or engaging in prostitution, but are less 
likely to have been tested for HIV, than general population women. 
They report fewer planned pregnancies, more unwanted pregnan- 
cies, and more abortions [49]. 

Sexual violence is associated with more severe positive psycho- 
pathology, comorbid SUD, and homelessness. Among homeless 
women with schizophrenia, sexual violence (97% lifetime inci- 
dence) ‘amounts to normative experience’ [50]. 

The benefits of continuing antipsychotic medications through 
pregnancy to prevent relapse outweigh the low risks of teratogen- 
icity. However, antipsychotic medications throughout pregnancy 
is associated with greater weight gain and risks for gestational 
diabetes. Women with schizophrenia have higher rates of 
preterm birth. 

Clozapine treatment during pregnancy is associated with over 2- 
fold increase in gestational diabetes. Infants born to mothers taking 
clozapine are more likely to be ‘floppy’ and to experience seizures, 
and they must be checked for agranulocytosis [51]. 


Enduring deficits in cognition, motivation/expression, 
and sensorimotor function 


A subset of the offspring of patients with schizophrenia, including 
many who will later become psychotic, display deficits in cognition, 
motivation/expression, and /or sensorimotor function in infancy 
and childhood, long before the onset of psychosis [52]. 

In large population cohorts, characterized in childhood and fol- 
lowed through adulthood, a subset of those children who will later 
become psychotic display deficits in cognition, motivation/expres- 
sion, and/or sensorimotor function [53]. 

At adult-onset FEP, prominent impairments in cognition, mo- 
tivation/expression, and/or sensorimotor function are present in a 
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subset of patients and are associated with poor subsequent social 
and occupational function [54]. In early-onset FEP, prominent im- 
pairments are present in >80% of patients, in association with brain 
volume loss and FL-APM-resistant positive psychopathology [55]. 

During the years following FEP, increasing deficits in cognition 
and motivation/expression occur in a subset of patients, accom- 
panied by brain volume loss and increasing FL-APM-resistant posi- 
tive psychopathology [8]. 

Many, but not all, studies suggest that prominent impairments in 
cognition, motivation/expression, and sensorimotor function tend 
to occur in the same patients. Parsimony suggests the unproven pos- 
sibility that the deficits share a common pathophysiology with vari- 
able distribution throughout the central nervous system. The deficits 
are enduring. They do not go away. 


Cognitive deficits 


At FEP, patients perform at a level 1.5 standard deviations below the 
general population, with particular deficits in learning and memory 
[56]. Patients with chronic schizophrenia perform at a level 1.5-2.5 
standard deviations below the general population (in the range of 
the lowest 5-10% of the general population), substantially limiting 
their abilities to engage successfully in real-world tasks [57], for ex- 
ample, self-management of chronic mental and physical illnesses. 

Patients with schizophrenia show pronounced hypoactivation 
of circuitry in the ‘multiple demand network’ of the left lateral pre- 
frontal cortex, including portions servicing initial, rudimentary 
processes (for example, updating, set shifting, and inhibition), com- 
patible with a broad disruption of neuropsychological perform- 
ance encompassing multiple cognitive domains [58]. Structural 
imaging studies also demonstrate disruption of white matter tracts 
‘facilitating information processing in the cortico-basal ganglia net- 
work in patients with chronic schizophrenia [59]. 

The effects of APM on cognition in schizophrenia appear to be 
minimal. Adjunctive pharmacotherapies targeting cholinergic, 
glutamatergic, or serotonergic neurotransmissions have, so far, 
failed to improve cognition. Cognitive remediation has shown 
benefit in improving function in some, but not all, trials and may be 
of greatest value in supported employment settings [60]. Long-term 
durability of effects is not established. 

Cognitive function is associated with cardio-respiratory function 
in patients with schizophrenia. Twelve-week programmes that en- 
gage patients with schizophrenia in vigorous physical exercise ap- 
pear to improve cognitive function [61] and brain volume. These 
gains do not persist after the exercise programmes end, as patients 
resume their pre-programme activity levels. 

Modifications/accommodations to deal with cognitive deficits are 
discussed in Modifications and accommodations, p. 672. 


Motivation/expression deficits 


The assessment of ‘primary’ deficits in motivation/expression should 
be preceded by detection and correction of superficially similar phe- 
nomena, for example, akinesia or somnolence due to prescribed 
medications or SUD, catatonia, social withdrawal due to positive 
psychopathology (for example, paranoia), depression and anxiety, 
or behavioural regression due to isolation or social deprivation [62]. 

Deficits in motivation most powerfully predict social and oc- 
cupational function [63]. Patients with schizophrenia appear able 
to experience pleasure when exposed to pleasant experiences but 


have ‘difficulty using mental representation of affective value to 
guide decision-making and goal-directed behavior [63]. Areas of 
‘dysconnectivity associated with reward responsivity’ have been 
identified in functional imaging studies [64]. 

At present, there is no convincing evidence regarding efficacy for 
any treatment for enduring deficits in motivation/expression [65]. 


Neurological soft signs 


Neurological soft signs (NSS) are ‘non-localizing neurological ab- 
normalities that cannot be related to impairment of a specific 
brain region [66]. Typical NSS include impairments in motor co- 
ordination, the sequencing of complex movements, and sensory in- 
tegration (for example, right-left discrimination). 

NSS are observed in dopamine antagonist medication-naive 
FEP patients and in the majority of patients with chronic schizo- 
phrenia. NSS are strongly associated with cognitive and motiv- 
ation/expression deficits and with FL-APM-resistant positive 
psychopathology [67]. 


Modifications and accommodations 


Modifications are reductions in expectations as to what patients with 
schizophrenia can do, relative to general population controls. For 
example, patients with schizophrenia recall, on average, half as many 
words as general population controls after a single presentation of 
a word list [56]. They may do best with educational processes (for 
example, illness self-management information) that provide them 
with half as much information as the general population would 
be expected to remember. The information must be thoughtfully 
selected and include the critical facts to be transmitted. 

Accommodations are adaptions in clinical interactions (for ex- 
ample, offering multiple brief sessions over time to transmit illness 
self-management information gradually) and in clinical environ- 
ments (for example, co-localization of mental health and primary 
care) to achieve desired outcomes despite enduring deficits. 

Modifications and accommodations are recommended for the 
informed consent process [68] in patients with schizophrenia, 
including adequate time for repeated presentations in minimally 
distracting settings, involving family members, consideration of 
multiple media (written, spoken, figures), and feedback testing: 
‘Professionals ... and patients cannot be assured that they have 
understood one another—especially in the case of persons already 
known to suffer problems of comprehension—without some form 
of checking’ [69]. Clinicians may consider more widespread use of 
such procedures. 

Clinical staff and family/important others may also provide func- 
tions that patients cannot offer (for example, activity therapists 
initiating outings that patients with motivation deficits do not ini- 
tiate but may enjoy). 

Patients with schizophrenia display a wide range of severity of 
enduring deficits. In an optimal system, safe and comfortable housing 
options, offering a range of independence (matched to patients’ cog- 
nitive function and initiative), would be available. However, the type 
of housing provided appears to be mostly driven by regional ap- 
proaches to health planning [70]. Discussions about independence 
should include the risks of loneliness and of relapse [71]. 

In an optimal system, a range of opportunities for productive ac- 
tivities would be available, from sheltered workshops through ac- 
commodated work placements without pay (disability benefits are 
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preserved) and supported employment programmes. Only approxi- 
mately 10% of patients with schizophrenia can support themselves 
through competitive employment, but approximately 70% can par- 
ticipate in productive activities with accommodations [72]. Patients 
who successfully manage their illness can serve as Certified Peer 
Specialists, trained to share their experiences and coping strategies 
with other patients, providing support with illness management, 
communication with clinicians, and overall health—maintaining 
hope and reducing internalized stigma [73]. 


Self-control 


Across the general population, there is a gradient of self-control— 
‘an effortful regulation of the self by the self’ [74]. Self-control, as- 
sessed in early childhood, predicts multiple health, relationship, and 
achievement outcomes in adult life [75, 76]. 

Those in the lowest quintile of self-control display more SUD 
and criminality and poorer mental and physical health than the re- 
mainder of the general population, and they utilize disproportionate 
amounts of society’s health care and criminal justice resources [75]. 

Those in the highest quintile are characterized by con- 
scientiousness—‘a large class of personality traits that include 
responsibility, industriousness, and orderliness’ [74]—which is as- 
sociated with higher incomes and occupational prestige, more stable 
and satisfying relationships, better health, and greater longevity [75]. 

A subset of patients with schizophrenia display enduring deficits 
in self-control, manifest in comorbid SUD, disruptive behaviours 
leading to hospitalization or incarceration, poor engagement with 
treatment, poor illness self-management, poor physical health, and 
accelerated mortality. 

Patients with schizophrenia appear more likely to mate [77, 78] 
and to cohabitate with [79] deviant individuals, including those 
with deficits in self-control. These pairings may result in offspring 
with schizophrenia and deficits in self-control and in environments 
in which high rates of adverse childhood experiences (ACEs) and 
adult trauma occur. 


Comorbid substance use disorder 


At FEP, at least 40% of patients use substances, predominantly mari- 
juana or alcohol. Approximately half of those using substances cease 
without specific interventions. Patients who cease substance use are 
more likely to have engaged families, and their outcomes are as good 
as those of FEP patients without substance use [80]. 

Patients who continue substance use despite FEP, now diagnosed 
with comorbid SUD, have higher rates of first-degree relatives with 
SUD than other FEP patients [81]. They are predominantly male and 
demonstrate impulsiveness, sensation-seeking, and behavioural dis- 
inhibition [82], similar to general population individuals with SUD, 
features that are antithetical to conscientiousness. They are more 
likely to display features of conduct disorder in childhood and anti- 
social personality disorder (ASPD) as adults [83]. Risk for violence 
is modestly increased in schizophrenia, but substantially increased 
with comorbid SUD and ASPD [84]. 

Patients with schizophrenia and SUD have more frequent inter- 
ruptions in antipsychotic medication, more severe positive psycho- 
pathology at cross-sectional assessments, higher rates of relapse 
and service utilization, lower rates of remission, and higher rates 
of completed suicide than patients with schizophrenia alone. Their 


all-cause mortality rate is substantially higher (SMR 8.46 versus 
3.61) than that of patients with schizophrenia alone [85]. 


Management and treatment of comorbid substance 
use disorder 


Patients with schizophrenia and deficits in self-control are not help- 
seeking. Engagement may be initiated during hospitalization or in- 
carceration, or through assertive outreach in the community. 

Sustained engagement is simplified if patients reside with the 
family (who should also be engaged and supported) or have a resi- 
dence (which may need to be provided as an initial priority). 

Non-compliance with prescribed oral medication is to be ex- 
pected. Some trials report reduced hospitalization or incarceration 
with LAI antipsychotic medication [86]. If compliance with oral 
medication and REMS procedures can be assured, clozapine may 
reduce substance use and positive psychopathology [87]. Trials with 
agents such as LAI naltrexone are needed. 

A small percentage of patients with schizophrenia and SUD 
will engage in intensive, team-based approaches, for example 
Behavioural Treatment for Substance Abuse in Severe and Persistent 
Mental Illness (BTSAS) offering small groups twice a week for 
6 months, motivational interviewing, monetary rewards for negative 
urine screens, and social skills training, that have been associated 
with a higher percentage of negative urine screens and fewer hospi- 
talizations than standard care [88]. 

ACT teams bring treatment to patients with schizophrenia and 
deficits in self-control who are not help-seeking. Multi-disciplinary, 
well-staffed teams go out and find (assertive outreach) patients 
who do not come to clinic appointments, are available around the 
clock for crisis support, and directly provide (as opposed to making 
referrals for which patients will not show up) a broad range of 
services— ‘helping with illness management, medication manage- 
ment, housing, finances, and anything else critical to the individual's 
community adjustment’ [89]. In clinical trials, patients assigned to 
ACT teams have fewer hospital days and better patient and family 
satisfaction (and connection) than patients in standard care. 

Incorporation of Certified Peer Specialists in ACT teams may im- 
prove the functioning of patients receiving ACT services and leads 
to fewer days of homelessness or hospitalization [90]. 


Adverse childhood experiences 


In the general population, ACEs (for example, physical or sexual 
abuse, observed domestic violence, parental mental illness, death or 
separation/divorce, or placement in foster care) are associated with 
SUD, depression, and poorer health and quality of life in adulthood 
[91]. Environments in which one ACE occurs may also offer others, 
and ‘accumulating levels of adversity often produce graded decre- 
ments in development and functioning’ [92]. 

Patients with schizophrenia report more ACEs than the general 
population, and more ACEs are associated with more SUD, home- 
lessness, incarceration, suicidal thoughts, and health problems (for 
example, HIV infection) [93]. 


Interventions to mitigate adverse childhood experiences 


Early nutritional [94] and psychosocial enrichment [95, 96] pro- 
grammes supporting mothers and children at genetic and/or envir- 
onmental risk may improve outcomes. 
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Resilience, ‘defined as a dynamic process encompassing positive 
adaptions (not merely the absence of pathology or dysfunction) 
within the context of significant adversity’ [97], may explain vari- 
ance in outcomes after ACEs. Early conceptualizations of resilience 
focused on traits within individuals presumed to impart hardiness. 
Later conceptualizations also highlight ‘affectional ties within the 
family and with extra-familial informal support systems’ [98] that 
can be protective and potentially restorative. Resilience is not a fixed 
set of attributes, but a dynamic process that can change in response 
to new challenges, opportunities, and emerging competencies. 
Resilience-enhancing interventions are discussed in Individual re- 
silience training, p. 675. 


Internalized stigma 


‘Stigma, directly translated, refers to a visible mark usually made by 
a pointed instrument. Current usage derives from the mark tattooed 
or branded into the skin of slaves or prisoners, denoting their un- 
desirable status. 

Internalized stigma, the degree to which patients accept that they 
are marked as less desirable, with less value or power, because of the 
presence of schizophrenia, is associated with greater feelings of de- 
pression, alienation, and perceived discrimination, and greater en- 
dorsement of unflattering mental illness stereotypes [99]. Clinicians 
may directly challenge internalized stigma through cognitive behav- 
ioural therapy [100]. 


Stigma resistance 


Stigma resistance, the degree to which patients reject derogatory 
self-assessments, is associated with higher self-esteem and reported 
empowerment, higher levels of hope and quality of life, and less 
depression [101]. Having social contacts (family and friends) and 
being in outpatient treatment—perceiving that one ‘matters’ to 
others—are associated with greater stigma resistance [102, 103]. 
Better cognitive functioning and higher levels of the person- 
ality trait of self-directedness are associated with greater stigma 
resistance. 

Stigma resistance and resilience overlap. Sustaining connected- 
ness with family and important others and providing the experience 
of productive activities may be important strategies for strength- 
ening stigma resistance. 


Internalized stigma in caregivers 


Caregivers of patients with schizophrenia report more internalized 
stigma than caregivers of patients with bipolar disorder or depres- 
sion. Earlier age of onset of psychosis and longer duration of illness 
(markers of the degree of enduring impairment) are associated with 
higher levels of internalized stigma in caregivers [104]. Higher levels 
of internalized stigma are associated with higher levels of psycho- 
logical morbidity in caregivers [105] and should be a target of family 
psychoeducation and support. 


Assignment of stigma by the public 


The public assigns stigma based upon diagnosis (the most severe, 
for example, schizophrenia, assigned the most stigma), judgements 
of the intensity of treatment needed (for example, hospitalization 
vs outpatient care), and judgements of the level of associated psy- 
chosocial disabilities [106]. Efforts to improve the public’s opinions 
of patients with schizophrenia have shown little effectiveness [107]. 


Non-psychiatric physicians responding to vignettes depicting 
various clinical populations expressed their least positive attitudes 
towards patients with schizophrenia and personality disorder and 
those classified as criminals. Mental health clinicians may preferen- 
tially refer patients with schizophrenia for non-psychiatric care to 
clinicians who have demonstrated more positive views of patients 
with schizophrenia. 


The right beginning: comprehensive FEP 
treatment programmes 


Comprehensive FEP treatment programmes are team-based and 
offer evidence-based pharmacological treatment, involvement of 
family, support for patients’ education/employment goals, and in- 
dividual psychosocial treatments, usually for a period of 2 years. 
Patients assigned to comprehensive FEP treatment programmes are 
more likely to remain in treatment, have less psychopathology and 
fewer hospitalizations, and have improved functioning, while re- 
ceiving lower doses of antipsychotic medications, relative to patients 
assigned to TAU (treatment as usual). They report better quality 
of life and satisfaction with treatment and are more likely to be in 
school or working. Their families report less burden than the fam- 
ilies of patients assigned to TAU [10]. 


Evidence-based pharmacological treatment 


Sustained remission of positive psychopathology is the goal of 
pharmacological treatment. LAI FL-APM may be considered in pa- 
tients with comorbid SUD, lack of insight, and/or no engaged family 
[108]. Use of the fewest medications, at the lowest effective doses, 
may offer treatment, with minimal or no side effects. 

Interruptions in antipsychotic medication that lead to relapse can 
be detected quickly by team members (including family), and anti- 
psychotic medication restarted. The duration of unmedicated re- 
lapse, not the occurrence, is associated with decline [8]. 

Monitoring of physical health and interventions to mitigate MRFs 
begins immediately in FEP treatment programmes [109]. Aerobic 
exercise may assist in managing weight and metabolic side effects 
and may benefit cognition and functioning [110]. 


Engagement of families 


Family members fear relapse [111] and struggle with their relative’s 
enduring deficits. They should be included in shared decision- 
making about medications. They should be encouraged to provide 
cognitive and motivational functioning that patients lack, for ex- 
ample, assisting with illness management and including patients in 
enjoyable activities. 

Family members are a resource that clinicians strengthen through 
knowledge and support through appreciation and encouragement. 
Real-world successes—the prevention of relapse, the achievement of 
productive activity by their relatives—may reduce their internalized 
stigma. 


Education/employment support 


Assistance with education/employment may be the most at- 
tractive aspect of FEP treatment programmes for patients. 
Returning to productive activity (school or work) provides pur- 
pose and supports self-esteem, and offers opportunities for social 
engagement. Work may bring patients and their families add- 
itional income. 
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Current approaches place patients into productive activity rapidly 
as psychosis is controlled, in the hope of forestalling dependency 
and demoralization [112]. With team support, many patients suc- 
cessfully manage the challenges of school and work. 


Individual resilience training 


Patients in individual resilience training (IRT) ‘matter—IRT fo- 
cuses on their strengths and the achievement of their individual 
goals in recovery. Education about the psychotic episode and as- 
sistance in processing that experience are provided, and patients are 
taught strategies to prevent relapse. Overall functioning and illness 
management are approached through psychiatric rehabilitation 
techniques. Self-stigma and residual psychopathology are addressed 
through cognitive behavioural therapy techniques. Additional mod- 
ules are available, as needed, to address SUD, traumatic experiences, 
and health maintenance. A training manual for IRT is available [113]. 


Comprehensive FEP treatment programmes-—terminable 
or interminable? 


At 5 years, that is, 3 years after 2-year FEP programmes have ended, 
the large differences between patients who participated and those 
relegated to TAU are attenuated, and at 10 years, the differences are 
undetectable [114]. Three- and 5-year treatment programmes pro- 
vide extended advantages for patients over TAU [115, 116]. 
Schizophrenia is a chronic illness for which the standard of care 
should be chronic comprehensive treatment to prevent progressive 
decline [117]. Risk for relapse continues, and relapse prevention 
must be sustained. Deficits endure, and accommodations to support 
functioning must be sustained. Traumatic life experiences and med- 
ical illnesses accrue. Family members grow old. The tasks of clin- 
icians are long and arduous but can be rewarded by assisting patients 
to sustain remission and experience productive and satisfying lives. 
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Phenomenology 
o 


Phenomenology precedes diagnosis. Phenomenology reflects not 
only observed signs and symptoms, but also the subjective experi- 
ence of the person with signs and symptoms. Further, phenomen- 
ology involves the interpretation of data obtained in the context of 
our current knowledge and prior theories. Based on phenomen- 
ology, this chapter analyses standard DSM and alternative non-DSM 
diagnostic approaches to mood disorders. 


DSM-based definitions 


The standard DSM-based definition for a depressive episode involves 
the occurrence of 2 weeks or more of depressive neurovegetative 
criteria, defined as the following: sleep alterations (insomnia or 
hypersomnia), appetite alterations (increased or decreased), dimin- 
ished interest or anhedonia, decreased concentration, low energy, 
guilt, psychomotor changes (agitation or retardation), and suicidal 
thoughts. Along with depressed mood, the presence of four of eight 
of these criteria for 2 weeks or longer represents the standard defin- 
ition of a DSM-5 ‘major depressive episode’ 

If no manic or hypomanic episodes in the past are identified, then 
the diagnosis of a current major depressive episode leads to a longi- 
tudinal diagnosis of ‘major depressive disorder’ (MDD), which will 
be discussed in more detail later. Subtypes in DSM-5 for MDD are 
atypical features (which represent mainly increased sleep and appe- 
tite, along with heightened mood reactivity), melancholic features 
(defined by no mood reactivity, along with marked psychomotor re- 
tardation and anhedonia), and psychotic features (the presence of 
delusions/hallucinations). 

Manic episodes are defined as euphoric or irritable mood with 
three or more of seven manic criteria, as follows: decreased need 
for sleep with increased energy, distractibility, grandiosity or in- 
flated self-esteem, flight of ideas or racing thoughts, increased talk- 
ativeness or pressured speech, increased goal-directed activities or 
psychomotor agitation, and impulsive behaviour (such as sexual 
impulsivity or spending sprees). If such symptoms are present for 1 
week or longer with notable functional impairment, a manic episode 





is diagnosed, leading to a DSM-5 diagnosis of type I bipolar dis- 
order. If such symptoms are present for at least 4 days, but without 
notable functional impairment, a hypomanic episode is diagnosed. 
If not a single manic episode had occurred ever, but only hypomanic 
episodes are present, along with at least one major depressive epi- 
sode, then the DSM-5 diagnosis of type II bipolar disorder is made. 

If manic symptoms occur for less than 4 days, or if other specific 
thresholds are not met for manic or hypomanic episodes, then the 
DSM-5 diagnosis for such bipolar-like conditions is ‘unspecified bi- 
polar disorder’. 

Manic episodes can be characterized by psychotic features (pres- 
ence of delusions/hallucinations). If psychotic features are present, 
then hypomania cannot be diagnosed (since such features involve 
notable impairment by definition). Similarly, if a patient is hospital- 
ized, irrespective of duration of manic symptoms, a manic episode is 
diagnosed, not a hypomanic episode, again because hospitalization 
involves notable functional impairment by definition. 

If manic or hypomanic episodes are caused by drugs (like anti- 
depressants), then the diagnosis of bipolar disorder is still made 
in DSM-5. This is an important change from DSM-IV where 
antidepressant-related mania/hypomania was viewed as an exclu- 
sion factor, which could not be used to diagnose bipolar disorder. 
Extensive research in the last two decades has shown that this DSM- 
IV exclusion is not supported scientifically and that mania/hypo- 
mania occur with antidepressants almost exclusively in persons who 
have bipolar illness, rather than in unipolar depression [1, 2]. 





Non-DSM-based definitions 


In a non-DSM based approach to diagnosis, beginning with the phe- 
nomenology of mood disorders, an initial observation is that these 
conditions are not primarily disorders of mood. The term ‘mood dis- 
orders’ is a misnomer, if interpreted, as is commonly the case, to mean 
that mood is central to diagnosis. In fact, mood is variable in the phe- 
nomenology of these conditions, and the most consistent clinical fea- 
tures for diagnosis are psychomotor changes. One can have depression 
without any sad mood at all; instead anhedonia is present. One can 
have mania without any euphoric mood at all; instead irritability is pre- 
sent. Classically, depression was characterized by the slowing down of 
ones thoughts, feelings, and movements. In other words, depression 





SECTION 10 Mood disorders 


involved psychomotor retardation. Classically, mania was character- 
ized by the speeding up of one’s thoughts, feelings, and movements. In 
other words, mania involved psychomotor activation. Thus, psycho- 
motor aspects can be seen as central to these conditions [3, 4]. 

It might be suggested therefore that these conditions should not 
be called mood disorders, a term which was instituted officially by 
DSM-III in 1980. The older term ‘affective’ disorder captures some of 
the larger context of the phenomenology of these conditions. Some 
have suggested another phrase, like ‘mood and movement disorders 
[5], be used instead to capture the importance of the motor aspects 
of these conditions. 

The term ‘disorder’ also can be questioned. It is another official 
term instituted by DSM-III in 1980, with the conscious intention of 
being ‘neutral’ as to aetiology. This neutrality, however, can come at 
the expense of clarity. The term ‘disorder’ is intentionally vague— 
some interpret it biologically to reflect a disease concept; others in- 
terpret it psychologically to reflect thought patterns or emotional 
experiences from the present or past; still others view it socially as 
reflecting medical interpretations of social conflict. The range of af- 
fective conditions is broad enough that it includes disease entities, 
as well as social problems and some forms of psychological devel- 
opmental changes. All these varieties of affective conditions are con- 
flated into one word with many meanings. 


Mixed states 


The most common approach to the description of the phenomen- 
ology of mood conditions contrasts depression and mania. While this 
clinical distinction can and should be made, it is not easy to make. It 
also is important to acknowledge that the phenomenology of mood 
conditions involves the frequent, if not usual, admixture of manic 
and depressive symptoms at the same time—called mixed states. In 
this scenario, a full-blown clinical depression may be present, but 
some manic symptoms may also be present such as rapid thoughts, 
increased sexual drive, and marked anger and mood lability. Thus, 
one can think of mood episodes on a spectrum from pure depression 
to pure mania, with many mixed states in between (Fig. 66.1). 

Another variety involves classic psychomotor retarded depression, 
lasting weeks to months, interspersed with very brief bursts of manic 
symptoms lasting hours to days. A third variety involves a full-blown 
manic episode, but with the presence of sad mood or suicidal thoughts 
or other potential depressive symptoms such as guilt or low self-esteem. 

When defined in this manner broadly, mixed states have been re- 
ported to be the most common presentation of mood states [6]. This 
broad definition includes concepts such as agitated depression and 
dysphoric mania. If defined in this way, mania and depression would 
be seen as uncommon pure variants of mood states, while the most 
common mixed states are a mixture of two. 
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Fig. 66.1 The mood episode spectrum. 


If this phenomenology is correct, then it has important impli- 
cations for the classification of diagnoses. The current diagnostic 
system is based completely on the distinction of depression and 
mania. If manic vs depressive states cannot be distinguished in most 
cases, then the current diagnostic system would be very difficult to 
implement validly. Put otherwise, the frequent presence of mixed 
states would invalidate the diagnostic system built on a mania vs 
depression dichotomy. 


Psychosis 


Classically, the key controversy in the phenomenology of mood dis- 
orders involved psychotic conditions like schizophrenia. This past 
debate thankfully has yielded to a gradual consensus, as the concep- 
tualization of schizophrenia has evolved to a chronic and constant 
disease of delusions or hallucinations. This view, which dates back to 
Kraepelin, still is challenged at times, though often with a misinter- 
pretation of Kraepelin’s view of manic-depressive illness (MDI). His 
original MDI concept involved the basic idea that psychotic symp- 
toms, if present, were not constant. They came and went. In contrast, 
psychotic symptoms were present in dementia praecox all the time. 
Kraepelin can be misinterpreted to hold that mood conditions re- 
cover completely forever. Then contemporary critics point to the fact 
that long-term symptoms are present in mood disorders, as in other 
chronic conditions like schizophrenia. In fact, the Kraepelinian view 
was simply that mood episodes are episodic, which means they have 
a beginning and an end. This episodicity does not imply that once 
an episode ends, there will never be any future episodes, that is, that 
the patient is ‘recovered’ forever. It implies that acute episodes end 
and temporary recovery occurs, unlike schizophrenia where acute 
exacerbations merge into chronic psychosis. Recurrence, not per- 
manent recovery, was the essential feature of MDI [7]. In sum, the 
distinction between mood vs psychotic conditions can be made 
well by focusing on recurrence vs chronicity of symptoms. Cross- 
sectionally, as Kraepelin always held, they can look the same. The 
clinical picture of schizoaffective illness raises the question of occa- 
sional overlap between psychotic and affective conditions, which we 
will discuss later in this chapter. 


Temperament 


The concept of affective temperaments dates back at least 200 years 
and was systematized about a century ago by Emil Kraepelin and 
Ernst Kretschmer. In Kraepelin’s texts, we read about ‘manic tem- 
peraments’ and ‘depressive temperaments’ [8]. Kretschmer de- 
veloped these concepts [9], which eventually led to the concepts 
of hyperthymia and dysthymia, respectively. The main idea is that 
manic and depressive states present not only in episodes of attacks 
of severe symptoms, but also as mild symptoms that are present all 
the time, as part of the baseline personality of an individual. Thus, 
temperaments can be defined as mild versions of mood states, but 
they go beyond that initial concept to include basic differences in 
personality traits and in energy levels, as expressed in sleep patterns 
and behaviours such as sexual and social or work-related activities. 
Hence, the following brief descriptions apply [10]: 

Hyperthymia involves a mild manic state as part of one’s basic 
temperament. Such persons are high in energy, need less sleep than 
most people (often 4-6 hours nightly), have high sex drives, and are 
highly social, extroverted, often workaholics, and frequently hu- 
morous. They are described as the life of the party, as fun-loving, 
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and can engage in risk-taking behaviours that others avoid such as 
skydiving or bungee-jumping or motorcycle or airplane flying. They 
dislike routine and are spontaneous. They can be quite anxious and 
inattentive. 

Dysthymia is the reverse, a mild depressive state as part of one’s 
basic temperament. Such persons are low in energy, need more sleep 
than most people (often 10-12 hours nightly), have low sex drives, 
and are socially anxious, introverted, low in work productivity, and 
not humorous. They avoid risk-taking behaviours, are devoted to 
routine, and can be obsessive. They can be quite anxious, but not 
usually inattentive. 

Cyclothymia involves constant alternation between mild manic 
and depressive states on a day-to-day, or a few days at time, basis. 
Such persons go up and down in mood and energy and activity levels, 
though they can be generally mostly extroverted and productive and 
social. They tend to be risk-takers at times and are unpredictable and 
spontaneous. They can be quite anxious and inattentive. 

In the original view of Kraepelin and Kretschmer, these tempera- 
ments are mild variations of MDI. They are not separate or inde- 
pendent diseases or disorders, as the DSM system sets them up. They 
are part of the same condition, just mild versions, ‘formes frustes’, as 
in the French term. This is no different than saying that mild adrenal 
insufficiency is related to, but not the same as, Addison’s disease or 
that mild hypothyroidism is related to, but not the same as, Graves’ 
disease. Affective temperaments are different in severity, but not in 
kind from severe depressive or manic illness. 

This perspective differs from the DSM-II]-based view of dysthymia 
and cyclothymia as ‘axis I conditions, rather than aspects of person- 
ality. Further successive DSM revisions have ignored hyperthymia 
completely. A useful resource is the TEMPS (Temperament 
Evaluation scale from Memphis, Pisa, and San Diego) scale, which 
is the most validated research scale to assess affective temperaments. 
As an aid to clinical diagnosis, a short self-report TEMPS scale can 
be invaluable [11]. 


Insight and the diagnostic process 


A major aspect to the diagnosis of affective illness is the psychopath- 
ology of insight. In general, insight is preserved in depression, and 
impaired in mania [12]. Specifically, about one-half of patients with 
severe mania and the majority of patients with hypomanic states 
deny having those symptoms despite the actual presence of those 
symptoms observed by researchers. Further, insight appears to have 
a U-shaped curve in relation to severity; it is most impaired in hypo- 
mania and in severe mania but may be more present in moderate 
states of mania. 

Since patients realize they are depressed but are often unaware 
that they are having hypomanic or manic states, they will present to 
clinicians complaining of the former and ignoring or denying the 
latter. Hence, clinicians may misdiagnose such patients as being de- 
pressed, rather than manic or mixed, or as having unipolar depres- 
sion, as opposed to bipolar illness. 

There are many reasons for the problem of lack of insight in mania. 
Part of the phenomenon probably is biological, with frontal or par- 
ietal function being affected in the manic state. Such lack of insight 
is known to occur in parietal lobe strokes and in association with 
frontal lobe injury, as well as other diseases such as Alzheimer’s de- 
mentia and schizophrenia [13]. In addition to this biological basis, 
insight may be further impaired because of the cultural impact of 


stigma against manic states, as well as simple lack of knowledge 
about such states. 

To address this problem of insight and misdiagnosis, it is im- 
portant to include family members and/or friends in the diagnostic 
interviews and even in treatment sessions, so as to get information 
from more than one source. While patients and family members re- 
port depressive symptoms equally, family members report manic 
symptoms twice as frequently as patients [14]. 

The patient, in short, is not a sufficient source of information 
to make diagnoses of affective illness. Confidentiality is not rele- 
vant, since at this stage the clinician is gathering information, not 
divulging it. It is a common clinical error to refuse to engage with 
family at the outset of a treatment relationship. This mistake, which 
is not needed to protect confidentiality, will lead to misdiagnosis in 
many patients. 

In addition to obtaining information from outside sources, clin- 
icians should use interviewing methods that take into account pa- 
tients’ natural inclination to deny manic symptoms. Generally 
speaking, it is best not to ask patients about manic criteria directly, 
but rather to ask open-ended questions that would capture those cri- 
teria. This way, the patient would report manic symptoms spontan- 
eously without realizing that they are doing so. An example might 
be: ‘Other than your depressive states, have you had periods of time 
where you felt the opposite?’ Patients tend to want to discuss their 
depressive states in detail. Clinicians can begin the interview by al- 
lowing such discussion and by going into detail to understand the 
features of the depressive states. Assessment of manic states should 
be left to later in the interview. Further, even if specific manic or 
hypomanic episodes are denied by patients, the clinician should 
then turn to family members present and ask them about the same 
symptoms. If all episodic symptoms are denied by both patient 
and family, then the likelihood of misdiagnosis of bipolar illness is 
lowered notably. 

Even then, clinicians should ask patients and family about their 
baseline temperament, to identify possible hyperthymia or cyclo- 
thymia. Again, open-ended questions are more likely to receive ac- 
curate answers: “Tell me what your personality is usually like, when 
you're not depressed? What’s your usual energy and activity level?’ 
Since ‘energy’ is a vague concept, it is useful to be specific about 
sleep—not just whether the patient feels it is adequate or not, but the 
actual time one goes to bed, the time one wakes up, and the quality 
of sleep during that time. Whether energy is high or low should not 
be based on the patient’s opinion alone, again, but also based on the 
observation of family and friends, and also compared to the norm of 
the patient's peers. 


Biological aspects of diagnosis 


It is stated often that there are no valid biological ‘markers’ for psy- 
chiatric diagnosis. This statement is oversimplified. In the case of 
affective conditions, there are some biological testing methods that 
can be relevant to a more accurate diagnosis. Among these, the 
most important method is assessment of genetics via family history, 
but also in various settings, important evidence is relevant from 
neuroimaging, EEG testing, and neuropsychological testing. 

Family history is very important in the diagnosis of mood con- 
ditions since bipolar illness is highly genetic, almost completely 
so based on twin studies, while DSM-defined MDD is much less 
genetic [15]. Further, other differential diagnostic conditions, like 
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personality disorder, also have much lower genetic heritabilities, 
based on a meta-analysis [16], than bipolar illness. Thus, a positive 
family history for bipolar illness is diagnostically important and 
rather specific for that condition. The absence of such family his- 
tory needs to be investigated, rather than simply being accepted as 
reflecting no such family history. This is the case because bipolar 
illness, and MDI in general, have been proven to be underdiagnosed 
for at least half a century or longer [17]. Thus, even if family members 
sought psychiatric or medical diagnosis, the statistical probability 
would be that bipolar or manic conditions would be diagnosed 
less frequently than would have been truly the case. Further, social 
stigma is high now, but was even greater in previous generations, 
and is greater in non-Western cultures, all of which may be reflected 
in the absence of family histories for mood conditions in prior gen- 
erations or in other cultures. 

Neurological evaluation can inform diagnosis, specifically EEG, 
neuroimaging, and neuropsychological testing. These tests tend to 
be informative for mood conditions in a way that is different than 
how they are used for neurological diagnoses. For instance, EEGs 
are often seen as within the normal range if there is no epileptiform 
activity, but spiking or slowing of EEG activity has been associated 
with mood conditions, depending on location and laterality [4]. In 
general, it has been found that right temporal abnormalities are asso- 
ciated with depression, and left temporal abnormalities with mania. 
Quantitative EEG may be more sensitive to such mild abnormalities, 
as compared to standard EEG [4]. On neuroimaging, depression has 
been associated with hippocampal atrophy, and mania with amyg- 
dala enlargement, among other changes [18]. Further, white matter 
infarcts, often dismissed as variants of normal, are common and spe- 
cific to vascular depression [19]. 

Perhaps the most common, and most misused, diagnostic test 
is neuropsychological testing. Misuse of such testing occurs when 
a cognitive function is identified immediately with a diagnosis; 
for example, attentional impairment is quantified and then ‘diag- 
nosed’ as attention deficit disorder (ADD). In fact, attentional im- 
pairment can occur because of either depression and mania [20]. 
Neuropsychological testing is a snapshot in time and thus reflects 
the current mood state, rather than the longitudinal diagnosis. It can 
provide corroborative evidence for mood diagnoses or other cog- 
nitive conditions though. For instance, learning disabilities or IQ 
abnormalities can be identified. The pattern of abnormalities can 
suggest right vs left hemisphere dysfunction, which, combined with 
quantitative EEG and/or MRI, may support a pure depressive vs a 
bipolar diagnosis, with a predominance of left hemispheric abnor- 
malities for manic or bipolar states, as opposed to right hemispheric 
abnormalities for pure depressive states. 


Classification 


Current classification of mood disorders 


The current classification of mood disorders is based on the 
Diagnostic and Statistical Manual of Mental Disorders, the fifth edi- 
tion of which (DSM-5) was published recently in 2013 [21]. Changes 
in DSM-5, compared to DSM-IV, were minimal for mood disorders. 
Specifically, for the diagnosis of a manic episode, a requirement for 
increased energy was added to the prior criterion of decreased need 


for sleep. This requirement is redundant because the entire concept 
of decreased need for sleep implies increased energy. Nonetheless, 
the energy criterion was made explicit. For MDD, no changes be- 
yond minor modifiers have been made since the basic criteria were 
set with DSM-III in 1980. 

The most important change in DSM-5 was not to the criteria 
for manic or depressive episodes, but rather to the modifier for 
‘mixed features. The change here was that the prior mood epi- 
sode definition for ‘mixed episodes’ was dropped, and the con- 
cept was moved to become a modifier for manic or depressive 
episodes. Thus, in DSM-5, mixed episodes are replaced with the 
terms ‘Bipolar disorder, current manic episode with mixed fea- 
tures’ or ‘Bipolar disorder, current depressive episode with mixed 
features’ and, importantly, ‘MDD, current major depressive epi- 
sode with mixed features. Thus, for the first time, a DSM revi- 
sion acknowledges the diagnostic relevance of manic symptoms 
in persons who do not have bipolar disorder, that is, MDD with 
mixed features, which will be discussed further. 

The DSM-5 concept of mixed features is defined as the presence 
of non-overlapping mood symptoms from the opposite mood states. 
Thus, a depressive episode would be said to have mixed features if 
multiple manic symptoms were present, excluding distractibility 
and irritability and agitation, which are considered ‘overlapping’ 
symptoms since they are part of the definition of a major depressive 
episode as well. The critique has been made that these overlapping 
symptoms are exactly those symptoms that are most common in 
mixed states; thus, excluding them would be like saying that one 
could not use the symptom of headache to diagnose migraine, 
since pain in the head overlaps with other non-migraine headache 
syndromes [22]. 


Critique of DSM-based classification 


Besides specific issues related to mood conditions, as described in 
this chapter, DSM-based classification, in general, has been subject 
to criticism in recent years. The critiques that may have the most sci- 
entific merit are as follows. 

DSM-based diagnosis is ‘pragmatic and is not based primarily 
on scientific studies. Leaders of DSM-III, IV, and 5 have been ex- 
plicit that certain changes based on abundant scientific data, such 
as dimensions of personality traits, would not be included in official 
DSM diagnoses, while other definitions with very limited scientific 
data, like dysphoric mood dysregulation disorder, were included. 
These changes and failures to make change were based on judge- 
ments termed ‘pragmatic, based on preferences of the psychiatric 
profession in the United States. This decision-making process may 
be utilitarian and socially needed, but it does indicate that scientific 
data are not the main sources of decision-making [23]. 

DSM-based diagnosis is overly conservative and resistant to 
change. The publication of DSM-5 in 2013 occurred with minor 
changes to DSM-IV overall (published in 1994), which also reflected 
little change from DSM-III (published in 1980). 

As a corollary, DSM-based diagnosis impedes scientific research 
and hence retards progress in the profession [24]. This observation 
led the US National Institute of Mental Health (NIMH) in 2013 to 
direct that no future funded research should use DSM-based cri- 
teria. Alternative criteria—the Research Domain Criteria (RDoC)— 
have been instituted [25]. 
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Non-DSM-based classification: manic-depressive illness 
and neurotic depression 


The critique described is a rationale for being open to non-DSM- 
based classifications for mood conditions. Three major non-DSM 
approaches involve the diagnostic groupings of MDI, neurotic de- 
pression, and affective temperaments. 


Manic-depressive illness 


The central non-DSM-based classification of mood conditions is the 
pre-DSM III concept of MDI or the post-DSM definitions of mood 
spectra. 

Manic depressive illness (MDI) was defined by Emil Kraepelin in 
1898 and later editions of his textbooks, and used as thus defined for 
about century, until DSM-III in 1980. The Kraepelinian definition of 
MDI was that any recurrent mood epsiodes of any kind, depressive 
OR manic, constituted the diagnosis of MDI [7, 8, 26]. This is not the 
same thing as bipolar disorder, which became official with DSM-III, 
which meant the presence of depressive AND manic episodes, not 
depressive OR manic episodes. In other words, MDI meant bipolar 
illness plus unipolar depressive illness. It represented what is defined 
by DSM terms as bipolar disorder and most of MDD. 

MDI was defined by recurrence of mood episodes as its diagnostic 
standard, not polarity of mood episodes. As long as recurrent epi- 
sodes occurred, that is, they came and went and were repeated, the 
diagnosis was MDI. It did not matter if the diagnosis reflected 100 
depressive episodes and zero manic episodes, or 100 manic episodes 
and zero depressive episodes. The diagnosis of MDI was the same in 
both cases. In other words, bipolar and unipolar courses were sub- 
types of the same illness, MDI, rather than two separate purported 
illnesses, as in the DSM system. 

One reason that Kraepelin de-emphasized polarity and focused 
on recurrent course of illness was his observation, along with others, 
that most mood states were neither purely depressive nor manic, but 
rather mixed. Kraepelin defined six different types of mixed states, 
building on Weygandt [27], as opposed to the two pure depressive 
or manic states. Mixed states were the most common presentation 
of MDI, and thus it was not possible to define the illness by polarity, 
since most mood states were not at one pole or the other. DSM-III 
tried to ignore this problem away by refusing to define mixed epi- 
sodes and by defining manic episodes very narrowly as being se- 
vere and lasting 1 week or longer. This definition, which conflicted 
with a large literature on mixed states and hypomanic episodes, was 
partially rectified by DSM-IV which accepted a definition of milder 
hypomanic episodes and introduced a definition of mixed episodes. 
However, hypomanic episodes were cut off at 4 days, without any sci- 
entific evidence for that cut-off duration, and mixed episodes were 
defined narrowly (requiring the presence of full syndromal mania 
and depression at the same time). DSM-5 has broadened the mixed 
definition somewhat, as noted, but it did not change the hypomania 
duration definition, despite the absence of evidence for the 4-day 
cut-off and the presence of some data supporting a shorter cut-off of 
2 days [28]. Further, the definition of mixed states proposed by DSM- 
5 has been criticized as being conceptual primarily (focused on the 
concept of ‘non-overlapping’ symptoms), as opposed to extensive 
empirical evidence that most mixed states are broader and charac- 
terized primarily by psychomotor excitation—like rapid thoughts, 
marked rage/irritability, agitation, and impulsivity—added to 


depressive features [22]. These empirically based mixed features are 
not part of the DSM-5 definition but are central to the non-DSM 
concept of mixed depressive and dysphoric manic states [29]. 

These DSM debates about duration of hypomania and definitions 
of mixed states are only relevant if the DSM distinction between bi- 
polar disorder and MDD is accepted. If the older concept of MDI is 
accepted, then DSM controversies over details of mood states would 
not change the MDI diagnosis in any case. 


Neurotic depression 


Besides MDI, the other major non-DSM classification in mood 
conditions is neurotic depression [30, 31]. This diagnosis was made 
commonly before DSM-III in 1980, and it was targeted specifically 
for exclusion from DSM-III [32]. It was defined, as with MDI and 
unlike DSM definitions, primarily by its course, rather than cross- 
sectional symptoms; neurotic depression reflected a mood condi- 
tion that was not severe, episodic, and recurrent—like MDI—but 
rather chronic, constant, but mild to moderate in severity. The mood 
symptoms were never manic, but rather depressed and anxious. 
Brief exacerbations could occur to higher severity, usually in the 
setting of psychosocial stress, but these periods of worsening were 
brief, usually days to weeks, followed by a return to a chronic base- 
line. In the United States, the terms ‘neurotic and psychotic’ were 
used in the mid-twentieth century loosely, based on psychoanalytic 
assumptions about the severity of repressed unconscious emotions. 
In Europe, the terms were used phenomenologically, with neurosis 
simply reflecting anxiety with some admixture of mild depressive 
symptoms, and psychosis reflecting the presence of delusions or 
hallucinations. 

There was a robust debate about the validity of neurotic depression 
as a diagnosis in the 1970s, especially in the UK, but also in the United 
States [33]. Since the term was used differently in the two nations, 
the debate was not the same exactly. In the UK, neurotic depression 
became identified with ‘exogenous’ or ‘reactive’ depression, while 
psychotic depression was defined as ‘endogenous. Since MDI was 
always seen as a biological, and thus ‘endogenous, disease, neurotic 
depression was seen in Europe as an entirely different kind of de- 
pressive condition than MDI. However, not all depressive episodes 
in MDI were psychotic, which led to the problem of how to differen- 
tiate non-psychotic depression in MDI (both bipolar and unipolar 
subtypes) from neurotic depression. In the UK debate, the focus was 
on the exogenous vs endogenous distinction, that is, on whether psy- 
chosocial stressors could be shown to differ in non-psychotic depres- 
sive states. As this factor was thrown into doubt based on research, 
the concept of neurotic depression was weakened accordingly. The 
definition of endogenous depression primarily based on the course 
of illness did not receive adequate attention though. 

In the United States, the concept of neurotic depression fell into 
disfavour as being Freudian, even though it did not have psycho- 
analytic roots in Europe. In the course of DSM-III debates and de- 
cisions, as now documented in new historical research [32], the 
evolution of neurotic depression was as follows. It was replaced 
initially by the term ‘minor’ depression, as opposed to ‘major’ de- 
pression which was to be reserved for the more severe mood states 
of MDI (renamed bipolar disorder and unipolar major depressive 
disorder). Psychoanalytic groups in the United States objected to the 
term ‘unipolar’ as being seen as ‘endogenous, and thus leading to 
likely drug, as opposed to psychotherapy, treatments. Those same 
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groups objected to the term ‘minor’ as likely to result in minor in- 
surance reimbursement. Thus, all the proposed terms were lumped 
eventually into the diagnosis of ‘major depressive disorder. There 
was still reluctance in the American clinical community to give 
up the term neurotic depression, so additional diagnoses were 
created or redefined—namely, generalized anxiety disorder and 
dysthymia—to compensate clinicians for giving up the concept of 
neurotic depression. 


Affective temperaments 


The third major non-DSM approach is the concept of affective 
temperaments, namely, dysthymia (constant mild depression), 
hyperthymia (constant mild mania), and cyclothymia (constant 
mild alternation between the two), as described previously. Using 
validated scale measures, about one-half of persons with mood con- 
ditions have an affective temperament [34]. Affective temperaments 
simply are milder versions of affective illnesses, as opposed to being 
categorically different conditions. A patient can have both unipolar 
depression and cyclothymic temperament, or bipolar illness and 
hyperthymic temperament. Indeed, genetic studies find that af- 
fective temperaments are quite frequent diagnoses in family mem- 
bers of persons with affective illnesses [35]. 

The concept of affective temperaments provides a dimensional 
approach to personality, with origins in affective illness, as opposed 
to the DSM-based categorical approach, with origins in psycho- 
analytic constructs. While the DSM revisions have recognized dys- 
thymia and cyclothymia, those conditions have not been viewed as 
personality constructs, but as ‘axis l disorders, equivalent to bipolar 
illness or unipolar depression. This approach is not how those con- 
ditions have been defined and used for over a century. Further, DSM 
revisions have excluded and ignored the condition of hyperthymic 
temperament. 

Like DSM-defined personality disorders, affective temperaments 
are present throughout life, persisting from childhood and adoles- 
cence. Thus, patients with cyclothymia have constant mood lability 
and impulsive sexuality, often interpreted as borderline personality, 
or constant problems with attention, often interpreted as attention- 
deficit/hyperactivity disorder (ADHD), as discussed in this chapter. 


A post-DSM classification: spectrum concepts 


After almost half a century of more research and clinical experience, 
one might ask how well DSM-based definitions of mood disorders 
have held up. 

The main question relates to the radical distinction between MDI, 
as a single broad disease, vs two different and separate bipolar and 
unipolar conditions. The bipolar/unipolar dichotomy was justified 
in the 1970s, leading up to DSM-III, based on differences in the 
classic diagnostic validators of course, genetics, biological markers, 
and treatment response [36]. In other words, the claim that the pres- 
ence or absence of manic episodes marked different diseases was 
supported by observations of differences in course, genetics, biology, 
and treatment between bipolar and unipolar groups. In particular, 
bipolar illness had an early age of onset (mean age of about 19 years vs 
about the late 20s for unipolar depression), brief depressive episodes 
(3 months or less, on average, in bipolar illness vs 6-12 months, on 
average, in unipolar depression), highly recurrent course (more fre- 
quent episodes in bipolar than unipolar illness, with rapid cycling, 
defined as four or more episodes yearly happening in about 25% of 


bipolar illness cases, but in <1% of unipolar depression cases), gen- 
etic specificity (manic episodes were found in families of persons 
with manic episodes, but not in families of persons with unipolar 
depression), and differential treatment (antidepressants for unipolar 
depression vs neuroleptics and lithium for mania) [26]. 

In the intervening half century, all of these distinctions have 
been weakened and/or refuted. Regarding the course of illness, the 
common diagnosis of MDD in children, far below the mean onset 
of the late 20s, conflicts completely with the rationale that bipolar 
illness and MDD differ on that course criterion. Brief depressive epi- 
sodes that occur multiple times yearly are diagnosed in patients with 
MDD commonly [37], whereas such course of illness should be rare 
if MDD was a different illness than bipolar disorder. Genetic studies 
have found high rates of depressive episodes, without mania, in per- 
sons with bipolar illness, and also frequent occurrence of bipolar 
illness in relatives of those with unipolar depression [35, 38, 39]. 
Treatment now overlaps considerably, with neuroleptic agents 
proven effective not only for mania, but also for depression, both in 
bipolar and unipolar types [40, 41]. Lithium has been well known 
to be effective not only for mania, but also for depression, both in 
bipolar and unipolar types [42, 43]. 

In short, the bipolar/unipolar distinction, now calcified in DSM- 
III through 5, has not been supported by further research in mood 
disorders and, in fact, research could be used to support the original 
MDI perspective of one broad mood illness. 

Another perspective taken on this research would be to main- 
tain the bipolar/unipolar dichotomy, but to recognize that it is not 
a strict distinction, but rather that there is one mood spectrum, 
with extremes of classic bipolar and unipolar cases, but with many 
cases in the middle of that spectrum. Some have called this con- 
cept a ‘bipolar spectrum [44] or a ‘manic-depressive spectrum’ 
(Fig. 66.2). 

One might also refer to an ‘MDD spectrum’ which captures the dif- 
ferent subtypes of depression described in this chapter, all of which 
were lumped together in DSM-III into one broad MDD category. 

Such views would incorporate both the original MDI concept 
and the later observations of bipolar and unipolar variations [45] 
(Fig. 66.3). 

The DSM approach to diagnosis generally is opposed to spectrum 
concepts, on a priori ‘pragmatic grounds of wanting to have a high 
threshold for psychiatric diagnosis to avoid ‘overdiagnosis’ [46]. 
Separate from the fact that this belief reflects stigma against psychi- 
atric illness, it is not based on empirical evidence, but rather reflects 
a conceptual assumption. For that reason, the scientific evidence 
supporting spectrum concepts in mood disorders was rejected in 
the DSM-5 process. 

Any future change likely will await a move away from the DSM 
process of diagnositic criteria towards something different, such as 
new research-based clinical criteria for diagnosis. 





Differential diagnosis 


The differential diagnosis of mood conditions involves distinctions 
within the mood dichotomy (bipolar vs unipolar) and distinc- 
tions with other conditions based on overlapping symptoms such 
as personality changes (borderline, antisocial, narcissitic person- 
ality disorders), delusions or hallucinations (schizophrenia and/or 
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Unspecified BP is DSM terminology for bipolar aspects of a case that do not 
meet mania or hypomania criteria, such as recurrent depressive episodes 
with underlying hyperthymia, or recurrent depressive episodes with a 
first-degree relative with bipolar illness. 


Fig. 66.2 The manic-depressive spectrum. 


schizoaffective illness), and cognition (ADD). An important aspect 
to approaching differential diagnosis is the concept of a diagnostic 
hierarchy, which has been rejected by DSM nosology. In the hier- 
archy approach, diagnoses with multiple symptoms should be diag- 
nosed preferentially to diagnoses with fewer symptoms [47]. Thus, 
pneumonia would be the only diagnosis in a case of fever with lung 
infection, as opposed to ‘fever disorder’ comorbidity. Since mood 
disorders cause personality changes, delusions, and cognitive ef- 
fects, they should be diagnosed, instead of assuming the presence 
of personality disorders, schizophrenia, and/or ADD. Further, the 
assumption of ‘comorbidity, which is encouraged by the DSM ap- 
proach, would be challenged by a hierarchy concept. Multiple diag- 
noses would not be made, unless other symptoms could be shown to 
exist, independent of the mood condition. 


Within the mood dichotomy: unipolar vs bipolar 


In the DSM polarity-based nosology of mood disorder, the differ- 
ential diagnosis of unipolar vs bipolar illness rests completely on 
the identification of manic or hypomanic episodes. Most patients 
present for treatment in the depressive phase; thus, the differen- 
tial diagnosis rests entirely on past manic/hypomanic episodes. As 
noted, patients deny having manic episodes about half the time, due 
to lack of insight. Such impaired insight is even worse in hypomania. 
Thus, in most cases, patients will deny manic/hypomanic episodes, 
leading to underdiagnosis of bipolar illness and overdiagnosis of 
MDD, which has been shown empirically [17]. Interviewing should 
include family and friends, who report manic symptoms more ac- 
curately than patients, to correct for this diagnostic bias. Patient self- 
report is not sufficient to rule out past manic/hypomanic episodes. 
A non-DSM-based approach would emphasize all diagnostic 
validators, including the course of illness and genetics, and not 
just symptoms, to differentiate bipolar vs unipolar depression. The 


depressive states themselves are somewhat different, with more mel- 
ancholia, mixed states, and delusions/hallucinations in bipolar than 
unipolar depression [48]. Family genetics of bipolar illness would 
argue against the legitimacy of diagnosing MDD in a depressed 
person. A course of illness with early age of onset (that is, 20 years 
or earlier), brief depressive episodes (3-6 months or less), and high 
recurrence (multiple depressive episodes yearly) all would argue for 
the diagnosis of bipolar illness and against unipolar depression [26]. 

The therapeutic relevance of this distinction is that antidepres- 
sants appear to be mostly ineffective in acute bipolar depression [49] 
and in prophylaxis [50]. They also can cause acute manic/hypomanic 
episodes [51] and, more importantly, have been shown to worsen 
the long-term course of bipolar illness in some subjects, especially 
those with a rapid-cycling course [52]. In rapid-cycling cases, ran- 
domized data show that more mood episodes, including depressive 
states, occur over time with antidepressants than if those agents are 
discontinued or foregone altogether [53]. 


Personality disorders 
Borderline, narcissistic, and antisocial personality disorders 


Bipolar illness and borderline personality share some features and 
differ in others. Shared features include rapid mood swings, un- 
stable interpersonal relationships, impulsive sexual behaviour, and 
suicidality. DSM-III through 5 criteria for borderline personality 
are met with the four criteria described, along with one more out of 
nine total criteria. Thus, using DSM definitions, most persons with 
bipolar illness would meet borderline criteria automatically. This ap- 
parent ‘comorbidity’ is questionable due to overlapping diagnostic 
definitions. As examined in a review [54], true comorbidity would 
be definable with the other five borderline criteria which are less 
common in bipolar illness, especially dissociative states and self- 
mutilations. If sexual trauma is added, which is not even one of the 
DSM criteria for borderline personality, then it has been shown that 
specific differentiation of bipolar illness from borderline personality 
is feasible. Further, the presence of bipolar family genetics would 
strongly argue for bipolar illness and against borderline personality, 
with the former being about twice as heritable genetically than the 
latter. 

When manic, or as part of hyperthymic or cyclothymic tempera- 
ment, persons with bipolar illness have high self-esteem, which can 
get reflected in lack of empathy towards others and a range of selfish 
behaviours. If seen through a Freudian and DSM-based lens, many 
clinicians misdiagnose such persons as having narcissistic person- 
ality disorder (NPD), characterized by a pervasive pattern of grandi- 
osity, a need for admiration, and low empathy. This approach would 
ignore the diagnostic validators that would support mood illness, 
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such as family bipolar genetics, and other manic symptoms that are 
irrelevant to NPD such as high energy levels and decreased need 
for sleep. It is relevant that NPD has been invalidated by research 
with diagnostic validators, being unable to be distinguished from 
multiple other personality disorders (that is, borderline, histrionic, 
dependent, antisocial) [55]. The DSM-5 Personality Disorders Task 
Force recommended the removal of NPD, but this recommendation 
was rejected by the APA Board of Trustees. Thus, the scientific legit- 
imacy of NPD as a diagnosis is questionable. 

In contrast, antisocial personality has been diagnosed and valid- 
ated for a century or more in psychiatry. Again, there is overlap with 
manic behaviours, which can be aggressive and impulsive, leading 
to legal troubles. If those behaviours are viewed solely from a social 
perspective, the antisocial diagnosis may seem relevant. Such an ap- 
proach would be corrected if other diagnostic validators were taken 
into account, again emphasizing family bipolar genetics, but also 
the course of illness with different behaviours in and out of mood 
episodes, as well as the presence of other manic symptoms that are 
irrelevant to antisocial personality such as high energy levels and 
decreased need for sleep. 


Schizophrenia/schizoaffective illness 


When delusions or hallucinations are present, the differential diag- 
nosis of mood conditions includes schizophrenia and schizoaffective 
illness. Delusions or hallucinations occur in about one-half of manic 
states and in about 10% of depressive states, somewhat more in bi- 
polar than unipolar illness [56]. When present, such symptoms are 
qualitatively similar to the delusions or hallucinations of schizo- 
phrenia and thus can be mistaken for the latter. Further, negative 
symptoms of schizophrenia like apathy, lack of affect, low energy, 
and social isolation can appear like symptoms of depression epi- 
sode. In both conditions, cognitive impairment is present. This mis- 
diagnosis was common throughout the twentieth century, when 
schizophrenia was diagnosed cross-sectionally by the presence of 
delusions/hallucinations (as Eugen Bleuler popularized), as opposed 
to longitudinally by a chronic declining course (as Emil Kraepelin 
taught). This course was key, in the Kraepelinian view—dementia 
praecox was chronic and MDI was episodic. The Bleulerian view was 
disproved by the seminal United States—U.K diagnostic project and 
other research, which led to the return to the Kraepelinian course 
criterion with DSM-III in 1980 [57]. 

Some recent genetics studies indicate common genetic aetiolo- 
gies for schizophrenia and affective illness, which some interpret as 
challenging the nosological dichotomy. Other genetic studies differ- 
entiate the two basic diagnostic groupings, as do other diagnostic 
validators such as course and neuroimaging [58, 59]. 

Some patients share features of both conditions, defined in the 
DSM system as schizoaffective disorder. Various studies of the 
classic diagnostic validators suggest that schizoaffective illness does 
not breed true and does not appear to represent a valid third psych- 
otic disease [60]. Instead, it likely is sometimes a more severe variant 
of affective illness and sometimes a less severe variant of schizo- 
phrenia. In some cases, it could represent the chance comorbidity of 
schizophrenia and affective illness in the same person. 


Attention deficit disorder 


Depressive and manic states are characterized by impaired 
concentration, as well as executive function impairment, and 


abnormal working and short-term memory [61]. These cogni- 
tive states are also present in ADD, leading to the diagnosis of 
ADD in persons who have manic and depressive symptoms. This 
misdiagnosis would be common, especially in hyperthymia and 
cyclothymia, where chronic problems with attention are present 
due to constant manic and depressive symptoms. Using the hier- 
archy concept, such persons would be diagnosed preferentially as 
having mood conditions; using DSM definitions, the comorbidity 
of both diagnoses would be asserted. In contrast to bipolar illness 
definitions, which have remained restrictive, DSM-5 broadened 
the diagnosis of childhood ADD, requiring fewer criteria to be 
met and extending the age of onset range from a maximum of 
6 years to 12 years. DSM-5 also formalized a definition of ADD 
in adulthood as well. 

Again differential diagnosis would be assisted by attention to all 
diagnostic validators, not just symptoms; thus, family genetics of bi- 
polar illness would argue against the ADD diagnosis, as would re- 
current depressive episodes. Since amphetamines are used to treat 
ADD and they have been associated with worsening of manic states 
[62], as well neurotoxicity in animal studies [63], the differential 
diagnosis has important practical implications. 
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Unipolar depressive disorder and bipolar disorder are common 
mental diseases that impose a very high societal burden in terms 
of morbidity, mortality, lost productivity, and costs [1]. The Global 
Burden of Disease study, which is a comprehensive assessment of 
disability and mortality from diseases and injuries from 1990 and 
projected to 2020, highlights the importance of mood disorders 
for the world. Using the measure of disability-adjusted life years 
(DALYs), the World health Organization (WHO) revealed that de- 
pressive disorder was the fourth leading cause of disease burden and 
bipolar disorder was the sixth leading cause in the world in 1990, 
and it was projected that, in the year 2020, depressive disorder would 
be the second leading cause of disease burden. In 2010, mental and 
substance abuse disorders accounted for 7% all DALYs worldwide, 
and within mental and substance abuse disorders, depressive dis- 
order accounted for 40% and bipolar disorder for 7% of DALYs [2]. 
Together, mood disorders burden society with the highest health 
care costs of all psychiatric and neurological disorders [3]. One 
DALY represents the loss of a healthy year of life and aggregates the 
years of life lived with disability (YLDs) with the years of life lost 
(YLLs) due to premature mortality. In 2010, depressive disorders 
were the second leading cause of YLDs due to population growth 
and ageing [4]. These estimations have contributed to shifting inter- 
national focus towards mood disorders as a leading cause of burden 
in its own right and also in comparison to more recognized physical 
disorders. 

Mood disorders have received considerable attention in psychi- 
atric epidemiology over the last 25 years. These received particular 
attention in the five-site United States National Institutes of Mental 
Health Epidemiologic Catchment Area Study (ECA), as well as in 
epidemiological studies in other countries around the world that 
used the ECA methodology. Mood disorders also received par- 
ticular attention in the National Comorbidity Survey (NCS) in 
the United States, in the National Psychiatric Morbidity Survey of 
Great Britain, and most recently in the World Mental Health Survey 
(WMH) across many countries. Thus, there are substantial data 
from around the world on the epidemiology of these disorders. 
In addition, many of the population-based case registers and twin 
registries have also paid particular interest in mood disorders, and 








some of these registers have the additional advantage of being able to 
consider genetic, as well as environmental, risk factors. 

Depressive disorder and bipolar disorder hold large similarities 
in pathogenesis, course of illness, and outcomes, and it is estimated 
that approximately 1% of patients convert from depressive disorder 
to bipolar disorder per year as patients develop hypomanic or manic 
episodes [5, 6], approximating 20-40% during the lifetime course. 




















gnostic issues 


While classical bipolar disorder, with episodes of euphoric mania 
interspersed with episodes of depression, is one of the clearest clin- 
ical syndromes in psychiatry, the boundaries of bipolar disorder 
remain contested. As case definition is central to epidemiology, all 
the contested boundaries of bipolar disorder could influence preva- 
lence rates and our understanding of risk factors. Some of the major 
boundary issues for bipolar disorder include the overlap of bipolar 
disorder with psychotic features with schizoaffective disorder and 
schizophrenia, and the overlap of bipolar disorder with unipolar 
depressive disorder when patients who present primarily with de- 
pression have brief or mild episodes of hypomania. There is also an 
overlap of bipolar disorder with apparent personality disorder, es- 
pecially Cluster B personality disorders such as borderline and nar- 
cissism, and the issue of when hyperthymic personality merges into 
bipolar disorder. When bipolar disorder is comorbid with alcohol or 
substance abuse, there are also important diagnostic issues. 
Another important issue in determining caseness of bipolar dis- 
order for epidemiological surveys is symptom pattern and duration. 
A number of the diagnostic instruments for assessing bipolarity in 
population surveys limit the questions on mania toa type ofsymptom 
profile characterized by euphoria, grandiosity, increased energy, and 
decreased sleep. Whether the commonly used epidemiology inter- 
views adequately detect those individuals who have manic episodes, 
characterized by irritability, anger, and activation, is very debatable. 
The other key diagnostic issue is what criteria are used to categorize 
the minimum duration for hypomania; are 4 days too long? Are even 
2 days too long? Furthermore, as insight is sometimes impaired in 
hypomania and mania, and as these are low-prevalence disorders, 
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the accuracy of case detection of bipolar disorders in populations 
remains an issue for further research. 


Prevalence 


Prior population studies, such as the ECA and its related cross- 
national studies and the NCS, reported that the lifetime prevalence 
of bipolar I disorder varied from 0.3% to 1.5% [7, 8]. More recent 
studies, such as the NCS replication study [9], reported data on both 
bipolar I and bipolar II disorders and found lifetime (and 12-month) 
prevalence estimates of 1.0% (0.6%) for bipolar I, 1.1% (0.8%) for bi- 
polar II, and 2.4% (1.4%) for subthreshold bipolar disorder. Broader 
definitions of mania/hypomania have resulted in lifetime prevalence 
rates increasing to about 4% or even higher [10]. 

In bipolar I disorder, the prevalence in males and females is 
similar. This is in contrast to the reasonably consistent female excess 
found in bipolar II disorder and depressive disorder. 

The incidence of bipolar I and II disorders is estimated to be 10- 
30 per 100,000 year (0.01-0.03% per year). 


Comorbidity 


In the NGS, all identified bipolar I individuals suffered from at least 
one, and often up to three or more, comorbid disorder. The most 
common comorbid disorders included the full range of anxiety dis- 
orders, alcohol and drug dependence, and antisocial behaviours. 

Alcohol and drug abuse and/or dependence are commonly 
comorbid with bipolar disorder. Clinical studies find that bipolar 
patients with comorbid substance dependence are less adherent to 
prescribed mood stabilizers and have more frequent hospital re- 
admissions. Stimulant abuse/dependence rates are especially in- 
creased in bipolar disorder. 

Individuals with bipolar disorder have the full range of anx- 
iety disorders, including phobias, panic disorder, and obsessive- 
compulsive disorder. 

A controversial area of comorbidity with bipolar disorder is that 
of childhood attention deficit disorder. Early studies have suggested 
high comorbidity between attention deficit disorder and child and 
adolescent mania/bipolar disorder [11, 12], whereas longitudinal 
data [13-15], as well as studies with more careful diagnostic sorting 
of symptoms [16], have revealed lower comorbidity and supported 
the differentiation of child and adolescent mania/bipolar disorder 
from hyperkinetic disorder/attention deficit disorder. As argued 
by Arnold and colleagues, ‘Indeed, if one automatically count such 
symptoms as hyperactivity and impaired attention towards both 
disorders without noting association with mood episodes, and es- 
pecially if one does not require episodicity for bipolar disorder it 
may artificially inflate the comorbidity rate’ [16]. In this way, lon- 
gitudinal results seem to confirm the observation that even though 
the symptoms may not be so different between mania and attention- 
deficit/hyperactivity disorder (ADHD), the clinical presentation 
and illness course with an episodic course in bipolar disorder and a 
more chronic course in ADHD differ between the disorders [15, 17]. 


Risk factors for bipolar disorders 


In considering the risk factors for bipolar disorder, it is useful to sep- 
arate risk factors into those that are risk factors for lifetime vulner- 
ability (for example, genetic factors) and those that are risk factors 
for the onset of an episode of depression or mania (for example, life 


events). Thus, in determining the risk factors for lifetime vulner- 
ability, genetic factors constitute the largest single risk factor. Bipolar 
disorder is one of the psychiatric disorders with the highest herit- 
ability at about 70-80%. 

However, if one is considering who is vulnerable to an episode 
of mania over the next 6 months, genetic factors will play a rela- 
tively smaller part and predictions may be best based on other fac- 
tors such as past history, childbirth, the presence of subsyndromal 
affective symptoms and cognitive deviances, being treated for de- 
pression with antidepressant medication, and the approach of spring 
or summer. 

Although organic factors, such as some types of central nervous 
system damage, are unusual risk factors in young adults, in late- 
onset bipolar disorder (age of onset more than 50 years), organic 
disease of the central nervous system is an increasing factor for the 
development of mania. 


Age of onset and course of illness 


Early-onset bipolar I disorder is more familial and is associated 
with a higher genetic load. Different population studies show 
that the mean age of onset of bipolar disorder varies from 17 to 
30 years, but there seems to be differences between Europe and 
the United States. European data suggest a mean age in the late 
twenties, whereas United States data suggest a mean age in the 
early twenties. Reasons for these substantial differences are not 
known but may include greater genetic loading due to a higher 
risk of affective illness in those who migrated from Europe to the 
United States, a higher rate of assortative mating in the United 
States, as well as various genetic mechanisms, including antici- 
pation and gene-environment interactions that may also differ 
in importance between the United States and Europe. Further, 
increasing overweight among children and adolescents has been 
suggested as a contributing factor being more prevalent in the 
United States. 

Nevertheless, as age of onset is not normally distributed, the mean 
is a slightly misleading variable, while the mean age of onset may be 
in the twenties; for a large proportion of patients, the age of onset are 
the teenage years. 

The majority of first episodes in bipolar I disorder are depressive; 
approximately 85% of patients have a depressive first episode, 10% 
a manic episode, and 3-5% a mixed episode. Typically, the duration 
of depressive episodes is between 2 and 5 months, although longer 
durations are often seen. The duration of manic episodes averages 
around 2 months. 

Long-term studies suggest that the vast majority of patients 
with bipolar I disorder, approximating 90-100%, will develop 
more episodes after the first manic episode. The course of illness is 
heterogenous—some will develop few episodes, and some many— 
but, on average, the risk of recurrence increases with the number 
of prior affective episodes, suggesting a clinical progression of the 
illness [18, 19]. Long-term studies also show that despite treatment, 
patients with bipolar disorder type I or II suffer from affective symp- 
toms of some severity approximately half of their lifetime. Over the 
course of illness, 80% of affective episodes are of the depressive type 
and 20% of the manic or mixed type. The presence of even milder 
affective symptoms is associated with decreased quality of life and 
functional and cognitive problems. 


Social, functional, and cognitive outcomes 


Approximately 40-60% of patients with bipolar disorder present 
with persistent cognitive deficits in the remitted state and across 
several cognitive domains, including attention, verbal learning, and 
executive function [20, 21]. Such cognitive deficits may be preva- 
lent even among healthy relatives of patients with bipolar disorder 
or early in the course of illness. There is robust evidence from sev- 
eral studies that patients’ persistent cognitive dysfunction is a key 
contributor to their socio-occupational disability, independent of 
mood symptoms [22-24]. Functional impairment is prevalent in 
bipolar disorder, with unemployment rates 4- and 10-fold higher 
than in the general population [25, 26]. Approximately two-thirds 
of the patients are unable to regain premorbid levels of social and 
vocational functioning following a single episode [25], and verbal 
memory and executive function seem moderately related to employ- 
ment outcome [27]. 

Patients with bipolar disorder have 2-3 times increased risk of 
developing dementia in the long run [28-32], and there are some 
suggestions that the risk of developing dementia may increase with 
the number of episodes in patients with depressive disorder and 
those with bipolar disorder [33]. Long-term treatment with lithium 
may decrease the risk of developing dementia [34-36], further sup- 
porting a link between bipolar disorder and dementia. 


Undiagnosed bipolar disorder and use of health services 


There is often an interval between the onset of mood episodes and 
seeking help (on average 8-10 years), and it may take a decade for 
a bipolar patient to receive both the correct diagnosis and be pre- 
scribed mood-stabilizing medication, resulting in an extended 
period of untreated illness [37]. 

Receiving a correct bipolar diagnosis at initial assessment is com- 
plicated by few depressed patients seeking help for hypomanic/ 
manic episodes and more seeking assistance only for their depressive 
episodes, while viewing any hypomanic/manic episodes as neither 
harmful or problematic nor particularly pathological. In combin- 
ation with assessment nuances, this leads to an estimated 20-40% 
of bipolar patients being misdiagnosed as having a unipolar depres- 
sive disorder and administered antidepressant medication that can 
worsen the longitudinal course of bipolar disorder, principally by 
causing more switching or more mixed states [37]. Further, the pro- 
portion of undiagnosed bipolar disorder in epidemiological samples 
diagnosed with depressive disorder is high, quantified as up to half 
of identified depressive cases, depending on the sensitivity of the cri- 
teria from which bipolar disorder can be accurately recognized, as 
well as practice and referral nuances [37]. 

In the ECA study, 39% of those with bipolar I or bipolar II dis- 
order received outpatient psychiatric treatment within 1 year and 
about 10% would receive inpatient treatment within a 6-month 
period. In the NCS study, 45% of those with bipolar disorder had 
received psychiatric treatment in the previous 12 months, although 
93% reported lifetime treatment for their bipolar disorder. However, 
both of these studies suggest that more than half of individuals with 
bipolar disorder are not currently in psychiatric treatment. 

Misdiagnosis or delayed help-seeking has been considered as a 
major contributor to poor outcome in bipolar disorder, with 80% 
of undiagnosed patients experiencing psychosocial problems 
impacting on their relationships, employment, finances, and health. 
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Further consequences of inappropriately treated bipolar disorder 
include an increased risk of suicide, longer hospitalizations, poorer 
social functioning, and general deterioration of the condition. 


Use of pharmacotherapy 


Major changes have taken place since 2000 in prescription patterns 
for bipolar disorder in mental care in Western countries in gen- 
eral [38]. Nation-wide and other population-based longitudinal 
pharmaco-epidemiological studies on prescription patterns have 
shown that lithium is prescribed less and anti-epileptics and dopa- 
mine antagonists are prescribed substantially more in recent years. 
In particular, the prescription of lamotrigine and quetiapine has in- 
creased substantially (from 0-3% in 2000 to approximately 40% of 
all patients in 2011 [38]). In many areas of the world, lithium has 
gone from being the first drug prescribed as maintenance therapy 
for bipolar disorder to being replaced by dopamine antagonists or 
anti-epileptics. The use of drugs for depression has been constantly 
high in Western countries, approximating 70-80% of all patients 
since 2000 [38, 39]. Combination therapy of lithium, anti-epileptics, 
drugs for psychosis (see Chapter 140), and also antidepressants has 
increased substantially during the last decade. 

Overall, the development in prescription patterns during the 
last decade has not been in accordance with the evidence and re- 
commendations in international guidelines in relation to the use of 
lithium [40-44] and antidepressants [43-47] in bipolar disorder. 
Changes in study populations do not seem to explain the changes 
in prescription pattern and use [38]. The high rates of combination 
therapy reflect that, in real life, patient response and remission rates 
are low with monotherapy for acute episodes and relapse rates are 
high during maintenance monotherapy therapy [48]. 


Adherence to treatment 


Pharmaco-epidemiological studies have consistently shown that ad- 
herence to medication is a major clinical challenge in the treatment 
of bipolar disorder. As much as half of the patients do not take medi- 
cation or take it in other durations and doses than prescribed, al- 
though randomized trials show that group-based psychoeducation 
can substantially add to improve adherence. 


Risk factors for affective episodes in patients 
with bipolar disorder 


A range of other biological factors are risk factors particularly rele- 
vant to the onset of episodes of illness, but they may contribute a 
relatively small part to lifetime vulnerability. Many women have 
their first episode of depression or mania in the post-partum period. 
There is substantial evidence that seasonal patterns influence the 
onset of manic and depressive episodes. There are consistent find- 
ings of an excess of manic episodes in late spring and early summer. 
To date, however, the nature of the environmental factors that in- 
fluence this late spring, early summer peak of manic episodes is 
less clear. 

There is also substantial evidence that disruptions of normal bio- 
logical rhythms may precipitate the onset of manic or depressive epi- 
sodes. This has been documented in relation to international travel 
involving east-west or west-east travel with disruption of circadian 
rhythms. Disruption of circadian rhythms through shift work or 
other factors which disrupt the normal sleep cycles may also be im- 
portant triggers to the onset of episodes of mania. 
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Adverse life events have been well documented to be precipitants 
of manic episodes, as well as depression. It appears that life events 
are more likely prior to the first or second episode of mania and are 
less likely later in the course of illness. 


Life expectancy, mortality, and causes of death 


Standardized mortality rates (SMRs) among patients with bipolar 
disorder have been found consistently to be increased 2-3 times, 
compared to the general population [49-51], and life expectancy has 
been reported to be decreased 8-12 years for patients with bipolar 
disorder, compared to the general population [52]. The increased 
mortality is related to increased mortality due to suicide [51, 53], un- 
intentional injuries [49, 51], and comorbid general medical illnesses, 
including cardiovascular disease [49-51, 54], diabetes [51, 55, 56], 
and chronic obstructive pulmonary disease [51, 57]. 

The SMR for completed suicide is estimated to be between 10 and 
15 [58]. The lifetime risk of suicide may have been overestimated 
in some prior studies, due to incomplete follow-up in selected 
samples or rather short-term follow-up of patients with first-time 
treated mental disorders [59]. The absolute risk of committing 
suicide within 36 years after the first psychiatric contact has been 
estimated to be 8% for men and 5% for women [59]. Suicide is a 
prevalent reason for lost life years specifically during adolescence 
and early and mid adulthood, and that risk clearly decreases with 
age [60]. Among 15-year-old patients with bipolar disorder, suicide 
accounted for 23-24% of all lost life years, decreasing to 0-5% for 
patients aged 75 years [60]. However, rather surprisingly, natural 
causes of death is the most prevalent reason for lost life years al- 
ready from adolescence and increased substantially during early and 
mid adulthood, that is, from age of 15 years and during the twenties, 
thirties, and forties [60]. For 15-year-old boys with bipolar disorder, 
natural causes of death have been found to account for 58% of 12 lost 
life years, and for 15-year-old girls, natural causes account for 67% of 
9 lost life years, increasing to 74% and 80% for 45-year-old men and 
women, respectively [60]. 

Reasons for increased mortality due to comorbid general medical 
illness in bipolar disorder may include unhealthy lifestyle, decreased 
health care, adverse pharmacological effects, and biological factors 
[61]. In relation to biological factors, accelerated ageing has recently 
been proposed as a mechanism explaining the increased prevalence 
of comorbid medical illness due to disparate pathophysiological 
changes observed in bipolar disorder such as brain structural al- 
terations, cognitive deficits, oxidative stress imbalance, amyloid 
metabolism, immunological deregulation, immunosenescence, 
neurotrophic deficiencies, and telomere shortening [62]. It is well 
established that, in addition to being differentially affected by 
chronic medical disorders, the age of onset of medical chronic dis- 
orders is much earlier, when compared to individuals in the general 
population [63]. 


Depressive disorders 


Diagnostic issues 


A key issue for the epidemiology of depressive disorders is defining 
the boundaries of major depression and dysthymia. Depressive 
symptoms in the community are common, and defining both the 
symptom count and the duration at which depressive symptoms 


count as part ofa clinical disorder is arbitrary. Provided that one ac- 
cepts the arbitrary definition of major depression, then determining 
the rates of current depressive disorders is not especially problem- 
atic. However, there are major methodological issues involved in 
determining whether an individual has ever had a lifetime episode 
of major depression. Lifetime prevalence rates vary from 4.4% in 
the United States ECA study to 17.1% in the NCS and to over 30% 
in the Kendler’s Virginia twin sample of women. In part, subjects in 
the community may forget or fail to report past episodes of major 
depression (recall bias), and the manner in which the questions 
are asked may importantly influence lifetime rates of depression. 
In the diagnostic interview schedule, which was used in the ECA, 
respondents were asked about lifetime symptoms, a lifetime diag- 
nosis was made, and then recency of the lifetime diagnosis was de- 
termined. Diagnostic interview schedules, such as the Composite 
International Diagnostic Interview, first ask about current depres- 
sive symptoms and then, having ‘primed’ individuals about depres- 
sive symptoms, go on to enquire about past depressive episodes. 
Interviews that follow the schedule of ‘priming’ before asking about 
past episodes appear to obtain considerably higher rates of lifetime 
major depression. Determining lifetime rates of depression with 
greater precision is an important task, as the vulnerability to depres- 
sion conferred by risk factors such as genetic factors and childhood 
experiences may be wrongly estimated if lifetime rates of major de- 
pression are imprecise. 

DSM-5 allows major depression to be further sub-classified into 
subtypes, such as melancholia, atypical, psychotic, and by severity 
and recurrence. Most of the traditional epidemiology studies have 
tended to ignore the issue of subtyping major depression. 


Prevalence 


In the ECA, the 6-month prevalence of major depression across five 
sites was 2.2 per 100. In the NCS, the 1-month prevalence of a major 
depressive episode was 6.1 per 100 [64]. In the National Psychiatric 
Morbidity Surveys of Great Britain, the 1-week rate of a depres- 
sive episode was 2.1 per 100 [65], and in the European Study of the 
Epidemiology of Mental Disorders, the 1-year prevalence of major 
depressive disorder was 3.9% [66]. Together, these studies would sug- 
gest that the current rate of major depression is in the realm of 2-5%. 

Estimates of the lifetime rate of major depression are much more 
variable. The lowest rate reported is 4.4 per 100 from the ECA study, 
while in the study of Virginia twins, the lifetime rate of major de- 
pression is over 30%. It is reasonable to believe that the true lifetime 
rate of major depression is probably in the realm of 10-20 per 100, 
but caution should be exercised in expressing lifetime rates of de- 
pression with undue precision. 

Over the past decade, one of the controversial findings in the epi- 
demiology of major depression has been whether the rates of de- 
pression are increasing and whether it is occurring at a younger age. 
Despite methodological concerns about the reliability of lifetime 
major depression, studies across countries have reasonably consist- 
ently documented an increasing rate of major depression with an 
earlier age of onset [67, 68]. As mood disorders are the single largest 
risk factor for suicide, it is also of note that, in most Western coun- 
tries, the rate of suicide, especially in young adults, increased con- 
siderably from the 1970s to the 1990s, although the suicide rate is 
now declining in many countries. This could, however, reflect better 
recognition and treatment of depression. 


Comorbidity 


Comorbidity is prevalent in depressive disorder. Not surprisingly, 
the most common comorbid disorders are anxiety disorders and 
substance abuse disorders. In the NCS, the anxiety disorders with 
the highest odds ratios indicating comorbidity were generalized 
anxiety disorder, panic disorder, and post-traumatic stress disorder. 
It is also important to note that for most anxiety disorders, with the 
exception of panic disorder, the anxiety disorder usually predates 
the onset of the depressive disorder. This is of considerable import- 
ance, as the risk factors for pure major depression differed from the 
risk factors for comorbid major depression. Furthermore, the cohort 
effects of increasing rates of major depression were largely attribut- 
able to increasing rates of comorbid major depression, rather than 
to increasing rates of pure major depression. These results raise im- 
portant issues for prevention, as it may well be that targeting young 
people with anxiety disorders could be a major step towards preven- 
tion of the development of later major depressive disorders. 

The second key area of comorbidity with major depression is with 
alcohol dependence. Data from the Virginia Twin Register suggest 
that part of this comorbidity is due to shared genetic factors, al- 
though there is also a smaller common environmental risk factor to 
both disorders. 

Another area of considerable comorbidity with depressive dis- 
order is the personality disorders. Comorbidity between depression 
and these disorders is receiving considerable attention in clinical 
samples, but to date, there are only limited data from epidemio- 
logical samples on the importance of these patterns of comorbidity. 


Risk factors 
Genetics 


There is substantial evidence that genetic factors are of major im- 
portance as risk factors for vulnerability to major depression, with 
a heritability at about 40%. Genes for major depression do not ap- 
pear to be unique for depression but overlap with genes for anxiety 
and genes for neuroticism. The greater prevalence of depression in 
women may be due to the strong association of anxiety and neur- 
oticism with depression and the fact that higher rates of anxiety and 
neuroticism in women lead to higher rates of depression. 


Stressful life events and childhood adversities 


Although the experience of a stressful life event is a well-known risk 
factor for developing depression, the associations are complicated. 
Firstly, causation may be unclear, as life events may cause depression 
and depression may result in stressful life events. Further, the type 
and timing of stressful life events play a role, including the experi- 
ence of acute vs more chronic stressors and the experience during 
childhood and the adolescent period or during different age spans 
in adulthood. Secondly, the impact of life events differs in relation 
to the onset of the first depressive episode and to subsequent de- 
pressive episodes. Approximately 60-80% of patients with a first and 
single depressive episode have experienced stressful life events in a 
6-month period prior to the onset of depression, whereas the preva- 
lence of life events is decreased to 20-40% or less in relation to sub- 
sequent depressive episodes. Finally, the experience of life events is 
highly individual and may be stressful for one person or life situation 
but positive for another. Overall, assessment of life events based on 
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self-report may be influenced by recall bias and those based on 
interviews may be influenced by observer bias and interpretation. 


Gene-environment interaction 


In 2003, Caspi et al. showed that risk for depression was related to an 
interaction between the serotonin transporter-linked polymorphic 
region (5-HTTLPR) and stressful life events, with persons carrying 
the short, less transcriptionally active 5-HTTLPR allele more vulner- 
able to depression after stressful life events [69]. The study sparked 
research into interactions between candidate genes and environ- 
mental exposures on risk for psychiatric disorders, but although this 
field of research has now persisted for more than a decade, it has been 
characterized by much controversy. Nevertheless, studies identifying 
stress using objective measures, such as specific medical conditions, 
or observer-based measures, such as in-person interviews, seem to 
have found stronger evidence for a gene-environment interaction 
than studies measuring stress by self-report [70]. 


Gender 


One of the most consistent findings in the epidemiology of major de- 
pression is that the ratio of women to men is approximately 2:1. This 
increased rate of major depression in women arises during puberty, 
as in childhood, there is a slightly higher prevalence of depression 
in boys than girls. The timing of this transition in rates by gender 
is related to biological puberty, rather than just to age. Intriguingly, 
a new population-based cohort study found that use of hormonal 
contraception among adolescent women is associated with an in- 
creased risk of subsequent development of depression, suggesting 
depression as a potential side effect to hormonal contraceptive use 
on a population level [71]. 


Personality 


There has been a long history of interest in the likelihood that people 
with certain personality traits are more vulnerable to depression than 
others. It is likely that those individuals who are unduly anxious, 
impulsive, and obsessional may have increased rates of later major 
depression. The best data exist for neuroticism, which emerges as a 
clear risk factor for the later development of depressive and anxiety 
disorders. However, as already mentioned, the same genes seem to 
contribute to the development of neuroticism and to later anxiety 
and depressive disorders. 


Physical illness 


Having a chronic or severe physical illness is associated with an in- 
creased risk for depression. The mechanisms behind this increased 
risk may vary, depending upon the physical disorder. In disorders 
such as Parkinson's disease, it is possible that there are shared 
neurotransmitter abnormalities between Parkinsons disease and 
depression. In post-stroke depression, there is good evidence that 
the location of the lesion, at least in part, contributes to the rate of 
depression, which suggests a neuroanatomical/neurotransmitter 
connection between the physical illness and the likelihood of de- 
pression. For non-central nervous system disorders, such as acute 
myocardial infarction, diabetes, and cancers, the mechanism for this 
association is less clear. Further, stress associated with a serious or 
chronic physical illness may act by bringing out an individual’s life- 
time vulnerability to depression. 
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An integrated aetiological model 


The ultimate purpose of studying risk factors for depressive dis- 
orders is to contribute to the development of an integrated aetio- 
logical model. The most promising research in this area has been 
performed by Kendler and colleagues on twins from the Virginia 
Twin Register [72] who developed a model that predicted over 50% 
of the variance in the liability to develop major depression in the next 
12 months. The strongest predictors to depression were as follows: 


e Stressful life events. 

e Genetic factors. 

e Previous history of major depression. 
e Neuroticism. 


Approximately 60% of the effect of genetic factors on the liability 
to depression was direct, but the remaining 40% was indirect and 
largely mediated by past episodes of depression, stressful life events, 
and neuroticism. 





Use of health services and untreated depression 


One of the major challenges for psychiatry presented by epidemio- 
logical studies of depression has been the consistent finding that the 
majority of cases of depression in the community are not recognized, 
diagnosed, or treated. In the ECA study, it was found that 65-70% 
of people with depression had visited a health professional in the last 
6 months, but only 15-20% had had a visit for a mental health reason 
and only about 10% had seen a mental health specialist. Among in- 
dividuals with a 12-month diagnosis of pure major depression in the 
European Study of the Epidemiology of Mental Disorders, only 21.2% 
had received any antidepressant treatment within the same period; the 
exclusive use of approved drugs for depression was even lower (4.6%), 
while more individuals took only drugs for anxiety (18.4%) [73]. 

Patients with depression who present with largely somatic, rather 
than psychological, symptoms are unlikely to be recognized by gen- 
eral practitioners. Even if major depression is recognized in the pri- 
mary care setting, it is often not adequately treated. 

The duration of untreated illness (DUI), defined as the time interval 
between first onset of the illness and initiation of (pharmacological) 
treatment, has increasingly been considered to influence the clinical 
course of different psychiatric disorders. Knowledge of the poten- 
tial consequences of postponing medical treatment of depression is 
of crucial importance, when the choice stands between a drug for 
depression, other therapies, or watchful waiting. Unfortunately, the 
effects of DUI have been investigated only scarcely in first-episode 
depression. The largest study found that initiation of appropriate 
drug treatment more than 6 months after the onset of first-episode 
depression reduces the chance of obtaining remission to 50% [74]. 


Use of pharmacotherapy 


Non-adherence to medication is a major challenge concerning up 
to 50% of patients and is associated with an increased risk of relapse 
and hospitalization. 


Age of onset and course of illness 


Age of onset is later for depressive disorder than for bipolar disorder, 
with a mean age of around 30 years—however, with a substantial 


proportion of patients having age at onset of below 21 years. The 
distribution of age of onset is bimodal, with peaks in the thirties and 
fifties. 

Length of episodes is typically 3-12 months (median duration of 
5 months), but 10-20% of patients experience depressive periods of 
longer than 2 years (that is, a chronic depressive episode). Results 
from up to 25 years of follow-up studies show that approximately 
30-50% of patients remain symptom-free after a depressive episode, 
while the remaining 50-70% develop new episodes or chronic de- 
pression. In a recent 5-year follow-up of patients with psychiatric 
hospital contact for the first lifetime episode of depression, 83% 
obtained remission and among these, 32% experienced recurrence 
of depression and 9% converted to bipolar disorder [6]. 

Approximately 1% convert from unipolar depressive disorder to 
bipolar disorder per year [5, 6], approximating 20-40% during the 
lifetime course. 


Social and cognitive outcomes 


Dysfunctions in social interactions remain persistent several years 
after recovery from depressive symptoms and are correlated with 
unemployment, disability, and decreased work performance. Within 
5 years, 20% of psychiatric inpatients and outpatients with major de- 
pression and belonging to the labour force at baseline in Finland 
were granted a disability pension [75]. 

Cognitive dysfunction in attention, memory, and executive func- 
tion is less prevalent in the remitted state of depressive disorder than 
in bipolar disorder and schizophrenia. However, self-reported atten- 
tion and concentration difficulties mediate as much as 25% of the 
impact of depression on patients’ psychosocial impairment. Long- 
term studies rather consistently show that depressive disorder is 
associated with a 2-3 times increased risk of developing dementia 
eventually (28-30, 32, 76]. 


Life expectancy and mortality 


The SMR for completed suicide is estimated to be increased 20-fold 
[58]. In the longest population-based follow-up study, a 36-year 
study after the first psychiatric hospital contact, the absolute risk of 
committing suicide was 7% for men and 4% for women [59]. 

Life expectancy is decreased approximately 12 years and 10 years 
in men and women, respectively [77]. SMR is highest for death 
due to suicide and accidents, but the absolute number of deaths is 
highest for natural causes. 

As bipolar and depressive disorders are associated with long- 
term deficits in social, functional, and cognitive outcomes and 
a substantially decreased life expectancy, there is a need for fu- 
ture studies on early and lifestyle interventions. A recent random- 
ized trial has shown that early intervention, combining optimized 
pharmacological treatment and psychoeducation, following the 
onset of bipolar disorder substantially improves the outcome of the 
bipolar illness [78] and specifically so among young adults below 
25 years of age [79]. 
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Mood disorders, both depressive and bipolar, are seemingly ubiqui- 
tous in modern-day society and, once established, cannot be cured. 
Research efforts in the latter half of the last century focused largely 
on diagnosis and optimal treatment of mood disorders, with the im- 
plicit aim of illness containment, rather than its complete removal. 
But even symptomatic management of mood disorders has been 
less than optimal. Part of the reason for this is that there has been 
a widespread lack of consensus regarding whether psychiatric dis- 
orders are discrete entities or lie on a continuum defined by dimen- 
sional constructs, and though the key features that characterize the 
disorders have been reliably identified, the construction of a mean- 
ingful hierarchy has eluded both researchers and clinicians alike. 
Apart from creating difficulties in diagnosis, this has meant that 
defining targets for treatment, clinically and for research purposes, 
has proven to be challenging. It remains unclear, for example, which 
symptoms or clusters of symptoms are most closely coupled with 
functional recovery and which symptoms are most likely to herald a 
recurrence or re-emergence of the illness. Therefore, treatment has 
generally been symptom-focused and its main goal has been to limit 
acute episodes of illness and to return the individual to wellness. 
Only relatively recently has there been a shift towards longer-term 
prophylaxis and maintenance of well-being, with a growing em- 
phasis on prevention. Clearly, prevention is the ultimate and ideal 
goal, namely, to intercept the illness before it takes hold by stop- 
ping processes that lead to the emergence of mood disorders. But 
to achieve this, a deeper understanding of the pathophysiology of 
mood disorders is necessary. Sadly, our current knowledge of how 
mood disorders evolve is incomplete and insufficient for this pur- 
pose. Nevertheless, attempts at prevention have begun in earnest, 
and emerging evidence suggests that, when such efforts are suitably 
targeted, reasonable benefits can be achieved, in the short term at 
least. Whether these outcomes can be sustained, and for how long, 
is the key question—the answer to which remains elusive. Therefore, 
this chapter briefly discusses some models of prevention, the types 


of interventions and treatments that are currently being trialled, and 
where future insights in this field might lead us. 





By their very nature, mood disorders distress the affected indi- 
vidual, but they also impact their families, friends, peers, colleagues, 
and the broader community. This makes mood disorders a tremen- 
dous burden to bear. Among mental disorders, the highest global 
burden, as explained by disability-adjusted life years (DALYs), 
that is, the cumulative number of years of productive life lost to 
disability, ill health, or early death, occurs in adolescents and the 
young to middle-aged and is associated with depressive and anx- 
iety disorders, while the burden from bipolar disorders is ranked 
fourth overall [1, 2]. This ‘quantitative’ evidence underscores the 
need to avert or reduce the qualitatively palpable burden exerted 
by these disorders at an earlier stage, and it has been suggested that 
evidence-based interventions may be capable of doing just this [3]. 
But presently, and somewhat alarmingly, based on the WHO pro- 
jections for 2030, this burden is not being diminished [4]. So what 
are the impediments to reducing this burden, and how can these 
barriers be overcome? 





Although gaps in the scientific knowledge base regarding preven- 
tion and treatment of mood disorders are gradually narrowing, 
puzzlingly, the actual burden is not diminishing at a commen- 
surate rate. This is attributable to a number of factors, including 
the conflicting and disparate priorities of diverse stakeholders who 
collectively shoulder the burden of mood disorders. Consequently, 
barriers exist at all levels—from individuals and families to com- 
munities, societies, and organizations. To overcome this, an inte- 
grated approach is needed, one that targets modifiable barriers by 
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Table 68.1 Modifiable barriers that hinder efforts to reduce 
the burden of mood disorders, and potential strategies 
to overcome them 


Delayed diagnosis’ 1. Early identification of risk and protective 
factors 
2. Early detection and diagnosis 


Delayed initiation of Use of early intervention strategies (mainly 


treatment” those that have the potential to cause little 
harm) 
Stigmatization (constrains 1. Psychoeducation (factual information 


uptake of services) regarding the nature, causes, risks, and 
protective factors of mood disorders) 

2. Awareness campaigns to enhance attitude 
and behaviour change 


3. Supportive community engagement 


campaigns 
Uneven distribution of 1. Clarity of policies governing resource 
funding allocation 


2. Availability of resources for training needs 


Paucity of evidence-based 
interventions and treatments 


Translating research from the laboratory to the 
real world 


“Especially likely with bipolar disorder where the illness usually manifests with 
depression and is therefore misdiagnosed [41]. 

“A tentative approach to intervention (especially the prescription of medications) is 
understandable where the diagnosis may be speculative. 

Source: data from Ment Health Serv Res, 4(4), Bruce ML, Wells KB, Miranda J, et al., 
Barriers to reducing burden of affective disorders, pp. 187-197, Copyright (2002), 
Plenum Publishing Corporation; Psychiatr Serv, 63(10), Corrigan PW, Morris SB, Michaels 
PJ, et al., Challenging the Public Stigma of Mental Illness: A Meta-Analysis of Outcome 
Studies, pp. 963-973, Copyright (2012); American Psychiatric Association Publishing; 
Biol Psychiatry, 52(6), Wells KB, Miranda J, Bauer MS, et al., Overcoming barriers to 
reducing the burden of affective disorders, pp. 655-675, Copyright (2002), Society of 
Biological Psychiatry. 


taking into consideration contributions from the various stake- 
holders (Table 68.1) [5-7]. 

It is estimated that primary care interventions have the potential 
to reduce the burden of mood disorders by 10-30%, but because of a 
delay in diagnosis and the subsequent initiation of treatment, this re- 
duction may not be sufficient [8]. Another barrier—stigmatization— 
which is largely based on fear, embarrassment, little or insufficient 
knowledge, and restrictive policies for those with mood disorders, 
further limits access to services and their uptake. Therefore, to pro- 
vide sustainable and effective evidence-based interventions, it is ne- 
cessary to identify changes in mental health policies and research 
services and to address inefficiencies in the distribution of funding 
and training needs [7]. At the same time, it is also important to 
understand the causes and underlying mechanisms involved in the 
development of mood disorders, to enable timely identification of 
the factors that contribute to their development. With early identifi- 
cation of risk and protective factors, it may be possible to intervene 
sooner to either prevent onset or delay progression to full-blown 
mood disorders. 


Causes of mood disorders 


To be able to intervene in an effective way so as to prevent and treat 
mood disorders, a deeper understanding of the risk factors that pre- 
dispose young people to developing mood disorders and the pro- 
tective factors that may increase resilience and adequately buffer 


them from the development of mood disorders, or at least their con- 
sequences, is critical. Understanding these factors may reveal targets 
for preventative and early interventions, as well as measurable out- 
comes that allow assessment of their efficacy and effectiveness. 

But despite extensive research in recent years, it has become 
increasingly evident that the causes of mood disorders are multi- 
factorial and that numerous variables contribute to the myriad of 
manifestations encountered clinically. For example, mood disorders 
are often preceded or triggered by life events—implicating a set of 
interactions between environmental and intrinsic risk factors that 
presumably converge on any number of neural systems and path- 
ways. Identification of such intrinsic and extrinsic risk factors that 
increase the probability of developing mood disorders may assist in 
predicting onset and serve to focus prevention endeavours. This is 
exciting because some of these factors may be discernible from a 
relatively early age [9-12], and while many are seemingly irrevers- 
ible, others are modifiable and may potentially serve as targets for 
prevention strategies. 

At the same time, enhancing protective factors may also serve to 
delay or prevent the deterioration of mental health. Thus, to main- 
tain an optimal level of well-being, any imbalance between risk and 
protection needs to shift so as to favour resilience. However, while 
exposure to risks and protective factors is inherent in the course 
of everyone’s life, critically, it is the extent of exposure and the 
individual’s response to this that varies. Interestingly, despite indi- 
vidual differences, there seems to be a developmental trajectory to 
exposure, that is, some risks and protective factors seem to be phase 
(stage of life)-dependent while others are more stable and present 
throughout the course of life. 

In the context of mood disorders, there seems to be a cumula- 
tive effect of risk and protective factors, wherein as the number of 
risk factors and the duration of exposure to these increases, so do 
the odds of developing a depressive or bipolar disorder, whereas a 
greater number of protective factors decreases these odds [13-16]. 
Specifically, emerging evidence suggests that through cumulative 
damage over time or disruptions during particularly sensitive de- 
velopmental periods (phase-dependent), the emergence of enduring 
mood disorders can be traced to exposure to adversity in the early 
years of life, from birth and infancy throughout childhood and ado- 
lescence into young adulthood (Fig. 68.1) [11, 14, 17-24]. At the 
same time, throughout these periods of growth and development, 
protective factors (intrinsic and extrinsic) act as buffers to minimize 
the deleterious effects of such adversity, and from the standpoint of 
prevention, these may also serve as targets that can be potentially 
enhanced as part of preventative strategies. 

For instance, from birth, there is a genetic predisposition, via 
parental transmission, that interacts with environmental risk fac- 
tors to increase the likelihood of developing mood disorders, but 
there is no particular time period during which this interaction 
is destined to exert an effect. As such, through the interaction be- 
tween the cumulative effects of, or exposure to, adversity at ‘critical’ 
time points and the individual’s genetic disposition, a somewhat de- 
layed, but chronic, expression of mood disorders may ensue years, 
or even decades, after the adverse event. These cumulative effects 
are thought to arise as a result of the ‘wear and tear or allostatic load 
of the body and the brain due to persistent exposure to adversity 
at multiple time points [16, 25, 26]. On the other hand, latent, but 
enduring, effects of exposure to adversity during sensitive periods 
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when the brain is presumed to be more responsive to positive 
and negative environmental triggers may become intricately em- 
bedded in underlying neurobiological processes [16, 18]. However, 
though it is likely that there are sensitive or ‘critical’ periods during 
which vulnerability to mood disorders is increased, the difference 
is probably modest, and evidence for such epochs of heightened 
susceptibility has thus far proven to be relatively weak and largely 
circumstantial [16, 18, 19]. 

Nevertheless, as a general rule, the earlier the intervention, the 
better the long-term outcomes are likely to be, and thus far, the evi- 
dence for this also seems to hold [27-30]. Most important, however, 
is to target the risk and protective factors that have a demonstrable 
causal link with mood disorders, and doing so earlier in develop- 
ment, as this has the greatest potential to produce the most robust, 
and possibly long-term positive outcomes [28, 29, 31]. 


Nature of mood disorders 


The major psychiatric taxonomies—the Diagnostic and Statistical 
Manual of Mental Disorders (DSM) and the International 
Classification of Diseases (ICD)—define mood disorders cross- 
sectionally, and they give relatively little consideration to their 
longitudinal course (while they acknowledge that they are chronic 
lifelong illnesses—often with an early age of onset) [24, 32-39]. This 
myopic view is, in part, pragmatic—because the trajectory of mood 
disorders is difficult to predict and the illnesses are intrinsically ca- 
pricious. But, because the propensity for recurrence binds all mood 
disorders, a longitudinal and longer-term perspective is essential. 
In this context, recurrence refers to the repeated occurrence of epi- 
sodes of depression and mania, but in reality, the underlying patho- 
physiology of the illness, once established, is always present; it is 
merely its clinical expression that ebbs and flows. This concept is not 
fully captured in classification systems, which instead emphasize the 
impact of acute exacerbations and infer that in between episodes of 
illness, patients recover as the illness remits. But when considering 





prevention, it is the underlying processes, that is, the subsyndromal 
pathological activity, that need to be targeted (Fig. 68.2). 


Prevention strategies 


When to intervene for maximum benefit is a crucial question that 
requires an understanding of the developmental trajectory of mood 
disorders. For instance, Howes and Falkenberg [40] have identified 
potential points for prevention in the developmental trajectory of bi- 
polar disorder (BD). The earliest point in this schema is when there 
is fluctuation of mood that represents an underlying cyclothymia or 
a mental state of ‘being at risk of developing BD. The second stage 
is when precipitating life events accumulate to the extent they cross 
a threshold and transition the illness into the ‘prodromal phase —in 
other words, the last stage where prevention is possible. However, 
as BD is often initially diagnosed as MDD, and the transition from 
MDD to BD can take up to 5 years, and even then, only a small pro- 
portion of cases make the transition, waiting for the prodromal 
phase before intervening risks altogether missing the window for 
prevention [41-43]. One advance could be to regard the develop- 
mental trajectory of some depressive disorders as the prelude to that 
of BD, and in such instances, by focusing on the prevention of de- 
pression, it may be possible, in fact, to prevent the development of 
BD. Similarly, a step further (backwards) builds on the observation 
that anxiety (and anxiety disorders) often precedes major depression 
and is a common ‘comorbidity’ of both depression and BD [44-48]. 
Therefore, anticipating and targeting anxiety may aid the prevention 
of mood disorders. With this developmental and chronological per- 
spective in mind, it is important to note that while vulnerabilities 
for anxiety and mood disorders may be established in adulthood, it 
is adolescence that seems to be the critical period for their clinical 
emergence and that this is set against a background of considerable 
neurobiological and psychosocial change that begins post-puberty 
[34, 49, 50]. The impact of these changes can be protective or detri- 
mental to long-term health outcomes, and so in either case, targeting 
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Fig. 68.2 The clinical trajectory of mood disorders and their underlying components. The shaded blue zones represent periods of pathological activity 
that drive the core pathological processes underpinning the illness. This leads to damage which then accumulates. Cumulative damage is shown as 

a blue line, and this indicates how, as the illness progresses, it scars brain functioning—impacting reserve and resilience. The underlying pathological 
processes and cumulative damage lead to clinical symptoms. The dashed line depicts clinical symptomatology upon which diagnoses are dependent. 
This transitions from being subsyndromal to eventuating in illness. The troughs and peaks illustrate the seemingly episodic nature of the illness, as it 


traverses the threshold for clinical diagnosis. 








SECTION 10 Mood disorders 


this stage could hold potential to benefit long-term outcomes [50]. 
Therefore, targeting the developmental trajectory of anxiety and de- 
pression may be a productive prevention strategy, with a particular 
focus on early vulnerability markers. 


‘Prevention’ vs treatment 





Prevention is the reduction of risk factors and enhancement of pro- 
tective factors prior to the onset of disease. Prevention interventions 
occur before the onset of a clinical episode and precede treatment 
interventions, and therefore, they lie intermediate on a continuum 
between health promotion and treatment [51, 52]. Effectiveness of 
prevention strategies appears to depend on the identification of risk 
factors for mood disorders and protective factors that produce resili- 
ence and defend against the development of mood disorders—even 
in extreme adversity. Treatment, on the other hand, is the reduction 
of disease-related symptoms and the enhancement of protective fac- 
tors and is usually only prescribed once an illness has formed and a 
clinical diagnosis has been made. 

However, the distinction between prevention and treatment is 
somewhat blurred. Some time ago, the Institute of Medicine recom- 
mended prevention should target only those with early symptoms 
that are not sufficiently severe to warrant a diagnosis of a mental 
disorder [53]. But this somewhat narrow definition overlooks a large 
pre-symptomatic group that may be at increased risk of mental dis- 
orders, and given our current understanding of the aetiology of mood 
disorders, waiting until symptoms emerge, apart from being less ef- 
fective, may primarily be regarded as unethical—especially if timely 
interventions can be shown to have a substantial impact. An alter- 
native perspective on the prevention of depression stipulates that for 
an intervention to be preventive, it has to show a post-intervention 
increase in depressive symptoms in the control group and either no 
increase or an attenuated increase in depressive symptoms in the 
intervention group [54]. This suggests that post-intervention, a re- 
duction in depressive symptoms, in the absence of an increase or no 
change in depressive symptoms in the control group, may be indica- 
tive of a treatment effect, rather than a prevention effect (Fig. 68.3). 
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Rationale for prevention 


The difficulty of diagnosing mood disorders in children and ado- 
lescents is widely acknowledged, and as a consequence, diagnostic 
criteria originally created for adults have been modified to capture 
mood disorders as they evolve in young people. But a lack of specifi- 
city and the variability in presentation mean that depression is often 
misdiagnosed and likely overlooked. Naturally, this impacts treat- 
ment decisions. The efficacy of drugs, in combination with psycho- 
therapy, for the treatment of unipolar depression is well established, 
and medications are a key component of any treatment regimen for 
bipolar disorders. However, the long-term consequences of pharma- 
cotherapy are yet to be fully determined, especially if commenced 
early in life [55-57]. For example, in the treatment of paediatric 
BD, pharmacological agents and cognitive behavioural, as well as 
interpersonal, therapies produce some benefit [intermediate ef- 
fect sizes (ES)], but these are thought to be unsustainable over time 
[56]. Interestingly, randomized controlled studies of psychosocial 
treatments suggest they may be effective in delaying the time to re- 
lapse, facilitating mood stabilization, reducing the severity of symp- 
toms of depression, and improving psychosocial functioning [58]. 
Intriguingly, such evidence points to the possibility that treatments, 
while ameliorating indicated symptoms, do not necessarily prevent 
the onset of mood disorders and the recurrence of future episodes, 
but instead may limit negative consequences. Therefore, while treat- 
ment is often regarded as having a preventive component, in ac- 
tuality, true prevention is rarely achieved with treatment alone. 

Evidence suggests that in primary care patients, prevention is 
more cost-effective than care as usual [59]. This is because the 
earlier the onset of mood disorders, the poorer the long-term 
prognosis—as demonstrated by a poorer overall quality of life, 
significantly dysfunctional interpersonal relationships, a greater 
number of recurrences, with shorter euthymic periods in between, 
greater comorbidity, and an increased likelihood of suicide attempts 
[60, 61]. Thus, it is no surprise that the most effective strategies to 
reduce the health burden of mood disorders are those most likely 
to prevent the onset of mood disorders, particularly in vulnerable 
young people [14, 60, 62-64]. 


TREATMENT 


Intervention 


Pre-intervention Post-intervention 


Fig. 68.3 A comparison of the effects of prevention and treatment interventions in depressed patients vs healthy subjects. Vertical bars represent 
implementation of either a prevention or treatment strategy. Horizontal lines represent risk of depression during prevention, and clinical change in 


response to treatment. An intervention that is an effective treatment decreases depression in depressed patients but has no tangible effect on controls. 
An intervention that is preventive does not impact those that have already established illness (patients) but moderates the risk, and therefore reduces 
the likelihood, of depression in healthy controls. 
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Intended target of prevention strategies 


The main aim of mood disorders prevention strategies is to re- 
duce the likelihood of developing the disorders and, as discussed, 
the benefits should readily outweigh any potential risks [14]. When 
considering prevention strategies, it is important to understand for 
whom they are intended and how they would be implemented, but 
most critical is to identify the target for which they are aiming. In 
this respect, according to the Institute of Medicine report [53], pre- 
vention strategies can be grouped into three categories: universal, se- 
lective, and indicated programmes (‘Table 68.2). 


Impact of prevention strategies 


The success of a prevention strategy is determined by its outcome, 
but measuring this is a complex task. Gauging the efficacy or effect- 
iveness of prevention approaches requires establishing clear out- 
comes, and unfortunately these are not easily defined. For instance, 
using a change in mood state as an outcome is not particularly useful 
in assessing universal and selective prevention strategies, whereas it 
is ideal for indicated prevention. This is because universal and se- 
lective approaches target intervention groups that may include those 
with low mood state levels from the outset and whose mood state 
changes are unlikely to be sufficiently large and clinically mean- 
ingful in comparison to other groups. This is because subthreshold 
depressive symptoms and syndromes seem to be prevalent (ran- 
ging from 10% to 50%) in community samples of youth who are 
the focus of universal and selective interventions [65-67]. On the 
other hand, the indicated approach will reveal clinically significant 
mood changes due to initial subsyndromal mood elevations in the 
intervention group, compared to other groups. Conversely, quality 
of life is an outcome measure that can be applied broadly across 


different types of strategies, but because it is a very general measure 
that is only loosely coupled with the intervention itself, its utility, 
particularly in differentiating different prevention approaches, is 
limited. Therefore, the selection of outcome measures is critical. 
Furthermore, in the absence of clearly defined causal mechanisms, 
it is difficult to identify not only reliable and measurable outcomes, 
but especially the ones that have clinical salience. Another pertinent 
issue regarding prevention approaches is the number of people 
needed to show the benefits of an intervention, in other words, its 
ES. Across the various prevention strategies, the numbers vary, with 
universal strategies usually requiring much larger numbers and in- 
dicated prevention strategies generally requiring smaller numbers to 
demonstrate an effect [51]. 


Professional responsibility 


Globally, mental health services (MHS) are often multi-disciplinary, 
and a number of different disciplines and professionals can as- 
sume responsibility for the prevention of mood disorders and the 
promotion of well-being. For instance, a social worker from child 
services may be in a position to monitor interactions within fam- 
ilies and identify problematic interactions that may be addressed 
with specific strategies to prevent the escalation of maladaptive 
behaviours. Indeed, ideally, the delivery of interventions has to be 
multi-disciplinary but integrated [68]. However, careful justifica- 
tion is needed when considering prevention because there are sig- 
nificant financial implications that affect funding policies governing 
the distribution of resources and training needs across disciplines. 
However, the prevention of mood disorders has been positioned 
as a global priority, and its goals extend beyond the narrow stric- 
tures of interdisciplinary MHS [69]. For instance, the Australian 


Table 68.2 Intervention programmes aimed at preventing or reducing mood disorders 


retires furiera  [sectve  Dindeaed | 


Target Intended for the general public or 
community, regardless of risk status 
or family conflict 
Impact If successful, have a greater public 
health impact 
Risk Degree of individual risk is not taken 
into account ifetime risk 
Risk/cost Ideal when their risk and cost are 
benefit minimal 
high 
Examples School-based and community/ 
primary interventions, for example 
PRP [93, 94], mindfulness- 
based interventions [115, 116], 





beyondblue [123], PSL [95], PRP 
[143], CB and IP interventions [144 


” This is an ongoing intervention evaluation protocol. 





Target a subgroup that is at higher risk, based on 
biological, psychological, or social/environmental 
risk factors, for example offspring of depressed 
parents, those with traumatic or life events such as 
divorce, bereavement, those experiencing marital 


f successful, have the potential to block the 
ransgenerational transfer of mood disorders 


Risk may not always be imminent; it could be a 


Preferred when the risk of the intervention is 
minimal or non-existent while the cost is not too 


Parenting focused and family-based interventions, 
or example ROSE programme [72], Preparation 
or Parenthood/Surviving Parenthood [145], 
FOCUS [76], EFFEKTE-E [77], FGCB [78], Family 
Options [79], KFS [80], FTI [81, 82] 


Target people who display subthreshold signs and 
symptoms of a given disorder. For example: 


For bipolar disorder—those in the prodromal phase 
exhibiting mood dysregulation, fluctuation of energy 
evels, and anxiety symptoms 


For depression—those with subclinical depressive 
symptoms (that do not fulfil the criteria of a diagnosis) 


Proximal to first onset/case identification and treatment 


f successful, can reduce the incidence of mood disorder 
onset and recurrences 


Risk is often imminent, needing swift attention 





ntervention would be deemed necessary, even if the risk 
and cost of the intervention are high 





“ School- and family-based CBT interventions, for 
example [87, 96, 113]; Effective Child and Family 
programme [83, 105, 106], ” STERK [142] 


PRP, Penn Resiliency Program; PSL, Problem Solving for Life; CB & IP, cognitive behavioural and Interpersonal Psychotherapy; ROSE, Reach Out, Stand Strong, Essentials; FOCUS, 
Families OverComing Under Stress; KFS, Keeping Families Strong; EFFEKT-E, Entwicklungsférderung in Familien: Eltern- und Kinder-Training in emotional belasteten Familien; FGCB, 
Family Group Cognitive-Behavioural; CBT: cognitive behavioural therapy; STERK: Screening and Training: Enhancing Resilience in Kids. 








SECTION 10 Mood disorders 


Government National Standards for Mental Health Services [70] 
recommended that MHS should identify individuals who are ‘ac- 
countable for developing, implementing and evaluating prevention 
activities, while ensuring adequate training in the principles of pre- 
vention for the workforce, with appropriate support provided for the 
implementation of prevention activities. However, it is not clear how 
these individuals can be identified. Likewise, the EU Compass for 
Action on Mental Health and Well-being—Prevention of Depression 
and Promotion of Resilience, in its consensus paper, has promoted 
intersectorial collaborations between the health, educational, la- 
bour sector, informational technology, and economic stakeholders 
and the building of capacity through training in order to tackle and 
alleviate the burden of depression in Europe. But in practice, it is 
possible that any number of individuals can be identified for such 
roles, and those need to be determined on the basis of implementa- 
tion requirements [31] (Fig. 68.4). 

Lastly, prevention has to be targeting specific individuals or groups, 
and in that case, the contexts under which the prevention will be 
implemented, for the individual/parents/extended family members/ 
peers/colleagues/schools/communities, have to be well understood. 
In this regard, the developmental and contextual framework that de- 
termines when and how interventions are implemented has to be 
taken into account. For instance, the National Research Council [14] 
recommended that besides multiple contexts, age-related patterns of 


competency and disorders and developmental tasks, as well as the 
interactions between biological, psychological, and social risk fac- 
tors, have to be taken into consideration when designing prevention 
interventions. 

The section on Preventative interventions, p. 706, illustrates the 
complexity of difficulties faced when making these considerations, 
with a summary of interventions implemented to date that directly 
or indirectly target the earliest developmental stage, that is, youth, 
and the setting under which they were implemented. 


Preventative interventions 





Parenting and family-based interventions 


Parenting and family-based interventions target parents or the 
family as a unit in order to reduce risk factors and enhance pro- 
tective factors in the offspring. This encourages changes in maladap- 
tive parenting skills, attitudes to parenting, and interactions between 
the parent and the child. Therefore, adaptive parenting skills and 
functional parent-child interactions are crucial at very early stages 
of child development and can be commenced perhaps during preg- 
nancy and early infancy, so that parents can be prepared emotion- 
ally and feel capable of nurturing their offspring and providing an 
environment that fosters secure attachment. Thus, early childhood 








Family physicians: 


Often first point of contact 











School educators (teachers/school 
counsellors/nurses/psychologists: 


Are proximal to childhood maladaptive 
behaviours such as school truancy, withdrawal 
as a consequence of school bullying, or adverse 
family dynamics 














Children’s services: 


Involved in monitoring of child development within 
families and communities 














Emergency service workers: 


Alternative first point of contact involved in making 
fast and appropriate referrals in aid of early detection 
of adversity or risk factors 














Psychiatrists/psychologists/mental health 
practitioners: 


Providers and evaluators of evidence-based 
interventions through clinical and research inputs 








Legislators/policymakers/service providers: 


Involved in setting up standards, mandating, and 
allocating financial and training resources to support 
interventions 





Fig. 68.4 Professionals that have the potential to contribute to the prevention of mood disorders. Six groups of professionals, among others, have 
the potential to directly contribute to the prevention of mood disorders. Where possible, those involved should be working collaboratively and ideally 


targeting youth populations. 
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environments and social networks have been identified as critical 
in shaping emotional development because strong social supportive 
relationships and networks determine positive and enduring mental 
well-being that extends to adulthood [27]. The ultimate aim of these 
interventions is to limit the transgenerational transmission of mood 
disorders, and several interesting strategies have been developed to 
achieve this goal. 

Core features of parenting and family-based interventions in- 
clude enhancing resilience, building of interpersonal relationships 
and supportive networks, parental skill enhancement, and problem- 
solving, as well as raising awareness of the impact of mental dis- 
orders on the individual and the family, particularly the offspring. 
In this regard, it is encouraging to note that prevention interven- 
tions can be implemented at very early stages of development, that 
is, during prenatal and perinatal stages, and can restrict maladaptive 
or enhance adaptive parenting skills so as to modify neurocognitive 
development in offspring [71]. Indeed, an example of such an inter- 
vention is the Reach Out, Stand Strong, Essentials for new mothers 
(ROSE) programme that targets disadvantaged pregnant women 
deemed to be at risk of post-partum depression [72]. The ROSE pro- 
gramme intervention is an interpersonal therapy (IPT)-based inter- 
vention that puts emphasis on building supportive interpersonal 
relationships. While limited by a very small sample size, the larger 
reduction (20%) in parental incidence of depression in the interven- 
tion group, compared to the smaller reduction in the comparison 
antenatal-only group (4%), is supportive of the potential utility of 
such interventions. Furthermore, psychosocial and interpersonal 
postnatal depression prevention strategies focusing on the perinatal 
phase may at least reduce the incidence of postnatal depression, a 
promising finding for infant well-being and development. Indeed, 
recent evidence for a direct and sustained effect of perinatal ma- 
ternal mood on the offspring’s neuropsychological development is 
strong and has prompted for calls for in utero programming and ac- 
tivation of the stress-immune systems [19, 21, 73]. Although prom- 
ising, the specific effects of such interventions on offspring mood 
states are not known, as the follow-up periods in most studies have 
thus far tended to be relatively brief—lasting typically no more than 
6 months. Therefore, longer-term follow-up of offspring cohorts is 
urgently needed to initially verify and then quantify these effects. 

Fostering adaptive interactions between parents and their ado- 
lescent offspring requires different parenting approaches to those 
employed in rearing infants and children. To identify parenting 
strategies that could potentially prevent depressive and anxiety 
symptoms in adolescent offspring, a poll of experts was under- 
taken using the Delphi Consensus method [74, 75]. This identified 
parenting strategies, such as establishing clear ground rules and con- 
sequences, establishing and maintaining a good relationship with 
the adolescent, and showing affection, as important. These strategies 
could be used in a manualized format, targeting parents as a check- 
list that they employ to help monitor their own behaviours in the 
implementation of interventions. 


Families at risk interventions 


Evidence from other parenting and family-based interventions 
of families at risk because of the presence of parental depression 
seem to show short-term benefits on the functioning of the family 
as a unit. These interventions do not target the offspring but in- 
stead target parents with children aged 18 months to 18 years, for 


example the Families OverComing Under Stress (FOCUS) inter- 
vention for youth aged 3-17 years [76]; Entwicklungsforderung 
in Familien: Eltern- und Kinder-Training in emotional belasteten 
Familien (EFFEKT-E) intervention for children aged 4-6 years 
[77]; Family Group Cognitive-Behavioural (FGCB) for youth aged 
9-15 years [78]; Family Options for children and adolescents aged 
18 months to18 years [79]; and Keeping Families Strong (KFS) 
intervention for youth aged 9-16 years [80]. The core features of 
these interventions are problem-solving and skill building, mainly 
parenting and coping skills, as well as providing psychoeducation 
on various issues related to depression and its impact on the affected 
individual and the rest of the family, particularly the offspring. This, 
in turn, raises awareness and possibly tolerance, thus enhancing re- 
silience and minimizing maladaptive stress responses. 

Generally, these interventions show short-term benefits for both 
the children and their parents, with reductions in parental depressive 
symptoms and an increase in the offspring’s prosocial behaviours 
and emotional expression that are mediated by ameliorating par- 
ental depression. However, one intervention evaluation did not have 
child-specific outcomes [79], and another showed improvements in 
child behaviour within the family that could not be transferred to 
the school environment where an increase in school maladjustment 
was seen [80]. The FGCB intervention, based on clinician ratings of 
mood changes, showed sustained benefits lasting 24 months, while 
the Youth Self-Report (YSR) measure showed intermittent and in- 
consistent benefits that could not be explained [78]. It could be that 
the sensitivity of the outcome measure might be crucial in the detec- 
tion of changes. 

Family therapeutic interventions (FTIs) target both mentally ill 
parents and their offspring aged 8-17 years [81-83]. These inter- 
ventions employ CBT and psychoeducation components. Generally, 
these interventions have been shown to produce variable effects 
on mood state. For instance, FTIs produced inconsistent improve- 
ments in mood that were short term [83]. Interestingly, it is possible 
that these were mediated by improvements in anxiety, which often 
precedes depression and is thought to increase its likelihood and 
advance its onset to an earlier age [84]. Therefore, perhaps meas- 
ures of anxiety should be routinely included when evaluating out- 
comes of prevention interventions [85]. However, FTIs also seem to 
have widespread effects in reducing risk and increasing protective 
factors of mood disorders through facilitation of a stronger child- 
parent relationship. Such improvements in family functioning have 
been shown to be sustained for up to 5 years, but this effect is not 
specific to FTIs and can also be achieved through brief parental 
psychoeducation [81-82]. 

CBT-based interventions have also been shown to have sustained 
reductions in the incidence of depression that is detectable for up to 
75 months post-intervention and is moderated by baseline parental 
depression [86]. This means that, for maximum benefit, it might be 
prudent to treat parental depression prior to the initiation of pre- 
vention intervention programmes. Perhaps the successful treatment 
of parental depression fosters an environment that facilitates the 
preventative effects of interventions on depression. Interestingly, 
this series of evaluations [86-88] also seem to suggest that overall a 
smaller number of adolescents need to undergo an intervention to 
prevent the onset of depression (NNT = 9 at 9 months to NNT = 6 
at 33 months). However, these findings have to be interpreted with 
caution because the inclusion of adolescents with a prior history of 
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depression might, to some extent, be responsible for the observed 
intervention effects—obfuscating whether the intervention is truly 
preventative or simply therapeutic. 


School-based interventions 


There have been an increasing number of school-based interven- 
tions in the last two decades, with similar overarching principles. 
Primarily, they are aimed at preventing the development of mood 
disorders, particularly depressive disorders in children and adoles- 
cents. School-based interventions offer advantages over other inter- 
vention settings because they provide ready access to large groups of 
young people and also remove some of the physical and economic 
barriers to access of services that usually limit their uptake [89]. 
School-based interventions, such as the CBT-based Penn Resilience 
Program (PRP) intervention, beyondblue, Problem Solving for Life, 
Resourceful Adolescent program, Positive Thoughts and Actions, 
and mindfulness-based programmes, have been implemented with 
varying degrees of success (‘Table 68.3). Each of these interventions 
has a number of core characteristic features, with some modest 
variations, involving mostly universal or targeted (selective and in- 
dicated) approaches. A selection of some recent interventions illus- 
trates these similarities and differences. 


Penn Resiliency Program 


The PRP [90] is the most evaluated CBT- and strength-based pre- 
vention programme designed for children and adolescents aged 
10-14. It is aimed at increasing resilience, reducing depressive 


Table 68.3 Examples of school-based programmes 


symptoms, and preventing the development of depressive dis- 
orders. Typically, it is a 12-week programme focused on changing 
cognitions, linking these to emotions, challenging negative ex- 
planations for events, handling and resolving conflict, problem- 
solving, and coping strategies, with weekly homework assignments 
between sessions. It is primarily a school-based intervention but 
has been evaluated in other settings such as primary care clinics. 
Its effectiveness seems to be independent of the expertise of the 
personnel leading its implementation, although the effects are 
slightly larger when implemented by research leaders than trained 
community leaders. It has been evaluated both as a targeted and as 
a universal intervention programme, and the former seems to be 
more beneficial [91-94]. 


Problem Solving for Life 


The Problem Solving for Life (PSFL) [95] intervention is a universal 
intervention that is teacher-led and incorporated into the school- 
term schedule. It is offered in weekly sessions over an 8-week period 
during a term, with each session lasting for 40-50 minutes. All 
sessions use various teaching methods, including interactive exer- 
cises, diary keeping, homework assignments, and the use of cartoons 
and class discussions. The intervention focuses on integrating cogni- 
tive restructuring and problem-solving. Targeted core skills include 
the identification of cognitive styles and challenging maladaptive 
problem-solving tendencies by teaching life problem-solving skills, 
enhancing the development of a positive problem-solving orienta- 
tion, and optimistic thinking styles [95]. 


Programme Prevention type Age range Programme Comparison Number of Evaluation 
content programme sessions duration 





Penn Resiliency Selective (high risk based 10-13 years CBT-based 

Program (PRP) on low socio-economic manualized group 
status) intervention 

PRP Universal (girls onlyand 11-14 years  CBT-based 
co-ed) 

PRP Universal 11-14 years CBT-based 

Problem Solving Universal 12-14 years  CBT-based 

for Life (PSFL) 

Resourceful Universal (RAP- |2elsiveeis [CBT 

Adolescent Adolescent or 

Program RAP-Parent) 

(RAP) 

RAP-Kiwi Universal 13-14 years CBT and IPT 

beyondblue Universal 13-14 years Multicomponent 
Selective (attributional 14-15 years Adaptive coping 
styles) skills training, 
Indicated (subthreshold enhancement 
depressive symptoms of self-esteem 

and personal 
development 
Positive Thoughts Indicated (subthreshold 12-14 years CBT 
and Action (PTA) depressive symptoms 











No intervention Up to 12 and [91, 92] 
24 months 

Co-ed no intervention 12 Up to 12 months [93] 

Penn Enhancement 12 Every 6 monthsup [94] 

Program (identification to 3 years 

of depression-related 

stressor) or no 

intervention 

Monitoring control 8 Up to 1 year [95] 

Adolescent Watch (no RAP-A: 11 Up to 10 months [97] 

intervention) RAP-P: 3 

IPT only 11 Up to 18 months [96] 

Community focus 10 Annually for 3 years [122, 123] 
and up to 5 years 

Assessment only 14 6 and 12 months [105, 106] 

conducted at beginning 

and end of active 

intervention 

Individual Support 12 6 and 12 months [99, 100] 


Program 
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The Resourceful Adolescent Program 


The Resourceful Adolescent Program (RAP) [96-98] is a universal 
school-based intervention designed for adolescents (RAP-A) and 
their parents (RAP-P). The RAP-A component is fully manualized 
and implemented by school personnel at baseline during classes to 
groups of adolescents, aged 14-15, in Year 9. It is implemented in 
11-weekly sessions per term, with each session lasting for 40-50 
minutes. It is strength-based to enhance resilience through imple- 
mentation of CBT and interpersonal therapy techniques. The focus 
is on stress management, cognitive restructuring, problem-solving, 
managing and resolving conflict, and building supportive networks. 
The RAP-P component is offered to parents of a cohort of adoles- 
cents and is usually delivered as a series of two separate 2- to 3-hour 
long workshops. The focus is on building parent's strengths, stress 
management, examination of parental roles in the development of 
their adolescent children while promoting positive family relation- 
ships, and conflict management and resolution. 


Positive Thoughts and Action Program 


The Positive Thoughts and Action Program (PTA) [99-100] is a 
manualized indicated prevention programme that is developmen- 
tally sensitive, targeting early adolescents. It is a CBT-based group 
intervention, focusing on cognitive style, problem-solving, coping 
skills, and the application of learnt skills in social functioning and 
behaviours. It is implemented in weekly sessions lasting 50 minutes 
in groups of 4-6 adolescents. Parental involvement is encouraged 
via two home visits with the child and parents, as well as parent 
workshops. Parent workshops involve psychoeducation regarding 
adolescent development and skills training. 


Beyondblue intervention 


The beyondblue intervention [101] is a flexible universal inter- 
vention established as part of the national depression initiative in 
Australia that fosters a stronger partnership between health and 
education services. It is implemented to Year 8 adolescents aged 
13-14 years and follows them up for the next 3 years, in each year 
building on skills that have been previously taught and learnt. Each 
year, adolescents participate in 10-weekly classroom sessions lasting 
for 30-40 minutes. The intervention has four core features. It is a 
curriculum intervention, which is focused on building supportive 
environments, while building pathways for care and education and 
the use of community forums. As a curriculum intervention, it pro- 
motes well-being and enhances core skills and resilience. During the 
classroom sessions, core skills are taught in an interactive manner 
with the use of role plays, videos, quizzes, group exercises, and dis- 
cussions. Core skills include problem-solving, conflict resolution, 
assertiveness, stress reduction, social skills, emotional education, 
self-efficacy and awareness, building of positive expectations of self 
in relation to the world and the future, greater understanding of 
mental health problems, and capacity to seek help. 


Summary 


The PRP intervention effects have been variable, with some evalu- 
ations revealing short-term benefits that are not sustainable in the 
long term, while others have shown longer-term benefits for up to 
2 years. Interestingly, the longer duration of benefits seems to depend 
on the presence of subthreshold symptoms at baseline and their se- 
verity. This raises the question of whether the PRP was preventative 


or merely a treatment of mild depressive symptoms, which, in turn, 
raises issues regarding conceptualization as a determining factor 
in prevention vs treatment. Furthermore, the strength of interven- 
tion effects seems to be dependent on symptoms at baseline, with 
stronger effects in those with subthreshold symptoms at baseline and 
only trend effects in those who are asymptomatic. It has also been 
suggested that the most reliable PRP effects are produced in research 
settings when the intervention is implemented by the developers 
and their expert colleagues. This means that while it is efficacious, its 
effectiveness is questionable. However, it does seem to have potential 
for those with mild to moderate depressive symptoms. 

The PSFL intervention has also been shown to be superior to 
monitoring in the short term, with reductions in depressive symp- 
toms that are clinically equivalent to the normative sample. The use 
of the Beck Depression Inventory (BDI) as an outcome measure in 
non-clinical populations has not been promoted due to questionable 
sensitivity in this population group [102, 103], but there is a possi- 
bility that booster sessions might have been helpful in maintaining 
benefits from the initial intervention. 

Although, conceptually, the beyondblue intervention has multiple 
components, each of which is evidence-based, it failed to show effect- 
iveness in reducing adolescent depressive symptoms or improving 
their well-being. It could be that, while the efficacy of the cognitive 
and IPT-based components of the intervention is somewhat estab- 
lished, efficacy of the building pathways for care and education and 
community focus components is largely unknown. ‘Therefore, the 
effects of these components, separately and combined, have not 
been well tested and estimated. However, a meta-analytic review has 
suggested that the programme content does not moderate interven- 
tion effects [104]. Perhaps, with so many components to address, 
ten sessions per term is too conservative and does not allow for an 
in-depth exploration and analyses of all the issues raised by parti- 
cipants to their satisfaction. Another challenge brought to the fore 
is the need for adequate and in-depth training of teachers to im- 
plement the intervention with fidelity, which requires significant re- 
sources. This further highlights the difficulty of incorporating and 
integrating such complex multicomponent interventions within the 
school setting. 

Contrary to this, a selective (based on the presence of a cognitive 
risk factor—attributional style) and indicated (based on the pres- 
ence of subthreshold depressive symptoms) prevention programme 
targeting adolescents aged 14-15 years was successfully imple- 
mented in a school setting in Iceland [105, 106]. The intervention 
was aimed at enhancing adaptive coping, self-esteem, and personal 
development, while the comparison was only an assessment con- 
ducted at the beginning and at the end of the active programme, 
although participants in this intervention were allowed to seek treat- 
ment elsewhere at any time, if needed. It was implemented by school 
psychologists and regular school staff trained in its implementation 
and assessment procedures. This intervention showed no new epi- 
sodes of depression/dysthymia in the intervention group, com- 
pared to a 2.5% increase in the usual treatment group, and at 6 and 
12 months’ follow-up, there was a 1.6% and 3.92% increase, respect- 
ively, in new episodes of depression/dysthymia for the intervention 
group, compared to a 13.3% and 20.97% increase, respectively, in the 
treatment as usual group. This increase was twice and three times as 
large in females, compared to males, at 6 and 12 months. This evi- 
dence suggests that sustained gains in the prevention of the onset 
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of depression/dysthymia can be achieved with an intervention that 
enhances adaptive coping and personal development. The authors 
also identified the age of 14 years as potentially a good age to com- 
mence preventive intervention in those at high risk. Interestingly, 
this tallies with neurobiological research that shows the biggest 
changes occurring in 15-year-old girls with emotional symptoms— 
thought to be antecedents [107]. By using school personnel for the 
implementation of the programme, the authors point to its poten- 
tial for real-world utility and effectiveness in significantly reducing 
the onset of depression, further bolstering the potential of targeted 
interventions in preventing or possibly delaying the onset of depres- 
sion/dysthymia by at least a year. 

The universal implementation of the RAP intervention is con- 
sidered to have the potential to either shift about 13% adolescents 
out of depression or prevent them from developing subclinical or 
clinical depression, at least within 10 months of receiving the inter- 
vention. However, although well tolerated by adolescents, teachers 
were less optimistic about the programme, stating the manual was 
restrictive and inflexible for addressing the unique needs of indi- 
vidual classes. Furthermore, the intervention effects seem to vary, 
depending on the outcome measure used. For instance, although 
both the self-report Reynolds Adolescent Depression (RAD) scale 
and the Beck Depression Inventory-II (BDI-II) revealed initial 
intervention benefits, only the RAD revealed sustained effects to 
18 months post-intervention. This again perhaps points to a lack of 
sensitivity of BDI measures in non-clinical populations. However, 
the results have to be interpreted with some caution because the ana- 
lysis was performed based on the assumption that the intervention 
would be more beneficial than placebo. The presence of some deteri- 
oration in mood state for both interventions over time may suggest 
that such an assumption was premature. Statistical analysis should 
have taken this possibility into account but has likely compromised 
the detection of intervention effects. 

A recent systematic review and meta-analysis of school-based 
interventions for depression only and depression and anxiety, respect- 
ively, have revealed very small and short-term intervention effects that 
are seemingly influenced by the type of prevention and the proficiency 
of the personnel delivering them [89]. In terms of how long the effects 
last for, on average, they tend not to last for more than 24 months; in- 
deed, the effects are stronger immediately following the intervention 
(ES, g = 0.23), reducing to g = 0.20, 0.12, and 0.11 in the first 6 months, 
6-12 months, and over 12 months, respectively. It is not clear whether 
it is loss of power to detect long-term effects or a natural process of 
decline that explains the loss of benefit over time. If it is the latter, 
then perhaps booster sessions may be necessary to maintain improve- 
ments long term. However, if the issue is simply that of the studies 
being underpowered, then there is a need for further evaluations that 
are suitably powered to gauge long-term effects involving larger num- 
bers of subjects. Alternatively, other measures that can perhaps de- 
tect more subtle changes over time with relatively smaller numbers 
may be trialled. Interestingly, a recent Cochrane review of CBT- and 
IPT-based interventions has revealed short-term benefits lasting no 
more than 12 months, and an estimated number to treat of 11, which 
is comparable to other public health interventions [108]. 


Internet-based interventions 


Emerging evidence from the few Internet-based interventions reveal 
moderate to large effects of the intervention on reducing depressive 


symptoms that are sustainable for 2.5 years. For instance, the tar- 
geted Competent Adulthood Transition with Cognitive Behavioural 
and Interpersonal Training (CATCH-IT) intervention for adoles- 
cents and young adults aged 14-21 years was developed to establish 
strategies physicians could use to actively engage young people at 
risk of depression [109-112]. As a behaviour change and resiliency- 
building Internet-based intervention, it was implemented by phys- 
icians offering motivational interviewing (MI) or brief advice (BA). 
Both interventions elicited prevention and treatment effects, with an 
increasing percentage of symptom-free individuals from baseline to 
12 months and reductions in clinically significant depressive symp- 
toms over the 2.5-year follow-up. Furthermore, when comparing the 
CATCH-IT to reported face-to-face cognitive behavioural-based 
interventions, it was revealed that they produced larger effects, 
with a lower NNT for depressive episode (Internet NNT = 4.79 vs 
face-to-face NNT = 11.4) [113]. Also, CATCH-IT produced large 
effects (ES 0.96), compared to face-to-face interventions (ES 0.51) 
[113]. These ES are comparable to BDI-measured treatment effects 
in adults [114]. 


Mindfulness-based interventions 


Mindfulness interventions are based on adult mindfulness-based 
stress reduction (MBSR)/mindfulness-based cognitive therapy 
(MBCT) principles and were implemented in a cluster of 50 schools 
in Belgium [115] and in Australian schools [116]. During sessions, 
guided and unguided mindfulness practices and relaxation were 
taught, followed by manualized homework exercises. Practices in- 
cluded breath counting, stopping and being present, mindfulness 
of daily activities, for example walking or watching thought traffic, 
two guided audio (9 minutes long) of seated and lying body scans, 
and breath awareness. The control arm involved normal curricular 
activities. 

Interestingly, mindfulness has significant and sustained benefi- 
cial effects on self-report depressive symptoms, at least in the short 
term of 6 months’ duration, suggestive of preventative and treat- 
ment features in Belgium studies, whereas none of these effects were 
seen in Australian studies. It could be because the Belgian cohort 
was slightly older than the Australian cohort (14-17 years vs 12- 
14 years) [115, 116]. The developmental trajectory of mindfulness 
is not fully understood, and it could be that younger participants 
are at a different developmental stage and are not ready, or able, to 
process and use the concepts of mindfulness. Furthermore, in the 
Australian study, mindfulness seemed to produce anxiety [116]. The 
increase in anxiety in the mindfulness intervention group might be 
associated with, or driven by, an increase in ruminative processes 
where attention is perhaps drawn to issues that would normally be 
ignored or avoided. 


Universal vs targeted interventions 


In general, universal interventions produce smaller effects (g= 0.19), 
compared to targeted interventions (g = 0.32), and indeed a system- 
atic review revealed that targeted CBT-based interventions produce 
moderate to large ES (ranging from 0.21 to 1.40) [117]. Interestingly, 
these effects were achieved only when the intervention was de- 
livered by mental health professionals, while teacher-led interven- 
tions produced significantly smaller effects, suggesting that the 
personnel delivering the intervention and, in particular, the extent 
of their training contribute to the effects. Specifically, school-based 
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personnel achieved the smallest effects (g = 0.17) and their experi- 
ence was variable. This is potentially problematic because it could 
encourage reluctance to incorporate and embed such interventions 
within regular school schedules where teachers and school coun- 
sellors have to invest a lot of time in class preparation, training, and 
supervision for relatively small gains. It has also been suggested 
that the magnitude of these effects may be influenced by the gender 
composition of participants, along with their age, and the duration 
of intervention. Thus far, larger effects have been seen with inter- 
ventions involving more females, older adolescents, and shorter- 
duration programmes with homework assignments. 

At the same time, it has been argued that regardless of the fact that 
the effects are small, they can potentially translate to meaningful im- 
provements at a population level, with enormous implications for 
preventing the onset of mood disorders. Indeed, it has been shown 
that up to 9 months, implementing preventative interventions re- 
duces the risk of developing depression, with RR 0.45, 0.61, and 
0.79 for universal, selective, and indicated interventions, respect- 
ively, although the indicated intervention maintains reductions up 
to 18 months (RR = 0.23) [118]. This therefore supports the notion 
that, despite the smaller ES of universal interventions in reducing 
depressive symptoms, they may have greater utility in preventing the 
onset of depression. Hence, a combination of both universal and tar- 
geted interventions is needed. 


Evaluation duration 


There is a growing need for long-term evaluations not only because 
the long-term sustained effects of prevention interventions have 
been variable, but because the initial effects of interventions have 
been shown to surface years later, during subsequent developmental 
phases, possibly because of having been missed earlier at times of 
greater ‘noise’ [119]. Perhaps initial intervention effects are masked 
by other processes or events occurring simultaneously, such that 
when emotional stability and maturity have been achieved, they 
re-emerge. Indeed, in one series of studies, intervention effects on 
risky behaviour seen at age 13, were no longer discernible for 4 years 
and then resurfaced at age 18 [28, 29]. This poses the quandary of 
whether initial intervention benefits need to be demonstrated before 
one can have some confidence in their likely downstream effects. 
Naturally, in the absence of initial benefits, one would assume that 
the intervention was not successful. The other point to consider re- 
garding duration is the possibility that prevention interventions may 
not necessarily prevent the onset of mood disorders but may merely 
delay their onset [120]. Ifso, it remains to be seen how long this delay 
is in reality, and thus intervention evaluations may provide evidence 
for this alternate hypothesis. Furthermore, longer-duration evalu- 
ations may also reveal whether intervention effects decay after a 
while, and the potential of interim booster sessions to reignite inter- 
vention effects that may well be on their way to decay, and whether 
the strength of initial benefits determines the rate of decay. 


Age effects on interventions 


The question of when to intervene or which age to target is a separate 
question that seems to be dependent on the content of the interven- 
tion. For instance, evidence from mindfulness-based interventions 
seems to suggest intervention efficacy in later adolescence (14- 
17 years), compared to early adolescence (12-14 years) [115, 116]. 
This is most likely because the concepts of mindfulness are too 


complicated for younger adolescents to comprehend and process. 
On the other hand, other school-based interventions that are largely 
based on cognitive behavioural techniques reveal stronger effects 
among children (g = 0.50), compared to early adolescents (g = 0.23) 
and older adolescents (g = 0.22), although the sample of interven- 
tions targeting younger children is relatively small [89]. Together, 
it suggests that each type of intervention is likely to be age-specific. 


The evidence from school-based interventions does not seem to 
be as strong as initially envisaged. Indeed, numerous null findings 
and non-significant trend effects have been reported [94, 121-123]. 
Factors contributing to the inconsistencies need clarification to help 
improve future prevention endeavours. 

At the same time, while universal interventions produce the 
smallest short-term or intermittent effects, targeted interventions 
seem to produce the largest ES. Furthermore, it seems that ES are 
dependent on the type of intervention, and also, there may be an 
issue with the reliability and sensitivity of outcome measures that fail 
to detect significant effects but tend to reveal non-significant trend 
effects. Furthermore, intervention effects tend to be short term, and 
those showing sustained effects tend to be indicated, rather than se- 
lective and universal interventions. Only a few have revealed truly 
prevention vs treatment effects; only targeted interventions show 
small numbers needed to treat. When to intervene and the possible 
need for booster sessions to sustain long-term benefits need further 
clarification. 


Future directions 


It is clear that the potential for prevention interventions is largely 
untapped. This is due to several reasons. Firstly, the issue of whether 
to intervene universally or to target specific populations remains 
unclear, despite superior gains from targeted interventions, as 
compared to universal interventions. This is because the nature of 
mood disorders is unpredictable. For example, although it can be 
predicted, with some confidence, that those who have parents with 
depression or have been themselves exposed to childhood adversity 
are more likely to develop mood disorders at some point in their 
lives, not everyone with these risk factors will go on to do so. On 
the other hand, there are many who develop mood disorders but 
have not been seemingly exposed to these risk factors. But preven- 
tion should aim to benefit both sets of individuals—those at high 
risk who go on to develop mood disorders and those at low risk who 
happen to develop mood disorders because of factors that are as yet 
unknown. Clearly, both groups should benefit from intervention, 
and hence the need for both targeted and universal interventions. 
Specifically, in order to target only those who need the interven- 
tion, it has been suggested that a multi-level, integrated approach 
to prevention is needed—one that takes into account the multitude 
of key risk and protective factors [31, 88]. But this approach, aimed 
at enhancing resilience, needs to be dynamic and contextual, be- 
cause each developmental stage is characterized by the emergence 
of new risks that may override or negate earlier protective factors, 
necessitating ongoing adaptation [119, 124]. Indeed, although ex- 
posure to regular and controlled stress is necessary for adaptation, 
the development of effective coping strategies, and maintenance of 
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allostasis, it is when the stress is uncontrollable, irregular, and un- 
predictable that allostatic overload and ensuing problems arise, and 
are qualitatively compounded by environmental factors and indi- 
vidual differences such as genetic disposition and personality traits 
[23]. This is particularly relevant to children and adolescents who 
have to be taught the best ways of dealing and coping with stress 
in different contexts. For instance, when taught strategies that are 
maladaptive or that become maladaptive in a particular context, a 
flexible and integrated approach is needed for countering elicited 
stress. This is because prolonged exposure to stress potentially re- 
sults in cumulative damage that is undetectable at first but causes 
subliminal pathological activity that eventuates in mood disorders. 
Therefore, neurobiological and psychosocial processes provide im- 
portant substrates that underpin the integrated approach to preven- 
tion [119, 124]. Remarkably, to date, prevention approaches have 
focused largely on psychosocial aspects of mood disorders. 

There is also an incentive to target the prodrome of mood dis- 
orders, because the benefits of intervention reductions in subclin- 
ical symptoms may be easier to measure. This reduction is presumed 
to reflect a slowing of any underlying process, thereby delaying the 
onset of the disorder or instigating the expression of a full-blown epi- 
sode. However, a reduction in symptoms is perhaps more of a thera- 
peutic than a preventive effect. Furthermore, delaying to intervene 
until prodromal symptoms present might also be too late to enact, 
‘true prevention’ —that is, preventing the initiation of underlying 
pathological processes. Such subtle underlying pathological activity 
is undetectable using current tools and measures. However, recent 
technological advances bode well for the prevention of mood dis- 
orders, and perhaps it is time to shift towards more sensitive and 
specific measures that are able to target the underlying pathological 
activity before it escalates and leads to a mood disorder [23]. Lastly, 
there are ongoing protocols that address some of the limitations in 
previous intervention protocols such as the inclusion of quality of 
life and cost-effectiveness measures as outcomes. Also, the compu- 
tation of a high risk index (HRI) as a measure of risk loading might 
be a useful tool to use for stratification. 


Contributions from genetic studies 


A potential avenue that is yet to be fully understood is the gen- 
etic contribution to the risk of mood disorders. The evidence ac- 
cumulated from studies of candidate genes and single-nucleotide 
polymorphisms (SNPs) provided no clear answers, largely because 
of the clinical heterogeneity of mood disorders and the large num- 
bers of data points required for such investigations. 

However, in recent times, with the use of much more powerful 
genome-wide association studies (GWAS), genetic risk predic- 
tion scores for individuals can be obtained that may well lead to 
new strategies for understanding and preventing mood disorders 
[125, 126]. For example, the polygenic risk score (PGRS) is a 
‘measure’ that can be obtained by aggregating effects of SNPs (al- 
leles) associated with disease status present in each individual, and 
this can be robustly applied to small samples [126-128]. Using this 
approach, PGRS has been applied prospectively to predict the risk of 
BD where the PGRS was found to be higher in high-risk young in- 
dividuals (offspring and siblings of individuals with BD), compared 
to controls, and to provide a useful means of identifying traits linked 
to the genetic risk of mood disorders [20, 129-132]. The potential 


to apply the PGRS as an outcome measure in prevention interven- 
tions has already been demonstrated in medical conditions. For ex- 
ample, in a 10-year coronary heart disease risk study, those receiving 
statins had a significant (3%) reduction in relevant PGRS, compared 
to controls. Similarly, in a 30-year risk of cancer, a lifestyle change 
(stopping smoking) had a significant (5.4%) reduction in PGRS. 
Importantly, these reductions were seen in those with a high PGRS 
score, compared to those with a moderate score. Furthermore, the 
risk threshold for screening for breast and colon cancer has been 
found to be reached much earlier by those with a high PGRS and 
family history of the disease [133]. Therefore, primary prevention 
strategies could potentially be facilitated by incorporating PGRS 
and other biological factors to identify those at different levels of 
risk of developing mood disorders. In other words, interventions 
could be tailored according to risk [133]. Furthermore, modifica- 
tion of the PGRS and, in turn, gene expression through psychosocial 
interventions serves as demonstration of the impact and importance 
of epigenetic processes. However, ethical, legal, and organizational 
considerations affecting the perception of genetic testing would have 
to be explored and addressed in order for polygenic predictions to be 
widely adopted in clinical practice. 


Filling in the gaps 


The missing link in prevention programmes that may contribute 
to the less than robust effects of interventions on measured out- 
comes could also be the use of more sensitive and possibly reli- 
able outcome measures targeting individual biology. While social 
processes are fluid and changing practically on a daily basis and 
self-reported functioning may be biased, other measures targeting 
biomarkers may be more subtle and stable. It has been suggested 
that identification of blood biomarkers for mood disorders may 
provide more specific and sensitive outcomes measures, while other 
potentially promising biomarkers of both resilience and vulner- 
ability include measures of hypothalamic-—pituitary—adrenal (HPA) 
axis dysregulation, oxidative stress, inflammation, autonomic re- 
sponses, telomere length, and epigenetic profiles [15, 16, 23, 134]. 
Indeed, the course of stress exposure may not always be current 
or traumatic in its intensity but often can be traced to the prenatal 
period which seems to prime future neural and physiological re- 
sponses, and these effects may persist throughout life [23]. The use 
of such measures in conjunction with the PGRS may eliminate the 
inherent problems of deciding who to target, irrespective of the risk 
status, whether at increased or low risk at the time of testing. Thus, in 
this context, the ultimate aim of prevention is not just to prevent the 
behavioural and clinical manifestations of mood disorders, but also 
to go deeper and redress or modify imbalances at cellular, neural, 
psychological, and behavioural levels. Indeed, there is potential 
utility in targeting significant mediating biomarkers for depression 
such as oxidative stress and inflammation, for instance with life- 
style changes and increasing the consumption of antioxidant-rich 
foods and omega-3 fatty acids and engaging in regular, but mod- 
erate, physical activity, along with ensuring adequate production of 
melatonin through sufficient sleep, but there are several problems 
that need to be solved to make these measures viable. Research in 
this area is still in its infancy and possibilities for the prevention 
of mood disorders are emerging; therefore, these investigations are 
crucial [135-137]. 
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When to intervene 


At the same time as considering what to intervene with, the timing 
of any preventative measures is a key decision—for which there 
is no clear answer. Should preventative measures be instituted in 
early or middle childhood, or later still in adolescence, or are there 
specific critical periods that yield the most beneficial outcomes? 
Perhaps a standard cannot be developed and instead it is neces- 
sary to tailor interventions to individual needs based on specific 
risk markers. This would also mean that monitoring and follow-up 
adopt a personalized approach. Driving the need for prevention/ 
early intervention is the fact that the cumulative effects of stress 
exposure are increasingly damaging, the longer they are present; 
and the earlier an intervention is instituted, the greater the po- 
tential long-term benefits [15, 16]. Prevention strategies that have 
been used so far, with variable success, include mindfulness- and 
CBT-based interventions, IPT interventions, psychoeducation, 
and skill training (parenting, coping, and problem-solving skills). 
Mindfulness prevention techniques have produced less than ro- 
bust effects with respect to depression, and research thus far sug- 
gests that late adolescents may benefit more than early adolescents 
[115, 116]. However, understanding the mechanism of action 
and targeting those mechanisms may be more efficacious than 
self-reported mood changes. It has recently been revealed that 
mindfulness exerts effects on enhancing self-regulation through 
neuroplasticity changes in several brain regions implicated in 
mood disorders and their development, including the anterior 
cingulate cortex (ACC), the insula, the temporo-parietal junction, 
fronto-limbic connectivity, the default mode networks (DMNs), 
and grey matter volume (GMV) increases in neural structures 
involved in emotion regulation, learning, and memory [88, 138- 
140]. This burgeoning body of research provides further evidence 
for the need to incorporate biological consequences of interven- 
tions in the evaluation of such prevention strategies, and their po- 
tential to identify when responsiveness to interventions occurs in 
the developmental trajectory. 


Model generation 


While our understanding remains incomplete, working models of 
the emotional mind are important but need to be constantly re- 
viewed as new knowledge comes to light. It is therefore necessary 
to refine the models underpinning prevention interventions, so as 
to integrate neurobiology, neuropsychology, and social aspects of 
depression/mood disorders. For instance, if there are intervention 
gains in depression but not a significant mediator on which the 
intervention is based, for example cognition, or there is a change 
in cognition but not depression, then the accuracy of the original 
model and the sensitivity, reliability, and validity of the outcome 
measures need to be modified [104]. In this vein, it has also been 
posited that “identification of genetic variants and understanding 
the molecular mechanisms may lead to prevention strategies aimed 
at correcting molecular disturbances in the pathways that lead from 
genes to behaviour, including the molecular pathways that underlie 
the effects of known risk factors for the disorder [14]. However, 
scrutiny of recent systematic reviews and meta-analyses of various 
approaches for the prevention of depression in children and adoles- 
cents published in the decade since this proposal was made reveal 
a huge gap that has not even begun to be narrowed [63, 89, 108, 


117, 118]. Fundamentally, integration of our knowledge is lacking, 
and common and consistent approaches are yet to be developed and 
agreed upon. It may be that such endeavours are under way, but 
based on the few published protocols of ongoing evaluations, this 
seems unlikely [141, 142]. Therefore, renewed efforts are needed 
that integrate the use of biomarkers and epigenetic processes, with 
a view to developing a robust and testable model that traces the 
developmental trajectory of mood disorders within a meaningful 
ecological framework. Ultimately, our focus and energies to such 
an approach is long overdue—especially given that success in 
preventing mood disorders is too great an opportunity to ignore. 
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In our coverage of this area, we emphasize depth, rather than breadth, 
and focus on current leads in the basic mechanisms underlying bi- 
polar disorder and its treatment targets, suggested by recent ad- 
vances, and place these against a background of historical concepts. 
For more encyclopaedic accounts, we point readers in the direction 
of the excellent reviews available [1-3]. We take as our starting point 
the mechanism of action of a drug (lithium) of known efficacy and 
translate this information into a biologically meaningful and uni- 
fying working hypothesis, in which all molecular defects and treat- 
ment targets for bipolar disorder are directly or indirectly linked to 
calcium signalling. We will illustrate the utility of this conceptual 
framework by providing examples of how these pathways encom- 
pass current, and how they could be used to great advantage in the 
future for, drug discovery. 


























Other neuropsychiatric disorders have basic mechanisms and treat- 
ment targets that directly implicate neurotransmitter systems. For 
example, schizophrenia has strong ties to overactive dopamine 
and D2 receptor activation, with D2 receptor antagonists being 
effective antipsychotics. Depression is also empirically linked to 
the serotonergic system as selective serotonin reuptake inhibitors 
(SSRIs) that increase the availability of serotonin in the synaptic 
cleft. In contrast, bipolar disorder does not implicate preferentially 
one neurotransmitter system, but rather is linked to several [2, 4]. 
Moreover, studies have demonstrated that there are changes in al- 
most all the major neurotransmitters in studies using blood samples 
and post-mortem tissues [2]. Therefore, the gold standard treatment 
for bipolar disorder remains lithium (see Chapter 72). 
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Lithium arguably exerts the most specific clinical effect in psychiatry 
and is highly effective in reducing the risk of relapse [3]. Only lithium 











meets the stringent criteria of a true mood stabilizer [5]. Lithium 
works in acute mania and depression and as maintenance therapy, 
and is the only proven anti-suicidal drug [6]. However, mono- 
therapy is rare [4], and though lithium remains the gold standard 
[7], it is often combined with other pharmacological agents such as 
the anticonvulsants valproate, carbamazepine, and lamotrigine, as 
well as dopamine antagonists such as olanzapine and quetiapine [4]. 

The treatment targets of lithium and, by extrapolation, the patho- 
logical targets of bipolar disorder have been the Holy Grail in the 
field ever since lithium’s discovery in 1949. Despite its toxic side ef- 
fects, lithium revolutionized the treatment of bipolar disorder and 
its success has been impossible to replicate with other drugs because 
we do not understand exactly how lithium works at the molecular 
and cellular levels. Lithium’s simplicity—the third lightest element— 
obscures its action on multiple targets and pathways. In the past, 
potential drug targets for lithium have come and gone in response 
to scientific fashion, rather than experimental evidence. This is 
nicely outlined by Belmaker [8] who noted that the suggested tar- 
gets varied through the decades according to the latest knowledge 
of brain function. 

The early years of lithium research focused on neurotransmitter 
systems, and although lithium has definite effects on these, it is now 
becoming increasingly evident that the targets of all mood stabiliza- 
tion lie not at the receptor level, but downstream of it [2]. 

The most effective drugs for treating bipolar disorder share a 
complex pharmacology that modulates second messenger path- 
ways. Evidence for second messenger modulation comes not only 
from lithium, but also from the efficacy of certain anticonvulsants. 
Valproate and carbamazepine both show efficacy towards bipolar 
disorder and affect second messenger systems; in contrast, the anti- 
convulsants topiramate and gabapentin do not show efficacy and 
do not affect second messenger systems, based on current evidence 
[9]. Historically, drugs used to treat bipolar disorder have been 
suggested to act at many intracellular targets within: the adenylyl 
cyclase (AC)/cyclic adenosine monophosphate (CAMP)/cAMP re- 
sponse element binding protein (CREB) pathway; the extracellular 
regulated kinase (ERK)/mitogen-activated protein kinase (MAPK) 
pathway; the phosphoinositide (PI)/inositol-1,4,5-triphosphate 
(IP;)/protein kinase C (PKC)/diacylglycerol kinase (DAGK) 
pathway; and the Wnt/frizzled (Fz)/dishevelled (Dvl)/glycogen syn- 
thase kinase-3 beta (GSK-3B) pathway. Because of the intertwined 
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nature of these signalling networks, with much crosstalk through 
feedback loops, dysfunction in unrelated gene products can give 
rise to similar cellular and neural network dysfunction; similarly, 
drugs with distinct mechanisms of action can all affect a similar 
cellular signalling pathway and yield the same effect on neural ac- 
tivity. Lithium is exceptionally promiscuous because it competes 
for magnesium at binding sites, due to a similar atomic radius, and 
can inhibit all magnesium-dependent processes, depending on its 
concentration. Lithium’s large number of potential targets can be re- 
duced by comparing the concentration of lithium required to inhibit 
a putative therapeutic target with the concentration that is thera- 
peutically relevant (0.6-1.2 mM) [10] and by considering the likely 
biological relevance of a given target. 





Guanine nucleotide-binding proteins (G proteins) 


Substantial evidence implicates G proteins in bipolar disorder 
[2, 3, 11]. G proteins are a large ubiquitous family of proteins that 
transduce signals from the cell surface into the cell (Fig. 69.1). 
The heterotrimeric class of G proteins are made up of three sub- 
units (alpha, beta, and gamma) and couple signals from activated 
G protein-coupled receptors (GPCRs) to enzymes (effectors) that 
generate second messengers. The effector enzyme and second mes- 
senger pathway is largely dictated by the alpha subunit. The alpha 
subunits Gs (stimulatory) and Gi (inhibitory) control the activity of 
adenylyl cyclase, which makes cAMP. Conversely, the alpha subunit 
Gq activates phospholipase C (PLC) beta. The pathways have nu- 
merous points where they can interact to increase or decrease the 


Stimulus 





Fig. 69.1 A simplified mechanism illustrating heterotrimeric G protein 
signalling. 

Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric 
G protein signalling, Copyright (2014), with permission from Portland Press Limited. 


function of the other. In this manner, cAMP can amplify or inhibit 
calcium signals, and vice versa. 

When a GPCR is activated, it causes a conformational change 
that results in guanosine diphosphate (GDP) being exchanged for 
guanosyl triphosphate (GTP), causing dissociation of the G, and 
G,, subunits, which then activate a variety of signalling pathways, 
depending on the location and coupling to second messengers and 
effectors (Fig. 69.1). Eventually, GTP is hydrolysed to GDP by the in- 
trinsic enzymatic activity of the G, subunit or the presence of other 
enzymes that hydrolyse GTP, and it re-joins the Gg, subunit to revert 
to the inactive trimer. This ‘off-mechanism is governed by different 
regulatory enzymes such as GTPase activating proteins which fa- 
cilitate the G, subunit’s intrinsic GTPase activity. For more detailed 
information, see a review [12]. Additionally, there is a family of G 
protein receptor kinases (GRKs) which work to phosphorylate the 
activated receptors directly and prevent the binding of G proteins 
[12] (Fig. 69.1). Evidence suggests that lithium does not act directly 
on G proteins but, after chronic treatment, leads to the stabilization 
of the inactive heterotrimeric state [3, 13]. 

Based on the hypothesis that overactive GPCR signalling is 
causal in bipolar disorder, lithium could dampen overactivation by 
stabilizing the inactive heterotrimeric G protein, leading to a reduc- 
tion in G protein-effector coupling. Neural hyperactivity in bipolar 
disorder is based on several lines of evidence. Post-mortem, as well 
as genetic, studies have shown that there exist regulatory differences 
in GRKs in affective disorders [14]. More specifically, a single nu- 
cleotide polymorphism of the GRK-3 gene has been associated with 
bipolar disorder [2]. Other findings show hyperfunctional G pro- 
tein function in leucocytes [14] and increased G, subtypes in post- 
mortem brain tissue, platelets, and leucocytes [2]. This is suggestive 
of an overall amplification of signalling, as the receptors are less 
likely to be phosphorylated and therefore inactivated. Hence, it is 
feasible that lithium’s therapeutic effects may be related to its ability 
to reduce the increased signalling as a virtue of a polymorphism in 
GRK-3. When tested in vitro in platelets from bipolar manic patients, 
valproate also showed evidence of reducing G protein coupling, and 
thereby attenuating signalling [15]. 


AC/cAMP/CREB pathway 


The initiation of the cAMP signalling pathway occurs when a hor- 
mone or neurotransmitter binds to the receptor and induces either 
an increase (Gas coupled) or a decrease (Gai coupled) in AC ac- 
tivity. The latter is a dimeric enzyme which catalyses the conversion 
of adenosine triphosphate (ATP) to cAMP, which further amplifies 
the signalling by acting on numerous effector proteins (Fig. 69.2), 
the most well characterized of which is protein kinase A (PKA). 
Many physiological effects of cAMP are mediated by PKA (Fig. 
69.2), which phosphorylates its targets to increase or decrease cell 
signalling-related physiological effects. Overall, PKA activation in- 
creases cytosolic calcium via entry through L-type Cav1.1 and L- 
type Cav1.2 channels. Another important action of PKA is its ability 
to induce gene transcription via the transcription factor CREB. The 
catalytic subunit of PKA translocates to the nucleus and phosphor- 
ylates CREB, which, in turn, binds to the DNA promoter region 
and turns on or off the transcription of certain genes. Some genes 
regulated by CREB relevant to bipolar disorder include the ones 
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Fig. 69.2 Summary of the control of cyclic AMP levels and its cellular targets. 
Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 


that code for brain-derived neurotrophic factor (BDNF), tyrosine 
hydroxylase, and circadian regulators. 

Lithium produces complex effects on this cAMP signalling 
pathway [2, 3]. It inhibits cAMP accumulation in response to stimu- 
lation; however, paradoxically, it increases basal, and in some cases 
stimulated, AC activity, resulting therefore in an increase in basal 
cAMP [16]. Moreover, lithium’s effects on cAMP and AC are tissue- 
specific [16, 17]. It has also been suggested that chronic lithium 
treatment might be producing these effects on G proteins, rather 
than on AC directly, as these effects were reversible in vitro only 
when increasing concentrations of GTP [18] were added, and not 
by increasing magnesium. If the effect of lithium were mediated by 
inhibition of AC, increasing magnesium would reverse the effect, 
as it does during acute treatment. Overall, chronic treatment with 
lithium showed reduced CREB phosphorylation in rat brains [2], 
indicative of an attenuation of this pathway, at least with respect to 
CREB activity. By comparison, valproate also decreased cAMP pro- 
duction on stimulation, but chronically it did not affect CREB. 


Phosphoinositide pathway 


Much of the early work implicating phosphoinositide signalling in 
the pathogenesis of bipolar disorder and as providing targets for 


drugs was equivocal because of the limitations of the experimental 
models and chemical tools available at the time. Now with new 
models, such as mice with genes knocked out for IMPase 1, IMPase 
2, and myo-inositol uptake, combined with a blood-brain barrier- 
permeant putative lithium mimetic ebselen and with L690330 de- 
livered directly into mouse brains [19], the phosphoinositide area is 
experiencing a renaissance. 

Receptors that employ this pathway are coupled to Gaq, which 
activates phospholipase C (PLC), which catalyses the hydrolysis 
of phosphotidyl inositol 4,5-biphosphate (PIP,) to diacylgycerol 
(DAG) and IP, (Fig. 69.3). Both IP, and DAG then go on to control 
several physiological processes, before being recycled back to PIP, 
via a series of intermediary phosphoinositides. As Gq is linked to 
nearly all of the known neurotransmitter systems (noradrenergic, 
serotonergic, glutamatergic, cholinergic, as well as dopaminergic 
through heterodimers), it is perfectly plausible that lithium might 
be regulating mood via its actions on phosphoinositide signalling. 

Allison and Stewart first showed that inositol levels were de- 
creased in rat brains treated with lithium and postulated that this 
reduction might be due to lithium interfering with active transport, 
synthesis, or incorporation into phosphoinositides [20]. Although 
myo-inositol levels decreased in the cerebral cortex of rats treated 
with lithium, levels of Ins1P increased [21], suggesting inhibition 
of the enzyme inositol monophosphatase (IMPase), which converts 





WZ" SECTION 11 Bipolar disorder 





Stimulus 






PtdIns —» Ptdlns4P ——» Ptdlns4, 5P, 


| © 
Valproate _ 


De novo 


Lithium 





Ox 


Inositol 


monophosphatase 
í PtdIns ; | 








synthesis 
ey o m 


Plasma inositol 
Na* 





Glycerol 
Arachidonic 
acid 





Recycling 


Dietary 
inositol 





Inositol | <— 


© 






PtdIns 
synthetase 






PtdIns CDP.DAG 







Endoplasmic reticulum 







Fig. 69.3 Summary of the phosphoinositol cycle and the possible mechanism of action of lithium and valproate. 
Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 


Ins1P into inositol. Hallcher and Sherman demonstrated that at 
therapeutically relevant concentrations, lithium uncompetitively in- 
hibits IMPase [22]. This mechanism of uncompetitive inhibition is 
rare and was an important consideration in the development of the 
inositol depletion hypothesis proposed by Berridge in 1982 [23, 24]. 


Inositol depletion hypothesis 


The inositol depletion hypothesis proposes an elegant scheme for 
lithium’s therapeutic efficacy in bipolar disorder [23, 24]. It is hy- 
pothesized that in bipolar patients, the phosphoinositide signalling 
pathway is hyperactive (Fig. 69.4). Lithium, by inhibiting IMPase, 
impedes this pathway by depleting the cell of inositol, which is re- 
quired for the regeneration of active signalling molecules such as 
IP; (Figs. 69.3 and 69.4). Moreover, lithium inhibits the enzyme 
uncompetitively [22], that is, the level of inhibition intensifies with 
increasing substrate concentration. Therefore, lithium is more ef- 
fective on pathways that are hyperfunctional, and not as much on 
those pathways that are effective at basal levels. Interestingly, IMPase 
is also at the juncture of another inositol recycling pathway. Glucose 
6-phosphate is converted by 1-D-myo-inositol-3-phosphate syn- 
thase (MIPS) into Ins1P [23, 24], which is then converted to inositol 


by IMPase (Fig. 69.3). Therefore, inhibition of IMPase ensures that 
both pathways (inositol recycling and synthesis) in the brain are 
inhibited. 


Inositol and the brain 


The brain is uniquely sensitive to inositol depletion, because poor 
blood-brain barrier permeability of inositol renders it heavily de- 
pendent on recycling (Figs. 69.3 and 69.4), in contrast to tissues in 
the periphery which have easy access to dietary inositol. Although 
this depletion of inositol phosphates may be relatively small, as in- 
ositol phosphates comprise a small component of the cellular ma- 
chinery, the changes they can influence may be of considerable 
proportion. A small change in inositol phosphatases may produce 
an amplification (and flux) leading to substantial cellular signalling 
and gene expression, which ultimately underlies lithium’s thera- 
peutic efficacy in bipolar disorder. 

The premise that inositol signalling is a valid therapeutic target 
in the treatment of bipolar disorder is strengthened by the fact that 
both valproate and carbamazepine also cause a depletion of inositol 
phosphates, although not always by inhibiting IMPase. Valproate 
and carbamazepine inhibit MIPS [25] (Fig. 69.3). Furthermore, all 
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Fig. 69.4 Diagram illustrating the inositol depletion hypothesis whereby overactive signalling in neurons can be corrected by slowing the 


inositol cycle. 


Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 


three mood stabilizers lithium, valproate, and carbamazepine in- 
hibit the uptake of myo-inositol from blood into the brain using the 
sodium/myo-inositol transporter 1 (SMIT1) [26]. 


Evidence for inositol depletion 


Besides the observation that lithium, valproate, and carbamazepine 
all deplete inositol, the strongest evidence for the inositol depletion 
hypothesis comes from lithium’s inhibition of IMPase at therapeutic 
concentrations. This inhibition causes a decrease in inositol and PIP,, 
which can be measured. Shimon et al. in 1997 showed decreased 
levels of inositol in the prefrontal cortex in post-mortem brains 


of bipolar patients using gas chromatographic techniques [27]. 
Magnetic resonance spectroscopy has also been extensively used, 
with mixed findings [28], but several studies have found decreased 
levels of myo-inositol in the frontal lobe of bipolar patients treated 
acutely and chronically with lithium, compared to controls [28]. 
Additionally, increased levels of phosphomonoesters (including 
Ins1P) have been shown to be increased in the frontal lobe of both 
bipolar manic and bipolar depressed patients treated with lithium 
[28]. According to the inositol depletion theory, a decrease in in- 
ositol leads to a reduction in phosphoinositides, such as PIP,, a pre- 
cursor for IP}. This was demonstrated by showing that reduced PIP, 
levels were found in platelets from bipolar I patients treated with 
lithium, compared to control volunteers [29]. Preliminary evidence 
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also comes from genetic linkage studies that have suggested a sus- 
ceptibility locus for bipolar disorder on chromosome 18p11, the 
location of one of the genes that codes for IMPase [30]. There is 
a caveat that this locus, like that for GRK-3, has not yet been con- 
firmed in subsequent meta-analyses (see Chapter 70). 


Ebselen: a blood-brain barrier penetrant 


IMPase inhibitor 





Lithium has wanted and unwanted effects that are concentration- 
dependent, but the therapeutic index is small. It does not work in 
all patients; side effects are unwanted and lead to non-compliance, 
and compared to other drugs, there is an increased burden on phys- 
icians as blood levels must be monitored. Therefore, there has been 
an effort to find a ‘lithium mimetic’: a drug with lithium’s advantages 
without its disadvantages. One strategy is to find agents that modu- 
late the target of lithium. However, as described previously, this is 
unknown. An approach can then be taken whereby a putative target 
is explored. 

Taking inositol monophosphatase as a putative target has resulted 
in several efforts, both commercial [31] and academic [32-34], to 
find inhibitors. On this background, an effort in drug rescuing was 
undertaken using the NIH Clinical Collection, which is a library of 
drugs known to be safe and with a history of use in humans but 
that have not been marketed due to clinical trials showing a lack 
of efficacy in a given indication or due to business reasons [35]. 
Using isolated IMPase revealed an inhibitor called ebselen. Ebselen 
was originally designed as an antioxidant and was used in clinical 
trials for stroke and atherosclerosis [36]. In contrast to all previous 
studies with inhibitors of inositol monophosphatase [37], ebselen 
was bioavailable and blood-brain barrier-permeant. As ebselen ex- 
hibits polypharmacology (in drug discovery, this is a term for one 
drug affecting many targets), it alone cannot reveal the mechanism, 
but it inhibits IMPase at 40-fold lower concentrations than GSK36, 
the other most likely target of lithium’s therapeutic action. Ebselen 
exhibited certain lithium-like behavioural effects in animal models 
that were reversed in vivo by the administration of inositol, strongly 
indicating IMPase inhibition as the mechanism [33, 34]. More dir- 
ectly, ebselen reduced myo-inositol in both mouse brains, as well as 
in the anterior cingulate cortex in healthy volunteers, consistent with 
the inositol depletion hypothesis [23, 24]. Ebselen also decreased 
impulsivity [38], which may relate to lithium’s anti-suicidal effects. 


Protein kinase C and diacylglycerol kinase 


PKC is a serine/threonine kinase that phosphorylates substrates to 
activate or inactivate them and thus modulates cell signalling path- 
ways. PKC itself is activated by DAG and calcium (Ca”’). Although 
lithium does not directly inhibit PKC, chronic administration de- 
creases the activity of two isoforms of PKC—PKCa and PKCe—in 
the frontal cortex and hippocampus [39], without affecting the other 
PKC isoforms [40]. PKC, as a component of the calcium signalling 
pathway discussed previously, can modulate glutamatergic, 
noradrenergic, and serotonergic pathways in the CNS, indicative 
that it may be important for mood stabilization [39]. PKC shows 
increased activity in platelets from patients in the manic phase [41] 


and in post-mortem brains [42], which is consistent with a hyper- 
active phosphoinositide cycle in bipolar disorder. Tamoxifen, an 
anti-oestrogenic drug that inhibits PKC at high concentrations, 
showed anti-manic effects in some small trials [43], highlighting 
a possible role for PKC in bipolar disorder. After chronic treat- 
ment, both lithium and valproate show a reduction in PKC activity, 
through differing mechanisms [15, 39, 44]. 

DAGK converts DAG to phosphatidic acid, an essential step in the 
resynthesis of PIP, from inositol (Fig. 69.3). Additionally, DAG also 
activates PKC isoforms, along with calcium. Therefore, hyperactive 
production or reduced removal of DAG may further result in PKC 
hyperactivity. In a genome-wide association study, a strong link be- 
tween bipolar disorder and DAGK was observed [45]. This finding 
has been replicated in some, but not all, studies [46]. Although 
lithium or valproate have not shown any direct effects on DAGK, as 
it is an upstream regulator of PKC, it might be another valid target in 
the treatment of bipolar disorder. 


GSK-3 is another serine/threonine protein kinase that modulates 
substrates by phosphorylation. It has two isoforms GSK-3a and ß, 
which have essentially the same function. GSK-3 phosphorylation 
of glycogen synthase renders it inactive. Lithium was shown to in- 
hibit GSK-3 uncompetitively at therapeutic concentrations with an 
IC, of about 2 mM [47]. This suggests that there is partial inhibition 
of GSK-3 in vivo (therapeutic concentration of approximately 0.8 
mM). However, some evidence indicates that lithium also inhibits 
the phosphatase that activates GSK-3, thereby reinforcing the inhib- 
ition of GSK-3 activity [48]. GSK-3 has a role in neuroplasticity and 
cellular resilience, and it is believed that lithium’s neuroprotective 
effects may be partially mediated by GSK-3 inhibition [49]. 

Circadian rhythm disruptions are seen in bipolar patients, and 
lithium corrects these, possibly through GSK-36 inhibition [49]. 
Moreover, GSK-3 is at the crucial juncture of neurotransmitters, 
like noradrenaline, dopamine, and 5-HT, and gene transcription. 
GSK-3 inhibition may also be involved in the teratogenic effects 
seen with lithium treatment [47]. Overall, GSK-3 has only recently 
been considered as a target for lithium and hence requires further 
investigation to understand lithium’s effects via GSK-36. Given that 
lithium must be dosed chronically to be effective, more studies are 
needed. Valproate does not directly inhibit GSK-3ß at therapeutic 
concentrations [49]. 

Although GSK-3ß has been contested vociferously as a likely 
target for lithium’s efficacy, there are substantial caveats. For an ex- 
cellent review, see [50]. Briefly, the phosphorylation patterns are 
inconsistent with lithium therapy. Additionally, inhibiting GSK-38 
signalling causes stabilization of b-catenin and activation of the ca- 
nonical Wnt signalling pathway [51]. The Wnt pathway is strongly 
associated with oncogenic changes, and even a moderate surge in 
B-catenin promotes oncogenesis [50]. 





Calcium signalling as a central unifying concept 


The best experimental evidence, both historical and current, from 
pathophysiology, genetics, and drugs is converging on calcium 
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signalling. Calcium controls all facets of brain function, including 
axonal growth, memory, excitability, brain rhythms, gene tran- 
scription, neuronal firing, neurotransmitter uptake, hormone re- 
lease, mitochondrial activity, neurogenesis, and apoptosis [51, 52]. 
Thus, calcium can control brain activity in several ways, including 
effects on neural circuit activity generated through interactions 
between excitatory and inhibitory neurons and short-term events 
that underlie neural excitability, as well as long-term processes 
such as gene transcription (Fig. 69.5). Calcium relays and inte- 
grates signals from extracellular messengers (neurotransmitters and 
neuromodulators) and other intracellular signalling pathways, to 
affect neuronal excitability over the short term and gene transcrip- 
tion over the longer term. Therefore, calcium is uniquely placed at 
a key position to influence neural activity, growth, and neural plas- 
ticity. Indeed, excessive calcium signalling has been hypothesized 
to drive the neuronal excitatory-inhibitory imbalance that can ex- 
plain mood swings in bipolar disorder [52]. Although there is great 
complexity in the number and interactions of genes and molecular 
players in calcium signalling (Fig. 69.5), it does not mean that it will 
take a drug for each defect; rather, promisingly, it means that there 
are a great number of ways to intervene with a single drug that would 
affect the entire pathway. 





An overview of calcium signalling 


Unlike other messengers that can be created and destroyed, calcium 
can only be moved around in time and space (Fig. 69.6). Cytosolic 
calcium concentration is at steady state and is the sum of the activity 
of all channels and pumps at both the plasma membrane and intra- 
cellular organelles. Extracellular calcium is approximately 1 mM and 
intracellular cytosolic calcium is approximately 100 nM, so there 
is a chemical gradient of about 10,000-fold, as well as an electrical 
gradient combined to drive calcium into a cell. Inside cells, many 
organelles also store calcium to create approximately the same con- 
centration gradient. Maintaining low cytosolic calcium is essential, 
as calcium itself is inherently toxic as it precipitates phosphates and 
phosphate-containing molecules and induces cell death through 
apoptosis. The cell expends much energy maintaining low cytosolic 
calcium through pumps at the plasma membrane such as the plasma 
membrane calcium-ATPase and a sodium/calcium exchanger. The 
endoplasmic reticulum is the major intracellular calcium store and 
has the sarco(endo)plasmic reticulum calcium ATPase (SERCA) on 
it. Mitochondria take up calcium through an electrogenic uniporter 
that operates in the presence of high cytosolic calcium in micro- 
domains close to channels. 

In the cytosol, calcium is heavily bound by immobile binding pro- 
teins (buffers), which greatly limits its diffusion. This enables calcium 
increases to be very rapid and localized, or as calcium can stimulate 
its own release at certain channels, the responses can be greatly amp- 
lified in time and space. This highlights that not all calcium within a 
cell has the same effect—what matters is the spatial distribution, the 
timing, and the amplitude [51]. Indeed, different frequencies of cal- 
cium spikes can drive different transcription factors. Similarly, the 
amplitude of localized calcium responses in neuronal growth cones 
controls attractive or repulsive migration towards a chemotactic 
substance. There are many families and isoforms of each pump and 
channel to provide for fine-tuning calcium signals and their cellular 


responses (Fig. 69.5). In one example, there is plasma membrane to 
nucleus signalling in which the calcium emanating from a Cav1.2 
channel can result in changes in gene transcription, whereas calcium 
emanating from an NMDA receptor channel is far less effective [52]. 
This selectivity results from better coupling to MAPK and cAMP 
signalling cascades through calcium. 

In neurons, calcium signalling can operate in a primary or 
modulatory mode [53] (Fig. 69.7). In the primary mode, calcium 
enters through channels in the plasma membrane, which may be 
amplified by ryanodine receptors through calcium-induced cal- 
cium release and often drives presynaptic vesicle fusion and neuro- 
transmitter release. In the modulatory mode, calcium signalling 
is controlled through the IP,/calcium pathway (Fig. 69.7). This 
modulatory calcium regulates membrane excitability and, in turn, 
neural networks and mood. Alteration in the function of compo- 
nents in either the primary or the modulatory pathways can con- 
tribute to the onset and treatment of bipolar disorder. 


Plasma membrane calcium channels 


The plasma membrane contains many diverse families of calcium 
channels, all with unique activation and physiological function and 
classified as operating in the primary mode, as defined previously 
(Fig. 69.7). Voltage-operated calcium channels mediate calcium in- 
flux upon plasma membrane depolarization and are classified as L-, 
N-, and P/Q- types (Fig. 69.5). Calcium influx controls physiological 
responses such as neurotransmitter release and gene expression [51- 
53]. Receptor-operated types respond to neurotransmitters such as the 
NMDA receptor that responds to glutamate and the 5-HT;, receptor 
that responds to serotonin. There are also second messenger-operated 
channels (SMOCs) that respond to calcium, cyclic nucleotides [for 
example, the cyclic nucleotide-gated channel (CNGC)], and arachi- 
donic acid (AA), as well as store-operated channels (SOCs) that re- 
spond to the calcium load in the endoplasmic reticulum (Fig. 69.8). 
Intracellular calcium can modulate membrane excitability by acting 
on other channels such as inhibition of the KV7.2/KV7.3 channels 
responsible for the hyperpolarizing M current or stimulation of the 
non-selective cation channel leading to a depolarizing current. 


Intracellular calcium channels 


Calcium signals that arise from intracellular channels operate in 
the modulatory mode and are common to all types of neurons. The 
regulation is also subtler in that it will influence excitability, but 
not actually cause firing of action potentials. There are two well- 
characterized families of intracellular calcium channels: the IP, re- 
ceptor and the ryanodine receptor (Fig. 69.7). The IP; receptor is 
under control of extracellular signals, including neurotransmitters, 
neuromodulators, and hormones when they act through GPCRs or 
receptor tyrosine kinases or through the phosphoinositol pathway, 
as described previously (Fig. 69.5). In this pathway, a soluble mes- 
senger IP; is generated and binds to, and opens, a ligand-gated ion 
channel called the IP; receptor, located on the endoplasmic re- 
ticulum, to release calcium. The ryanodine receptor is primarily 
regulated by calcium through a process termed calcium-induced 
calcium release, but it is also sensitized by the second messenger 
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Fig. 69.5 Calcium signalling plays a central role as a transducer of signal inputs and outputs and can be finely tuned by selecting components 
(channels, pumps, and effectors) from the calcium signalling toolkit. The green boxes illustrate the membrane calcium channels, whereas the red boxes 
are the pumps and exchangers that move calcium either out of the cell or back into the endoplasmic reticulum (ER). The purple boxes represent the 
buffers located in the cytoplasm or in the ER. 

Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 
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Fig. 69.6 The ‘on’ and ‘off’ of calcium signals is controlled by channels and pumps at the plasma membrane and on organelles. PMCA, plasma 
membrane Ca**-ATPase; SERCA, sarco-/endoplasmic reticulum Ca**-ATPase. 


Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 


cyclic adenosine diphosphate (ADP) ribose. Calcium-induced cal- 
cium release amplifies calcium responses in amplitude, time, and 

space (Fig. 69.9). The IP; receptor also shows this phenomenon. 
Both channels have very large cytosolic heads with sites for regula- 
tion by almost all kinases, to sense calcium, ATP, redox, and other 


metabolites, so that they can integrate many facets of cellular status 
and integrate these into a calcium signal. 


In terms of drugs, most bipolar patients respond to lithium and val- 
proate. As these are mood stabilizers and buffer swings to either pole, 
they are telling us something about the underlying regulation. These 
drugs have different chemical structures and possibly different mo- 
lecular targets. They unify in their action at the level of intracellular 
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Fig. 69.7 In neurons, calcium acts in either a primary or a modulatory mode. The primary calcium signal is generated by receptor-operated channels 
(ROCs) or voltage-operated channels (VOCS) that promote the entry of external calcium. This primary signal can be augmented by calcium-stimulating 
ryanodine receptors (RYRs) to release calcium from internal stores. The modulatory calcium signal is generated by agonists operating through InsP3 
and calcium release from intracellular stores. 


Adapted from Physiol. Rev. 96(4), Berridge MJ, The Inositol Trisphosphate/Calcium Signaling Pathway in Health and Disease, pp. 1261-96, Copyright (2016), with permission 
from the American Psychological Society. 
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Fig. 69.8 Mechanisms of calcium entry through agonist-operated calcium channels on the plasma membrane. The most direct mechanism occurs 
in the receptor-operated channels (ROCs) where the agonist binds directly to the ion channel. Agonists can also activate entry indirectly by recruiting 


various internal signalling pathways to activate second messenger-operated channels (SMOCs) or a store-operated channel (SOC). 
Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 
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Fig. 69.9 Calcium-induced calcium release (CICR) amplifies calcium increases in space. Calcium influx activates either ryanodine receptors or 
IP; receptors, which sets off a chain reaction, resulting in a calcium wave (yellow arrows). R, ryanodine receptors; |, IP, receptors; ER, endoplasmic 


reticulum; VOCs, voltage-operated channels. 
Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 
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signal transduction, at the pathways leading to and from calcium 
signalling (Fig. 69.5). Efficacious drugs do not induce an all-or-none 
effect, but a subtle modulation of calcium release and uptake which 
creates an environment where variations from the mean are prohib- 
ited. The calcium signalling pathway provides for numerous unex- 
plored potential treatment targets for new drugs. 


Evidence for calcium signalling defects 





in peripheral cells 


Elevated intracellular Ca? is recognized as a marker of bipolar 
disorder [54, 55]. In platelets and lymphocytes from patients with 
bipolar disorder, both baseline and agonist-stimulated levels of cal- 
cium were greater, consistent with overall calcium overload [55, 56]. 
Moreover, thapsigargin, an inhibitor of the SERCA calcium pump 
(Figs. 69.5 and 69.6), also showed an increase, indicating increased 
storage of calcium [55]. Increased cytosolic calcium occurred in 
both depression and mania [56]. However, elevated basal calcium 
concentrations have been detected in transformed B lymphoblasts 
from patients with bipolar I disorder, compared with those with 
bipolar II disorder or major depression or healthy controls [55]. 
Increased total serum and ionized calcium have also been reported 
in euthymic lithium-treated patients, compared to healthy controls 
[57]. Thus, calcium homeostasis appears altered across all mood 
states in bipolar disorder and treatment relating to bipolar disorder 
[2]. The mechanistic origins of these elevations in calcium are un- 
known, but suggestions have included dysfunction of almost all cal- 
cium pumps and channels (Fig. 69.5) [55]. 


Evidence from iPSCs, genetics, and calcium 


antagonist drugs implicates calcium channels 





An obvious limitation of studies with peripheral cells is whether 
calcium signalling in these cells accurately represents signalling 
in neurons. Recent studies with induced pluripotent stem cells 
(iPSCs), fibroblasts dedifferentiated and then redifferentiated into 
neurons, also add support to a defect in calcium signalling that can 
be normalized with drugs used to treat bipolar disorder. For ex- 
ample, iPSC neurons derived from fibroblasts taken from lithium 
responders showed hyperexcitability and signalling defects in both 
calcium and mitochondria [58]. Importantly, this hyperactivity and 
calcium signalling were corrected with lithium when the fibroblasts 
were taken from lithium responders [58]. Additionally, compared 
to controls, iPSC neurons derived from bipolar patients exhibited 
dysregulated calcium transients and wave amplitudes that were cor- 
rected with lithium [59]. 

One of the most promising findings is that a component of the 
calcium signalling pathway appears to be both involved in bipolar 
disorder and a therapeutic target. Specifically, the voltage-operated 
calcium channel Cav1.2, for which the gene CACNAI encodes its 
alpha subunit, which forms the pore, voltage sensor, and drug- 
binding sites (Fig. 69.10), is implicated by several distinct sources of 
evidence [54]. CACNAIC is consistently identified in genome-wide 
association studies (GWAS) [60, 61]. 

As with most findings in bipolar disorder, this hypothesis has not 
been fully tested but is tantalizing and theoretically straightforward. 


Basic mechanisms of and treatment targets for bipolar disorder 


Evidence comes from peripheral cells taken from bipolar patients, 
the efficacy in certain individuals with bipolar disorder of drugs 
that block voltage-operated calcium channels, and GWAS [54]. The 
voltage-operated calcium channel Cavl.2 is encoded by the gene 
CACNA1, and this has turned up reproducibly in several GWAS and 
has the strongest evidence of any GWAS hit [61]. Interestingly, the 
risk locus is within the large third intron (non-coding), and there- 
fore, it likely affects function through the level of expression or al- 
ternative splicing. Indeed, when iPSCs were made from individuals 
with this variant, the cells had elevated mRNA expression and more 
calcium influx [62]. Results from GWAS reveal polymorphisms in 
the alpha 1C subunit of the L-type voltage-gated calcium channel 
gene (CACNAIC) as a risk factor for bipolar disorder. In regard 
to iPSCs, cells derived from bipolar patients and patients with 
CACNA1 (which encodes the L-type calcium channel Cav1.2 al) 
risk genotype exhibit aberrant calcium channel gene expression and 
functional calcium signalling. The risk variant is likely associated 
with increased channel expression and function. 

Consistent with the involvement of Cav1.2 is that calcium channel 
blockers designed to treat cardiovascular disorders have been used 
to treat bipolar disorder. Although all three main groups of cal- 
cium channel blockers have been tested—dihydropyridines (for 
example, nifedipine), phenylalkylamines (for example, verapamil), 
and benzothiazepines (for example, diltiazem), the results are con- 
flicting and lack definitive evidence for efficacy [54]. Problems in- 
clude cohort size and low statistical power, lack of penetration of 
the blood-brain barrier, and lack of selectivity for isoforms found 
in the brain, limiting the dose to that tolerable for peripheral side 
effects. A recent meta-analysis concluded that the current evidence 
is equivocal but deserves proper study, both through design and 
through the development of blockers selective for the brain splice 
variant isoforms [54]. 

Theoretically, the involvement Cav1.2 will have profound effects 
on neural function, as it regulates both excitability and synaptic plas- 
ticity [52]. Indeed, in a model system in which a mutation in Cav1.2 
in mice gives rise to Timothy syndrome, in humans, this shows ef- 
fects on synaptic plasticity and spatial learning [63]. Mutations in 
Cav1.2 result in Timothy syndrome, dysfunction in multiple organs, 
and severe cognitive impairment [52]. 


Future drug discovery 


All drugs used in bipolar disorder were either approved before cur- 
rent rigorous regulatory and ethically sound clinical trials (lithium) 
or were repurposed from other uses as antipsychotics and anti- 
epileptics. The community hopes to pursue rational drug design, 
in which our understanding of the pathophysiological or pharma- 
cological mechanism will reveal targets for which we can develop 
drugs. Impeccable logic, but unfortunately there exist precious few 
examples where this has been achieved. The problem is that, of the 
many suggested therapeutic targets, none have been rigorously 
tested, and as the focus of biological interests in neuroscience has 
shifted, so has interest in possible targets. As none are validated or 
invalidated, we exist in a form of purgatory. Also, all mood stabil- 
izers exhibit polypharmacology, making therapeutic target identifi- 
cation difficult. We need to move forward, using the best imperfect 
knowledge we have. For example, the putative targets of lithium have 
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Fig. 69.10 Structure and regulation of the Cav1.2 L-type calcium channel implicated in bipolar disorder. PKA, protein kinase A; AKAP18, A-kinase- 


anchoring protein; PKC, protein kinase C; CaM, calmodulin. 


Adapted from Cell Signalling Biology, 6, Berridge MJ, Module 2: Figure heterotrimeric G protein signalling, Copyright (2014), with permission from Portland Press Limited. 
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The earliest studies of the genetic basis of bipolar disorder (BD) 
were based on families, twins, and adoptees and used epidemiologic 
methods to draw their conclusions (Fig. 70.1). These studies pro- 
vided the first systematic evidence of a genetic basis for BD, setting 
the stage for the molecular genetic studies of recent years. 


Family li 





The tendency of illnesses to run in families is one of the most ob- 
vious observations giving rise to genetic hypotheses. Familial trans- 
mission of BD was reported long before modern concepts of the 
illness [1, 2] were widely adopted. In one of the earliest systematic 
family studies, Slater [3] described rates of ‘manic-depressive dis- 
order among the parents and children of diagnosed patients that 
were remarkably close to the estimates of modern studies using 
criterion-based diagnoses and family study [4] or population-based 
methods [5]. 

Family studies may reveal patterns of transmission that can give 
clues about the nature of genetic influences. In segregation analysis, 
observed patterns of transmission in families are compared to pat- 
terns that would be expected under various inheritance models such 
as autosomal dominant or X-linked recessive. Many such studies 
were carried out for BD in the 1980s and 1990s (reviewed in [6]). 
While some studies reported transmission patterns that were most 
consistent with a single major (dominant) gene, most could not con- 
fidently reject more complex, multifactorial models. Such incon- 
sistency probably reflects what we now know to be the considerable 
genetic heterogeneity that underlies BD. 

Family studies have also been used to explore the range of pheno- 
types that cluster among close relatives. Such studies can give clues 
to the variable expressivity of shared genetic factors. Most family 
studies of BD have used a proband-wise, case-control approach, 
comparing the rates of various illnesses among the relatives of ‘cases’ 
diagnosed with BD vs relatives of psychiatrically healthy ‘controls. 
Results of the largest studies are reviewed in [6]. Most studies found 
that 10-15% of first-degree relatives of people diagnosed with bi- 
polar I disorder met criteria for a major mood disorder themselves 
when directly interviewed, although the disorder could range from 
major depression to schizoaffective disorder. Increased rates of 
other disorders were also found, including alcoholism and anxiety 





disorders. Some studies also reported an increased rate of schizo- 
phrenia, but this was an inconsistent observation [4, 7, 8]. 

Some family studies have examined clinical features of BD or 
its associated neurocognitive traits. These studies have found that 
age at onset [9], psychotic features [10], frequency of episodes [11], 
polarity (manic or depressive) of the initial episode [12], suicidal 
behaviour [13], and response to lithium treatment [14] are all fa- 
milial. Studies of neurocognitive traits have also reported a number 
of measures that are associated with BD in families [15]. Some traits, 
such as resting state EEG and deficits in facial recognition, also 
seem to occur in unaffected relatives, suggesting that they reflect 
a subthreshold liability distinct from recognized clinical manifest- 
ations of the illness [16, 17]. Daily activity patterns also seem to run 
in families and are associated with BD in some studies [18]. So far it 
has proven difficult to define robust subtypes of BD based on any of 
these traits, but this is an area of active investigation. 

Population-based family studies have only become possible in re- 
cent years, thanks to universal health registries and electronic health 
records. Lichtenstein and colleagues have exploited the electronic 
records of the Swedish health system to link related individuals who 
received psychiatric diagnoses at multiple time points. These studies 
have shown results that largely agree with the proband-wise case- 
control studies, but with much larger sample sizes, reduced recall 
bias, and more complete ascertainment [5]. The results support the 
finding that schizophrenia is increased among the close relatives of 
individuals diagnosed with bipolar I disorder, but at less than half 
the rate of major mood disorders [19]. 

While they have provided some important clues to the familiality 
of BD, its complex mode of transmission, and its familial clustering 
with other diagnoses, family studies in themselves provide only 
weak evidence in support of genetic aetiology. Families share en- 
vironments, life experiences, and habits that could easily contribute 
to illness, with no necessary contribution of genes per se. Other 
study designs are needed to dissociate the effects of genes and family 
environment. 


Twin studies 


Twin studies allow a more unbiased estimate of genetic influences, 
since they control for shared non-genetic influences that compli- 
cate the interpretation of family studies. Twin studies typically com- 
pare monozygotic (MZ or identical) twins—who inherit the same 
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DNA—with dizygotic (DZ or non-identical) twins—who share, on 
average, half their DNA. Since both MZ and DZ twins are expected 
to experience the same family environment and similar early life ex- 
periences, non-genetic influences should be the same in both types of 
twins and traits influenced by inherited genetic factors should show 
more resemblance between MZ than DZ twins. Based on this prin- 
ciple, twin studies can estimate the fraction of trait variability that is 
attributable to genetic differences, a quantity known as heritability 
(for review, see [20]). 

Twin studies have been conducted in BD since at least the 1930s. 
While methods of ascertainment, diagnosis, and statistics have 
changed substantially, heritability estimates have been remarkably 
consistent (reviewed in [6]). Most studies find that BD is 70-90% 
heritable. Such heritability estimates are among the highest of the 
major mental illnesses [21] and underscore the major role that in- 
heritance plays in the aetiology of BD. 

Twin studies have several limitations. Most importantly, they do 
not indicate which genes or how many different genes are involved. 
Even a highly heritable trait like BD seems to be influenced by nu- 
merous genes of small effect (see Polygenic models, p. 737). Twin 
studies may underestimate genetic influences when non-inherited 
sources of genetic variation, such as epigenetic marks and somatic 
mutation, play a major role. Twin studies may also overestimate 
heritability when the assumptions about shared environment are 
violated, when genetic and environmental influences are strongly 
correlated, or when ascertainment is biased or incomplete [22]. 
Despite these inherent limitations, the consistency in heritability 
estimates over many different studies and decades of investigation 
suggests any error should be modest. 


Adoption studies 


Another approach to parsing the influence of genes and envir- 
onment is the adoption study. Typically, adoption studies com- 
pare traits in the biological and adoptive parents of adoptees who 
manifest a trait of interest. This approach thus separates the effects 


of parental environment from those of inheritance, reducing or 
eliminating gene-environment correlations that can complicate the 
interpretation of twin studies [8]. Adoption studies cannot control 
for the effects of the intrauterine environment, which may play a role 
in some mental illnesses such as schizophrenia [23]. 

Three adoption studies of BD have been published [24-26]. 
Although they differed somewhat in methodology and had small 
sample sizes, all three studies found that rates of BD were increased 
in the biological parents of adoptees with BD, while no such increase 
was observed in their adoptive parents. These results support the 
view that inherited influences are most important in BD. 

One population-based study linked electronic health records of 
adoptees with those of their biological and adoptive parents [19]. 
This study achieved a much larger sample size than the traditional 
adoption studies, although parents and adoptees with BD who did 
not seek treatment would be missed by this study design. The results 
again showed that rates of BD were elevated among the biological 
parents of adoptees diagnosed with BD. 


Assortative mating 


Assortative mating describes mating patterns in a population that 
are non-random with regard to particular traits. For example, people 
tend to choose mates of similar height, education, and smoking 
habits. Assortative mating is important since it can concentrate gen- 
etic risk factors in certain families (while reducing them in others), 
increasing the risk or severity of illness in the offspring. 

Assortative mating for mood disorders has long been observed 
in both clinical and epidemiological samples [27-29]. While such 
mating patterns could, in principle, reflect the influence of secondary 
factors associated with mood disorders (such as alcoholism), most 
studies have found that people often choose mates with similar (or 
identical) psychiatric diagnoses. A large electronic health record- 
based study in the Swedish population confirmed these findings 
and revealed strong patterns of assortative mating across a range of 
mental illnesses [30]. 


Molecular genetics 


Genetic linkage in bipolar disorder 


The ultimate goal of genetic linkage studies is to define genetic loci 
that underlie an inherited disorder. A considerable number of genetic 
linkage studies of BD were conducted between the 1980s and 1990s. 
To support this effort, collections of densely affected pedigrees and 
sib-pairs were assembled. The National Institute of Mental Health 
(NIMH) Genetics Initiative (https://explorer.nimhgenetics.org/ 
subject-counts/explore/?c=bipolar-disorder), one of the largest ef- 
forts, established a resource of DNA, immortalized cell lines, and ex- 
tensive clinical phenotype information from over 4000 individuals in 
850 families. The DNA and associated source information are available 
to qualified researchers studying the genetics of mental disorders and 
other complex diseases at recognized biomedical research facilities. 

Early linkage studies depended on phenotypic markers such 
as blood type or colour blindness (reviewed in [31]). Genetically 
mapped DNA markers, including restriction fragment length 
polymorphisms (RFLPs) [32] and highly informative short tandem 
repeat or microsatellite markers [33], were later developed. 

An early study in an extended Old Order Amish pedigree re- 
ported linkage to a locus on chromosome 11 [34], but this finding 
was not supported in an independent set of pedigrees [35] and the 
original evidence for linkage was greatly diminished after a relative 
developed BD [36]. Linkage to chromosome 18 was also reported 
[37], with some support in an independent sample [38], but a causal 
gene was not identified. 

In 1997, the NIMH Genetics Initiative published a genome- 
wide scan of 97 families with 540 individuals [39]. Non-parametric 
methods showed the strongest findings on chromosomes 1, 6, 7, 10, 
16, and 22. To increase power, meta-analytic studies were performed 
that combined several different sets of pedigrees [40-42]. The results 
were stronger, but no more consistent. 

Some studies sought to reduce genetic heterogeneity by focusing 
on clinical subtypes of BD. While most were unsuccessful, increased 
linkage signals were found for bipolar II disorder [43], BD with 
psychosis [10], age at onset [44], and polarity at onset [12]. These 
reports were not followed by clear replications, suggesting that 
the subtype strategy did not identify subsets of families that were 
substantially more homogenous. 


Candidate gene studies 


For many years, candidate gene screening was pursued as an alter- 
native to linkage studies. The aim was to determine whether genes 
postulated to serve important neuronal functions had a role in gen- 
etic predisposition to BD. The burgeoning list of potential candi- 
date genes was presented previously [45]. So far, definitive evidence 
is lacking for the role of three of the most commonly investigated 
brain-expressed genes: brain-derived neurotrophic factor (BDNF), 
catecholamine-O-methy] transferase (COMT), and serotonin trans- 
porter (SLC6A4). Methodological limitations of the candidate gene 
strategy eventually led to its replacement by genome-wide associ- 
ation studies (GWAS) [46]. 


Genome-wide association studies 
Design and theory 


GWAS were initially conceived as a way to overcome many of 
the problems inherent in candidate gene studies [47]. Initially a 
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theoretical idea, GWAS soon became practical with the advent of 
microarrays. Thanks to this technology, it is now routine to genotype 
millions of common genetic variants (known as single nucleotide 
polymorphisms, or SNPs) in thousands of samples, at less than a 
penny per genotype. GWAS have represented a major breakthrough 
for BD (and hundreds of other common, complex traits), for the first 
time revealing robust biomarkers that point towards genes. GWAS 
have also revealed the hitherto underappreciated role of small 
chromosomal deletions and duplications (known as copy number 
variants, or CNVs) in risk for a range of neuropsychiatric disorders. 

Since the typical GWAS assays thousands of independent SNPs, 
statistical testing needs to correct for the large number of hypoth- 
eses tested. This is typically accomplished by use of a Bonferroni- 
corrected significance threshold of P <5 x 10%. One consequence of 
this multiple testing problem is that GWAS need very large samples 
to achieve adequate statistical power. Thousands of cases and con- 
trols are typical; some studies use even more. Such sample sizes are 
difficult to achieve in a single collection, so many GWAS use meta- 
analysis methods to combine the results of different collections. This 
calls for sophisticated detection and correction of biases that can 
arise from even subtle differences in ancestry, genotyping platform, 
or other sample-specific sources. 

GWAS are best powered for detecting association with common 
variants [48]. Such common variants typically confer only a slight 
increase in disease risk, since natural selection keeps higher-risk 
variants from rising to high frequencies under most circumstances 
[49]. The goal of GWAS is to detect genes involved in the trait of 
interest, but the genetic markers used in a GWAS do not unambigu- 
ously implicate particular genes and often fall within large stretches 
of DNA between genes. For a review of the experimental route from 
GWAS hit to disease gene, see [50]. 


Results to date 


The first GWAS of BD were published in 2007, with few genome- 
wide significant markers [51-53]. As samples have grown, findings 
have grown as well. To date, over 20 distinct genetic associations 
with BD have been detected at genome-wide significance (Fig. 
70.2). Some of these have been replicated in at least one independent 
sample. This is perhaps the greatest advance of the GWAS method 
for psychiatry—identification of robust genetic markers for dis- 
orders that had hitherto defied genetic analysis. Given the steady 
pace of new findings—approximately one new genetic association 
for every 1000 cases of BD studied—it seems likely that GWAS will 
yield many additional associations as sample sizes increase [54]. 

As expected, all of the markers found so far are common and each 
confers a modest increased risk for illness (odds ratios <1.2). The 
associated SNPs lie near genes that encode signalling molecules, ion 
channels, neuronal growth factors, and other molecules [55], but as 
noted previously, firm experimental links between associated SNPs 
and particular genes remain to be established. Interestingly, many of 
the same genetic markers have been implicated by GWAS of schizo- 
phrenia, supporting the view that BD and schizophrenia share some 
genetic risk factors [5, 56]. 


Polygenic models 


The large number of associated markers and low risk conferred 
by each marker have led to intensive investigation of the ways 
in which markers found by GWAS may act together to influence 
risk in each individual case. Early expectations of multiplicative 
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Fig. 70.2 Significant associations with bipolar disorder from genome-wide association studies published through December 2016. 
Source: data from NHGRI-EBI GWAS Catalog; graphic generated with NCBI Genome Decoration Page. 


effects—where the risk conferred by a set of distinct markers is 
greater than the sum of each individual marker—have been largely 
unfulfilled. Instead, it appears that common risk variants add to- 
gether to confer risk, in much the same way as predicted by classic 
polygenic models [57]. 

A landmark study [56], published in 2009, offered one way to as- 
sess the effects of many markers of small effect. By assigning each 
allele a weight, based on its association odds ratio, that study showed 
that it is possible to estimate the total risk of illness conferred by the 
full set of alleles carried by each individual. Known as the polygenic 
risk score, this estimate has proven to be useful to gauge the impact 
of common risk variation across studies and traits. Since it does not 
seek to implicate any one particular marker, the polygenic risk score 


can use information from markers that have not yet been implicated 
at the stringent genome-wide significance threshold. 

The polygenic risk score approach shows that common variants 
can account for approximately 5% of the variance in risk for BD [58]. 
The remaining risk remains unexplained, leading to what has come 
to be known throughout the GWAS literature as the ‘missing herit- 
ability’ problem [59]. As sample sizes grow, we expect that the pro- 
portion of risk that can be explained by common variants will grow 
but will probably remain below the inherited genetic risk estimated 
by twin studies. This is one of the rationales for studies of rare vari- 
ation, discussed later in this chapter. 

The polygenic risk score approach has also shown that most of the 
common genetic risk for BD overlaps with that seen in schizophrenia 


and—to a lesser extent—major depression [58]. This important 
finding is again consistent with the idea that the major mood and 
psychotic disorders share many common genetic risk factors. 


SNP heritability 


Since they sample genetic variation across the entire genome, SNPs 
can also be used to estimate distant relatedness between individuals. 
This fact can be harnessed to estimate heritability without some of 
the limitations and challenges inherent in traditional twin studies. 
This has come to be known as the ‘SNP heritability’ [60]. 

Several SNP heritability studies have been published. Heritability 
estimates have generally been far below those of twin studies. SNP 
heritability estimates for BD are typically in the range of 30-40%, 
less than half the heritability estimated from twin studies [61]. 
Additional research is needed to understand the sources of this large 
difference in heritability estimates. 


Copy number variants 


As high-resolution studies of human chromosomes became feasible 
in recent years, it became clear that SNPs were not the only kind of 
variation in the human genome. Copy number variants, or CNVs, 
are small chromosomal segments that are deleted or duplicated 
across individuals in the population [62]. Individually rare, CNVs 
that may span several genes are, as a group, quite common in the 
human genome. Some of these CNVs play a role in human disease. 
For reasons not yet fully understood, CNVs seem to play an espe- 
cially important role in diseases of the central nervous system, such 
as intellectual disability, autism, and schizophrenia, where they can 
contribute to over 5% of cases [63]. 

While it seems clear that CNVs do not play a major role in BD 
[64], several large studies have implicated particular CNVs in a 
range of psychiatric disorders, including BD [65, 66]. The clearest 
example is a duplication on chromosome 16p11.2 that spans about 
650,000 base pairs of DNA and encodes a number of genes [67]. The 
16p11.2 duplication is found in under 1% of people with BD but ap- 
pears to confer a substantial risk of illness—about 10-fold—much 
higher than any other known single genetic event [65]. This same 
duplication also confers a high risk of schizophrenia and intellec- 
tual disability, so it cannot be considered a specific risk factor for 
BD. Further research is needed to understand how this CNV affects 
gene function and neurodevelopment and why it is such a potent 
risk factor for a range of neuropsychiatric disorders. 


Sequencing studies 


Thanks to advances in sequencing technology, it is now possible to 
determine the precise nucleotide sequence of long stretches of DNA 
with great accuracy and relatively low cost. Sequencing can be per- 
formed at the level of individual genes, expressed sequences (known 
as the exome), or whole genomes, including long stretches of DNA 
that are coming to be understood as the major sites of gene expres- 
sion regulation. 

Since they do not rely on discreet genetic markers, sequencing 
studies can uncover the full range of genetic variation, both common 
and rare, that exists in the human genome. Large-scale studies have 
already demonstrated that each individual harbours numerous rare 
variants [68], including 1-3 that arose de novo as new mutations 
not present in either parent [69]. Variants within coding regions 
of genes may change amino acids, alter splice patterns, or shift the 
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reading frame, leading to substantial differences in protein function, 
including complete loss of function. 

Rare alleles could help explain the large portion of heritability 
apparently not explained by common variants [59]. This hope has 
fuelled a growing set of studies aimed at sequencing individuals 
with common neuropsychiatric disorders. Only a few large-scale 
sequencing studies have been performed so far in BD. Both case- 
control and family designs have been utilized [70-73]; one study 
has focused on de novo variants [74]. The results have been largely 
inconclusive, probably owing to the very large samples needed to es- 
tablish unambiguous associations with rare variants [75]. As sample 
sizes grow, rare risk alleles may be found that will contribute to a 
more complete understanding of the full range of genetic variation 
underlying BD. 


Genetic modelling studies 


Animal models 


Modelling psychiatric disorders in animals is challenging, since cur- 
rent psychiatric diagnosis is based on aspects of mood, cognition, 
and perception that are difficult to infer in animals. Despite this 
problem, certain behavioural phenotypes thought to model aspects 
of BD have been widely studied in rodents such as hyperactivity as 
a model for mania, and learned helplessness as a model for depres- 
sion. Animal models offer many advantages such as rapid evaluation 
of therapeutic interventions, genetic engineering, and experimental 
control of environmental risk factors. Critical reviews on animal 
models for neuropsychiatric disorders [76], recurrent or bipolar de- 
pression [77], and bipolar mania [78] have been presented recently. 

General approaches for creating animal models are typically 
based on genetics, pharmacology, environmental manipulation, or 
brain lesions [76]; each of these approaches has its own limitations. 
Evaluating the appropriateness of a model is fraught with many chal- 
lenges. The latter review listed three types of validators: construct, 
face, and predictive validity. Criteria for construct validity could be 
met, for example, by knockout of a highly penetrant single gene, but 
such genes have not been identified in BD. Face validity depends on 
the extent to which the animal model displays biochemical, neuro- 
pathological, and behavioural properties analogous to those ob- 
served in human disease. Predictive validity can be claimed when 
the response to pharmacologic agents in the animal model predicts 
drug response in human patients. 

Numerous attempts at modelling a specific phenotype relevant to 
BD have been published. Mania-like behaviour in animals has been 
induced through administration of psychostimulants such as am- 
phetamine, oubain, or quinpirole [76, 78]. In these animals, hyper- 
activity has been shown to be diminished by mood-stabilizing agents 
such as lithium or valproate. A review of studies modelling mania 
in mice through genetic manipulations has been presented recently 
[78]. Expression profiling of genes changed under mania-like con- 
ditions and during subsequent behavioural rescue with drugs could 
reveal perturbed pathways that may have a role in pathophysiology. 

Some examples illustrate the strengths and weaknesses of this 
strategy. GSK-3B is a well-established target of lithium via the 
Wnt signalling pathway. Overexpression of GSK-36 in mice in- 
duces hyperactivity, lower food intake, and other behavioural 
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abnormalities broadly resembling mania in humans (reviewed in 
[78]). The ClockA19 mutant mouse, which could model some of the 
circadian rhythm disturbances in BD, exhibits various mania-like 
behaviours that can be normalized by chronic lithium treatment (re- 
viewed in [78]). Knockdown or heterozygous disruption of exon 1b 
in the gene ANK3, which has been implicated in BD by GWAS, pro- 
vokes mania-like behaviour in mice that can be reversed by lithium 
(reviewed in [78]). The function of genes implicated in BD is per- 
turbed in each of these models, thus providing support for an impact 
of these individual genes on behaviour, as well as an experimental 
system in which mechanisms can be investigated. However, such 
models cannot capture the additive effects of multiple common vari- 
ants in several genes, even though this is currently the most widely 
accepted genetic architecture for BD and other mental illnesses. 

Animal models that might recapitulate both mania and 
depressive-like behaviours could meet requirements of construct, 
face, and predictive validity. Two such mouse models have been de- 
veloped and studied—the Black Swiss mouse, which was generated 
by crossing outbred Swiss mice with the inbred strain C57BL6/JN, 
and the Madison mouse [78]. Lithium treatment reverses some of 
the mania-like phenotypes in both of these models, suggesting they 
may be exploited to screen new compounds for lithium-like effects. 

Several mouse models of depression have been developed. Most 
are built on the learned helplessness construct [79], such as the 
forced swim test and the tail suspension test (reviewed in [77]). 
Historically, these models have played a valuable role in the dis- 
covery of antidepressants [80], which are often prescribed for people 
with BD, but the heuristic value of the learned helplessness construct 
has been questioned in recent years as the pace of discovering new 
antidepressants has slowed [81]. 


Cellular models 


Various cells, including lymphoblastoid cell lines, fibroblasts, and 
immortalized neuronal cells, have been used to examine diverse 
aspects relevant to BD. In this section, we will focus our discussion 
on induced pluripotent stem cells (iPSCs). The development of iPSC 
technology through the Nobel Prize-winning work of Yamanaka 
[82, 83] has provided the unprecedented power to study previously 
inaccessible live human tissues by reprogramming an individual’s 
adult somatic cells to stem cells. iPSCs could then be differentiated 
into cells of interest, including neural progenitor cells, astrocytes, 
and neurons that could be targeted for various studies. iPSCs and 
their derivatives provide a person-specific cellular template to pursue 
functional, drug response, drug discovery, and toxicology studies. In 
contrast to animal models, studies on iPSC derivatives would deliver 
data on cells that carry the genetic background of the patient or that 
of the person under study. In addition, dish-derived stem cells could 
be tapped for further investigations in vivo, for example in mouse 
models, to either cause or rescue an abnormal phenotype. 

To help assemble, characterize, generate, store, distribute, and pro- 
mote expansion of the iPSC collection, the NIMH Repository and 
Genomics Resource (NIMH-RGR) (https://www.nimhgenetics.org/ 
stem_cells/) established the NIMH Stem Cell Center to act as a re- 
pository for stem cells for neuropsychiatric disorders, assuring the 
research community of the availability of study samples. 

Disease modelling in iPSC and iPSC derivatives in neurologic, 
neurodevelopmental, and neuropsychiatric disorders was recently 
reviewed [84]. The ability to transform stem cells into various 


neuronal derivatives permits the comparative analysis of transcrip- 
tome, proteome, and epigenetic patterns at each stage of differenti- 
ation of cells that originate from affected and unaffected individuals. 
Integration of expression patterns with GWAS findings could 
strengthen support for loci and signalling pathways that contribute 
to disease risk. Cellular phenotypes revealed in the presence or ab- 
sence of various pharmacologic or environmental agents, that could 
distinguish patients from controls, could be harnessed to develop 
functional assays and lead to the discovery of candidate therapeutic 
compounds. Production of isogenic lines using genome-editing 
methods, such as CRISPR-Cas9, might reduce the requirement for 
large sample sizes; however, the lack of a penetrant major effect mu- 
tation in BD presents a key impediment for such an approach. 

Several studies on iPSC neuronal derivatives in BD have claimed 
phenotype differences between cells derived from people with BD 
and those from healthy controls [85-87]. Each study focused on dif- 
ferent cellular phenotypes in small samples, highlighting the need 
for higher-throughput, standardized measures that can be carried 
out and tested for replication in larger samples. If consistent cel- 
lular phenotypes can be identified, they could serve as a basis for 
many important studies such as screening for novel therapeutic 
compounds. 


Future directions 


One of the important near-term tasks is to establish firm links be- 
tween common variants identified by GWAS and specific genes. This 
task has become more tractable, as we learn more about the mech- 
anisms of gene regulation [88]. At the same time, the task has also 
grown more complex. It is becoming increasingly clear that physical 
proximity is a poor indicator of which SNPs regulate which genes 
[89], that each SNP may regulate many different genes, and that the 
relationship between SNPs and the genes they regulate may vary 
by cell type and developmental stage. The identification of specific 
genes is important if we are to succeed in extracting insight from 
GWAS into the biological basis of BD. 

In addition to those genes implicated by GWAS, many other genes 
may harbour rare variants that contribute to BD. While more diffi- 
cult to achieve, firm rare variant associations may be particularly 
informative since they may be more easily linked to specific genes 
and often confer much larger risk than common variants. 

The field of complex genetic disorders still lacks a unified frame- 
work that can fully encompass both common and rare genetic vari- 
ants. How can both forms of variation be used to develop a more 
complete account of the inherited risk of BD? Are certain kinds of 
variants specific to BD, or are they all shared to some extent with 
other major mental illnesses such as schizophrenia or major depres- 
sion? How can genetic variation be leveraged to improve and refine 
our diagnostic categories? 

While it is clear that disease modelling using animal or cellular 
models cannot recapitulate, in its entirety, the complexity of BD, these 
models can illuminate certain aspects of pathophysiology. Recent 
developments in the generation of three-dimensional cultures and 
brain organoids [90] may lead to the creation of more physiologic- 
ally meaningful models. Engrafting patient iPSC-derived neurons 
into rat brain could disclose impairments in connectivity and axonal 
transport that might be operative in neuropsychiatric disorders [91]. 


The potential role of glial cells may be revealed by tools such as chi- 
meric mice whose native glia are replaced by human-derived cells 
[92, 93]. These advances and others that are being developed, such 
as optical electrophysiology [94], could help spur improvements in 
disease modelling and assay development that will form the basis 
for high-throughput therapeutic screening and the identification of 
novel drug treatments for BD. 

As we move towards the identification of genetic variants that, 
taken together, confer substantial risk for BD, we will also have the 
opportunity to investigate more powerfully the role of non-genetic 
risk factors. By contrasting the life histories of individuals at high 
genetic risk who do or do not develop BD, we may finally be able to 
define specific non-genetic events that contribute to risk for bipolar 
illness. This could ultimately lead to preventive approaches, based 
on risk factors that are more easily modifiable than inherited genetic 
variants. 


Conclusions 


The last century has seen enormous progress in our understanding 
of the genetic basis of BD. From early studies aimed at assessing the 
overall contribution of genes to the obvious familial clustering of 
BD, we have now progressed to the point where we have reliably 
identified dozens of common genetic markers for this common 
disorder. Complex inheritance patterns, substantial genetic hetero- 
geneity, and considerable genetic overlap with other major mental 
illnesses remain as real barriers to a complete genetic account of in- 
dividual risk. Much still needs to be done to translate the great pro- 
gress in genetic research into biological insights that can inform an 
understanding of the causes, treatment, and prevention of BD. 
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Emotional over-reactivity and emotion dysregulation are key clinical 
features of bipolar disorder [1]. Given this, it is not surprising that 
the majority of functional neuroimaging studies in bipolar disorder 
examined functioning in neural circuitries supporting emotion 
processing and emotional regulation when participants performed 
emotion processing and emotional regulation tasks. These neural 
circuitries centre on the amygdala, a region important for emotion 
processing, threat, and salience perception [2], and prefrontal cor- 
tical regions important for emotion regulation. Emotion regulation 
has been subdivided into voluntary and automatic (implicit) sub- 
processes where different prefrontal cortical regions, including the 
orbitofrontal cortex, ventrolateral prefrontal cortex, dorsolateral 
prefrontal cortex, and medial prefrontal cortex (including the an- 
terior cingulate cortex and mediodorsal prefrontal cortex), underlie 
these different subprocesses [3]. Here, a predominantly lateral pre- 
frontal cortical system, centred on the dorsolateral prefrontal cortex 
and ventrolateral prefrontal cortex, is thought to be important for 
voluntary emotion regulation, while a medial prefrontal cortical 
system, including the orbitofrontal cortex, anterior cingulate cor- 
tical subregions (subgenual, pregenual, and dorsal subregions), 
mediodorsal prefrontal cortex, and also the hippocampus, is thought 
to be important for automatic emotion regulation [3] (Fig. 71.1a). 
Earlier studies reported abnormally elevated amygdala activity to 
emotional stimuli, and abnormally reduced activity in lateral and 
medial prefrontal cortical regions supporting emotion regulation 
[3] in individuals with bipolar disorder. More recent studies focused 
on elucidating specific fronto-subcortical functional abnormalities 
in adults with bipolar disorder during emotion regulation and in- 
hibitory control [4]. Specifically, these studies showed abnormally 
decreased inferior frontal cortical activity, especially in the ventro- 
lateral prefrontal cortex, during different positive and negative 
emotion processing and emotion regulation tasks in adults with bi- 
polar disorder, even across different mood states. Altered (including 
abnormally elevated and abnormally decreased) activity in the 
amygdala and other emotion regulation circuitry regions was also 








demonstrated in adults with bipolar disorder in these studies (Ta 
71.1; Fig. 71.2). 






yuli 


Other findings highlight abnormally elevated amygdala and stri- 
atal and medial prefrontal cortical activity to positive emotional 
stimuli in individuals with bipolar disorder (Table 71.1), with more 





recent studies demonstrating abnormally increased amygdala and 
medial prefrontal cortex activity [4] to emotional, especially happy, 
faces in adults with bipolar disorder. These findings may reflect a 
dysregulated amygdala response and an underlying attentional bias 
to positive emotional stimuli in individuals with bipolar disorder 
that predisposes to hypomania and mania. 





Several studies reported abnormally increased positive functional 
connectivity (that is, positive coupling of changes in magnitude of 
activity over time) or abnormally reduced inverse functional con- 
nectivity (that is, negative coupling of changes in magnitude of 
activity over time) between the amygdala and the ventrolateral pre- 
frontal cortex in individuals with bipolar disorder, compared with 
healthy individuals, during performance of emotion processing and 
emotion regulation tasks [5] (Table 71.1). For example, studies re- 
ported weaker inverse functional connectivity or abnormal positive 
functional connectivity between the ventrolateral prefrontal cortex 
and the amygdala in individuals with bipolar disorder, compared 
with healthy controls, during emotion regulation tasks. Interactions 
between the amygdala and the ventrolateral prefrontal cortex may 
also be influenced by different types of emotional stimuli in individ- 
uals with bipolar disorder, but not in healthy individuals. 

Other studies focused on functional connectivity between the 
amygdala and the medial regions of the orbitofrontal cortex. Here, 
reduced functional connectivity between the medial orbitofrontal 
cortex and the amygdala has been reported in depressed individuals 
with bipolar disorder, relative to healthy individuals, in both hemi- 
spheres, and on the left relative to depressed individuals with major 
depressive disorder. Functional connectivity between the amygdala 
and other medial regions of the prefrontal cortex is also altered in 
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Fig. 71.1 (a) Key nodes in emotion processing and emotion regulation and neural circuitries in healthy individuals. dIPFC, dorsolateral prefrontal 
cortex; vIPFC, ventrolateral prefrontal cortex; ACC, anterior cingulate cortex. Regions in blue are involved in voluntary emotion regulation processes. 
Regions in purple are involved in automatic emotion regulation processes. In red is the amygdala, a key node in emotion processing, threat, and 
salience perception. 
(b) Key nodes in reward processing neural circuitry in healthy individuals. In orange is the ventral striatum, a key neural region involved in the 
orientation to reward cues and prediction error encoding. In red is the orbitofrontal cortex, a key neural region important for encoding the incentive 
value of stimuli. In purple are different medial prefrontal cortical regions, including different anterior cingulate cortical regions and mediodorsal 
prefrontal cortex, that are implicated in the regulation of appetitive behaviours and regulation of the ventral striatum in rewarding or potentially 
rewarding contexts. In blue is the ventrolateral prefrontal cortex, which, in rewarding contexts, links stimulus representations to reward outcomes and 
encodes the value of different decision-making options. 
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Table 71.1 Main themes from functional neuroimaging studies in bipolar disorder 


Abnormal activity in emotion processing and 1, Abnormally increased amygdala activity to emotional stimuli 
emotion regulation neural circuitry 2. Abnormally decreased ventrolateral prefrontal cortical activity during emotion processing, emotion 
regulation, and response inhibition 


Abnormal activity in emotion processing circuitry to 1. Abnormally increased amygdala, striatal, and medial prefrontal cortical activity to positive emotional 


positive emotional stimuli stimuli 

Abnormal functional connectivity in prefrontal 1. Abnormally increased positive functional connectivity or abnormally reduced inverse functional 
cortical-amygdala circuitries important for emotion connectivity between amygdala and ventrolateral prefrontal cortex during emotion processing and 
processing and regulation emotion regulation 





2. Abnormally reduced or altered functional connectivity between medial orbitofrontal cortex and 
amygdala during emotion processing 

3. Altered functional connectivity between ventrolateral prefrontal cortex and medial prefrontal cortex 
during task performance 

4. Altered functional connectivity between dorsolateral prefrontal cortex and amygdala, and between 
dorsolateral prefrontal cortex, anterior cingulate cortex, and ventrolateral prefrontal cortex during 
emotion processing and emotion regulation 








Abnormal activity and functional connectivity in 1. Abnormally increased amygdala, medial prefrontal cortical, orbitofrontal cortical, and temporal 
emotion processing and emotion regulation neural cortical activity during non-emotional cognitive task performance 

circuitry during performance of non-emotional tasks 

Abnormal activity and functional connectivity in 1. Abnormally increased left ventrolateral prefrontal cortical activity and left orbitofrontal cortical 
reward processing neural circuitry activity during reward anticipation and receipt 


2. Abnormally increased ventral striatal activity to reward anticipation and receipt in euthymic, but not 
in depressed, individuals with bipolar disorder 

3. Abnormally reduced inverse functional connectivity between ventral striatum and medial prefrontal 
cortex 











4, 


Fig. 71.2 Key functional abnormalities in emotion regulation neural circuitry in individuals with bipolar disorder. The most consistent abnormalities 
include: 1. abnormally increased amygdala activity to emotional stimuli; 2. abnormally decreased ventrolateral prefrontal cortical activity during 
emotion processing, emotion regulation, and response inhibition; 3. abnormally increased amygdala and medial prefrontal cortical activity to 

positive emotional stimuli; 4. abnormally increased positive functional connectivity or abnormally reduced inverse functional connectivity between 

the amygdala and the ventrolateral prefrontal cortex during emotion processing and emotion regulation; and 5. abnormally reduced or altered 
functional connectivity between the medial orbitofrontal cortex and the amygdala during emotion processing. Star shapes refer to neural regions that 
are most consistently reported as being affected as described previously in bipolar disorder. In parallel, there are widespread abnormal decreases in 
grey matter volume and cortical thickness in prefrontal cortical regions, decreased grey matter volume in the amygdala and the hippocampus, and 
abnormally decreased fractional anisotropy in white matter tracts connecting the ventral prefrontal cortex and anterior temporal regions. There are also 
abnormalities of regional glucose metabolism and microglial activation in the same regions showing grey matter loss. 








individuals with bipolar disorder during task performance, but 
without a clear direction of effect emerging (Table 71.1). The differ- 
ential pattern of findings regarding functional connectivity between 
the amygdala and the ventrolateral prefrontal cortex and between 
the amygdala and regions of the medial prefrontal cortex may thus 
reflect specific functional abnormalities in lateral prefrontal cortical- 
based voluntary/effortful emotion regulation pathway and medial 
prefrontal cortical-based automatic/implicit emotion regulation 
pathway. Abnormally elevated positive functional connectivity in 
the former pathway may reflect abnormally elevated encoding of 
emotion in the amygdala, resulting in an inefficient, compensatory 
regulation of the amygdala by the ventrolateral prefrontal cortex or 
by abnormally elevated signalling of emotion from the amygdala to 
the ventrolateral prefrontal cortex. By contrast, abnormally reduced 
functional connectivity in the latter pathway may reflect a relative 
disconnectivity pattern, resulting in deficient amygdala regulation 
by medial prefrontal cortical regions (Fig. 71.2). 

There are also observations of altered connectivity between the 
ventrolateral prefrontal cortex and the medial prefrontal cortex in 
individuals with bipolar disorder. The dorsolateral prefrontal cortex 
may also play a role in regulating the amygdala, with alterations 
in dorsolateral prefrontal cortical-amygdala and dorsolateral pre- 
frontal cortical—anterior cingulate cortical—ventrolateral prefrontal 
cortical functional connectivity reported in individuals with bipolar 
disorder (Table 71.1). 


Abnormal activity and functional connectivity 
in emotion processing and emotion regulation neural 
circuitry during performance of non-emotional tasks 


Abnormal patterns of activity in emotion processing circuitry, 
including the amygdala, medial prefrontal cortex, orbitofrontal 
cortex and temporal cortex, during non-emotional, cognitive task 
performance have also been reported in individuals with bipolar 
disorder [4] (Table 71.1). Key findings include abnormally elevated 
amygdala activity, abnormally reduced inverse functional connect- 
ivity between the amygdala and cortical regions, and abnormally re- 
duced activity in prefrontal cortical regions implicated in cognitive 
control processes in adults with bipolar disorder during a variety of 
cognitive and motor tasks. These findings suggest a heightened per- 
ception of emotional salience in non-emotional contexts in bipolar 
disorder. 


Abnormal activity and functional connectivity in reward 
processing neural circuitry 


A number of behavioural and event-related potential studies have 
highlighted heightened reward sensitivity in individuals with bipolar 
disorder. The key role of the ventral striatum (nucleus accumbens) 
in response to reward cues, reward receipt, and prediction error 
(the difference between expected and actual outcomes) is well es- 
tablished [6]. Specific prefrontal cortical regions also have specific 
roles in reward processing. The ventrolateral prefrontal cortex, in 
addition to its role in emotion regulation, is activated during arousal 
in the context of emotional stimuli, encodes the value of different 
decision-making options, and links stimulus representations to spe- 
cific reward outcomes [7]. The orbitofrontal cortex encodes reward 
values. In humans, both of these regions may have excitatory af- 
ferent connections with the ventral striatum, given studies reporting 
excitatory afferent connections between the homologues of these 
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prefrontal cortical regions and the ventral striatum in rodents. The 
medial prefrontal cortex (including dorsal and pregenual anterior 
cingulate cortical subregions and mediodorsal prefrontal cortex) 
regulates the ventral striatum and appetitive behaviours in poten- 
tially rewarding contexts [8] (Fig. 71.1b). 

Functional neuroimaging studies of reward processing neural 
circuitry in individuals with bipolar disorder indicate abnormally 
elevated activity in the ventral striatum and left prefrontal cortex, 
in particular, the left orbitofrontal cortex and the left ventrolateral 
prefrontal cortex, during reward processing [4]. Here, studies re- 
ported abnormally increased left ventrolateral prefrontal cortical 
activity to reward anticipation in adults with bipolar disorder across 
different mood states and abnormally increased ventral striatal ac- 
tivity during reward anticipation in euthymic individuals with bi- 
polar disorder; abnormally elevated left orbitofrontal cortex and 
amygdala activity to reward reversal and elevated left orbitofrontal 
cortex activity to reward in euthymic adults with bipolar disorder; 
and elevated ventral striatal activity to reward cues and outcomes 
in individuals with subthreshold hypomania. By contrast, patterns 
of either no ventral striatal activity abnormality during reward an- 
ticipation or receipt or abnormally reduced ventral striatal activity 
to reward receipt have been demonstrated in depressed individuals 
with bipolar disorder. Another study reported no differential ac- 
tivity in the ventral striatum to reward receipt vs omission in manic 
adults with bipolar disorder. These findings indicate abnormally 
elevated activity in distributed reward circuitry, particularly in the 
left ventrolateral prefrontal cortex, during reward processing in in- 
dividuals with bipolar disorder (Table 71.1; Fig. 71.3). Only a small 
number of recent studies reported altered functional connectivity in 
reward circuitry, including significantly reduced inverse functional 
connectivity between the ventral striatum and medial prefrontal 
cortical regions. This is an area requiring future research effort. 





Intrinsic (resting state) connectivity studies 


These studies provide measures of tonic functional connectivity in 
neural circuitries of interest, rather than stimulus-related, phasic 
functional connectivity in these circuitries, and can thereby iden- 
tify context-independent functional abnormalities that poten- 
tially represent core functional abnormalities in such circuitries in 
a given disorder. The majority of these studies in bipolar disorder 
employed a region of interest approach to examine functional con- 
nectivity among a priori regions and localized neural circuitries 
at rest, measuring, for example, correlations between time series 
of low-frequency fluctuations in activity among these regions. 
Predominant findings from these studies in individuals with bipolar 
disorder parallel findings from task-based fMRI studies [5] and in- 
clude: (1) abnormally decreased, but also abnormally increased, 
positive resting connectivity between medial prefrontal cortical and 
amygdala and striatal regions; (2) abnormal positive and abnormally 
reduced inverse resting connectivity between the medial prefrontal 
cortex and the ventrolateral prefrontal cortex, but also abnormally 
reduced positive resting functional connectivity among medial and 
lateral prefrontal cortical regions; (3) abnormally elevated positive 
resting state connectivity between the ventrolateral prefrontal cortex 
(and the anterior cingulate cortex) and amygdala; and (4) abnor- 
mally reduced inverse resting connectivity between the amygdala 
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Fig. 71.3 Key functional abnormalities in reward processing neural circuitry in individuals with bipolar disorder, including: 1. abnormally increased 
left ventrolateral prefrontal cortical activity and left orbitofrontal cortical activity during reward anticipation and receipt; 2. abnormally increased ventral 
striatal activity to reward anticipation and receipt in euthymic, but not in depressed, individuals with bipolar disorder; 3. abnormally reduced inverse 
functional connectivity between the ventral striatum and the medial prefrontal cortex. Star shapes refer to neural regions that are most consistently 
reported as being affected as described previously in bipolar disorder. In parallel, there are widespread decreases in grey matter volume and cortical 
thickness in prefrontal cortical regions and striatal regions in individuals with bipolar disorder. 


and the dorsolateral prefrontal cortex. Other studies in individuals 
with bipolar disorder show more widespread patterns of abnor- 
mally reduced positive functional connectivity among different 
prefrontal cortical and temporal regions, and reduced positive func- 
tional connectivity among midline cortical regions. Abnormalities 
in amygdala-ventrolateral prefrontal cortical resting functional 
connectivity may be determined, at least in part, by the medial pre- 
frontal cortex. 

Other resting state functional connectivity studies focused on 
larger-scale networks and reported in individuals with bipolar dis- 
order diverse patterns of abnormally increased resting connectivity 
in paralimbic and fronto-temporal/paralimbic networks, abnor- 
mally decreased connectivity in the medial prefrontal cortex, and 
abnormal positive resting connectivity between the medial pre- 
frontal cortex and the ventrolateral prefrontal cortex, and between 
the medial prefrontal cortex and the insula, together with abnormal 
decoupling between the mediodorsal prefrontal cortex and the 
dorsolateral prefrontal cortex. 

Fewer studies used global connectivity methods (for example, in- 
dependent components analyses) to study patterns of whole brain 
study resting functional connectivity in individuals with bipolar dis- 
order. Findings from these studies are somewhat mixed, showing 
diverse patterns of abnormally elevated or reduced resting connect- 
ivity within and among different large-scale neural networks im- 
plicated in salience processing, executive function, emotion, and 
‘default mode’ (non-task performance). 


Recent studies examined the amplitude of low-frequency fluctu- 
ations (ALFF) and the homogeneity of time series within specific 
neural regions, as alternative measures of resting state connectivity. 
These studies reported both abnormally increased and decreased 
ALFF in fronto-temporal-striatal regions in individuals with bipolar 
disorder vs healthy individuals, decreased ALFF in left post-central 
parahippocampal regions in individuals with bipolar disorder vs 
healthy individuals, and greater regional homogeneity in left fronto- 
parietal cortices in depressed bipolar disorder (subtype unspecified) 
vs healthy adults. 

Together, these studies indicate intrinsic, context-independent 
abnormalities, both in functional connectivity between regions and 
in the amplitude and homogeneity of low-frequency fluctuations 
within neural regions, predominantly within prefrontal cortical, 
amygdala, and striatal circuitries implicated in emotion processing, 
emotion regulation, and reward processing in individuals with bi- 
polar disorder [5]. Findings largely parallel those from task-based 
fMRI studies in bipolar disorder and also suggest abnormalities in 
resting state functional connectivity among larger-scale neural net- 
works in the disorder (‘Table 71.2). The relative paucity of resting 
state functional connectivity studies, in comparison with task-based 
fMRI studies, in bipolar disorder, particularly those employing global 
network approaches, suggests that future studies should employ such 
approaches, in conjunction with task-based fMRI studies, to facilitate 
understanding of relationships between phasic and tonic-level func- 
tioning in neural circuitries of interest in bipolar disorder. 


Table 71.2 Main themes from intrinsic resting state connectivity 
studies in bipolar disorder 


Region of interest i 
approaches 


Abnormal changes in resting connectivity 
between medial prefrontal cortical and 
amygdala and striatal regions (both abnormally 
increased and decreased resting connectivity) 
2. Abnormal changes in resting connectivity 
between medial prefrontal cortex and 
ventrolateral prefrontal cortex (both abnormal 
positive and abnormally reduced inverse, 
and abnormally reduced positive, resting 
connectivity) 

3. Abnormally elevated positive resting state 
connectivity between ventrolateral prefrontal 
cortex (and anterior cingulate cortex) and 
amygdala 

4. Abnormally reduced inverse resting connectivity 
between amygdala and dorsolateral 
prefrontal cortex 

5. Larger-scale network abnormalities across 

different prefrontal and temporal cortical and 

subcortical regions 





= 


Global connectivity il, 
approaches 


Diverse patterns of abnormally elevated 

or reduced resting connectivity within and 
among different large-scale neural networks 
implicated in salience processing, executive 
unction, emotion, and ‘default mode’ (non-task 
performance) 


Approaches using 1. Abnormally increased and decreased ALFF in 
amplitude of low ronto-temporal-striatal regions 

frequency fluctuations 2. Abnormally increased regional homogeneity in 
(ALFF) and regional eft fronto-parietal cortices 

homogeneity of time 

series 








Structural neuroimaging studies 


Key findings from earlier studies in bipolar disorder were an in- 
creased number of white matter hyperintensities [9] and enlarged 
amygdala grey matter volumes [10, 11], highlighting the potential 
role of abnormalities in emotion processing neural circuitry in bi- 
polar disorder. More recent studies examined regional grey matter 
volumes in a wider range of cortical and subcortical regions in 
adults with bipolar disorder. Two main themes have emerged from 
these later studies that relate to emotion processing and regula- 
tion neural circuitries [4]. Firstly, findings indicate abnormally 
decreased grey matter volume, decreased white matter volume, 
and decreased thickness in cortical regions implicated in emotion 
processing and cognitive processes important for emotion regula- 
tion, that is, the prefrontal and anterior temporal cortices, and in 
cortical regions underlying salience perception, that is, the insula 
and the dorsal anterior cingulate cortex, in individuals with bipolar 
disorder, with some exceptions. One study implicated the ventrolat- 
eral prefrontal cortex, in particular, in bipolar disorder, reporting 
a negative association between right ventrolateral prefrontal cortex 
grey matter volume and illness duration, and smaller right ventro- 
lateral prefrontal cortex grey matter volume in adults with bipolar 
disorder with long-term illness and minimal lifetime exposure to 
lithium vs healthy adults, but abnormally increased right ventrolat- 
eral prefrontal cortex grey matter volume in relatives of individuals 
with bipolar disorder and in younger adults in early stages of bipolar 
disorder [12]. Other studies replicated the finding of reduced right 
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ventrolateral prefrontal grey matter volume in bipolar disorder. 
Indeed, several findings indicate that prefrontal cortical grey matter 
volumes in general may decrease with illness progression, including 
a greater number of manic episodes, but may normalize (or even in- 
crease) with lithium. These findings of reduced grey matter volume 
in prefrontal cortical regions, especially in the right ventrolateral 
prefrontal cortex in adults with bipolar disorder, suggest a structural 
basis for findings from fMRI studies of decreased activity in this re- 
gion during emotion processing and emotion regulation tasks. 

A second key finding regarding structural findings in emotion 
processing and regulation neural circuitries in bipolar disorder is 
decreased subcortical regional volumes, especially in the amygdala, 
particularly during depressive episodes, which may normalize with 
lithium. For example, a meta-analysis reported amygdala volume 
decreases in youth, but not in adults, with bipolar disorder [13], sug- 
gesting a normalization of amygdala volume over development in 
bipolar disorder, potentially due to medication [12]. Abnormally de- 
creased hippocampal and parahippocampal volumes have also been 
reported in adults with bipolar disorder; such abnormalities may 
also be masked by lithium. 

Recent studies also provide evidence for abnormally increased 
gyrification of the anterior cingulate cortex in adults with bipolar 
disorder and have also employed newer techniques, such as high 
angular resolution molecular diffusion imaging, to examine the 
microstructure of different brain tissues in individuals with bipolar 
disorder. Findings confirm the presence of structural abnormalities 
in the hippocampus and striatum in bipolar disorder. 


Diffusion imaging studies 


The main diffusion imaging measures are longitudinal/axial diffu- 
sivity (AD), the diffusivity of water molecules along the principal 
axis; radial diffusivity (RD), the diffusivity of water molecules 
along transverse directions perpendicular to the longitudinal axis; 
and fractional anisotropy (FA), the ratio of longitudinal vs trans- 
verse diffusivity in white matter tracts. Given the properties of white 
axonal membranes and myelin sheaths, FA will be high in white 
matter tracts with densely packed collinear axons, but low in white 
matter with non-collinear axons. By contrast, RD will be high in 
white matter with non-collinear axons, but also high in white matter 
with damaged axonal membranes and/or myelin sheaths. 

Studies measuring FA, RD, and AD reported specific abnormal- 
ities in the microstructure of white matter tracts in neural circuit- 
ries important for emotion processing, emotion regulation, and 
reward processing in individuals with bipolar disorder. Initial diffu- 
sion imaging studies reported white matter abnormalities in front- 
ally situated white matter tracts in adults with bipolar disorder [4]. 
Recent diffusion imaging studies of adults with bipolar disorder re- 
ported abnormally reduced FA, often paralleled by abnormally in- 
creased RD, in these tracts, especially in tracts connecting prefrontal 
cortical and anterior limbic structures supporting emotion regula- 
tion and in temporal white matter. White matter tracts that most 
consistently show these abnormalities are the anterior regions of 
the corpus callosum, the anterior cingulum, the uncinate fasciculus, 
and the superior longitudinal fasciculus, tracts that interconnect 
frontal regions, and connect frontal regions with more posterior 
regions (including occipital, parietal, and temporal regions). These 
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abnormalities may be more apparent in a depressive episode than in 
remission and may be normalized by lithium. One study reported 
abnormal nodal networks in the left ventrolateral prefrontal cortex, 
left hippocampus, and bilateral mid anterior cingulate cortex in 
adults with bipolar disorder vs healthy adults [14], that may sug- 
gest a specific white matter structural basis for the observed pattern 
of abnormal left ventrolateral prefrontal cortex and orbitofrontal 
cortex activity during reward processing in bipolar disorder. 

Together, findings from diffusion imaging studies in adults with 
bipolar disorder thus suggest either abnormal myelination or ab- 
normal orientation of axons in white matter tracts connecting 
frontal and temporal (and also occipital and parietal) cortices, cor- 
tical regions that are in neural circuitries important for emotion 
regulation and reward processing. Interestingly, a recent diffusion 
imaging study measured magnetization transfer ratio (MTR) to as- 
sess the degree of myelin content in white matter tracts, and dif- 
fusion tensor spectroscopy (DTS) to measure metabolite diffusion 
specifically within axons in these tracts, in individuals with bipolar 
disorder [15]. This study reported specific reductions in MTR, but 
no abnormalities in DTS, in right prefrontal cortical white matter 
in individuals with bipolar disorder. These findings suggest that the 
major finding of reduced FA reported previously may result from 
reduced myelin content, rather than changes in axonal structure, in 
bipolar disorder. 

These data appear consistent with post-mortem histopathological 
evidence showing a reduction in oligodendroglial cell counts and 
in the expression of myelin basic protein (MBP) and other oligo- 
dendroglial genes in the medial prefrontal cortical network in bi- 
polar disorder [16]. In prefrontal cortical and hippocampal tissue of 
bipolar patients’ studied post-mortem (predominantly from medi- 
cated subjects in the depressed or mixed phases prior to death), 
the genes involved in energy metabolism and mitochondrial func- 
tion also are downregulated, while genes involved in immune re- 
sponse and inflammation are upregulated [17]. These data, taken 
together with the evidence reviewed in the following sections that 
bipolar disorder is associated with abnormal energy metabolism and 
neuroinflammation, may relate mechanistically to the white matter 
abnormalities identified using DTI, DTS, and structural MRI. 


Magnetic resonance spectroscopy studies 


Data obtained using magnetic resonance spectroscopy (MRS) tech- 
nologies are relevant to functional and structural neuroimaging ab- 
normalities since they can identify alterations in the bioenergetics 
underlying the metabolic and haemodynamic signals assessed 
during functional neuroimaging. Corroborating the evidence for 
reduced mitochondrial gene expression in bipolar disorder, in vivo 
MRS studies have revealed an abnormal reduction in frontal lobe 
pH in both medicated and unmedicated individuals with bipolar 
disorder, and a shift from oxidative phosphorylation to glycolysis 
in medication-free individuals with bipolar disorder [18]. More 
recently, phosphorus-31 (P-31) MRS studies of bioenergetics in 
bipolar disorder provided additional in vivo evidence of mitochon- 
drial dysfunction in bipolar disorder. 

Furthermore, since cerebral blood flow, glucose metabolism, and 
BOLD-fMRI signals are most heavily influenced by glutamatergic 
transmission, the application of proton (H-1) MRS to quantitate 


glutamate, glutamine, and ‘Glx’ (a composite measure consisting 
mainly of glutamate and glutamine) levels in brain tissue also 
holds relevance for the interpretation of functional imaging data. 
In general, studies applying these methods have shown a pattern 
of abnormally elevated Glx levels in bipolar disorder, in contrast 
to abnormally reduced Glx levels in major depressive disorder. 
Moreover, the glutamine/glutamate ratio appears abnormally de- 
creased in depression (whether unipolar or bipolar), but elevated in 
mania. In contrast to functional neuroimaging measures of cerebral 
glucose metabolism and blood flow, which predominantly reflect 
glutamatergic transmission, the H-1 MRS measures of glutamate 
and glutamine almost entirely reflect the intracellular pools, with 
glutamate residing largely in neurons and glutamine in glia. The ab- 
normal patterns found in mood disorders thus have been interpreted 
to indicate that the glutamate-related metabolite pool is constricted 
in major depressive disorder, but expanded in bipolar disorder, and 
that depressive and manic episodes are characterized by modulation 
of the glutamine/glutamate ratio in opposite directions, suggesting 
reduced vs elevated glutamate conversion to glutamine within glial 
cells, respectively. Since glutamate is converted to glutamine in glial 
cells, and glutamine is converted to glutamate in neurons, these data 
suggest an abnormality in neuronal-glial cell interactions. The MRS 
literature thus appears consistent with the post-mortem findings 
of reduced oligodendroglial cell counts, impaired astrocyte func- 
tion (especially with respect to glutamate transport), and loss of 
GABAergic neurons in structures where the glutamine/glutamate 
ratio also is altered in bipolar disorder (for example, pregenual an- 
terior cingulate cortex) [16]. 


Positron emission tomography studies 





PET studies of bipolar disorder have employed various radioligands 
to measure regional blood flow, glucose metabolism, neuroreceptor 
binding potential, or other parameters. Most of this literature is be- 
yond the scope of this chapter and is reviewed elsewhere [19, 20]. 
A few pertinent studies are discussed here, which converge with the 
literature described in the preceding sections to implicate alterations 
in bioenergetic and neuroinflammation in the pathophysiology of 
bipolar disorder. 

The concept that bioenergetics abnormalities exist in bipolar dis- 
order has face validity because of the clinical observation and symp- 
tomatology indicating that psychomotor activity and subjective 
energy are generally elevated in mania, but reduced in depression. 
Consistent with this concept, glucose metabolism is increased in the 
unmedicated manic phase, but decreased in the unmedicated de- 
pressed phase of bipolar disorder in limbic-cortical regions such as 
the subgenual prefrontal cortex [21]. These data thus appear con- 
sistent with the P-31 MRS data reviewed previously, which sug- 
gested that the glutamine/glutamate ratio is modulated in opposite 
directions in mania vs depression. 

The recent development of PET radioligands for the translocator 
protein (TSPO) on the outer mitochondrial membranes in micro- 
glia has enabled in vivo measures of microglial activation. This 
phenomenon is a central component of neuroinflammation and is 
manifest by a range of phenotypic changes in microglial cell morph- 
ology, as well as upregulation of cellular proteins, including TSPO. 
The first study using PET-TSPO imaging in bipolar disorder found 


that individuals with bipolar disorder type 1 had elevated TSPO 
binding, relative to healthy controls, in the hippocampus [22]. 

It remains unclear whether microglial activation in these regions 
reflects a repair response or instead constitutes a pathological pro- 
cess that disrupts the normal physiology and precipitates disease 
in bipolar disorder. One line of evidence suggests that inflamma- 
tion may contribute to reductions in hippocampal volume, as pro- 
inflammatory drive on the kynurenine metabolic pathway reduces 
the concentrations of the neuroprotective metabolite kynurenic 
acid, relative to levels of the putatively neurotoxic metabolites 3- 
hydroxykynurenine and quinolinic acid. Savitz et al. [23] reported 
that this ratio was reduced in plasma in bipolar disorder individuals, 
relative to controls, and also correlated directly with hippocampal 
volume in bipolar individuals (that is, lower neuroprotective indices 
were associated with smaller hippocampal volumes). Similarly, 
within the anterior cingulate cortex, a post-mortem study of brain 
tissue from a combined sample of unipolar and bipolar depressed in- 
dividuals who died by suicide showed that microglial activation and 
quinolinic acid expression were increased to the greatest extent in 
the subgenual anterior cingulate cortex [24], which has been associ- 
ated with reduced grey matter in both mood disorder subtypes [21]. 


Interim summary 


Neural circuitry abnormalities in bipolar disorder thus comprise: (1) 
dysfunction in bilateral prefrontal cortical (especially ventrolateral 
prefrontal cortex and orbitofrontal cortex)-hippocampal-amygdala 
emotion processing and emotion regulation neural circuitries; 
(2) hyperactivity in predominantly left-sided ventral striatal- 
ventrolateral prefrontal cortex reward processing circuitry; (3) par- 
allel resting state functional connectivity abnormalities in these, and 
more widespread, neural circuitries; (4) abnormalities in regional 
grey matter structure in these neural circuitries; (5) microstruc- 
tural abnormalities in white matter tracts in these neural circuitries, 
with more recent findings suggesting myelin abnormalities in these 
tracts; (6) bioenergetic abnormalities suggestive of mitochondrial 
dysfunction and altered neuronal glial interactions; and (7) abnor- 
malities of regional glucose metabolism and microglial activation in 
the same regions showing grey matter loss. The combination of these 
functional and structural abnormalities may be characteristic of the 
behavioural abnormalities associated with bipolar disorder, that is, 
emotional lability, emotional dysregulation, and reward sensitivity 
(Figs. 71.2 and 71.3). 


Newer neuroimaging research areas 


in bipolar disorder 





(See Table 71.3.) 


Mood state-specific functional abnormalities in emotion 
processing, emotion regulation, and reward processing 
in neural circuitries 


A small number of cross-sectional studies examined individuals 
with bipolar disorder in different mood states during emotion pro- 
cessing and emotion regulation [4, 5]. One finding is abnormally 
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Table 71.3 Newer neuroimaging research areas in bipolar disorder 


Mood state- 1. Cross-sectional studies of emotion processing 
specific and emotion regulation neural circuitries: (1) 
abnormalities differential patterns of activity in orbitofrontal 
cortex, insula, ventrolateral prefrontal cortex, 
amygdala, and dorsolateral prefrontal cortex across 
different mood states; (2) differential patterns 
of emotion processing task-based connectivity 
and resting connectivity between amygdala and 
medial prefrontal cortex, hippocampus, and 
orbitofrontal cortex 
2. Longitudinal studies: differential patterns of 
prefrontal cortical and amygdala activity and 
amygdala connectivity across different mood states 





Different bipolar i 
disorder subtypes 


Abnormally decreased prefrontal cortical activity 
(especially ventrolateral prefrontal cortex) across 
different tasks in bipolar disorder type II 
2. Similar patterns of abnormally elevated activity in 
amygdala and dorsolateral prefrontal cortex during 
emotion regulation and working memory, and 
abnormally elevated left ventrolateral prefrontal 
cortical and ventral striatal activity during reward 
anticipation, in bipolar disorder types | and Il. 
Graded differences in magnitude of activity 
between bipolar disorder subtypes 
3. More widespread reductions in grey matter across 
different cortical regions in bipolar disorder type |, 
relative to bipolar disorder type 











Youth with bipolar 1. 
disorder 


Similar patterns to adult bipolar disorder of 
abnormally increased amygdala activity, decreased 
ventrolateral prefrontal cortical activity, and 
decreased prefrontal cortical-amygdala functional 
connectivity during emotion processing and 
emotion regulation 
2. Abnormally decreased amygdala volumes and 
abnormally reduced prefrontal cortical volumes 
and cortical thickness 
3. Abnormally reduced FA in white matter tracts 
connecting prefrontal cortical and subcortical 
regions 





Adults and youth 1. 
at risk of bipolar 
disorder 


Healthy at-risk adults and youth: (a) abnormally 
increased prefrontal cortical and amygdala activity 
during emotion and cognitive control tasks; 

(b) abnormally increased left ventrolateral prefrontal 
cortical activity to reward; (c) abnormally increased 
frontal and subcortical grey matter volumes; 

(d) abnormally reduced FA in white matter tracts 
connecting prefrontal cortical and subcortical 
regions 
2. Psychiatrically affected at-risk adults and youth: (a) 
predominantly abnormally increased prefrontal 
cortical activity and prefrontal cortical-amygdala 
functional connectivity during emotion regulation 
and cognitive control tasks; (b) abnormally 
increased activity in left-sided prefrontal cortical 
regions and insula, regardless of task; (c) abnormally 
decreased grey matter volumes in prefrontal and 
temporal cortical regions, striatum, and insula; but 
increased volumes in right ventrolateral prefrontal 
cortex and left caudate; (d) decreased white matter 
volume and white matter integrity, especially in 
prefrontal regions 














decreased orbitofrontal cortex activity during emotion processing 
across different mood states. Other findings indicate differing roles 
of the insula and ventrolateral prefrontal cortex in adults with bi- 
polar disorder across different mood states during emotion regula- 
tion; different mood state-specific increases in amygdala activity to 
negative emotional faces; abnormally decreased right dorsolateral 
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prefrontal cortex activity during non-emotional working memory 
across different mood states; increased ventrolateral prefrontal 
cortex-thalamic activity in adults with bipolar disorder in mixed 
mood episode vs depression during response inhibition; and failure 
to deactivate ventral regions of the medial prefrontal cortex during 
a working memory task. 

Cross-sectional studies also reported differences in amygdala- 
medial prefrontal cortical functional connectivity across different 
mood states, including differential patterns of amygdala functional 
connectivity with the medial prefrontal cortex between depressed 
and remitted individuals with bipolar disorder during emotion pro- 
cessing; differential patterns of resting state functional connectivity 
between the amygdala and the anterior cingulate cortex (pregenual 
subdivision) in manic and depressed mood states and between 
manic and euthymic states; and abnormally increased vs reduced 
resting state connectivity between the amygdala and the hippo- 
campus in mania vs depression, respectively. Contrasting patterns 
of variability in large-scale network resting state signal amplitude in 
bipolar depression vs mania are also reported [25]. 

Longitudinal studies reported an abnormally reduced extent of 
activity in prefrontal cortical regions implicated in emotion regu- 
lation during an emotional face-word interference emotion regu- 
lation task in individuals with bipolar disorder, relative to healthy 
controls, but to a greater extent during hypomanic than depressed 
and euthymic mood states; differential patterns of amygdala func- 
tional connectivity in the same individuals with bipolar disorder 
during mania vs depression; and normalized activity in the amyg- 
dala and prefrontal cortical regions in remitted vs manic adults with 
bipolar disorder during reward and motor tasks. 

While there are no clear patterns of mood state-specific func- 
tional neural abnormalities in bipolar disorder, findings suggest 
amygdala-prefrontal cortical functional abnormalities across dif- 
ferent mood states, which normalize with remission. Clearly, more 
longitudinal, within-subject studies are required to identify func- 
tional abnormalities in neural circuitries that predispose to switches 
between different mood states in bipolar disorder. 


Bipolar disorder subtypes 


A small, but increasing, number of neuroimaging studies focused on 
adults with bipolar disorder type II [4]. One study reported abnor- 
mally reduced amygdala and bilateral ventrolateral prefrontal cortex 
activity, and reduced amygdala-orbitofrontal cortex and amygdala- 
dorsolateral prefrontal cortex functional connectivity, during emo- 
tional face processing in depressed adults with bipolar disorder type 
II. Other studies reported abnormally reduced activity in different 
prefrontal cortical regions during working memory and response 
inhibition in depressed individuals with bipolar disorder type II. The 
finding of decreased prefrontal cortical activity, in particular in the 
ventrolateral prefrontal cortex, parallels that shown during similar 
tasks by euthymic and depressed adults with bipolar disorder type 
I and suggests that reduced ventrolateral prefrontal cortex activity 
during emotion processing and emotion regulation may be a trait 
marker of bipolar disorder across types I and II. 

Other fMRI studies directly compared adults with bipolar disorder 
type I and bipolar disorder type II and healthy adults. One study 
compared euthymic adults with bipolar disorder type I, euthymic 
adults with bipolar disorder type II, and healthy controls during a 
working memory task and reported a graded pattern of abnormally 


elevated activity in the dorsolateral prefrontal cortex during the task 
in individuals with bipolar disorder type I, relative to bipolar dis- 
order type II, suggesting greater functional abnormalities in pre- 
frontal cortical recruitment during the task in bipolar disorder type 
I than in type II. Another study reported similar patterns of abnor- 
mally elevated activity in the amygdala and dorsolateral prefrontal 
cortex during an emotion regulation task in both euthymic individ- 
uals with bipolar disorder type I and euthymic individuals with bi- 
polar disorder type II, but abnormally increased inverse functional 
connectivity between the amygdala and the dorsolateral prefrontal 
cortex during the task only in individuals with bipolar II disorder. 

Another study focused on reward circuitry and reported signifi- 
cantly increased ventral striatal and left ventrolateral prefrontal cor- 
tical activity in adults with bipolar disorder type II vs adults with 
bipolar disorder type I and healthy adults during reward anticipa- 
tion, again paralleling previous studies that highlighted abnormally 
increased left ventrolateral prefrontal cortex and ventral striatal ac- 
tivity during reward anticipation in euthymic and depressed adults 
with bipolar disorder type I [26]. Findings from this study are the 
first to suggest that bipolar disorder type II may be characterized by 
a greater magnitude of functional abnormalities in reward neural 
circuitry than bipolar disorder type I, supporting findings associ- 
ating bipolar disorder type II with more disabling functional impair- 
ments in daily living than bipolar disorder type I [27, 28]. 

Regarding other neuroimaging modalities, abnormally elevated 
resting state functional connectivity across several regions within a 
temporo-insular neural network, identified with independent com- 
ponents analysis, was observed in individuals with bipolar disorder 
type II. Other studies reported more widespread grey matter reduc- 
tions in the prefrontal, temporal, and parietal cortices in euthymic/ 
moderately depressed adults with bipolar disorder type I vs de- 
pressed adults with bipolar disorder type II, and greater magnitude 
of reductions in grey matter volume and cortical thickness in in- 
dividuals with bipolar disorder type I relative to bipolar disorder 
type II. Diffusion imaging studies that compared individuals with 
bipolar disorder type I and those with bipolar disorder type II re- 
ported abnormalities in fronto-thalamic-temporal white matter in 
both disorders, although findings across studies are inconsistent, 
with abnormalities only in bipolar disorder type I or, by contrast, 
abnormalities only in bipolar disorder type II. 

There is clearly a need for more neuroimaging studies comparing 
individuals with bipolar disorder type I with those with bipolar dis- 
order type II and those with other bipolar disorder subtypes. 


Neuroimaging studies adopting dimensional approaches 


Guided by the NIMH Research Domain Criteria initiative (see 
Chapter 8), neuroimaging studies are beginning to adopt a dimen- 
sional approach to the study of psychiatric disorders, including bi- 
polar disorder. One focus has been to examine individuals across 
the mood disorders spectrum. One study reported a positive correl- 
ation between reward sensitivity (fun-seeking) and ventral striatal 
activity across adults with bipolar disorder type I, adults with bipolar 
disorder type II, and healthy adults [26]. This study thus associated 
patterns of function in reward circuitry with information processing 
domains that cut across diagnostic boundaries. 

Another focus has been to examine the psychosis spectrum by 
including individuals with bipolar disorder and individuals with 
schizophrenia. Here, studies examined relationships between 


neuroimaging measures of structure and function across the whole 
brain and in neural circuitries of interest and psychotic symptoms, 
regardless of diagnosis. Recent findings include significant rela- 
tionships between psychotic symptom severity and abnormally 
reduced local resting state functional connectivity [29] and re- 
ductions in prefrontal cortical and temporal cortical grey matter 
volume [30] across individuals with schizophrenia and bipolar 
disorder. 

These findings suggest that the conceptualization of mood 
disorders and schizophrenia in terms of different psychi- 
atric spectrum disorders may lead to a better understanding of 
neuropathophysiological processes in these illnesses [31]. Here, 
for example, bipolar disorder type I, bipolar disorder type II, other 
bipolar disorder subtypes, and major depressive disorder could 
perhaps be included along a mood disorders spectrum, and schizo- 
phrenia and bipolar disorder along a psychotic disorders spectrum. 


Youth with bipolar disorder 


Studies reported in youth with bipolar disorder similar patterns to 
those in adult with bipolar disorder of abnormally increased amyg- 
dala activity, decreased ventrolateral prefrontal cortical activity, 
and decreased prefrontal cortical-amygdala functional connect- 
ivity during emotion processing and emotion regulation paradigms 
[32]. Abnormally increased amygdala activity may be more evident 
in youth than in adults with bipolar disorder. Studies in bipolar 
disorder youth also demonstrated abnormally decreased amyg- 
dala volumes; reduced orbitofrontal cortex and anterior cingulate 
cortex grey matter and reduced prefrontal cortical thickness; abnor- 
mally reduced FA in white matter tracts connecting prefrontal and 
subcortical regions, and reduced volume of the corpus callosum; 
and altered resting state and regional homogeneity in prefrontal 
cortical--amygdala-striatal circuitry and large-scale networks. 


Adults and youth at risk of bipolar disorder 


These studies can be divided into those examining healthy adults and 
youth at future risk of bipolar disorder by virtue of having a parent 
or sibling with the disorder, and those examining at-risk adults and 
youth who are already affected with some psychiatric symptoms, but 
not with bipolar disorder (reviewed in [32]). 


Healthy adults and youth at risk of bipolar disorder 


During emotion processing, healthy at-risk youth show abnormally 
elevated amygdala activity, abnormal inverse anterior cingulate- 
amygdala functional connectivity, and abnormally elevated left 
ventrolateral prefrontal cortical-amygdala functional connectivity 
during emotion processing. During emotion regulation and non- 
emotional cognitive control tasks, healthy at-risk adults and youth 
showed abnormally elevated, predominantly right-sided activity in 
frontal control regions (ventrolateral and dorsolateral prefrontal 
cortices) and in the amygdala, and decreased right ventrolateral 
prefrontal cortical-amygdala and dorsolateral prefrontal cortical- 
amygdala functional connectivity. Regarding reward processing, 
healthy at-risk youth showed abnormally elevated left ventrolat- 
eral prefrontal cortical activity to reward, and abnormally reduced 
functional connectivity between the ventrolateral prefrontal cortex 
and the anterior cingulate cortex during reward anticipation. In 
unaffected relatives of adults with bipolar disorder, abnormally ele- 
vated right orbitofrontal cortical activity to reward, but abnormally 
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elevated left orbitofrontal cortical activity to loss, was reported. 
Resting state findings indicate abnormally reduced resting state 
functional connectivity between the amygdala and the anterior cin- 
gulate cortex and between the amygdala and the ventrolateral pre- 
frontal cortex in healthy at-risk youth. 

Regarding structural findings, the main pattern is abnormally in- 
creased grey matter volume in several areas, including the amygdala, 
the bilateral ventrolateral prefrontal cortex, the left dorsolateral pre- 
frontal cortex, the left parahippocampal gyrus, and the left caudate. 
Diffusion imaging studies indicate abnormal, predominantly right- 
sided decreases in FA and volume in white matter tracts connecting 
prefrontal cortical and subcortical regions. 

These findings may represent two separate kinds of markers: those 
representing resilience and those representing risk. Resilience 
markers might include increased prefrontal activity and increased 
frontal and subcortical volumes, given that at-risk individuals who 
demonstrated these patterns were psychiatrically healthy at the time 
of study. Risk markers might include increased amygdala activity 
and abnormal prefrontal white matter, as these patterns are similar 
to findings from studies of adults with bipolar disorder. 


Psychiatrically affected adults and youth at future risk 
of bipolar disorder 


Regarding emotion processing, findings indicate abnormally ele- 
vated amygdala activity, abnormal inverse anterior cingulate- 
amygdala functional connectivity and abnormally elevated left 
ventrolateral prefrontal cortical-amygdala functional connectivity 
during emotion processing, and abnormal functional connectivity 
among the ventrolateral prefrontal cortex, the anterior cingulate 
gyrus, and the dorsolateral prefrontal cortex in affected first-degree 
relatives of individuals with bipolar disorder. During emotion regu- 
lation and non-emotional cognitive control tasks, studies indicate 
abnormally increased dorsolateral prefrontal cortical activity and 
increased ventrolateral prefrontal cortical—bilateral amygdala func- 
tional connectivity in youth with milder-level behavioural and emo- 
tional dysregulation symptoms, and abnormally increased activity 
in the left frontal pole in first-degree relatives who currently have 
depression or substance abuse diagnoses. Such individuals also 
show abnormally decreased activity in cognitive control regions and 
abnormally decreased functional connectivity between prefrontal 
cortical regions implicated in emotion regulation and subcortical 
regions. 

Regarding reward processing, one study reported increased left 
ventrolateral prefrontal cortical/middle prefrontal cortical activity 
to reward, with increasing magnitude of non-specific behavioural 
and mood dysregulation symptoms in youth. A meta-analysis con- 
cluded that high-risk individuals (both those with and without a 
current psychiatric diagnosis) showed abnormally increased activity 
in left-sided prefrontal cortical regions and the insula, regardless of 
task [33]. 

Structural findings indicate abnormally decreased volumes in 
a variety of regions, including the dorsolateral prefrontal cortex, 
orbitofrontal cortex, insula, anterior cingulate cortex, ventral 
striatum, and bilateral frontal and left temporo-parietal regions. 
Increased volumes are also observed in the right ventrolateral pre- 
frontal cortex [12], the left insula, and the left caudate. Diffusion 
imaging studies report decreased white matter volume and white 
matter integrity, particularly in prefrontal regions. 
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Abnormally increased prefrontal cortical activity during emotion 
regulation and non-emotional cognitive control tasks may represent 
resilience factors in non-bipolar disorder-affected at-risk individuals. 
Similarly, findings of abnormally increased prefrontal cortical volume 
may also represent resilience factors, while findings of abnormally 
decreased prefrontal cortical volumes parallel findings in adults with 
bipolar disorder. Risk factors include left-sided subcortical volume in- 
creases that may be associated with left-sided increases in prefrontal 
cortical and subcortical activity during reward and other task per- 
formances observed across at-risk individuals and individuals with 
bipolar disorder. Diffusion imaging studies reporting decreased white 
matter volume in these at-risk individuals may also represent risk fac- 
tors as they, too, parallel findings in adults with bipolar disorder. 

Longitudinal studies are clearly needed to examine developmental 
trajectories of structural and functional changes in prefrontal cortical- 
subcortical circuitry in individuals with bipolar disorder and those 
youth at risk of future mood and psychotic disorders. Such studies will 
help identify abnormal developmental trajectories in this circuitry 
that are associated with having bipolar disorder or other mood dis- 
orders in youth, and biomarkers that can help identify which at-risk 
youth are most likely to develop these disorders in the future. 


Multimodal neuroimaging studies 


A future area for neuroimaging research in bipolar disorder is the 
use of multimodal techniques to identify structure-function rela- 
tionships in the neural circuitry. A very small number of studies 
examined structure-function relationships in the prefrontal 
cortical-amygdala circuitry in adults with bipolar disorder type 
I and bipolar disorder type II, but there is a need for more such 
studies in individuals across the mood disorders spectrum. In par- 
allel, studies are beginning to identify relationships between genetic 
variants and functioning and structure within the neural circuitry 
in adults and youth with bipolar disorder and in individuals at risk 
for bipolar disorder. Ultimately, an integrated systems approach will 
help identify biomarkers that reflect the neuropathophysiological 
processes in individuals with mood, psychotic, and other psychiatric 
disorders that span genetic, molecular, neural circuitry, and behav- 
ioural levels of investigation [31, 34, 35]. 


Neuroimaging studies using pattern 
recognition approaches 


A key criticism of neuroimaging studies is their reliance upon 
group-level statistics, rather than providing data that are useful 
at the individual level. If neuroimaging techniques are to provide 
clinically relevant information, then useful individual-level meas- 
ures of brain structure and function need to be obtained from these 
techniques. One recent advance has been to combine neuroimaging 
with pattern recognition approaches, which develop algorithms 
to automatically learn and recognize complex patterns to inform 
decision-making based on large data sets. Studies combining these 
approaches have been able to help classify individuals, case by case, 
into different diagnostic categories, including bipolar disorder vs 
major depression, based on their patterns of neural activity [36, 37] 
and grey and white matter structure [38]; bipolar disorder vs schizo- 
phrenia, based on grey matter structure [39]; and youth with bipolar 
disorder vs healthy youth, based on grey matter structure [40]; and 
also accurately discriminated between individual healthy youth 
at high genetic risk vs those at low risk for future bipolar disorder 


[41]. Combining neuroimaging with pattern recognition techniques 
thus holds much promise for future elucidation of clinically useful, 
individual-level biomarkers to inform diagnosis, risk identification, 
and personalized treatment choice. 


Limitations of extant studies 


There are many limitations of existing neuroimaging studies in bi- 
polar disorder. Firstly, the majority of studies, especially studies 
employing fMRI or resting state, recruited relatively modest (for 
example, <30) numbers of participants per group, thereby allowing 
only limited conclusions about the generalizability of the find- 
ings to the wider population of individuals with bipolar disorder. 
Similarly, there are few studies comparing individuals with bipolar 
disorder across different mood states and few replication findings, 
especially for fMRI and resting state studies. For resting state studies, 
this is likely due to the different resting state methodologies used, 
in addition to modest sample sizes. Many studies focused solely on 
a priori prefrontal cortical-subcortical regions of interest, with little 
reporting of findings in other regions, thereby limiting inferences 
about potential roles of other neural circuitries in bipolar disorder. 
Furthermore, there is a dearth of neuroimaging studies directly com- 
paring different bipolar disorder subtypes (for example, bipolar dis- 
order type I vs bipolar disorder type II) or bipolar disorder vs other 
major psychiatric disorders (for example, schizophrenia). It is there- 
fore difficult to determine the extent to which bipolar disorder sub- 
types or different psychiatric disorders share, or are distinguished 
by, underlying neural mechanisms. Such studies can identify neural 
biomarkers reflecting these neural mechanisms that can aid diag- 
nosis and treatment choice, particularly for those disorders that are 
often difficult to distinguish, based on clinical assessment alone, for 
example, bipolar disorder type I vs bipolar disorder type II, bipolar 
disorder types I and II vs major depressive disorder, and bipolar dis- 
order vs schizophrenia. There are also few multimodal neuroimaging 
studies examining relationships between structure and function in 
neural circuitries of interest in bipolar disorder, or between resting 
and task-related functional connectivity in these neural circuitries in 
bipolar disorder. These studies will facilitate more in-depth under- 
standing of the neural mechanisms underlying bipolar disorder. 

Another major criticism is the potentially confounding effects 
of psychotropic medication upon neuroimaging measures. An 
increasing number of studies in bipolar disorder suggest that psycho- 
tropic medications either have normalizing effects on neuroimaging 
measures or do not significantly impact these measures [42], al- 
though, as is apparent from these studies, lithium in particular may 
have neurotrophic effects in some neural regions in bipolar disorder, 
while antipsychotic medications are associated with grey matter de- 
creases [43]. Further studies are thus needed to determine the na- 
ture of effects of specific medications on neural circuitries of interest 
in bipolar disorder. Longitudinal neuroimaging studies examining 
individuals pre- and post-medication can address this important 
point, as can large cross-sectional studies comparing medication- 
free individuals with those taking different medication types. 


The field of neuroimaging in bipolar disorder is progressing con- 
siderably, with findings from these studies making significant 


contributions to understanding of the neuropathophysiology of 
bipolar disorder [4]. To move the field forward, the next wave of 
bipolar disorder neuroimaging studies should aim to adopt the 
following strategies. Firstly, studies should examine emotion pro- 
cessing, emotion regulation, and reward neural circuitry functional, 
structural, white matter, and intrinsic connectivity abnormalities as- 
sociated with dimensions of pathological behaviours that cut across 
conventionally defined bipolar disorder and other mood disorder 
diagnostic categories. These studies should also include longitudinal 
designs to identify the extent to which alterations in these neural 
circuitry abnormalities are associated with changes in affective state. 
This approach has potential to identify neural circuitry markers 
that better reflect neuropathophysiological processes in bipolar dis- 
order and other mood disorders and the nature of neural mechan- 
isms underlying abnormal mood switches. Secondly, studies should 
examine developmental trajectories of these neural circuitries in 
individuals with bipolar disorder across the lifespan and in at-risk 
youth, with longitudinal follow-up designs. This approach will iden- 
tify neural circuitry markers that can help identify those individuals 
at highest risk of developing future affective pathology, and thereby 
pave the way forward for studies that utilize these markers to guide 
early intervention and prevention strategies. Thirdly, studies should 
incorporate multimodal neuroimaging techniques and biological 
system level approaches, to examine the impact of genetic variation 
and molecular-level processes upon neural circuitry development 
in at-risk individuals and individuals with bipolar disorder and 
other mood disorders. Fourthly, studies should take advantage of 
advances in the application of pattern recognition techniques to 
neuroimaging to identify individual-level neural circuitry markers 
that not only help classify individuals into present diagnostic groups, 
but also help predict individual-level future clinical course. Fifthly, 
collectively, these four approaches will help elucidate more complex 
neuropathophysiological processes underlying the dimensions of 
abnormal behaviours associated with affective pathology and yield 
individual-level biological markers reflecting these processes that 
have clinical utility for diagnosis, future illness development pre- 
diction, and guiding personalized treatment choice in at-risk and 
mood-disordered individuals. 
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Introduction 


The pharmacological treatment of bipolar disorder (BD) is complex 
because it combines different therapeutic approaches and needs to 
take into account acute phases and long-term management [1, 2]. 
Several factors are involved in the treatment choice such as the pre- 
dominant polarity [3], the risk of suicide, the treatment adherence, 
and the polarity index of a drug [4, 5]. Clinical guidelines have been 
published recently on the latest developments on pharmacological 
treatment of BD [6-10]. A summary of the clinical management of 
BD is synthesized in Table 72.1. 

The challenge of treating bipolar disorder in women in the peri- 
natal period is considered in some detail in Chapter 73. 


Mania and mixed states 


The treatment of mania aims at managing the core manic symptoms 
and psychosis, at reducing any behavioural component typically as- 
sociated with this condition, and at minimizing the risk of side ef- 
fects in order to improve treatment adherence [11, 12]. 

A recent meta-analytic review (N = 13,073) [13] found that dopa- 
mine antagonists/partial agonists are more effective than anticon- 
vulsants for the treatment of manic episodes, making them the most 
appropriate short-term choice for mania. Haloperidol, risperidone, 
and olanzapine (OLZ) were ranked as the most effective. Quetiapine 
(QTP), risperidone, and OLZ showed the best results for tolerability. 

Another recent meta-analysis showed the efficacy for mania of 
aripiprazole (ARP), asenapine (ASN), cariprazine, haloperidol, 
OLZ, paliperidone (PAL), QTP, risperidone, and ziprasidone (ZIP), 
in comparison with placebo [14]. A network meta-analysis per- 
formed by the same author found that ARP, OLZ, QTP, risperidone, 
and valproate (VPA) had lower all-cause discontinuation rates 
than placebo. Moreover, sensitivity analysis by drug class indicated 
similar efficacy profiles for haloperidol, newer dopamine antagon- 
ists/partial agonists, and mood stabilizers [15]. 

The availability of injectable extended-release formulations 
(of OLZ, risperidone, ARP, PAL, and ZIP) may also represent a 








therapeutic option in mania, with the aim to improve treatment 
adherence [16]. 

Combined treatment with lithium or VPA and a dopamine an- 
tagonist/partial agonist should be considered when patients show 
breakthrough mania with the first agent [6-10]. 


Bipolar depression 


In contrast to the treatment of mania, enthusiasm for pharmaco- 
logical strategies in BD depression is tempered by uncertainty, des- 
pite the high burden and prevalence of this phase of BD [17, 18]. 
Currently, none of the available antidepressants (that is, drugs li- 
censed for the indication of major depression in a unipolar illness 
course) is officially recognized as monotherapy for the treatment 
of BD despite their common use in clinical practice. To date, only 
QTP (approved by the FDA and EMA), OLZ-fluoxetine combin- 
ation, and lurasidone (LUR) (approved in some countries, but not 
by EMA) have regulatory approval for BD depression. 

OLZ has been found to be effective in BD depression both in 
monotherapy and in association with fluoxetine [19, 20]. The effi- 
cacy of QTP monotherapy for the treatment of BD depression has 
been demonstrated in two placebo RCTs of similar design [21, 22]. 
Another two subsequent RCTs showed some superiority of QTP 
monotherapy over both lithium and paroxetine in BD depressive 
episodes [23, 24]. Two recent RCTs have demonstrated the efficacy 
of LUR monotherapy or in combination with lithium or VPA in 
BD depressed patients [25]. A more recent study of adjunctive LUR 
found that the significant improvement in depressive symptoms on 
the Montgomery-Asberg Depression Rating Scale (MADRS) from 
weeks 2 to 5 disappeared at week 6 [26]. 

Despite preliminary evidence suggesting a possible beneficial ef- 
fect of adjunctive ARP on BD depression [27], two subsequent RCTs 
failed to find any significant superiority of ARP over placebo [28]. 

With respect to lithium and anticonvulsants, use of lithium in 
monotherapy for an episode of BD depression seems to have no sup- 
port, based on available evidence [23]. Nonetheless, for its proven 
effectiveness as prophylaxis of BD, as an augmentation strategy 
of unipolar depression, and for its effect in preventing the risk of 
suicide [29], lithium continues to be considered as a first-choice 
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Table 72.1 Pharmacological management of bipolar disorder (BD) in mania, depression, and maintenance phases' 


Clinical management’ Advantages 





Mood stabilizers 


Valproate 


Lamotrigine 


Lithium 
Carbamazepine 


Oxcarbazepine 


Antipsychotics 


Aripiprazole 


Asenapine 
Lurasidone 


Olanzapine 


Paliperidone 
Quetiapine 


Risperidone 


Ziprasidone 


Antidepressants 


CYP, cytochrome; EPS, extra-pyramidal symptoms. 


+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


++ 


+++ 


++ 


++ 


++ 


+++ 


++ 


+++ 


+++ 


++ 


++ 


++ 


++ 


+++ 


++ 


++ 


Useful in BD episodes with mixed features 
Prophylactic effect in BD 
Possible effect in bipolar depression 


Predominant depressive polarity, depressive 
recurrences 


Possible effect in bipolar depression 


Anti-suicidal properties, prophylaxis in BD, manic- 
predominant polarity 


Prophylactic effect in BD, less effective than lithium 
and valproate, in BD with non-classical features 
Fewer adverse effects than carbamazepine. 


Prophylactic effect in BD, less effective than lithium 
and valproate, in BD with non-classical features 





anic-predominant polarity, lower rates of 
discontinuation 








ntramuscular long-acting formulation available 





anic-predominant polarity 

Possible treatment of BD with mixed features 
Predominant depressive polarity 

Approved by FDA for bipolar disorder 
anic-predominant polarity, manic recurrences 
ntramuscular long-acting formulation available 


Olanzapine + fluoxetine approved by FDA in 
bipolar depression 


anic-predominant polarity 











ntramuscular formulation available 
inimal liver metabolism 





[he only antipsychotic with indications for both 
acute episodes and maintenance in BD, for manic- 
and depressive-predominant polarity. Approved by 
FDA and EMA for bipolar depression 


Manic-predominant polarity, prevention of manic 
relapses, not depressive (risperidone long-acting 
injectable) 








Manic-predominant polarity 
Intramuscular long-acting formulation available 





Controversial issue. Not in monotherapy in BDI 
disorder. Applicable in resistant bipolar depression 
combined with mood stabilizers in individual cases 


Disadvantages 





CYP450 inhibitor, not recommended 


in women at childbearing age 
Hyponatraemia 


Slow titration 
Risk of rush 


Not recommended in renal failure 


CYP450 inducer 
Effects on leucocyte count 


Hyponatraemia 


Akathisia 


Moderate metabolic syndrome 
Efficacy related to feeding 


Akathisia, sedation 


Severe metabolic syndrome 





igh doses are often needed 











yperprolactinaemia 


Sedation 


Risk of switch to depression, EPS 
Hyperprolactinaemia 


Good metabolic profile 


Risk of switch to mania, mixed 
states and rapid cycling, mood 
destabilization 


' Reported clinical management reflects our interpretation of available evidence and do not necessarily imply regulatory endorsement. For further information, refer to guidelines 


[6-10]. The list includes some clinically significant adverse effects which can be experienced by a fraction of patients exposed. This list is by no means exhaustive and is not meant as a 
comparison between the different drugs. 


? Olanzapine + fluoxetine. 


+++, highly recommended; ++, very recommended; +, recommended; -, not much recommended; - -, not recommended; - - -, not at all recommended. 


strategy in BDI depression by many guidelines on BD, as well as in 
clinical practice [9]. 

Use of VPA monotherapy for BD depression was supported in a 
recent review and meta-analysis of four small studies [30]. The effect 
of lamotrigine (LTG) on BD depression appeared weak ina first ana- 
lysis of five double-blind RCTs [31]. However, more recent studies 
on LTG, mostly as adjunctive treatment to lithium or QTP, found a 
modest effect in BD depression [32-34]. Some clinical guidelines 
consider it as a first- or second-choice therapeutic strategy for the 
treatment of acute depressive phase [7]. 

One of the most controversial issue in BD is the use of ADs, 
since it has been associated with the risk of (hypo-) manic or mixed 
switch, with the risk of suicide or with a rapid cycling course [18, 
35-38]. The International Society for Bipolar Disorders (ISBD) has 
recently assembled an international group of experts to find a clin- 
ically and evidence-based consensus on this issue [39]. They con- 
cluded that the use of ADs to treat depressive phases of BD should 
neither be condemned nor endorsed without carefully evaluating 
individual clinical cases. In general, antidepressant monotherapy 
should be avoided in BD. If ADs are used, they should be prescribed 
along with an anti-manic treatment. In BD types I and II depression, 
with two or more concomitant core manic symptoms and with a his- 
tory of rapid cycling or mood instability, ADs should be used with 
caution, due to the risk of worsening of the illness course. Treatment 
with some ADs in BD type II appears to be otherwise relatively well 
tolerated. Multi-functional ADs acting on serotonergic and adren- 
ergic systems appear to carry a particularly high risk of inducing 
pathologically elevated states of mood and behaviour and should be 
used with caution [39]. 

During the last years, new therapeutic strategies have been inves- 
tigated for the treatment of BD depression. Limited evidence of ef- 
ficacy in BD depression has been found for the dopaminergic drug 
modafinil and its R-enantiomer armodafinil [40, 41]. 

Several lines of research have suggested dysfunction of N-methyl- 
D-aspartate (NMDA) receptors could play an important role in the 
pathophysiology of BD [42], and a single intravenous dose of the 
NMDA receptor antagonist ketamine has been found to be effective 
in treating severe and resistant acute BD depression. Unfortunately, 
this effect is relatively short-lasting, so that finding a place in the 
long-term treatment is problematic [43, 44]. 
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Maintenance phase 


The primary objectives of maintenance treatment of BD are 
preventing new (hypo)manic/mixed or depressive episodes and sui- 
cidal behaviour and improving residual symptoms and treatment 
adherence, but also optimizing functional recovery and quality of 
life [2]. 

In considering long-term treatment, the predominant polarity of 
the individual's illness [45] and the polarity index (PI) of any drug 
[4, 5] should be taken into account. The PI indicates the relative ef- 
fectiveness in preventing mania against the efficacy in preventing 
depression of drugs commonly used for maintenance treatment of 
BD (Fig. 72.1); it is calculated as the ratio of the number needed 
to treat (NNT) for the prevention of depression and the NNT for 
preventing mania, based on the results of relevant RCTs for main- 
tenance treatment of BD. 

A PI >1 indicates a greater relative anti-manic prophylactic po- 
larity; a PI <1 indicates a greater relative antidepressant prophylactic 
polarity, while a PI near to 1 indicates similar relative manic and 
antidepressant prophylaxis, so a possible ideal ‘mood stabilizer’ 

A predominantly anti-manic PI was found for PAL, ARP, 
risperidone (long-acting injectable, RLai), and ZIP, followed by 
OLZ, ASN, and lithium. Of all the agents tested, QTP presented a 
PI closest to 1. A predominantly antidepressant PI was found for 
LTG, oxcarbazepine (OXC), and VPA. From available recent data 
(not fully published), LUR would represent the first dopamine an- 
tagonist with a PI inferior to 1, so predominantly an antidepressant. 
The PI for VPA and OXC appears <1 but should be interpreted with 
caution because clinical trials for these two drugs showed no stat- 
istical superiority over placebo. In general, anticonvulsants (VPA, 
OXC, and LTG) appear to be slightly more effective in preventing 
depressive episodes, while dopamine antagonists/partial agonists 
(with the exception of QTP and LUR) and lithium are more effective 
in preventing mania. In maintenance, monotherapy is the goal, but 
combination treatment is often needed. Typical combinations are 
lithium or VPA with OLZ, QTP, LTG, or ARP, especially in patients 
with a depressive-predominant polarity or in the absence of a pre- 
dominance of polarity. When monotherapy is preferred, in patients 
with a manic-predominant polarity, lithium, risperidone (RIS), ZIP, 
ARP, PAL, ASN, and OLZ can be used, while LTG and LUR should 
be used when depression prevails during the course of the illness. 
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Fig. 72.1 Long-term relative efficacy according to the polarity index. 
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RCT data suggest QTP monotherapy should prevent both phases of 
the illness (Fig. 72.1). 

Regarding the efficacy of mood stabilizers and lithium during 
maintenance, lithium remains the most effective treatment in 
preventing manic relapses and, less effectively, depressive relapses 
and in reducing the risk of suicide and hospitalizations [29, 46]. 

The efficacy of LTG in preventing depression in BD patients with 
a predominantly depressive polarity is considered the true strength 
of this drug [47]. 

VPA monotherapy is considered less effective than lithium in 
the prevention of relapse and more effective in depressive relapses, 
and should not usually be considered for women of childbearing 
potential [48]. 

Carbamazepine (CBZ) monotherapy is less effective than lithium 
in preventing recurrences, and the main risk is the unsafe side effect 
profile and interference with the metabolism of other drugs [17]. 
OLZ has been studied as a comparator to depot risperidone (RLai) 
and showed a reduction in manic and depressive relapses [49]. 

The long-term use of LUR was demonstrated by a recent 28-week 
study of continued treatment with LUR plus lithium or VPA, with a 
trend in reduction in time to recurrence of any mood event, com- 
pared with placebo, and a significant reduction in time to recurrence 
of a depressive episode [50]. ARP in monotherapy has shown super- 
iority over placebo for acute and continuation treatment of mania 
(no depression), with low rates of discontinuation [15, 51]. Similarly, 
ZIP and PAL have demonstrated a positive effect in preventing 
manic recurrences [52, 53]. 

The use of injectable formulations (LAIs) in long-term treatment 
may be considered in cases of poor adherence (that is, more than 
30% of BD patients) [54]. LAIs could be used in BD patients where 
the treatment plan is continuation with dopamine antagonists, but 
adherence to oral medication is poor. The available data for LAI 
risperidone are positive for preventing mania, but not depression 
[49, 55]. PAL extended-release significantly delayed the time to re- 
currence of any mood symptoms vs placebo in a randomized study 
of maintenance treatment in patients with BD after an acute manic 
or mixed episode [53]. 

Evidence for the long-term prophylactic effect of antidepressant 
treatment of patients with BD type I or II remains poorly studied, 
despite common clinical use of antidepressants; the ideal use is al- 
ways in combination with an anti-manic drug [39]. 


Psychosocial management of bipolar disorder 





Given the high rates of recurrence among patients with BD when 
maintained on standard pharmacotherapy [56], effective clinical 
practice recommends combining pharmacotherapy with targeted 
psychotherapy. There is currently no agreement on when psy- 
chotherapy should be initiated or how long it should last. There 
is, however, agreement that targeted psychotherapy should be 
psychoeducational, meaning that patients should be given didactic 
instructions and skill training that will enable them to recognize and 
manage their mood swings, cope with life stress, and maximize the 
quality of life. Several forms of psychotherapy have emerged as ef- 
fective in the stabilization and long-term maintenance of BD in dif- 
ferent age groups [57-59]. 


Psychoeducational approaches 


In structured group psychoeducation, between eight and 12 pa- 
tients with BD meet weekly for a group session led by a mental 
health clinician and focusing on illness awareness, treatment com- 
pliance, early detection of prodromal symptoms and recurrences, 
and lifestyle regularity. Colom et al. [60] assessed the efficacy of 
group psychoeducation among 120 adult patients with bipolar 
type I who had been in remission for at least 6 months. All pa- 
tients received mood-stabilizing medications. One group received 
21 weekly sessions of structured group psychoeducation, and the 
other group 21 non-didactic group support sessions. At the end of 
2 years, fewer of the group psychoeducation patients (67%) than the 
control patients (92%) had relapsed, and fewer had been hospital- 
ized. The group differences remained over a 5-year post-treatment 
follow-up [61]. 

A replication study was carried out in Brazil [62]. A total of 55 
bipolar type I and II outpatients in remission received 16 sessions 
of group psychoeducation (using the Colom et al. model) or 16 
sessions of a ‘matched’ (that is, non-randomized) group placebo 
treatment. Over 1 year, there were no differential effects of group 
psychoeducation on recurrence or psychosocial functioning, al- 
though patients in group psychoeducation subjectively reported 
more clinical improvement. Non-randomized selection of the com- 
parison participants may have attenuated treatment effects. 

Two large-scale studies examined group psychoeducation within 
the context of larger care management programmes. Bauer et al. [63] 
compared ‘collaborative care management’ (CCM) for 306 BD pa- 
tients treated at 11 U.S. Department of Veterans Affairs sites. The 
core of treatment—the Life Goals programme—emphasized illness 
management skills. CCM also included enhanced access to care 
through a nurse co-ordinator and support for physicians to follow 
medication practice guidelines. Over 3 years, patients in CCM spent 
fewer weeks in manic episodes and had greater improvements in 
social functioning and quality of life than patients who received care 
as usual. 

In a group health network, Simon et al. [64] randomly assigned 
441 patients to a similar care management intervention or to treat- 
ment as usual (TAU). Care management consisted of pharma- 
cotherapy, group psychoeducation sessions, telephone-based 
monitoring, interdisciplinary care planning, and relapse prevention 
planning. Over 2 years, patients in care management had signifi- 
cantly lower mania scores and spent less time in manic or hypo- 
manic episodes than those in TAU, but there were no effects on 
depressive symptoms. Both studies strongly support the beneficial 
effects of psychoeducation about illness management. It was not 
possible to determine the unique contributions of each treatment 
component to outcomes. 

A 7- to 12-session programme of individual psychoeducation 
with medication was found to be more effective than routine care 
and medication [65]. Individual psychoeducation, consisting of in- 
struction on how to identify early warning signs of recurrence and 
obtain medical intervention, was associated with longer intervals 
prior to manic recurrences and enhanced social functioning. 

There have been recent attempts to test the effects of 
psychoeducationally oriented clinics on the course of BD. In a 
randomized trial in Denmark, Kessing et al. [66] assigned 158 pa- 
tients to treatment in a specialized outpatient clinic that integrated 


pharmacotherapy with group psychoeducation or to a standard 
care outpatient clinic. There was an overall beneficial effect of 
psychoeducationally oriented clinics on rehospitalization, but also 
a statistical trend for those between the ages of 18 and 25 years to 
benefit more than those over the age of 25. This study may indicate 
that younger (or less recurrent) patients may benefit more from 
psychoeducation than older and more recurrent patients. 

In summary, group psychoeducation and individual relapse pre- 
vention strategies are effective adjuncts to pharmacotherapy in re- 
ducing the risk of mood relapses. Studies that examine treatments at 
the clinic level, rather than the individual patient level, may be quite 
informative for community dissemination. 


Functional remediation 


A multi-site study in Spain has examined the effectiveness of 21 90- 
minute sessions of functional remediation, in comparison with 21 
sessions of structured psychoeducation groups [61] or TAU [67]. 
All patients received medication management. Functional remedi- 
ation addresses neurocognitive issues through exercises to enhance 
memory, attention, problem-solving, reasoning, multitasking, and 
organization. In 183 euthymic patients who completed 21 weeks 
of treatment, functional remediation was more effective than TAU 
and as equally effective as structured psychoeducation in enhancing 
functioning. 


Family intervention approaches 


Family interventions for BD first appeared in the 1970s (for example, 
[68]). Miklowitz and Goldstein [69, 70] developed family-focused 
treatment (FFT) for patients with BD who had just had an episode 
of mania or depression. This approach, given in 21 sessions over 
9 months, consists of psychoeducation about BD for the patient and 
family members (parents, siblings, spouse), to assist in identifying 
and intervening in prodromal symptoms of relapse; communication 
enhancement training, and problem-solving skills training. The core 
assumption of FFT is that enhancing knowledge of BD, reducing 
high expressed emotion (critical, hostile, or overprotective attitudes 
in caregivers), and enhancing family interactions will promote en- 
vironments that are protective against relapses. 

In the Colorado Treatment/Outcome study [71], BD type I pa- 
tients who were in the process of recovering from an acute episode 
(N= 101) were randomly assigned to 9 months of FFT with pharma- 
cotherapy or crisis management (CM) (two sessions of family 
psychoeducation plus crisis intervention sessions) with pharmaco- 
therapy. Over 2 years, patients in FFT were three times more likely 
to complete the study without relapsing (52% vs 17%) and had 
longer periods of stability without relapse (73.5 weeks vs 53.2 weeks) 
than those in CM. They also had greater improvements over time 
in depression, less severe manic symptoms, and better adherence to 
medications than patients in CM. 

Rea and colleagues [72] compared FFT plus pharmaco- 
therapy to an equally intensive (21 sessions), individually focused 
psychoeducational treatment plus pharmacotherapy for BD type 
I patients who had just been hospitalized for a manic episode 
(N = 53). Over a 1- to 2-year post-treatment follow-up, patients in 
FFT had much lower rates of rehospitalization (12%) and symp- 
tomatic relapse (28%) than patients in individual therapy (60% and 
60%, respectively). There were no differences between the groups in 
medication regimens or adherence over 2 years. 
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Family focused treatment for adolescents 
with bipolar disorder 


In a two-site randomized trial, Miklowitz et al. [73] assigned 58 ado- 
lescents with BD type I, type II, or not otherwise specified to a develop- 
mentally appropriate version of FFT (FFT-A) plus pharmacotherapy 
or a 3-session ‘enhanced care’ (EC) family psychoeducational treat- 
ment plus pharmacotherapy. Over 2 years, adolescents in FFT-A 
had more rapid time to recovery from depressive symptoms, less 
time in depressive episodes, and greater stabilization of depressive 
symptoms over 2 years than adolescents in EC. Comparable effects 
were not observed on manic or mixed symptoms. A secondary ana- 
lysis revealed that adolescents in high expressed emotion families 
showed greater reductions in both mania and depression if they re- 
ceived FFT-A than if they received EC [74, 75]. 

A second trial involving three study sites and 145 adolescents with 
BD type I or II did not replicate these results [76]. Adolescents in 
FFT-A and the 3-session EC comparator did not differ in time to 
recovery from their acute episode at study entry or in time to relapse 
over 2 years. However, the stage of treatment had a moderating effect 
on FFT-A; during the 1-year post-treatment phase, adolescents in 
FFT-A had significantly lower mania symptom scores than those in 
EC, and significantly higher quality of life scores. Families may need 
time to absorb communication, problem-solving, and relapse pre- 
vention skills that are the focus of FFT. Once skills are put into use, 
family environments may become more protective against mood ex- 
acerbations in bipolar illness. 


Family focused treatment for children at risk 
for bipolar disorder 


FFT has been tested in children and adolescents who are at risk for 
BD—those with a family history of BD type I or II plus a current 
diagnosis in the child of major depressive disorder (MDD), BD not 
elsewhere classified (NEC: recurrent manic or hypomanic phases 
that do not meet the DSM-5 duration criteria), or cyclothymic dis- 
order. In a 1-year RCT, 40 high-risk children (aged 9-17) with MDD 
or BD-NEC were randomly assigned to either 12 sessions of FFT 
high-risk version (FFT-HR) or a 1- or 2-session education con- 
trol [77]. The FFT-HR focused on mood management, sleep/wake 
rhythm regulation, and communication/problem-solving exercises 
to reduce family tension. The participants in FFT-HR demonstrated 
more rapid recovery from their initial mood symptoms, more weeks 
in remission from mood symptoms, and more improvement in 
hypomania symptoms over 1 year than participants in the education 
control. Once again, the largest treatment effect sizes were among 
children in high expressed emotion families. 

FFT is only one approach to family intervention. Reinares et al. 
[78] examined 12 sessions of group psychoeducation for care- 
givers of adult BD patients (for example, early detection of pro- 
dromes, promoting healthy sleep), stress management strategies, 
and encouraging self-care through expanding one’s social net- 
work, exercise, and other means. Compared to usual care, caregiver 
psychoeducation groups were associated with longer time to manic 
recurrence in patients over 18 months. Caregivers also had increased 
knowledge of how to manage BD, decreases in subjective feelings of 
burden, and fewer attributions of blame towards the patients [78]. 

Other investigators have examined family intervention protocols 
that are more suited to younger bipolar patients. Multiple family 
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groups [79] and a 12-session integrated programme of individual 
CBT and FFT sessions known as the ‘Rainbow’ programme [80] 
were both found to be effective in randomized trials for children 
with bipolar spectrum disorders (aged 12 or under). 


Cognitive behavioural therapy 


CBT aims to modify maladaptive thoughts through restructuring 
of patients’ self-defeating thoughts or beliefs and behavioural com- 
ponents such as scheduling pleasurable events to improve mood. 
Therapy usually incorporates problem-solving and interpersonal 
skills training as well. Several major trials of CBT have been con- 
ducted in BD. These trials have tested CBT manuals that differed to 
some extent (for example, whether or not patients were assumed to be 
in remission or not), which may, in part, explain the disparate results. 

Lam and colleagues [81], compared pharmacotherapy plus 
6 months of CBT (12-18 sessions) with pharmacotherapy plus TAU 
for 103 patients in remission from mood episodes. At 1 year, relapse 
rates were 44% in CBT and 75% in usual care. Patients in CBT also 
spent fewer days in illness episodes. One year to 30 months after 
treatment, CBT no longer prevented relapse, relative to usual care, 
but continued to show a positive influence on mood symptoms [82]. 
Unlike the individual psychoeducation study [65], the effects of CBT 
were stronger on depression than on mania. 

A UK trial conducted across five sites [83] compared 22 sessions 
of CBT plus medication to TAU plus medication for 253 patients 
who began in various clinical states. No effects of CBT vs TAU were 
observed on time to recurrence over 18 months. A post hoc ana- 
lysis revealed that patients with fewer than 12 prior episodes had 
fewer recurrences if treated with CBT than with TAU, whereas the 
opposite pattern was apparent among patients with 12 or more epi- 
sodes. It is not clear whether patients with fewer than 12 episodes 
were less ill/episodic or younger or had been ill for fewer years than 
patients with 12 or more episodes. 

In a four-site Canadian trial [84], 204 patients were randomly as- 
signed to 20 weekly sessions of CBT or to six group psychoeducation 
sessions (using Bauer and McBride's Life Goals [63] programme) 
with pharmacotherapy. There were no differences in relapses or 
symptom severity over 72 weeks. Group treatment was clearly the 
more cost-effective alternative in this study. 

Meyer and Hautzinger [85] examined 76 patients in a variety of 
clinical states who were randomly assigned to CBT plus pharma- 
cotherapy (20 therapy sessions over 9 months) or an equally in- 
tensive individual supportive therapy plus pharmacotherapy. Over 
33 months, there were no differences between the groups in relapse 
or time to relapse. 

Thus, no RCT has shown the superiority of CBT over other forms 
of therapy matched on duration and the number of sessions. Meta- 
regression analyses of the psychosocial treatment literature may be 
able to specify the conditions under which CBT is effective in BD. 
Trials have varied in the heterogeneity of patient symptomatic states at 
entry, the number of prior episodes, the intensity of the comparator, or 
the nature of the outcome variable. It may also be that ‘third-wave’ ap- 
proaches, such as dialectical behaviour therapy or mindfulness-based 
cognitive therapy, will prove effective where CBT was not [86, 87]. 


Interpersonal and social rhythm therapy 


The interpersonal and social rhythm therapy (IPSRT) is based on the 
assumptions that: (1) symptoms of BD are triggered by disruptions 


in daily routines and sleep/wake cycles; and (2) stabilization of these 
routines is essential to mood stabilization. IPSRT begins during, or 
shortly following, an acute period of illness and focuses on stabilizing 
daily and nightly rhythms and resolving interpersonal problems that 
preceded the acute episode. Patients learn to track their routines and 
sleep/wake cycles and identify events (for example, job changes) that 
may provoke changes in these routines. 

In the Pittsburgh Maintenance Therapies study [88], 175 acutely 
ill patients were randomly assigned to IPSRT or active clinical man- 
agement, both given weekly with medication management. Clinical 
management emphasized symptom control and medication adher- 
ence. Once patients were stabilized, they were randomly reassigned 
to IPSRT or active clinical management for a 2-year maintenance 
phase. The findings suggested that IPSRT in the acute phase was 
associated with longer time before recurrence in the maintenance 
phase than was clinical management, regardless of what treatment 
patients were assigned during maintenance care. IPSRT was most 
effective in delaying recurrences in the maintenance phase when pa- 
tients succeeded in stabilizing their daily routines and sleep/wake 
cycles during acute treatment. 

A study of IPSRT in New Zealand [89] examined 100 adolescent 
to young adult patients (aged 15-36) who were randomly assigned 
to 26-78 weeks of IPSRT or supportive care, both given with medica- 
tions. There were no differences between groups—over the interval, 
both groups improved in depressive symptoms, manic symptoms, 
and social functioning. It is unclear whether the degree of change 
was attributable to non-specific elements of the two treatments 
(for example, duration of care or number of sessions), the effects of 
pharmacotherapy, or the passage of time. 

Swartz et al. [90] compared IPSRT with placebo vs IPSRT plus QTP 
in 92 patients with bipolar type II depression, one of the few studies 
to compare psychotherapy to medications directly. Patients in the 
two groups had equivalent rates of treatment response over 20 weeks 
(60% in IPSRT plus placebo vs 74.5% in IPSRT plus QTP). Patients 
in the IPSRT plus QTP group had greater stabilization of Hamilton 
depression scores (without evidence of treatment-emergent manic 
symptoms) but were also more likely to gain weight during the trial. 
Identification of patients most likely to benefit from psychotherapy 
as monotherapy is an important direction for research in BD. 


The STEP-BD study 


In the U.S. NIMH’s multi-site study of BD, the Systematic Treatment 
Enhancement Program for Bipolar Disorder (STEP-BD) [91], in- 
vestigators at 15 sites compared IPSRT, FFT, and CBT (30 sessions 
over 9 months) to a three-session individual psychoeducational 
intervention (collaborative care, or CC) on time to recovery from a 
bipolar type I or II depressive episode. Patients (N = 293) received 
pharmacotherapy with at least one mood stabilizer with or without 
adjunctive ADs, atypical antipsychotics, or anxiolytics. Over 1 year, 
patients under intensive therapy conditions were more likely to re- 
cover from depression (64%) and recovered more rapidly (mean 
169 days) than patients in CC (52%, 279 days, respectively). One- 
year rates of recovery, which did not statistically differ, were 77% for 
FFT, 65% for IPSRT, and 60% for CBT [92]. Patients in the intensive 
therapies were also more likely to remain well in any given month of 
the 12-month study than patients in CC and had better overall func- 
tioning, relationship functioning, and life satisfaction over time [93]. 
The STEP-BD programme suggests that intensive psychotherapy is a 


vital part of the effort to stabilize episodes of depression and enhance 
functioning in BD, although it did not show that any one of these 
approaches was better than the others. 


Conclusions 


The broad emphasis on pharmacological management in BD has 
sometimes obscured the role of adjunctive psychotherapy in modu- 
lating life stressors and enhancing coping mechanisms. Bipolar pa- 
tients may spend over one-third of their lives in states of depression 
[94], and mood-stabilizing medications are generally more effective 
for mania than depression [95]. Thus, integration of pharmaco- 
logical and psychosocial treatments for bipolar depression seems 
increasingly important. The use of psychotherapy to enhance com- 
pliance with mood-stabilizing medications and to develop a relapse 
prevention plan may also augment the protective effects of medica- 
tion maintenance. 

There are three interventions with consistent empirical evidence 
for adults and, in some cases, adolescents: group psychoeducation 
(often with accompanying care management programmes), FFT, 
and IPSRT. The evidence for CBT is too mixed at present to list 
this option as evidence-based for BD. There is beginning evidence 
for functional remediation treatment for bipolar patients with 
neurocognitive impairment. 

We have a considerable amount to learn about what elements 
of psychosocial care are most effective in stabilization or long- 
term treatment. Possibly, strategies like mindfulness meditation, 
smartphone-based mood reporting, and online psychoeducation 
programmes will reduce the costs of care without reducing efficacy. 
However, these programmes are likely to serve as supplements to 
major psychosocial approaches, rather than as substitutes, until ran- 
domized trial data suggest that they are just as effective as the in vivo 
intensive approaches described here. 

Use of psychosocial interventions early in the illness course of BD 
raises the hope that we will one day be able to prevent conversions 
from subthreshold to threshold versions of this disorder in children. 
FFT has been examined in a preliminary way in pre-adolescents and 
adolescents at high risk for BD [77], but participants have not been 
followed long enough to determine the effects on conversion. 

Finally, the subpopulation of patients with BD who respond most 
consistently to different psychosocial treatments needs to be clari- 
fied. Treatments like CBT or group psychoeducation appear to be 
most effective in patients with few prior episodes [78, 83, 97], al- 
though not all studies have shown this (for example, [96]). Ideally, 
predictive algorithms will be developed to select the type and dur- 
ation of treatment that is most likely to bring about clinical improve- 
ment in patients defined by unique characteristics (personalized 
medicine). 
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The links between childbirth and mental disorders have been de- 
scribed for hundreds, if not thousands, of years [1], but post-partum 
episodes are not merely of historical interest. Perinatal mental illness 
remains a great health challenge in the twenty-first century, no better 
illustrated than by the findings of the UK Confidential Enquiries 
into Maternal Deaths, which have found suicide to be a leading 
cause of maternal death [2]. The impact of these conditions is wider 
than merely maternal health, however, with the economic impact of 
perinatal psychiatric disorders in the UK estimated to be approxi- 
mately £8.1 billion for each year’s birth cohort, with the majority of 
costs due to consequences on the child [3]. Despite their undoubted 
clinical importance, perinatal mental illnesses have not received the 
attention, in terms of clinical practice and research, they deserve. 





Terminology and c 


Terminology 





A range of terms are used in this area of psychiatry, including peri- 
natal, peripartum, antenatal, prenatal, preconception, pregnancy, 
puerperal, postnatal, and post-partum. In other branches of medi- 
cine, the term perinatal may be used differently, but in the context 
of ‘perinatal psychiatry; it is generally taken to refer to pregnancy 
and the post-partum—the period following childbirth. Although, 
as discussed in later sections, the common psychiatric classification 
systems define the post-partum as 4 or 6 weeks following delivery, 
in clinical practice, it is common to extend the post-partum period 
to at least 6 months after childbirth or even up to 1 year. Indeed, 
the UK National Institute for Health and Care Excellence (NICE)’s 
Antenatal and Postnatal Mental Health: Clinical Management and 
Service Guidance defined the post-partum period as up to 1 year fol- 
lowing childbirth [4]. In this chapter, we will therefore take the peri- 
natal period to be pregnancy and the first post-partum year. 


Classification 


Before a discussion of individual conditions, it is worth pausing to 
consider how perinatal episodes are treated by the commonly used 
classification systems. Unfortunately, the classification of episodes of 
psychiatric disorder in relationship to childbirth is an area that leads 
to much confusion, both clinically and in research. The Diagnostic 





and Statistical Manual of Mental Disorders (DSM) does not include 
separate categories for perinatal episodes. Rather, DSM-5 allows epi- 
sodes of some disorders occurring in pregnancy or within 4 weeks of 
delivery to be given a peripartum-onset specifier [5]. Turning to the 
International Classification of Diseases (ICD) [6], the way perinatal 
disorders are dealt with in ICD-10 is also problematic. There is a 
category for disorders associated with the puerperium (F53), which 
covers those with onset within 6 weeks of delivery and includes 
categories for mild and severe episodes under which the terms 
postnatal depression and puerperal psychosis are used. However, 
these categories are unique in that ICD-10 states they should only be 
used for episodes which ‘do not meet the criteria for disorders clas- 
sified elsewhere’ The vast majority of perinatal episodes that would 
meet criteria for categories such as depression, mania, or psychosis 
would therefore not be diagnosed as perinatal if the instructions of 
the classification systems were strictly adhered to. Relying on the 
ICD-10 F53 diagnostic categories therefore would grossly under- 
represent the true prevalence of perinatal mental health conditions. 

Thus, both DSM and ICD follow the generally accepted position 
that conditions we will consider in this chapter, such as post-partum 
psychosis and postnatal depression, are not separate nosological 
entities but merely represent episodes of psychiatric disorder trig- 
gered by childbirth. Despite the post-partum labels not having a 
place in the classification systems, they remain in common use by 
professionals, the public, and most importantly women who experi- 
ence episodes of illness at this time. One further problem with the 
nosology of perinatal mental health conditions is the ubiquitous 
use of the term ‘post-partum depressior to refer to all forms of psy- 
chological distress following pregnancy—from mild and transient 
mood changes to some of the most severe psychotic conditions seen 
in psychiatry. This inappropriate usage not only can trivialize severe 
episodes of illness with an underestimation of risk in future preg- 
nancies, but also supports the inappropriate labelling of a normal 
mood variation as a psychiatric disorder. 

Women can be affected by a range of mental health problems in 
pregnancy and the post-partum period, including, but not limited 
to: depression, anxiety, eating disorders, OCD, personality dis- 
orders, bipolar disorder, schizophrenia, and other psychotic dis- 
orders. In fact, all the conditions discussed in other chapters of this 
textbook may present in the perinatal period. In this chapter, we will 
discuss both severe perinatal mental illnesses and more common 
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mental health conditions. We will discuss disorders with first onset 
in the perinatal period and the course and management of women 
with pre-existing illness. We will first consider the presentation, epi- 
demiology, prognosis, and management of a range of conditions 
presenting in relationship to pregnancy and childbirth, before fin- 
ishing the chapter discussing what is known about the aetiology of 
these episodes. However, the most dangerous of the perinatal condi- 
tions are associated with a bipolar diagnosis, and for this reason, the 
chapter belongs in this section of the textbook. 


Severe perinatal mental illness 


Severe episodes of mental illness occurring in the perinatal period 
can be a recurrence of an existing condition, such as bipolar disorder 
or schizophrenia, or may be the first episode of psychiatric illness a 
woman has experienced. 

The perinatal period, as we will see, is a time of high risk for 
new onset of an episode of severe psychiatric illness. We will there- 
fore also deal with the diagnosis and treatment of severe episodes 
of affective psychosis with onset in the immediate post-partum 
period—episodes that have traditionally been referred to as puer- 
peral or post-partum psychosis. In addition, we will consider the 
management of women with pre-existing severe mental illness, such 
as bipolar disorder or schizophrenia, who become pregnant. We will 
examine the impact that pregnancy and childbirth may have on the 
condition, but also consider the implications that the mental illness 
may have on the pregnancy. 


Post-partum psychosis 


The most severe forms of post-partum mood disorder have trad- 
itionally been labelled post-partum (or puerperal) psychosis (PP) 
[7]. These terms are usually used to refer to new onset, although not 
necessarily the first episode, of severe affective psychosis in the im- 
mediate puerperium. Although there is no consensus about what is 
meant by immediate, it is of note that in 90% of cases, the onset is 
within 2 weeks of delivery [8]. While the boundaries of this condi- 
tion are not easy to define, the core concept is the acute post-partum 
onset of mania or affective psychosis. Accordingly, the continuation 
of a more chronic psychosis, such as schizophrenia, would not be 
appropriately labelled as post-partum psychosis. In the majority of 
cases, in fact, the symptomatology and prognosis of PP resemble 
those of an affective disorder, rather than those of schizophrenia. 
Symptoms are those of a severe mood disorder often accompanied 
by psychotic symptoms such as delusions and hallucinations. The 
word ‘psychosis’ in the context of PP can, however, be confusing as 
some women with severe manic episodes with early post-partum 
onset may receive the label but not experience actual psychotic 
symptoms [9]. Mixed episodes, in which manic and depressive 
symptoms occur simultaneously, are common, and the clinical 
picture often shows a constantly changing ‘kaleidoscopic’ picture. 
The latter point raises an important clinical lesson, in the assess- 
ment of women with significant psychiatric symptoms with onset in 
the immediate post-partum period—one evaluation is usually not 
enough, as the picture fluctuates over time and can escalate rapidly 
and severity is sometimes difficult to recognize [7, 10]. In addition, 


confusion and perplexity are common symptoms and some women 
may present in a way typical of the so-called ‘polymorphic or cycloid 
psychosis’ [11]. 

In summary, although, as discussed here, puerperal/post-partum 
psychosis is not a prominent diagnostic category in the major clas- 
sification systems, it remains an important concept. It is commonly 
used in clinical practice and research and, most importantly per- 
haps, by women who experience these episodes, as illustrated by 
‘Action on Postpartum Psychosis’ being the major third-sector or- 
ganization in the UK dedicated to helping women and their families 
with this condition [12]. Although the classification systems appear 
set on relegating the term ‘puerperal psychosis’ to the history books, 
there are advantages in the diagnosis of PP in both research and clin- 
ical practice [13]. 


Differential diagnosis 


In addition to other psychiatric disorders, several physical health 
conditions occurring in the perinatal period can be associated with, 
or even present with, psychosis and are therefore important in the 
differential diagnosis. These include: infections, pre-eclampsia and 
eclampsia, autoimmune, metabolic, encephalitis, and cerebral vas- 
cular diseases and syndromes associated with substance abuse and 
withdrawal [14]. The Confidential Enquiries into Maternal Deaths 
in the UK has reported a number of maternal deaths that were due 
to misattribution of psychotic symptoms to PP, rather than the true 
underlying condition, and therefore emphasize the need to consider 
physical conditions in the assessment of women who have sudden 
onset of a severe psychiatric episode in the perinatal period [2]. The 
Confidential Enquiries into Maternal Deaths have also found that a 
substantial proportion of women who committed suicide following 
PP were misdiagnosed as suffering from anxiety, moderate depres- 
sion, or adjustment disorder and did not receive adequate treatment 
[2]. Untreated severe post-partum disorders are associated with an 
increased risk of both suicide [15] and infanticide [16]. 


Epidemiology 


Although, when compared to other conditions occurring in the 
perinatal period, PP is a less common disorder [13], as a severe and 
potentially fatal condition, it is important to recognize and manage 
appropriately. Studies based on hospital admission in the post- 
partum period have estimated an incidence of 1-2 in 1000 deliveries 
[17]. This estimate, however, does not account for women with PP 
who are not admitted and will, of course, also include women ad- 
mitted for other reasons. 

The specific link between childbirth and the triggering of mania 
or an affective psychosis is well established, with the post-partum 
a period of very high risk [17-21]. For example, data from Danish 
population registries have established that the risk of a first lifetime 
manic episode in the first 4 weeks post-partum is 23 times higher 
than a comparison period 1 year after childbirth [17]. Further work 
from Denmark has established that even if admitted for another psy- 
chiatric disorder, such as major depression, a first admission within 
the first month after childbirth increases four times the likelihood of 
developing bipolar disorder within 15 years, compared to any first 
psychiatric admission outside the childbearing period [20]. 

In more than 50% of women experiencing PP, it is their first psy- 
chiatric episode and therefore difficult to predict [22]. For women 
with a psychiatric history, however, there are a number of factors 


which identify women who are at a particularly high risk of PP. 
Women with a history of bipolar disorder have a 1 in 5 chance of PP 
for each delivery [18], and a previous episode of PP confers an even 
higher risk in excess of 1 in 2 [23]. A family history of PP has also 
been identified as doubling the risk of PP in women with a personal 
history of bipolar disorder [24]. For this reason, establishing a family 
history of perinatal episodes is clearly important in the assessment 
of women in pregnancy with existing mental health conditions. In 
contrast, for women with no personal history that puts them at high 
risk, the importance of a positive family history is less clear. 


Management of post-partum psychosis 


PP is a psychiatric emergency, and admission is necessary in the ma- 
jority of cases [7, 25]. In around 50%, the episode will be the first 
contact with psychiatric services and therefore could not have been 
predicted, emphasizing the need for prompt assessment, diagnosis, 
and treatment [22]. In the UK, NICE guidelines recommend admis- 
sion to specialized mother and baby units (MBUs) [4], but there re- 
mains a postcode lottery in provision; MBU beds are commonly not 
available, necessitating admission for many women to general adult 
wards. Although further evidence for the efficacy of MBUs is clearly 
needed, there is preliminary evidence that MBUs are preferred by 
women [26] and also lead to better outcomes and shorter duration 
of admission [25]. 

The mainstay of the management of PP, in addition to admission, 
is pharmacological treatment, but there is a paucity of specific evi- 
dence to guide us, that is, no RCTs. However, response to medication 
is good. For example, a study from the Netherlands evaluated the ef- 
ficacy of an empirical treatment algorithm for PP, consisting sequen- 
tially of: (1) a GABA-modulating drug (benzodiazepine); (2) adding 
a dopamine antagonist/partial agonist; and (3) adding lithium. With 
adherence to this stepped regime, they observed complete remission 
in more than 98% of cases, with 80% of women maintaining remis- 
sion for 9 months [27]. Significantly higher relapse rates were seen in 
women maintained on a dopamine antagonist/partial agonist alone, 
compared to those treated with lithium [27]. 

Despite no RCTs for electroconvulsive therapy in PP, the evidence 
of its effectiveness is promising [28]. The potential side effects (an- 
terograde amnesia in 18%, prolonged seizures in 11%, according to 
a recent study [29]) need to be weighed against its efficacy in those 
resistant to medication and the potentially very rapid response in 
very severe, life-threatening episodes of illness. 


Prognosis 


In the modern era, the prognosis for promptly identified and 
treated PP is good, with over 95% of patients achieving remission 
within 1 year [27]. The median duration of illness is considerably 
shorter for women treated with pharmacotherapy (40 days) [27] 
than for a historical comparison with those treated prior to the era 
of pharmacotherapy (8 months) [30]. Having an episode of PP has 
a long-lasting impact beyond the acute episode. Recurrence rates 
are around 50% for further post-partum episodes and between 50% 
and 70% for bipolar recurrences not related to childbirth [31, 32]. In 
addition, there may be an impact on decisions about future pregnan- 
cies and on relationships. One retrospective study of 116 women, for 
example, found that only 58% of women went on to have a further 
pregnancy and that 18% of relationships ended after the severe post- 
partum episode [22]. 
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Women with existing severe mental illness 


As discussed in the previous section, although an episode of severe 
perinatal illness may be the first psychiatric presentation, in at least 
50% of cases, the episode will be in the context of a history of severe 
mental illness such as bipolar disorder or schizophrenia. 


Bipolar disorder 


A considerable body of evidence points to the high risk of post- 
partum episodes in women with bipolar disorder. In a large retro- 
spective study of 1212 parous women with bipolar disorder, over 
70% had experienced at least one episode of mood disorder in the 
perinatal period and over 1 in 4 had experienced an episode of PP 
[18]. However, although the relative risk of PP was greatly increased 
in these women with bipolar disorder, it was non-psychotic depres- 
sion that was the most commonly reported mood episode in the 
perinatal period [18]. Regarding the timing of episodes, although 
over 90% of manic or psychotic episodes occur within the first 4 
weeks after childbirth, the pattern of onset is different for non- 
psychotic depression, with 25% having an onset after the first post- 
partum month [18]. 


Management 


For women with a history of bipolar disorder, there are difficult de- 
cisions to be made around becoming pregnant and about manage- 
ment in pregnancy and the post-partum period. Potential pregnancy 
should be considered in all women with bipolar disorder in their 
reproductive years, and women should have access to preconcep- 
tion counselling if they are considering pregnancy [4]. In counsel- 
ling women about pregnancy, the known or potential risks of taking 
medication in pregnancy must always be weighed against the high 
risk of recurrence discussed previously [4, 7, 25, 33]. 

The evidence base to guide these decisions is admittedly sparse, 
and more research is clearly needed. However, there are data that 
can help women and clinicians. For example, it is likely that the re- 
currence risk in the perinatal period is higher for women who stop 
mood-stabilizing medication, with one study finding 2-fold higher 
risk of a recurrence in those who discontinued lithium due to preg- 
nancy [34]. Further guidance on the exact period of risk comes from 
a naturalistic, longitudinal study which found that women with a 
history of PP were at high risk in the post-partum period, but not 
one of the 29 women with a history had a recurrence during a sub- 
sequent pregnancy. In contrast, ten of 41 women with a history of 
bipolar disorder outside the puerperium experienced a recurrence 
during pregnancy, with the risk doubled in those not taking a drug 
to stabilize mood (19% vs 40%, respectively) [22]. The results of this 
study suggest therefore that while prophylactic medication should 
be considered in all women at high risk in the post-partum period, 
in pregnancy, it should be considered in women with a history of 
bipolar disorder not related to childbirth. 

Regarding the reproductive safety of specific medications, the evi- 
dence base is continuously developing and clinical guidelines, such 
as from NICE [4] or the British Association of Psychopharmacology 
[35], may not include the latest research. Advice in individual cases 
may be sought from perinatal specialist services or from websites, 
such as that provided by the UK Teratology Information Service 
(http://www.uktis.org). Although we are not able to provide a 
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detailed consideration of all medications here, there is considerable 
evidence that children exposed to valproate in utero are at high risk 
of developmental disorders and congenital malformations [36]. In 
contrast, lithium does not appear to be as problematic in pregnancy 
as initial studies indicated [37]. 

Finally, although specific evidence for psychosocial interventions 
for bipolar disorder in the perinatal period may be lacking, it is vital 
that these approaches, particularly psychoeducation, are considered 
[4, 25]. 

Readers are referred to Chapter 75 for a more detailed account of 
the treatment options for bipolar disorder. 


Schizophrenia 


Although a considerable body of evidence points to an aetiological 
overlap across the mood psychosis spectrum (see Chapter 59), the 
triggering of episodes by childbirth is an interesting area of dis- 
tinction. We do not see the same massively increased risk of a new 
episode of a schizophrenia-like psychosis in the immediate post- 
partum, as we see for episodes of bipolar disorder [17, 19]. Women 
with schizophrenia may require admission in the first post-partum 
year, however, often to deal with issues around becoming a new 
mother, even in the absence of a severe symptomatic recurrence. 

As for bipolar disorder, specific evidence on the management of 
schizophrenia in the perinatal period is lacking. A preliminary study 
found that collaboration between services and the women’s partner 
in the care of the baby leads to an improvement in symptomatology 
[38]. Motherhood can be very challenging for women with schizo- 
phrenia, and this diagnosis, together with that of personality disorders 
and substance abuse or dependence, has been shown to be associated 
with increased involvement of social services [25]. Tragically, when it 
does occur, the loss of child custody is a traumatic event that can pre- 
cipitate a crisis and has been associated with maternal suicide [2, 26]. 


Common mental disorders 


Having considered the presentation of severe perinatal mental 
illness, we turn now to consideration of other mental disorders that 
may occur in pregnancy and the post-partum. 


Depression 


Depression, the leading cause of disability worldwide, is the most 
common medical complication of maternity [39]. 


Clinical presentation 


The weight of evidence suggests that there is no difference in clin- 
ical presentation between post-partum depression and depression 
occurring at other times [40, 41]. There are, however, issues with 
some somatic depressive symptoms (that is, fatigue, loss of libido, 
appetite, and weight, and sleep changes), as these are commonly ex- 
perienced by all women following childbirth, so they need to be as- 
sessed with care [42]. Thoughts of self-harm are common in perinatal 
depression; an American national survey found 30% of women who 
screened positive for post-partum depression endorsed thoughts of 
self-harm [43]. Differences between the presentation of depression 
in pregnancy and following childbirth have been described, with 
intrusive violent thoughts and psychosis more common in women 
with post-partum onset [44]. 


Differential diagnosis 


Episodes of major post-partum depression need to be distinguished 
from minor mood change, the so-called baby or maternity blues, 
and from episodes of PP as discussed previously [45]. The blues are 
characterized by emotional lability in the first post-partum week. 
This occurs commonly and does not require treatment. Assessment 
of minor mood change in the post-partum period should ensure 
that the woman is not, and does not, become more severely de- 
pressed. Physical conditions associated with low mood in the peri- 
natal period, such as thyroid dysfunction and anaemia, should also 
be assessed. 

It is important to be aware that post-partum depression can be a 
manifestation of a bipolar disorder; for example, over 50% of cases of 
bipolar post-partum depression in one study were initially misdiag- 
nosed as unipolar disorder by clinicians [46], and in another study, 
over 20% of women who screened positive for post-partum depres- 
sion were found to have bipolar disorder [43]. Indeed, women with 
episodes of unipolar depression with post-partum onset have been 
found to be more likely to develop a future bipolar disorder than 
women with depression occurring at other times [20]. Episodes of 
psychotic depression in the immediate post-partum period should 
raise the possibility of an underlying bipolar diathesis, even in the 
absence of manic symptoms [47]. Recognition of the bipolar na- 
ture of depression is important for management. This is illustrated 
by a small study of 34 women with bipolar post-partum depression, 
initially misdiagnosed as unipolar, that showed the discontinu- 
ation of antidepressants (that is, drugs licensed for the treatment 
of unipolar depression) and the introduction of a mood stabilizer 
(drugs licensed for bipolar disorder) improved symptoms in 88% 
of cases [46]. 


Epidemiology 


The point prevalence of major depression in the perinatal period 
varies widely across studies, ranging from 0% to over 30% [42, 48- 
50], and it is likely that methodological differences, particularly in 
the assessment of depression, account for this variability. A sys- 
tematic review estimated the point prevalence of major depressive 
disorder to be between 3.1% and 4.9% during pregnancy and 4.7% 
in the first 3 months post-partum [49]. If minor depression is in- 
cluded, these rates are higher—11% in pregnancy and 13% in the 
post-partum [49]. There has been great debate over the question of 
whether depression is more common in the perinatal period and 
the data are not consistent. It does seem clear, however, that preg- 
nancy is not protective against depression, with both a systematic 
review [49] and a large epidemiological study from the United States 
finding no significant difference in prevalence or incidence of de- 
pression [51]. However, there may be differences for more severe 
episodes with a Danish registry-based study finding the relative risk 
of first admission to hospital for major depression is halved during 
pregnancy, but more than doubled in the first 2 months after child- 
birth, compared to 1 year post-partum [17]. 

The risk factors for perinatal depression show considerable 
overlap with depression occurring at other times. Established risk 
factors for depression occurring in the post-partum include a his- 
tory of mood or anxiety disorders, stressful life events, poor social 
support, and domestic violence [42]. The factors associated with de- 
pression in pregnancy are similar (poor social support, adverse life 


events, domestic violence, a history of mental illness) but include 
some additional factors related to the pregnancy (unplanned or un- 
wanted pregnancy, present/past pregnancy complications) [52]. 


Prognosis 


Although women can be reassured that the prognosis of post-partum 
depression is good, with episodes lasting 3-6 months on average, 
30% of women remain depressed beyond the first post-partum year 
[42]. It should also be discussed with women that they are at an in- 
creased risk of further episodes of depression, with recurrence rates 
of around 40% for both perinatal and non-perinatal episodes [53]. 


Screening 


It is not in doubt that many episodes of perinatal depression go un- 
detected; some estimates suggest this may be up to 50% of cases [3]. 
However, there is considerable debate over the benefit of screening 
all women for depression in the perinatal period. The high preva- 
lence and the potentially serious negative consequences of peri- 
natal depression have led many to advocate for universal screening, 
most recently, for example, by the U.S. Preventative Services Task 
Force [54]. However, others have pointed to potential negative 
consequences of universal screening, including misdiagnosis and 
increasing stigma, and results from economic modelling have sug- 
gested that screening may not meet thresholds to be considered 
cost-effective [55]. Certainly screening tools must not be viewed as 
comprehensive and clinical guidelines emphasize that the clinical 
decision about the presence of depression should always be made on 
the basis of a further comprehensive assessment [4, 56]. A number 
of self-report measures of depression have been advocated to screen 
for perinatal depression; the Edinburgh Postnatal Depression scale 
(EPDS), which has been widely translated, is perhaps most com- 
monly used around the globe. Despite the widespread use of the 
EPDS, NICE [4] recommends that health care professionals in con- 
tact with women in the perinatal period use the Whooley questions: 


e During the past month, have you often been bothered by feeling 
down, depressed, or hopeless? 

e During the past month, have you often been bothered by having 
little interest or pleasure in doing things? 


Although formal screening programmes for maternal depression 
remain controversial, what is not in doubt is that all health profes- 
sionals should be aware of a woman's mental health in the perinatal 
period, in addition to her physical well-being. 


Treatment 


Perinatal depression responds to the same treatments as depression at 
other times. Given issues around the use of medication in pregnancy 
and in breastfeeding women (discussed in the following sections), 
psychological and psychosocial approaches are perhaps of particular 
relevance. A range of psychosocial interventions have been shown to 
be effective in a Cochrane meta-analysis, including peer support and 
non-directive counselling, cognitive behavioural therapy, psycho- 
dynamic psychotherapy, and interpersonal therapy [57]. In addition, 
for some women, further interventions directed at the mother-baby 
relationship may be appropriate if this remains impaired and has not 
responded to treatment of maternal depression [4, 56]. 

For many women, particularly for those with moderate to severe 
episodes of depression, medication will be required. It is clear that 
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the use of drugs for depression in pregnancy has increased consid- 
erably in recent decades, with rates varying widely from around 3% 
in Denmark [58] to over 13% in the United States [59]. Pregnancy 
and breastfeeding are often exclusion criteria in drug trials, and as 
a result, there is little specific evidence for the pharmacologic man- 
agement of post-partum or pregnancy depression [60]. Decisions 
must therefore be extrapolated from evidence for the efficacy of 
approaches accumulated in the non-perinatal context. Medication 
trials in women with severe post-partum depression are possible, 
however, and there may be scope for innovation; a 60-hour hor- 
monal infusion appeared very promising in a small early-stage trial, 
for example, but obviously requires replication [61, 62]. 

For women already on drugs for depression, the data are in- 
consistent regarding the impact of medication being continued or 
stopped. A longitudinal study of 201 euthymic pregnant women 
found that 68% of those who discontinued treatment had a relapse 
during pregnancy, compared with 26% of those who continued 
medication [63]. In contrast, another study of 778 pregnant women 
with a history of major depression found that staying on, or stopping, 
drug treatment had little influence on psychiatric outcome [64]. 
Differences in the severity of depressive histories probably account 
for these conflicting findings. We would emphasize the need to take 
account of the specific history of each woman, the severity of her 
illness, and the evidence for her response to individual medications, 
when making these difficult decisions. 

Other interventions for perinatal depression have a limited evi- 
dence base and include the use of oestrogens and progestins, N-3 
fatty acids, exercise, and integrated yoga [65-67]. 

Just as for pregnancy, decisions regarding medication in breast- 
feeding women must be made following a full risk-benefit analysis 
that considers the options available. As new data on medication 
safety in pregnancy and during lactation are emerging regularly, 
up-to-date advice from specialist services should be sought in 
individual cases. 


Anxiety disorders 


Despite often being overlooked [42] and mislabelled with the per- 
vasive term ‘post-partum depression, perinatal anxiety disorders 
are common [68, 69]. Prevalence rates for anxiety disorders range 
widely from 6% to 39% in pregnancy [70] and from 16% to 50% in 
the post-partum period [71]. In addition, both pregnant and post- 
partum women are at increased risk of developing OCD (risk ratios 
1.45 and 2.38, respectively) [72], with 2.5% of women experiencing 
a clinical episode of OCD in the perinatal period [72]. In making 
a clinical assessment of women with OCD symptoms, it should be 
remembered that obsessional thinking is common in the perinatal 
period, but that some women are much more significantly affected 
with obsessional thoughts and compulsive behaviours that can often 
involve the child, and obsessions need to be distinguished from de- 
lusions that may pose a risk to the baby. 

Perhaps to an even greater extent than for perinatal depression, 
there is a lack of data to guide the treatment of perinatal anxiety dis- 
orders; RCTs are lacking, and even naturalistic studies are rare [42]. 
Again, management decisions must be made on the basis of evidence 
obtained in studies of anxiety disorders not related to pregnancy 
and the post-partum. One group of approaches that have received 
interest in the perinatal period are the so-called ‘mind-body’ inter- 
ventions that encompass a large group of therapies such as hypnosis, 
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meditation, yoga, biofeedback, t’ai chi, and visual imagery. Although 
increasingly popular, there is a lack of rigorous evidence to support 
their effectiveness for the management of perinatal anxiety [73]. 


Post-traumatic stress disorder 


For many years, childbirth was not recognized as a potential trigger 
for PTSD, and although this has changed, perinatal PTSD still has 
not perhaps received the attention it deserves. Studies have examined 
symptoms occurring after a range of pregnancy outcomes, including 
miscarriage, termination, stillbirth [74], and traumatic birth [75]. 
The prevalence of perinatal PTSD is estimated to be between 1% and 
8% [42, 76], with a higher prevalence in low-income countries and 
high rates of comorbidity with major depression, anxiety, and sub- 
stance misuse [42, 77]. Particularly high rates of PTSD have been de- 
scribed in low-income pregnant women exposed to intimate partner 
violence, with rates as high as 40% according to one study [76]. There 
is not a large evidence base to guide management, but consistent with 
data for PTSD more generally, a Cochrane systematic review found 
little or no evidence to support routine psychological debriefing for 
women who have experienced a traumatic birth [78]. 


Eating disorders 


Anorexia nervosa impacts a woman’ fertility [79], but eating dis- 
orders more generally do affect considerable numbers in the peri- 
natal period. One study has reported the prevalence of eating 
disorders during pregnancy to be 7.5% (compared to 9.2% pre- 
pregnancy) [80]. For some women with a history of eating disorders, 
however, pregnancy is associated with symptomatic improvement. 
A study from Norway, for example, reported remission rates of 
between 29% and 78%, depending on the specific condition [81]. 
Eating disorder symptoms have also been reported to increase the 
risk of post-partum depression [82]. There is no trial evidence for 
the prevention of recurrence in women at risk. Management should 
be aimed at maintaining regular eating patterns, supporting realistic 
goals, and optimizing nutrition for the mother and the fetus [42]. 


Personality disorders 


Despite the potential for personality disorders to cause significant 
difficulties in pregnancy and the post-partum period, there has been 
little research conducted to date on their impact and management 
in the perinatal period. The prevalence of personality disorders in 
pregnancy, as assessed by self-report, was found to be around 6% in 
one Scandinavian study [83]. An important point is that personality 
disorders can occur alongside many of the other disorders discussed 
previously and are associated with poor prognosis [42, 83]. 


Substance misuse 


Substance misuse in pregnancy is common, with a study in the 
United States estimating 9.0% of pregnant women aged 18-25 and 
3.4% of those aged 26-44 using illicit drugs or misusing prescription 
drugs. However, these rates were roughly half of those observed in 
non-pregnant women in the same age group [84]. Substance misuse 
is often comorbid with other psychiatric conditions and impact 
negatively on prognosis. In particular, the Confidential Enquiries 
into Maternal Deaths described a number of cases where sub- 
stance abuse is a significant feature in those women who died [2]. 
Women with substance use disorders require intensive and multi- 
disciplinary care in pregnancy. A harm reduction approach, aimed 


at periods of abstinence while recognizing the likelihood of relapse, 
should be adopted [84]. 





Aetiology: understanding the puerperal trigger 


Given the post-partum is a period of high risk for episodes of psychi- 
atric disorder, particularly for severe illness, it has long been seen as 
offering potential clues to the mechanisms underlying such illness. 
PP affects women primarily with a bipolar disorder diathesis, acted 
on by a specific puerperal trigger [25]. Understanding the nature 
of this trigger will be of great benefit, allow for the development 
of novel treatments, and enable the prevention of illness in those 
women at high risk. There are several hypotheses of puerperal trig- 
gering that have been explored. 


Changes in medication 


Perhaps the simplest explanation could be that the high-risk post- 
partum is a result of women stopping medication because of con- 
cerns over the safety of medication in pregnancy. The illness might be 
simply a result of acute withdrawal of medication. Evidence against 
this hypothesis comes from a study that compared the course of 42 
women with previous bipolar episodes who stopped lithium due 
to pregnancy, compared to 59 age-matched non-pregnant lithium 
discontinuers [33]. Rates of recurrence were very similar for both 
groups up to 40 weeks, implying that drug withdrawal per se had 
no effect. However, following delivery, a large and highly significant 
difference in recurrence rates was observed (70% vs 24% recurrence, 
respectively). Thus, withdrawal of medication had an important im- 
pact on the vulnerability to the puerperal trigger but exerted no dis- 
cernible effect on the recurrence risk in itself. 


Psychosocial factors 


Becoming a mother is a complex and often difficult psycho- 
social transition, but psychosocial factors have not been shown to 
play a major role in vulnerability to psychosis in the puerperium. 
A number of studies have found no association between stressful life 
events and PP in women at high risk [85, 86]. 


Genetic factors 


Studies have suggested both that episodes of PP are a marker for a 
more familial form of bipolar disorder [87] and that a specific vul- 
nerability to the puerperal triggering of bipolar illness is familial 
[24]. The relationship between genetic factors influencing puerperal 
triggering and those for the underlying bipolar disorder is still to 
be established, however, and is likely only to be clear when the gen- 
etic variation underpinning vulnerability to post-partum episodes 
are identified. Molecular genetic studies of PP have been conducted 
[88, 89], but in the age of GWAS, no genome-wide associations have 
been reported to date. Sample sizes available for genetic studies will 
have to be far bigger than those available at present to achieve the 
power needed to identify genes of likely small to modest effect (see 
Chapters 59 and 70). 


Hormones 


The abrupt onset of illness during a time of major physiological 
change suggests hormonal factors may play an important role in 
aetiology. It is increasingly recognized that ovarian sex steroids 


have important functions in the central nervous system. Oestrogen 
and progesterone receptors are widespread in the brain where they 
modulate neurotransmission and neuroplasticity via both gen- 
omic and non-genomic mechanisms and regulate not only ma- 
ternal behaviour, but also emotion processing, arousal, cognition, 
and motivation [90-93]. Gonadal steroids have been implicated 
in a number of neuropsychiatric illnesses, including migraine and 
neurodegenerative and premenstrual dysphoric disorders [93, 94]. 

The evidence pointing to reproductive hormones in the aeti- 
ology of post-partum episodes is predominantly circumstantial. The 
study of Bloch and colleagues [95], however, provides more direct 
evidence for the involvement of oestrogen and progesterone in the 
post-partum triggering of mood episodes. This study that simulated 
the supraphysiologic gonadal steroid levels of pregnancy and post- 
partum withdrawal found five of eight women with a history of post- 
partum depression and none of eight of the women with no history 
of post-partum depression developed significant mood symptoms 
during the withdrawal period. It is possible therefore that women 
vulnerable to post-partum episodes do not show gross abnormal- 
ities in endocrine physiology, but rather an abnormal response to 
the normal hormonal fluctuations of pregnancy and childbirth. 

The role of gonadal steroids in the pathogenesis of perinatal mood 
disorders is still to be established, and its clinical implications are 
unclear. At present, however, there is no support for the routine use 
of hormonal treatments in the prevention or management of peri- 
natal mood disorders [65]. 


Sleep 


It is well established that sleep and circadian rhythm disruption can 
trigger the onset of manic episodes very rapidly [96]. Sleep loss is 
also, of course, very common in pregnancy and the post-partum. 
It is surprising therefore that this potential trigger has not received 
more attention, with only a few studies addressing this very plausible 
hypothesis [97]. 


Other potential factors 


Several obstetric-related factors have been explored, but the only ro- 
bust risk factor identified is primiparity [98, 99], which has held up 
even in analyses that take account of women with severe post-partum 
episodes being less likely to go on to have further pregnancies. The ef- 
fect of primiparity has been hypothesized to be due to the biological 
differences between first and subsequent pregnancies and has raised 
the possibility of an aetiological link with other medical conditions 
showing a similar increase in first pregnancies such as pre-eclampsia 
[99]. Intriguingly, pre-eclampsia and PP have both been associated 
with immune dysregulation, for example the observation of a marked 
increase in the rates of post-partum autoimmune thyroiditis and im- 
mune biomarker alterations in women with PP [100]. In addition, 
pre-eclampsia and PP share other overlaps; for example, both have 
been found to be inversely correlated with tobacco smoking [101]. 
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The diagnosis of major depressive disorder (MDD) relies on the pres- 
ence of a certain number of signs and symptoms, including feelings 
of guilt, hopelessness, dysphoria, cognitive dysfunction, persistent 
sleep, and appetite abnormalities. These signs and symptoms overlap 
with other conditions such as anxiety, bipolar, and seasonal affective 
disorders. In this chapter, we provide an overview of the basic neuro- 
biological mechanisms underlying MDD and its treatment. There 
are several alterations in the molecular pathways and neuronal net- 
works associated with MDD. We focus here on: gene x environ- 
ment interactions, dysfunctional brain circuitries, neurotransmitter 
alterations, maladaptation in neurotrophins and neuroplasticity, 
hypothalamus-pituitary-adrenal (HPA) axis dysfunction, ab- 
normal immune system responses, circadian arrhythmicity, and 
sleep disturbances (Fig. 74.1). We briefly describe the mechanisms 
of actions for approved antidepressant therapies and also discuss re- 
cent insights into the pathophysiology of MDD and future possible 
therapy targets. 











There is increasing evidence that MDD is best understood in a 
lifelong perspective as changes in neuronal circuit function through 
stressful life events acting in genetically predisposed individuals. 
The genetic liability of MDD is 30-40% [1, 2]. It is likely that sev- 
eral genes, and their protein products, are involved both in the 
pathophysiology of the disease and in mediating antidepressant 
responses. Several genetic polymorphisms have been associated 
with depression, but in most instances, these studies have not been 
replicated. According to a large-scale GWAS, 15 loci and 17 inde- 
pendent SNPs were related to depression [3]. GWAS of both MDD 
and bipolar depression share 11p11.2 and 12p13.33 polymorphisms, 
including an SNP in CACNAIC encoding the alC subunit of the 
voltage-gated L-type calcium channel Ca,1.2 [4, 5]. A recent paper, 





which identified genetic determinants associated with personality 
traits, describes a genetic dimension which relates introversion and 
neuroticism to MDD [6]. 

A polymorphism in the 5-HT transporter results in long or short 
alleles, the short allele being less functional [7]. Influential G x E 
studies in humans [8] have reported a positive relationship be- 
tween the number of self-reported early life stressors (for example, 
childhood maltreatment) and increased depression risk among in- 
dividuals who had one or two copies of the short allele, compared 
with those homozygous for the long allele [8]. Similar results were 
observed in experimentally stressed non-human primates [9]. 
However, it should be noted that there have been difficulties in 
replicating these findings [10]. 

Epigenetics is an area of research intimately related to G x E inter- 
actions. It is defined as heritable cell characteristics (phenotype), 
which are not determined in the DNA sequence (genotype), but ra- 
ther are determined by G x E interactions [11]. Environmental fac- 
tors may lead to DNA methylation in cytosine bases and methylation, 
phosphorylation, or acetylation of histones, resulting in changes of 
gene expression via activation or inactivation of specific gene pro- 
moters [11]. In studies of MDD, most of the epigenetic changes 
have been experimentally induced by stress, especially early in life. 
Stressors, among other things, decrease the gene expression of both 
brain-derived neurotrophic factor (BDNF) and the glucocorticoid 
receptor [12] (Fig. 74.1). Epigenetic states might be reversible, 
even in adulthood. This has important implications for potential 
pharmacological treatment; for example, drugs that inhibit histone 
deacetylation (HDAC), such as valproic acid, could be used to reverse 
epigenetic changes [13, 14]. No human studies have been performed, 
but preclinical research has demonstrated that local administration 
of HDAC inhibitors in the nucleus accumbens can reverse depressive 
behaviour in rodents [13]. In addition, DNA methylation changes 
(for example, by acetyl-L-carnitine) can promote antidepressant-like 
behaviour in rats [15]. It is important to note, however, that drugs 
regulating epigenetic mechanisms have pleiotropic actions in dif- 
ferent cell types, thus increasing the risk for side effects [12]. 
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Fig. 74.1 Neurobiological mechanisms underlying major depressive disorder. Stress, particularly in early life, causes epigenetic changes which interact 


with genetic vulnerability, resulting in several pathophysiological processes. S 


uch G x E interactions affect circadian rhythms, immune activation, 


cellular arborization, HPA axis activity, and signalling mechanisms. Alterations in these neurobiological mechanisms contribute to the development of 


depression. 


Dysfunctional brain circuitries 





Neuroimaging approaches, including volumetric magnetic reson- 
ance imaging (MRI), diffusion tensor imaging (DTI), functional 
MRI, and "F-fluorodeoxyglucose positron emission tomography 
(FDG-PET), provide a versatile platform to characterize struc- 
tural and functional connectomes. Imaging studies have improved 
neurocircuitry models of mood, emotionality, and behavioural 
regulation [16, 17]. 

Studies have reported morphological changes in MDD patients, 
in both neurons and glia in the hippocampus, amygdala, and pre- 
frontal cortex (PFC) [16, 18]. The decrease in hippocampal volume 
is directly proportional to the number and duration of depres- 
sive episodes, especially in early-onset MDD [19]. The decrease of 
hippocampal volume seems to be related to prolonged increased 
levels of cortisol, which may relate to the fact that the hippocampus is 
the brain region with the highest levels of receptors for glucocortic- 
oids [16, 20]. The hippocampus is critical for learning and memory 
processes. In depressive patients, memory formation is skewed 
towards negative events; this is known as negative bias. Negative 
bias is characterized by an enhanced focus on negative stimulus, 


enhanced attention towards potentially threatening stimuli, and the 
attribution of negative emotional value to environmental stimuli 
that are considered to have neutral valance by healthy individuals 
[21]. Negative bias-related memories involve the amygdala, hippo- 
campus, anterior cingulate cortex, PFC, and caudate-putamen [21]. 
In addition, the lateral habenula (LHb) may be a key subcortical 
structure in the generation of negative cognitive biases. LHb plays 
a crucial role in encoding aversive states [21, 22]. Deep brain stimu- 
lation (DBS) of the LHb can reverse depressive-like behaviours in 
rodents [23], reinforcing a role of the LHb in depression. DBS has 
also been examined in MDD patients. Based on the observation 
that some brain regions (including the subgenual cingulate cortex 
or Brodmann area 25) are metabolically overactive in treatment- 
resistant depression, DBS was applied to Brodmann area 25. This 
resulted in a striking and sustained remission of depression in some 
previously treatment-resistant patients in an open study [24, 25]. 
However, there is no published randomized controlled trial (RCT) 
to confirm this antidepressant effect. Another observational study 
showed an antidepressant effect of DBS in the ventral striatum [26], 
but a subsequent RCT failed to demonstrate an antidepressant effect 
of DBS in this region [27]. More advanced strategies for directly and 
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focally altering neural activity, such as gene therapy, chemogenetics, 
and optogenetics, are successfully being developed in animals. This 
line of research has revealed that control of projection-specific dy- 
namics is well suited for modulation of behavioural patterns that 
are relevant to a broad range of psychiatric diseases, including de- 
pression. For example, optogenetic targeting of projections from the 
medial PFC to the dorsal raphe nucleus will control mobility in the 
forced swim test, while targeting projections from the basolateral 
amygdala to the nucleus accumbens produces an appetitive or aver- 
sive response, depending on the type of stimulus [28]. 


Neurotransmitter alterations and therapy targets 





A critical role of monoamines in depression was initially supported 
by two independent serendipitous findings in drug discovery. 
Iproniazid was developed against tuberculosis but was found to ex- 
hibit antidepressant properties via monoamine oxidase (MAO) in- 
hibition [29]. Several irreversible and non-selective MAO inhibitors 
were then developed, but they all have cardiovascular side effects due 
to the accumulation of tyramine. Accumulation of this trace amine 
depends upon MAO-B inhibition, and thus more recent antidepres- 
sant MAO inhibitors, such as moclobemide, target MAO-A and are 
reversible. Imipramine was found during efforts to develop an anti- 
psychotic agent. Surprisingly, imipramine showed antidepressant 
properties and was approved as the first tricyclic drug for depression 
[30]. Pioneering work showed that imipramine could counteract 
noradrenaline uptake into presynaptic neurons [31]. It was later 
found that desipramine, a metabolite of imipramine, is a selective 
noradrenaline reuptake inhibitor (NRI). This finding, together with 
data showing lowered activity of noradrenergic neurons in depres- 
sion, led to the catecholamine hypothesis of depression [32]. It was 
subsequently found that several tricyclic drugs for depression also 
inhibit serotonin (5-HT) reuptake, and a serotonin hypothesis of 
depression was conceived [33]. Currently, agents that selectively 
inhibit the reuptake of either noradrenaline (that is, reboxetine) or 
serotonin (SSRIs; for example, fluoxetine, paroxetine, sertraline, and 
citalopram), or both (that is, venlafaxine and duloxetine) are com- 
monly used as drugs for depression and anxiety [34] (Fig. 74.2). 
Several serotonin and noradrenaline receptors are implicated 
in antidepressant actions and side effects of reuptake inhibitors 
[34]. Many receptors are now targeted by more recently developed 
drugs for depression and anxiety. The 5-HT1A somatodendritic 
autoreceptor, which negatively controls 5-HT release, is an 
interesting target, since its blockade augments extracellular sero- 
tonin levels [34]. Accordingly, vilazodone is a combined 5-HT1A 
partial agonist and an SSRI, and is used for the treatment of de- 
pression [34]. Another compound that both inhibits serotonin re- 
uptake and targets 5-HT receptors is vortioxetine, which binds to 
5-HT1A (agonist), 5-HT1B (partial agonist), 5-HT3 (antagonist), 
and 5-HT7 (antagonist) receptors. In addition to these serotonin re- 
ceptors, antagonism at 5-HT2C receptors plays an important role in 
the antidepressant actions of mirtazapine and agomelatine [34, 35]. 
Mirtazapine is also a noradrenergic a2A autoreceptor antagonist and 
enhances fronto-cortical serotonergic and noradrenergic transmis- 
sion to elevate mood. Agomelatine combines 5-HT2C receptor an- 
tagonism with melatonin receptor agonism and has antidepressant 


properties, along with beneficial effects on sleep, due to the restor- 
ation of circadian rhythmicity. 

Antidepressant responses can also be obtained by enhancing 
dopaminergic transmission. Dopamine enhances motivation, re- 
ward response, and plays a crucial role in the reinforcement systems, 
often dysfunctional in MDD [36]. Bupropion, a dopamine and nor- 
adrenaline reuptake inhibitor, is an antidepressant drug with ener- 
gizing and mood-elevating properties [34]. Several studies have also 
reported that D2 agonists may present antidepressant actions, some 
of which (for example, pramipexole) are used in the treatment of 
Parkinson's disease. A large proportion of patients with Parkinson's 
disease suffer from depression and, interestingly, depressive symp- 
toms often precede the onset of motor symptoms and diagnosis, sug- 
gesting a largely neurochemical cause [37]. 

An approach to study the role of monoamine transmission in 
depression and antidepressant effect is monoamine depletion. 
Serotonin levels can be reduced by para-chlorophenylalanine 
(PCPA), an inhibitor of tryptophan hydroxylase, the rate-limiting 
step in 5-HT synthesis, or by acute depletion of tryptophan (an 
essential amino acid that is the precursor of 5-HT). On the other 
hand, inhibition of tyrosine hydroxylase by a-methyl-para-tyrosine 
decreases noradrenaline and dopamine synthesis. These depletion 
procedures induce a depressive relapse in remitted MDD patients 
taking drugs for depression, and there is a correlation between the 
target of the procedure (5-HT or noradrenaline/dopamine) and the 
site of action of the drug [SSRI or NRI/dopamine reuptake inhibitor 
(DARI)] [38]. Moreover, while depletion of monoamines lowered 
mood in normal volunteers with a family history of MDD, it did not 
affect normal volunteers without a family history of MDD. These 
results can be interpreted as an indication that monoamines may be 
related to a biological vulnerability to MDD, rather than to mood 
state [38, 39]. 

There is a delay in therapeutic onset after the start of antidepres- 
sant treatment, indicating that the antidepressant action is not 
strictly related to an increase in monoamine levels, but rather to 
molecular and cellular processes initiated by them [40]. Several 
molecules, including p11 (also termed $100a10), BDNF, glial cell- 
derived neurotrophic factor (GDNF), vascular endothelial growth 
factor (VEGF), insulin-like growth factor-1 (IGF-1), fibroblast 
growth factor-2 (FGF-2), and glutamate receptors, are induced 
by monoaminergic antidepressants and mediate beneficial effects 
on neuronal plasticity, such as increased spine density and neuro- 
genesis, correlating with antidepressant responses [16, 40-42] 
(Fig. 74.2). 

In contrast to monoaminergic drugs, some compounds acting 
via glutamate receptors possess rapid antidepressant effects. Studies 
using magnetic resonance spectroscopy (MRS) to measure the con- 
centration of intra- and extracellular glutamate, glutamine, and 
GABA have shown changes in the glutamate system (for example, 
reduced levels of glutamine, the precursor of glutamate) in MDD 
[43-45]. It has been reported that glutamate-reducing agents, such 
as riluzole and lamotrigine, exert antidepressant actions by lowering 
extracellular levels of glutamate [46-48] (Fig. 74.2). Moreover, it has 
been established that blockade of NMDA receptors by a single in- 
jection of the non-competitive antagonist ketamine causes a rapid 
(within hours) and long (weeks) antidepressant effect [45, 49, 50]. 
However, the psychotomimetic, anaesthetic, amnestic, and addictive 
properties of ketamine preclude usage on a larger scale, and intense 
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Fig. 74.2 Cellular mediators altered in depression and by antidepressant agents. Schematic drawing showing cross-talk between neurons, microglia, 
and astrocytes. Selected neurotransmitters, immune mediators, and glucocorticoid hormones involved in depression are shown. Targets for 
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research is focused on finding alternative ways of interfering with 
glutamate neurotransmission to achieve fast antidepressant actions 
without severe side effects. Antagonism of the NMDA/NR2B sub- 
unit presents antidepressant-like effects in animal models, but has 
so far failed to show antidepressant actions in clinical trials [51]. 
Another approach has been to develop antidepressant agents acting 
at the glycine modulatory site of the NMDA receptor such as the 
partial agonist GLYX-13 (rapastinel) [45]. Moreover, a metabolite 
of ketamine—(2R,6R)-hydroxynorketamine—has shown promising 
antidepressant-like effects in animal models [52]. The mechanism of 
action of (2R,6R)-hydroxynorketamine remains to be fully under- 
stood, but reported data indicate that it is not an NMDA receptor 
antagonist, but rather an AMPA receptor potentiator [52]. In this 
context, it is interesting to note that tianeptine, an atypical drug for 
depression, enhances 5-HT reuptake, potentiates AMPA receptor 
function, and promotes synaptic plasticity [53]. Moreover, it has also 
recently been demonstrated that tianeptine acts as a u opioid receptor 


agonist. In addition to targeting ionotropic glutamate receptors, an- 
tagonism at metabotropic glutamatergic 2, 3, or 5 receptors have 
shown antidepressant actions in rodent models [45] (Fig. 74.2). 

There is also considerable evidence that depressed patients have 
reduced GABA levels in the brain and that the subunit composition 
of GABA(A) receptors are changed following stress and in MDD, 
particularly in interneurons [54]. 


Maladaptation in neurotrophins 





and neuroplasticity 


BDNF plays a central role in the neurotrophic theory of depres- 
sion [20, 55]. BDNF is highly expressed and inducible in the hippo- 
campus and PFC where it critically regulates neurotransmitter 
release, neurogenesis, neuronal survival, and synapse plasticity 
[56]. In accordance, peripheral blood markers from both unipolar 
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and bipolar depression patients and depression-like animals have 
decreased levels of BDNF [20, 56, 57]. In contrast, treatment with 
different antidepressant drugs, for example SSRIs and ketamine, in- 
creases the levels of BDNF in the PFC and hippocampus [56, 57]. 
In the nucleus accumbens, low levels of BDNF seem to have anti- 
depressant effects, being therefore the opposite from other areas, 
which complicates therapy development [16]. In addition to BDNF, 
other neurotrophins have been implicated in depression. Several 
studies have shown changes in GDNF and VEGF levels in depres- 
sion [58]. FGF-2 level is decreased in depression, while treatment 
with drugs for depression increases FGF-2 levels [59]. Likewise, 
FGF-2 exerted an antidepressant-like effect in the chronic mild 
stress model, and furthermore, FGF-2 antagonists blocked the 
antidepressant effect of imipramine and fluoxetine [60]. Besides 
pharmacological treatments, cognitive behavioural therapy (CBT), 
electroconvulsive therapy (ECT), and exercise seem to be effective 
in the treatment of depression by stimulating neuroplasticity. CBT is 
effective both in the short and long term [61, 62], although its mech- 
anisms of action are not yet well understood. Regarding ECT, the 
literature reports different procedures on how to perform it (uni- or 
bilateral and brief or ultrabrief pulse), and all are effective [63, 64]. 
The antidepressant action of ECT can be related to 5-HT, noradren- 
aline, and dopamine neurotransmission potentiation, hippocampal 
neuroplasticity, glutamate neurotransmission, and rapid eye move- 
ment (REM) sleep suppression [65, 66]. These multiple-target ac- 
tions have been associated with the greater and faster effect of ECT, 
compared to monoaminergic drugs [63, 64]. Some data suggest that 
exercise can exert an antidepressant effect and may improve anti- 
depressant drug effects [67, 68]. Exercise has been reported to af- 
fect growth hormone, inflammatory cytokines, neurotrophins, and 
neurogenesis [67]. 


HPA axis dysfunction 


Exposure to prolonged stress can lead to persistently changed 
responsivity of the HPA axis. Since the pre- and postnatal periods 
are fundamental to the development of the central nervous system, 
stress during these periods can promote long-term changes in the 
brain function, increasing the risk of depression. Accordingly, 
studies have shown that poor maternal care during early life can in- 
duce epigenetic changes in the neuron-specific glucocorticoid re- 
ceptor NR3C1 promoter [69]. 

The HPA axis activity is governed by different hor- 
mones: corticotropin-releasing hormone (CRH) and vasopressin 
(AVP) from the hypothalamus; adrenocorticotrophic hormone 
(ACTH) from the pituitary; and glucocorticoids (cortisol) from the 
adrenal cortex. Increased levels of CRH and AVP will lead to in- 
creased ACTH that will, in turn, increase the secretion of cortisol. 
Cortisol has a negative feedback action on CRH, AVP, and ACTH 
release, thereby decreasing the activity of the HPA axis. Cortisol is 
fundamental to homeostatic and allostatic control of responses to 
the environment such as cognitive and affective coping mechanisms. 
Numerous studies have shown that chronic administration of gluco- 
corticoids leads to changes in mood and cognition [70]. 

A significant number of depressive patients show increased levels 
of glucocorticoids and glucocorticoid resistance, that is, a lack 
of feedback suppression of the HPA axis by glucocorticoids [40, 


70-72]. Glucocorticoids can activate two distinct types of recep- 
tors: mineralocorticoid (MR) and glucocorticoid (GR) receptors; 
MRs are related to a basal tone of activation of the HPA axis, while 
GRs play a more important role in the response to stress and in the 
negative feedback loop of HPA activation [70]. Studies have shown 
an imbalance in the expression of GRs and MRs in depressive pa- 
tients, leading to prolonged HPA axis activation, with resistance of 
the negative feedback loop [70, 72]. 

Activity of the HPA axis is fundamental to the control of several 
body functions, including metabolism, the immune system, and 
several brain functions such as neuronal survival, neurogenesis, and 
memory acquisition. In rodents, repeated glucocorticoid adminis- 
tration has been shown to reduce hippocampal neurogenesis and 
volume [73]. Depressed patients with psychotic features showed 
high evening cortisol levels, which is associated with poor cogni- 
tive performance [74]. Moreover, Cai and co-workers [75] showed 
that early life stress can lead to increased mitochondrial DNA 
(mtDNA) dysfunction and shortening of telomeric DNA, and that 
these changes were tissue-specific due to glucocorticoid secretion. 
Drugs for depression can improve HPA axis abnormalities through 
modulation of GR function [73]. Several studies in depressive pa- 
tients have reported increased levels of cortisol in saliva, plasma, 
and urine, along with hypertrophic pituitary and adrenal glands 
[70, 76]. Based on the HPA axis dysfunction theory of depression, 
mifepristone, a GR and progesterone receptor antagonist and an MR 
receptor agonist, has been studied in patients with bipolar depres- 
sion, with mixed results (for example, [77]). 


Abnormal immune system responses 


Glucocorticoids regulate not only the HPA axis, but also metab- 
olism and immunity. There are data suggesting that increased levels 
of pro-inflammatory cytokines can induce glucocorticoid resist- 
ance and promote HPA axis hyperactivity in depressive patients 
[72, 73, 78] (see also Chapter 15). Cytokines also activate microglia, 
causing neuroinflammation and amplification in inflammatory sig- 
nals through release of reactive oxygen and nitrogen species and 
chemokines [78]. This activation leads to sustained long-lasting 
brain inflammation that may contribute to depressive-like states and 
sickness behaviour. 

Studies have shown changes in the regulation of both pro- and 
anti-inflammatory cytokines in depression, and it is proposed 
that such changes are due to external (for example, family loss) 
or internal (for example, changes in gut flora) stressors [40, 79]. 
Preclinical research using administration of lipopolysaccharide 
(LPS), a major component of the outer membrane of Gram-negative 
bacteria, showed increased brain levels of tumour necrosis factor 
alpha (TNFa) up to 10 months after exposure [80]. Moreover, LPS 
can induce depressive-like behaviours in rodents, which can be at- 
tenuated by anti-inflammatory drugs [81]. Increased plasma cyto- 
kines (IL-6 and TNFa) and depressed mood are also observed in 
normal volunteers who received a low dose of this endotoxin [82]. 
Although there are some discrepancies between studies regarding 
changes in certain cytokines in depressed patients, elevated IL- 
6 has been found repeatedly [83, 84]. Increased levels of IL-6 
have been linked to an increase in C-reactive protein (CRP), bac- 
terial translocation in the gut, increased oxidative and nitrosative 
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stress, hyperactivity of the HPA axis, and a shift in the tryptophan 
pathway to the TRYCAT (tryptophan catabolite) pathway [85]. Due 
to its many pro-depressive features, antagonism of IL-6 may be a 
possible target to treat depression. Indeed, addition of the IL-6 re- 
ceptor (IL-6R) antagonist tocilizumab to drugs for depression could 
be useful [86]. Recently, a proof-of-concept RCT of infliximab, a 
TNFa-specific monoclonal antibody, has reported improvements in 
patients with treatment-resistant depression characterized by high 
inflammation at baseline [87]. Other promising therapies include 
antibodies against signal transducer and activator of transcription 3 
(STAT3) or soluble glycoprotein 130 (sgp130) [88]. 

In bipolar depression, a recent meta-analysis indicated that anti- 
inflammatory drugs exert antidepressant effects and can potentiate 
the efficacy of monoaminergic antidepressant drugs. Accordingly, 
N-acetylcysteine (NAC) shows an antidepressant effect; NAC also 
has an antioxidant effect, which may contribute to its effect on bi- 
polar depression. On the other hand, omega-3 fatty acids and ethyl- 
eicosapentanoate (EPA), which are also antioxidants, have shown 
mixed results in adjunctive treatment studies [88]. However, not all 
depressed patients show cytokine changes, and positive results with 
drugs that target inflammation appear to be restricted to a subset of 
patients with increased inflammatory biomarkers [78]. 

Increased oxidative and nitrosative stress is associated with in- 
creased inflammation, and studies have reported that even in the 
absence of a depressive phenotype, long-term stress exposure, es- 
pecially in early life, can lead to an increase in pro-inflammatory 
markers in blood [89, 90]. An increase in oxidative and nitrosative 
stress can also lead to decreased levels of neurotransmitters such as 
serotonin, noradrenaline, dopamine, and glutamate [91]. 


Circadian arrhythmicity and sleep disturbances 





Circadian rhythms are responsible for our adaptation to a 24- 
hour day schedule and to keep the homeostasis of the organism. 
Functions of the brain, peripheral organs (for example, heart 
and liver), hormone release, and the immune system all present 
a circadian rhythm. Several findings suggest an association be- 
tween depression and circadian rhythm changes. For example, 
sleep change (insomnia or hypersomnia) is one of the criteria for 
MDD diagnosis, and it is present in 60-90% of depressive patients. 
Furthermore, depressive symptoms can exhibit a circadian fluctu- 
ation, generally being more prominent in the morning than in the 
evening [92]. 

Each cell in our body contains a set of circadian genes that are 
differently expressed along the day. The main pacemaker, respon- 
sible for keeping the whole body in synchronicity, is located in the 
suprachiasmatic nucleus (SCN) in the anterior hypothalamus. The 
main circadian genes (circadian locomotor output cycles kaput— 
Clock; brain and muscle ARNT-Like 1—Bmal1; Period—Per1,2,3; 
Cryptochrome—Cry1,2) work in transcriptional/translational feed- 
back loops, that is, a positive and a negative loop that will stimulate 
and inhibit each other to keep the cycle going [93]. More specific- 
ally, increased levels of the heterodimer CLOCK-BMALI will in- 
crease the transcription of Per and Cry (that will heterodimerize) 
and REV-ERBa and RORa (retinoic acid-related orphan nuclear re- 
ceptors) (another heterodimer). Once the expression of these two 
heterodimers increases, they will inhibit the expression of Clock and 


Bmall. However, during the day, they will be degraded, resulting in 
increased levels of CLOCK and BMAL] again [93, 94]. 

The homeostatic functionality of the circadian system is dependent 
on the genes that control it, the levels of several neurotransmitters, 
including serotonin, noradrenaline, dopamine, and acetylcholine, 
and hormones [95, 96]. In homeostatic conditions, serotonin, that 
has its synthesis peak in the morning and presents seasonal rhyth- 
micity fluctuations, is the precursor to the synthesis of melatonin, 
one of the main controllers of rhythms. Serotonergic projections go 
from the raphe to the SCN, and vice versa. Noradrenaline, in turn, 
regulates melatonin synthesis, while dopaminergic neurons from 
the ventral tegmental area play a role in the control of REM sleep 
and adaptation to light [95-98]. Acetylcholine release is highest 
during awake time and REM sleep, but reduced during non-REM 
(NREM) sleep, and consequently loss of cholinergic neurons leads 
to sleep disturbances [99]. Regarding hormonal regulation, growth- 
hormone releasing factor (GHRF) (that is predominantly expressed 
in the first half of the night and is secreted in the hypothalamus and 
arcuate nucleus) plays a role in the control of NREM sleep [100]. 
Increasing levels of cortisol and CRH, which build up during the 
night, increase the frequency of REM sleep in the second half of the 
night [101, 102]. In MDD, the reduction of REM latency seems to 
be related to decreased cholinergic and monoaminergic activity, 
while increased NREM duration appears to be related to low levels 
of CRH, and shorter sleep episodes are related to decreased cortisol 
[101, 102]. This could be due to crosstalk between HPA dysfunction 
and sleep disturbance in depressive patients. 

Depressive patients seem to present arrhythmicity or length- 
ening of the circadian period and disruption in sleep archi- 
tecture, and it is proposed that part of the effect of drugs for 
depression, particularly agomelatine, can be due to re-establishment 
of normal circadian rhythm by shortening of the circadian period 
[103]. Polysomnographic studies have shown changes in the 
electroencephalographic profile of individuals with MDD. Depressed 
patients presented diminished sleep efficiency, shorter sleep period, 
and increased REM time with shorter REM latency [102]. 

Regarding the use of circadian rhythm interventions to treat de- 
pression, sleep deprivation (SD) is a promising therapy. Several 
different procedures for SD have been proposed: total SD, par- 
tial SD, and selective REM SD [104]. A few hours of SD appear to 
improve mood in depressive patients. SD appears to potentiate 
monoaminergic transmission, influence glutamatergic neurotrans- 
mission, increase metabolism in specific brain areas (ventral/lat- 
eral cingulate cortices and medial PFC), and increase neurogenesis, 
while resetting the circadian rhythm of the patient [104, 105]. 

Another non-pharmacological therapy employed to treat depres- 
sion is light therapy. Light therapy is frequently used to treat sea- 
sonal affective disorder (SAD) or MDD with seasonal patterning 
[106]. A defining characteristic of SAD is its seasonality, with de- 
pressive episodes commonly occurring during wintertime [106]. 
SAD is a complex disease and appears to be polyfactorial and poly- 
genetic and involves circadian arrhythmicity [106]. There is a cor- 
relation between latitude and the prevalence of SAD. Indeed SAD 
is more prevalent in northern latitudes [106, 107]. SAD appears to 
respond to light therapy, although data are limited [108]. The idea 
behind light therapy is that exposure to a strong light pulse daily 
at a specific time will help to keep the circadian rhythm by syn- 
chronizing the SCN [109]. The main components of light therapy 
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are intensity, wavelength, and duration and time of exposure. In pa- 
tients with SAD, light therapy reduced 5-HT transporter binding in 
the PFC [110]. 


Concluding remarks and future perspectives 


The discovery of iproniazid and imipramine and its molecular tar- 
gets resulted in the monoamine hypothesis of depression. Several 
improvements of monoamine-based drugs have been made, but 
their efficacy is still limited, with less than 50% of depressed patients 
remitting. It is therefore crucial to identify relevant targets for novel 
antidepressant treatment strategies to alleviate this serious, unmet 
medical need. Fundamental knowledge of the neurobiological sys- 
tems and molecular targets related to the pathophysiology of de- 
pression has improved significantly over the past years. As briefly 
reviewed in this chapter, abnormal G x E interactions, dysfunc- 
tional brain circuitries and neurotransmitter systems resulting in 
maladaptative neuroplasticity, and HPA axis dysfunction, along with 
abnormal immune system responses and circadian arrhythmicity, 
all play a role in depression. The glutamatergic system, inflamma- 
tory pathways, and neuromodulation of specific brain circuitries 
are all promising new targets for novel therapies. In addition, sev- 
eral preclinical studies suggest that neuropeptides, such as galanin, 
neuropeptide Y (NPY), and CRH, are targets for drug development. 
However, there are major species differences in the expression levels 
and anatomical location of neuropeptides and their receptors, which 
complicates translation of preclinical data to the clinic. 

With the advancement of genetics, epigenetics, and neuroimaging, 
stratification of patients with distinct diagnostic subtypes, therapy 
responses, dosing regimens, and likelihood for side effects will de- 
velop over the coming years. Moreover, gender is an important 
and largely ignored factor in depression. Indeed two-thirds of pa- 
tients with unipolar depression are female, and furthermore, the 
hormonal status of female patients can impact depression symp- 
tomatology [111-113]. Despite the fact that some initial studies sug- 
gested that depressed female patients would respond better to SSRIs, 
while male patients would respond better to tricyclic drugs, the in- 
fluence of gender in drug treatment response is poorly understood 
(112, 114, 115]. 

It is likely that precision medicine utilizing recent neurobiological 
insights and pharmacogenomic approaches will result in more per- 
sonalized therapies for a large proportion of depressed patients. This 
development will also have a major impact on our view of depressive 
disorder. 
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The familial nature of major depression has been well established. 
The results of twin, family, and adoption studies demonstrated that 
both genetic and environmental factors contribute to the aetiology 
of depression. More recently, progress in molecular biology has led 
to greater insight into the specific genetic factors that may underlie 
depression. This chapter updates information on the genetic epi- 
demiology of depression and reviews results of molecular genetics 
studies of depression. The sub-discipline of genetic epidemiology 
focuses on the identification of the role of genetic factors and their 
joint influence with environmental factors in disease aetiology. 
Genetic epidemiology employs traditional epidemiologic study de- 
signs, including case-control and cohort studies, to evaluate the ag- 
gregation in groups as closely related as twins or as loosely related as 
migrant cohorts. 

Prior to the molecular genetics era, study designs in genetic epi- 
demiology were devised to infer genetic causation by controlling for 
genetic background while letting the environment vary (for example, 
migrant cohorts, half siblings, separated twins) or, conversely, con- 
trolling for the environment while allowing variance in the genetic 
background (for example, siblings, twins, adoptees, non-biologic 
siblings). Measures of risk in genetic epidemiology include familial 
relative risk (disease risk in relatives of cases vs controls) and genetic 
attributable risk (the proportion of a particular disease that would 
be eliminated if a particular gene or genes were not involved in the 
disease). As described in the next section, sophisticated methods 
have been developed to compare combinations of genetic markers 
between cases and controls (for example, polygenic scores) and to 
estimate the proportion of phenotypic variance explained by genetic 
variants (typically SNPs) for complex traits [that is, genome-wide 
complex trait analysis (GCTA)] [1]. 

Major depressive disorder (MDD) is characterized by low mood 
and energy, the inability to experience enjoyment, changes to eating 
and sleep patterns, feelings of guilt or worthlessness, and suicidal 
thoughts. MDD is the second leading cause of disability worldwide 
[2]. Depression is highly heterogenous, and there are eight major 
subtypes of MDD thatare associated with different levels of disability 








and different patterns of familiality (the tendency of a trait to occur 
among members of a family, usually by heredity), comorbidity, and 
heritability (the proportion of variability that is genetic in origin, 
that is, the ratio of the genetic variance of a population to its pheno- 
typic variance). 


Genetic epidemiology of depression 


There has been a substantial body of research that investigates the 
familial and genetic factors underlying depression. There are several 
reviews of family, twin, and adoption studies of depression over the 
past decade [3-5]. Increased recognition of the role of biologic and 
genetic vulnerability factors for psychiatric disorders has led to re- 
search with increasing methodological sophistication over the past 
two decades. 

Family studies 

Familial aggregation is generally the first source of evidence that 
genetic factors may play a role in the aetiology of a disorder. The 
most common indicator of familial aggregation is the relative risk 
ratio, computed as the rate of a disorder in families of affected per- 
sons divided by the corresponding rate in families of controls. The 
patterns of genetic factors underlying a disorder can be inferred 
from the extent to which patterns of familial resemblance adhere 
to the expectations of Mendelian laws of inheritance. The degree 
of genetic relatedness among relatives is based on the proportion 
of shared genes between a particular relative and an index family 
member or proband. 

The familial aggregation of major depression has been exam- 
ined in numerous controlled studies. Aggregate estimates of the 
familial associations for major depression, based on reviews and 
meta-analyses of controlled family study [5] and registry [6] data, 
are substantially lower than those of bipolar disorder, with familial 
risk ratios averaging about 2.5-2.8, familial heritability of 0.32, and 
average twin heritability of 0.3-0.4 [7]. 


Adoption studies 


Adoption studies have been a major source of evidence regarding the 
joint contribution of genetic and environmental factors to disease 
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aetiology. However, with recent trends towards selective adoption 
and the diminishing frequency of adoptions in the United States, 
adoption studies are becoming less feasible methods for identifying 
genetic and environmental sources of disease aetiology (http:// 
www,johnstonsarchive.net/policy/adoptionstats.html). 

As reviewed by Sullivan et al. [5], few adoption studies of depres- 
sive disorders have been completed and report conflicting findings. 
These studies are difficult to carry out and often involve indirect 
sources of diagnostic information, for example using a sick leave 
registry as the source of clinical information [8] or obtaining only 
limited data on the biological parents [9]. More recent analyses of 
Swedish registry data revealed that both genetic and environmental 
factors had influences on depression in adopted offspring [10]. The 
most compelling finding from adoption studies, however, is the dra- 
matic increase in completed suicide among biological, as opposed to 
adoptive, relatives of mood disorder probands [11, 12]. 


Twin studies 


Twin studies that compare concordance rates for monozygotic twins 
(who share the same genotype) with those of dizygotic twins (who 
share an average of 50% of their genes) provide estimates of the de- 
gree to which genetic factors contribute to the aetiology of a disease 
phenotype. Path analytic approaches that estimate the proportion 
of variance attributable to additive genes and common and unique 
environment have been the standard method of analysis of data 
from large twin studies. The twin family design is one of the most 
powerful study designs in genetic epidemiology because it yields 
estimates of heritability, but also permits evaluation of multigen- 
erational patterns of expression of genetic and environmental risk 
factors. Finally, twin studies may inform the spectrum of expression 
of diseases and disease subtypes through identification of the com- 
ponents of the phenotype that are the most heritable. 

Twin studies of depression have been conducted on both clinical 
(for example, [13]) and community (for example, [14-16]) samples, 
or both (for example, [17]). Regardless of the method of data collec- 
tion, the heritability point estimates were broadly similar, and the 
95% confidence intervals were quite wide; there were no obvious 
gender differences, and shared environmental influences seemed 
to have little impact on liability to depression. Instead, the greater 
proportion of variance was due to individual specific environmental 
effects [5]. 

Heritability in twin studies of depression parallels the familial 
risks, demonstrating the contribution of genetic factors to the aeti- 
ology. During the past decade, there has been a shift from small 
clinical studies to studies with larger samples from registries and 
population samples, that has increased our ability to identify shared 
familial risk across these conditions, as well as patterns of familial 
specificity. The newer generation of family studies has also begun to 
expand phenotypic assessments to include dimensional measures of 
phenotypes and biologic measures that may be closer manifestations 
of the underlying genetic factors [18, 19]. 


Molecular genetics 


The major approaches that have successfully led to gene identifica- 
tion for Mendelian disorders are linkage and association studies. 
These approaches were highly successful for rare diseases that fol- 
lowed traditional modes of transmission, including autosomal dom- 
inant, recessive, or sex-linked patterns in families. With completion 


of the human genome project, nearly all of the genes underlying rare 
Mendelian disorders have been identified. 


Linkage 


Genetic linkage is the tendency of DNA sequences that are located 
close together on a chromosome to be inherited together during the 
meiosis phase of sexual reproduction. This tendency is exploited in 
linkage analysis, a genetic mapping technique that identifies regions 
of chromosomes that are likely to contain a risk gene, and is meas- 
ured by the percentage recombination between loci. The logarithm 
of the odds (LOD) score is the statistical estimate of whether two 
genes, or a gene and a disease gene, are likely to be located near each 
other on a chromosome and are therefore likely to be inherited to- 
gether. A LOD score of 3 or higher is generally understood to mean 
that two genes are located close to each other on the chromosome 
and indicates a statistically significant result. After linkage is estab- 
lished, fine mapping must be completed to attempt to isolate which 
gene or genes may be driving the linkage signal. Multiple genome- 
wide linkage studies of depression have been completed and report 
variable results. Statistically significant or suggestive results have 
been presented for depression on chromosomes 2, 3, 7, 8, 10, 15, and 
17 [20-33], with few replications. 


Association 


Although there have been dozens of studies of candidate genes 
in case-control studies of depression, none stood the test of inde- 
pendent replication [34]. These studies were intuitively appealing, 
but the low a priori probability rate that any particular locus could 
have a strong association with depression led to high false-positive 
rates that often misled the field [35]. The lack of identification of 
these candidate genes in genome-wide association studies (GWAS) 
was also disappointing. However, few of these studies were ad- 
equately powered to identify genes of small effect. A candidate gene 
approach, based on replicated findings identified in GWAS, may still 
be a promising future tactic to dissect the genetic architecture of 
subgroups of depression. 


Genome-wide association studies 


With advances in molecular biology, there has been a flood of in- 
formation on the contribution of genetic risk factors to complex 
diseases. This has transformed the sub-discipline of genetic epidemi- 
ology that previously relied solely on inferences based on pheno- 
typic disease manifestations in relatives. These advances are largely 
attributable to GWAS that identify common genetic variants or 
SNPs (common DNA variants with >1% population frequency) that 
are significantly more frequent in cases than in controls [36], with an 
a priori statistical significance threshold of 5 x 10°. Structural vari- 
ation, such as segmental duplications and deletions, or copy number 
variants (CNVs) can also be identified in GWAS. 

During the past decade, there has been dramatic growth in large- 
scale case-control studies to identify genetic markers associated 
with major mental disorders. To date, ten GWAS of MDD have been 
published [37-46], with only three loci of genome-wide signifi- 
cance reported [41, 46]. It is notable that many of the studies contain 
overlapping samples, especially since the advent of large inter- 
national data-sharing initiatives such as the Psychiatric Genomics 
Consortium (PGC) (http://www.med.unc.edu/pgc/) (Table 75.1). 
The PGC has completed the largest GWAS to date of MDD and found 
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Table 75.1 Genome-wide association studies of major depressive disorder published through 2016 
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Sullivan 2009 etherlands Study of Depression Lifetime DSM-IV 
and Anxiety (NESDA), MDD 
etherlands Twin Registry (NTR), 
etherlands Mental Health Survey 
and Incidence Study (NEMESIS), 
Adolescents at Risk for Anxiety and 
Depression (ARIADNE) 
Lewis 2010 Depression Case Control [DeCC] Recurrent 
study, depression 
Depression Network [DeNT] study, 
Genome-Based Therapeutic Drugs 
or Depression [GENDEP] 
Muglia 2010 ax-Planck Institute of Psychiatry © DSM-IV or ICD-10 
recurrent MDD 
CoLaus Recurrent MDD 
Meta-analysis 
Rietschel 2010 Department of Psychiatry, DSM-IV MDD 
University of Bonn, Germany 
Kohli 2011 unich Antidepressant Response At least a moderate 
Signature (MARS) depressive episode 
Shi 2011 Genetics of Recurrent Early-Onset Recurrent early- 
Depression (GenRED) onset MDD 
Shyn 2011 Sequenced Treatment Alternatives MDD 
o Relieve Depression (STAR’D) 
Wray 2012 Queensland Institute of Medical MDD 
Research (QIMR, Australia), 
The Netherlands Study of Anxiety 
and Depression (NESDA), 
The Netherlands Twin Registry 
(NTR), 
The University of Edinburgh (Uk), 
The Molecular Genetics of 
Schizophrenia study (controls only, 
USA 
Ripke 2013 ajor Depressive Disorder DSM-IV lifetime 
Working Group of the Psychiatric MDD 
GWAS Consortium 
Direk 2017 CHARGE consortium, PGC Lifetime MDD 
CHARGE consortium, PGC MDD 











no statistically significant hits [45]. More recently, however, the PGC 
presented the results of an analysis that stratified the sample by age 
at onset (AAO) and identified one replicated genome-wide signifi- 
cant locus associated with adult-onset (>27 years) MDD (rs7647854; 
odds ratio 1.16; 95% confidence interval 1.11-1.21; P = 5.2 x 10™') 
[47]. The technique of subsetting sample cases on the basis of char- 
acteristics, such as AAO, recurrence, or episodicity of depressive epi- 
sodes, and treatment response may help to disentangle the genetic 
heterogeneity of depression. Also, by examining a broad depression 
phenotype that included both MDD and depressive symptoms with 
data from the PGC and the Cohorts for Heart and Aging Research 
in Genomic Epidemiology (CHARGE) Consortium, a significant 
hit was found through a meta-analysis [46], whereas another study 


























Perlegen Sciences 1738 1802 No hits [37] 
high-density 
oligonucleotide arrays 
Illumina Human610- 1636 1594 No hits [38] 
Quad BeadChip 
Illumina 926 866 o hits [39] 
HumanHap550 SNP 
chip 
Affymetrix 500K SNP 492 1052 o hits 
chip 
1359 1782 o hits 
Illumina HumanHap 604 1364 o hits [40] 
550v3 (controls), 
Illumina human 610 
W quad BeadChips 
(cases) 
Illumina 100k, 300k 353 366 rs1545843 [41 
Beadchips 
Affymetrix 6.0 1020 1636 No hits 42 
Affymetrix 6.0, 5.0, 1221 1636 No hits 43 
500K, and Perlegen 
Illumina and Affymetrix 2431 3673 No hits 44 
platforms 
Multiple arrays 9240 9519 No hits [45] 
Multiple arrays 9240 S519) rs9825823, [46] 
6718 13,453 | 


examining depressive symptoms in African American and Latina 
women found no genome-wide significant hits [48]. Subsequent to 
the submission of this chapter in March 2016, six additional GWAS 
studies of depression have been completed and published; details of 
these are summarized by Ormel et al. [48a]. 

Despite the enthusiasm generated by the positive findings for 
many of these disorders, the total proportion of variance explained 
by even the largest international collaborative studies with hundreds 
of thousands of cases is still quite small (17-29% in cross-disorder 
analyses and much lower in others) [49]. This has led to substantial 
discussion regarding the so-called ‘missing heritability’ in GWAS. 
However, the findings that common genetic variants explain only 
a limited proportion of the variance is not surprising in light of 
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growing evidence regarding the role of undetected rare variants, 
environmental factors, and sources of misclassification of cases and 
controls in GWAS, including aetiologic and clinical heterogeneity 
within cases, misclassification of controls, and other factors that 
might reduce the power of these studies. The clinical samples from 
these studies have been highly heterogenous in terms of sampling 
source and diagnostic characteristics, and few of the control samples 
would meet traditional criteria for controls in epidemiology [50]. 
Future studies will require large systematic samples that are either 
directly recruited for a cohort study or existing registries and/or 
biobanks that have sufficiently large and well-characterized samples 
of cases, as well as built-in controls without the index conditions. 


Sequencing 


More recently, international collaborations have focused on 
genome-wide or exome (protein-coding) sequencing techniques, 
and this has been proposed as a tool that can be used in risk predic- 
tion and lead to a greater understanding of the aetiology, prognosis, 
and treatment response in psychiatric disorders [51]. Moreover, the 
analytic challenges will be quite complex, and novel techniques are 
in development [52]. Success of the sequencing approach has been 
shown in schizophrenia, where an exome sequencing study of 2536 
schizophrenia cases and 2543 controls showed polygenic burden 
from rare (<1 in 10,000), disruptive mutations distributed across 
many genes [53]. The first major success of genome sequencing 
in MDD was reported by the CONVERGE (China, Oxford, and 
Virginia Commonwealth University Experimental Research on 
Genetic Epidemiology) Consortium, which used low-coverage 
whole-genome sequencing of 5303 Chinese women with recurrent 
MDD and 5337 controls, and reported two loci contributing to risk 
of MDD on chromosome 10: one near the SIRT1 gene (P = 2.53 x 
10°), and the other in an intron of the LHPP gene (P= 6.45 x 10°”). 
Additional analyses of 4509 cases with melancholia yielded an in- 
creased genetic signal at the SIRT1 locus [54]. 


SNP-based polygenic approaches 


Several statistical approaches that take advantage of markers identi- 
fied in GWAS have also advanced our understanding of the genetic 
architecture of psychiatric disorders, notably genomic profile (or 
polygenic) risk scores and GCTA. 


Genomic profile (or polygenic) risk scores 


Polygenic scores summarize the genetic effects in a GWAS by com- 
puting a weighted sum of associated ‘risk’ alleles within each subject. 
Initially, markers (typically SNPs) are selected based on their evi- 
dence for association, typically their P-values, using a ‘training’ or 
discovery sample, and the weighted score is then constructed in 
an independent ‘testing’ or replication sample. If an association is 
found between a trait/disorder and the polygenic score, one assumes 
that a genetic signal is present among the selected markers. Later, 
this score can then be used for prediction of individual trait values 
[55]. The original use of polygenic scores has now been extended to 
include detecting shared genetic aetiology among traits or to infer 
the genetic architecture of a trait, to establish the presence of a gen- 
etic signal in underpowered studies, and can act as a biomarker for 
a phenotype [56]. Recently, Power et al. [47] used polygenic score 
analyses to show that earlier-onset major depression is genetically 


more similar to other major psychiatric disorders, e.g. schizophrenia 
and bipolar disorder, than later-onset depression is. 


Genome-wide complex trait analysis 


GCTA is used to estimate the proportion of phenotypic variance ex- 
plained by genetic variants (typically SNPs) for complex traits, and 
has been used to better understand the genetic architecture of com- 
plex traits [1]. It can be completed genome- or chromosome-wide, 
and using genomic-relatedness-matrix restricted maximum likeli- 
hood (GREML). It provides an estimate of narrow heritability that 
does not rely on the assumptions defined in standard twin studies. 
Instead it assumes that environmental factors are uncorrelated, with 
differences in the degree of genetic similarity for individuals who are 
not in the same extended families, and estimates genetic relatedness 
directly from the SNP data. This is contrary to the method used in 
standard behavioural genetics studies where pedigree-defined re- 
latedness is assumed [57]. 

In addition to defining the genetic relationship from genome- 
wide SNPs, GCTA can also be used to predict the genome- 
wide additive genetic effects for individual subjects and for 
individual SNPs, to estimate the linkage disequlibrium (LD) struc- 
ture encompassing a list of target SNPs and to estimate the genetic 
correlation between two traits or diseases using SNP data (http:// 
www.complextraitgenomics.com/software/gcta/index.html). This 
method was first successfully applied to analyse the genetic con- 
tribution to human height [58], but its use has now expanded to 
psychiatric disorders, including depression. Using this method, 
Ferentinos et al. [59] reported an SNP-based heritability estimate of 
0.17 for the presence/absence of early AAO of depression. Slightly 
higher estimates have been reported in a sample from China where 
common SNPs explained between 20% and 29% of the variance in 
MDD risk, and the heritability in MDD explained by each chromo- 
some was proportional to its length (r = 0.680; P = 0.0003), sup- 
porting a common polygenic aetiology [60]. 

It should be noted that polygenic scoring and GCTA assume addi- 
tive genetic variance, which does not take into account potential 
multiplicative gene-gene (G x G) interactions nor do they consider 
gene-environment (G x E) interactions. Furthermore, polygenic 
scoring and GCTA estimates are typically derived from SNPs, but 
other types of genetic variants (CNVs, segmental duplications, etc.) 
may also underlie disease aetiology. 


Pharmacogenetics 


Pharmacogenetics is the study of genetic differences in drug meta- 
bolic pathways that can affect drug response. Given that treatment 
response with selective serotonin reuptake inhibitors (SSRIs) and 
other antidepressant medication varies considerably between pa- 
tients but demonstrates familial aggregation [61, 62], and with only 
one-third of patients reaching remission [63], there has been a push 
to establish more rigorous biological markers of treatment response. 
The pharmacogenetics of depression has been reviewed by Fabbri 
et al. [64], Hamilton [65], Lin and Lane [66], and others. A number 
of GWAS [67-73] and meta-analyses [74, 75] have been completed 
to investigate the genetics of antidepressant response, as well as side 
effect profiles [76, 77]. Generally, these studies have not yielded 
positive results, and this has been attributed, in part, to the heter- 
ogenous nature of depressive disorders, the lower rates of heritability 
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when compared to other psychiatric disorders, and poor medication 
adherence [78]. 


Challenges in identifying genetic factors 
underlying depression 


The heterogeneity of depression has been widely documented. The 
concept of unipolar depression from early therapeutic and genetic 
research referred to severe cases of depression that led to hospital- 
ization. As the concept was extended to MDD that only required 
a 2-week episode, depression was characterized by substantially 
greater heterogeneity in terms of broadened symptom expression, 
comorbidity with anxiety and other conditions, and stronger asso- 
ciation with environmental stressors. Both genetic epidemiologic 
and molecular genetic studies of depression have systematically 
investigated these sources of heterogeneity in order to define more 
homogenous subgroups that may be more likely to reflect common 
underlying genetic architecture. 


Subtypes 


The major subtypes of depression in the Diagnostic and Statistical 
Manual of Mental Disorders (DSM) system are melancholic and 
atypical subtypes. There is substantial evidence from twin and family 
studies that the atypical subtype is more familial and may reflect in- 
flammatory processes that lead to elevated body mass index (BMI) 
and other cardiovascular risk factors. Other subtypes that have been 
examined in genetic studies include persistent depressive disorder 
(dysthymia), bipolar disorder, seasonal affective disorder (SAD), 
psychotic depression, perinatal/peripartum/post-partum depres- 
sion, and premenstrual dysphoric disorder (PMDD). The overlap 
across these subtypes and the lack of specificity in families, however, 
has not yielded greater specificity of genetic factors underlying these 
subtypes. 


Endophenotypes 


Another widely employed approach to reduce the heterogeneity 
of major depression has been the exploration of endophenotypes, 
defined as phenotypic traits or markers that may represent inter- 
mediate forms of expression between the output of underlying genes 
and the broader disease phenotype [79-81]. Studies of the role of 
genetic factors involved in these systems may be more informative 
than studies of the aggregate psychiatric phenotypes because they 
may represent more closely the expression of underlying biologic 
systems. A recent meta-analysis of psychiatric endophenotypes [81] 
suggests that currently identified endophenotypes are not superior 
to conventional phenotypic disease definitions. 


Incorporation of environmental factors 


To date, most of the molecular genetics studies in psychiatry have 
not incorporated environmental factors as a source of variance in 
aetiologic models. Most of the evidence for the role of the common 
and unique environment has been based on residuals from path ana- 
lytic models employed in twin studies. Stressful life events have been 
consistently associated with increased risk of mood disorders, but 
mood disorders also elevate the risk of life events, so the directional 
links have been complex [82]. More recent research has identified 
infections [83] and inflammation or immune response [84], which 
may elevate the risk of depression. However, this association may 
be related to comorbid conditions or may actually be bi-directional, 


as demonstrated in recent prospective studies of large population- 
based samples [84]. 

Although intriguing, studies that have examined candidate genes 
that may interact with stressful life events in the aetiology of de- 
pression have generally not been replicated. The widely cited study 
of Caspi et al. [85] has not been replicated in similar prospective 
studies, even in New Zealand [86], and meta-analyses have dem- 
onstrated that the original findings reflected a false-positive re- 
port [87, 88], with the most recent meta-analysis of more than 
30,000 subjects concluding that there is no interaction between the 
SHTTLPR locus with life events in the aetiology of depression [89]. 
Duncan and Keller [90] concluded that low power, along with low 
prior probability that a G x E hypothesis is true, suggests that most 
or even all positive candidate gene G x E findings represent type 
I errors. Methods to combine genetic susceptibility factors with en- 
vironmental exposures will be a major challenge in the next phase of 
psychiatric genetics, as described in the following section. 


Combining genetic and environmental factors 


The next phase of research in genetic epidemiology will require 
integration of research on the genetic and environmental risk fac- 
tors already described. Traditional study designs in genetic epi- 
demiology that can be used to study the joint influence of genetic 
and environmental factors include case-only studies and cross- 
sectional cases-control studies, as well as cohort studies on gene- 
environment interaction [91-93]. These study designs can now be 
extended to condition upon broad or specific genotypic similarity, 
based on GWAS, to identify environmental exposures that influence 
gene expression. 

There are a growing number of studies that have incorporated 
findings from genetic studies to identify the effect of environmental 
factors for diseases that have well-established genetic risk factors. 
The large scope of studies that will be required to detect the joint 
impact of genetic with environmental factors is daunting [94]. 
Although gene-environment interaction is generally assumed to be 
a key mechanism for links between genetic susceptibility and envir- 
onmental exposures in complex diseases, analyses of several of these 
cohort studies have yielded additive, rather than interactive, influ- 
ences of genetic and environmental risk factors. Likewise, studies 
of environmental factors among those with the APOE-e, genotype 
that confers increased risk of cognitive decline and Alzheimer’s dis- 
ease have found additive, rather than interactive, influences of other 
environmental risk factors [95, 96]. Incorporation of phenotyping 
that taps the domains underlying broad diagnostic categories, based 
on knowledge of underlying biologic pathways, coupled with built- 
in hypothesis-based environmental exposures, will facilitate the 
integration of advances in molecular genetics and environmental 
science. 


Summary and conclusions 


The results of twin, family, and adoption studies demonstrated that 
both genetic and environmental factors contribute to the aetiology 
of depression. Depression is highly familial, with familial risk ratios 
averaging about 2.5-2.8, a familial heritability of 0.32, an average 
twin heritability of 0.3-0.4, and a SNP-based heritability of 0.17 for 
the presence/absence of early AAO of depression. To date, no genetic 
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markers have been associated with aggregate major depression nor 
its major subtypes in GWAS of tens of thousands of people; however, 
stratification of the sample by AAO allowed for the identification of 
one replicated genome-wide significant locus associated with adult- 
onset MDD. The first major success of genome sequencing was re- 
ported by the CONVERGE Consortium, which reported two loci 
contributing to risk of MDD. 

Challenges to studying the genetic underpinnings of depression 
include its heterogeneity, multifactorial aetiology, and pervasive 
comorbidity with other conditions. Over time, as the concept of 
MDD was broadened, depression was characterized by substantially 
greater heterogeneity in terms of symptom expression, comorbidity 
with anxiety and other conditions, and stronger association with en- 
vironmental stressors. Both genetic epidemiologic and molecular 
genetics studies of depression have systematically investigated these 
sources of heterogeneity in order to define more homogenous sub- 
groups that may be more likely to reflect a common underlying 
genetic architecture. Incorporating environmental factors has been 
a challenge, and though stressful life events have been consistently 
associated with increased risk of mood disorders, mood disorders 
also elevate the risk of life events, so establishing the directional links 
has been complex. 

The next phase of research in genetic epidemiology will require 
integration of research on the genetic and environmental risk fac- 
tors already described. Building upon traditional study designs in 
genetic epidemiology that can be used to study the joint influence 
of genetic and environmental factors include case-only studies and 
cross-sectional case-control studies, as well as cohort studies on 
gene-environment interaction, which can now be extended to con- 
dition upon broad or specific genotypic similarity, based on GWAS, 
to identify environmental exposures that influence gene expression. 
This has led to increased recognition of the role of biologic and gen- 
etic vulnerability factors for mood disorders and to research with 
increasing methodological sophistication over the past two decades. 
This review also demonstrates the heterogeneity of depression and 
its multifactorial aetiology that will require future research that em- 
braces sources of complexity, including genetic, biologic, and envir- 
onmental factors. 
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A range of neuroimaging techniques have the potential and, indeed, 
are contributing to the understanding of the aetiology, progression, 
and treatment of affective disorder. In the case of unipolar depres- 
sion, they have underlined the message that depression is, in some 
sense, a brain disease. At the same time, they have reinforced the 
continuity with studies of emotion and cognition in the general 
population and in other disorders. However, they have also high- 
lighted some of the weaknesses of a biological approach based on 
small samples and inadequate mechanistic understanding. There are 
important overlaps between this chapter and Chapter 71 on imaging 
studies in bipolar disorder (BD). 











The majority of imaging studies relevant to major depression have 
employed case-control designs in which a group of patients with, or 
at risk of, depression are compared to a control group. These studies 
report group-level differences in one or more neuroimaging meas- 
ures, quantifying the significance of their findings using statistical 
approaches which estimate the probability of the group differences 
arising by chance. The aim of this approach is to identify features of 
the brain’s structure or function which differ between groups from 
which inferences about illness mechanism may be drawn. However, 
as discussed in Chapter 75, it is likely that a number of distinct 
mechanistic processes lead to the expression of the major depres- 
sive phenotype. The ability to determine an imaging signature of a 
diagnosis is clearly bounded by the degree to which individuals with 
the diagnosis share common aetiological processes; the absence of 
the same [1] will make it much more difficult to identify clear neural 
signatures when comparing groups of depressed and non-depressed 
individuals. A second limitation to case-control designs is that 
they provide only weak evidence that the identified neuroimaging 
processes are causally related to pathology. Group differences in 
neuroimaging outcomes between patients with depression and con- 
trols may arise because: (1) changes in the identified neural system 
are causally related to the disorder; (2) having the disorder causes 
changes (for example, taking medication or leaving the house less) 





which then leads to the observed neuroimaging effect; or (3) a third 
factor causes both the expression of the illness and, separately, the 
observed neuroimaging finding. These limitations are widely ac- 
knowledged and can be mitigated by imaging studies using alterna- 
tive designs, the most prominent of which are briefly summarized in 
the following sections. 


li 
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Prospective studies 


Prospective cohort studies can demonstrate whether a neural process 
temporally precedes illness and may have a causal role in outcomes. 
Studies ofincreasing ambition have recently begun to be established, 
with early results relevant to depression already published [2]. 


Experimental studies 


Experimental studies measure the effects of interventions which 
manipulate neural systems potentially relevant to pathology. They 
are able to provide strong evidence on the causal role of the system in 
disease. While the capacity to target and manipulate specific neural 
circuits is clearly limited in human, compared to animal, models, a 
degree of anatomical specificity can be achieved in human subjects 
using electrical or magnetic stimulation, and some biochemical spe- 
cificity is possible using pharmacological manipulations. For ex- 
ample, increased serotonergic function reduces amygdala activity in 
response to negative stimuli in both non-depressed control [3] and 
patient [4] populations, with the degree of this reduction predicting 
later response in patients [5]. 





Lastly, there has been increasing recent interest in study designs 
in which neuroimaging techniques are used to classify patients. 
Broadly, two procedures have been used. Firstly, unsupervised clas- 
sification attempts to find structure in the imaging data itself, for 
example by asking whether the data from depressed patients can be 
clustered into separate groups (see later in this chapter for detailed 
examples). A particular advantage of this approach is that it is less 
a hostage to heterogeneity than case-control designs. Unsupervised 
classification techniques could conceivably identify the neural signa- 
tures of different mechanistic processes which lead to major depres- 
sion. Secondly, supervised classification uses some sort of ground 
truth value to determine the group to which a person belongs (for 
example, has a patient responded or not to their treatment?). The 
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supervised classification procedure then attempts to uncover a 
rule by which the neuroimaging data may be used to arrive at this 
ground truth. 

‘The statistical underpinning of these classification designs is quite 
different from that of case-control studies. Whereas case-control 
studies examine the mean differences between groups of patients 
and controls, asking ‘what is the probability that this group differ- 
ence occurred by chance?; classification studies develop a rule to 
separate individuals into different groups and then ask ‘does my rule 
work when I apply it to a different set of data? Thus, the measure 
of whether a classification process is meaningful is its capacity to 
predict the structure of new data (sometimes called ‘out-of-sample 
data’). The focus on predicting the properties of new data means that 
there is potential application in clinically relevant situations. For 
example, a classification rule which accurately predicts response to 
treatment may, in principle, be used to personalize treatment choice 
for individual patients. 

The neuroimaging literature for major depression is split into 
studies which have examined genetic risk factors for major depres- 
sion, those which have examined the broader risk phenotype, and 
finally those which have examined the phenotype itself. This litera- 
ture is reviewed later in this chapter, with findings which progress 
beyond case-control evidence particularly highlighted. Imaging 
studies in psychiatry have often been too small and have reported ef- 
fect sizes that are potentially inflated by post hoc statistical methods 
and publication bias [6]. In addition, cases and controls ought to be 
recruited using comparable methods, which is a challenge for clin- 
ical case series. To allow confidence in what we describe here, we will 
describe pooled analyses of data from many comparable studies, in 
preference to single studies, where possible. 


Imaging the brain at risk for major 


depression: ‘imaging genomics’ 





As Chapters 70, 71, and 72 illustrate, a major depressive episode 
arises on a background of risk determined by genetics and early ex- 
perience. The genetic basis of MDD implicates many genes of modest 
effect, whose identity is only now emerging from the very large sam- 
ples required for statistical confidence in association studies. Such 
genes might be operating through anxiety proneness, since the per- 
sonality trait of neuroticism is a strong predictor of the risk of MDD. 
Alternatively, new genes may implicate new mechanisms related to 
decompensation in the face of life stresses or ‘events. Imaging can 
identify brain structure and brain function, and both must be, in 
part, under genetic control and vary accordingly. Historically, there 
have been two distinct phases of ‘imaging genomics, as it has been 
ambitiously named. The first focused on candidate genes that had 
been suggested might be contributory to genetic risk, based on con- 
temporary understanding of neurobiology. The second, into which 
we are now emerging, can be described as discovery-based and relies 
on post hoc associations derived from GWAS. 





1. Reproduced from Mol Psychiatry, 18(4), Murphy SE, Norbury R, Godlewska 
BR, et al., The effect of the serotonin transporter polymorphism (5-HTTLPR) on 
amygdala function: a meta-analysis, pp. 512-20, Copyright (2013), with permis- 
sion from Springer Nature. 


Of the candidate gene approaches, the serotonin transporter 
(SERT) has long seemed a potential locus for a genetic effect. Indeed 
the first putative association between a polymorphism in the regula- 
tory region of the SERT gene and anxiety trait appeared over 20 years 
ago. It was apparently supported by other very highly cited research 
[7], which suggested an interaction between the polymorphism and 
MDD, modulated by exposure to adverse life events. In support of 
these findings, both functional and structural studies of the brain in 
healthy volunteers suggested an association between the SERT poly- 
morphism and brain function. The original study suggested that 
individuals with one or two copies of the short allele of the SERT pro- 
moter polymorphism exhibited greater amygdala neuronal activity, 
as assessed by BOLD, in response to fearful stimuli, compared with 
individuals homozygous for the long allele [8]. Further studies re- 
ported reduced grey matter volumes in the limbic system of healthy 
volunteers in association with the short allele [9] and that MDD was 
associated with the polymorphism [10]. The apparent coherence of 
these findings led to an increasingly optimistic view that imaging 
provided endophenotypes that were easier to study and more valid 
than ‘disorders’ for genetic analysis. On this view, an endophenotype 
could provide a target between the genotype and any disease; this 
could speed the development of precision medicine and would also 
be easier to model in animal studies [11]. 

The subsequent developments around this topic have been highly 
contested. In essence, the best known findings linking the SERT ei- 
ther with MDD or with abnormal brain function were shown to be 
at best overstated, and at worst, artefacts of enthusiasm. It has high- 
lighted the general problem of statistical power, excess positive find- 
ings, and the problem of reproducibility in such studies. In relation 
to the BOLD imaging findings, a recent review concluded “... there 
was considerable between-study heterogeneity, which could not 
be fully accounted for by the study design and sample characteris- 
tics that we investigated. In addition, there was evidence of excess 
statistical significance among published studies. These findings in- 
dicate that the association between the 5-HTTLPR and amygdala 
activation is smaller than originally thought, and that the majority 
of previous studies have been considerably under powered to reli- 
ably demonstrate an effect of this size” [12]. Moreover, the assump- 
tion that endophenotypes will usually turn out to demonstrate 
larger genetic effect sizes appears to go beyond the evidence from 
the existing animal models as well [13]. The imaging genomics field 
continues to struggle with the paradox of a continuing plethora of 
positive findings from studies that are simply underpowered to de- 
tect reliably what is often claimed from post hoc analysis of data. 

Discovery-based approaches will inform imaging in two ways. 
Any GWAS or CNV-based variant showing a strong association 
with a phenotype of interest is likely to be a better choice of can- 
didate gene than, for example, the SERT (which does not emerge 
reliably from GWAS). Secondly, pooled imaging data can be directly 
related to GWAS data in discovery-based designs. For example, 
the Enhancing NeuroImaging Genetics through Meta-Analysis 
(ENIGMA) Consortium has identified common genetic variants 
associated with hippocampal volume. Their philosophy for pooling 
many data sets promises results for genes with clear effects on brain 
development [14]. However, a link through this approach to major 
depression has not yet been reported and the power of even very 
large genetic studies to detect depression-related genes suggests cau- 
tion (see Chapter 75). 





Imaging the at-risk phenotype in major depression 


Imaging young people at risk for depression is an obvious way to 
relate risk for depression to brain structure and function. Perhaps 
surprisingly, such studies have not been very numerous, but system- 
atic reviews suggest they have been generally consistent. Being at 
risk may mean by virtue of familial risk or early adverse experience. 
The children of a proband with MDD are at risk for MDD. However, 
they are also at risk for anxiety disorders. Moreover, the child of a 
parent with BD may also be at risk for MDD and anxiety disorders 
(and not necessarily BD). Thus, the state of being ‘at risk’ may not be 
very specific. As a further complication, some children will prove 
to be resilient, so not actually at risk at all. These complexities con- 
found all such studies designed on the basis of family history, unless 
there is a prolonged follow-up to determine actual outcomes. This is 
not always feasible. Most studies of at-risk offspring are conducted 
in teenagers. 

Being at risk due to early abuse and neglect risks confounding by 
psychiatric illness in the parent. Thus, the at-risk phenotype may 
be at risk by virtue of both the early abuse and neglect inflicted by 
the parent(s) and by virtue of genes inherited from the parent. In 
practice, it will always be difficult, if not impossible, to control for 
the latter. 


Structural imaging: brain volumes and white 
matter integrity 


Volumetric studies of the developing human brain across adoles- 
cence show early cortical grey matter volume (CGMV) increases, 
followed by decreases, and monotonic increases in cerebral white 
matter volume (CWMV) [15]. There are no consistent reports of an 
abnormal growth trajectory for either CGMV or CWMV in young 
people at risk for depression, but the fact that the brain is changing 
over the critical decades makes such studies additionally difficult. 
Differences in the brain of adolescent patients with MDD, compared 
with controls, are seen in the cortex. 

Pooling of studies of subjects at risk by virtue of childhood abuse 
and neglect offers some preliminary findings. In such an analysis, in- 
creased exposure to childhood adversity was associated with smaller 
caudate volumes bilaterally, only in girls. All subcategories of child- 
hood adversity showed the effect, independent ofan MDD diagnosis 
[16]. The absence of the effect in boys is a striking example of sexual 
dimorphism, but it is not paralleled by different causal pathways to 
depression in men and women [17]. There was no effect of child- 
hood adversity on hippocampal volume. 

Diffusion-weighted MR (magnetic resonance) imaging (diffusion 
tensor imaging, or DTI) is being used increasingly to determine the 
integrity of white matter in MDD. DTI resolves the diffusibility of 
water in different directions to give a measure called fractional an- 
isotropy (FA). If diffusion is highly constrained, as along the axons 
of a white matter tract, FA values are high. Unconstrained diffusion, 
as in saline solution or the cerebrospinal fluid, has zero FA. Reduced 
FA may imply white matter disorganization, either as a develop- 
mental or as an acquired property of the brain. 

A small, but growing, number of studies have addressed ‘at-risk’ 
populations. Participants with a family history of depression have 
reduced FA in the cingulum and other white matter tracts [18]. 


CHAPTER 76 Imaging of depressive disorders 


Indeed, reduced FA has been a common finding in a range of at-risk 
studies—for psychosis [19], BD [20], and even antisocial personality 
disorder [21]. There has been usually a quite surprisingly diffuse ef- 
fect across the white matter skeleton. Most studies have been indi- 
vidually underpowered, but a substantial overall effect is obvious. 


Functional imaging: resting states and brain activation 


Functional imaging studies might be predicted to be a sensitive 
method to distinguish high-risk from low-risk subjects. Resting 
state connectivity and the patterns of activation in cognitive tasks 
have provided convincing methodology. However, differences in 
experimental design often make data pooling difficult. While still 
preliminary, the published studies have highlighted the insula 
cortex and its connections and the inferior frontal regions as key 
components within the networks processing emotional experience. 
Lesion studies had already suggested that the insula is important for 
integrating cognitive, affective, sensory, and autonomic information 
to create a conscious experience of feeling [22]. Frontal areas have 
been implicated as possible brain regions mediating cognitive con- 
trol or feedback to brain regions with more reflexive reactions to 
threat such as the amygdala. 

Young relatives of bipolar probands are at risk for both MDD and 
BD. Those who subsequently developed MDD demonstrated rela- 
tively increased activation in the insula cortex in a challenge task, 
compared to controls and high-risk subjects who remained well. In 
the latter groups, this region demonstrated reduced engagement with 
increasing task difficulty. The high-risk subjects who subsequently 
developed MDD did not demonstrate normal disengagement [23]. 
Insula involvement may not be specific; young unmedicated partici- 
pants with BD type II had increased coherence across several brain 
regions at rest in a temporo-insular network, including the bilateral 
insula and putamen [24]. 

Other work has highlighted the inferior frontal cortex as a locus 
showing weaker connectivity at rest with other regions (including 
the insula and temporal gyrus in both BD and young people at risk 
for bipolar and other mood disorder [25]. The same group had shown 
reduced activation of the inferior frontal lobe when inhibiting re- 
sponse to fearful faces in at-risk subjects [26] and reduced cortical 
thickness in the left pars orbitalis of the inferior frontal gyrus (IFG), 
compared with controls [27]. So far, at-risk studies have not differ- 
entiated well between bipolar and unipolar risks. Indeed, at a brain 
systems level, they may be very similar. 

The risk for future episodes of depression is increased in pa- 
tients who have remitted from previous episodes. Thus, remitted 
patients may also be considered an ‘at-risk’ group. Interpretation 
of neuroimaging findings comparing remitted patients and never- 
depressed control subjects is complicated by the possibility that 
persistent effects between episodes of the illness may be related to 
trait-like risk processes or to a scarring effect of previous episodes. 
Resting connectivity in such patients has been reported to be re- 
duced within cognitive control networks [28], as has default mode 
activity during effortful tasks [29]. Additionally, activity within 
reward-related circuitry has been reported to be disordered [30] and 
amygdala responses to faces increased [31]. 

There have been also a limited number of functional neuroimaging 
studies of children and adolescents exposed to early neglect and/ 
or maltreatment (physical, sexual, and emotional). In reviewing 
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these findings, there have been reports of heightened and reduced 
amygdala responses to threat in maltreated samples, blunted stri- 
atal response to anticipation and receipt of rewards, and increased 
activation in the anterior cingulate cortex during processing of emo- 
tional stimuli and executive control tasks [32]. 


Imaging the MDD phenotype 


Structural imaging: brain volumes and white 
matter integrity 


Patients with MDD are probably a highly heterogenous group, es- 
pecially when studied in their maturity with an established pattern 
of illness. Thus, one may anticipate an impact from the current 
illness itself, any vulnerability factor as described in the previous 
paragraphs, and the uncertain effects of physical illness and life- 
style choices. All or any of these cumulative influences may produce 
acquired brain changes in mature patients. In addition, current or 
previous drug treatment may contribute to the picture either as an 
ongoing drug effect’ or as a consequence of a relatively recent with- 
drawal. The impact of these confounding factors has been greatly in- 
creased by the use of relatively small samples drawn from individual 
sites. However, this problem has now been well recognized. 
Systematic reviews of published data provide one solution. 
Judicious exclusion criteria can then identify very comparable 
studies that control for one or more of the obvious confounds. For 
example, 14 published studies of about 400 medication-free cases 
with MDD showed reduced grey matter in the prefrontal and limbic 
cortices (including the hippocampus bilaterally) [33]. The alterna- 
tive to controlling for variables in this piecemeal way is to pool as 
many individual data sets as possible and allow statistical power 
to overcome the confounds in a single mega-analysis. This is the 
principle behind the ENIGMA project (http://enigma.ini-usc.edu). 
Thus, three-dimensional brain imaging data from 1728 MDD pa- 
tients and 7199 controls (15 research projects worldwide) showed 


that patients had significantly lower hippocampal volumes (Cohen's 
d = -0.14; % difference = -1.24). The effect was driven by patients 
with recurrent MDD (Cohen's d = -0.17; % difference = —-1.44), so 
there were no differences between first-episode patients and con- 
trols. These effects are, of course, small. They may imply that loss 
of hippocampal volume is an acquired feature of MDD, related to 
the illness course. However, no cross-sectional study can prove that. 
Earlier age of onset was associated with a smaller hippocampus and 
a trend towards a smaller amygdala and larger lateral ventricles. 
Current symptoms, use of drugs for depression, and methodology 
had no effect [34]. The absence of effect in first-episode MDD pa- 
tients echoes findings in individual, relatively well-powered, and 
controlled studies [35]. 

The ENIGMA group’ study of cortical volumes in 2148 MDD 
patients and 7957 controls showed thinner cortical grey matter in 
the orbitofrontal cortex, anterior and posterior cingulate, insula, 
and temporal lobes in adults aged over 21 years (effect sizes -0.10 
to -0.14) (Fig. 76.1). These effects were more pronounced in first- 
episode and early-onset cases. By contrast, effects in adolescents 
were larger but occurred as changes in regional brain volume, not 
cortical thickness. The affected brain areas included medial and 
superior frontal areas and somatosensory and motor areas (effect 
sizes -0.26 to -0.57) and were most striking in recurrent MDD [36]. 
This complex pattern implies different developmental trajectories 
for the cerebral cortex in early-onset cases, compared with controls. 

It is an open question whether improved-quality multimodal 
imaging can refine methodology to produce more precise results in 
smaller samples. A pooled analysis from a selection of such studies 
employing peak co-ordinates for MR structural data showed re- 
duced grey matter in the amygdala, dorsal fronto-median cortex, 
and right para-cingulate cortex. In the same patients, PET demon- 
strated increases in glucose metabolism in the right subgenual and 
pregenual anterior cingulate [37]. The connectivity of the subgenual 
region had previously been shown to suggest projections to the 
nucleus accumbens, amygdala, hypothalamus, and orbitofrontal 
cortex [38]. 
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Fig. 76.1 (see Colour Plate section) Meta-analysis of magnetic resonance scans from patients with major depression, compared with controls, from 
the ENIGMA working group [34]. Effect sizes for regions with significant (PFDR <0.05) cortical thinning are shown in red. Negative effect sizes d indicate 


cortical thinning in MDD, compared to controls. 


Reproduced from Mol Psychiatry, 21(6), Schmaal L, Veltman DJ, Van Erp TGM, et al., Subcortical brain alterations in major depressive disorder: findings from the ENIGMA 
Major Depressive Disorder working group, pp. 806-12, Copyright (2016), Macmillan Publishers Ltd. Reproduced under the Creative Commons Attribution License (CC BY 4.0). 


Functional imaging: resting state and task-based studies 


In one sense, resting state data are well suited to aggregation in sec- 
ondary analyses; the lack of any task for subjects to perform during 
scanning means that their experience inside the scanner should be 
reasonably consistent across studies. However, a large variety of stat- 
istical approaches are used in the first-level analysis of resting state 
data; the choice can profoundly influence the connectivity metrics 
generated. The two common methods for analysing resting state 
data involve either correlating BOLD signal against a range of seed 
regions (with different sets of seeds used in different laboratories) 
or using data-driven independent component analyses of whole 
brain data. It is difficult to draw firm conclusions from such variable 
approaches, although increased connectivity in depressed patients 
within the default mode network is perhaps the most consistently 
reported finding [39]. While it is an advantage of resting state data 
that the same networks can be identified with reasonable reliability 
across studies, it is less clear how differences in these networks re- 
late to cognitive function or why they may be related to symptoms 
[40]. This explanatory gap means we do not understand what role 
increased connectivity within the default mode network plays in 
major depression. 

In contrast, task-based fMRI studies in depressed patients have 
been influenced by dual-process, mechanistic models of emotional 
and cognitive control. These models suggest two neural systems to 
be key—firstly, a bottom-up, limbic-based system which responds 
relatively automatically to salient stimulus features, and secondly, a 
top-down frontal-based control system which responds more flex- 
ibly and is able to modify activity of the limbic system. Published 
studies have tended to report altered activity of frontal regions 
during emotional or effortful tasks, which has been interpreted as 
reflecting reduced or deficient function of the top-down control sys- 
tems [41, 42]. This goes along with evidence of increased activity of 
limbic regions, particularly in response to negative affective stimuli 
[43]. One attraction of this proposal is that it links imaging findings 
not only to symptoms, but also to cognitive abnormalities observed 
in patients such as reduced executive function or processing biases 
for negative information. However, very similar ‘hyperactive limbic 
system combined with deficient control system’ models have been 
proposed for a range of psychiatric diagnoses such as anxiety, BD, 
and attention-deficit/hyperactivity disorder (ADHD). Whether this 
explanatory overlap between diagnoses is a result of common neural 
mechanisms across disorders, a reflection of the current technical 
limitations of neuroimaging modalities, or of a relatively under- 
developed imaging literature is not clear. Whatever the explanation, 
current imaging-based mechanistic models of major depression 
do not have the specificity to distinguish between different illness 
phenotypes. 


Classification studies in major depression 


The RDoC project has exemplified the search for objective meas- 
ures, such as those from neuroimaging, to inform novel categorical 
schemes in mental health and disease (see Chapter 8). It has led to 
an increased focus on studies which recruit patients with a range 
of difficulties and diagnoses in order to identify common processes 
across conventional symptom-based diagnoses. A slightly less am- 
bitious approach attempts to identify subcategories within existing 
diagnoses. The common observation that patients with major de- 
pression show heterogenous illness courses, response to treatment, 
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and risk factors has prompted a number of early attempts to identify 
subgroups of depressed patients such as those with a melancholic 
vs a reactive illness course. Historically, it proved challenging to 
demonstrate that such subgroups were clinically meaningful, for 
example that they provide useful information on prognosis or re- 
sponse to treatment. A recent influential study provides a first hint 
that neuroimaging measures may be useful in this project. In the 
study illustrated in Fig. 76.2, Drysdale and colleagues amassed over 
1000 resting state scans from depressed patients and controls [44]. 
Using an unsupervised classification approach, based on resting 
state connectivity measures and symptom score measures, the au- 
thors claim to have identified four distinct subgroups of patients. 
The groups displayed a reasonably specific pattern of resting state 
connectivity and associated cluster of symptoms. They then demon- 
strated that they were able to identify patients with these symptom 
clusters using just the connectivity data, in a sample of 400 new par- 
ticipants. However, the validity of the reported clustering of patients 
has been questioned in recent work (44a) and, even if the clusters do 
turn out to be reliable, this does not not guarantee that they will be 
clinically useful. The authors provided important initial reassuring 
evidence on this point. The four clusters of patients responded dif- 
ferently to repeated transcranial magnetic stimulation (rTMS). 
Further assessment of the utility of this clustering technique will 
require replication of the cluster identification process by other re- 
search groups and confirmation that cluster assignment is associated 
with response to more mainstream treatments or to general illness 
course. However, the study demonstrates a potentially exciting de- 
velopment beyond symptoms; this has been long anticipated, but 
not hitherto delivered. 

Supervised learning, the second approach to classification, has 
most commonly been employed to classify depressed patients into 
responders vs non-responders to treatment. The majority of pharma- 
cological and psychological treatments for depression take weeks to 
fully influence subjective symptom measures. A significant propor- 
tion of patients do not respond to the initial treatment regime. As 
a result, there is often a long delay before patients are started on ef- 
fective therapy. A number of studies have looked at whether an indi- 
vidual will respond to a specific treatment, based on structural and/ 
or functional neuroimaging data collected before treatment is initi- 
ated or over the first few days of treatment. The rationale is that such 
measures could pave the way to personalized therapy in which the 
neuroimaging outcomes would be used to select the treatment that 
is most likely to help an individual patient. Published studies to date 
have employed modest sample sizes and have been unable to test 
the performance of the classifiers using a fully held-out sample [45]; 
within-sample accuracies (that is usually estimated using ‘leave- 
one-out’ procedures) have tended to range between 60% and 70%. 
Response rates to antidepressants in unstratified clinical trials are 
often around 50%, so a 10-20% improvement appears promising for 
clinical settings. Ongoing projects are collecting larger sample sizes, 
which will allow more robust out-of-sample validation of classifier 
performance. 

Improvements in methodology may well identify reliable sub- 
groups of patients or classifiers which identify treatment responders 
vs non-responders. The next stage will be to test the use of these 
classifiers in RCTs. Imaging is still both unwieldy and relatively ex- 
pensive, and other methods of objectifying behavioural phenotypes 
will be developed from self-rating and wearable devices, which may 
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Fig. 76.2 (see Colour Plate section) Definition of subtypes of depression using resting state fMRI data taken from [44]. Resting state data from 
depressed patients and control subjects were collected. (a) Regions of interest were defined across the brain, and the mean BOLD time series from each 
of these regions was extracted for each subject. (b) For each subject, the correlation between all pairs of time series was calculated as a measure of 
connectivity. The figure illustrates the correlation between the regions of interest which are arranged depending on the neural system of which they are 
a member. (c) and (d) A data reduction method (canonical correlation; CC) was then used to find simple relationships between the imaging data shown 
in (b) and patient depression scores measured using the Hamilton Depression Rating Scale. CC looks for common factors in the imaging and symptom 
score data sets; here two were found: (c) an ‘anhedonia’-related scale and (d) an ‘anxiety’-related scale. The brain images display the regions of interest 


provide cheaper means of patient stratification. Whether imaging 
will find a role generating classifiers in clinical practice is still uncer- 
tain. Its widespread adoption will require evidence for meaningful 
and cost-effective improvement in outcomes. 


White matter integrity and neurodevelopment 


As already noted, diffusion-weighted MR imaging (DTI) offers a 
sensitive way to determine the integrity of white matter. Apparent 
abnormalities are seen in at-risk samples. A quantitative voxel-based 
pooled meta-analysis of FA of almost 400 first-episode, drug-naive 
MDD revealed reductions in the corpus callosum, bilateral anterior 
internal capsule, and right inferior temporal and right superior 
frontal gyri. FA reductions in some of these areas correlated with 
symptom scores and duration of depression [46]. In a similar large 
pooled analysis comparing over 500 more heterogenous MDD pa- 
tients and controls, there were trends to reduced FA in the corpus 
callosum (CC) and inferior fronto-occipital fasciculus (effect sizes — 
0.19 and -0.20, respectively) [47]. 

These findings in people early in the illness course suggest de- 
creased fibre coherence or glial distribution [48] to be associ- 
ated with MDD. Given similar findings in at-risk populations 
and first-episode cases, the abnormality appears likely to be 
neurodevelopmental, rather than acquired, as a result of illness 
course or medication. FA increases monotonically during adoles- 
cence, but at different rates in different brain structures; thus, the 
splenium appears to stabilize by age 15, while the uncinate fasciculus 
appears still to be changing at age 30 [49]. Therefore, development 
of the splenium would be complete by age 20 and the difference in 
FA would then be enduring. Some prospective longitudinal data in 
young people with first-degree relatives with BD suggest no catch- 
up of the decreased FA when first seen, over the next 2 years [50]. 
This finding clearly complements, and must partly explain, the find- 
ings of reduced FA in mature patients. Studies of twins and siblings 
suggest that FA in multiple cortical regions is under genetic control 
[51]. Therefore, diffuse FA abnormalities in white matter appear to 
be a neurodevelopmental marker of vulnerability to MDD and other 
psychiatric disorders, including BD (see Chapter 71), ADHD (see 
Chapter 35), schizophrenia (see Chapter 60), and impulse control 
disorders (see Chapter 123). How alterations in white matter micro- 
structure can be such a general marker of neurodevelopmental ab- 
normality is not established. White matter investigation using more 
advanced MR methods will clearly be increasingly possible. 

Given the findings in young people at risk of MDD or with early- 
onset psychiatric disorder, it is unsurprising that mature patients 
show the same pattern of diffuse reductions in white matter FA. It 
could be an enduring mark of their early vulnerability. A prelim- 
inary description of DTI data from the pooled ENIGMA MDD co- 
hort showed the largest differences in white matter between cases 
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and controls in the body and genu of the CC [52]. The effect sizes for 
different areas varied up to a maximum of -0.34 in the body of the 
CC. Such effects for the white matter microstructure are stronger 
than those reported in their similar meta-analysis of subcortical 
grey matter volume in MDD described previously. The sample size 
is still modest, and moderating factors remain to be studied in larger 
samples. 

In an early study of patients with onset of MDD over 60 years, FA 
reductions were marked, even when grey matter volume changes 
were undetectable [53]. In addition, these patients performed worse 
than appropriate controls in tests of executive function, processing 
speed, episodic memory, and language. There were meaningful 
correlations between FA in individual brain areas and cognitive 
impairments: reduced FA of the anterior thalamic radiation and un- 
cinate fasciculus with executive function; the genu of the CC with 
processing speed and anterior thalamic radiation; and the genu 
and body of the CC and the fornix with episodic memory. While 
exploratory, these findings supported the internal validity of the 
findings in a small study. Moreover, they predicted a closer link be- 
tween acquired or progressive white matter disruption and cognitive 
deficits than for the superficially similar reductions seen in younger 
patients with MDD. 

For the late-onset group, acquired white matter disruption may be 
a particular risk factor. This is supported firstly by the observation 
that DTI changes are more striking in late-onset patients, compared 
with age-matched early-onset patients [54]. Secondly, there was an 
association between subthreshold depressive symptoms and FA re- 
ductions in an intensively studied cohort of now ex-civil servants 
aged almost 70 years (effect size -0.2) [55]. This group gave no his- 
tory of depressive episodes. In this study, there was no association 
with grey matter volume, and the sample size was 350. However, 
only 10% of the total sample had excess depressive symptoms. The 
anatomy of the largest effects was diffuse, involving particularly the 
CC and the inferior and superior longitudinal fasciculi. Vascular 
risk factors are believed to contribute to the development of DTI 
changes in the elderly. Scores for vascular risk (Framingham Stroke 
Risk Profile) correlated with DTI measures in late-life MDD [56]. 

Reduced FA has been a highly consistent finding in studies of 
more heterogenous groups of older patients. In a review of nine such 
studies, the dorsolateral prefrontal cortex and uncinate fasciculus 
in patients with MDD had lower FA than controls (effect sizes 0.7 
and 0.23, respectively), although heterogeneity and publication bias 
were evident for the dorsolateral prefrontal cortex [57]. 


White matter integrity, neurodegeneration, and 
cerebral lesions 


White matter abnormalities were originally described in mood 
disorder patients in the earliest studies with CT and MR imaging 





Fig. 76.2 Continued 
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between which resting state correlations were most closely related to the specific factor. (e) The strength of these two factors was estimated for each 
patient, and then an unsupervised classification procedure—hierarchical clustering—was used to try and find clusters of patients who differed on the 
wo measures. (f) Four clusters of patients were found, with each cluster defined by a different combination of the two factors. The scatter plot shows 
data from patients colour-coded by the cluster to which they are assigned; grey dots represent patients who could not be confidently classified. 

The authors next demonstrated that these clusters could be reliably detected in a second data set and that cluster membership could be assigned using 
just the neuroimaging data. Lastly, it was shown that the clusters predicted treatment response to transcranial magnetic stimulation (TMS) therapy—a 
greater proportion of patients in clusters 1 and 3 responded to TMS than those in clusters 2 and 4. 

Reproduced from Nat Med, 23(1), Drysdale AT, Grosenick L, Downar J, et al., Resting-state connectivity biomarkers define neurophysiological subtypes of depression, pp. 23- 
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as ‘white matter hyperintensities’ [58, 59] and are most often seen 
in older patients [60] (see Chapter 66). They are clearly correlated 
with vascular disease [53]. They do not, in themselves, explain any 
of the findings already described for older patient samples, because 
white matter changes appear to be so diffuse. Notwithstanding the 
foregoing emphasis on neurodevelopmental abnormality in young 
people and stable FA reductions in mature patients with MDD, white 
matter disruption appears also to be a manifestation of progressive 
brain changes in ageing. Correlates of microstructure abnormality 
with cognitive function are more evident in a healthy ageing popu- 
lation; progressive reduction in FA was seen with age and correlated 
selectively with working memory among several tests of executive 
function and processing speed [61]. 


Brain diseases or injury and depression 





Post-stroke depression (PSD) is a common complication of re- 
habilitation after stroke. The location of brain injury predisposing 
to depression is of obvious interest, although not without contro- 
versy [62,63]. A recent systematic meta-analysis aimed to determine 
the mapping between PSD and lesion location. Forty-three studies 
involving 5507 patients suffering from stroke were included [64]. 
Only studies with 1-6 months post-stroke group showed a statistical 
association between right hemisphere stroke and risk of depression 
(OR 0.79, 95% CI 0.66-0.93). This contrasts with earlier claims that 
lesions of the left hemisphere were associated with an increased risk 
of depression after stroke [63]. 

Rather differently, mood and anxiety disorders may be presenting 
symptoms of brain tumours or other brain disease. Imaging can be 
highly relevant to neurological diagnosis. Depressive illness may be 
the only complaint in the presentation of meningioma for as many as 
20% of the cases occurring in the fifth decade of life. As many as 75% 
of some series of patients with multiple sclerosis experience a delay 
in multiple sclerosis diagnosis due to symptoms of MDD. A missed 
or delayed diagnosis is clinically significant if it postpones effective 
intervention, increases the burden of disease/disability, and reduces 
the quality of life. Rather similarly, major depression may precede 
the diagnosis of Parkinson's disease in as many as 37% of younger, 
especially female, patients presenting with motor signs [65]. 

Traumatic brain injury (TBI) is receiving increasing attention 
because of the impact on survivors of head trauma in civilian and 
military life and in contact sports. In a review of almost 100 publi- 
cations, 27% of cases of TBI were diagnosed with MDD/dysthymia 
using formal criteria and 38% reported clinically significant depres- 
sion with self-report scales, which makes it a common complication 
of TBI [66]. Interestingly, DTI investigations of white matter integ- 
rity can effectively differentiate patients with TBI from controls [67]. 
There also appears to be a relationship between DTI measures and 
TBI outcomes. However, a specific relationship with depression in 
this patient group, perhaps mediated by white matter damage, has 
not yet been described, even if it can be anticipated from what has 
been foregoing in this chapter. 

As with other MR measures, DTI may prove useful in patient 
stratification and prediction of treatment response. Lower FA has 
been related to treatment resistance [68] and reduced resilience to 
MDD after early-life adversity [69]. 


Isotope-based imaging 


Isoptope-based imaging (PET and SPET or SPECT) has assumed 
marginal importance in recent years. PET and SPECT provide the 
initial evidence, based on cerebral blood flow, for functional abnor- 
mality in the brain in depression [70, 71]. These methods remain 
the only way in which specific binding to brain receptors can be 
measured in the living human. We do not propose to review here 
the literature relating to the 5-HT1 receptor [72], the D2 receptor 
[73], or the serotonin transporter [74], all of which are relevant to 
depression. 

The findings in isotope imaging studies are limited by the costs 
of such studies (which result in small samples) and the complicated 
context for their interpretation (provided by genotype, develop- 
ment, early experience, previous exposure to medication or drug/ 
alcohol use, data from post-mortem studies, and our understanding 
of animal experiments on the same receptors). 


Conclusions 


MR measures have an obvious direct appeal for neuroscience, and 
the biology of MDD has proved no exception. To date, definitive 
advances that change practice have not been made. As a marker of 
neurodevelopmental abnormality, the white matter may prove to 
be an important tissue for understanding behavioural pathology; 
as such, it provides a candidate target for understanding gene and 
gene x environment interactions on the one hand and impairment 
of emotional control and cognition on the other. 
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‘But one need not sound the false or inspirational note to stress the 
truth that depression is not the soul’s annihilation; men and women 
who have recovered from the disease—and they are countless— 
bear witness to probably what is probably its only saving grace: it is 
conquerable? [1]. William Styron here elegantly expresses the no- 
tion that the prognosis of most depressive episodes is eventually fa- 
vourable, and indeed much depression in the community resolves 
without specific treatment. However, depression can be persistent 
and disabling, and in these circumstances, the use of effective ther- 
apies is appropriate and important. 

This chapter will first examine evidence for the efficacy of the wide 
range of treatments employed in depression. The subsequent section 
(see Management of depressive disorders, p. 810) will deal with the 
way in which these treatments are best integrated into specific clin- 
ical management plans. It should be noted that controlled trials of 
treatment usually involve patients with a depressive episode diag- 
nosed according to DSM or ICD criteria and in the moderate range 
of clinical severity. There are less controlled data available about the 
treatment of milder depressions or those that are particularly severe. 
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nent of acute major depression 


The first widely used drugs for depression were tricyclic antidepres- 
sants (TCAs) such as imipramine and amitriptyline. Their mech- 
anism of action in depression was eventually shown to be due to 
blockade of the reuptake of noradrenaline (NA) and serotonin [5- 
hydroxytryptamine (5-HT)] at nerve terminals, thus prolonging the 
action of these monoamines in the synapse. The majority of drugs 
developed subsequently for depression share this therapeutic mech- 
anism [2] (Table 77.1). Despite their efficacy, TCAs have significant 
safety problems, particularly potential lethality in overdose through 
cardiotoxicity. In addition, their anticholinergic properties can 





make tolerance of a therapeutic dose challenging and, for example, 
place older male patients at risk of urinary retention [2]. 

Newer drugs, in particular selective serotonin reuptake inhibitors 
(SSRIs), have replaced TCAs as first line [3]. SSRIs are better toler- 
ated and much safer in overdose than TCAs, but commonly cause 
gastrointestinal and CNS adverse effects (Box 77.1). Serotonin and 
noradrenaline reuptake inhibitors (SNRIs), such as venlafaxine and 
duloxetine, are primarily 5-HT reuptake blockers but also have some 
modest effects to inhibit the reuptake of NA. Their side effect pro- 
file is similar to that of SSRIs, but venlafaxine appears less safe in 
overdose [2]. A number of other antidepressant drugs are available, 
including mirtazapine, which has antagonist properties at 5-HT,,, 
and NA a,-adrenoceptors, and trazodone, which is a weak 5-HT re- 
uptake inhibitor and is metabolized to a 5-HT receptor agonist. Both 
latter drugs have a sedating profile [2]. Agomelatine is licensed for 
depression in Europe; its main action is stimulation of melatonin re- 
ceptors, but it may also antagonize 5-HT,c receptors [4]. 

Other recent developments have led to drugs that block 5-HT 
reuptake, while having additional effects on a variety of 5-HT re- 
ceptor subtypes. For example, vilazodone has partial agonist activity 
at the 5-HT,, receptor, while vortioxetine binds to several other 
5-HT receptor subtypes (5-HTisasno 5-HT;, 5-HT,). Whether 
these agents have advantages over SSRI treatment is not fully clear, 
though vilazodone is claimed to produce less sexual dysfunction and 
vortioxetine to have particular benefits in depression-related cogni- 
tive impairment [5, 6]. 

All the drugs listed in Table 77.1 are efficacious in the acute treat- 
ment of major depression, with the largest effects, relative to placebo, 
being seen in patients with major depression whose symptoms are 
of at least moderate severity. Short-term response rates in controlled 
trials are about 50% for patients on active treatment, and about 
30% for those on placebo, with the number needed to treat (NNT) 
being between 5 and 6 [3]. Several meta-analyses have assessed the 
differences in efficacy and general tolerability between the newer 
agents. There are suggestions that, among the SSRIs, sertraline and 
escitalopram are the most effective, while venlafaxine is slightly 
more efficacious than SSRIs, though less well tolerated [7]. Whether 
these differences are clinically important has been disputed [8]. In 
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Box 77.1 Some side effects of SSRIs 


Gastrointestinal Common: nausea, appetite loss, dry mouth, 
diarrhoea, constipation, dyspepsia 
Uncommon: vomiting, weight loss 


Central nervous Common: headache, insomnia, dizziness, 





system anxiety, agitation, fatigue, tremor, somnolence 
Uncommon: extra-pyramidal reaction, seizures, 
mania 

Other Common: sweating, delayed orgasm, 


anorgasmia 


Uncommon: rash, bleeding, pharyngitis, 
dyspnoea, serum sickness, hyponatraemia, 
alopecia 


Drug interactions NSAIDs, aspirin, anticoagulants (bleeding), 
ithium, triptans, MAOIs (including selegiline), 
ryptophan, linezolid, pethidine, St John’s Wort 
5-HT toxicity). Some SSRIs (fluoxetine, fluvox- 
amine, and paroxetine) strongly inhibit hepatic 
metabolizing enzymes and increase levels of 
other drugs; numerous examples reported, 
including antipsychotic drugs, anticonvulsants, 
and antiarrhythmics 











Adapted from Cowen P, Harrison P, Burns T, Shorter Oxford Textbook of Psychiatry, Sixth 
Edition, Copyright (2012), with permission from Oxford University Press. 











some meta-analyses, the selective NA reuptake inhibitor reboxetine 
performs poorly in terms of tolerability and efficacy [7]. 
Monoamine oxidase inhibitors (MAOIs) were among the first 
drugs discovered that treat depression, but they are little used now- 
adays because of their liability to cause dangerous reactions with 
other drugs and tyramine-containing foods. However, controlled 
trials have shown that MAOIs can be effective in depressed patients 
who have not responded to numerous other pharmacological and 
non-pharmacological treatments, and for this reason, they retain 
some specialist use [9]. The reversible type A MAOI moclobemide 
has the advantage of not requiring adherence to a special diet. 
However, it can still produce hazardous interactions with other 


Table 77.1 Pharmacological properties of some antidepressants 


drugs, and it is doubtful whether moclobemide at standard doses 
is as effective as conventional MAOIs for patients with resistant 
depression. 


Other depressive conditions 


Drugs for major depression, in particular SSRIs, have been shown 
to be effective in dysthymia, with an NNT of 3-4 [10]. This sug- 
gests that medication can be effective in milder depressive condi- 
tions of lengthy duration. However, for milder depressions of more 
recent onset, antidepressant medication does not seem specifically 
useful [11]. 


Longer-term treatment 


By convention, the term ‘relapse’ is used to describe the worsening 
of depressive symptoms after an initial improvement, following the 
acute treatment of a depressive episode. Recurrence, on the other 
hand, refers to a new episode of depression that develops after a 
sustained period (of at least several months) of complete recovery. 
Treatment to prevent relapse is called ‘continuation treatment; while 
that employed to prevent recurrence is known as ‘prophylactic’ or 
‘maintenance’ therapy [3]. 

It is well established that stopping antidepressant treatment soon 
after an acute treatment response is associated with a high risk of re- 
lapse. Placebo-controlled studies indicated that continuing effective 
acute phase treatment for 6 months halves relapse rates [3]. For pa- 
tients at high risk of recurrence (for example, those with three or 
more previous depressive episodes), extending antidepressant medi- 
cation after this period is highly effective in lowering recurrence 
rates. A systematic review found that prescription of maintenance 
medication, relative to placebo, diminished the recurrence rate over 
the next 1-2 years from 40% to 18% (NNT = 4.5) [12]. Maintaining 
the dose at the level which was required to achieve acute symptom- 
atic remission, if tolerability permits, appears the most effective 
strategy. 


Electroconvulsive therapy 


Controlled trials indicate that electroconvulsive therapy (ECT) 
is more effective than simulated ECT (anaesthesia with electrode 


[Drug 5-HT reuptake NA reuptake Anticholinergic Sedating Overdose toxicity Sexual dysfunction Weight gain 
+ ++ +++ +++ +++ + +++ 





Amitriptyline + 
Lofepramine + +44 ca 
Fluoxetine apt 0 0 
Paroxetine ait 0 + 
Venlafaxine +++ tp 0 
Reboxetine 0 +++ + 
Bupropion 0 + 0 
irtazapine 0 0° 0 

Trazodone + 0 0 

0, minimal effect; +, some effect; ++, moderate effect: +++, marked effect. 

1 Indirect anticholinergic effects through NA potentiation. 

? Also blocks dopamine reuptake to some extent. 

3 Increases NA through a,-adrenoceptor blockade. 

“ Rarely causes priapism. 
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application, but no passage of current) in patients with major de- 
pression. The overall response rate is about 70% for ECT, and 40% 
for simulated treatment (NNT = 3.5) [13]. Randomized trials also 
suggest that ECT is slightly more effective and more rapid-acting 
than antidepressant medication, at least in the short term [13]. 
Generally, ECT is employed in patients who have not responded to 
other antidepressant therapies, and it may often prove effective in 
these circumstances; however, relapse rates in the months following 
treatment are high [14, 15]. 

ECT is generally a safe treatment. However, there is concern about 
its deleterious effects on memories for more remote personal events 
(autobiographical memory loss). While objective measures suggest 
that this loss largely resolves within 6 months, subjective reports by 
patients indicate that the loss can extend for longer and might some- 
times be permanent [2]. 


Other neurostimulatory treatments 


A number of other neurostimulatory treatments have been tested 
in depressed patients, often where conventional therapies have not 
been helpful. Deep brain stimulation is discussed in Chapter 20. 
Other neurostimulatory approaches include vagal nerve stimulation 
(VNS) and transcranial magnetic stimulation (TMS), both of which 
have been approved for the treatment of depression in the United 
States. TMS has also been approved in several other countries for the 
management of resistant depression. 

VNS has a slow onset of antidepressant action, over many months, 
which means that long-term follow-up studies are needed to assess 
its effectiveness. A meta-analysis of six outpatient multi-centre trials 
involving patients with refractory depression showed higher re- 
sponse rates in the VNS group at 96 weeks than a matched group 
of patients receiving standard treatment (32% vs 14%, respectively; 
NNT = 5.5). However, most of the studies were not randomized [16]. 

TMS uses a powerful magnetic field to produce a current flow 
in neural tissue. The use of appropriately shaped coils allows local- 
ization of stimulation to the main cortical regions. TMS has been 
used successfully to relieve depressive states, but there are still un- 
certainties about the best coil placement and stimulus parameters. 
The usual mode of administration is to administer stimulation to 
the left prefrontal cortex, with treatments being given 5 days a week 
for 2 weeks. 

In a meta-analysis of trials involving about 1300 depressed pa- 
tients randomized to either active or sham TMS, there was a greater 
clinical response in patients receiving active treatment (36% vs 15%, 
respectively; NNT = 9) [17]. However TMS appeared less effective 
than ECT in producing remission in severely depressed patients 
(34% vs 52%, respectively; NNT = 5.5) [18]. Nevertheless, TMS 
would be expected to be relatively free of the personal memory 
deficits induced in some patients by ECT. 


Bright light treatment 


The use of phototherapy, or artificial bright light, as a treatment for 
depression was first studied systematically by Rosenthal and col- 
leagues who used morning bright light to treat patients with the 
newly identified syndrome of ‘seasonal affective disorder’ [19]. Since 
then, phototherapy has become the mainstay of treatment of ‘winter 
depression, particularly in patients whose clinical features include 
hyperphagia, hypersomnia, and diminished energy. 


A meta-analysis of eight randomized studies found that morning 
bright light was significantly superior to a dim light control in re- 
ducing symptomatology in patients with winter depression, with an 
effect size of about 0.5 [20]. In winter depression, the therapeutic 
effects of bright light, fluoxetine, and cognitive behavioural therapy 
(CBT) appear generally equivalent, although cognitive therapy may 
have a better long-term outcome [3]. 


Psychotherapy 
Treatment of acute depression 


Individual CBT is the most studied psychological treatment in the 
management of acute depression, and there is strong evidence for its 
efficacy. Not surprisingly, the effect size of CBT is greatest in studies 
that compare it to a wait-list’ control or ‘treatment as usual’ (TAU). 
Direct comparisons of CBT to other structured psychological treat- 
ments, such as interpersonal psychotherapy (IPT) and behavioural 
activation, tend not to show important differences in outcome. 
Similarly, most studies show similar efficacy when CBT is compared 
to antidepressant medication in the treatment of acute depression. 
However, the combination of CBT with antidepressant medication 
is probably better than drug treatment alone [21, 22]. 

Many psychiatrists do not consider that CBT is effective for pa- 
tients with severe depression, but this view does not have support 
from randomized trials. However, defining ‘severity’ by means of a 
score on a rating scale may not capture specific factors that could 
compromise the efficacy of psychological treatment such as signifi- 
cant cognitive impairment and psychomotor retardation. Also the 
expertise of the therapist and their adherence to treatment protocols 
significantly influences outcome in CBT trials [23]. 

Systematic reviews indicate that other individual structured psy- 
chological treatments, such as IPT, behavioural activation, and 
problem-solving therapy, are effective in treating depression, in com- 
parison to both placebo treatment and TAU [22]. However, the effi- 
cacy of short-term psychodynamic therapy is less certain [24]. The 
utility of behavioural activation and problem-solving therapy is im- 
portant because such therapies require less therapist training and 
are easier to implement than CBT. Because of the high prevalence of 
depression in primary care and the preference of many patients for 
psychological treatments, there has been much interest in developing 
ways of increasing the availability of psychological therapies. 
Approaches range from group CBT formats to self-help booklets and 
therapy delivered via the Internet. Generally, in mild to moderate de- 
pression, all these treatments have some efficacy, relative to wait-list 
control, and there are no clear-cut differences between them [25]. 

This is also a convenient place to discuss the efficacy of exercise 
in treating depression. While there has been some controversy, the 
more recent systematic reviews suggest large effects of supervised, 
moderate-intensity aerobic exercise to decrease depressive symp- 
tomatology, compared to no treatment, and a moderate effect in 
comparison to TAU [26]. Again such treatment is likely to be most 
beneficial for patients with mild to moderate depression. 


Other depressive conditions 


Where depression has persisted for a considerable period, and par- 
ticularly in dysthymia, CBT appears less effective than antidepres- 
sant medication [27]. There are few trials of other psychological 
treatments in chronic depression, but psychodynamic therapy 
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may be superior to TAU in patients with long-term resistant 
depression [28]. 


Continuation and maintenance treatment 


There is evidence that in patients randomized to CBT during the 
acute phase of depression, there is sustained improvement in depres- 
sive symptomatology over the next few months, even after the end of 
treatment. The risk of relapse after CBT is significantly less than that 
of patients withdrawn from pharmacotherapy (NNT = 5). There is 
also good evidence that CBT given in the continuation phase lowers 
relapse rates, compared to CBT given in the acute phase only (10% 
vs 31%, respectively; NNT = 5) [29]. 

Mindfulness-based cognitive therapy (MBCT) integrates CBT 
with meditation techniques designed to lower stress by facilitating 
acceptance and self-compassion. There is evidence that MBCT 
lowers the risk of recurrence in patients with a history of frequent 
episodes of depression and is as effective as maintenance antidepres- 
sant treatment in this respect [29, 30]. However, reliable identifica- 
tion of the specific elements of treatment and patient characteristics 
that may be linked to therapeutic response has proved difficult. 

Combining IPT with medication in the treatment of acute epi- 
sodes decreases relapse rates over the following 12 months [29]. In 
patients who achieve remission from depression, with IPT as sole 
treatment, maintenance therapy is also helpful in preventing recur- 
rence. In a 3-year follow-up of elderly depressed patients, the com- 
bination of nortriptyline and IPT was more effective in preventing 
relapse (20% relapse rate) than nortriptyline alone (43% relapse 
rate), which itself was more effective than IPT alone (64% relapse 
rate) [31]. 


Management of depressive disorders 


Assessment 


The key aspects of the assessment of depression are summarized in 
Box 77.2. The critical factors that facilitate assessment are careful 
history taking and mental state examination. These also provide an 
invaluable opportunity to build a rapport with the patient, which 
is essential if constructive management plans are to be mutually 
agreed. The availability of a close friend or relative, able to provide 
collateral information, can be very helpful. 

In the history, particular attention should be paid to previous epi- 
sodes of mood disturbance, which can offer important clues to the 
likely prognosis of the present disorder. A careful search for symp- 
toms of hypomania and mania should be made, as the presence of 
bipolar disorder indicates that a different kind of management of 


Box 77.2 Assessment of a depressive condition 


e Is the diagnosis major depression (DSM-5) or depressive episode 
(ICD-10)? 

e How severe is this episode, and what is its duration? 

e What is the past history of mood disturbance? 

e What is the risk of suicide? 

e What are the important causal factors? 

e What are the patient's social resources? 

e How is the depression affecting others? 











the depressive episode is likely to be necessary (see Chapter 72). The 
severity of the depressive disorder will have important implications 
for the kind of treatment that might be suggested first. Here, it is 
necessary to gauge both the intensity and the range of current symp- 
toms, as well their duration. Identification of melancholic depres- 
sion or depressive psychosis will also influence treatment selection. 
The risk of suicide should be assessed in every patient presenting 
with depression. 

Coming to an understanding with the patient about the aetiology 
of the depressive episode is important in management. As pointed 
out by Aubrey Lewis [32], in most patients, the depressive episode 
appears to be the result of a combination of personal predisposition 
and current life difficulties, though, with the latter, it is important to 
try to establish which problems have arisen as a result of the depres- 
sive condition—often a difficult task. It is important to remember 
that depressive symptoms may be secondary to another psychi- 
atric disorder or an underlying general medical condition—or its 
treatment. 

Another important part of the assessment is learning about the 
patient’s current social resources, for example how well they are sup- 
ported in their home and work life and whether they have an under- 
standing confidant. It is also important to consider the effects of the 
depressive condition on other people, both those close to the patient, 
such as dependent children, or others who might be affected by the 
patient's occupation, for example coach drivers or medical workers. 


Treatment setting 


The majority of patients with depression are treated in primary care, 
and many continue to work and carry out family and social activ- 
ities, albeit with reduced efficiency and enjoyment. Patients with 
more severe depressive disorders are often quite unable to function 
and may be in states of considerable anguish and distress, with sig- 
nificant suicidal ideation. An early treatment consideration there- 
fore is how far patients need additional support and what form this 
should take. A few patients, perhaps because of poor self-care and 
worrying levels of suicidal thinking, will require inpatient admis- 
sion. However, if sufficient family support is available, even very ill 
patients may be managed at home, in conjunction with frequent 
home visiting and monitoring by a community psychiatric team. 
The next question is whether the patient should continue to work. 
If the disorder is of mild to moderate severity, work can provide a 
valuable distraction from depressive thoughts and be a source of 
companionship. If the disorder is more severe, slowness, poor con- 
centration, and low energy are likely to impair work performance, 
which can increase feelings of hopelessness and may pose risks to 
others. If patients are not engaged in work, it is important to plan 
day-to-day activities because depressed patients often give up activ- 
ities and withdraw from other people. As a result, they become de- 
prived of social stimulation and potentially rewarding experiences, 
which increases feelings of depression. It is therefore important to 
make sure that the patient is occupied adequately, although they 
should not be pushed into activities which will be too demanding. 
The appropriate kind of psychological treatment should also 
be decided in each case. All depressed patients require support, 
encouragement, and an explanation that they are suffering from an 
illness, and not some form of moral or other personal failure. Similar 
counselling of partners and other family members is often useful. It 
is also important to try and tackle any psychosocial problems that 
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Box 77.3 Stepped-care management of depression 


1. Patients with short-lived mild depression who may recover quickly 
without treatment should be offered an early review (‘active 
monitoring’). 

2. Antidepressants are not recommended for the treatment of mild 
depression 

3. Patients with persistent mild depression should be recommended 
a guided self-help programme, based on cognitive behavioural 
therapy. Group cognitive behavioural therapy is an alternative. An ex- 
ercise programme can also be recommended. 

4. For patients with persistent mild depressive symptoms who do not 
respond to these measures, consider drug treatment with an SSRI or 
higher-intensity psychological treatment. 

5. Patients who present with moderate or severe depression should be 
treated with a combination of antidepressant medication and a high- 
intensity psychological intervention. 

6. Patients who respond to antidepressant medication should continue 
treatment for at least 6 months. Patients at high risk of relapse should 
be advised to continue antidepressant treatment for 2 years. 

7. Consider cognitive behavioural therapy for patients who have re- 
lapsed despite antidepressant treatment, or mindfulness-based cog- 
nitive therapy for patients who are well but who have experienced 
three or more previous episodes of depression. 

Source: data from National Institute for Health and Care Excellence, CG90 Depression 


in adults: recognition and management, Copyright (2009; updated 2018), National 
Institute for Health and Care Excellence. 









































are making an important contribution to the depression. For ex- 
ample, couple therapy can be a helpful addition in depressed patients 
where problems with a partner are playing a role in maintaining the 
disorder. Other more specific psychological treatments will be indi- 
cated for some (see Box 77.3). 


The stepped-care approach 


The majority of patients with depression present in primary care 
with mild to moderate symptoms. It is generally agreed that de- 
pressive symptomatology in the community follows a dimensional 
distribution, with no clear cut-off between the mild and transient 
depressive feelings common in everyday life and the more severe 
depressive states that are an indication for some form of medical 


Table 77.2 SSRIs available to treat depression 


intervention. In practice, the latter judgement is made on the basis 
of the number and severity of clinical symptoms, the subjective level 
of distress, the level of functional impairment, and the length of time 
for which the symptoms have persisted [33]. 

Current guidance from the National Institute for Health and 
Care Excellence (NICE) advises a stepped-care approach, so that 
depressed patients are treated with the least intrusive approaches 
first, with more intensive treatments being introduced where im- 
provement does not occur [25, 34] (Box 77.2). Patients with milder 
symptoms can be helped by the less intensive approaches described 
earlier, including self-help treatment, group CBT, or exercise. It 
should be noted, however, that in people with a history of recur- 
rent depressive episodes, even mild symptoms may be a sign of an 
impending relapse and more active treatment may be indicated. 

For patients with moderate levels of depression or greater, NICE 
recommends a combination of high-intensity psychological treat- 
ment (which generally would be individual CBT) and antidepressant 
medication. This treatment is suggested because systematic reviews 
indicate that a combination of CBT and medication is superior to 
medication given alone. However, in practice, it may be difficult 
to access expert CBT in a timely way, in which case starting treat- 
ment with antidepressant medication is appropriate. If, however, 
individual CBT or IPT are available from a suitably experienced 
therapist, it is reasonable to use psychotherapy as sole first-line treat- 
ment in moderate depression, if the patient prefers. 


Antidepressant medication 


Antidepressant medication can be considered for most patients with 
a major depressive syndrome of at least moderate severity, and par- 
ticularly those with features of melancholic depression in DSM-5 or 
somatic depression in ICD-10. Dysthymia is also an indication for 
antidepressant medication [10, 27]. 

Most guidelines recommend initial treatment with an SSRI [3, 34]. 
Several preparations are available, and the differences between them 
relate largely to the pharmacokinetic profile and the risk of drug 
interaction (Table 77.2). There have been concerns that early in 
treatment, SSRI use may be associated with an increased risk of hos- 
tile and suicidal behaviour, but meta-analyses of placebo-controlled 
trials in adults suggest no increase in fatal suicide outcomes. SSRIs 
may be associated with a small increased risk of non-fatal self-harm, 


Drug Half-life Withdrawal syndrome CYP inhibition’ Cardiotoxicity 
(daily dose range) a 


Citalopram 

(20-40mg) 

Escitalopram 30 + 
(10-20mg) 

Fluoxetine 90 (plus long-acting 0 
(20-60mg) metabolite) 

Fluvoxamine 20 +++ 
(100-300mg) 

Paroxetine 24 + 
(20-50mg) 

Sertraline 26 ++ 
(50-200mg) 


1 Hepatic metabolizing enzymes. 


Prolongation of QTc at higher doses 


0 Prolongation of QTc at higher doses 
2D6/2C9 0 
1A2/2C9/2C19/3A4 0 
2D6 0 
2D6 (modest) 0 
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but the number need to harm (759) is much greater than the NNT 
(about 5) [35]. However, the first few weeks in the clinical manage- 
ment of depression do seem to be a time of increased risk of sui- 
cidal behaviour across a range of treatment modalities [36]. Careful 
and frequent monitoring of patients over this period is therefore 
prudent. 

Patients should be warned about the delayed onset of action of 
SSRI, as well likely side effects, including nausea and some restless- 
ness during sleep—a forewarned patient is more likely to continue 
with medication. A number of patients become more anxious and 
agitated early during SSRI treatment; therefore, it is important to 
explain that such effects are sometimes experienced during treat- 
ment but do not mean that the underlying depression is worsening. 
If the patient persists with treatment, such anxiety and agitation usu- 
ally diminish, but short-term treatment with a benzodizapine may 
be helpful, particularly if sleep disturbance is a problem. Patients 
should be reviewed frequently during the first few weeks of treat- 
ment, when support and advice are helpful both to maintain morale 
and to ensure compliance with medication. 

If the use of SSRI medication is not suitable or not well tolerated 
(Box 77.1), a number of other options are available. Mirtazapine 
has a sedating profile. Less sedating alternatives include the NA 
and dopamine reuptake inhibitor bupropion, which is licensed for 
treating major depression in the United States, but not in Europe. 
In Europe, it is possible to use lofepramine, a tricyclic drug which is 
predominantly an NA reuptake inhibitor and appears relatively safe 
in overdose [2]. 


Special situations 
Depression in pregnancy 


The management of depression in pregnancy involves careful assess- 
ment, together with the provision of accurate information about the 
risks of depression to the mother and baby and the risks and benefits 
of different forms of treatment. If a woman has a history of recurrent 
moderate to severe depression and has been effectively maintained 
on antidepressant medication, it is generally better to continue this 
treatment during pregnancy because the risk of relapse is high if 
medication is withdrawn and there are risks to the fetus of untreated 
maternal depression. However, women with a history of less severe 
depression can often withdraw antidepressant medication success- 
fully [37, 38]. 

Women not taking antidepressant medication who become de- 
pressed during pregnancy should be provided with psychoeducation 
and psychological treatment in a stepped-care manner, depending 
on the severity of their condition (Box 77.3). If the depression is se- 
vere, or if a moderate depressive episode fails to respond to psycho- 
logical management, antidepressant medication should be offered, 
usually an SSRI [38]. Generally, the risks of SSRI treatment in preg- 
nancy are low and are outweighed by those of untreated depression. 
However, self-limiting neonatal withdrawal syndromes have been 
described, although the incidence across studies varies widely. SSRI 
use in later pregnancy may also be associated with a small increase 
(1-2 per 1000 live births) in the risk of pulmonary hypertension of 
the newborn, a potentially fatal condition [39]. Paroxetine has been 
associated with cardiac defects and should therefore be avoided in 
pregnancy; other SSRIs appear to be safe in this respect [38]. There 
may also be an increased risk of autism in offspring born to mothers 


taking SSRIs in pregnancy, but whether this is a causal effect of SSRI 
treatment has yet to be determined. 

Antidepressant drugs should also be prescribed cautiously to 
women who are breastfeeding. The amount of drug that enters breast 
milk varies according to the individual agent. In terms of SSRI treat- 
ment, sertraline is present in milk in only small amounts and breast- 
feeding may continue while the baby is observed for effects such as 
sedation and feeding difficulties [2]. 


Depression in childhood and adolescence 


For children and young people with persistent depressive disorders 
that have not responded to simple supportive measures, psycho- 
logical treatment is regarded as the first-line approach. A system- 
atic review found that both CBT and IPT were superior to wait-list 
control, whereas psychodynamic therapy was not. IPT and CBT 
were also superior to problem-solving therapy, but IPT had fewer 
all-cause discontinuations than CBT [40]. 

Antidepressant medication seems less effective in child and ado- 
lescent depression than in adults, and there is greater evidence 
for an increased risk of self-harm with SSRIs. A network meta- 
analysis of several drugs, including TCAs, SSRIs, venlafaxine, and 
mirtazapine, found that only fluoxetine was reliably better than 
placebo [41]. Another meta-analysis of 27 placebo-controlled trials 
of SSRI therapy in young people found no completed suicides, but 
a small significant increase in suicidal ideation and self-harm at- 
tempts in association with SSRI treatment (number needed to 
harm = 143) [42]. 

There have been few direct comparisons of medication and psy- 
chological treatment in adolescent depression, but the Treatment 
for Adolescents with Depression Study (TADS), sponsored by the 
United States National Institute for Mental Health (NIMH), found 
that after 12 weeks of treatment, response rates for pill placebo (34%) 
and CBT (43%) were significantly less than for fluoxetine alone 
(61%) and the combination of CBT and fluoxetine (71%). By 18 
weeks, the response to CBT was similar to that found with fluoxetine 
alone, while the response to combination treatment was still su- 
perior. Fluoxetine treatment alone was associated with an increased 
risk of suicidal ideation and self-harm relative to placebo; however, 
this increase was not seen in patients in whom fluoxetine was com- 
bined with CBT [43]. TADS suggests that the best current treatment 
for moderate to severe depression in adolescence is a combination 
of fluoxetine and CBT. 


Psychotic depression 


Determining when the negative thinking of a severely depressed 
patient has reached delusional intensity is not easy. However, rec- 
ognizing patients suffering from psychotic depression is important 
because antidepressant medication is unlikely to be effective as sole 
treatment. There is a long-held clinical view that pharmacological 
treatment of depressive psychosis requires combination therapy 
with antidepressant medication and a dopamine antagonist drug— 
the latter given at antipsychotic doses. A meta-analysis of nine trials 
confirmed that such combination treatment was superior to either 
treatment alone (NNT = 7 vs antidepressant monotherapy; NNT = 5 
vs antipsychotic monotherapy) [44]. It is also worth noting that 
TCAs may be more effective than the newer more pharmacologic- 
ally selective drugs in the treatment of depressive psychosis [45]. 
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Use of electroconvulsive therapy 


ECT will, very rarely, be part of the first-line treatment of depression 
and, in such circumstances, will usually be considered only where 
there is a need to bring about improvement as rapidly as possible. 
In practice, this applies mainly to patients who are unable to drink 
enough fluid to maintain an adequate output of urine, including 
the rare cases of depressive stupor. In depressive psychosis, ECT is 
considerably more effective than an antidepressant given alone, but 
probably about the same therapeutic effect can be achieved, albeit 
more slowly, if a combination of an antidepressant drug and an anti- 
psychotic drug is used [46]. 

The main use of ECT is in severely depressed patients who have 
failed to respond to other antidepressant treatments. In these cir- 
cumstances, a response rate of at least 50% can be expected, though 
the relapse rate over the next 6 months is high, emphasizing the 
importance of continuation of pharmacological and psychological 
treatment [9]. 


Failure to respond to first-line antidepressant treatment 


If a moderate to severe depressive condition does not respond 
within a number of weeks to psychological treatment or antidepres- 
sant medication, the diagnosis should be reviewed carefully and 
further assessment made as to whether continuing psychosocial dif- 
ficulties are perpetuating the disorder. Substance misuse, which is 
commonly comorbid with depression, may also hinder the response 
to treatment and may need to be separately addressed (Box 77.2). 

If a patient has failed to respond to appropriately delivered psy- 
chological treatment, then a trial of medication can be considered. 
Where there has been a lack of response to antidepressant treatment, 
it is important to check that the patient has been taking medication 
as prescribed. If not, the reasons for this should be sought. The pa- 
tient may be convinced that no treatment can help or may find the 
side effects unpleasant. If this enquiry is unrevealing, antidepres- 
sants should be continued at an increased dose, if possible. In gen- 
eral, SSRIs do not have well-described dose-response relationships, 
although some patients respond to higher doses, particularly if a 
partial response has been observed at a standard dose. If psycho- 
therapy has not been offered at this point, a structured psychological 
treatment, such as CBT or IPT, should be considered if it is available 
in a timely way. 

The large pragmatic Sequenced Treatment Alternatives to Relieve 
Depression (STAR*D) study found similar response rates when 
SSRI-non-responsive patients were randomized to CBT or to 
pharmacological augmentation; however, the pharmacological ap- 
proach worked more quickly [47]. A study of primary care patients 
who had failed to respond adequately to antidepressant medication 
found that CBT (a median of 11 sessions given over 6 months) was 
markedly superior to TAU in terms of response rate at 6 months 
(46% vs 22%, respectively; NNT = 4) [48]. 

If it becomes clear that the patient is not getting better, a number 
of further steps can be taken to improve pharmacological treatment 
(Box 77.4). There is not a fixed order in which treatments should 
be offered. Prescribing decisions should be made in collaboration 
with the individual patient, taking into account factors such as spe- 
cific symptomatology, how far the condition may have shown a 
partial response, and the side effect profile of the various treatment 
options [46]. 


Box 77.4 Some pharmacological treatments 
for resistant depression 


e Increase antidepressant to the maximum dose, if tolerance permits; if 
the patient has depressive psychosis, add an antipsychotic drug; try a 
different class of antidepressant drug, including venlafaxine and tri- 
cyclic antidepressants. 

e Try an antidepressant combination (for example, an SSRI or 
venlafaxine with mirtazapine). 

e Add an atypical antipsychotic drug to an SSRI or venlafaxine. 

e Add lithium to antidepressant drug treatment. 

e MAOIs (can be usefully combined with lithium). 

JECT 


Reproduced from Cowen P, Harrison P, Burns T, Shorter Oxford Textbook of Psychiatry, 
Sixth Edition, Copyright (2012), with permission from Oxford University Press. 














Switching antidepressants 


If a patient does not respond to one antidepressant, the first step 
is usually to stop the first medication and try another. If a pa- 
tient has not responded to one kind of antidepressant, it would 
seem sensible to switch to an antidepressant that has different 
pharmacological properties. However, a meta-analysis showed 
that switching from an ineffective SSRI to a different class of anti- 
depressant (mirtazapine, venlafaxine, or bupropion) was only 
marginally better in terms of remission rate than switching to a 
second SSRI (28% vs 23.5%, respectively; NNT = 22) [49]. Overall, 
however, around 30% of patients unresponsive to a first anti- 
depressant medication will show some benefit from a switch to a 
second compound. 

Both amitriptyline and venlafaxine appear somewhat more ef- 
fective than SSRIs in patients with severe depression; these drugs are 
therefore worth trying at some point in the management of patients 
who are unresponsive to initial medication trials [9]. While MAOIs 
are also clearly beneficial in some patients with resistant depression, 
the food and drug restrictions required mean that these drugs are 
likely to be used as something of a last resort, unless a patient has 
responded well to them in the past [9]. 


Combination/augmentation of antidepressants 


When switching antidepressant preparations, withdrawal of the first 
compound may not be straightforward. For example, patients may 
have gained some small symptomatic benefit from the treatment 
and this may be lost. Also, if the first medication is stopped quickly, 
withdrawal symptoms can result. On the other hand, a protracted 
changeover in medication may be distressing for a patient who is in 
despair about ever feeling better. 

For this reason, in patients who are unresponsive or partly re- 
sponsive to first-line medication, it may be more appropriate to add 
a second compound to the antidepressant. This is called ‘combin- 
ation therapy’ where the second compound is itself considered to be 
an antidepressant. The term ‘augmentation’ refers to the addition of 
a drug that is not by itself regarded as an effective antidepressant but 
is nevertheless able to produce an therapeutic response when added 
to ineffective antidepressant medication. 

Combination therapy is widely used in patients who have failed 
to respond to first-line antidepressant treatments, with drugs such 
as mirtazapine and bupropion commonly being added to ineffective 
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SSRI and SNRI treatment. However, the evidence that these are ef- 
ficacious strategies is not well supported by large randomized trials 
[9]. There is more evidence for the benefit of augmentation therapy, 
particularly using low-dose atypical antipsychotic drugs. A meta- 
analysis involving about 3500 patients found that, relative to placebo, 
the addition of antipsychotics such as aripiprazole and quetiapine to 
SSRI treatment was significantly more likely to result in clinical re- 
mission (NNT = 9) [50]. However, atypical antipsychotics are not 
well tolerated because of sedation and weight gain in the case of 
quetiapine, and restlessness and agitation with aripiprazole. 

Lithium also appears effective in augmentation treatment when 
added to ineffective antidepressant medication. Meta-analyses of 
randomized studies suggest an NNT of around 5, but overall the 
total number of patients studied is relatively small [51]. In addition, 
many lithium augmentation studies involved a primary TCA treat- 
ment and the efficacy of lithium in augmentation of newer anti- 
depressants is less firmly established. Because of the potential for 
serotonin toxicity, the addition of lithium to SSRIs and SNRIs should 
be carried out cautiously, starting at low lithium doses. 


Ketamine 


Ketamine is a dissociative anaesthetic that blocks the N-methyl-D- 
aspartate (NMDA) subtype of the glutamate receptor. Recent con- 
trolled studies have shown that ketamine, given intravenously at a 
sub-anaesthetic dose, produces a striking and rapid remission of 
depressive symptoms in patients resistant to conventional pharma- 
cotherapy. The improvement in depression begins about 1 hour after 
ketamine administration, as dissociative symptoms wane, and can 
last up to 7 days. A systematic review of controlled studies involving 
180 patients with depression found higher rates of clinical response 
to ketamine than to intravenous saline or intravenous midazolam at 
1, 3, and 7 days, with an NNT of 3-5 [52]. 

The rapid and striking antidepressant effect of ketamine in resistant 
depression is of great interest. However, it has not proved possible 
in most patients to maintain the antidepressant effect with the use 
of oral glutamatergic agents such as riluzole or memantine, which 
means that repeated administration of intravenous ketamine is often 
necessary to maintain the therapeutic response. A form of intranasal 
ketamine has been developed that appears to have acute antidepres- 
sant effects and that might be feasible to use for repeated treatment. 
However, there are concerns about possible ketamine toxicity after 
repeated use, including bladder changes, as well adverse psycho- 
logical effects such as dependence [9]. Nevertheless, the antidepres- 
sant effect of ketamine has focused attention on the role of glutamate 
in antidepressant action and might conceivably lead to novel classes 
of agents for the common clinical problem of resistant depression. 


Continuation and maintenance treatment 
Continuation treatment 


After symptomatic remission, follow-up for several months is usu- 
ally recommended. If the improvement in depression appears 
to have been brought about by an antidepressant drug, that drug 
should usually be continued for about 6 months and then grad- 
ually withdrawn over a period of several weeks, provided longer- 
term maintenance treatment is not indicated. If residual depressive 
symptoms are still present, it is safer not to withdraw medication [3]. 
At follow-up interviews, a careful watch should be kept for signs of 


discontinuation reactions or relapse. It is helpful to discuss with the 
patient the possible early signs of relapse and to develop a plan of 
action, should any of these signs appear. Involving relatives in this 
plan can be helpful. 

If a patient has responded to CBT or IPT, having some additional 
sessions during the continuation phase lessens the risk of relapse. 
It is known that patients with residual depressive symptoms are at 
greater risk of relapse, and the use of CBT, whatever the primary 
treatment, can have a useful prophylactic effect in this situation [29]. 
As noted earlier, there is a high risk of relapse in the 6 months after 
the use of ECT for resistant depression, and the best continuation 
therapy is not yet determined. Some benefit has been claimed for a 
continuation regime of lithium and nortriptyline [15]. The STAR*D 
study showed that, in general, depressed patients who required sev- 
eral treatment steps to achieve remission had high relapse rates over 
the next year [53]. In such patients, additional psychological treat- 
ment may have a useful role in improving outcome [9]. 


Maintenance treatment 


Major depression is often recurrent, and long-term maintenance 
treatment may need to be considered. It is estimated that, among pa- 
tients who have had three episodes of major depression, the likelihood 
of another episode is 90%. The usual recommendation is that main- 
tenance drug treatment should be considered if a patient has had two 
previous episodes of depression within a 5-year period, particularly 
if there is a family history of recurrent major depression or personal 
and social factors predictive of recurrence [3]. In addition, the clin- 
ician will need to take into account factors such as the likely impact 
of a recurrence on the patient's life and the previous response to drug 
treatment, as well as the patient's view of long-term drug therapy [46]. 

For patients taking antidepressant medication, the longer-term 
treatment choice is most readily based on the response to acute phase 
therapy. If, however, a change needs to be made because of adverse 
effects (for example, sexual dysfunction with SSRIs), an alternative 
choice can be decided on the basis of the side effect profile. Lithium 
is not generally employed as monotherapy in longer-maintenance 
treatment of depression but can be added to antidepressant treat- 
ment if the response is inadequate or there is a high risk of suicide [3]. 

For patients who do not do well with these measures or where 
a patient at high risk of relapse wishes to discontinue antidepres- 
sant medication, there is growing evidence for a role of mindfulness 
CBT [29, 30]. Sometimes, it appears that the depressive disorder is 
related to continuing life stressors such as overwork, complicated 
social relationships, or substance misuse. In this case, various forms 
of psychotherapy may help adjustment to a lifestyle that is less likely 
to lead to further illness. 

Another important aspect of longer-term follow-up is attention 
to the physical health of the patient, bearing in mind that people 
with depression are at substantially increased risk of medical 
comorbidities, particularly cardiovascular disease and metabolic 
syndrome [54]. Regular monitoring of relevant aspects of physical 
health is therefore an important part of overall management. 
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Exposure to stressful adverse events has always been an important 
part of human life, and the notion that exposure to such events can 
have a profound impact on our emotions and behaviour was de- 
scribed by philosophers, writers, and artists long before it became 
the focus of mental health professionals and researchers. Prior to 
the introduction of the post-traumatic stress disorder (PTSD) diag- 
nosis into the third edition of the Diagnostic and Statistical Manual 
of Mental Disorders (DSM-III) [1], there was widespread recogni- 
tion that experiencing stressful life events could have numerous 
negative effects on physical and mental health [2, 3], but there was 
little to no focus on distinguishing between potentially traumatic vs 
other stressor events. The PTSD Criterion A (the stressor criterion) 
serves an important gatekeeping function by distinguishing between 
stressor events that are and are not considered to be potentially trau- 
matic and capable of producing PTSD. Thus, it is not surprising that 
one of the most controversial and challenging aspects of the PTSD 
diagnosis has been defining potentially traumatic events (PTEs) and 
distinguishing PTEs from other stressor events (OSEs) [4]. 

PTSD was initially categorized as an anxiety disorder, whereas 
other disorders that are clearly precipitated by exposure to adverse 
stressor events [for example, adjustment disorders (ADs) and re- 
active attachment disorder (RAD)] were placed elsewhere in the 
DSM. An unintended consequence of this distinction between PTEs 
and OSEs and the placement of PTSD and other stressor-related 
disorders in different chapters of prior additions of the DSM was 
that it obscured the extent to which exposure to stressor events can 
produce a broad range of mental disorders and health problems. 
Perhaps the most important change regarding PTSD in DSM-5 [5] 
was its removal from the ‘Anxiety disorders’ category and reclassi- 
fication within a new diagnostic category that captures problems 
that emerge following exposure to PTEs and OSEs. Although PTSD 























may be expressed as a fear-based anxiety disorder, as defined in both 
DSM-III [1] and DSM-IV [6], it may be expressed also as an anhe- 
donic/dysphoric, externalizing, or dissociative disorder [7]. 

Given abundant evidence that post-traumatic psychopath- 
ology is characterized by a wider range of cognitions, emotions, 
and behaviour than seen in an anxiety disorder, DSM-5 created a 
new classification for PTSD and related disorders—‘Trauma- and 
stressor-related disorders. All diagnoses included in this new chapter 
have two things in common: (1) a discrete traumatic/adverse event 
or experience that preceded the onset or aggravation of symptoms; 
and (2) a wide range of cognitions, emotions, and behaviours em- 
bedded within DSM-5 diagnostic criteria for each disorder. This 
new category includes five specific disorders: PTSD, ASD, AD, RAD, 
and disinhibited social engagement disorder (DSED). This chapter 
reviews each of these disorders with respect to DSM-5 diagnostic 
criteria, separated into adult- and child-specific disorders, describes 
notable changes from DSM-IV, and discusses theoretical and clin- 
ical implications of these changes. 

The DSM-5 revision process adopted a very conservative ap- 
proach and required very strong evidence to modify, delete, or add 
any symptoms to any diagnostic disorder [8]. As detailed later in 
this chapter and elsewhere [8, 9], major changes for PTSD were re- 
visions of the stressor criterion (Criterion A1), deletion of Criterion 
A2, reconceptualizing PTSD’s symptom meta-structure as a four- 
factor model (rather than three factors, as in DSM-IV and DSM- 
III), revising specific diagnostic criteria, addition of a pre-school 
subtype, and addition of a dissociative subtype. As with PTSD, ASD 
diagnostic criteria were modified to recognize the variability of post- 
traumatic distress responses. Further, the DSM-IV stipulation that 
three dissociative symptoms were required to meet diagnostic cri- 
teria was eliminated from ASD. AD, previously a residual diagnostic 
category in a class by itselfin DSM-IV, was moved into the “Trauma- 
and stress-related disorders’ category because onset is preceded by 
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an adverse event (PTE or OSE). Finally, the absence of adequate 
caregiving (for example, extreme insufficient care) is a prerequisite 
for two childhood disorders—RAD and DSED. 

Recognition of the heterogeneity of post-traumatic distress in 
both PTSD and ASD, with consequent removal of these diagnoses 
from the ‘Anxiety disorders’ category raised a number of important 
theoretical issues. This is highlighted by the very different PTSD 
conceptualization offered by DSM-5’s broad construct, in contrast 
to the WHO’s forthcoming ICD-11 [10], which proposes a very 
narrow construct of PTSD. The broad DSM-5 approach seeks to 
‘provide clinicians with a menu of symptoms and symptom clusters 
that would adequately cover the most typical clinical presentations’ 
[8, p. 550], but ICD-11’s narrow approach restricts PTSD diagnostic 
criteria to symptoms that are unique to PTSD, and not found in 
other anxiety or mood disorders [10]. Therefore, symptoms, such as 
insomnia, irritability, cognitive impairment, dysphoria, alienation, 
and detachment, which occur in DSM-5’s PTSD, as well as in other 
DSM-5 disorders, are eliminated in ICD-11’s PTSD criteria. There 
are 20 PTSD symptoms in DSM-5, with a minimum of six symptoms 
needed to meet the diagnostic threshold, whereas ICD-11 only has 
six PTSD symptoms and requires a minimum of three symptoms 
to meet the diagnostic threshold. As stated elsewhere, to extend the 
ICD-11 approach to medical diagnoses, ‘one would eliminate symp- 
toms such as fever, pain, and edema from the diagnostic criteria of 
a specific disease because they are found in so many other diseases’ 
[8, p. 550]. 





Categorical vs dimensional diagnostic approaches 


Both the DSM and ICD systems take categorical approaches towards 
PTSD and other diagnoses (for more information, see Chapter 7). 
That is, a person either has a diagnosis based on meeting symp- 
toms and functional impairment thresholds or they do not have a 
diagnosis because they do not meet these thresholds. This categor- 
ical approach has the advantage of simplicity by making it easier to 
study groups of individuals with and without diagnoses and to de- 
termine eligibility for services or compensation based on diagnosis. 
However, categorical diagnostic approaches have disadvantages and 
have been criticized on theoretical, empirical, and practical grounds. 

The case can be made that the latent construct of PTSD is more 
dimensional than categorical. Two lines of evidence support this 
contention. Firstly, a series of studies examined the latent structure 
of PTSD, using a sophisticated set of statistical procedures called 
taxometric analysis which used national probability samples of 
adult women and adolescents in the United States [11], randomly 
selected combat veterans in Australia [12], and United States combat 
veterans [13]. Taxometrics is a branch of applied mathematics that 
determines whether the latent structure of a phenomenon is a taxon 
(that is, a latent category whose members are qualitatively different 
from non-members of that category) or non-taxonic (that is, the la- 
tent structure is continuous in nature and the differences are quan- 
titative, not qualitative). Analysis of the latent structure of PTSD in 
these three studies obtained indicators of PTSD symptomatology 
from individuals that were subjected to taxometric analyses to deter- 
mine whether the data fit a categorical or a continuous solution. All 
studies found that the latent structure of PTSD is continuous in na- 
ture, not a taxon or distinct category. Secondly, studies of individuals 


with PTE exposure with several PTSD symptoms, but not enough to 
meet diagnostic criteria, still have elevated rates of suicidal behav- 
iours and other problems in functioning [14, 15]. This suggests that 
it may be more useful to think about people having degrees of PTSD, 
as opposed to either having PTSD or not. 

Although the DSM-5 planning process recognized some of the 
limitations of its current categorical approach to diagnosis and con- 
ducted preliminary work investigating the feasibility of adopting a 
more dimensional approach, a decision was made that this was pre- 
mature for DSM-5. Therefore, PTSD remained a categorical diag- 
nosis in DSM-5 and will remain so in ICD-11. 


Post-traumatic stress disorder 





DSM-5 PTSD description and revisions from DSM-IV 


The PTSD diagnostic criteria have evolved incrementally with each 
DSM revision. However, the core features of PTSD have remained 
similar throughout these evolutionary changes. The underlying as- 
sumption of this diagnosis is that exposure to PTEs has the potential 
to produce a characteristic set of symptoms, resulting in substan- 
tial psychological distress and/or functional impairment. Many in- 
dividuals exposed to PTEs are resilient and do not develop PTSD 
[16], confirming that PTE exposure is a necessary, but not sufficient, 
condition for PTSD development and highlighting the importance 
of understanding factors associated with PTSD. DSM-5 contains 
PTSD diagnostic criteria, as well as detailed text that describes and 
elaborates upon these criteria and symptoms. 

In DSM-5, Criterion A defines eligible PTEs as an event or events 
that involve exposure to: (1) death (either actual death or threats of 
being killed); (2) serious injury; or (3) sexual violence. Exposure to 
PTEs can happen in one or more of the following four ways: (1) ex- 
periencing the event directly; (2) witnessing the PTE happening to 
someone else in real time, in person, and not via electronic media; 
(3) learning that the PTE happened to a family member or close 
friend (note: if the PTE involved the death of a family member or close 
friend, the cause of death must have been violent or accidental); and 
(4) experiencing extreme or repeated exposure to disturbing details or 
aspects of PTEs experienced by other people, usually in a work-related 
context (for example, collecting human remains following combat, 
crime, or disasters; law enforcement personnel viewing photos of, or 
hearing, depictions of child murders or child pornography cases; mili- 
tary drone operators viewing death, destruction precipitated by drone 
attacks, and psychotherapists exposed to details of their patients’ trau- 
matic experiences). In the latter type of exposure, there is exclusion of 
cases in which exposure occurs through electronic media, television, 
movies, or pictures, unless the exposure is work-related. 

The DSM-5 text gives numerous examples of the types of PTEs 
that are included in Criterion A. These include serious accidents and 
natural and man-made disasters, including terrorist attacks; combat 
or war zone exposure; physical or sexual assault; exposure to haz- 
ardous chemicals; torture; witnessing dead bodies or parts of dead 
bodies; witnessing physical or sexual assaults; threats of injury or 
death to family members or close friends; deaths of family members 
or close friends due to violence, accidents, or disasters; and the types 
of extreme or repeated exposure to disturbing aspects of PTEs ex- 
perienced by others described previously. Medical procedures can 
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also qualify as PTEs if they ‘involve sudden, catastrophic events (for 
example, waking during surgery, anaphylactic shock)’ [5, p. 274]. 

Four points are important to note about PTEs described in DSM- 
5. Firstly, the text indicates that the list of PTEs described in the text 
is not inclusive, so other events can qualify as PTEs, even if they 
are not specifically listed. Secondly, the Criterion A text and ac- 
companying explanation clearly acknowledge the possibility that 
many individuals have experienced more than one PTE. Thirdly, the 
DSM-IV Criterion A (for example, A2) requirement that an event 
must produce fear, helplessness, or horror to qualify as a PTE was 
removed from DSM-5, which would likely result in an increase in 
the number of events qualifying as PTEs in DSM-5 vs in DSM-IV. 
However, DSM-5 also excluded some events that were included as 
PTEs in DSM-IV (for example, non-violent deaths of family mem- 
bers or close friends; observing violent events such as mass shoot- 
ings or terrorist attacks exclusively via electronic media, unless the 
viewing was work-related), which would reduce the number of 
events qualifying as PTEs in DSM-5, as compared to DSM-IV. One 
study with a national sample of United States adults compared the 
prevalence of lifetime exposure to PTEs using DSM-5 vs DSM-IV 
Criterion A definition and found that lifetime prevalence was high 
using both definitions, but that lifetime prevalence of exposure 
was slightly lower using the DSM-5 Criterion A definition [16]. 
This suggests that the DSM-5 revisions of Criterion A had largely 
offsetting effects on the proportion of individuals with a qualifying 
PTE. However, this study did find that the biggest reason individ- 
uals had PTSD with DSM-IV, but not DSM-5, criteria, was exclu- 
sion in DSM-5 of deaths that were not violent or accidental. Finally, 
it is extremely important to understand that defining an event as a 
qualifying PTE does not mean that everyone exposed to that event 
will develop PTSD or that all qualifying PTEs have an equal prob- 
ability of producing PTSD if they are experienced. Some PTEs have a 
much higher probability of increasing the risk of PTSD than others, 
as is described in the DSM-5 text. 

Criterion B symptoms are: (1) recurrent memories of the PTE that 
are involuntary, intrusive, and distressing; (2) distressing dreams of 
the PTE or something that is closely related to the PTE; (3) flash- 
backs or other dissociative reactions in which the person feels as if 
the PTE is happening again (note: these dissociative reactions are 
on a continuum, ranging from feelings that are reliving the PTE all 
over again to experiencing fragmentary sensory or perceptual sen- 
sations that occurred during the PTE); (4) intense or prolonged 
psychological distress that occurs when someone encounters an in- 
ternal or external cue that reminds them of the PTE or something 
that is closely associated with it; and (5) physiological reactions of 
an extreme nature that occur when a person encounters internal or 
external cues that remind them of the PTE or something closely as- 
sociated to it. Criterion B is met if an individual has one or more of 
these symptoms. Criterion B in DSM-IV was revised in very minor 
ways, so this criterion is virtually the same as in DSM-5, with the 
exception of minor changes in language. 

Criterion C has two symptoms that measure persistent avoidance 
of stimuli that are associated with a PTE: (1) avoiding, or attempts 
to avoid, distressing memories, thoughts, or feelings about a PTE 
or something that is closely associated with a PTE; and (2) avoiding 
or attempting to avoid external reminders of the PTE that arouse 
distressing memories, thoughts, or feelings about the PTE or 
things that are closely associated with it. Examples of such external 


reminders include people, places, conversations, activities, objects, 
or situations. Criterion C is met if an individual has at least one of 
these symptoms. 

One of the biggest revisions of DSM-IV criteria in DSM-5 was 
constructing a new Criterion C that contains only two active avoid- 
ance symptoms by pulling these symptoms out of DSM-IV Criterion 
C that included both avoidance and numbing symptoms. DSM-IV 
Criterion C had two avoidance symptoms and five numbing symp- 
toms, and three Criterion C symptoms were required to meet the 
threshold. This meant that someone could meet the DSM-IV 
Criterion C threshold of three symptoms without having any ac- 
tive avoidance symptoms. Numerous factor-analytic studies found 
that DSM-IV Criterion C symptoms were split into an avoidance 
symptom factor and a numbing symptom factor. This provided 
ample justification for creating a separate criterion for active avoid- 
ance symptoms [7]. Some critics of DSM-5 have expressed concern 
that requiring the presence of at least one active avoidance symptom 
to obtain a PTSD diagnosis in DSM-5 will result in some people not 
having PTSD using DSM-5 criteria who would have PTSD using 
DSM-IV criteria [17]. Others argue that avoidance has always been 
an important part of the PTSD construct and clinical picture [16] 
and that the presence of active avoidance distinguishes PTSD from 
other disorders, including major depression [12]. The Kilpatrick 
et al. study [16] did find that failure to have at least one active avoid- 
ance symptom was the second biggest reason for persons meeting 
DSM-IV but failing to meet DSM-5 criteria for PTSD, but this oc- 
curred in a very small number of cases. Taking all of this into con- 
sideration, the DSM-5 revision requiring active avoidance is well 
justified because avoidance has always been a key component of the 
PTSD construct. 

Criterion D has seven symptoms measuring negative alterations 
in cognition and mood. These symptoms must be associated with 
the PTE and must have either begun after, or been aggravated by 
exposure to, the PTE. Criterion D symptoms are: (D1) inability to 
recall important aspects of the PTE, typically due to dissociative am- 
nesia and not due to other factors such as head injury or heavy al- 
cohol or drug use; (D2) negative beliefs and/or expectations about 
others or the world that are persistent and exaggerated; (D3) dis- 
torted cognitions about the cause or consequences of the PTE that 
result in the person unreasonably blaming themselves or others for 
what happened; (D4) negative emotional states such as feelings of 
fear, horror, anger, guilt, or shame; (D5) diminished interest or par- 
ticipation in important life activities; (D6) feeling detached or es- 
tranged from other people; and (D7) inability to experience positive 
emotions such as happiness, satisfaction, or loving feelings. At least 
two of these symptoms are required to meet Criterion D. 

ICD-11 proposed criteria for PTSD have no negative cognition 
or mood symptoms [10, 18]. However, there is substantial evidence 
that exposure to PTEs can produce these alterations and that ex- 
posed individuals who develop PTSD are significantly more likely to 
have negative alterations in cognition and mood. Two papers using 
data from a large national sample of United States adults addressed 
this issue. The first paper compared the prevalence of D2 and D3 
symptoms among individuals with no PTE exposure, PTE exposure 
that did not result in PTSD, and PTE exposure that did result in 
PTSD [19]. Participants identified the type of cognitive distortion 
experienced if D2 or D3 was endorsed. No significant differences 
were found between groups with no PTE exposure or PTE exposure 
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that did not result in PTSD. However, those with PTSD were sig- 
nificantly more likely to believe that they were a bad person; that 
they could not trust anyone; that nothing good would ever occur; 
that they had no chance for a career, relationship, marriage, or chil- 
dren; that their soul or spirit was permanently damaged; and that 
the world was completely dangerous. The group with PTSD was also 
significantly more likely to exhibit beliefs that the PTE happened be- 
cause of something they did wrong; that they should have been able 
to prevent it; that the event happened because of something about 
them; and that they blamed themselves for not being able to recover 
and get on with their lives. 

The second paper examined associations between specific nega- 
tive emotions in symptom D4 and a DSM-5 PTSD diagnosis among 
adults who had experienced a PTE involving a physical or sexual 
assault [20]. Compared to individuals with assault histories who did 
not develop PTSD, individuals with assault-related PTSD were sig- 
nificantly more likely to have several negative emotions, including 
anger, guilt, shame, and horror, as well as fear. This study also found 
that the emotion of anger among those with PTSD was more preva- 
lent than the emotion of fear. 

Criterion E consists of major alterations in arousal and reactivity 
associated with the PTE. There are six symptoms, two or more of 
which are required and most of which are identical to those in DSM- 
IV Criterion D. However, two symptoms are new or substantially 
modified. Symptom E1 is irritable behaviour and angry outbursts 
that are expressed as verbal or physical aggression towards other 
people or objects. This symptom requires some behavioural mani- 
festation of anger and aggression, not just an angry mood, and 
there is considerable evidence that individuals with PTSD have a 
substantially higher prevalence of the symptom than those without 
PTSD [20, 21]. Symptom E2 is reckless or self-destructive behaviour 
such as dangerous driving, excessive alcohol or drug abuse, or risky 
sexual behaviour. The behaviours described in E1 and E2 have long 
been a part of the clinical picture of many patients with PTSD and 
have also been an important focus of clinical treatment efforts. The 
remaining Criterion E symptoms are hypervigilance, exaggerated 
startle response, problems with concentration, and sleep difficulties. 

As was the case with DSM-IV, PTSD in DSM-5 cannot be diag- 
nosed unless the symptoms persist for at least 1 month (Criterion 
F). Likewise, PTSD cannot be diagnosed unless symptoms produce 
significant distress and/or functional impairment (Criterion G). 

PTSD in DSM-5 has two other additions of note. Firstly, there is a 
new explicit set of PTSD criteria for children aged 6 years or younger. 
Child trauma professionals had observed that young children had 
extremely low rates of PTSD, using DSM-IV criteria, primarily due 
to adult-centric phrasing of symptoms and the requirement to have 
an inappropriately high number of symptoms within the DSM-IV 
Criteria C and D to meet the diagnostic threshold. Consequently, 
the DSM-5 child PTSD diagnostic criteria use a more child-centric 
language focus on observable behaviours, while eliminating sub- 
jective symptoms, and reduce the number of symptoms per cri- 
terion to meet the diagnosis. Secondly, there is a new dissociative 
subtype of PTSD, defined on the basis of repeated experiencing of 
either depersonalization or derealization. This addition was based 
on evidence of neurobiological differences among individuals with 
PTSD with and without these dissociative symptoms [22], as well 
as evidence that treatment outcome is worse among PTSD-positive 
patients with dissociative symptoms [7, 23]. 


PTSD prevalence 


Obtaining accurate estimates of PTSD prevalence requires the use 
of a probability sample of the population in question, compre- 
hensive assessment of PTE exposure, and thorough assessment of 
PTSD symptoms and symptom-related distress/functional im- 
pairment [24, 25]. Several good studies provide information about 
PTSD prevalence using DSM-IV criteria. The National Comorbidity 
Survey-Replication (NCS-R) [26] found that the lifetime prevalence 
of PTSD among a national probability sample of United States adults 
was 6.8% overall, 7.9% among women, and 3.6% among men. The 
National Epidemiologic Survey on Alcohol and Related Conditions 
(NESARC) [27] reported that the lifetime prevalence PTSD was 
7.3%. A study of PTSD among a national household probability 
sample of United States adolescents found that the past 6-month 
prevalence of PTSD was 6.3% among female adolescents and 3.7% 
among male adolescents [28]. 

Limited data are available about the impact of the DSM-5 revi- 
sions on PTSD prevalence. However, one study that compared 
PTSD prevalence using DSM-IV and DSM-5 criteria in a large na- 
tional sample of United States adults found slightly lower prevalence 
using DSM-5 vs DSM-IV criteria for lifetime (8.3% vs 9.8%, respect- 
ively), the past 12 months (4.7% vs 6.3%, respectively), and the past 
6 months (3.8% vs 4.7%, respectively) PTSD [16]. Another major 
study with a large national probability sample of United States adults 
reported that the lifetime and past-year prevalence of PTSD using 
DSM-5 criteria were, respectively, 6.1% and 4.7% [29], but this is 
clearly an underestimate of DSM-5 PTSD prevalence due to the use 
of an incorrect diagnostic algorithm requiring three symptoms each 
for Criteria D and E, instead of the two symptoms in each that are 
actually required. Another study of United States Army soldiers by 
Hoge and colleagues [30] confirmed that PTSD prevalence using 
DSM-5 vs DSM-IV criteria was slightly lower (11.5% and 12.3, re- 
spectively). In summary, the data are clear that DSM-5 PTSD is 
a prevalent disorder, with a prevalence that appears to be slightly 
lower overall than when using DSM-IV criteria. 


PTSD in ICD-11 


The ICD-11 revision is scheduled for completion in 2019, so no 
proposed changes have been finalized. However, it is apparent 
from a draft of the proposed criteria and descriptions of the pro- 
posed approach that the ICD-11 version of PTSD is likely to be a 
radical departure from the DSM-5 version of PTSD, as well as from 
the DSM-IV and ICD-10 version of PTSD [10, 18]. The most rad- 
ical proposed change separates PTSD into two disorders: PTSD and 
complex PTSD. The stated rationale is to narrow the scope of the 
PTSD construct, reduce the number of symptoms from 20 in DSM- 
5 to six in the ICD-11 proposal, increase diagnostic reliability, and 
eliminate a large number of symptoms that overlap with other dis- 
orders such as depression [10, 18]. 

The ICD-11 proposed definition of Criterion A describes ex- 
posure to an extremely threatening or horrific event or a series of 
events which is substantially less specific than the DSM-5 definition. 
The ICD-11 PTSD proposal focuses on three core elements: (1) re- 
experiencing of the trauma; (2) avoidance of reminders of the event; 
and (3) a heightened perception of threat and arousal. This ICD-11 
proposal for PTSD is operationalized by requiring at least one (of 
two) re-experiencing symptom (that is, flashbacks or nightmares), 
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one (of two) avoidance symptom (that is, avoidance of internal or 
external reminders), and one (of two) hyperarousal symptom (that 
is, hypervigilance or exaggerated startle response) [10, 18, 31]. The 
ICD-11 proposal omits the entire DSM-5 Criterion D and omits four 
of the DSM-5 Criterion E hyperarousal symptoms (that is, E1 irrit- 
able behaviour and angry outbursts; E2 reckless and self-destructive 
behaviour; E5 concentration difficulties; and E5 sleep difficulties). 
This PTSD proposal, like DSM-5, but unlike ICD-10, requires the 
symptoms to produce distress and/or functional impairment. 

Complex PTSD is a controversial diagnosis which has been criti- 
cized on conceptual and empirical grounds [27, 31]. Under the 
ICD-11 proposal, the first requirement for complex PTSD is to 
have the diagnosis of PTSD. Complex PTSD, as defined by ICD-11 
proposal [10, 18], requires at least one symptom in each of three 
complex PTSD symptom clusters: (1) affect dysregulation (that is, 
emotional reactivity, dissociation, anger, aggression, and emotional 
numbing); (2) negative self-concept (that is, negative beliefs about 
the core value of the self, along with feelings of guilt and shame); and 
(3) interpersonal disturbances (that is, avoidance of relationships, 
estrangement, and lack of emotional intimacy in relationships). 
Complex PTSD was not included in DSM-5 due to a lack of sup- 
porting evidence [7]. 

An obvious question is the effect these extensive revisions might 
have on PTSD prevalence and whether there is an impact on the 
percentage of those exposed to a PTE who meet diagnostic criteria 
for PTSD, because having a diagnosis is often required to qualify for 
treatment services or other benefits. The few head-to-head compari- 
sons of ICD-11 vs DSM-5 prevalence and caseness indicate that the 
ICD-11 proposal may produce a dramatic reduction in PTSD preva- 
lence and caseness. Wisco and colleagues [32] compared past-month 
PTSD prevalence using DSM-5 and ICD-11 criteria in a national 
sample of United States adults and a sample of United States mili- 
tary veterans. The prevalence of past-month PTSD prevalence using 
ICD-11 criteria was lower than when using DSM-5 criteria in both 
national adult and veteran samples. O'Donnell and colleagues [33] 
also reported the same pattern of findings in a probability sample of 
injury patients from four hospitals. A study from Denmark [34] also 
found that PTSD prevalence was lower using ICD-11, as opposed 
to DSM-5, criteria. An international study conducted in 13 coun- 
tries [15] did not actually assess PTSD using DSM-5 and ICD-11 
criteria but approximated these diagnoses using data from DSM-IV 
assessments. This study reported a slightly higher PTSD prevalence 
estimate when using the ICD-11 approximation (3.2% vs 3.0% for 
DSM-5), but the DSM-5 estimate was clearly lower than it should 
have been due to the three new symptoms not having been meas- 
ured. Although more research on this topic is needed, findings sug- 
gest that the proposed ICD-11 PTSD criteria are likely to have the 
unintended consequence of reducing the prevalence of PTSD and 
the number of people with a PTSD diagnosis that could qualify them 
for treatment and other services. 


Acute stress disorder 


Acute stress disorder (ASD) was first introduced in DSM-IV and 
significantly updated in DSM-5. The DSM-5 definition includes ex- 
posure to a Criterion A PTE, as described previously in the PTSD 
criteria. The key difference between PTSD and ASD is time and 


markers of distress. ASD is defined as distress occurring after 3 days, 
but within 1 month ofa Criterion A PTE. Further, ASD requires nine 
or more symptoms in the five major categories of intrusion, nega- 
tive mood, dissociative, avoidance, and arousal symptoms (DSM-5). 
Consistent with other mental health disorders, the symptoms must 
cause clinically significant distress or functional impairment. 

DSM-III [1] did not permit a trauma-related diagnosis prior to 
1-month post-traumatic event exposure, perhaps due to the norma- 
tive symptoms associated with experiencing traumatic events. The 
introduction of ASD as a mental health disorder within DSM-IV 
[6] allowed for individuals who have marked distress due to a PTE 
within 1 month of the PTE to receive clinical services that may po- 
tentially decrease distress and the development of PTSD. However, 
the DSM-IV definition of ASD gave excessive weight to dissocia- 
tive symptoms. Although other symptoms, including anxiety and 
arousal, were part of the diagnosis, individuals needed three or more 
dissociative symptoms in order to meet the criteria. In fact, several 
other acute symptoms are associated with the development of PTSD, 
rather than only peritraumatic dissociation [35]. When developing 
DSM-5, several critiques regarding DSM-IV definitions of ASD 
were considered, including questioning the predictive value of ASD 
in predicting the development of PTSD, differentiating ASD from 
other mental health disorders, the utility of ASD to enhance early 
intervention, and the general utility of the ASD diagnosis. These cri- 
tiques have been described elsewhere in more detail [35-37], but 
recommendations for DSM-5 primarily focused on the broadening 
of symptoms to remove DSM-IV’s unjustified emphasis on dissocia- 
tive symptoms. 

The DSM-5 definition of ASD differs from that of the DSM-IV 
definition in several ways. Firstly, the definition of a PTE was up- 
dated to be consistent with the new definition of a PTE, as described 
in PTSD. The most marked change in symptom presentation was the 
elimination of DSM-IV’s requirement of three or more dissociative 
symptoms, while giving greater weight to a broader set of symptoms 
in DSM-5. Although DSM-5 does recognize that some people may 
experience dissociative symptoms, others may now meet ASD cri- 
teria without a single dissociative symptom. In other words, there 
has been a shift in our understanding of ASD from a predominantly 
dissociative disorder to one that includes a greater variability in 
symptom presentation. 

Preliminary evidence suggests that both DSM-IV and DSM-5 
criteria have equal predictive value related to the development of 
PTSD, but DSM-5 criteria have greater specificity, compared to 
DSM-IV criteria [35]. According to DSM-5, there are higher rates 
of ASD among individuals who have experienced interpersonal 
PTEs (20-50%), compared to non-interpersonal PTE (<20%) [5, 
p. 284]. Therefore, ASD is quite common and should be regularly 
assessed for among populations who have experienced recent PTEs. 
Although ASD is not always predictive of PTSD, individuals with 
ASD (and those without) should also be assessed for PTSD if they 
continue (or develop) significant symptoms of distress or impair- 
ment 1 month following a PTE. 


Adjustment disorders 





Until DSM-5, ADs constituted a cluster of residual diagnoses that 
occupied their own space in DSM-III or DSM-IV. Divided into 
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anxiety and depressed and disturbed conduct subtypes (or some 
combination of these), ADs were perhaps the most poorly character- 
ized set of diagnoses within DSM. There were no specific symptoms 
needed to make the diagnoses. Indeed, all that was necessary was 
prior exposure to an adverse event (PTE or OSE) and a clinician’s 
judgement that ‘significant distress or functional impairment’ had 
its onset because the exposure took place. In many cases, ADs have 
been considered subthreshold anxiety or mood disorders, although 
it has never been clear how disturbed conduct relates to better expli- 
cated DSM diagnoses. 

The AD diagnosis seems to be relatively common. According to 
DSM-5, ‘prevalence may vary widely as a function of the population 
studied and the assessment methods used’ [5, p. 287]. In outpatient 
cohorts, the prevalence ranges from 5% to 20%. In hospital psychi- 
atric consultation settings, the prevalence may reach 50% [5]. 

When considering ADs, the DSM-5 workgroup was faced with 
various definitions of AD—was it a bona fide diagnosis in its own 
right, a subthreshold anxiety/mood disorder (like dysthymia and 
cyclothymia), or something else? Given the lack of rigorous research 
and poor validation for the various AD subtypes, it was unclear 
how and whether they related to one another [38]. Some investi- 
gators formulated the diagnosis as a subthreshold PTSD syndrome 
[39], but again, evidence was lacking. Finally, given the DSM-5 
ground rules that no diagnostic criterion could be added, deleted, 
or modified without strong empirical evidence and given the almost 
complete lack of such evidence (because of the lack of published re- 
search) when it came to ADs, it is not surprising that the DSM-5 
version of ADs resembles the DSM-IV version very closely. 

There were two important decisions regarding ADs that should 
promote research, which should usefully guide future revisions of 
DSM-5. Firstly, ADs were removed from their own category and 
moved into the “Trauma and stressor-related disorders’ chapter of 
DSM-5. Secondly, ADs were reconceptualized as a stress-related dis- 
order that might share a common pathophysiology with PTSD and 
ASD, since failure to cope with an overwhelming stressor appears to 
characterize all of these disorders. 


Child-specific symptoms and disorders 





In addition to the mental health disorders described, there are spe- 
cific disorders for children within the trauma- and stressor-related 
disorders section of DSM-5. This includes PTSD, RAD, and DSED. 

PTSD can be diagnosed among children; however, as discussed, 
DSM-5 outlines several modifications for children under the age 
of 6 (the preschool subtype). These child-specific symptoms were 
suggested after compiling 15 years of research suggesting that there 
may be specific developmental manifestations of symptoms [13]. 
Specifically, intrusive memories may manifest as re-enactment of 
the traumatic event; children may not be able to connect the content 
of nightmares to the traumatic event, and irritable behaviour may 
manifest as extreme temper tantrums. It is important to consider 
these child-specific manifestations of symptomology related to trau- 
matic event exposure, as they differ from adults. 

RAD and DSED are both diagnoses for young children older than 
9 months. Both RAD and DSED are thought to be caused by social 
neglect, defined by DSM-5 as ‘the absence of adequate caregiving 
during childhood’ [5, p. 265]; therefore, this is a requirement for 


both diagnoses. DSM-IV included only one disorder—RAD—with 
two subtypes (inhibited and disinhibited), but now the two subtypes 
are considered separate disorders in DSM-5. RAD includes predom- 
inantly internalizing symptoms, while DSED includes predomin- 
antly externalizing symptoms. Both disorders are considered to be 
rare, and the prevalence, even among severely neglected children, 
ranges from 10% for RAD to 20% for DSED [5]. 

Diagnostic criteria for RAD include symptoms of emotional 
withdrawal towards an adult caregiver and marked social and 
emotional disturbance, both presumably caused by social neglect. 
Children who meet criteria for RAD must engage in both minimal 
comfort-seeking and minimal responding to comfort from the care- 
giver when distressed. Further, children must exhibit two of the 
following: social withdrawal prior to the age of 2, persistent lack of 
positive mood, and ‘episodes of unexplained irritability, sadness, or 
fearfulness’ with their caregivers [5, p. 265]. Children are eligible for 
this diagnosis between the ages of 9 months and 5 years. 

In contrast, DSED occurs when a child regularly interacts with 
unfamiliar adults in a disinhibited manner. Children who meet cri- 
teria for DSED must engage in two behaviours related to interacting 
with unfamiliar adults, including: (1) approaching unfamiliar adults; 
(2) in a disinhibited manner, including being overfamiliar with the 
unfamiliar adults; (3) without checking in with the caregiver; and 
(4) leaving with the unfamiliar adults. Similar to RAD, there must be 
a pattern of social neglect from the caregiver. Although both child- 
specific disorders are rare, they have the potential to have a signifi- 
cant impact on the children’s lives if left untreated. 


Other specified trauma- and 


stressor-related disorders 





In addition to the disorders described, there are several disorders 
that are considered to be trauma- and stressor-related but did not fit 
the previously referenced diagnoses. These disorders cause signifi- 
cant distress and functional impairment but are either not specified 
within the already described disorders, are culturally specific dis- 
orders, or require further study in order to determine their existence 
as separate disorders. These include adjustment-like disorders with 
delayed onset or prolonged duration, ataque de nervios and other 
culturally specific disorders, and unspecified trauma- and stressor- 
related disorders. Persistent complex bereavement disorder (PCBD) 
embodies symptoms otherwise categorized as complicated grief or 
prolonged grief disorder. Because it has not been established which 
symptoms best characterize pathological grief reactions, PCBD now 
only appears only in the DSM-5 appendix as a presumptive disorder 
that requires future research (for more information, see Chapter 84). 

Although there is currently not a specific diagnosis for those who 
have PTSD symptoms but fail to meet all of the diagnostic criteria 
of PTSD, this ‘diagnosis’ is currently best captured within the ‘Other 
trauma-related disorder’ category. This is sometimes referred to as 
subthreshold, subsyndromal, subclinical, or partial PTSD, but the 
term partial PTSD has achieved the most widespread usage [15]. 
Partial PTSD has been operationally defined several ways, but they all 
require individuals to have several PTSD symptoms. Unfortunately, 
there is no current consensus on a case definition for partial PTSD 
in DSM 5, since different investigators have used different criteria 
in their research. Achievement of such a case definition should be a 
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high priority for research in this field. As noted in Schnurr’s recent 
review [15], partial PTSD is prevalent, and individuals with partial 
PTSD often have lower rates of functional impairment/distress than 
those with full PTSD, but higher rates than their counterparts who 
were exposed to PTEs but who had no PTSD symptoms. 

Such findings are strikingly similar to what would be predicted 
if PTSD is continuous, as opposed to categorical. Also, many indi- 
viduals with partial PTSD seek clinical services or attempt to enrol 
in PTSD research studies. Lacking one PTSD symptom can some- 
times make the difference between having PTSD and having partial 
PTSD, and the former would be eligible for treatment services or 
research studies recruiting individuals with PTSD, whereas the latter 
would not. 


Co-occurring disorders 


Trauma- and stressor-related disorders often have several key co- 
occurring disorders. Many disorders may co-occur partially due to 
overlapping symptomology (that is, which include both depressive 
and anxiety disorders). Other disorders co-occur possibly as a means 
to cope with the symptoms associated with experiencing stress and 
trauma (that is, substance use disorders). Other disorders may also 
be related to PTE exposure (that is, borderline personality disorder 
and dissociative identity disorder). When the psychological trauma 
is accompanied by head injury (for example, combat and motor ve- 
hicle accidents), traumatic brain injury is often a co-occurring con- 
dition. Although a thorough discussion of co-occurring disorders 
is beyond the scope of the current chapter, it is important to men- 
tion that co-occurring disorders can significantly complicate treat- 
ment and therefore should be thoroughly assessed. Historically, it 
was thought that disorders like PTSD should not be treated if there 
are other disorders that may complicate treatment (that is, substance 
use disorders or borderline personality disorder). However, recent 
treatment outcome research indicates that concurrent treatment 
of trauma-related disorders and other disorders should be strongly 
considered due to its efficaciousness [27, 40, 41]. 


What is needed to move the science forward 


in understanding trauma- and stressor-related 
disorders? 





For a better understanding of how to best classify, assess, and treat 
trauma- and stressor-related disorders, more research is needed in 
several areas. More work is needed to integrate the dimensional ap- 
proach into the current categorical diagnostic criteria. With respect 
to PTSD, there are three major ways to accomplish this. Firstly, it is 
important to conduct taxometric analyses with DSM-5 PTSD symp- 
toms to confirm the continuous nature of PTSD latent structure using 
the new criteria. Secondly, there is great merit to utilize continuous, 
as well as categorical, measures of PTSD symptoms/symptom fre- 
quency/symptom intensity in research and clinical practice. Thirdly, 
both research and clinical practice should make more use of the par- 
tial PTSD construct, because it is clear that individuals not meeting 
full diagnostic criteria still have significant problems that could 
benefit from increased research and clinical attention. Moreover, in- 
creased use of the partial PTSD construct might also be useful in 


addressing discrepancies in PTSD caseness using DSM-5 vs DSM- 
IV criteria. To the best of our knowledge, no research has been done 
to date, in which comparisons of the overlap between PTSD cases 
using DSM-IV and DSM-5 criteria also included individuals with 
partial PTSD. Our hypothesis is that the inclusion of partial PTSD 
cases would substantially increase the diagnostic concordance and 
reduce discrepancies. 


PTSD assessment 


There is a substantial literature on specific structured interview and 
self-report measures that have been used to assess PTSD [42, 43], 
so we will make three general points, rather than provide a detailed 
discussion of assessment measures. Firstly, an accurate assessment 
cannot be conducted without a thorough assessment of PTE ex- 
posure across the lifespan. In clinical, as well as research, settings, 
this requires asking a series of behaviourally specific questions about 
all types of relevant PTE exposure. 

Secondly, you cannot accurately assess for PTSD or partial PTSD 
unless your assessment asks about all 20 DSM PTSD symptoms. It 
is relatively common for some PTSD assessment instruments to use 
screening questions or skip-outs in which individuals are not asked 
about all PTSD symptoms once it becomes clear that they cannot 
meet the full diagnostic criteria. However, these skip-outs make it 
impossible to determine whether someone has partial PTSD, and we 
believe that a strong case can be made that skip-outs should never be 
used in PTSD assessment. 

Thirdly, the typical approach to PTSD assessment in multiple PTE 
exposure cases is to assess PTSD symptoms in reference to one index 
PTE (for example, the most recent PTE, the most serious or worse 
PTE, a PTE of particular interest such as rape or military combat, 
or a randomly selected PTE). However, exposure to multiple PTEs 
can result in composite PTSD, defined as meeting PTSD diagnostic 
criteria in which the PTSD symptoms incorporate content related 
to more than one PTE [16]. This requires a different assessment ap- 
proach that first determines whether individuals have each PTSD 
symptom and then determines which PTE or PTEs caused or ag- 
gravated the symptom. Not surprisingly, composite PTSD was more 
prevalent than PTSD to a single event, and its prevalence increased 
as a function of the number of PTEs that were experienced [16]. We 
believe that a challenge to PTSD researchers and clinicians will be 
to develop assessment strategies that better address the issue of how 
multiple PTE exposure relates to PTSD symptomatology. Further, 
more research concerning the other trauma- and stressor-related 
disorders, specifically with respect to PCBD and ADs. Research is 
needed regarding the diagnostic criteria of ADs and PCBD and how 
these disorders may or may not overlap with other disorders within 
this diagnostic cluster. 

It remains to be seen whether there are similar or overlapping psy- 
chobiological consequences of traumatic event exposure in PTSD 
and ASD, adverse experiences in ADs, and insufficient care in RAD 
and DSED. Research on biomarkers and underlying pathophysio- 
logical abnormalities promises to improve our understanding of how 
effective psychobiological mechanisms for stress and adaptation are 
dysregulated in the five trauma- and stressor-related disorders. 


Are there different PTSD phenotypes? 


By the time DSM-6 is published, it is possible that there will be no 
longer a PTSD diagnosis, as we have known it since DSM-III. When 
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first introduced in 1980, PTSD was revolutionary because it valid- 
ated the notion that an overwhelming stressor can produce stable 
alterations in cognitions, emotions, and behaviours that result in 
recognizable symptom clusters. The explosion of research, since 
that time, has made it abundantly clear that there are a variety of 
post-traumatic clinical presentations that meet diagnostic criteria 
for PTSD. Currently, this is best exemplified in DSM-5 ASD, as dis- 
cussed previously. Indeed, DSM-5 PTSD has been criticized because 
it has so many different symptoms, in contrast to most other psychi- 
atric disorders such as major depressive disorder and most anxiety 
disorders [17, 44]. 

To us, the best explanation for such complexity is that we have 
tried to sweep all significant post-traumatic symptomatology under 
a single PTSD rug. Indeed, based on clinical phenomenology alone, 
there appear to be four different PTSD phenotypes: (1) an adren- 
ergic fear-based anxiety disorder; (2) an anhedonic/dysphoric dis- 
order; (3) an externalizing disorder; and (4) a dissociative disorder 
[45]. It is likely that the ongoing Research Domain Criteria (RDoC) 
approach will identify different genetic or other biomarkers that 
may be more strongly associated with one phenotype than another 
(see Chapter 8 for more on RDoC). For example, the dissociative 
phenotype (currently categorized as PTSD’s dissociative subtype) 
is characterized by unique brain imagery patterns and treatment 
responsivity [22]. Evidence for an adrenergic subtype is indicated by 
selective responsivity to the adrenergic antagonist prazosin among 
soldiers with PTSD who recorded the highest blood pressure [46]. 
A specific biological marker dopamine-beta-hydroxylase (DBH), 
which catalyses the conversion of dopamine to noradrenaline, is 
uniquely associated with Criterion B intrusion symptoms; indeed, 
preliminary data have shown that plasma DBH levels appear to be 
causally related to the development of PTSD intrusion symptoms 
[44]. We expect that progress along these fronts may eventually re- 
sult in explication of a number of more specific post-traumatic syn- 
dromes, rather than the very complex array of clinical presentations 
currently embodied in the solitary DSM-5 PTSD diagnosis. A list of 
potential phenotypes might include: post-traumatic hyperarousal/ 
hyperadrenergic, post-traumatic anhedonic/serotonin-deficient, 
post-traumatic dissociative, etc. syndromes. If and when we can 
confidently identify different PTSD phenotypes, it is likely to have 
two major implications for diagnosis and treatment. Firstly, it may 
result in a spectrum of post-traumatic syndromes differentiated by a 
more concise set of diagnostic criteria that will include phenotype- 
specific biomarkers. Secondly, it may facilitate treatment matching, 
so that individuals with a certain phenotype will receive treatments 
best suited for their specific post-traumatic phenotype. 

In short, DSM-5 has opened the way for considering PTSD as a 
spectrum disorder, in which research on phenotypes is the inevitable 
next step. We look forward to the future identification of distinct post- 
traumatic diagnoses with different underlying pathophysiologies 
that will respond best to different optimal treatments. 


Conclusions 


The diagnostic criteria and classification for trauma- and stressor- 
related disorders have undergone significant changes with DSM-5 
and are proposed to include even more drastic changes within ICD- 
11. The iterative changes in diagnostic criteria and classification 


highlights not only the extensive research that has gone into under- 
standing these phenomena, but also the challenges that arise when 
examining disorders that stem from PTEs and OSEs. Given the cur- 
rent knowledge, there are several needed next steps to understand 
how to best classify trauma- and stressor-related disorders which in- 
cludes, but is not limited to: (1) further clarifying partial PTSD and 
deciding whether classification under a dimensional vs a categorical 
approach would be beneficial; (2) continuing to refine assessments 
to ensure that PTEs and OSEs are thoroughly assessed and symp- 
toms after these experiences are best captured; and (3) continuing 
to examine potential phenotypes of trauma- and stressor-related 
disorders. 
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‘Stress’ is a commonly used word that generally refers to experi- 
ences that cause feelings of anxiety and frustration, because they 
push us beyond our ability to successfully cope. ‘Stress’ has be- 
come a major focus of modern life, but what do we mean by ‘stress’? 
Besides time pressures and daily hassles at work and home, there 
are stressors related to economic insecurity, loneliness, poor health, 
and interpersonal conflict. More rarely, there are situations that are 
life-threatening—accidents, natural disasters, violence—and these 
evoke the classical ‘fight-or-flight’ response and can cause traumatic 
memories and post-traumatic stress disorder (PTSD). 

The most common stressors are the ones that operate chronic- 
ally, often at a low level, and that cause us to behave in certain ways. 
For example, being ‘stressed out’ may cause us to be anxious and/or 
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depressed, to lose sleep at night, to eat comfort foods and take in more 
calories than our bodies need, and to smoke or drink alcohol exces- 
sively. Being ‘stressed out’ may also cause us to neglect seeing friends, 
or to take time off from our work, or to reduce our engagement in 
regular physical activity as we, for example, sit at a computer and try 
to get out from under the burden of too much to do. Often we are 
tempted to take medications—anxiolytics, sleep-promoting agents— 
to help us cope, and, with time, our bodies may increase in weight 
and develop other symptoms of an unhealthy lifestyle. Thus, health- 
promoting and health-damaging behaviours must be included when 
we discuss ‘stress, and this leads us to the concepts of ‘allostasis’ and 
‘allostatic load and overload’ that will be discussed in this chapter. 
The brain is the organ that determines what is stressful and the be- 
havioural and physiological responses, whether health-promoting 
or health-damaging (Fig. 79.1). And the brain is a biological organ 
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Fig. 79.1 Central role of the brain in allostasis and the behavioural and physiological response to stressors [122]. 
Reproduced from New Eng J Med, 338(3), McEwen BS, Protective and Damaging Effects of Stress Mediators, pp. 171-9, Copyright (1998), with permission from Massachusetts 


Medical Society. 
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that changes in its architecture, molecular profile, and neurochem- 
istry under acute and chronic stress and directs many systems of 
the body—metabolic, cardiovascular, immune—that are involved 
in the short- and long-term consequences of being ‘stressed out’ 
and the consequent health-damaging behaviours. This chapter 
summarizes some of the current information, placing emphasis on 
how the stress hormones and other hormones can play both pro- 
tective and damaging roles in the brain and body, depending on 
how tightly their release is regulated and how harmoniously the 
network of stress mediators functions. The chapter discusses impli- 
cations for mental, as well as physical, health and describes some of 
the approaches for dealing with stress in our complex world. 


Types of stress 


This chapter will use the following classifications of the types of stress: 
good stress, tolerable stress, and toxic stress (Box 79.1). See (http:// 
developingchild.harvard.edu/library/reports_and_working_papers/ 
policy_framework/) for paper related to toxic stress. 

Good stress is a term used in popular language to refer to the ex- 
perience of rising to a challenge, taking a risk, and feeling rewarded 
by an often positive outcome. A related term is ‘eustress. Good self- 
esteem and good impulse control and decision-making capability, 
all functions of a healthy architecture of the brain, are important 
here! Even adverse outcomes can be ‘growth experiences’ for indi- 
viduals with such positive, adaptive characteristics that promote re- 
silience in the face of adversity. 

“Tolerable stress’ refers to those situations where bad things 
happen, but the individual with a healthy brain architecture is able 
to cope, often with the aid of family, friends, and other individuals 
who provide support. These adverse outcomes can be ‘growth ex- 
periences’ for individuals with such positive, adaptive characteristics 
and support systems that promote resilience. Here, ‘distress’ refers to 
the uncomfortable feeling related to the nature of the stressor and 
the degree to which the individual feels a lack of ability to influence 
or control the stressor [1-3]. 

Finally, ‘toxic stress’ refers to the situation in which bad things 
happen to an individual who has limited support and who may also 


Box 79.1 Levels of stressful experiences: their causes, 


consequences, and why we experience them! 





e Result: sense of mastery and control 
e HEALTHY BRAIN ARCHITECTURE 

— Good self-esteem, judgement, and impulse control 
Tolerable stress 
e Adverse life events buffered by supportive relationships 
e Result: coping and recovery 
e HEALTHY BRAIN ARCHITECTURE 
— Good self-esteem, judgement, and impulse control 





Toxic stress 
e Unbuffered adverse events of greater duration and magnitude 
e Result: poor coping and compromised recovery 
e Result: increased life-long risk for physical and mental disorders 
e COMPROMISED BRAIN ARCHITECTURE 

— Dysregulated physiological systems 


























have a brain architecture that reflects the effects of adverse early life 
events that have impaired the development of good impulse control 
and judgement and adequate self esteem. Here, the degree and/or 
duration of ‘distress’ may be greater. With toxic stress, the inability 
to cope is likely to have adverse effects on behaviour and physiology, 
and this will result in a higher degree of allostatic overload, as will be 
explained later in this chapter. 


Definition of stress, allostasis, and allostatic load 





In spite of the further definitions of the types of stress, the word 
‘stress’ is still an ambiguous term and has connotations that make 
it less useful in understanding how the body handles the events that 
are stressful. Insight into these processes can lead to a better under- 
standing of how best to intervene, a topic that will be discussed at 
the end of this article. There are two sides to this story—on the one 
hand, the body responds to almost any event or challenge, whether 
or not we call it ‘stress, by releasing chemical mediators, for example 
catecholamines that increase the heart rate and blood pressure, that 
help us cope with the situation; on the other hand, chronic elevation 
of these same mediators, for example chronically increased heart rate 
and blood pressure, produces chronic wear-and-tear on the cardio- 
vascular system that can result, over time, in disorders such as strokes 
and heart attacks. For this reason, the term ‘allostasis’ was introduced 
by Sterling and Eyer in 1988 to refer to the active process by which the 
body responds to daily events and maintains homeostasis (allostasis 
literally means ‘achieving stability through change’). Because chron- 
ically increased allostasis can lead to disease, we introduced the 
term “allostatic load or overload’ to refer to the wear-and-tear that 
results from either too much stress or from the inefficient manage- 
ment of allostasis, for example not turning off the response when it is 
no longer needed. Other forms of allostatic load are summarized in 
Fig. 79.2 and involve not turning on an adequate response in the first 
place or not habituating to the recurrence of the same stressor and 
thus dampening the allostatic response. 


Protection and damage as the two sides of the 





response to experiences 


Protection via allostasis and wear-and-tear on the body and brain 
via allostatic load/overload are the two contrasting sides of the 
physiology involved in defending the body against the challenges of 
daily life. Besides adrenaline and noradrenaline, there are many me- 
diators that participate in allostasis, and they are linked together in a 
network of regulation that is non-linear, meaning that each mediator 
has the ability to regulate the activity of the other mediators, some- 
times in a biphasic manner [4]. Glucocorticoids, produced by the 
adrenal cortex in response to ACTH from the pituitary gland, is the 
other major ‘stress hormone’ Pro- and anti-inflammatory cytokines 
are produced by many cells in the body, and they regulate each other 
and are, in turn, regulated by glucocorticoids and catecholamines. 
Whereas catecholamines can increase pro-inflammatory cytokine 
production, glucocorticoids are known to inhibit this production. 
And yet, there are exceptions—pro-inflammatory effects of gluco- 
corticoids that depend on the dose and cell or tissue type [5, 6]. The 
parasympathetic nervous system also plays an important regulatory 
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Fig. 79.2 Four types of allostatic load. The top panel illustrates the normal allostatic response, in which a response is initiated by a stressor, sustained 
for an appropriate interval, and then turned off. The remaining panels illustrate four conditions that lead to allostatic load: top left, repeated ‘hits’ from 
multiple stressors; top right, lack of adaptation; bottom left, prolonged response due to delayed shutdown; and bottom right, inadequate response that 
leads to compensatory hyperactivity of other mediators (for example, inadequate secretion of glucocorticoid, resulting in increased levels of cytokines 


that are normally counter-regulated by glucocorticoids) [122]. 


Reproduced from New Eng J Med, 338(3), McEwen BS, Protective and Damaging Effects of Stress Mediators, pp. 171-9, Copyright (1998), with permission from Massachusetts 


Medical Society. 


role in this non-linear network of allostasis, since it generally op- 
poses the sympathetic nervous system and, for example, slows the 
heart and also has anti-inflammatory effects [7, 8]. 

What this non-linearity and interaction among mediators mean is 
that when any one mediator is increased or decreased, there are com- 
pensatory changes in the other mediators that depend on the time 
course and level of change of each of the mediators. Unfortunately, 


we cannot measure all components of this system simultaneously, 
and we must therefore rely on measurements of only a few of them 
in any one study. Yet the non-linearity must be kept in mind in 
interpreting the results. 

A good example of the biphasic actions of stress, that is, ‘pro- 
tection vs damage, is in the immune system, in which an acute 
stressor activates an acquired immune response via mediation by 
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catecholamines and glucocorticoids and locally produced immune 
mediators; and yet a chronic exposure to the same stressor over sev- 
eral weeks has the opposite effect and results in immune suppres- 
sion [9, 10]. Acute stress-induced immune enhancement is good 
for enhancing immunization, fighting an infection, or repairing a 
wound, but it is deleterious to health for autoimmune conditions 
such as psoriasis or Crohn's disease. On the other hand, immune 
suppression is good in the case of an autoimmune disorder and dele- 
terious for fighting an infection or repairing a wound. In immune- 
sensitive skin cancer, acute stress is effective in inhibiting tumour 
progression, while chronic stress exacerbates progression [11, 12]. 


Stress and disease 


Cardiovascular disease and depression are recognized as conse- 
quences of ‘toxic psychological stress, using the definition of toxic 
stress discussed earlier [13]. There are conditions that lead to 
allostatic load and overload and exacerbate pathophysiology, besides 
the well-known factors associated with diet, alcohol consumption, 
and smoking and other unhealthy behaviours (Box 79.2). We shall 
discuss physical activity and lack thereof below, in connection with 
interventions. In particular, the social and physical environments are 
huge factors. Human beings are social creatures, and loneliness is a 
contributor to allostatic overload. Likewise, neighbourhoods matter 
and living in ugly, noisy, and dangerous neighbourhoods with a lack 
of green space contribute to allostatic overload. 


Circadian disruption 


One of the key systems in the brain and body that regulate homeo- 
stasis of these varied physiological and behavioural variables is the 
circadian system. Based in the suprachiasmatic nucleus (SCN) of 
the hypothalamus, the brain’s clock controls the rhythms in the rest 
of the brain and body through both neural and diffusible signals. 
Disruption of these key homeostatic systems contributes to allostatic 
overload [14]. Reduced sleep duration is associated with increased 
body mass and obesity, and sleep restriction to 4 hours per night 
increases blood pressure, decreases parasympathetic tone, increases 
evening cortisol and insulin levels, and promotes increased appetite, 
possibly through the elevation of ghrelin, a pro-appetitive hormone, 
along with decreased levels of leptin. Moreover, pro-inflammatory 
cytokine levels are increased with sleep deprivation, along with de- 
creased performance in tests of psychomotor vigilance, and this has 
been reported to result even from a modest sleep restriction to 6 
hours per night [14, 15]. 

Circadian disruption has sometimes been overlooked as a sep- 
arate, yet related, phenomenon to sleep deprivation. In modern 


Box 79.2 Causes of allostatic load/overload [122] 


e Loneliness [123] 

e Circadian disruption: jet lag, shift work, sleep deprivation [14, 15, 17, 82] 

e Lack of physical activity [124] 

e Ugly, noisy, dangerous living environment and lack of green space 
[2, 23, 24] 

e Health behaviours: type and quantity of food, alcohol consumption, 
smoking 











industrialized societies, circadian disruption can be induced in nu- 
merous ways, the most common of which are shift work and jet lag 
that contribute to weight gain and obesity (reviewed in [14]). Animal 
models have provided additional insights; for example, normal 
C57B1/6 mice housed in a disrupted 10-hour light:10-hour dark 
cycle showed accelerated weight gain and disruptions in metabolic 
hormones and also showed cognitive inflexibility and shrinkage of 
dendrites in the medial prefrontal cortex [16], similar to reports of 
long-recovery vs short-recovery flight crews where short-recovery 
crews had impaired performance in a psychomotor task, reacting 
more slowly and with more errors [17]. 





Key role of the brain in response to stress 


The brain is the key organ of the stress response because it deter- 
mines what is threatening, and therefore stressful, and also controls 
the behavioural and physiological responses (Fig. 79.1). There are 
enormous individual differences in the response to stress, based 
upon the experience of the individual early in life and in adult life. 
Positive or negative experiences in school, at work, or in romantic 
and family interpersonal relationships can bias an individual to- 
wards either a positive or a negative response in a new situation. 

Early life experiences carry an even greater weight in terms of how 
an individual reacts to new situations. Early life physical and sexual 
abuse carry with it a life-long burden of behavioural and patho- 
physiological problems [18]. Cold and uncaring families produce 
long-lasting emotional problems in children [19]. Some of these ef- 
fects are seen on the brain structure and function and in the risk for 
later depression and PTSD [20]. One of the biological consequences 
of early life adversity is prolonged elevation of inflammatory cyto- 
kines, as well as poor dental health, obesity, and elevated blood pres- 
sure, in children and young adults [21]. Harsh language is among the 
components of early life adversity and has been shown to increase 
inflammatory markers [22]. 

And the physical environment makes a huge difference, with 
crowding, noise, and ugliness, along with physical danger, being 
major contributors to allostatic overload both during development 
and throughout adult life [2, 23, 24]. 

Animal models have been useful in providing insights into be- 
havioural and physiological mechanisms. Individual differences 
in anxiety-like behaviours are evident [25, 26]. Early life maternal 
care in rodents is a powerful determinant of life-long emotional re- 
activity and stress hormone reactivity, and increases in both are as- 
sociated with earlier cognitive decline and a shorter lifespan [27]. 
Effects of early maternal care are transmitted across generations by 
the subsequent behaviour of the female offspring as they become 
mothers, and methylation of DNA in key genes appears to play a role 
in this epigenetic transmission [28, 29]. Yet, the mother is not the 
sole determinant of offspring emotional and physical development, 
but rather modulates it by her behaviour towards the infant, par- 
ticularly in the immediate aftermath of infant experiences of novelty 
inside or outside of the home cage [30]. 

Furthermore, in rodents, abuse of the young is associated with an 
attachment, rather than an avoidance, of the abusive mother, an ef- 
fect that increases the chances that the infant can continue to obtain 
food and other support until weaning [31]. Moreover, other con- 
ditions that affect the rearing process can also affect emotionality 
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in offspring. For example, uncertainty in the food supply for rhesus 
monkey mothers leads to increased emotionality in offspring and 
possibly an earlier onset of obesity and diabetes [32, 33]. 

Besides the important role of the social and physical environment 
and experiences of individuals in health outcomes, genetic factors 
also play an important role. Different alleles of commonly occurring 
genes determine how individuals will respond to experiences. For 
example, the short form of the serotonin transporter is associated 
with a number of conditions such as alcoholism, and individuals 
who have this allele are more vulnerable to respond to stressful ex- 
periences by developing depressive illness [34, 35]. In childhood, in- 
dividuals with an allele of the monoamine oxidase A gene are more 
vulnerable to abuse in childhood and more likely to themselves be- 
come abusers and to show antisocial behaviours, compared to indi- 
viduals with another commonly occurring allele [36]. Nevertheless, 
in a positive, nurturing environment, as formulated by Suomi and 
by Tom Boyce and colleagues [37-39], these same alleles may lead 
to successful outcomes, which has led them to be called ‘reactive or 
context-sensitive alleles, rather than ‘bad genes: 


Brain as a target of stress 


The hippocampus 


One of the ways that stress hormones modulate function within the 
brain is by changing the structure of neurons. The hippocampus is 
one of the most sensitive and malleable regions of the brain and is 
also very important in cognitive function and mood regulation [40]. 
The dentate gyrus (DG)-CA3 system, which is delicately balanced 
anatomically, and thus vulnerable to overstimulation, as in seizures, 
is believed to play a role in the memory of sequences of events, al- 
though long-term storage of memory occurs in other brain regions 
[41]. Moreover, the anterior part of the hippocampus has strong 
connections to the amygdala and prefrontal cortex and is a nexus of 
vulnerability to depression. But, because the DG-CA3 system is so 
delicately balanced in its function and vulnerability to damage, there 
is also adaptive structural plasticity, in that new neurons continue to 
be produced in the dentate gyrus throughout adult life [42], and CA3 
pyramidal cells undergo reversible remodelling of their dendrites in 
conditions such as hibernation and chronic stress [43, 44]. The role 
of this plasticity may be to protect against permanent damage. As a 
result, the hippocampus undergoes a number of adaptive changes in 
response to acute and chronic stress via a host of cellular and mo- 
lecular mechanisms [45] and also shows positive effects of regular 
physical activity on hippocampal volume and memory [46]. 


Prefrontal cortex and amygdala 


Repeated stress also causes changes in other brain regions such as 
the prefrontal cortex and amygdala. Repeated stress causes den- 
dritic shortening in the medial prefrontal cortex [47] but produces 
dendritic growth in neurons in the amygdala, as well as in the 
orbitofrontal cortex. Excitatory amino acids and BDNF are involved 
[47-49]. 


Contrasting effects 


Acute stress induces spine synapses in the CA1 region of the hippo- 
campus, and both acute and chronic stress also increases spine 


synapse formation in the amygdala, but chronic stress decreases it 
in the hippocampus. Moreover, chronic stress for 21 days or longer 
impairs hippocampal-dependent cognitive function and enhances 
amygdala-dependent unlearnt fear and fear conditioning, which are 
consistent with the opposite effects of stress on hippocampal and 
amygdala structure [49]. Chronic stress also increases aggression 
between animals living in the same cage, and this is likely to reflect 
another aspect of hyperactivity of the amygdala [50]. Behavioural 
correlates of remodelling in the prefrontal cortex include impair- 
ment in attention set shifting, possibly reflecting structural remod- 
elling in the medial prefrontal cortex [51]. 


Sex differences 


Animal models of stress effects on the brain show that females and 
males respond differently to acute and chronic stressors because of 
developmental factors involving both epigenetic effects of hormones 
along with genes in the sex chormosomes themselves [52]. Sex dif- 
ferences in the brain are subtle but widespread [53], and yet males 
and females do many things equally well; for example, in human 
subjects taking tests on empathy, men and women do equally well, 
but the brain activation patterns during the tests show different 
brain regions are activated [54]. This is reminiscent of an animal 
model study in which, despite no overall sex differences in fear con- 
ditioning freezing behaviour, the neural processes underlying suc- 
cessful or failed extinction maintenance are sex-specific [55]. Given 
other work showing sex differences in stress-induced structural 
plasticity in prefrontal cortex projections to the amygdala and other 
cortical areas [56], these findings are relevant not only to sex differ- 
ences in fear conditioning and extinction, but, according to Gruene 
et al. ‘also to exposure-based clinical therapies, which are similar in 
premise to fear extinction and which are primarily used to treat dis- 
orders that are more common in women than in mer [55]. 


Translation to the human brain 





MRI and fMRI imaging studies of stress, depression, 
and Cushing's disease 


Much of the impetus for studying the effects of stress on the struc- 
ture of the human brain has come from the animal studies sum- 
marized thus far. Although there is very little evidence regarding 
the effects of ordinary life stressors on brain structure, there are 
indications from functional imaging of individuals undergoing or- 
dinary stressors, such as counting backwards, that there are lasting 
changes in neural activity [57], anda 20-year history of chronic per- 
ceived stress has been linked to smaller hippocampal volume [58]. 
Moreover, the study of depressive illness and anxiety disorders has 
also provided some insights. Life events are known to precipitate 
depressive illness in individuals with certain genetic predisposi- 
tions [34, 59, 60]. Moreover, brain regions such as the hippo- 
campus, amygdala, and prefrontal cortex show altered patterns of 
activity in PET and fMRI and also demonstrate changes in volume 
of these structures with recurrent depression—decreased volume 
of the hippocampus and prefrontal cortices and the amygdala 
[61-64] (Fig. 79.3). Interestingly, amygdala volume has been re- 
ported to increase in the first episode of depression, whereas 
hippocampal volume is not decreased [65, 66]. It has been known 
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Fig. 79.3 Non-linear network of mediators of allostasis involved in the stress response. Arrows indicate that each system regulates the others in 
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for some time that stress hormones, such as cortisol, are involved 
in psychopathology, reflecting emotional arousal and psychic dis- 
organization, rather than the specific disorder per se [67]. We now 
know that adrenocortical hormones enter the brain and produce a 
wide range of effects upon it. 

In Cushing’s disease, there are depressive symptoms that can be 
relieved by surgical correction of hypercortisolaemia [68, 69]. Both 
major depression and Cushing's disease are associated with chronic 
elevation of cortisol that results in gradual loss of minerals from 
bone and abdominal obesity. In major depressive illness, as well 
as in Cushing’s disease, the duration of the illness, and not the age 
of the subjects, predicts a progressive reduction in volume of the 
hippocampus, determined by structural MRI [61, 70]. Moreover, 
there are a variety of other anxiety-related disorders, such as PTSD 
[71, 72] and borderline personality disorder [73], in which at- 
rophy of the hippocampus has been reported, suggesting that this 
is a common process reflecting chronic imbalance in the activity 
of adaptive systems, such as the HPA axis, but also including en- 
dogenous neurotransmitters such as glutamate. 


Glucose regulation 


Another important factor in hippocampal volume and function is 
glucose regulation. Poor glucose regulation is associated with smaller 
hippocampal volume and poorer memory function in individuals 
in their 60s and 70s who have ‘mild cognitive impairment’ (MCI) 
[74-76], and both MCI and type 2, as well as type 1, diabetes are rec- 
ognized as risk factors for dementia [77-80]. Moreover, depression 
is often associated with insulin resistance (IR) and metabolic syn- 
drome, and treatment with the peroxisome proliferator-activated 
receptor (PPAR) gamma agent pioglitazone showed improve- 
ment in depressive symptoms, but only in subjects with IR (81). 
Inflammation is also a factor related to many disorders of modern 
life, and elevation of IL-6 has been linked to reduced hippocampal 
volume [82], as well as to poor sleep (83). Regarding allostatic over- 
load, a bifactorial model for calculating its impact on poor health 


shows that mediators of inflammation, glucose level, and lipids are 
the most salient [84]. 


Insulin resistance 


The original hypothesis that IR is a missing link between mood dis- 
orders and dementia [80, 85] has recently been supported by data on 
all components of the model, starting with the connection between 
depressive disorders and IR [86, 87] and also confirming a role of 
depressive disorders in accelerating the onset of dementia [88-90]. 
As IR is a modifiable metabolic pro-inflammatory state underlying 
type 2 diabetes mellitus, cumulative data indicate that, in middle- 
aged adults, IR is associated with disrupted memory and executive 
function and a corresponding metabolic decline in the medial pre- 
frontal cortex, reductions in hippocampal volumes, and aberrant 
intrinsic connectivity between the hippocampus and the medial pre- 
frontal cortex [91-93]. These findings are supported by recent work 
in animal models, in which antisense inactivation of the insulin re- 
ceptor in the hippocampus leads to cognitive impairment without 
systemic consequences [94], whereas antisense inactivation of the 
hypothalamic insulin receptor creates systemic insulin resistance 
and dyslipidaemia and also insulin resistance in the hippocampus, 
along with depressive-like behaviour and cognitive impairment 
[95]. Remarkably, these changes are reversed by dietary restriction 
[96, 97], indicating that the brain can be resilient. 


Progression towards dementia 


Yet there is, at some point, a ‘switch’ that triggers irreversible changes 
that lead towards amyloid beta (Abeta) toxicity and dementia [89]. 
These authors point out that synaptic NMDA receptor activation 
normally has an antioxidant role by suppressing FOXO1 transcrip- 
tion factor in the hippocampus, but abnormal and excessive NMDA 
activation in the insulin-resistant state appears to enable FOXO1 
translocation to the cell nucleus, leading to the generation of reactive 
oxygen species and possibly also activation of stress kinases, which 
further impairs insulin signalling. Moreover, Abeta production is 
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accelerated and Abeta oligomers enter into a vicious cycle, leading 
to further damage [89]. Mitochondrial function declines under 
these conditions and contributes to the positive feedback cycle of 
toxicity [98]. 


Epigenetics: two meanings that are both important 


for prevention and treatment 





‘Epigenetics’ now refers to events ‘above the genome’ that regulate 
the expression of genetic information without altering the DNA 
sequence. Besides CpG methylation described earlier, other mech- 
anisms include histone modifications that repress or activate chro- 
matin unfolding [99] and the actions of non-coding RNAs [100], 
as well as transposons and retrotransposons [101] and RNA editing 
[102]. For prevention and treatment, in the spirit of integrative 
medicine, it is important to let the ‘wisdom of the body’ prevail and 
to focus upon strategies that centre around the use of targeted be- 
havioural therapies, along with treatments, including pharmaceut- 
ical agents, that ‘open up windows of plasticity’ in the brain and 
facilitate the efficacy of the behavioural interventions [103]. This is 
because a major challenge throughout the life course is to find ways 
of redirecting future behaviour and physiology in more positive and 
healthy directions [104]. In keeping with the original definition of 
epigenetics [105] as the emergence of characteristics not previously 
evident or even predictable from an earlier developmental stage 
(for example, think about a fertilized frog or human egg which look 
similar and what happens as each develops!), we do not mean ‘re- 
versibility’ as in ‘rolling back the developmental clock’ but rather fre- 
direction as well as ‘resilience’, which can be defined as achieving a 
successful outcome in the face of adversity’. 


Interventions that change the brain and 





improve health 


Can the effects of stress and adverse early life experiences on the 
brain be treated and compensated, even though there are no ‘magic 
bullets’ like penicillin for stress-related disorders [104]? For psychi- 
atric illnesses such as depression and anxiety disorders, including 
PTSD, it is necessary to complement, and even replace, existing 
drugs and adopt strategies that centre around the use of targeted 
behavioural therapies, along with treatments, including pharma- 
ceutical agents, that open up ‘windows of plasticity’ in the brain 
and facilitate the efficacy of the behavioural interventions [103, 
106, 107]. To that extent, meeting the demands imposed by stressful 
experiences via various coping resources can lead to growth, adapta- 
tion, and learning to promote resilience and improved mental health 
[108, 109]. BDNF is a mediator of plasticity and, while it can facili- 
tate beneficial plasticity (for example, see [48]), it should be noted 
that BDNF also has the ability to promote pathophysiology, as in 
seizures [110-112]. 


How the brain gets ‘stuck’ 


Depression and anxiety disorders are examples of a loss of resili- 
ence, in the sense that changes in brain circuitry and function, 
caused by stressors that precipitate the disorder, become ‘locked’ in 


Box 79.3 Reactivating and redirecting brain plasticity 





e ‘Releasing the brakes’ that retard structural and functional plasticity 
[105]. Example amblyopia first done using fluoxetine [121] and cal- 
oric restriction [125], in which reducing inhibitory neuronal activity 
appears to play a key role. Replicated by putting cortisol in drinking 
water, instead of caloric restriction [125]. NOTE: ultradian fluctuations 
of cortisol according to a diurnal pattern modulate turnover of some 
spine synapses, in relation to motor learning and possibly other 
forms of learning [126, 127]. 

e Opening windows of plasticity with physical activity: regular phys- 

ical activity, which has actions that improve prefrontal and parietal 

cortex blood flow and enhance executive function [128], increases 

hippocampal volume in previously sedentary elderly adults [46, 129] 

and complements work showing that fit individuals have larger 

hippocampal volumes than sedentary adults of the same age range 

[130]. Regular physical activity is an effective antidepressant and 

protects against cardiovascular disease, diabetes, and dementia 

[131, 132]. Moreover, intensive learning has also been shown to in- 

crease the volume of the human hippocampus, based on a study on 

medical students [133]. 

Successful cognitive behavioural therapies, which are tailored to individual 

needs, can produce volumetric changes in both the prefrontal cortex 

in the case of chronic fatigue [134] and in the amygdala in the case of 

chronic anxiety [135], and in a brainstem area associated with well- 

being [136]. Mindfulness-based stress reduction (MBSR) practice has 
been shown to increase regional brain grey matter density in the hippo- 
campus, cerebellum, and prefrontal cortex, the brain regions involved 
in learning and memory processes, emotion regulation, self-referential 
processing, and perspective-taking [137]. Enhancing self-regulation of 

mood and emotion appears to be an important outcome [138, 139]. 

Meditation is reported to enlarge the volume of the hippocampus 

and to do so differently in men and women, suggesting to the au- 

thors that mindfulness practices operate differently in males and 

females [140]. 























a particular state and thus need external intervention. Indeed, pro- 
longed depression is associated with shrinkage of the hippocampus 
[62, 113] and prefrontal cortex [63]. While there appears to be no 
neuronal loss, there is evidence for glial cell loss and smaller neur- 
onal cell nuclei [114, 115], which is consistent with shrinking of the 
dendritic tree, described earlier, after chronic stress. Indeed, a few 
studies indicate that pharmacological treatment may reverse the de- 
creased hippocampal volume in unipolar [116] and bipolar [117] 
depression, but the possible influence of concurrent cognitive be- 
havioural therapy in these studies is unclear. 

Even in adulthood, gene expression in the brain continually 
changes with experience [118] and there is loss of resilience of the 
neural architecture with ageing [119] that can be redirected by ex- 
ercise [46] and by pharmacological intervention [120, 121]. Beyond 
ageing, there are new approaches to ‘opening windows of plasticity’ 
and redirecting the brain towards a more health-promoting state, 
and these are summarized in Box 79.3. 


Conclusions: what can one do about being 


‘stressed out’? 





If being ‘stressed out’ has such pervasive effects on the brain, as well 
as the body, what are the ways in which individuals, as well as policy- 
makers in government and business, can act to reduce the negative 
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effects and enhance the ability of the body and brain to deal with 
stress with minimal consequences? The answers are simple and ob- 
vious, but often difficult to achieve. 

From the standpoint of the individual, a major goal should be 
to try to improve sleep quality and quantity, have good social sup- 
port and a positive outlook on life, maintain a healthy diet, avoid 
smoking, and have regular moderate physical activity. Concerning 
physical activity, it is not necessary to become an extreme athlete, 
and moderate physical activity helps, as noted earlier, in relation to 
enlarging hippocampal volume. 

From the standpoint of policy, the goal should be to create in- 
centives at home and in work situations and to build community 
services and opportunities that encourage the development of the 
beneficial individual lifestyle practices. 

As simple as the solutions seem to be, changing behaviour and 
solving problems that cause stress at work and at home are often 
difficult and may require professional help on a personal level, or 
even a change of job or profession. Yet these are important goals 
because the prevention of later disease is very important to increase 
‘healthspan’ and promote full enjoyment of life and also to reduce 
the financial burden of disease and disability on the individual and 
on society. 
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This chapter will highlight the genetic architecture of trauma- and 
stress-related disorders, exemplified by post-traumatic stress dis- 
order (PTSD), which itself overlaps in symptom dimensions with 
acute stress disorder and adjustment disorder. Within DSM-5, this 
nosological group shares, as part of their diagnosis, the specific call 
for an environmental component in the form of direct or indirect 
exposure to a causative traumatic incident [1]. This unique position 
among psychiatric disorders is particularly relevant for studying the 
interaction of an organism's internal biological component with ex- 
ternal stimuli in the pathogenesis of a mental disorder. 

Here, we will review the genetics of stress-related disorders, fo- 
cusing on clinical genetic approaches such as family and twin 
studies and molecular genetic studies, that is candidate gene and 
hypothesis-generating genome-wide association studies (GWAS), 
as well as gene-environment and epigenetic approaches. Due to 
the close phenomenological relationship between the categories 
of trauma- and stress-related disorders and anxiety disorders, the 
genetics of cross-disorder phenotypes and cross-cutting symptom 
measurements will build an additional focus. 
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The requirement for trauma exposure as a key factor in the diag- 
nosis of PTSD poses a challenge to estimates of familial aggregation, 
since the exact degree of trauma exposure is likely to vary between 
otherwise comparable relatives. The task to distinguish the heritable 
components of PTSD is difficult. Indeed, biological factors could af- 
fect an individual's liability to PTSD in several ways, including, but 
not limited to an increased susceptibility to trauma exposure (gene- 
environment correlation), an increased susceptibility to PTSD inde- 
pendent of the trauma, an influence on risk factors potentially linked 
to PTSD (for example, the personality trait of increased risk-seeking 
behaviour), and/or mediation of a pleiotropic effect on several of 
these. For example, in one family study, PTSD lifetime symptoms 
of mothers were significantly associated with PTSD symptoms in 





children in a dose-dependent fashion, while similarly predicting in- 
creased trauma exposure in children, which accounted for 74% of 
the increased PTSD incidence rate [2]. 

Quantitative genetic analysis of Vietnam-era veteran monozy- 
gotic and dizygotic male twin pairs overcame some of the problems 
of unequal trauma exposure. It revealed a substantial genetic in- 
fluence on the susceptibility to all assessed PTSD symptoms; a bi- 
variate genetic model, adjusted for combat exposure, estimated the 
heritability of re-experiencing symptoms to range between 0.13 and 
0.30 [3]. The heritability of avoidance and arousal symptoms varied 
between 0.30-0.34 and 0.28-0.32, respectively, with no evidence for 
a shared environmental component [3]. In a study of genetic and 
environmental effects on exposure to assaultive and non-assaultive 
trauma and occurrence of PTSD symptoms in male and female twin 
pairs of non-veteran volunteers with trauma exposure, the variance 
of reported symptoms was best explained by genetic factors for as- 
saultive trauma (overall heritability of 0.53), with genetic factors 
particularly contributing to symptoms of re-experiencing (0.36) and 
numbing (0.36), whereas no genetic component was found in non- 
assaultive trauma (mainly explained by non-shared environmental 
factors) [4] (Table 80.1). 

While both genetic and non-shared environmental influences 
affected PTSD symptoms, and the severity of PTSD symptoms has 
been linked to the number of assaultive traumatic events suffered, 
the influence of genetic factors has been suggested to decline with 
increase in total trauma number [5]. In other words, there appears 
to be a threshold beyond which the quantitative summation of en- 
vironmental factors supervenes [5]. Another population-based 
twin sample estimated the magnitude of additive genetic influences, 
based on trauma quality, and reported a greater genetic contribu- 
tion in high-risk trauma (0.60; trauma reported as most disturbing), 
as compared to low-risk trauma (0.47), with a genetic correlation 
between high-risk trauma and PTSD of 0.89, whereas for low-risk 
trauma and additive genetic and non-shared environmental influ- 
ences, a lower correlation of 0.57 was observed [6]. 


Structural and rare variant studies 


No systematic research efforts have been published linking PTSD to 
recognized neurodevelopmental disorders, structural genetic vari- 
ations, or rare single-nucleotide polymorphisms (SNPs). 
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Table 80.1 Trauma- and symptom-focused perspectives on the heritability of stress-related disorders 





Trauma exposure 
Assaultive trauma* 0.53 0.40-0.67 
on-assaultive trauma? = = 
Low-risk trauma? 0.47 0.35-0.57 
High-risk trauma? 0.60 0.49-0.71 
Clinical presentation 
Re-experiencing symptoms? 0.36 0.23-0.51 
Avoidance symptoms* 0.28 0.14-0.44 
umbing symptoms 0.36 0.22-0.50 
Hyperarousal symptoms* 0.29 0.15-0.44 
Total PTSD symptoms? 0.38 0.24-0.52 
PTSD diagnosis” 0.46 0.31-0.62 





= = 0.42 0.33-0.60 
0.32 0.21-0.45 0.68 0.55-0.80 
= = 0.53 0.43-0.65 
= = 0.40 023-051 
= = 0.64 0.54-0.80 
= = 0.72 0.56-0.86 
= = 0.64 0.50-0.78 
= = 0.71 0.56-0.85 
= = 0.62 0.48-0.76 
= = 0.54 0.38-0.69 





Assaultive trauma comprised captivity and physical/sexual assault. Non-assaultive trauma comprised natural disasters, motor vehicle 
accidents, and the occurrence of a sudden death in the family. Low-risk trauma comprised combat exposure and accidents. High-risk 
trauma comprised rape, sexual molestation, and childhood neglect/abuse. Heritability estimates not adding up to 1.0 are derived from 


rounding up/down effects. 





PTSD, post-traumatic stress disorder; Cl, confidence interval; a”, additive genetic factors; c¢?, shared environmental factors; e?, unique 


environmental factors. 
*[4]. 
” [6]. 


GWAS, candidate genes, gene-environment interaction, 
and ‘therapy genetics’ studies 


The first GWAS of PTSD was performed in a small trauma-exposed 
cohort of African-American veterans and their intimate partners 
(295 cases, 196 controls) and yielded several SNPs with suggestive 
associations, with one polymorphism in the RAR (retinoic acid 
receptor)-related orphan receptor alpha gene (RORA) reaching 
genome-wide significance (rs8042149, C allele) [7]. The rs8042149 
finding failed to replicate in two independent replication samples, 
but interestingly, other RORA SNPs were implicated as PTSD risk 
variants [7] . Notably, a subsample analysis of the initial GWAS co- 
hort, screening 606 SNPs spanning the RORA gene, significantly as- 
sociated rs8042149 with the fear sub-factor of PTSD comorbidities 
(outcome of a 3-factor confirmatory factor analysis, next to an in- 
ternalizing and an externalizing sub-factor) [8]. 

A GWAS conducted in a cohort of European-Americans (300 
cases, 1278 controls) yielded rs406001 on chromosome 7p12 as 
a genome-wide significant hit, the nearest gene being the protein 
cordon-bleu (COBL) and the second-best hit located in the first in- 
tron of the tolloid-like protein 1 (TLL1), barely missing genome- 
wide significance (rs6812849, P = 2.99 x 107) [9]. A SNP replication 
analysis of the top GWAS hits in another cohort and a subsequent 
combined cohort analysis resulted in a more favourable overall 
genome-wide statistical outcome for TLL1 rs6812849 [9]. An ana- 
lysis of the main effects of COBL SNPs in an African-American 
civilian population did not replicate, but a significant genotype- 
environment interaction with childhood trauma was found for 
COBL rs406001 [10]. The T allele mediated higher PTSD symptom 
intensity and was associated with significantly decreased fractional 
anisotropy in left hemisphere structures (inferior fronto-occipital 
fasciculus, uncinate fasciculus, orbitofrontal white matter tracts, and 
inferior longitudinal fasciculus) [10]. 


Screening for genome-wide significant association in a sample of 
trauma-exposed African-American women (94 cases, 319 controls) 
identified the minor G allele of rs10170218, a SNP within the long 
intergenic non-coding RNA sequence AC068718.1 gene located at 
2q32.1 [11]. This was replicated in a second independent cohort of 
European-American women [11]. A functional enrichment analysis 
of the nominally significant GWAS findings highlighted pathways 
related to the maintenance of telomere length [11]. 

Another GWAS (3167 cases, 4607 trauma-exposed controls) 
found a genome-wide significant hit in the ankyrin repeat domain- 
containing protein 55 gene (ANKRD55, rs159572) in subjects of 
African-American ancestry and one close to the zinc finger pro- 
tein 626 gene (ZNF626, rs11085374) in the European-American 
subsamples [12]. 

A study analysing 3742 SNPs of over 300 genes previously re- 
ported as PTSD candidate risk genes in trauma-exposed European- 
American women (845 cases, 1693 controls) in a categorical and 
dimensional fashion found the most significant association between 
the intronic synaptic vesicular amine transporter (SLC18A2, in- 
volved in the ATP-dependent neurotransmitter storage) rs363276 
A allele and a PTSD diagnosis [13]. In a similar experimental design 
examining the association of 3755 candidate genes and PTSD in a 
gene-environment interaction approach with childhood trauma in 
a cohort of male soldiers of European ancestry (810 samples), an 
association was identified between the beta-2 adrenergic receptor 
(ADRB2) rs2400707 G allele and increased childhood adversity 
load, resulting in heightened PTSD symptom intensity, which was 
subsequently replicated in an independent sample of predominantly 
female African-American civilians (2083 samples) [14]. 

A GWAS in trauma-exposed United States marines of mixed an- 
cestry (940 cases, 2554 controls) resulted in a significant association 
for the phosphoribosyltransferase domain-containing protein 1 
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(PRTFDC1); the rs6482463 minor A allele was under-represented 
in PTSD. A replication attempt in an independent military cohort 
failed, and although a candidate gene screen, including RORA, TLL1, 
and AC068718.1 SNPs, yielded nominally significant associations, 
no finding survived correction for the number of polymorphisms 
tested per gene [15]. In general, results of hypothesis-generating 
studies in such small samples must be viewed with caution, since it 
is difficult to exclude chance findings. 

Finally, the largest GWAS of PTSD to date (5182 cases; 15,548 con- 
trols, 13,638 of which were trauma-exposed) suggested a genome- 
wide significant finding in the kelch-like protein 1 gene (KLHL1, 
rs139558732) detected in its African-American subsample (2520 
cases, 7171 controls) [16]. However, recent GWAS investigations in 


a cohort of mixed-ancestry Iraq—Afghanistan-era veterans [17] and 
of the DSM-5 dissociative subtype of PTSD in a cohort of European- 
American United States military veterans [18] did not report any 
genome-wide significant results. 

In addition to GWAS, the impact of specific genetic variation in 
PTSD has focused on genes related to monoaminergic neurotrans- 
mission (Fig. 80.1) analogous to candidate studies performed in 
anxiety disorders (see Chapter 89) and neuroendocrine functioning, 
in particular the hypothalamic—pituitary—adrenal (HPA) axis. A re- 
cent meta-analysis of trauma-exposed samples found only a trend 
towards significance between PTSD and the short (s) allele of the 
serotonin transporter (SLC6A4, chromosome 17q11.2) 5-HTTLPR 
(5-hydroxytryptamine transporter-linked polymorphic region; 
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Fig. 80.1 Multi-level evidence linking the serotonergic system to trauma- and stress-related disorders, cross-disorder phenotypes, and anxiety 


disorders. The short allele/low transcriptional activity variants of the 5-HTTLPR were chosen as seed node. Arrows represent domain interrelations 

of systems analysis mediating interactions between genetic and environmental factors, potentially leading to observable meta-phenomena crossing 
classic nosological boundaries and connecting different spectra of psychopathology. Note that some authors include the rs25531 polymorphism in 
their analyses, with the minor G allele rendering transporter expression levels of the long 5-HTTLPR allele more closely to the short allele. Refer to the 
text for detailed supporting literature. ACC, anterior cingulate cortex; AMY, amygdala; CBT, cognitive behavioural therapy; HIP hippocampus; IN, insula; 
MPFC, medial prefrontal cortex; PCC, posterior cingulate cortex; PD, panic disorder; PTSD, post-traumatic stress disorder; SAD, social anxiety disorder; 


SSRI, selective serotonin reuptake inhibitor; TH, thalamus. 


the s allele is linked to decreased expression/transporter activity, 
compared to the long [l] variant) [19]. However, the s/s genotype 
and high trauma exposure were associated with PTSD, arguing for 
a further exploration of gene-environment interaction to clarify 
the polymorphism’ role in trauma-related psychopathology [19]. 
Soldiers with one or two copies of the low-functioning 5-HTTLPR/ 
rs25531 alleles (the minor rs25531 G allele reduces transporter ex- 
pression levels like the s variant) reported increased symptoms of 
PTSD and anxiety in response to heightened exposure to war zone 
stressors, as compared to carriers of high-functioning alleles [20]. 
Accordingly, the genotype with low transcriptional efficiency was 
associated with PTSD diagnoses and PTSD symptom severity in vet- 
erans [21]. Interestingly, the low-functioning 5-HTTLPR/rs25531 
alleles mediated an interaction effect of increased PTSD symptoms 
during battle deployment only in individuals with greater atten- 
tional avoidance of threats, rather than in individuals with vigi- 
lant threat appraisal [22]. Similarly, following a university campus 
shooting, the low-functioning 5-HTTLPR/rs25531 genotypes pre- 
dicted increased acute stress symptom intensity 2-4 weeks after the 
event, even when correcting for shooting exposure; symptoms of 
avoidance and hyperarousal, but not of re-experiencing, were the 
most implicated [23]. 

Remarkably, a pharmacogenetic study examining the 4- and 12- 
week PTSD treatment response to sertraline linked 1 allele homozy- 
gosity to reduced dropout rates due to adverse effects and increased 
efficacy; most strikingly, there was a 48% response rate (defined as 
a minimum symptom reduction of 30%) by week 12, as compared 
to a 0% response rate in s allele carriers [24]. Correspondingly, low- 
expression 5-HTTLPR/rs25531 genotype carriers displayed a sig- 
nificantly worse 6-month treatment outcome following an 8-week 
exposure-based cognitive behavioural therapy (CBT) programme, 
controlling for pre-CBT symptom severity and the number of treat- 
ment sessions [25]. 

The catechol-O-methyltransferase COMT (chromosome 
22q11.21) rs4680 A/A genotype has been associated with PTSD, 
particularly in patients who had experienced childhood adversities 
and possessed high neuroticism scores [26]. The minor A allele re- 
sults in an exchange of amino acid 158 valine (major G allele) to 
methionine and significantly increased degradation of catechol- 
amine neurotransmitters—often referred to as the val158met poly- 
morphism. Surprisingly, it has been indicated further that, while 
COMT rs4680 G (val) allele carriers showed a classic dose-response 
relationship of lifetime trauma abundance and PTSD incidence rate, 
met/met genotype carriers exhibited an increased risk of PTSD, in- 
dependent of trauma load [27]. 

The peptidyl-prolyl cis-trans isomerase FKBP5, commonly re- 
ferred to as the FK506 binding protein 5 (FKBP5, chromosome 
6p21.31), is probably one of the most extensively characterized 
candidate genes associated with PTSD. FKBP5 is an immunophilin 
family member with isomerase and chaperone functions; it interacts 
with unligated steroid receptor heterocomplexes through heat shock 
protein 90 (HSP90), influencing intracellular trafficking behaviour, 
receptor affinity, and nuclear translocation. Since FKBP5 transcrip- 
tion is activated by glucocorticoid receptors via steroid response 
elements, a direct negative feedback loop is created [28]. Several 
SNPs within the FKPB5 gene (rs1360780 T allele, rs3800373 C allele, 
rs9296158 A allele, and rs9470080 T allele) do not directly predict 
PTSD symptom outcome or interact with non-childhood trauma 
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[29]. Instead, they appear to interact with the severity of child 
abuse to predict increased levels of adult PTSD symptom levels and 
a greater suppression of cortisol release during a pharmacological 
challenge with dexamethasone [29]. A study evaluating FKBP5 
SNPs in a trauma-exposed African-American cohort (1963 sam- 
ples) reported a significant association between the rs1360780 A al- 
lele (located in a functional enhancer region in intron 2 488 bp away 
from a glucocorticoid response element) and childhood trauma on 
PTSD symptoms [30]. The FKBP5 rs1360780 T allele has been inde- 
pendently reported to predict an increased risk of long-term PTSD 
symptom relapse following narrative exposure therapy [31]. In the 
same African-American cohort, the rs9470080 T allele increased the 
risk to be diagnosed with PTSD in an interaction with childhood 
maltreatment; yet surprisingly, T/T genotype carriers without child- 
hood trauma had the lowest risk of PTSD [32]. In studies with diffu- 
sion tensor imaging and resting state functional magnetic resonance 
imaging (fMRI), rs1360780 T allele carriers with PTSD displayed 
a decreased structural, as well as functional, connectivity between 
the anterior cingulate cortex and the hippocampus, which has been 
suggested as a source of the pathologically increased neuronal sali- 
ence observed in PTSD [33]. Moreover, in an attempt to evaluate dif- 
ferent biologically distinct subtypes of PTSD, it has been found that 
rs9296158 A allele carriers display increased plasma cortisol levels 
[34]. These, in turn, correlated with symptom intensity and signifi- 
cantly influenced peripheral mRNA levels of genes carrying steroid 
hormone transcription factor binding sites in whole blood [34]. 
Other work has focused on different elements of neurohormonal 
stress response regulation such as the gene coding for the pi- 
tuitary adenylyl cyclase-activating polypeptide type I receptor 
(ADCYAP1R1, chromosome 7p14.3), a G protein-coupled receptor 
implicated in the modulation of adrenocorticotrophic hormone 
(ACTH) secretion. The C allele of ADCYAPIR1 1s2267735, located 
in a putative oestrogen response element, has been linked to PTSD 
symptom intensity and PTSD diagnosis in females and reduced dis- 
crimination of danger and safety cues ina fear conditioning paradigm 
[35]. Subsequent gene-environment studies found significant inter- 
actions of the ADCYAP1R1 rs2267735 C allele and the total amount 
of suffered traumata [36], or the degree of experienced childhood 
maltreatment [37], in predicting increased female PTSD symptom 
intensity. Imaging has revealed decreased hippocampal activity 
during contextual, but not during cued, fear conditioning in female 
ADCYAPIRI1 rs2267735 C allele carriers [38] and an increased re- 
activity of the amygdala and hippocampus to threat stimuli, as well 
as decreased functional connectivity between the amygdala and the 
hippocampus during passive viewing of threatening stimuli [39]. 
Moreover, with regard to HPA axis regulation, two SNPs (rs8192496 
and rs2190242) in the corticotropin-releasing hormone receptor 2 
(CRHR2, chromosome 7p14.3) gene have been linked to a decreased 
risk of suffering from PTSD in female trauma-exposed veterans, a 
proposed effect of an attenuated neurohormonal stress response 
[40]. Finally, in a prospective emergency department setting, a com- 
posite additive risk score derived off genes related to stress response 
and arousal, including, among others, FKBP5, ADCYAPIR1, the 
CRH receptor 1 gene (CRHR1, chromosome 17q21.31), and COMT, 
predicted increased PTSD symptoms in subjects without early post- 
trauma intervention classified as ‘risk’ carriers, as compared to ‘low- 
risk and ‘resilience’ genotypes [41]. Additionally, PTSD symptom 
intensity 12 weeks after trauma only correlated with the genetic risk 
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load in the group without intervention, but not in patients with im- 
aginal exposure therapy to trauma memory, with the overall number 
of risk alleles significantly associated with an increased PTSD inci- 
dence rate [41]. 


Epigenetic studies 


The potential relevance of maladaptive changes in chromatin struc- 
ture and DNA methylation as mediators of the influence of trau- 
matic experiences cannot be underestimated. An impact can be 
expected from the single cell level up to cognitive patterns and be- 
haviour in the organism’ allostatic biological load in general, and in 
manifestations of PTSD in particular. 

A large microarray approach evaluating DNA methylation 
of more than 14,000 genes in 23 PTSD-affected and 77 PTSD- 
unaffected individuals suggested traumatic events to induce 
downstream alterations in immune system functions by reducing 
methylation levels of genes related to cytokine production and in- 
nate immunity [42]. 

At the candidate gene level, overall SLC6A4 promoter region 
methylation influenced the risk of PTSD, independently of the 5- 
HTTLPR genotype, with low methylation levels correlating with an 
increased PTSD incidence rate in people who experienced multiple 
traumata [43]. Similarly significant associations were discerned be- 
tween SLC6A4 hypomethylation and increased PTSD symptom in- 
tensity and symptom quantity [43]. 

A study in a trauma-exposed African-American cohort found 
that FKBP5 rs1360780 influenced chromatin conformation with 
respect to the transcription start site—A allele carriers exposed to 
childhood trauma displayed decreased methylation levels close to, 
and within, glucocorticoid response elements in intron 7, inde- 
pendent of recent lifetime trauma exposure [30]. It was hypothe- 
sized that this hypomethylation in risk allele carriers would imply 
an increased induction of FKBP5 by glucocorticoid receptor acti- 
vation and enhancement of the ultra-short negative HPA feedback 
loop, leading to increased glucocorticoid receptor resistance [30]. 
Additionally, structural imaging data revealed a significant negative 
correlation between peripheral FKBP5 intron 7 methylation and 
hippocampal volume [30]. Thus, epigenetic changes in these genes 
not only alter the sensitivity of peripheral glucocorticoid receptors, 
but are also associated with morphological changes in limbic areas 
already believed to be indicative of a higher stress hormone system 
reactivity [30]. 

Transgenerational effects of trauma exposure have been explored 
in a study of Holocaust survivors and their offspring; survivors 
showed significantly increased FKPB5 methylation, yet decreased 
levels in their offspring [44]. At the same time, significant inter- 
actions of the FKBP5 rs1360780 T allele and CpG-specific methy- 
lation levels with either parental trauma intensity or the severity of 
self-experienced childhood trauma were reported, providing in- 
sights into the psychophysiological epigenetic integration of trauma 
across age groups [44]. 

Finally, in a cohort of children undergoing CBT for anxiety dis- 
orders, there was a significant correlation between the change 
in FKBP5 methylation and therapy response, with responders 
demonstrating the greatest decrease in methylation, an effect which 
was specific to FKBP5 rs1360780 T allele carriers [45]. 

Additional epigenetic evidence links other HPA regulators to 
PTSD. Remarkably, while increased ADCYAPIR1 methylation was 


associated with a PTSD diagnosis, rs2267735 C allele carriers dis- 
played reduced cerebral mRNA expression levels [35]. Furthermore, 
decreased glucocorticoid receptor (NR3C1, chromosome 5q31.3) 
promoter region methylation in peripheral blood mononuclear cells 
was detected in veterans with PTSD [46]. It was further associated 
with increased cortisol levels following the dexamethasone suppres- 
sion test, reports of psychiatric distress, peritraumatic dissociation, 
and poor sleep quality [46]. In Holocaust survivors’ offspring with 
both parents suffering from PTSD, lower NR3C1 methylation was 
observed, compared to offspring with only paternal PTSD, and lower 
methylation levels correlated with increased post-dexamethasone 
cortisol suppression [47]. 

Interestingly, the methylation state of DNA methyltransferases 
themselves has been investigated following trauma exposure, with 
probands developing PTSD later on showing increased methylation 
of DNA (cytosine-5)-methyltransferase 1 (DNMT1), whereas low 
pre-trauma methylation of DNA (cytosine-5)-methyltransferase 
3B (DNMT3B) predicted increased levels of PTSD symptom 
severity [48]. 


Genetics of cross-disorder phenotypes of post- 





traumatic stress and anxiety 


Heritability estimate studies suggest a shared genetic signature 
underlying PTSD and anxiety spectrum conditions, namely spe- 
cific phobia (SP), social anxiety disorder (SAD), agoraphobia (AP), 
panic disorder (PD), and generalized anxiety disorder (GAD) 
[49]. Acommon diathesis of PTSD is further suggested with inter- 
mediate phenotypes, such as fear generalization, avoidance behav- 
iour, anxiety sensitivity, state/trait anxiety, harm avoidance, and 
physiological hyper-responsiveness to certain stimuli, supported 
by their genetic correlation and joint cross-disorder familial risk 
[50, 51]. 

Unsurprisingly, in addition to an overlap in the genetic architec- 
ture of trauma- and stress-related disorders and anxiety disorders, 
both diagnostic categories are based upon a selection of highly 
connected, psychometrically assessable constructs, outlining, for 
example, the gradual development of normative stimulus-specific 
fears into functionally impairing generalized anxiety. Primarily, 
the study of these cross-disorder post-traumatic stress and anxiety 
phenotypes has focused on continuous traits, which are hypothe- 
sized to represent different forms of neuropsychiatric vulnerability 
for the diagnosis of the corresponding disorders during develop- 
ment. As outlined in later sections, various intermediate pheno- 
type concepts have been suggested to affect the onset, therapy, or 
disease course of trauma-inflicted psychological stress. (Also see 
Chapter 89 for additional remarks on potentially relevant cross- 
disorder phenotypes.) 


Serotonergic system 


Fig. 80.1 depicts the multi-level, cross-disorder genetic vulner- 
ability mediated by several intermediate phenotypes and environ- 
mental factors for SLC6A4/5-HTT gene variation. A meta-analysis 
of cortisol reactivity (1686 cases) to acute psychosocial stress re- 
ported a moderating effect of the 5-HTTLPR polymorphism, with 
homozygous s allele carriers demonstrating increased cortisol 
reactivity [52]. Increased vigilance towards negatively valenced 


stimuli in carriers of low-active 5-HTTLPR alleles was further con- 
firmed via a meta-analysis (807 cases) of studies focusing on atten- 
tional biases [53]. However, the low-expression forms of 5-HTTLPR 
could rather represent a plasticity factor capable of developing 
stronger biases for both negative and positive affective stimuli [54]. 
Thus, it has been suggested that the increased amygdala response 
to aversive stimuli in carriers of the short 5-HTTLPR allele is not 
mediated by 5-HTT availability in adults, but rather by amygdala 
size, with smaller amygdala volumes correlating with increased 
neural activation, suggesting a neurodevelopmental influence of 
the serotonergic polymorphism [55]. Another meta-analysis (9361 
cases) of gene-environment studies in children and adolescents re- 
vealed that Caucasian carriers of the 5-HTTLPR s allele are not only 
significantly more vulnerable to negative environmental influences, 
but also more receptive to positive input than 1/1 carriers [56]. In 
a cohort of healthy subjects, the 5-HT'TLPR 1/1 genotype resulted 
in higher anxiety sensitivity scores in an interaction with child- 
hood trauma exposure, in particular with regard to somatic sub- 
dimensions of anxiety sensitivity [57]. Nevertheless, despite most 
studies focusing on risk factors, few have also included protective, 
resilience-increasing factors such as self-efficacy. When childhood 
trauma and self-efficacy were assessed in relation to trait anxiety, 
social anxiety, and agoraphobic anxiety in healthy adults genotyped 
for the 5-HTTLPR/rs25531, s allele carriers showed a positive cor- 
relation of trauma and anxiety, independent of self-efficacy [58]. 
Conversely, 1,/1, homozygotes with low self-efficacy scored signifi- 
cantly higher in all available anxiety assessments when they had 
experienced childhood trauma but were resilient when they pos- 
sessed high self-efficacy [58]. 


Hormone system 


In relation to the HPA axis, carriers of the CRHR1 rs110402 A al- 
lele, which has been implicated in elevated cortisol reactivity to 
acute stressors, displayed blunted basolateral amygdala habitu- 
ation in a fear extinction paradigm, and an increased risk to be 
diagnosed with any anxiety disorder [59]. The CRHR1 rs878886 
minor G variant has been associated with reduced fear-conditioned 
responses to a threat cue and a tendency towards anxious gener- 
alization and increased contextual anxiety in an interaction with 
the low-activity alleles of the 5-HTTLPR [60]. A replication study 
confirmed the acquisition deficit and increased contextual anxiety 
in rs878886 G allele carriers and suggested a tri-allelic interaction 
with low-expression 5-HTTLPR/rs25531 variants on cued fear ac- 
quisition [61]. In imaging genetic approaches, several SNPs in both 
HPA-relevant genes, CRHR1 and NR3C1, have been linked to in- 
creased amygdala activation during the acquisition phase of a fear 
conditioning paradigm and decreased prefrontal activity during 
the extinction phase, respectively [62]. Interestingly, to note from 
an epigenetic viewpoint, currently healthy subjects with a lifetime 
anxiety disorder diagnosis have lower levels of NR3C1 promoter 
region methylation, while the number of experienced childhood 
adversities negatively correlated with promoter region methyla- 
tion only in subjects without a lifetime diagnosis [63]. A micro- 
satellite marker [64] and several SNPs [65] within the CRH gene 
itself have also been linked to behavioural inhibition in children 
with a familial loading of PD. Moreover, carriers of the FKBP5 
rs1360780 T allele have been shown to display heightened levels of 
harm avoidance [66] and an attention bias towards threat cues and 
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increased hippocampal activation during threat evaluation [67], 
while amygdala activation was increased in risk allele carriers ex- 
posed to emotional neglect in childhood [68]. 


Catecholaminergic system 


Evaluation of the COMT vall158met polymorphism in association 
with childhood maltreatment demonstrated that val/val allele car- 
riers displayed a significantly increased startle response to aver- 
sive pictures, while met/met carriers showed blunted responses to 
unpleasant stimuli [69]. The effect in val carriers was discerned to 
be potentiated by increased amounts of maltreatment, which had 
no interaction effect in met allele carriers [69]. In a double-blind, 
placebo-controlled design, it has also been shown that val allele car- 
riers displayed an increased startle potentiation towards aversive 
stimuli, independently of an acute injection of levodopa (L-dopa), 
whereas in met allele carriers, the startle response to unpleasant pic- 
tures was only enhanced under L-dopa administration, but not by 
placebo [70]. This would support the influence of catecholaminergic 
neurotransmission on the emotion-potentiated startle reflex either 
by increased phasic dopamine abundance, as conferred by the high- 
activity val allele, or by pharmacologically enhanced dopamine 
availability, eventually leading to a maladaptive predisposition of the 
emotional valence system. 


Clinical challenges and future perspectives 





The gene-environment interactions reported in relation to PTSD 
highlight not only very appealing candidates, such as the serotonergic 
or HPA system, but also underline the importance of individual vul- 
nerability windows in personal development, for example reflected 
by ‘plasticity polymorphisms’ influencing gene expression or protein 
activity, which interact with childhood experiences, yet are not in- 
fluenced by recent life events. Due to the special position of PTSD 
in psychiatric nosology, genetic association and gene-environment 
studies pose the challenge that causative and non-causative trauma 
exposures are correlated and difficult to disentangle [71]. While 
trauma exposure is a diagnostic criterion of PTSD, one must dis- 
tinguish that category which is sufficient to cause PTSD and that 
which increases the risk of a future PTSD diagnosis (but, by itself, is 
not enough to warrant a diagnosis) [72]. Another aspect, not to be 
underestimated, is the influence genetic polymorphisms may have 
on the exposure to trauma per se, mediated by a gene-by-environment 
correlation. Thus, a polymorphism might not only increase the sus- 
ceptibility to trauma (and thereby to develop subsequent PTSD), 
but also rather influence the risk of a PTSD diagnosis via an in- 
creased trauma incidence rate (either of causative or risk-increasing 
trauma). With the ever growing feasibility and accessibility of large 
sample numbers in psychiatric genetics, the identification of rare 
variants with high penetrance could further substantiate efforts 
at multilocus predictions, by simultaneously integrating an array 
of molecular variations, next to other biological, developmental, 
environmental, epidemiological, or temperamental risk factors. 
Large-scale machine-learning algorithms have, for example, already 
been applied to identify the most significant predictors of PTSD, 
including demographic factors (females, married), traumatic ex- 
perience (almost exclusively violence-related, including organized 
violence, interpersonal violence, and relationship/sexual violence), 
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and psychiatric lifetime diagnoses (attention-deficit/hyperactivity 
disorder, separation anxiety disorder, SP, GAD, and prior unre- 
lated PTSD) [73]. Moreover, genetic findings in stress-related inter- 
mediate phenotypes point towards a bottom-up, multi-dimensional, 
cross-disorder taxonomy, for example, by suggesting a considerable 
degree of genetic correlation between trauma-related and anxiety 
disorders, spanning categorical classifications, subsyndromal di- 
mensional constructs of symptoms, temperament and environ- 
mental risk, and behavioural, neurophysiological, and neuronal 
circuit readouts (Fig. 80.1). This could result in specified functional 
domains of behaviour built upon aggregated cross-discipline evi- 
dence and grouped into higher-level processes of emotional and 
cognitive functioning. These may then inform major constructs, 
like, for example, the negative valence system within the Research 
Domain Criteria (RDoC) initiative [74]. Finally, future studies 
will have to systematically explore the potential of genetic/epigen- 
etic tools for treatment response prediction, aiming at establishing 
a biomarker-driven personalized, individually tailored ‘precision 
medicine’ approach in stress-related disorders [75]. 


Conclusions 


Stress-related disorders display moderate heritability, as outlined 
by family and twin studies. Molecular genetic studies have begun 
to elucidate to what extent genetic and epigenetic variations shape 
the onset, clinical presentation, and therapy of stress-related dis- 
orders. So far, these have notably highlighted the relevance of the 
serotonergic and catecholaminergic systems, as well as HPA axis 
regulation in stress-related disorders (Table 80.2). GWAS and subse- 
quent cross-sample meta-analyses in large cohort sizes complement 
the findings on candidate gene and gene-environment approaches. 
The potentially artificial boundaries created by a categorical diag- 
nostic system are challenged by interdisciplinary studies gathering 
multi-level evidence for a common, cross-disorder genetic trunk. In 
light of the apparent intricacy of the implicated principal compo- 
nents and the heterogeneity of interaction effects and genetic plei- 
otropy, tools offered by systems biology could promote the synthesis 
of candidate genes into functional molecular processes linking en- 
vironmental and psychological readouts. A central equivalent of 


Table 80.2 Overview of genetic variants implicated in PTSD and cross-disorder post-traumatic stress and anxiety phenotypes. The overall 
degree of confidence was ranked from ‘+’ = ‘low’ to ‘++++’ = ‘high’, based on the reviewed evidence (refer to the individual sections in this 


chapter for detailed supporting literature) 


Top risk variant Evidence from structural, rare variant, GWAS, candidate gene, gene-environment, and 
epigenetic studies 


Meta-analytical evidence for increased PTSD risk [19] 

Meta-analytical evidence for interaction with high trauma exposure in PTSD risk [19] 
Increased PTSD symptoms in response to increased trauma exposure [20] 

Increased PTSD incidence rate and symptom severity [21] 

Interaction with attentional avoidance of threats, leading to increased PTSD symptoms [22] 





SLC6A4 Serotonin 
transporter 
polymorphic region 
(5-HTTLPR) short 
allele 


Overall degree 
of confidence 


Increased acute stress symptom intensity [23] 
Decreased response to sertraline treatment [24] 


Decreased response to exposure-based CBT [27] 


+444 


Decreased promoter methylation associated with increased PTSD incidence rate and symptom 


intensity [43] 


FKBPS rs1360780 


T allele 





hippocampus [33] 





ncreased harm avoidance [67] 








evaluation [68 


ADEWAPIRI | EZ207735 


C allele ear conditioning [35] 





KLHL1 





rs139558732 
C allele 


rs2400707 
G allele 


rs4680 
G allele (val158met, 
val variant) 


date [16] 
ADRB2 


COMT 





maltreatment [70] 








nteraction between CpG-specific methylation levels and parental or childhood trauma [44] 
Decreased methylation associated with increased CBT therapy response for anxiety disorders [45] 


ncreased female PTSD symptom intensity in interaction with trauma intensity [36] 
ncreased female PTSD symptom intensity in interaction with childhood maltreatment [37] 
ncreased amygdala and hippocampus activity during threat processing [39] 
ncreased methylation associated with increased PTSD incidence rate [35] 


ominal genome-wide significance in the African-American subgroup of the largest PTSD GWAS to 


Top hit in a multi-candidate gene association study in interaction with childhood trauma exposure 
and replication in an independent cohort [14] 


eta-analytical evidence for increased cortisol response to psychosocial stress [52] 

eta-analytical evidence for attentional bias to negatively valenced stimuli [53] 

eta-analytical evidence for increased vulnerability to negative environmental influences, but also 
increased receptiveness to positive influences [56] 


ncreased PTSD symptoms in interaction with childhood trauma [29] 
ncreased PTSD symptom relapse following narrative exposure therapy [31] 
Decreased functional and structural connectivity between the anterior cingulate cortex and the 


++++ 


ncreased attentional bias towards threat cues and increased hippocampal activation during threat 


ncreased amygdala activation in interaction with emotional neglect during childhood [69] 


ncreased PTSD symptom intensity and PTSD incidence rate in females and reduced discriminative 





++ 


++ 


ncreased PTSD incidence in interaction with lifetime trauma abundance [27] 
ncreased startle response to aversive pictures, especially in interaction with childhood + 


PTSD, post-traumatic stress disorder; GWAS, genome-wide association study; CBT, cognitive behavioural therapy. 


this integration between organism-internal molecular layout and 
organism-external environmental impact is emerging with psychi- 
atric epigenetics. The potential of a dynamic gene-specific structural 
change in covalent DNA modifications or histone assembly could 
finally bridge the gap between genetics and environment. Besides 
its relevance for innovative and personalized treatments, the clin- 
ical potential of psychiatric genetics/epigenetics with regard to 
stress-related disorders lies in the prospective characterization of 
disease risk allowing for targeted prevention strategies, designed 
to counteract genetically determined at-risk states with resilience- 
increasing interventions and thus lower the incidence of stress- 
related disorders. 
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Over the past two decades, researchers have used structural and 
functional neuroimaging techniques and neuroscience-based para- 
digms to study the mediating neurocircuitry of stress-related psy- 
chiatric disorders, with a focus on post-traumatic stress disorder 
(PTSD). The ultimate goals of such investigations are to identify 
neurocircuitry abnormalities, to determine whether those abnor- 
malities are useful in diagnosis or in predicting treatment response, 
and to elucidate treatments that could address the abnormalities and 
ameliorate symptoms. In the text that follows, we will review the 
imaging methods, paradigms, and findings of neuroimaging studies 
of PTSD and acute stress disorder (ASD). We will present the re- 
sults in the context of a neurocircuitry model of PTSD that emerged 
from basic neuroscience studies of fear conditioning and extinction. 
Finally, we will identify current gaps in our knowledge regarding 
brain structure and function in PTSD and discuss how future studies 
might address those gaps. 




















The structure and function of the human brain have been assessed 
using neuroimaging techniques, including magnetic resonance 
imaging (MRI), functional MRI (fMRI), and positron emission 
tomography (PET) (for a review, see Chapter 12). 

MRI generates three-dimensional images of the brain struc- 
ture [1]. MRI data can be gathered and analysed in different ways, 
including traditional morphometry, voxel-based morphometry 
(VBM), and diffusion tensor imaging (DTI). In traditional morph- 
ometry, the borders of specific brain regions of interest are traced, 
and the volumes of these regions are calculated and compared be- 
tween patients and comparison groups [2]. In VBM, grey matter 
density is measured at each voxel (that is, three-dimensional pixel) 
in the entire brain, averaged across participants, and compared 
between patient and control groups [3]. By measuring the linear 
diffusion of water molecule protons along axons, DTI can indi- 
cate the location, thickness, health, and orientation of white matter 
tracts [4]. 











fMRI measures blood oxygen level-dependent (BOLD) signal, 
which is the difference in magnetic susceptibility in oxygenated vs 
deoxygenated blood and an indicator of brain activity [1]. During 
emotional or cognitive tasks, brain structures become metabolically 
active and consume oxygen. The flow of oxygenated blood to that 
specific brain region is increased, creating a measurable magnetic 
resonance (MR) signal [5]. {MRI has good spatial and fair temporal 
resolution and does not use ionizing radiation, thus making it a 
highly desirable psychiatric neuroimaging technique. 

PET is a functional neuroimaging technique that allows re- 
searchers to assess regional cerebral metabolic rate for glucose 
(rCMRglu), regional cerebral blood flow (rCBF), or receptor oc- 
cupancy [6]. Due to its poor spatial and temporal resolution and 
because PET requires the injection of radioactive isotopes, it is not 
the preferred method for measuring the brain’s responses to stimuli 
or tasks. However, PET is the preferred method for measuring 
rCMRglu and receptor occupancy. 











Functional neuroimaging studies of stress-related disorders com- 
monly use behavioural or cognitive paradigms such as fear condi- 
tioning and extinction, symptom provocation, cognitive activation, 
and resting state methods. For example, fear conditioning and ex- 
tinction paradigms elucidate the neurobiological basis of failure to 
extinguish fear responses to stimuli that formerly predicted threat 
but no longer do—a model for some PTSD symptoms. Fear con- 
ditioning involves repeated predictive pairings of a previously neu- 
tral conditioned stimulus (CS) (for example, a coloured light) with 
an aversive unconditioned stimulus (US) (for example, a shock), 
to create an association resulting in the CS alone eliciting a condi- 
tioned response (CR) (for example, increased skin conductance) [7]. 
Later, during extinction, the CS is repeatedly presented without the 
US, typically resulting in a diminished CR. Then 24 hours later, the 
memory of extinction training can be assessed by presenting the CS 
without the US. In symptom provocation paradigms, researchers 
present participants with audiotaped descriptions of traumatic 
events (known as ‘scripts’) or trauma-related pictures, odours, or 





sounds during functional neuroimaging in order to elucidate the 
brain regions and networks that mediate the symptomatic state [8]. 
In cognitive activation paradigms, researchers implement cognitive 
tasks that are known to activate specific brain regions or networks in 
order to test their function in PTSD vs comparison groups. Finally, 
resting state paradigms measure fMRI BOLD signal or rCMRglu in 
the absence of a task [9]. Resting state fMRI permits the examin- 
ation of functional connectivity, that is, associations between dif- 
ferent brain regions. 


Neurocircuitry model 


Early psychophysiological and neuroimaging studies began gath- 
ering evidence for a fear conditioning model of PTSD, which posits 
that the principles of Pavlovian fear conditioning and extinction 
underlie the acquisition and maintenance of anxiety and stress- 
related disorders (reviewed in [7, 10]). Through the lens of the fear 
conditioning model, sustained symptoms of PTSD and related con- 
ditions can be conceptualized as an overexpression of fear to cues 
that once predicted a threat but should now be considered safe (for 
example, a veteran who was once injured in a loud combat explo- 
sion and who now experiences fear upon hearing loud fireworks). 
Neuroimaging studies of PTSD have demonstrated structural and 
functional abnormalities in the brain circuits that underlie fear con- 
ditioning and extinction, as well as emotion regulation and memory 
(Fig. 81.1). 

Consistent with its role in the acquisition and expression of con- 
ditioned fear and related defensive behaviours, the amygdala is 
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Fig. 81.1 Structural magnetic resonance images showing brain 
regions of interest in PTSD. (a) A sagittal section showing the dorsal 
anterior cingulate cortex (dACC), as well as structures that comprise 
the ventromedial prefrontal cortex (vmPFC) such as the rostral anterior 
cingulate cortex (rACC), medial frontal gyrus, and subgenual anterior 
cingulate cortex (sgACC). (b) A horizontal section showing the bilateral 
insular cortex. (c) A sagittal section showing the hippocampus and 
amygdala. In PTSD, the amygdala, dACC, and insula tend to be hyper- 
responsive, and the vmPFC tends to be hyporesponsive. Findings 
regarding the hippocampus have been more mixed, as they may depend 
on the specific experimental paradigm. 
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generally hyper-responsive in PTSD, and in many studies, its ac- 
tivation is positively correlated with symptom severity. The struc- 
tures comprising the ventromedial prefrontal cortex (vmPFC) 
generally hold inhibitory influence over the amygdala and are 
key to the learning and subsequent recall of fear extinction. These 
vmPFC structures, such as the rostral anterior cingulate cortex 
(rACC), subgenual anterior cingulate cortex (sgACC), and medial 
frontal gyrus (MFG), are hyporesponsive in PTSD, indicating 
functional deficits in learning and remembering safety informa- 
tion and inhibiting fear responses. Conversely, the dorsal anterior 
cingulate cortex (dACC) is implicated in the expression of learned 
fear and is hyper-responsive in PTSD, and its activation often posi- 
tively correlates with amygdala activation. Likewise, the insular 
cortex tends to be hyper-responsive in PTSD. Given its role in 
monitoring bodily states, insula activation in PTSD may be related 
to subjective experiences like anxiety and pain. Findings regarding 
the hippocampus in PTSD are mixed, with some studies showing 
hyper-responsiveness and some showing hyporesponsiveness. In 
PTSD, difficulty identifying safe contexts and updating contextual 
cues to include safety information, as well as more general deficits 
in declarative and episodic memory, may reflect hippocampal 
dysfunction. 

While the fear conditioning-based neurocircuitry model of PTSD 
has been quite fruitful (for example, [7, 10]), it cannot directly 
account for several important aspects of PTSD symptomatology, 
including endocrine abnormalities, deficits in executive cogni- 
tive function, and ongoing anxiety, in the absence of a conditioned 
stimulus or context (for example, [11, 12]). However, one of the most 
striking aspects of the functional neurocircuitry abnormalities seen 
in PTSD is that they are expressed not only in studies that directly 
assess fear conditioning and extinction, but also in symptom provo- 
cation, cognitive activation, and resting state studies [13]. The fol- 
lowing sections will review some of these studies. 


Amygdala 


Function 


Exaggerated amygdala activation in PTSD is now a well-replicated 
finding that can be observed during fear conditioning paradigms, 
for example in response to an electric shock US [14] and to the 
CS during acquisition of conditioned fear (for example, [15, 16]). 
Furthermore, in PTSD, the amygdala continues to be hyper- 
responsive to the CS, even during or after extinction (for example, 
[16, 17]). These findings are evidence that, relative to controls, the 
amygdala in those with PTSD is hyper-responsive to threat and to 
cues that predict threat and continues to be hyper-responsive to 
such cues after they no longer predict threat. The amygdala also 
shows exaggerated responses to trauma recollection (for example, 
[18, 19]) and trauma-related sensory stimuli (for example, [20-22]). 
Interestingly, amygdala hyper-responsivity can occur in response 
to trauma-unrelated emotional stimuli such as emotional facial ex- 
pressions (Fig. 81.2) (for example, [23-27]) and aversive images (for 
example, [28, 29]). Individuals with PTSD even show exaggerated 
amygdala activity at rest [30] or in response to tasks using neutral 
stimuli [31]. However, not all studies have found increased amyg- 
dala activation in PTSD (for example, [32-34]). 
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Fig. 81.2 (see Colour Plate section) Example of functional magnetic 
resonance imaging (fMRI) data. (a) Amygdala activation to fearful vs 
happy facial expressions was greater in the PTSD group vs control group. 
(b) MR signal change in the amygdala in each condition for each group. 
Error bars represent the standard error of the mean. 


Adapted from Arch Gen Psychiatry, 62(3), Shin LM, Wright Cl, Cannistraro PA, et al., 
A functional magnetic resonance imaging study of amygdala and medial prefrontal 
cortex responses to overtly presented fearful faces in posttraumatic stress disorder, 

pp. 273-81, Copyright (2005), with permission from American Medical Association. 


Within PTSD groups, amygdala activation is positively correl- 
ated with PTSD symptom severity (for example, [18, 23, 26, 35- 
37]), and symptomatic improvement is associated with a decrease 
in amygdala activation (for example, [38-40]). Although these 
findings might suggest that exaggerated amygdala activation is 
a state-like, acquired characteristic of PTSD, other findings sug- 
gest that it may reflect a trait-like vulnerability factor—pre-trauma 
amygdala responses predict greater PTSD symptoms following 
trauma (for example, [41, 42]). Finally, pretreatment amygdala ac- 
tivation appears to predict a less favourable response to treatment 
(for example, [43, 44]). 

Far less research has been conducted on amygdala function in 
ASD. In a small longitudinal study, Reynaud and colleagues (2015) 
found that firefighters who developed ASD showed a greater in- 
crease in amygdala activation post-trauma, relative to pre-trauma 
baseline, despite not showing differences in amygdala activation be- 
fore trauma [45]. However, given the small sample in the ASD group 
(n = 2), this study requires replication. 


Structure 


Findings regarding amygdala structure in PTSD are somewhat 
mixed. Several studies have found smaller amygdala volumes or 
reduced grey matter density in PTSD [46-50], with some studies 
reporting a negative correlation between amygdala volume and 
symptom severity (for example, [46, 51, 52]). However, some 
studies have found greater amygdala volumes in PTSD (for ex- 
ample, [53-55]) or no group differences (for example, [56-60]). 
These discrepancies could be attributed, in part, to variability in 
imaging methods and/or in participant samples across studies. 
Indeed, one meta-analysis suggested that diminished amygdala 
volumes are significant when PTSD groups are compared to 
trauma-unexposed control groups, but not trauma-exposed con- 
trol groups [61]. In general, the latter is a more appropriate com- 
parison group as it controls for exposure to trauma. Furthermore, 
because attempts to relate structural and functional abnormalities 
in the amygdala in PTSD have been lacking, it is difficult to deter- 
mine whether structural abnormalities in the amygdala in PTSD 
are the origin of corresponding functional abnormalities. More 
studies examining amygdala structure and function within the 
same cohort are needed. 

In the only study to examine amygdala structure in ASD, no ab- 
normalities were found [62]. 





Ventromedial prefrontal cortex 


Function 


Individuals with PTSD show relatively diminished vmPFC activa- 
tion during extinction learning and recall (for example, [15, 17, 63]), 
during the recollection of traumatic events (for example, [18, 33, 64- 
71], but see [72]), in response to other trauma-related stimuli (for 
example, [21, 67, 73-75]), in response to trauma-unrelated emo- 
tional stimuli such as fearful facial expressions or unpleasant scenes 
(for example, [22, 24, 27, 76-78]), and even in response to neutral 
stimuli or at rest (for example, [31, 79-85]). Finally, successful treat- 
ment with cognitive behavioural therapy (CBT), eye movement 
desensitization and reprocessing (EMDR), or a serotonin reuptake 
inhibitor (SSRI) appears to be related to increased prefrontal cortex 
activation (for example, [39, 40, 43, 86]). 

However, not all studies have reported diminished vmPFC ac- 
tivation in PTSD; some have reported no group differences or 
greater activation in PTSD (for example, [87, 88]). The direction 
of the functional abnormality in the vmPFC may depend on the 
state of the participants and the nature of the task completed in 
the scanner. For example, greater vmPFC activation can be ob- 
served when PTSD groups experience a dissociative state (for ex- 
ample, [87, 89]) or view stimuli below the threshold of conscious 
awareness [90]. 


Structure 


Structural MRI studies of PTSD have reported abnormalities in the 
vmPFC in the form of reduced volumes (for example, [50, 53, 57,91- 
96]), reduced white matter tract integrity (for example, [97, 98]), and 
reduced grey matter density (for example, [99-108]). In addition, 


relatively smaller pretreatment grey matter density in the pregenual 
ACC predicted a less favourable response to CBT [43]. Kasai and 
colleagues used monozygotic twin pairs discordant for combat ex- 
posure to determine whether grey matter density reductions ob- 
served in PTSD are familial vulnerability factors for developing 
PTSD or an acquired characteristic of PTSD. Results showed that 
grey matter volume density reductions in the pregenual ACC were 
an acquired characteristic of PTSD [102]. 

The only study of vmPFC structure in ASD did not find 
abnormalities [62]. 


Dorsal anterior cingulate cortex 


Function 


Neuroimaging studies have shown dACC hyperactivation in PTSD, 
relative to trauma-exposed control groups, during conditioning, 
early extinction learning, and extinction recall (for example, [17, 
63, 109]). Furthermore, dACC activation is negatively correlated 
with extinction recallin PTSD and trauma-exposed non-PTSD con- 
trols [17], and dACC resting metabolism in PTSD is positively cor- 
related with both activation during extinction recall and symptom 
severity in PTSD [109]. 

dACC hyper-responsivity in PTSD has also been found in studies 
presenting trauma scripts [18] and words [110], emotional facial ex- 
pressions [81], neutral stimuli (for example, [31, 82, 111]), and in 
resting state studies [112]. In one study, greater dACC activation in 
response to negative vs neutral photographs predicted a poorer out- 
come following treatment [44]. 

The findings of two twin studies have suggested that dACC 
hyperactivity and hyper-responsivity are familial vulnerability 
factors for developing PTSD following trauma exposure. In the 
first, Shin and colleagues assessed rCMRglu in combat-exposed 
veterans with PTSD and their identical co-twins unexposed 
to combat, as well as combat-exposed veterans without PTSD 
and their identical co-twins unexposed to combat. Veterans 
with PTSD and their co-twins had significantly greater resting 
rCMRglu in the dACC/mid cingulate cortex (MCC), compared 
with veterans without PTSD and their co-twins (Fig. 81.3) [112]. 
Similarly, using fMRI and the Multi-Source Interference Task, 
Shin and colleagues found greater dACC activation in combat- 
exposed veterans with PTSD and their unexposed co-twins, as 
compared to combat-exposed veterans without PTSD and their 
co-twins [111]. 

Whether the dACC is hyperactivated in ASD is not clear. In one 
symptom provocation study, Cwik and colleagues found greater 
dorsal medial frontal cortex activation in participants with ASD, 
compared to healthy trauma-unexposed control participants [113]. 
However, this increased activation occurred not in dACC proper, 
but in a region just dorsal to that. 


Structure 


Reduced grey matter volume in the dACC has been observed in 
trauma survivors with PTSD, compared to those without PTSD (for 
example, [103, 114]). 
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Fig. 81.3 (see Colour Plate section) Example of positron emission 
tomography (PET) data. (a) Greater resting regional cerebral 
metabolic rate for glucose (rCMRglu) in the dorsal anterior 
cingulate cortex (dACC)/mid cingulate cortex (arrow) in combat- 
exposed twins with PTSD and their identical co-twins, compared 
with combat-exposed twins without PTSD and their identical co- 
twins. (b) Group rCMRglu means. Error bars represent the standard 
error of the mean. 
Adapted from Arch Gen Psychiatry, 66(10), Shin LM, Lasko NB, Macklin ML, et al., 
Resting metabolic activity in the cingulate cortex and vulnerability to posttraumatic 


stress disorder, pp. 1099-107, Copyright (2009), with permission from American 
Medical Association. 





Function 


The insula is hyper-responsive in PTSD during the recollection of 
traumatic events (for example, [64, 115, 116)], the presentation of 
other trauma-related stimuli (for example, [21, 110, 117, 118]), the 
presentation of emotional, but trauma-unrelated, stimuli such as 
negative words, fearful facial expressions, and aversive photographs 
(for example, [76, 87, 119, 120]), and at rest [85]. In addition, greater 
insula activation in response to negative vs neutral photographs pre- 
dicted a poorer outcome following treatment [44]. However, not all 
studies have reported insula hyper-responsivity in PTSD (for ex- 
ample, [22, 29, 68]). 
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Structure 


Insula volume [57] and grey matter density (for example, [100, 102, 
104, 121]) are reduced in PTSD, relative to comparison groups. 
Furthermore, a twin study has shown that diminished grey matter 
volume in the insula appears to be an acquired characteristic of 
PTSD and is negatively correlated with re-experiencing symptoms. 
Thus, this abnormality appears to be state-like and hence could po- 
tentially be used in diagnosis or as a biological indicator of symp- 
tomatic improvement [102]. 


Hippocampus 


Function 


Functional neuroimaging studies of the hippocampus in PTSD 
have been mixed. Some studies found that the hippocampus is 
hyporesponsive in PTSD, for example during extinction recall [17], 
script-driven imagery (for example, [40, 69]), trauma-unrelated and 
neutral stimuli (for example, [73, 122-124]), and at rest (for example, 
[125, 126]). However, other studies show that the hippocampus is 
hyper-responsive in PTSD during tasks with trauma-related stimuli 
such as words [127] and emotional and neutral stimuli (for example, 
[28, 79, 90, 123, 128]). These mixed findings may be accounted for 
by differences in task demands upon the hippocampus or imaging 
modality [129]. 


Structure 


Many studies have reported reduced overall hippocampal volume or 
grey matter volume in PTSD, compared to control groups (for ex- 
ample, [48, 57, 80, 93, 108, 130-141]). In addition, one study found 
that relatively smaller hippocampal volume predicted a less favour- 
able response to treatment with prolonged exposure [142]. However, 
a few studies have reported larger hippocampal volume in PTSD (for 
example, [53, 143]) or no differences in hippocampal volume (for 
example, [50, 144-148]). 

The results of at least one study suggest that diminished 
hippocampal volume is a familial vulnerability factor rather than an 
acquired characteristic of PTSD. In a twin design similar to those 
described previously (for example, [111, 113]), Gilbertson and col- 
leagues found smaller hippocampal volumes in combat veterans 
with PTSD and their identical co-twins, relative to combat veterans 
without PTSD and their identical co-twins. Furthermore, severity 
of PTSD in the combat veterans was negatively correlated with 
hippocampal volume of their identical co-twins, consistent with a 
vulnerability factor [149]. 


Future directions 


Over the past 20 years, an enormous amount of progress has been 
made in our understanding of the neurocircuitry of PTSD. Despite 
this, many questions remain either unanswered or entirely unad- 
dressed. For example, we do not yet know whether neurocircuitry 
abnormalities differ by trauma type, duration of trauma, and age of 
trauma onset. Part of the problem lies in the fact that single studies 
tend to focus on just one trauma type, duration, and age of onset. In 
the future, different imaging data sets, all using the same paradigm 


(for example, fear conditioning/extinction), might be combined to 
permit a proper examination of these variables. 

Most studies that we reviewed have included participants with 
chronic PTSD, and whether the findings extend to acute PTSD is 
undetermined. On a related note, as shown in our literature re- 
view, very little is currently known about the neurocircuitry of ASD. 
Practically speaking, ASD is difficult to study because an ASD diag- 
nosis must be made within the first month after a traumatic event. 
This requirement makes it challenging to find and enrol such parti- 
cipants for imaging studies. 

Another question that is not yet fully answered relates to the origin 
of neurocircuitry abnormalities in PTSD. Most current studies are 
cross-sectional and hence cannot determine whether these abnor- 
malities reflect pre-existing familial vulnerability factors or acquired 
characteristics of the disorder. If an abnormality, such as dACC 
hyper-responsivity, is a familial vulnerability factor and can increase 
the risk for developing PTSD, it could potentially be used to screen 
individuals who are likely to experience trauma and develop PTSD 
such as police officers and firefighters. In contrast, ifan abnormality 
is an acquired characteristic of PTSD, it could potentially be used as 
a biological marker in diagnosis or in the assessment of treatment 
response. In the future, studies implementing longitudinal designs 
(that include a baseline assessment of participants prior to trauma 
exposure and follow participants over time after trauma) or twin 
designs will help clarify the origin of neurocircuitry abnormalities 
in PTSD. 

In the United States, the National Institute of Mental Health 
(NIMH) is now requiring that all psychiatry research it funds be 
conducted in a new framework that focuses on symptom dimen- 
sions, rather than on categorical diagnoses. This means that future 
studies of PTSD will be based on correlational analyses associating 
symptom severity with other measures like brain activation, psy- 
chophysiology, behavioural performance, and self-report. One dif- 
ficulty with this change in approach is that the results of new studies 
may be difficult to fully reconcile with those of existing studies that 
took a more categorical approach. 

One of the goals of neuroimaging studies is to elucidate the 
brain regions or networks that function abnormally in PTSD. 
Such information could lead to new treatments that are focused 
on ‘normalizing’ that brain function. In the future, researchers will 
undoubtedly use non-invasive neuromodulation tools and tech- 
niques such as transcranial direct current stimulation (tDCS) and 
transcranial magnetic stimulation (TMS) to determine whether 
modulating brain function can ameliorate symptoms. 

Another important potential clinical application of neuroimaging 
data in psychiatry is the prediction of response to treatment. For 
example, although CBT is arguably the most effective in the treat- 
ment of PTSD, not all patients respond well, and finding a good 
treatment fit can take months to years. Thus, patients and clinicians 
would benefit from having objective pretreatment measures that can 
be used to predict treatment response. Neuroimaging could be such 
a measure. Furthermore, researchers are beginning to use sophisti- 
cated machine-learning algorithms (which include multiple meas- 
ures such as imaging, genetic, psychophysiological, behavioural, 
clinical, and demographic) to best distinguish between treatment 
responders and non-responders and hence improve the prediction 
of treatment response. 


Neuroimaging studies of stress disorders (predominantly PTSD) 
have revealed functional abnormalities consistent with a fear con- 
ditioning model. The results reviewed herein suggest that the amyg- 
dala and dACC are hyper-responsive in PTSD, consistent with their 
involvement in fear learning and expression. In contrast, the vymPFC 
is hyporesponsive, as might be predicted, given its role in inhibiting 
the amygdala during the successful extinction of fear. The insula is 
hyper-responsive in PTSD, likely reflecting the processing of intero- 
ceptive cues such as anxiety. Finally, functional abnormalities in the 
hippocampus have been more mixed in PTSD, and further research 
ought to clarify its role in memory and identifying safe contexts in 
this disorder. 

Overall, although much progress has been made in our under- 
standing of the neurocircuitry of PTSD, many questions remain 
unanswered. Future research will be needed to clarify the trauma- 
related factors that affect neurocircuitry abnormalities, the origin of 
such abnormalities, and the role of neuroimaging in assessing and 
predicting treatment response. 
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In this chapter, we will focus on prevention of disorders that 
largely arise after single traumatic events. Such events can strike 
many people at once, such as disasters or terrorist attacks, or only 
single or few individuals such as severe motor vehicle crashes 
or a sexual assault. We focus on strategies to help prevent post- 
traumatic stress disorder (PTSD) and acute stress disorder (ASD). 
These are the two most well-researched disorders within the new 
Diagnostic and Statistical Manual of Mental Disorders, fifth edi- 
tion (DSM-5) “Trauma- and stressor related disorders’ chapter [1]. 
Further, a unique aspect of both these disorders within DSM-5 
is the presence of an index traumatic event or events. Therefore, 
once we know that such an event has occurred, PTSD and ASD 
are mental disorders for which we could, and should, seek strat- 
egies to prevent the emergence of the full-blown disorder. We first 
discuss what prevention means in this context. Next, we consider 
diagnostic features of PTSD and ASD to understand what it is that 
could be prevented after trauma. Then, we consider the associated 
epidemiology and risk indicators of developing these disorders, 
that is, the scale of the prevention challenge and who we might 
target. Given the limited prevention strategies to date, consider- 
able further research is needed to advance the field. 





In DSM-5, a new chapter “Trauma- and stressor-related disorders’ 
was formed that includes seven diagnoses (Box 82.1) [1]. The 
focus in this chapter is on PTSD and ASD as the two most well-re- 
searched disorders within this new DSM-5 chapter [1]—and the 
ones for which there is a clear clinical demand for prevention. 
Critically, a unique aspect of both these disorders within DSM-5 
is the occurrence of an index traumatic event or events. A trau- 
matic event here is defined as exposure to actual or threatened 
death, actual or threatened serious injury, or actual or threatened 
sexual violence [1]. Therefore, once we know that such an event 





e Acute stress disorder 

e Post-traumatic stress disorder 

e Reactive attachment disorder 

e Disinhibited social engagement disorder 

e Adjustment disorders 

e Other specified trauma- and stressor-related disorder 

e Trauma- and stress-related disorder not otherwise specified 


Source: data from Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, 
DSM-5, Copyright (2013), American Psychiatric Association. 











has occurred, PTSD and ASD are mental disorders for which we 
could, and should, seek strategies to prevent the emergence of the 
full-blown disorder. 

Regarding the other diagnoses in the “Trauma- and stressor- 
related’ section of this book (see Chapters 78-85), reactive at- 
tachment disorder and disinhibited social engagement disorder 
are viewed as general consequences of inadequate child care or 
maltreatment. The diagnosis of adjustment disorder reflects the 
case when someone is in a serious crisis and experiences very 
distressing reactions to, and cannot cope with, any of a great vari- 
ation of stressors, often related to life events such as unemploy- 
ment or divorce. The crisis is expected to resolve within a few 
months. The diagnoses of ‘other specified’ or ‘not otherwise speci- 
fied’ trauma- and stressor-related disorder are clinically relevant 
when someone presents with, for example, a debilitating reaction 
to a sudden death in the family that is beyond a crisis or when 
the presenting problems do not match any other trauma-related 
disorder. All of the last three disorders in Box 82.1 are of clin- 
ical utility but have not been researched to any greater extent— 
because of their great variation in what they encompass, they are 
difficult targets for prevention. The childhood-onset disorders re- 
quire preventive strategies targeted towards parents and are not 
described further in this text. 
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Prevention strategies adapted to trauma- and 


stressor-related disorders 





Prevention is, broadly speaking, defined as stopping ‘something’ 
from happening or arising [2]. Preventive interventions can be im- 
plemented in many different ways. A long-used medical model of 
prevention is the ‘three-level concept of prevention’ developed by 
Leavell and Clark [3, 4] in the late fifties. This model makes a dis- 
tinction between primary, secondary, and tertiary prevention inter- 
ventions from a physical health perspective. The primary prevention 
level refers to health promotion and protection from a disease or 
disorder, for example immunization from a disease through vaccin- 
ation. The secondary prevention level refers both to early detection 
and treatment of a disease in order to avoid chronicity (for example, 
prostate-specific antigen screening to detect prostate cancer). 
Tertiary preventions are interventions that aim to reduce as much 
damage as possible from a disease and can be considered equivalent 
to physical rehabilitation. 

Prevention for mental disorders also has as its aim the reduction 
of symptoms, and ultimately the disorder [4]; the term primary 
prevention can encompass a wider range of definitions, compared 
with this medical model [3, 5]. Primary prevention for mental dis- 
orders has been described as including interventions which are 
‘universal’ (targeting the whole population), ‘selective’ (targeting 
population subgroups or individuals who are at higher risk than 
the average population), and ‘indicated’ (targeting high-risk people 
who are identified as ‘having minimal detectable signs or symptoms 
foreshadowing mental disorder or biological markers indicating a pre- 
disposition but who do not meet diagnostic criteria for the disorder at 
that time’ [3, 5]. Of note here is that, according to this latter aspect, 
primary prevention in a trauma context could include targeting 
those with early symptoms (such as intrusive memories of the trau- 
matic event) which may precede the diagnosis of a full-blown dis- 
order such as ASD and PTSD. Ina mental health context, secondary 
prevention has been described as seeking to lower the prevalence 
through early detection and treatment of a diagnosable disease. 
Tertiary prevention interventions include those to reduce dis- 
ability and prevent relapse. The World Health Organization [3, 5] 
uses similar terms regarding mental health and notes that given that 
while the traditional medical model ‘works well for medical disorders 
with a known etiology. Mental disorders, on the other hand, often 
occur due to the interaction of environmental and genetic factors at 
specific periods of life. It becomes difficult even to agree on the exact 
time of onset of a mental disorder. However, as discussed, PTSD and 
ASD are unique in DSM-5 in that their aetiology is clear and forms 
part of the diagnosis—the presence of an index traumatic event. 

In relation to aetiology, another question of importance in pre- 
vention research is when a preventive intervention should be ad- 
ministered. The definition of primary prevention requires further 
thought when applied to psychological trauma. Several terms are in 
use synonymously. Some argue that primary prevention for trauma- 
and stress-related disorders should only encompass interventions 
before trauma exposure, while any intervention after trauma ex- 
posure, but before the development of PTSD (for example, before 
1 month of symptoms), is to be considered secondary prevention 
[6]. However, as described, a form of prevention focusing on early 
symptoms prior to a diagnosis may have utility; traumatic events 


are often unpredictable, which restricts knowledge of who to target 
in advance in the reality of limited health care resources. Therefore, 
there would be benefits for the mental health field in developing ‘se- 
lective’ intervention strategies and tools to offer soon after a trau- 
matic event where the applicability is restricted to a group with 
higher known risk of the disease emerging (see Epidemiology of 
ASD and PTSD, p. 862). Thus, another related view is that, in the 
mental health field, it may also be possible, by analogy, to provide a 
form of ‘therapeutic vaccination after an index traumatic event has 
occurred (that is, a post-event preventive intervention). For example, 
in physical medicine, after a patient is bitten by a dog, a ‘therapeutic 
vaccine’ [7] is given to protect against rabies. After a traumatic event, 
an equivalent ‘primary’ prevention approach would be the metaphor 
of a ‘cognitive therapeutic vaccine’ that could be administered soon 
after the event—whether psychological or pharmacological. We are 
not at the stage where there is a strong evidence base for this ap- 
proach, but the analogy of preventing the worsening of symptoms 
soon after an index stressor is a useful one, both clinically and to 
guide research. 

While the remainder of this chapter will focus on pragmatic and 
research approaches to prevention from a mental health perspec- 
tive, it should be noted that many other fields have a role to play 
in achieving a wider prevention agenda related to trauma. For ex- 
ample, public health and efforts in society in general should seek to 
prevent potentially traumatic events such as road traffic accidents, 
assault with weapons, rape, and so forth. Reducing the incidence 
of potentially traumatic events would clearly have a major role to 
play in reducing the incidence of disorders such as ASD and PTSD. 
Examples of such intervention include implementation of blood al- 
cohol limits for drivers, addition of separation barriers to freeways, 
strengthening of community-based support services, and laws 
limiting access to firearms. 


About post-traumatic stress disorder 


We first consider the background, the definition of traumatic 
events, and the diagnostic features of PTSD to understand the 
nature of symptoms from this particular disorder that could be 
prevented after trauma. PTSD is defined by a set of criteria that 
include the event characteristics, four groups of symptoms, a dur- 
ation criterion, and a distress/disability criterion [1]. The disorder 
is seen as the cardinal psychopathological consequence of trau- 
matic events, although such events may also contribute to several 
other forms of psychopathology [8]. PTSD is often associated with 
high levels of distress, a high degree of chronicity, and high levels 
of disability [9-11]. 

The traumatic event that precipitates PTSD is defined as the ‘index 
event’ and means that the individual is exposed to actual or threat- 
ened death, actual or threatened serious injury, or actual or threat- 
ened sexual violence [1]. Thus, everyday stressors, like losing a job or 
going through a divorce, do not meet the criterion for a potentially 
traumatic event in DSM-5. The exposure to the event could have 
occurred directly or indirectly (learning that a close relative/friend 
has been through a trauma or being exposed to aversive material 
through work). It is not possible to receive a diagnosis of PTSD after 
being exposed indirectly through electronic media, television, or 
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movies (that is, by watching a frightful film clip on the Internet), un- 
less it occurred in the line of work (such as by police reviewing trau- 
matic crime footage). It is important to note that no set of objectively 
discerned individual characteristics of any kind of event have been 
found to consistently lead to PTSD [12-14]. The subjective experi- 
ence of the threatening nature of the event appears to be key factor 
in the ‘toxicity’ of an event [15]. That is, people who experience high 
levels of threat peritraumatically have higher risk for later PTSD 
[15-17]. 

The four symptom clusters of PTSD in DSM-5 include intrusion 
symptoms, avoidance, cognitive and mood changes, and arousal. 
The intrusion symptoms include intrusive memories of the trauma, 
flashbacks, nightmares, and distress in response to reminders of the 
trauma. 

The core clinical feature of PTSD is arguably the recurrent, invol- 
untary, and intrusive distressing memories of the traumatic event 
[18]. Such memories are intrusive in that they spring to mind un- 
wanted and unbidden. They typically take the form of discrete epi- 
sodes from within a trauma—so called ‘hotspots’ in the memory 
[19]. They are recurrent in that it is often the same selection of 
hotspot scenes that appear repeatedly (rather than, say, the whole 
trauma from beginning to end). Such memories are ‘seen in the 
mind’s eye’ and ‘heard in the mind’s ear’—that is, they comprise 
typically visual, but multisensory, mental images [20]. 

The next symptom cluster of PTSD involves avoidance. PTSD pa- 
tients often try to avoid stimuli that are associated with the traumatic 
event. They therefore often avoid thoughts and feelings or situations/ 
persons/places that are reminders of the experienced trauma (for ex- 
ample, car crash survivors often avoid driving or using public trans- 
portation). Consequently, many trauma survivors live restricted 
lives, constantly avoiding reminders of the trauma. 

The third symptom cluster involves marked changes in cogni- 
tions and mood after trauma. Some trauma survivors with PTSD 
report a persistent negative emotional state (e.g. shame), or that 
they feel numb and cannot get in touch with their emotions as they 
did before (e.g. happiness). They can experience persistent cog- 
nitive symptoms, with negative beliefs about themselves (for ex- 
ample, Tm a bad person’), others, and the world (for example, “The 
world is unsafe’), and distorted perceptions about why the trauma 
occurred are also common (for example, ‘It was my fault that the 
trauma occurred’). 

The fourth symptom cluster of PTSD involves arousal symp- 
toms. PTSD patients often describe themselves as in a chronic state 
of alert, constantly scanning the environment for potential threats. 
This group of symptoms also includes insomnia, irritability, and 
concentration—these symptoms can follow from the hypervigilance 
and disrupted stress responses. 

The symptoms must have been present for at least 1 month since 
the index event. This is called the ‘duration criterion, and was origin- 
ally introduced to reduce the risk of overdiagnosis of common acute 
responses to distressing events. The symptoms should cause clin- 
ically significant distress or impairment in social, occupational, or 
other important areas of functioning, and they should not be better 
explained by another psychiatric or somatic disorder. Hence, it is 
important to note that the symptoms must not be attributable to the 
effects of a substance (for example, medication, alcohol) or another 
psychiatric or medical condition, stating the importance of a good 
differential diagnosis. 


About acute stress disorder 


ASD was introduced in DSM-IV as a way to describe a distressing 
response within the first month after the event that could identify 
patients at high risk for developing PTSD [21]. The diagnostic cri- 
teria of ASD are similar to those of PTSD, except that the presenting 
problems do not have to be as many as for PTSD and they should be 
present during the first month. 

ASD was originally thought to be an antecedent to PTSD, and 
although it has some bearing on how to identify trauma survivors 
who are at high risk for developing PTSD, it has become evident 
that many patients with ASD never develop PTSD. In fact, of those 
diagnosed with ASD, less than half will meet criteria for PTSD after 
12 months [22]. 


Assessing PTSD and ASD 


The assessment of ASD and PTSD is slightly different from the as- 
sessment of other mental disorders in that the clinician must as- 
sess not only the presence and severity of the symptoms, but also 
whether or not the individual’s experience satisfies the traumatic 
event criterion and if the presenting symptoms started or worsened 
after the event. It should be evident to the reader that in order to 
make a diagnosis of ASD or PTSD, it requires both trained asses- 
sors and time for the assessments. The clinician should be trained 
and preferably use well-described methods such as the Acute Stress 
Disorder Interview [23] or the Clinician-Administered PTSD Scale 
[24]. Next, we consider the scale of occurrence of traumatic events 
and stress-related disorders. 


Epidemiology of ASD and PTSD 


Prevalence rates 


Traumatic events are common in the population. In the large World 
Mental Health (WMH) surveys, where more than 70,000 respond- 
ents were interviewed between 2001 and 2012, the lifetime exposure 
to potentially traumatic events was estimated to be 70% across 24 
countries [25]. 

How prevalent is PTSD? One influential study was the National 
Comorbidity Survey, conducted in the United States in 1995, in 
which the lifetime prevalence rate of PTSD in the United States 
was estimated to be 8% [10]. Within the UK, the best estimate 
of the whole population PTSD point prevalence is 3%, which is 
derived from the 2007 Adult Psychiatric Morbidity Study [26]. 
In the WMH surveys, the lifetime prevalence of PTSD was esti- 
mated to be 3.9%, with two-thirds having an adult age of onset 
[25]. Interestingly, in populations with a high risk of exposure, 
the prevalence rates vary and are not necessarily higher than the 
general population estimates. For example, United States military 
veterans seem to be at higher risk of combat-related PTSD than 
UK veterans [27]. Among UK veterans, the prevalence was 3-6%, 
which is close to the estimates of PTSD in the general population. 
A review of resettled refugees concluded that resettled refugees in 
western countries could be approximately ten times more likely 
to have PTSD than the general population in those countries 
[28], which is perhaps unsurprising, given the dose-wise effect of 
trauma exposure and the multiple exposure of different events, for 
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example, war, torture, and rape, in this group. Additional studies 
of rescue and recovery workers suggest that the risk of PTSD de- 
creases, the more training and familiarity the personnel in high- 
risk occupations have with the task [29]. 

Some research studies have pointed to evidence that certain types 
of traumatic events make it more likely to develop PTSD. Notably, 
high rates of occurrence are seen after rape and torture. For example, 
the risk of developing PTSD is approximately ten times higher in 
populations exposed to assault, compared to populations that sur- 
vive a motor vehicle crash [30]. It should be noted that there are 
challenges with comparing the conditional prevalence across event 
types; for example, people at risk of an assault can be more likely than 
others to have a history of potentially traumatic events. Prevention 
strategies aimed towards victims of high-risk events such as inter- 
personal violence would therefore benefit from taking into account 
the influences of past stressors [31]. 


The time course of trauma symptoms 


As previously mentioned, it is common to experience some adverse 
reactions such as intrusive memories, avoidance, and hyperarousal 
in the immediate aftermath of trauma. For many these symptoms 
disappear without professional help [32]. This is thought to be the 
case in most trauma populations, including motor vehicle accidents 
[33], rape victims [34], New York City residents following the ter- 
rorist attacks of 11 September 2001 [35, 36], and for individuals who 
experienced the 2004 South East Asia tsunami [37]. 

The high proportion of exposed individuals who either are re- 
silient or experience a natural recovery after trauma means that the 
timing of preventive interventions is important to consider. High 
natural recovery rates mean that many people that are encompassed 
by unselective prevention strategies shortly after an event could re- 
cover anyway, raising issues of cost and efficiency. Interventions 
that target high-risk groups among survivors need to consider the 
time frame. For example, an initial screening to select people for 
preventive interventions may be based on current distress levels, 
though more research on screening would be needed to inform 
evidence-based practice. 

The risk of negative consequences of intervening in people who 
otherwise would recover anyway is also important to consider. 
Adverse effects of psychological interventions have only recently 
received greater attention, driven particularly by the discovery of 
adverse consequences from some early interventions for trauma 
survivors and bereaved individuals [38, 39]. Several reviews 
has shown that critical stress incident debriefing, a widely dis- 
seminated intervention after trauma, may increase, rather than 
decrease, the risk of developing PTSD [40]. In summary, to es- 
tablish successful prevention strategies of trauma-related dis- 
orders in populations exposed to highly distressing events, there 
is a need to consider not only what the strategies should be, but 
also to pay careful attention to the target population and the time 
course of action. 


Trauma and risk indicators 


There has been extensive research in the last decades on risk fac- 
tors in the trauma population and identifying individuals most 
vulnerable following exposure to a traumatic event. A common 
classification of different types of indicators of risk are pre-trauma, 
peritraumatic, and post-trauma indicators. 


Pre-trauma risk indicators 


Gender is perhaps the most well-known pre-trauma risk factor. 
A consistent finding has been that women have more or less twice 
the risk of PTSD, as compared to men overall [41-44]. For example, 
one large study found a lifetime prevalence of PTSD of 20.4% in fe- 
male trauma survivors and 8.2% of male trauma survivors [10]. 
However, studies are also needed which control for exposure to the 
same type of event. Low socio-economic status, previous exposure 
to highly distressing events, previous psychiatric symptoms, and 
high levels of neuroticism also increase the risk of later developing 
PTSD [16, 45]. New research suggests that there may be genes that 
make certain people susceptible to PTSD. Genetic research has ex- 
panded greatly the last decade, and twin studies estimate the herit- 
ability of PTSD to up to 35%, but this varies when individual PTSD 
symptoms are examined [46]. 


Peritraumatic risk indicators 


What happens during the event plays an important role in the de- 
velopment of PTSD. People who thought that they would be killed 
or severely injured during the event have a higher risk of developing 
PTSD [16, 17], and very high levels of distress emotions or dissocia- 
tive symptoms during, and immediately after, the trauma are strong 
indicators of PTSD [16, 47, 48]. In contrast, the degree of phys- 
ical injuries of the trauma, sometimes thought of as an objective 
measure of exposure severity, does not seem to affect the probability 
of developing PTSD [16, 49, 50]. 


Post-trauma risk indicators 


Low socio-economic status plays a role post trauma as well as pre 
trauma in the development of PTSD [16, 51, 52]. Perceptions of a 
lack of social support is one of the strongest indicators of PTSD [16]. 
Catastrophic appraisal of the traumatic event in the long-term after- 
math of trauma is also considered a risk factor [32]. 

Some studies have elucidated a relationship between biomarkers 
and the development of PTSD [53]. One recent meta-analysis 
showed that higher heart rate after trauma exposure predicted 
long-term PTSD symptoms. Neither cortisol nor blood pressure 
was associated with any elevations in PTSD symptoms [54]. Heart 
rate, cortisol, and systolic blood pressure played a significant role 
in developing PTSD in younger patients, but not in the older popu- 
lation [54]. Inflammatory responses in the aftermath of trauma has 
also been investigated in several studies, showing heightened con- 
centrations of inflammatory signals, including cytokines and C- 
reactive protein [55]. 

Although there has been considerable research on the deter- 
minants of trauma and PTSD, science has not yet found decision 
algorithms that can reliably distinguish patients who require inter- 
vention from those who will recover on their own. 


Primary prevention of PTSD 


Primary prevention strategies for PTSD have traditionally used two 
types of approaches—they have either targeted all individuals ex- 
posed to a trauma (for example, a disaster) or targeted those indi- 
viduals who are displaying symptoms of ASD. The most effective 
approach remains unclear. Although several risk indicators for 
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PTSD have been identified, their clinical application is underdevel- 
oped for targeting preventive interventions, and more evidence is 
needed. 


Screening in the aftermath of trauma 


The diagnostic accuracy of screening instruments in the immediate 
aftermath of traumatic events is unclear at this point. Although there 
have been several studies investigating the usefulness of PTSD/ASD 
questionnaires in the immediate aftermath of trauma, results have 
shown overall low positive predictive values [56, 57], meaning that 
if we would screen all individuals who were afflicted by an event by 
using a cut-off value on a self-report screening measure, the number 
of false positives could far outnumber the true cases that would be 
detected. Consequently, the development of European guidelines for 
psychosocial services following a disaster [58] reached a consensus 
against screening everyone affected because of the absence of evi- 
dence for the effectiveness of screening and scarce resources could 
be better allocated elsewhere. 


Pharmacological prevention 


Several pharmacological treatments have been proposed as 
possible preventive strategies of PTSD. The first trials testing 
pharmacological agents in a preventive context for PTSD have 
used benzodiazepines. However, these trials showed negative re- 
sults. If anything, there were tendencies on harmful effects, that is, 
benzodiazepines could potentially increase the risk of developing 
PTSD [59, 60]. 

Another pharmacological preventive approach that has been 
tested is the B-adrenergic blocker propranolol. Propranolol is 
thought to block and disrupt the consolidation of newly acquired 
trauma memory via post-synaptic B-adrenergic receptors in the 
basolateral nucleus of the amygdala [61]. One double-blinded 
trial conducted in 2002 with 41 participants found positive results 
in preventing PTSD with propranolol in an emergency room set- 
ting [62]. However, a subsequent trial comparing propranolol with 
gabapentin and placebo (N = 38) administered propranolol within 
48 hours of trauma exposure and found no difference in PTSD rates 
in physically ill patients [63]. One study randomized 29 children 
to propranolol or placebo; results showed a tendency to negative 
effects in girls and no significant difference in PTSD rates overall, 
compared to placebo [64]. One double-blinded trial with 41 par- 
ticipants attempted to prevent PTSD by administering propranolol 
within 6 hours after exposure to trauma in an emergency room 
context. Results were overall negative, but patients randomized to 
propranolol showed less psychological reactivity, compared to the 
placebo group, when confronted with a trauma reminder 3 months 
later [65]. 

The administration of glucocorticoids is thought to result in the 
impairment of retrieval of declarative memory and to decrease the 
hyperactive fear response that is associated with the newly acquired 
trauma memory [66]. Schelling et al. performed two RCTs exam- 
ining the effects of stress doses of hydrocortisone in preventing 
PTSD symptoms, following septic shock and cardiac surgery, in in- 
tensive care units. They found a significant reduction of PTSD symp- 
toms in long-term survivors, as well as improvements in quality of 
life outcomes [67, 68]. Another RCT in an emergency room con- 
text on high-dose hydrocortisone treatment administered in the 
first hours after trauma exposure was associated with a lower risk of 


developing PTSD [69]. Administration of low-dose hydrocortisone 
in another RCT was found to prevent the onset of PTSD, compared 
to placebo, particularly in acutely injured trauma victims without a 
history of significant psychopathology [70]. 

There have also been attempts to test other pharmacological 
agents such as morphine [71], SSRIs [72], and ketamine [73], but 
high-quality randomized trials are currently lacking. Two natural- 
istic studies of morphine administration following trauma exposure 
in civilian and combat-related injury have suggested that morphine 
may prevent later PTSD [71, 74]. One RCT on the effect on early ad- 
ministration of imipramine and fluoxetine in thermally injured chil- 
dren showed no effect in preventing PTSD [72]. Intranasal oxytocin 
has also been evaluated as a prevention strategy for PTSD, showing 
effect in patients with a high acute clinician-rated PTSD symptom 
severity [75]. Ketamine administration in naturalistic studies on 
moderately injured accident victims has been shown to decrease the 
incidence of PTSD [76, 77]. 

To summarize, although some important steps have been taken, 
more research on pharmacological agents to prevent the develop- 
ment of PTSD is still needed. Some studies have shown promising 
effects, but it is too soon to provide any definite recommendation in 
clinical practice yet. Importantly, one systematic review concluded 
that the small number of studies and their limited methodological 
quality cast uncertainty about the effects [78]. 


Psychological prevention 


A range of different early psychological interventions have been 
developed in order to prevent negative psychological reactions 
after trauma. However, as in the pharmacofield, high-quality clin- 
ical trials have been lagging considerably behind. Descriptions of 
the most well-researched interventions are given in the following 
sections. 


Critical incident stress debriefing 


Critical incident stress debriefing (also commonly known as ‘de- 
briefing’) takes place in the first hours following a traumatic event 
and typically consists of one individual or group session (1-3 hours), 
in which the participants describe what they experienced, their 
thoughts and feelings during the event, and how they think of how 
this event will influence them in their future [79]. Interestingly, there 
are several reviews of this intervention that have found that it may 
actually increase, rather than decrease, the risk of developing PTSD, 
that is, interfering with the natural psychological recovery after 
trauma [80, 81]. This is of some concern, as this is a commonly used 
intervention worldwide by, for example, first responders, govern- 
ment organizations on victims of mass disasters, and responders to 
traumatic incidents in the work setting. A summary of the Cochrane 
review even stated that ‘compulsory debriefing of victims of trauma 
should cease [40]. 


Cognitive behavioural interventions 


Trauma-focused cognitive behavioural therapy (CBT) has been 
tested as a secondary preventive strategy (that is, treating patients 
with ASD with the aim to lower the incidence of PTSD) in several 
trials, showing a decreased incidence of PTSD, in comparison to 
supportive counselling [82, 83, 84]. In this treatment, the patient 
usually receives 4-5 sessions covering standard CBT techniques 
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(psychoeducation, anxiety management, exposure, and cognitive 
restructuring) a couple of weeks after the traumatic event. 

In the case of mental health primary prevention (that is, treating 
individuals immediately after trauma exposure), the evidence base is 
more limited. Resnick et al. [85] developed a psychological interven- 
tion for rape victims that consists of a 17-minute video presentation 
consisting of general psychoeducation and coping strategies for anx- 
iety. This intervention was shown to decrease both PTSD symptoms 
and marijuana abuse compared with standard care in a randomized 
trial with 140 participants with sustained long-term effects [86]. 

Another primary prevention strategy tested is ‘prolonged ex- 
posure’ (PE) which is a form of CBT. In PE treatment, PTSD symp- 
toms are seen as signs of an incompletely processed trauma memory, 
making the memory associated with high levels of fear and stress. 
The high level of distress drives avoidance behaviours that prevent 
the person from facing their memories and situations that, since the 
trauma, are perceived as dangerous, although objectively safe, and 
from overcoming their problems. Thus, in this model, the patient 
is asked to reactivate the trauma memory repeatedly in a safe and 
non-stressful context (that is, visualizing the event in their mind’s 
eye several times—the so-called imaginal exposure). The aim of this 
is to allow the fearful responses associated with the memory to grad- 
ually decline and afterwards to verbally process the revisiting of the 
traumatic memory. The patient is also asked to confront objectively 
safe but feared situations (the so-called exposure in vivo). 

There is, to date, one randomized trial that has tested prolonged 
exposure as an early intervention after trauma [87]. In this study, 137 
emergency trauma care patients received three sessions with a clin- 
ical psychologist, starting the intervention 12-24 hours after trauma 
exposure in the emergency department. Results showed that, com- 
pared to a control group consisting of assessment only, the modified 
PE intervention decreased PTSD symptoms at 12 weeks later [87]. 
Trials that aim to replicate and extend these promising findings are 
needed. 


Cognitive science-driven interventions 


A new line of enquiry from experimental psychopathology studies 
has sought to create psychological prevention techniques ‘de novo’ 
from cognitive science—that is, suggesting that by translating 
emerging neuroscientific insights and experimental research, we 
begin to develop new low-intensity psychiatric intervention that 
could prevent intrusive memories following trauma [88]. It dif- 
fers from the application of PE or CBT techniques in several ways. 
First, this line of research focuses on targeting a core clinical fea- 
ture of post-traumatic distress, rather than a whole disorder—here 
the recurrent intrusive visual memories of a traumatic event which 
are distressing and disruptive. Second, the preventive behavioural 
intervention is derived from various combined aspects of cogni- 
tive neuroscience (rather than being an adaptation of an existing 
talking therapy). The hypothesis is that engaging in a visuo-spatial 
task during memory consolidation competes for working memory 
resources with visual imagery and interferes with the formation of 
intrusive memories. Dual-task experiments also indicate that when 
similar cognitive tasks compete for shared resources, they interfere 
with each other and thereby impede memory processing; for ex- 
ample, a visuo-spatial pattern-tapping task interfered with holding a 
visual mental image in mind (making it become less vivid and emo- 
tional) [89]. An example of such a visuo-spatial task is the computer 


game Tetris [90, 91]. Critically, a competing task needs to be done 
within the time period of memory consolidation—within an esti- 
mated 6 hours following an event [92], during which time memory 
for events is still labile and vulnerable to disruption. Several la- 
boratory experiments were done using healthy volunteers before 
bringing this approach to ‘the real world’ 

As a first proof-of-concept step towards clinical translation, 
Iyadurai et al. [88] compared a Tetris-based intervention (trauma 
memory reminder cue plus about 20 minutes of Tetris game play) 
vs attention-placebo control (written activity log for the same dur- 
ation), both delivered in an emergency department within 6 hours 
of a traumatic motor vehicle accident. The primary outcome of the 
RCT was the number of intrusive trauma memories (core clin- 
ical feature post-trauma) measured daily in the subsequent week. 
Results supported the efficacy of the Tetris-based intervention, 
compared with the control condition; there were fewer intrusive 
memories overall, while time-series analyses showed that intrusion 
incidence declined more quickly. Convergent findings were found 
on a measure of clinical post-trauma intrusion symptoms at 1 week, 
but not on other symptom clusters or at 1 month [88]. This proof- 
of-concept study opens the way for a larger trial, powered to detect 
differences at 1 month. 

A similar pattern of results has been found in a conceptual rep- 
lication by Horsch et al. [93] who tested the Tetris intervention on 
women in hospital who had a traumatic childbirth in the previous 
few hours [93]. The treatment group received usual care plus the 
Tetris-based intervention (20 minutes of Tetris game play, but no 
reminder cue as mothers were in the same hospital context as where 
the index trauma occurred, so a reminder was not deemed neces- 
sary) vs a control group of usual care. The reduction in the number 
of intrusive memories was almost two-thirds in the following week 
in mothers who played Tetris, compared with the control group. At 
1 month, there was an indication of a lower rate of PTSD in the treat- 
ment condition. 

Advantages of this type of cognitive science-driven approach in- 
clude the fact that it is brief (approximately one half-hour session), 
it does not require a highly skilled therapist or talking about the 
trauma—and it could be readily scalable, which is required for pre- 
ventive approaches post-trauma. 


Other psychosocial interventions 


Psychoeducation delivered via a self-help booklet has also been 
investigated in two trials but has not demonstrated any effect in 
preventing PTSD [94, 95]. An RCT using an early eye movement de- 
sensitization and reprocessing (EMDR) protocol delivered either 48 
or 72 hours after exposure to trauma showed a decreased incidence 
of PTSD symptoms 3 months later in patients exposed to workplace 
violence [96]. However, the control group in this study consisted of 
debriefing, and as debriefing can potentially be harmful (see Critical 
incident stress debriefing, p. 864), the results should be interpreted 
with caution. 

Psychological first aid (PFA) is a set of practices that were com- 
piled in their current form by the National Child Traumatic Stress 
Network and the National Center for PTSD to reduce PTSD in ado- 
lescents, adults, and their families. The model is based on a set of 
principles: sense of safety, calming, sense of self, and community ef- 
ficacy, connectedness, and hope [97]. Importantly, despite the wide- 
spread acclamation of PFA as an ‘evidence-informed’ intervention 
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[98], comprehensive literature reviews have not yet found evidence 
of its efficacy in the prevention of PTSD. 

To summarize, as in the case of pharmacological agents, there are 
to date as yet no psychological interventions that have shown con- 
sistent, reliable results in preventing the development of PTSD, with 
the exception of a form of trauma-focused CBT with patients who 
already have ASD for which there is some evidence. Importantly, 
there are too few published high-quality large-scale trials to give any 
consistent recommendation of what psychological treatment to use 
for trauma survivors in the early aftermath of trauma. 


A considerable amount of research has looked at risk factors for 
PTSD, but no stable and reliable decision algorithm has been de- 
veloped that can distinguish patients who require intervention 
from those who will recover on their own. Some early trials on pri- 
mary intervention have shown promising effects, but it appears it 
is too soon to provide strong recommendations for clinical prac- 
tice. After high-profile events such as a disaster, immediate post- 
trauma support can be demanded—it is as important to know the 
evidence on what interventions not to give (for example, debriefing 
is counterindicated). Meanwhile, active research efforts are needed 
to develop a range of interventions that could be used soon after 
trauma. Given the presence of an index traumatic event—a known 
aetiology—prevention efforts based on science and clinical practice 
should be within our reach. 
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Mental disorders where trauma or stress plays a central role are 
grouped together in “Trauma- and stressor-related disorders’ in the 
Diagnostic and Statistical Manual of Mental Disorders, fifth edition 
(DSM-5) [1] and in ‘Disorders specifically associated with stress’ 
in the beta version of the International Classification of Diseases, 
eleventh revision (ICD-11) [2]. This group of disorders can only be 
diagnosed following exposure to a stressful or traumatic event. In 
DSM-5 and the beta version of ICD-11, these include reactive at- 
tachment disorder, disinhibited social engagement disorder, post- 
traumatic stress disorder (PTSD), and adjustment disorders (ADs). 
DSM-5 also includes acute stress disorder (ASD), whereas the beta 
version of ICD-11 distinguishes between PTSD and complex PTSD 
and includes prolonged grief disorder. The beta version of ICD-11 
specifies an acute stress reaction, but this is dissimilar from ASD as 
it is viewed as a normal response to trauma that subsides within a 
few days. Here we will focus on the management of PTSD, ASD, and 
ADs in adults. The management of ASD is discussed in the context 
of prevention of PTSD. 


















































Prevention 


Both ASD and PTSD develop following exposure to a traumatic 
event involving actual or threatened death, serious injury, or sexual 
violence; ASD occurs 3 days to 1 month following the trauma, and 
PTSD is diagnosed when the symptoms have lasted at least 1 month 
[1]. Their clear relation to an instigating event(s) provides an op- 
portunity to institute strategies to prevent the development of ASD 
and PTSD. Preventing the development of PTSD is of major societal 
significance, as the disorder is often chronic and disabling once it 
sets in. Although several individuals who have ASD go on to develop 
PTSD, there are also many who develop PTSD without a prior diag- 
nosis of ASD [3]. In the context of PTSD, primary prevention refers 
to preventing exposure to traumatic events or strategies aimed at 





enhancing resilience (for more information on primary prevention, 
see Chapter 82) [4]. Secondary prevention entails interventions in- 
stituted after trauma exposure, with the intention of preventing the 
development of PTSD. Once PTSD symptoms are present, the aim of 
tertiary prevention is to treat symptoms early and try to avert the de- 
velopment of chronic PTSD. Prevention strategies can be universal, 
aimed at all individuals exposed to a traumatic event, or can be tar- 
geted towards individuals at higher risk of developing PTSD, such as 
individuals with ASD. 


Psychological interventions 


Various psychological interventions have been investigated 
in the prevention of PTSD, as well as in the treatment of ASD. 
These include single-session interventions, such as debriefing, 
and multiple-session interventions such as counselling, multiple- 
session debriefing, cognitive behavioural therapy (CBT), memory 
restructuring, and psychoeducation. Debriefing is a psychological 
intervention usually provided in a group format, as a single session, 
within a few days after exposure to a traumatic event. Debriefing 
sessions usually involve a discussion of the traumatic event and en- 
courage participants to recollect their experience and to express 
their emotional reactions to the event. The intention of debriefing 
is to reduce acute distress and prevent the development of psychi- 
atric disorders—PTSD, in particular. Systematic reviews and meta- 
analyses have consistently found that debriefing does not reduce 
the incidence of PTSD nor does it lead to improvements in PTSD 
symptoms, depression, anxiety, or functioning [5, 6]. There is even 
some evidence that debriefing may worsen PTSD outcomes [6]. 
Similarly, a Cochrane review found that early multiple-session psy- 
chological interventions were not superior to control conditions 
for any PTSD-related outcomes when administered to all trauma 
survivors [7]. There is, however, evidence from meta-analyses that 
trauma-focused CBT (TF-CBT) is superior to waitlist/treatment 
as usual (TAU), as well as supportive counselling in the treatment 
of ASD (TF-CBT is described in Psychotherapy, p. 870) [5, 8]. 
Although various other psychosocial interventions have been in- 
vestigated, there is insufficient evidence at this stage to support any 
of them in the prevention of PTSD. 
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Pharmacotherapy 


Medications that have been investigated in the prevention of PTSD 
include steroids (hydrocortisone), beta-blockers (propranolol), 
drugs for depression (fluoxetine, escitalopram, sertraline, and 
imipramine), benzodiazepines (BZDs) (temazepam, alprazolam, 
and clonazepam), anticonvulsants (gabapentin), opioids (mor- 
phine), beta-adrenergic agonists (albuterol), NMDA receptor ant- 
agonists (ketamine), and omega-3 fatty acids. The only drug that 
has demonstrated efficacy in preventing PTSD in meta-analyses 
is hydrocortisone. In two meta-analyses (n = 5 studies; n = 164), 
hydrocortisone was superior to placebo in reducing the incidence 
of PTSD [number needed to treat (NNT) = 7-13], as well as in 
improving PTSD symptom severity [9, 10]. In the studies, hydro- 
cortisone was usually administered within 6-12 hours of trauma 
exposure at doses of between 20 mg and 140 mg. Glucocorticoids 
are hypothesized to work by facilitating extinction learning and 
impairing memory consolidation of the traumatic event. The 
known adverse effects of hydrocortisone, such as affective changes, 
increased susceptibility to infection, and detrimental cardiovas- 
cular outcomes, remain a concern. Another drug investigated in 
PTSD prevention—propranolol—showed promise in early trials 
but has not demonstrated efficacy in meta-analyses [9, 10]. Beta- 
blockers are thought to disrupt memory consolidation and to 
dampen sympathetic tone and, in so doing, to reduce hyperarousal 
and intrusive symptoms. The few studies that have evaluated drugs 
for depression, BZDs, anticonvulsants, and omega-3 fatty acids 
have largely been negative [9, 10]. Morphine administered fol- 
lowing trauma exposure, in cohort studies, may reduce the risk of 
PTSD presumably by impairing amygdala fear conditioning [4, 9]. 
Similarly, there is limited evidence from cohort studies that keta- 
mine and albuterol administration following trauma may mod- 
erate the incidence of PTSD [4, 9]. At this stage, there is insufficient 
evidence to recommend the routine clinical use of any pharmaco- 
logic intervention. 


Other interventions 


Another approach examined in the prevention of PTSD involves 
collaborative stepped care where trauma-exposed individuals are 
assigned case managers and referred for evidence-based treatments 
(CBT and/or pharmacologic treatments), according to individual 
requirements. Although studies are limited, collaborative care ap- 
pears to lead to improved outcomes, compared to usual care [4, 5]. 


Conclusions 


Overall, there is a lack of evidence to draw clear conclusions for 
most interventions [5]. The only intervention that can be firmly 
recommended is TF-CBT, but only once symptoms of ASD are 
present. No universal interventions aimed at all trauma-exposed 
individuals can currently be supported. A collaborative care ap- 
proach could possibly assist in identifying individuals with PTSD 
symptoms earlier and increasing access to evidence-based inter- 
ventions. Without clear evidence to guide interventions, clinicians 
should employ a pragmatic approach. This entails not interfering 
with spontaneous recovery, but rather providing practical assist- 
ance and support and monitoring for the emergence of symptoms 
of ASD or PTSD and timely referral for appropriate treatment 
if symptoms persist or are distressing. Providing information 


regarding responses to trauma and when to seek help to trauma- 
exposed individuals and encouraging them to engage with their 
available sources of social support may also be helpful. Further re- 
search is required to improve detection of individuals at increased 
risk of PTSD to allow for targeted interventions. Well-designed 
trials are required to determine the efficacy of interventions and 
to determine individual factors influencing treatment outcomes. 
Timing of preventive interventions may be of particular import- 
ance, especially if the aim is to disrupt memory consolidation of 
the traumatic event. 


Psychotherapy 
Trauma-focused cognitive behavioural therapies 


TF-CBT is the most frequently studied form of psychotherapy in 
PTSD. TF-CBT is a broad category of therapies where the principles 
of CBT have been adapted to focus on behaviours and cognitions 
central to the disorder. Some TF-CBTs are primarily behavioural, 
while others are predominantly cognitive, and others have a com- 
bined behavioural and cognitive approach. TF-CBT is usually pro- 
vided in weekly sessions lasting 60-90 minutes for around 8-12 
weeks and can be delivered individually or in groups. All TF-CBTs 
incorporate psychoeducation regarding PTSD, particularly during 
initial sessions, and frequently also impart anxiety management 
techniques. The broad aim of TF-CBT is to alter maladaptive 
thinking and behaviours that develop in relation to the traumatic 
event and thereby alleviate symptoms and improve functioning. 
Cognitive-based therapies focus on altering trauma-derived cog- 
nitive distortions and negative beliefs. Specific manualized vari- 
ants of cognitive therapy have been developed, such as cognitive 
restructuring and cognitive processing therapy (CPT), which 
allow for a standardized approach. The behavioural-based inter- 
ventions usually involve exposure therapies, which are founded on 
the principles of extinction learning, whereby repeated exposure 
to trauma-related cues in a safe environment leads to extinction 
of the fear response. The most frequently studied form of exposure 
therapy in PTSD is prolonged exposure (PE), a manualized inter- 
vention. Exposure therapy involves either imaginal or in vivo ex- 
posure to trauma-related stimuli, while anxiety and distress are 
managed with a relaxation technique such as controlled breathing. 
In imaginal exposure, the person recalls memories of the event, 
while in vivo exposure involves exposure to situations that the 
person avoids because they trigger reminders of the trauma. Recent 
meta-analyses have demonstrated the efficacy of individual TF- 
CBT in changing PTSD diagnostic status (NNT 2-4), improving 
PTSD symptoms, depression, anxiety, and functioning, as com- 
pared to waitlist/TAU [11-14]. The results remain similar, whether 
the focus is on cognitive therapy, exposure therapy, or mixed TF- 
CBT approaches, and effects appear to persist up to 12 months fol- 
lowing completion. Group TF-CBT has also demonstrated efficacy 
for PTSD-related outcomes, although the number of studies are 
limited [11]. Gender appears to influence outcomes, as effects are 
larger in women-only studies [11]. Meta-analyses have reported 
higher dropout rates for TF-CBT than for waitlist/TAU, which 
raises concerns regarding the acceptability of TF-CBT [11]. 
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Non-trauma-focused cognitive behavioural therapies 


Other CBT-based therapies that do not focus on the specific trauma 
have also been investigated in PTSD. The most frequently studied 
non-TF-CBT is stress inoculation training (SIT), which focuses on 
increasing an individual’s ability to manage stress and to prevent fu- 
ture episodes of anxiety. It incorporates coping skills training and 
relaxation techniques, as well as broad CBT-based approaches. The 
frequency and duration of sessions are similar to those used in TF- 
CBT. Although fewer studies have investigated non-TF-CBT, results 
indicate efficacy in improving PTSD symptoms, depression, and 
anxiety directly following treatment [11, 14]. Other meta-analyses, 
however, cite insufficient evidence for the efficacy of SIT [12]. 


Eye movement desensitization and reprocessing 


Eye movement desensitization and reprocessing (EMDR) has fea- 
tures in common with other TF-CBTs and is one of the most com- 
monly studied forms of psychotherapy in PTSD. During EMDR 
sessions, individuals bring to mind the traumatic experiences and 
negative cognitions associated with the trauma, while performing 
saccadic eye movements as they follow the therapist's finger from left 
to right for a few seconds. The process is repeated until the person 
becomes desensitized to the distressing content, after which the 
individual practises holding positive self-efficacious statements in 
mind, while imagining the trauma. The saccadic eye movements are 
thought to interfere with working memory and to reduce distress re- 
lated to the trauma and thus reduce avoidance [15]. Treatment with 
EMDR also usually involves 8- to 12-weekly sessions, lasting around 
60-90 minutes each. Meta-analyses have reported that EMDR 
is superior to waitlist/TAU in changing PTSD diagnostic status 
(NNT = 2), improving PTSD symptoms, depression, and anxiety up 
to 3 months following treatment [11-14]. 


Other psychotherapies 


Psychodynamic psychotherapy, which is usually a longer-term 
intervention, employs traditional psychodynamic principles of 
bringing unconscious content into conscious awareness where it can 
be processed and conflict surrounding the trauma can be resolved. 
Brief eclectic therapy (BET) is a manualized therapy provided over 
16 sessions that combine CBT and psychodynamic approaches. 
Interpersonal therapy (IPT) is usually provided in 10-20 sessions 
and concentrates on improving interpersonal relationships to en- 
hance the management of interpersonal distress. Narrative exposure 
therapy (NET) combines features of exposure therapy, as well as 
narrative therapy, to allow desensitization and reframing of the 
traumatic experience within the person's life story. Person-centred 
therapy and supportive counselling are non-directive and allow 
the person to talk about their problems and feelings, with support 
from an accepting therapist. Supportive therapy is often used as an 
active comparator in clinical trials. A Cochrane review that com- 
bined other therapies’ reported superiority to waitlist/TAU in PTSD 
symptoms, depression, and anxiety directly following treatment; 
however, dropout rates were higher than waitlist/TAU [11]. There is 
some evidence of efficacy, based on a limited number of studies, for 
NET and BET in improving PTSD-related outcomes above waitlist/ 
TAU [12, 13]. IPT was not superior to waitlist/TAU in one meta- 
analysis [14], but group IPT demonstrated efficacy in a single trial 
[13]. Psychodynamic psychotherapy, resilience therapy, and hypno- 
therapy have demonstrated efficacy only in single small trials [13]. 


A meta-analysis of six hypnotherapy trials found large effects in im- 
provements of PTSD symptoms post-treatment but did not report 
outcomes, as compared to control conditions [16]. 


Comparative effectiveness of psychotherapies 


In terms of efficacy, the largest body of evidence, supported by the 
largest effect sizes [13], is for TF-CBT, followed by EMDR. Although 
there is some evidence for non-TF-CBT and other psychotherapies, 
these results are less clear. Effect sizes for all psychotherapeutic 
interventions are larger when compared against waitlist/TAU than 
against active comparators [13]. Fewer head-to-head comparisons 
for psychotherapies exist. One meta-analysis found no difference 
between PE and cognitive-based TF-CBTs, SIT, or EMDR [17]. 
Exposure therapy has demonstrated superiority to relaxation in re- 
ducing PTSD symptoms [12]. There is some evidence that TF-CBT 
and EMDR are superior to non-TF-CBT and ‘other therapies’ for 
certain PTSD-related outcomes [11]. One meta-analysis found that 
EMDR was superior to TF-CBT only in improving anxiety symp- 
toms directly following treatment [17]. Only one study observed 
non-TF-CBT was superior to ‘other therapies’ [11]. 


Pharmacotherapy 


The most commonly studied medications are drugs for depression, 
SSRIs in particular, followed by drugs for psychosis. 


Drugs for depression (antidepressants) 


Meta-analyses have demonstrated small, but significant, effects 
favouring these drugs overall [13], and SSRIs as a class [13, 18], above 
placebo. Although guidelines often combine SSRIs as a drug class in 
treatment recommendations, the evidence reveals that this may not 
be appropriate in the management of PTSD. Currently, paroxetine 
and sertraline are the only two SSRIs that are FDA-approved for 
PTSD. Of the individual SSRIs, paroxetine and fluoxetine have 
consistently been found superior to placebo in meta-analyses [13, 
14, 18]. There are some inconsistencies in meta-analyses regarding 
the efficacy of sertraline, with some reporting superiority to placebo 
[13, 14] and others not [18]. These inconsistencies can be explained 
by varying inclusion criteria in clinical trials (that is, some more 
stringent than others) and by whether or not unpublished results 
were included. Citalopram and escitalopram have failed to demon- 
strate efficacy in a few studies [13, 14, 18]. The serotonin and nor- 
adrenaline reuptake inhibitor (SNRI) venlafaxine has demonstrated 
superiority over placebo in meta-analyses including two RCTs [13, 
14, 18]. Meta-analyses have not shown efficacy for the monoamine 
oxidase inhibitor (MAOI) brofaromine, the tricyclic antidepressants 
(TCAs) desipramine and imipramine, and the noradrenaline/dopa- 
mine reuptake inhibitor (NDRI) bupropion [13, 14, 18]. The results 
for the MAOI phenelzine, the TCA amitriptyline, the noradrenergic 
and specific serotonergic antidepressant (NaSSA) mirtazapine, 
and the discontinued serotonin antagonist and reuptake inhibitor 
(SARI) nefazodone have been inconsistent in meta-analyses [13, 
14, 18]. Even in those reporting superiority over placebo, the results 
were based on single trials. 


Drugs for psychosis (antipsychotics) 


In two meta-analyses, dopamine antagonists were superior to pla- 
cebo overall, but with a small effect [13, 14]. In two meta-analyses, 
olanzapine failed to demonstrate efficacy [13, 18], but in another, it 
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was superior to placebo as monotherapy, but not when used as aug- 
mentation of an antidepressant [14]. Risperidone was superior to 
placebo, both as monotherapy and when used in augmentation in 
two meta-analyses [13, 14], but failed to demonstrate any efficacy in 
another [18]. Based ona single trial, aripiprazole was not superior to 
placebo [14]. Despite these small positive effects, their use is limited 
by their significant adverse effects. 


Anticonvulsants 


Anticonvulsants as a class have not demonstrated efficacy 
[13, 14]. One meta-analysis found that topiramate was superior 
to placebo as monotherapy in two trials and as augmentation in 
a single trial [13], whereas others did not observe evidence of its 
efficacy [13, 18]. The other anticonvulsants valproate semisodium, 
tiagabine, and lamotrigine have failed to demonstrate efficacy in 
limited trials [13, 14, 18]. 


Benzodiazepines 


Systematic reviews and meta-analyses have consistently found that 
BZDs are ineffective in preventing or treating PTSD symptoms 
[13, 14, 18, 19]. Furthermore, BZDs increased the likelihood of 
developing PTSD and were associated with other negative sequelae 
such as worsened PTSD severity, psychotherapy outcomes, depres- 
sion, anxiety, aggression, and substance use [19]. 


Adrenergic drugs 


Although there have been some negative findings [13], the alpha-1 
adrenergic receptor antagonist prazosin has been found to be su- 
perior to placebo in meta-analyses in improving PTSD symptoms 
and sleep quality and in reducing nightmares [14, 20]. Prazosin pre- 
sumably works by dampening noradrenergic activity, and thereby 
decreasing hyperarousal, as well as by disrupting reconsolidation of 
fear memories [19]. Prazosin is usually initiated at 1 mg and titrated 
up every 2-3 days up to mean daily doses of between 3 mg and 15 
mg. It is usually well tolerated, although dizziness is the most com- 
monly reported adverse effect. The alpha-2 agonist guanfacine has 
not demonstrated efficacy in PTSD thus far [13, 14]. 


Table 83.1 Evidence-based treatments for stress-related disorders 


Other emerging drugs 


Efficacy has been demonstrated for GR205171 (a neurokinin-1 an- 
tagonist) [18] and the herbal drug gingko biloba [13] in single RCTs. 
Due to its positive effects in depression, ketamine has received in- 
creased interest in the treatment of PTSD. Ketamine, commonly 
used as an anaesthetic for surgical procedures, was associated in 
a case series with both worsened and improved PTSD symptoms. 
A pilot RCT (ketamine vs midazolam) and a case study reported im- 
provements in PTSD symptom severity [21]. A meta-analysis of 19 
intranasal oxytocin trials in various mental disorders, including two 
PTSD trials, reported a small effect favouring oxytocin; however, of 
the individual disorders, significant improvements were only found 
for autism spectrum disorders. The two trials evaluating oxytocin in 
PTSD were, however, very small (n = 18-30), one of which showed 
promising results [22]. Both oxytocin and ketamine warrant further 
investigation in PTSD. 


Combined psychotherapy and pharmacotherapy 


A Cochrane review evaluated whether combined pharmaco- 
therapy and psychotherapy was superior to either treatment mo- 
dality alone [23]. Only four trials were eligible for the review, and 
all of the trials utilized SSRIs and TF-CBT. Two trials compared 
combined treatment to pharmacotherapy alone and found no evi- 
dence that combined treatment was superior. Two trials, one in 
adults (n = 25) and one in children (n = 24), compared combined 
treatment to psychotherapy alone and found no difference between 
the groups. A more recent RCT ( = 228) similarly found that the 
combination of PE and paroxetine was not superior to either alone 
in achieving remission; PE, however, was superior to paroxetine 
alone in adults with PTSD secondary to a motor vehicle accident 
(MVA) [24]. In contrast, another RCT (n = 37) found combined 
paroxetine and PE to be superior to PE alone in improving PTSD 
symptom severity and in achieving remission [25]. No clear con- 
clusions on the benefits of combination therapy can be drawn until 
larger well-designed RCTs are executed to address this question 
(Table 83.1). 


Second line Third line (reserved for specialist practice | Interventions demonstrating 
and treatment resistance) potential 


PTSD 


Prevention No evidence for any universal 
preventative interventions 
TF-CBT for symptomatic 
individuals or those 
diagnosed with ASD 


ME CBM 
EMDR 


Treatment Other psychotherapies (group 
TF-CBT, non-TF-CBT, NET, BET) 

Drugs for depression (paroxetine, 
fluoxetine, venlafaxine, 
sertraline) 

Combined pharmacotherapy and 


psychotherapy 


Problem-solving therapy Other brief psychotherapies 


Collaborative care approaches 
Hydrocortisone 


Distance-delivered TF-CBT 
VRET 

indfulness-based therapies 
Drug-assisted psychotherapies 
TMS 

Exercise 


Other medications [prazosin, risperidone 
(monotherapy and augmentation), 
olanzapine (monotherapy), topiramate] 


Targeted pharmacotherapy Resiliency training programmes 
indfulness-based therapies 


Exercise 





ASD, acute stress disorder; BET, brief eclectic therapy; EMDR, eye movement desensitization and reprocessing; NET, narrative exposure therapy; PTSD, post-traumatic stress disorder; 
TF-CBT, trauma-focused cognitive behavioural therapy; TMS, transcranial magnetic stimulation; VRET, virtual reality exposure therapy. 
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Treatment selection 


Effects sizes for TF-CBT and EMDR, whether compared to waitlist/ 
TAU or comparator conditions, are significantly larger than those 
found in pharmacotherapy trials, which are generally small [14]. 
Psychotherapy benefits also persisted after therapy completion, 
whereas medication had to be continued to maintain efficacy [14]. 
The effect sizes for drugs, however, are equivalent to those observed 
in depression trials [18]. Based on the available evidence, TF-CBT 
and EMDR are first-line interventions and other psychotherapies 
and psychopharmacological interventions can be considered as 
second-line interventions. Drugs with the best evidence for efficacy 
are fluoxetine, paroxetine, and venlafaxine. Even though the find- 
ings of meta-analyses have been inconsistent, sertraline was equiva- 
lent to venlafaxine in one RCT [14] and can thus be considered as 
well. In individuals with prominent sleep difficulties, prazosin may 
be an option, and atypical antipsychotics, as monotherapy or aug- 
mentation, should rather be reserved for treatment-resistant cases. 

Generally, dropout rates for pharmacotherapy trials have not been 
significantly different from placebo, suggesting that overall psy- 
chopharmacological treatment was well tolerated. Acceptability of 
psychotherapies is of concern, as dropout rates in studies are often 
high, particularly in TF-CBT, which may be related to avoidance of 
trauma-related reminders. A systematic review evaluating patient pref- 
erences regarding treatments for PTSD largely found that individuals 
preferred psychotherapy above medication; they, however, preferred 
medication to receiving no treatment [26]. Medication was generally 
preferred when there was comorbidity (depression, insomnia) and 
when there were practical concerns such as perceived faster effect with 
pharmacotherapy. Predictors of psychotherapy preference were PTSD 
as a primary diagnosis, perceived threat to life, previous psychiatric 
history, traumatic injuries, sexual assault, and female gender. 

There is a lack of evidence to guide treatment selection based 
on trauma type or individual-level characteristics. For both psy- 
chotherapy and pharmacotherapy, studies with women only had 
larger effect sizes and studies with veterans smaller effect sizes [13]. 
Concerns regarding methodological quality and publication bias 
have been raised for both psychotherapeutic and pharmacologic 
interventions [13, 14]. Of note, individuals in psychotherapy trials are 
often on concomitant medications and tend to have had more prior 
treatments than those in pharmacotherapy trials [14]. Another factor 
that influences treatment choice is the availability of various treat- 
ment options. Access to psychotherapy is limited in many settings. 
Perceived barriers to treatment that have been identified include 
stigma, shame, and fear of negative social consequences, low mental 
health literacy, doubts regarding treatments, limited resources and 
time, and apprehension regarding re-experiencing the trauma [27], 
whereas perceived facilitators of treatment include social support 
and acceptance, severity of the disorder, and recognizing a need to 
change, positive experiences with health professionals, and avoiding 
negative social interactions [27]. Individual preference, along with 
current best evidence, should guide treatment selection. 


Special populations 
Children and adolescents 


Meta-analyses have demonstrated evidence favouring psycho- 
logical therapies overall in the treatment of PTSD in children and 


adolescents (C&A) [28, 29]. Psychological interventions studied 
included TF-CBT (including PE), EMDR, narrative therapy, sup- 
portive counselling, psychodynamic psychotherapy, child-centred 
therapy, multi-disciplinary treatment, meditation, and classroom- 
based interventions. Effect sizes overall were larger when compared 
to waitlist/TAU than to active controls. Psychological therapies were 
superior to control conditions in changing PTSD diagnostic status 
and improving PTSD symptom severity up to 1-year follow-up, as 
well as depression and anxiety in the short term. The only individual 
therapy type that was superior to both TAU/waitlist and supportive 
counselling was TF-CBT. A Cochrane review of psychological ther- 
apies for preventing PTSD in trauma-exposed C&A also showed a 
benefit for psychological treatments as a whole in preventing PTSD 
and in reducing PTSD symptoms, when compared to waitlist/TAU/ 
no treatment (NNT = 6.3), for up to a month follow-up [30]. There 
was a small effect demonstrating superiority of TF-CBT over EMDR, 
play therapy, and supportive therapy in improving PTSD symptoms. 

There have been very few trials evaluating pharmacotherapy in the 
management of PTSD in C&A. A recent systematic review of RCTs 
in C&A identified only two pharmacotherapy trials, both evaluating 
sertraline; they could not perform a meta-analysis due to the limited 
number of studies [28]. Another systematic review that evaluated 
psychopharmacology in the management of PTSD in C&A reported 
that in three RCTs of SSRIs (two sertraline, one fluoxetine), there 
was no benefit to SSRI treatment, and in one brief RCT of imipra- 
mine, the latter was superior to chloral hydrate in improving ASD 
symptoms in child burn victims [31]. They also found small open- 
label studies that demonstrated improvements in PTSD symptoms 
when using citalopram, quetiapine, carbamazepine, high- vs low- 
dose valproate semisodium, and the alpha-2 agonists clonidine 
and guanfacine (clonidine improved re-enactment symptoms, and 
guanfacine improved nightmares). However, beta-blockers did not 
prevent PTSD or reduce symptoms severity in an open study. In 
C&A with PTSD, pharmacotherapy is not routinely indicated and 
should be reserved for the management of treatment resistance in 
specialist practice. 


Psychiatric comorbidity 


There is very little evidence to inform the management of PTSD 
with psychiatric comorbidity, although PTSD is highly comorbid 
with depressive, anxiety, and substance use disorders (SUDs). Many 
of the psychotherapies investigated in PTSD improve associated de- 
pression and anxiety symptoms as well. The same drugs with evi- 
dence in PTSD can also be utilized in the management of depressive 
and anxiety disorders because of their broad spectrum of action. 
A Cochrane review of psychological therapies for comorbid SUD 
and PTSD found that TF-CBT was superior to TAU in improving 
symptoms of PTSD and SUDs, but not when compared to active 
comparators [32]. Treatment completion rates were lower in TF-CBT 
groups. Non-TF-CBT-based therapies were not superior to TAU, 
whether delivered in a group/individual format. A meta-analysis 
evaluating integrated treatment for SUD and PTSD, which involves 
simultaneous treatment for both disorders by the same treatment 
team, found that integrated programmes improved PTSD and SUD 
symptoms, but integrated programmes were not superior to non- 
integrated programmes [33]. Thus far, there is no clear evidence of 
specific pharmacotherapy interventions that improve symptoms of 
both PTSD and SUDs in individuals with comorbidity [34]. 
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Emerging interventions 
Alternative psychotherapy delivery modes 


Due to problems with access to evidence-based psychotherapies, 
efforts have been under way to examine alternative methods of 
delivering psychotherapies. Existing therapies have been adapted 
into remotely delivered interventions (Internet, telephone, email, 
and videoconferencing). Meta-analyses have found that Internet- 
delivered CBT was superior to waitlist/TAU in improving PTSD 
symptom severity and depressive and anxiety symptoms, but not 
when compared to active controls [35-37]. Virtual reality exposure 
therapy (VRET) follows the same principles as traditional exposure 
therapies, but the person is exposed to cues related to the trauma in 
a computer-generated virtual reality (VR). In VR, the person can 
interact with the environment, allowing for an immersive experi- 
ence. A systematic review of VRET in PTSD identified ten studies, 
seven of which demonstrated that VRET was superior to waitlist, 
although no difference was found between VRET and traditional ex- 
posure therapies [38]. Although preliminary, these results suggest 
the potential utility of VRET, particularly as VR headsets have be- 
come increasingly available and affordable. 


Mindfulness-based therapies 


Over the last three decades, there has been a proliferation of research 
into mindfulness-based interventions. Mindfulness has its origin in 
Buddhist contemplative teachings, and current therapies have been 
adapted in accordance with medical and scientific practice. The prac- 
tice of mindfulness involves a non-judgemental moment-to-moment 
observation of internal and external experiences, with the aim of 
enhancing awareness. Through sustained practice, mindfulness aims 
to decrease distress by developing greater acceptance of things as they 
are and enhance self-regulation and positive qualities such as com- 
passion and equanimity [39]. Mindfulness has been incorporated 
into various therapeutic approaches such as mindfulness-based stress 
reduction (MBSR), mindfulness-based cognitive therapy (MBCT), 
acceptance and commitment therapy (ACT), and dialectical behav- 
iour therapy (DBT) [39, 40]. MBSR and MBCT are manualized pro- 
grammes delivered in eight weekly group sessions, lasting around 
2 hours, with one full-day session. Both MBSR and MBCT incorp- 
orate mindfulness meditation and yoga practices. MBCT also has 
a cognitive component where participants are instructed to change 
their relationship to their thoughts, rather than to challenge the con- 
tent, as is commonly done in CBT. A recent systematic review of 
mindfulness-based interventions, mostly MBSR, in trauma-exposed 
adults reported that overall studies demonstrated positive effects and 
improvements in PTSD symptoms, particularly avoidance subscales 
[40]. However, the results were based on small studies with variable 
methodological rigour, and further studies of higher methodological 
quality are required before any clear conclusions can be drawn. 
Another meta-analysis reported that meditation-based interventions 
(MBSR, yoga, and mantra repetition programmes) were superior 
to waitlist/TAU in improving PTSD and depressive symptoms [41]. 
A benefit of mindfulness-based interventions is that they are often 
delivered in groups and thus increase accessibility. 


Drug-assisted psychotherapies 


There is an increased interest in the therapeutic potential of 
methylenedioxymethamphetamine (MDMA), commonly known 


by its recreational use name ‘ecstasy, in the treatment of PTSD 
[42]. The psychoactive properties of MDMA were first investi- 
gated in the late 1970s, and prior to it being classified as an illegal 
Schedule 1 drug in the mid-1980s, it was utilized by clinicians as 
an adjuvant to psychotherapy. For more than 20 years, research 
into MDMA was limited to investigations of the risks and poten- 
tial neurotoxicity of MDMA, particularly with regard to its rec- 
reational use. Despite extensive research, the debate on whether 
MDMA is neurotoxic persists. There are, however, known poten- 
tial dangers associated with recreational ‘ecstasy’ use such as de- 
hydration and brain oedema [42]. MDMA-assisted psychotherapy 
(MDMA-AP) typically involves 1-3 drug-assisted psychotherapy 
sessions, lasting 8 hours each, with non-drug psychotherapy 
sessions provided prior to, and following, the MDMA-AP sessions. 
MDMA putatively assists in the treatment of PTSD by decreasing 
anxiety and increasing the individual's ability to trust, and thus 
strengthening the therapeutic alliance, by providing a sense of in- 
creased insight, and by enabling memory retrieval and reconsoli- 
dation [42]. This is partly due to its known physiological effects 
such as increased serotonin, dopamine, noradrenaline, oxytocin, 
and vasopressin release. In a preliminary meta-analysis of two 
RCTs (n = 35), the authors demonstrated large effects sizes for 
PTSD symptoms, favouring MDMA-AP [43]. They also compared 
the effect sizes and dropout rates against those reported in a meta- 
analysis of PE17 and found that the effect sizes were slightly larger 
for MDMA-AP, and that dropout rates were lower in MDMA-AP 
(12.7%) than in PE (27.0%). 

Most of the studies evaluating cognitive enhancers of CBT have 
used d-cycloserine (DCS). DCS is thought to enhance extinction 
learning due to its partial agonism of the N-methyl-D-aspartate 
(NMDA) receptor complex. A recent Cochrane review evaluated 
the efficacy of DCS augmentation of CBT in anxiety disorders, 
including PTSD and included 21 RCTs, five of which were on PTSD 
[44]. Overall, DCS-augmented CBT was not superior to placebo- 
augmented CBT in both adults and C&A. The studies were heter- 
ogenous, and the quality of evidence overall was low. The subgroup 
analysis revealed that DCS-augmented CBT was superior to placebo 
in achieving remission of PTSD at 1-6 months’ follow-up and in 
improving comorbid anxiety symptoms. In C&A, however, placebo- 
augmented CBT was superior to DCS-augmented CBT in improving 
anxiety and depressive symptoms in one study (n = 57). 


Transcranial magnetic stimulation 


Transcranial magnetic stimulation (TMS) is a non-invasive 
neuromodulation modality. Electromagnetic pulses are delivered 
by a TMS stimulator via a coil placed over the scalp. TMS presum- 
ably works by inducing changes in the electrical field in the focal 
brain region being stimulated and through altering function down- 
stream via connected neural circuits. Repetitive TMS (rTMS) is a 
type of TMS usually utilized in treatment protocols and can vary 
according to the stimulation site and the intensity, frequency, dur- 
ation, and sequence of pulses. Low-frequency (LF) rTMS (<1 Hz) is 
generally considered to have inhibitory effects, and high-frequency 
(HF) rTMS (25 Hz) to be excitatory. rTMS is generally considered 
safe when administered according to accepted safety guidelines 
[45]. The most frequently reported adverse effects include head- 
aches and facial pain, and the most severe adverse effect is the in- 
duction of seizures, though the risk is very low. Since the late 1990s, 
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rTMS has been investigated as a potential treatment for PTSD in 
different patient populations, utilizing varying protocols [46]. Due 
to its role in complex cognitive and behavioural functions, the 
dorsolateral prefrontal cortex (DLPFC) has been researched as a 
treatment target in PTSD and several other neuropsychiatric dis- 
orders. A meta-analysis and systematic review evaluating the effi- 
cacy of rTMS to the DLPFC in PTSD identified three RCTs (n = 64) 
of HF and LF rTMS to the right DLPFC, and one RCT evaluating 
HF rTMS to the left DLPFC [47]. All rTMS protocols were admin- 
istered over ten sessions, with a mean total number of 6250 pulses. 
Their meta-analysis demonstrated statistically significant improve- 
ments in PTSD symptoms, anxiety, and depression, favouring ac- 
tive rTMS to the right DLPFC. Dropout rates did not differ between 
active and sham rTMS, although individuals in the active rTMS 
group had more severe PTSD symptoms at baseline. The RCT 
investigating the left DLPFC had similar results. A more recent 
meta-analysis pooled the results of five RCTs (n = 108) for HF rTMS 
and LF rTMS to the right and left DLPFCs, as well as HF deep TMS 
to the bilateral medial prefrontal cortices (mPFC) [48]. They also 
demonstrated an overall benefit favouring active rTMS, and a meta- 
regression identified no specific variables that influenced outcomes. 
Although preliminary, the results suggest that rTMS holds promise 
as a treatment option for PTSD. Further studies with larger sample 
sizes are required to prove efficacy for routine clinical practice, and 
ideal treatment protocol parameters still need to be defined. 


Exercise 


Akin to many other psychiatric disorders, cardiovascular disease 
(CVD) risk is higher in individuals with PTSD. One meta-analysis 
revealed that individuals with PTSD had almost double the risk 
of metabolic syndrome than matched general population controls 
[49]. A recent systematic review and meta-analysis evaluated the 
efficacy of physical activity as a treatment for PTSD [50]. They in- 
cluded four RCTs (n = 200); two trials evaluated active yoga, one 
evaluated an aerobic intervention (stationary cycling) and one 
evaluated a combined resistance and aerobic intervention. The dur- 
ation of the interventions varied between 6 and 12 weeks, with 1-2 
supervised sessions per week. The meta-analysis found that the ex- 
ercise interventions were superior to the comparator interventions 
in improving PTSD and depressive symptoms, with no significant 
adverse effects. There were insufficient data to evaluate the effect on 
anthropometric measurements. Although preliminary, the results 
suggest that exercise interventions hold potential as an adjunct in 
the treatment of PTSD. 


Other interventions 


Biofeedback involves a process whereby an individual can gain volun- 
tary control of a physiological process by receiving electronic moni- 
toring feedback. When the feedback incorporates brain function, 
such as electroencephalography (EEG), it is called neurofeedback. 
A recent systematic review including three studies (one RCT) re- 
ported improvements in PTSD symptoms and associated neurobio- 
logical changes [51]. A meta-analysis of hyperbaric oxygen therapy 
(HOT) (100% oxygen being delivered at an increased atmospheric 
pressure) in traumatic brain injury (TBI) found that although HOT 
led to improved Glasgow coma scale (GCS) scores, there was no ef- 
fect on PTSD severity [52]. Stellate ganglion block, an anaesthetic 
procedure where a local anaesthetic is injected next to the cervical 


sympathetic ganglion, has shown promise as a treatment for PTSD 
in case studies and case series [53]. 

Deep brain stimulation (DBS) of brain regions involved in PTSD 
pathophysiology could be a potential treatment option for refractory 
PTSD. Preclinical studies have demonstrated promising results with 
stimulation of the basolateral amygdala (BLA), ventral striatum, 
hippocampus, and PFC, and one case study reported an improve- 
ment in PTSD symptoms in a veteran treated with BLA DBS [54]. 

Of six studies identified in a systematic review of art therapy in 
traumatized adults, two studies reported significant improvements 
in PTSD-related symptoms and two in anxiety, compared to waitlist/ 
TAU or an art control condition [55]. A recent systematic review 
evaluated animal-assisted therapy in the treatment of trauma- 
exposed individuals; three studies were in adults, and seven in 
C&A. The animals most frequently utilized were dogs, followed by 
horses. Although very preliminary, results overall were promising, 
demonstrating improvements in PTSD symptoms, depression, and 
anxiety [56]. 


Adjustment disorders 


An AD involves a maladaptive response, either emotionally or be- 
haviourally, or both, to an identifiable stressful situation or mul- 
tiple stressors. ADs usually have their onset within 1 [2] to 3 [1] 
months after the onset of a stressful event and usually resolve within 
6 months after the stressor has terminated. The types of stressful 
events that can lead to an AD are broad and can include severe 
traumas or more common events such as financial difficulties, inter- 
personal conflict, illness, or work-related stressors. ADs cannot be 
diagnosed if the person meets criteria for another mental disorder, 
such as major depressive disorder (MDD) or PTSD, and is distin- 
guished from a normal response to a stressful situation by significant 
functional impairments or severe distress, out of proportion to what 
would be accepted culturally. Although ADs are frequently encoun- 
tered and can be a useful clinical diagnostic classification, research 
specifically evaluating ADs is scant. Here we expand the discussion 
by including interventions in non-clinical populations or in subclin- 
ical depression and anxiety. ADs are associated with heightened sui- 
cide risk, and thus suicide risk assessment should form part of the 
routine management of ADs [57, 58]. 


Psychological therapies 


Due to the time-limited nature of ADs, brief therapies are usually 
preferred, unless the stressor is more chronic in nature. Interventions 
can be aimed at reducing or removing the stressor, facilitating adapta- 
tion, or reducing distress and functional impairment [57]. Problem- 
solving-based approaches can assist in directly dealing with the 
stressor. Adaptation to the stressor can be facilitated by approaches 
that reframe the stressor and with practical interventions. The spe- 
cific symptoms of ADs can be addressed by similar psychological 
interventions used to treat depression, anxiety, and stress-related 
disorders such as CBT, relaxation therapies, IPT, and supportive and 
psychodynamic therapies [57]. Resilience-enhancing approaches 
can assist both with reducing the risk of developing ADs as well 
as with the treatment thereof. A Cochrane review evaluated CBT 
and problem-solving therapy (PST) to assist individuals with ADs 
to return to work [59]. PST involves identifying problems and then 
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generating, implementing, and evaluating solutions to the problems. 
There was evidence that PST, but not CBT, had significant effects on 
AD-related symptoms and on the time to return to partial, but not 
full, work. 

A meta-analysis evaluating group CBT in subclinical depression 
reported that group CBT was superior to waitlist, but not to com- 
parator interventions, and did not reduce the risk of incident de- 
pression [60]. Meta-analyses of the efficacy of resiliency training 
programmes in diverse clinical and non-clinical populations found 
these programmes had a moderate effect on improving resilience 
overall, and trauma-focused resilience training programmes, spe- 
cifically, had a moderate effect in reducing stress and depression 
[61, 62]. Meta-analyses of mindfulness-based interventions in di- 
verse psychiatrically or medically ill and non-clinical populations 
have demonstrated improvements in anxiety, depression, stress, and 
quality of life [63, 64]. Meditation-based programmes were, how- 
ever, not superior to active comparators. Self-help-based mindful- 
ness interventions, such as Internet-based therapies and books, have 
also been demonstrated superior to control groups in improving 
stress, depression, anxiety, and mindfulness in meta-analyses 
involving non-clinical populations [65, 66]. Relaxation training, 
such as progressive relaxation, autogenic training, and meditation, 
has also demonstrated efficacy in a meta-analysis of mixed clinical 
and non-clinical samples [67]. 


Pharmacotherapy 


There are very few trials evaluating pharmacotherapy for ADs. 
Intervention that have been studied include BZDs, drugs for de- 
pression, and other anxiolytics. Single studies have found that 
etifoxine, a non-BZD anxiolytic, was superior to lorazepam, while 
others demonstrated no differences between drugs for depres- 
sion (mianserin, tianeptine, viloxazine, trazodone) and BZDs (al- 
prazolam, clorazepate, lormetazepam) and s-adenosyl methionine 
in improving anxiety [57]. Herbal medicines, such as kava-kava, val- 
erian, gingko biloba, and plant extracts, have also demonstrated su- 
periority in improving anxiety, compared to placebo, in one or two 
studies [57, 58, 68]. In practice, drugs for depression are frequently 
prescribed for ADs, despite the available evidence [68]. A meta- 
analysis of their efficacy in subthreshold depression demonstrated 
a small, but significant, effect favouring the SSRIs, fluoxetine, and 
citalopram, but not paroxetine, above placebo [69]. A Cochrane re- 
view also reported that kava was significantly superior to placebo 
in improving anxiety symptoms; however, concerns regarding liver 
toxicity limit the use of kava [70]. 


Exercise 


A recent meta-analysis found that physical activity improved de- 
pression and anxiety in non-clinical adults [71]. Exercise is postu- 
lated to have beneficial effects on anxiety and depressive symptoms 
due to neurophysiological effects such as HPA axis regulation, in- 
creased monoamine levels, and increased neurogenesis in limbic 
structures [72]. Yoga has also been demonstrated to improve ADs 
in a single study [68]. 

Overall, the quality of evidence for treatments in ADs is poor, and 
itis difficult to draw clear conclusions. In general, practical problem- 
focused approaches are supported by the evidence and make clinical 
sense (Box 83.1). 


Box 83.1 Practical approach to the management of PTSD 


General guidelines 

e Routinely screen for a history of trauma exposure and for symptoms 

of PTSD. 

Assess for ongoing trauma and ensure individual safety. 

e A sensitive and empathic approach is required in dealing with 
trauma-exposed individuals. 

e Respect autonomy, and involve the individual in treatment choices 
and in their management plan. 

e Be aware of existing treatment options for PTSD in your community 

of practice, and build a referral database, including organizations that 

provide support to trauma-exposed individuals. 

Follow available treatment guidelines and evidence-based practices. 

Avoid potentially harmful interventions such as debriefing or pre- 

scribing benzodiazepines. 











Following trauma exposure 

e Do not interfere with spontaneous recovery. 

e Provide practical support and assistance. 

e Provide information regarding responses to trauma and when to 
seek help. 

e Encourage utilization of social support structures. 

e Monitor for symptom emergence. 

e If ASD is diagnosed, refer for TF-CBT. 


Management of PTSD 

e Assess for suicidality and other risk behaviours. 

e Confirm that PTSD is present, and exclude other differential 
diagnoses. 
Assess for comorbidity (medical/psychiatric/substance use), and 
manage comorbidity appropriately. 

Refer on to the appropriate level of care, depending on severity and 
complexity. 
e Address potential barriers to treatment. 

Consider risks and benefits of treatment, alongside individual factors, 
when formulating a management plan. 
e Institute first-line treatments, unless specific reasons exist for an alter- 
native treatment option that is not first line (for example, availability, 
individual preference, comorbidity). 
e Monitor adherence, response to treatment, and tolerability, and ad- 
just the management plan accordingly. 
e If first-line treatment has failed, confirm that the diagnosis is correct 
and assess for any complicating factors before instituting second-line 
interventions. 
e Treatment-resistant individuals should ideally be managed in spe- 
cialist practice. 
































ASD, acute stress disorder; EMDR, eye movement desensitization and reprocessing; 
PTSD, post-traumatic stress disorder; TF-CBT, trauma-focused cognitive behavioural 
therapy. 
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Bereavement 


Beverley Raphaelt, Sally Wooding, and Julie Dunsmore 





Bereavement is the complex set of reactions that occur with the 
death of a loved one: the emotions of grief with yearning, angry 
protest, and sadness; the cognitive processes of understanding and 
making meaning of the finality and nature of death; and the social, 
cultural, spiritual, and religious contexts of adaptation. Grief may 
also result from other losses such as health, home, country, and safe 
worlds. There have been investigations into potential neurobio- 
logical substrates, without, as yet, a consensus about the explana- 
tory model. In Mourning and Melancholia, Freud [1] described the 
psychological processes of mourning which involved the gradual 
relinquishment of bonds with the deceased, and how mourning 
differed from melancholia. Lindemann [2] described the symptom- 
atology and management of acute grief in his classic paper on his 
experiences assessing and treating the survivors of a nightclub fire. 
Engel [3] asked, ‘Is grief a disease?’ and concluded in the negative. 
Bowlby’s work on attachment, separation, and loss [4-6] has been 
the most influential in informing research and clinical practice, with 
many studies of both adults and children utilizing such concepts. 
Early research focused chiefly on bereavement following the death 
of a spouse, describing normal, high-risk, and pathological patterns 
of grief [7-9]. There is also a number of excellent reviews of theory 
and research, including those of Stroebe’s group [10, 11]. 








Common phenomena of the grief experience of adults, identified 
through many research studies [12-14], relate to similar domains 
influenced by developmental trajectories through childhood and 
adolescence. Adult studies indicate consistent patterns: numbness 
and disbelief; yearning, angry protest, and ‘searching’ behaviours 
representing separation distress; sadness with reviewing of mem- 
ories of the lost relationship, with a range of associated emotions; 
and progressive acceptance of the death and changed circum- 
stances, sometimes referred to as resolution. Bonanno et al. [14] 
has shown that resilient trajectories, defined by low overall distress, 
are common. Other transient phenomena described by clinicians 
working with bereaved people [15] include: identificatory symp- 
toms, reflecting the deceased’s illness; and a sense of the deceased’s 


"Tt is with regret that we report that Beverley Raphael died on 21 September 2018. 





ongoing presence, at times as though seeing the face, hearing the 
voice, or feeling the touch of the dead person. ‘Yearning’ is con- 
sidered to be the most pathognomonic of these grief phenomena, 
which usually settle over the first year but may continue, triggered 
by anniversaries or specific memories. Older people who have had 
a long relationship with a spouse may continue this relationship in 
their minds for the comfort of ‘talking’ with the person and a need 
for the ongoing closeness [16]. Recent research [17] has modelled 
sequential peaks of the reactive phenomena: disbelief, yearning, 
anger, and depression, which bereaved people more usually describe 
as sadness. Grief may be a precipitant of depression in those with 
pre-existing or bereavement-related vulnerabilities, and the differ- 
entiation of normal and more pathological forms of grief from de- 
pression is important clinically [15, 18]. Intense grief and the peaks 
of distress identified in this chapter do not usually continue beyond 
the first 4-6 months [12, 13]. Continuing ‘acute’ grief beyond this 
time suggests the possibility of a pathological response, as do other 
risk indicators, although some phenomena may continue intermit- 
tently for many years. Comparative studies have demonstrated that 
the intensity of adult grief is likely to be greatest for the death of a 
child, then spouse or partner, then parent [12, 19]. 
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Recent research has examined the neurobiology of grief through 
studies using functional magnetic resonance imaging of grief [20] 
and brain activity in women grieving the break-up of a romantic 
relationship [21]. Workers have attempted to develop a theoret- 
ical model, based on a wide range of relevant data, encompassing a 
‘neurobiopsychosocial’ framework for sadness and loss [22]. Stress 
hormones [23] and psychoimmune function is a further area of re- 
search. A comprehensive model integrating the relevant research 
findings is yet to be established. 


Risk and p 





and out 


Pre-existing vulnerabilities that may influence the course and 
outcome have been reviewed, alongside other risk factors [24]. 
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These include personality vulnerabilities related to relationship 
styles such as avoidant and insecure attachments. Genetic factors 
do not appear to have been directly studied, but it is likely that 
the short allele of the serotonin gene promoter polymorphism 
of SHTTLPR, which influences response to adversity, may con- 
tribute through gene-environment interactions [25]. Prior loss 
and adverse experiences may add vulnerability, for instance mul- 
tiple losses faced by indigenous peoples, with loss of culture, land, 
and loved ones, with multiple premature deaths and separations 
[26]. Separation anxiety in childhood, as well as pre-existing 
psychiatric disorder, family psychiatric disorder, and substance 
abuse, may add to vulnerability. Successful negotiation of earlier 
losses, mature defence styles, and optimism may be protective. 
The nature of the lost relationship has been identified in a number 
of studies as being a significant factor [15, 27]. The special re- 
lationship between parent and child is associated with greater 
vulnerabilities, including an increased risk of psychiatric hospi- 
talization and even death by suicide. Patterns of distress differ by 
gender with stillbirth, neonatal deaths, and sudden infant death 
syndromes, perhaps suggesting different attachment patterns 
[28]. The death of an adolescent child is not infrequently by acci- 
dent, suicide, or risk-taking with illicit drugs, bringing the extra 
complexities of adaptation for the grieving parents. A great deal 
of research has explored the grief associated with the death of a 
partner or spouse, both young and old. High levels of depend- 
ence and ambivalence have been shown to complicate grief and 
to be associated with more difficult bereavement [15, 27], and 
prolonged or complicated grief may be more likely. Family mem- 
bers may have different relationships with the deceased, and thus 
varying patterns and trajectories of grief, which may cloud the 
recognition of children’s and others’ needs. Adults’ loss of an older 
parent appears to be the least distressing, although there is still 
sadness plus the recognition of one’s own mortality. Here, as at any 
age, intense fantasies of reunion with the deceased may indicate a 
risk of suicide, especially for older widowed men. Circumstances 
of the death may influence outcome. When dying is prolonged, 
as in the later stages of a terminal illness, the dying person may 
experience grief over his or her own life and the loved ones who 
will be lost to him, alongside the anticipatory grieving of family 
members. While palliative care systems may provide bereavement 
programmes, families have complex dynamics and may require 
family-focused interventions [29]. Sudden unexpected deaths 
bring an extra level of emotional shock [30], especially if also un- 
timely, as with children’s death. When violent death occurs, as 
with homicide or the mass violence of terrorism or war, those be- 
reaved may experience a complex mixture of traumatic stress re- 
actions and grief reactions, sometimes called traumatic grief [31]. 
The specific issues facing those bereaved by violent deaths of loved 
ones have been reported in a recent volume by Rynearson [32], 
which deals with homicide, terrorism, and other violent deaths. 
The prolonged and difficult grief in such circumstances is high- 
lighted by findings from September 11, Oklahoma, and Bali bomb- 
ings. When people are missing, believed dead, the uncertainty and 
other stressors, including complex legal and evidentiary processes 
(for example, Disaster Victim Identification requirements), may 
lead to alternating hope and dread. When there are no remains, it 
will be more difficult for those bereaved to accept the reality of the 
death. Seeing and ‘saying goodbye’ to the dead person have been 


shown to help those bereaved in disasters. If remains are much 
disfigured, as with burns, it is important that those bereaved are 
supported in their choice about this. Social support, particularly 
the perceptions of the supportiveness of the family and social 
network, are likely to be protective and assist the bereaved psy- 
chologically [33], while perceptions of unhelpfulness may be as- 
sociated with more negative outcomes [9]. Cultural requirements 
for social support may differ, as may the delineation of the period 
of mourning, the roles of the bereaved, and associated spiritual 
and religious needs [34]. Multiple other adversities may occur, ei- 
ther coincidentally or as a consequence of the circumstances and 
the loss of the person, for instance financial difficulties, loss of 
resources, changed status, loss of meaning and identity, or other 
profound stressors of illness, injury, or other bereavements. Such 
additional stressors may increase vulnerability [15]. In terms of 
prolonged grief disorder (previously known as complicated grief 
disorder, and initially traumatic grief), Prigerson et al. [35] have 
carried out extensive research to refine this syndrome. Bringing 
together the views of international researchers, they have devel- 
oped consensus criteria for a distinct psychiatric disorder to be 
considered for inclusion in DSM-5. This definition requires that 
the reaction to loss encompasses one of three symptoms of sep- 
aration distress (for example, yearning) and a minimum of five 
from a total of nine other symptoms, experienced at least daily, to 
a distressing or disruptive level. These include: shock; emotional 
numbing; avoidance of the reality of the loss; difficulty accepting 
the loss; feelings of meaninglessness; difficulty moving on with 
life; bitterness over the loss; mistrust; and a diminished sense of 
self. Such symptoms would need to last at least 6 months and be 
associated with significant levels of functional impairment. These 
findings fit well with earlier research identifying more chronic 
patterns of grief [12, 13]. 


Physical and mental health consequences 


of bereavement 





A recent valuable review [36] outlined the evidence of increased 
mortality for bereaved spouses, particularly males, which is most 
pronounced in the first 6 months and includes a range of conditions 
such as heart disease, leading to death from a ‘broken heart’ It is also 
more pronounced for those younger. Death of a child is associated 
with an even greater mortality risk, particularly for mothers. Suicide 
is one of the heightened risks, especially for mothers and older 
males. Physical health impairments are also found [36], with a var- 
iety of physical symptoms, as well as greater use of medical services 
and medications. Further research is needed to clarify the nature of 
any increased rate of specific diseases. Changed health behaviours, 
the impact of loss of a health-supporting partner, functional or 
social changes, or shared environments of risk may contribute. With 
regard to mental health, there may be an increased level of anxiety 
and depressive symptoms. There may be a heightened risk for some 
bereaved individuals for exacerbations of pre-existing conditions or 
the precipitation of new illnesses, including post-traumatic stress 
disorder when there is a violent death [32]. Other anxiety disorders, 
major depression, substance use disorder, and bipolar disorder may 
be precipitated by bereavement. Complicated or prolonged grief dis- 
order may also represent a psychiatric consequence [35]. 





Assessment and management 


Most bereaved people do not require counselling, so assessment 
must be a basis for intervention. Assessment should be simple and 
synchronous with need and do no harm, addressing the death and its 
circumstances, the relationship, and the bereaved’s experience since, 
including social support [31]. A more structured format, potentially 
including grief measures [37], can clarify the presence of PTSD, 
depression, or prolonged grief, or other health needs, including 
physical health changes or problems, establishing the basis for inter- 
vention. Initial management of acutely bereaved persons requires 
empathic, compassionate support, and responding to any acute 
needs in ways that are protective of their mental health, recognizing 
the ‘rollercoaster’ of emotions that may occur, and facilitating nat- 
ural resilience. Concepts of psychological first aid are also valuable 
in the immediate period after the death [38]. Dealing with concerns 
about the deceased's suffering and support to view the dead person's 
remains, should the bereaved choose to, are likely to be helpful. Most 
evidence-based interventions focus on psychotherapeutic methods, 
ranging from preventive counselling of those with demonstrated 
heightened risk [39] to self-help guided interventions [40], inter- 
personal psychotherapy modifications for traumatic grief [41], 
integrated cognitive behavioural therapy models [42], or psycho- 
dynamically informed models [43] and web-based treatments [44]. 
Counselling models [45] and psychoeducation have also focused on 
those bereaved through specific deaths such as those of infants [46]. 
Other models deal with grief work and tasks [47], as well as specific 
treatment for morbidity of complicated grief [48] and depressive or 
anxiety disorders, including the use of pharmacotherapy for such 
conditions when indicated [49]. Rynearson’s [32] work with ‘restora- 
tive retelling’ following violent deaths emphasizes the narrative story 
which is central to much bereavement counselling and testimony. 
A practical approach to assessment and counselling may be initi- 
ated with some gentle queries such as: ‘Can you tell me a little about 
your loss?, What happened with ‘John’s’ death? ‘Can you tell me 
about Johr and your relationship?’ ‘What’s been happening since?’ 
If there is intense continuing distress, circumstances of death which 
are untimely or traumatic, a complex relationship with the deceased, 
disruption of family functioning which is impacting on the needs of 
children, inadequacies of social support, ‘unresolved’ earlier losses, 
and multiple additional stressors, the bereaved may be at heightened 
risk of adverse outcomes. Preventive counselling, which facilitates 
grieving, is attuned to the bereaved persons readiness, vulnerabil- 
ities, and strengths, and helps them to tell the story of their experi- 
ences and the person they have lost, is likely to improve outcomes 
[39]. If assessment indicates that psychiatric disorders have arisen, 
for instance depression, post-traumatic stress disorder, anxiety, or 
substance use disorders, these conditions should be managed ap- 
propriately, alongside counselling for the bereavement, should this 
be required. The use of antidepressants or other medication is not 
indicated for bereavement itself but may be appropriate for such 
complications [49]. Monitoring for suicide risk should accompany 
clinical management. 

Prolonged or complicated grief may benefit from cognitive be- 
havioural therapy interventions, as well as relevant rehabilitation. 
Those who are both traumatized and bereaved may need the trau- 
matic stress issues to be dealt with first, and facilitation of grieving 
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following this [43]. There is a need for more comprehensive research 
and evaluation of prevention, early intervention and treatment mo- 
dalities, and their appropriate provision to individuals, families, or 
groups [50]. Culturally appropriate models of support and inter- 
vention also need to be further developed. Many bereaved people 
present first within primary care settings and to their general practi- 
tioners who will need the skills and knowledge to deal with their dis- 
tress, assess their needs, counsel them as appropriate, and refer when 
necessary. Much support also comes from community and non- 
government agencies, including bereavement focused groups for 
specific losses or for grief generally, and from specialized services in 
public or private mental health sectors. Telling the bereaved person 
how to grieve, that they should ‘forget about the past; and that ‘time 
heals all’ is usually perceived as unhelpful. Treating grief as a disease, 
for example antidepressant medication for normal sadness, is seen 
by many bereaved people as interfering with their capacity to grieve 
for their loved one. Counselling bereaved people requires hopeful, 
compassionate psychotherapeutic intervention, which recognizes 
the human suffering involved, validates the person’s strengths, and 
respects their spiritual needs. Loss is a central issue for all of us, both 
our fears of it and its reality. Counselling requires those involved 
to recognize their own sensitivities in this regard and to assist the 
‘journey’ of those affected in dealing with their loss. Most people 
grieve, remember with love those whom they have lost, and continue 
to love and love anew. 


FURTHER INFORMATION 


Kellehear, A. (2007). A Social History of Dying. Cambridge University 
Press, Cambridge. 

National Centre for Childhood Grief. http://www.childhoodgrief. 
org.au/ 
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The concept of repressed memory has been around for at least 
a century. It is the notion that, as a defence mechanism, horribly 
traumatic experiences can be involuntarily banished into the uncon- 
scious, even for decades, yet at some later point return to awareness, 
unchanged, vivid, and accurate—protected from the ravages of time 
and immune to the normal laws of memory functioning and decay. 

The idea of repression is everywhere. Characters in television 
shows, movies, and books repress memories of their dark pasts and 
recover them with the help from fictional therapists—using memory 
recovery tools such as hypnosis, guided imagery, or age regression. 
Or memories return in fragments until the full story bursts into 
memory in time to solve a crime, to prompt a long awaited reck- 
oning, or to allow recovery from psychological problems. And na- 
tional media discuss claims involving repressed memories of satanic 
abuse, witnessing murder, alien abduction, and other fantastical 
events, some disclosed by celebrities. Psychologists are exposed to 
the concept of repression in introductory psychology classes and 
throughout their careers. Moreover, the police use memory recovery 
techniques with witnesses and suspects. And innocent suspects con- 
fess to crimes they did not commit and did not remember when the 
police convinced them they ‘repressed’ the memory of committing 
the heinous act [1, 2]. 

Though the idea of repression and memory recovery had been 
part of cultural and professional zeitgeists for decades, controversy 
regarding the validity of so-called ‘recovered memories’ broke out in 
force during the late 1980s and early 1990s. Cases in which amemory 
allegedly ‘repressed’ for years, even decades, was ‘recovered’ fully 
intact and in detail formed the basis of widely publicized criminal 
charges ranging from sexual abuse to murder. Families were des- 
troyed; personal careers, reputations, and liberty were lost, and mil- 
lions were spent on litigation of the claims. 

Controversy regarding the reality of these claims was fuelled by 
several forces. For decades after Freud repudiated his initial view 
that many psychological problems were caused by child sexual abuse 
(CSA) by family members and theorized instead that the many 
accounts he heard in therapy were fantasies, CSA was widely con- 
sidered to be rare and lacking in severe consequences. Changes to 
these assumptions began during the 1960s and 1970s among profes- 
sionals and feminist writers. At the same time, there were a number 





of widely publicized cases of alleged sexual abuse involving multiple 
children in day care centres across the country. In this context, new 
data were emerging that suggested that CSA had been occurring 
more frequently than previously assumed. Together, these forces 
significantly increased scholarly research on the topic (for reviews 
of this history, see [3, 4]). 

Feminists adopted the view that CSA was a violent crime of 
rape, not simply an inappropriate sexual act. This encouraged the 
trauma perspective on CSA, emphasizing the severity of short- and 
long-term effects of abuse that took hold during the late 1970s and 
1980s. Dozens of books (many by victims), news articles, movies, 
and television documentaries increased public awareness of the 
frequency of CSA (and specifically incest) and dramatically por- 
trayed the proposed traumatic nature of abuse and the severity of its 
consequences [4]. 

Among such reports was that of actress Roseanne Barr in 1991 [5]. 
She appeared on the cover of People magazine, claiming recent re- 
covery of memories of abuse by her mother in infancy and later her 
father, who continued to abuse her until she was 17. Also in People 
magazine that year was the story of former Miss America Marilyn 
Van Derbur, who claimed to have recovered repressed memories of 
abuse by her father [6]. Ms. Magazine, perhaps the most prominent 
feminist publication of the time, published an issue with the cover 
‘BELIEVE IT! Cult Ritual Abuse Exists (January/February, 1993). 
The author described her mother as a Satanist and alleged activities 
such as witnessing multiple murders, cannibalization of her baby 
sister, and others. 

A number of self-help books focusing on the link between 
sexual abuse and repression were published during this time frame, 
claiming that sexual abuse was more common than recognized, that 
it is traumatic and therefore memories of abuse are often repressed, 
that these repressed memories are the source of many psychological 
problems (including multiple personality disorder), and that re- 
pressed memories can be brought back into consciousness, with 
the result that the psychological problems will resolve. Prominent 
among these were The Courage to Heal [7], Secret Survivors [8], and 
Repressed Memories: A Journey to Recovery From Sexual Abuse [9]. 

Collectively, these influences encouraged many to self-diagnose 
repressed memories and/or to seek the help of therapists in re- 
covering them, and led some therapists to focus on the possibility 
that repressed memories of sexual abuse were responsible for their 


patients’ problems. Some even began to describe themselves as ‘re- 
covered memory’ therapists. But the same reports and writings that, 
in some quarters, encouraged belief in repression and the role of 
sexual abuse in causing it fuelled disbelief and criticism in others. 
In part, disbelief stemmed from incompatibility of claims regarding 
trauma and repression with theoretical and empirical work on 
memory. And, in part, it stemmed from the very nature of many re- 
ported recovered memories. Both children’s reports of day care abuse 
and adults’ reports of their recovered memories were peppered with 
fantastical allegations, not all relating to sexual abuse. Children re- 
ported activities such as riding on flying cows, being forced to eat 
faeces, being flushed down toilets, or killing and burying large ani- 
mals. Adults reported recovered memories of Satanic ritual abuse, 
forced impregnation by Satan, ritual human sacrifice, alien abduc- 
tion, multiple past lives, or witnessing murder. Some claims were 
patently impossible (such as riding flying cows, being flushed down 
toilets). Many were regarded by most as impossible (such as alien 
abduction, forced sex and breeding with aliens, multiple past lives, 
impregnation by Satan, memory for events within the scope of in- 
fantile amnesia, and others) [10]. For critics, the question became 
one of how such clearly false memories had been created. 

The ensuing widespread debate has left us with two clear prac- 
tical problems without clear solutions. Firstly, how should therap- 
ists deal with the issue of sexual abuse and the potential existence of 
repressed memories underlying patients’ problems? Secondly, how 
should claims of recovered memories be treated in legal contexts? 

In sections to follow, we focus on a question informing both 
issues. How can false memories of such dramatic autobiographical 
events as sexual and/or satanic abuse be created? We explicate the 
repression hypothesis and point readers to excellent reviews of the 
positions of supporters and critics and the evidence offered in sup- 
port of each position. We then turn to the false memory perspec- 
tive, focusing on empirical literature demonstrating the processes 
and procedures through which false memories for autobiographical 
events are formed. We end with commentary on implications of this 
literature for therapeutic practices and for the legal system. 


The repression hypothesis 


The repression hypothesis offers the seemingly counterintuitive 
proposition that the more traumatic an event at the time it occurs, 
the more likely the memory of the event is to be repressed, banished 
from consciousness, and unresponsive to attempts to retrieve it until 
it is psychologically ‘safe’ to again become aware of it (for example, 
[11-14]). This view is rooted in Charcot, Janet, and Freud’s theories 
of the role of sexual abuse in prompting repression (for a review of 
such theories, see [15]). Central to these theories is the idea that 
victims may be completely unaware of the traumatic event during 
repression, yet it will nevertheless cause psychological problems, 
acting ‘like a foreign body which long after its entry must continue 
to be regarded as an agent that is still at work [11, p. 6]. 

Though Freud eventually abandoned his initial theory implicating 
repressed memories of CSA as the foundation of psychological 
problems, modern repression theorists have adopted and ex- 
panded them (for example, [13, 16]). One variant of the repression 
perspective—the betrayal trauma theory [14]—entails predictions 
that are even more counterintuitive than the basic proposition that 
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trauma promotes repression. For example, it asserts that sexual 
abuse is more likely to be forgotten ifit involves greater betrayal such 
as when committed by an adult on whom one depends (such as a 
father or other close family member). The betrayal trauma theory 
also asserts that memory for other (particularly emotional) types of 
events is facilitated by repetition. But traumatic amnesia or repres- 
sion is assumed to be more, rather than less, likely with repetition of 
traumatic events such as abuse. 

The original Freudian notion of repression and variants, such as 
the betrayal trauma theory, have directly promoted the recovered 
memory movement among therapists. The assumption was that if 
the repressed memories could be ‘recovered’, processed, and inte- 
grated into the patient’s life narrative, the patient’s problems would 
improve. This led therapists to adopt suggestive techniques to un- 
cover memories of the offending events. Principal among these 
was the explicit suggestion to patients that their symptoms reflect 
sexual abuse and that failure to remember that abuse reflects repres- 
sion of the memories. Secondary to this was the use of a variety of 
suggestive memory recovery techniques such as hypnosis, guided 
imagery, dream interpretation, participation in ‘survivor’ support 
groups, and others (for reviews, see [17, 18]). 


Though the repression hypothesis has enjoyed widespread support 
in the therapeutic community, this has been much less true within 
the scientific community, particularly among memory scientists and 
many clinical scientists. Many criticisms address the incompatibility 
of the repression hypothesis with theory and research on memory. 
Inherent to the notion of repression is the idea that all stages of 
memory operate differently and through different mechanisms for 
traumatic material. While many on both sides of the controversy 
would agree that there are differences between the operation of 
memory for traumatic, highly emotional, and more neutral events, 
controversy centres on the nature of those differences. 

For critics, the view is that more traumatic or emotional mem- 
ories are more likely to be remembered, to be highly accessible, and 
to be remembered through normal processes of retrieval—though 
some argue for greater equivalence between traumatic and non- 
traumatic material. For repression theorists, the view is generally 
that traumatic material is more likely to be forgotten (repressed), to 
be inaccessible, and to be resistant to recall through normal mech- 
anisms such as salient cues or reminders or efforts to recall (thereby 
necessitating the extraordinary efforts to retrieve the memories). 
Moreover, it is assumed that memory for traumatic material does 
not decay or become distorted in the normal manner, such that even 
when retrieved after decades of disuse, such memories will be ac- 
curate and detailed (for explication and reviews, see [19]). 

Secondly, disagreement surrounds whether, and by what methods, 
each side’s claims can be validly tested. Each has offered methodo- 
logical criticisms of the efforts of the other. Though there have been 
some attempts to provide laboratory demonstrations of processes 
related to repression (such as memory suppression or blocking) (for 
reviews, see [20, 21]), repression theorists have largely attempted to 
demonstrate the reality of repression for traumatic events such as 
CSA or to test the conditions under which it is most likely to occur 
with real-life alleged victims. Critics have raised such concerns 
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as: (1) verification of alleged abuse (that is, of the same actions en- 
tailed in the recovered memories); (2) verification of a period of in- 
ability to remember the abuse (vs failure to think about the abuse or 
episodes of remembering that are themselves forgotten); (3) verifi- 
cation of repression as the mechanism of failure to remember (vs 
normal mechanisms of forgetting); or (4) verification that even if 
an episode seemed inaccessible to memory for some period, the 
episode was indeed traumatic in nature. The latter concern is im- 
portant, in that studies showing some report that they had been 
abused, but had not remembered it for long periods of time, are not 
relevant if the unremembered event was not traumatic. Essentially, 
critics argue that these concerns indicate that the hypothesis has not 
been adequately tested and arguably may be untestable (for reviews 
of these concerns, see [4, 18, 22]). 

On the other hand, proponents of the repression hypothesis have 
argued that laboratory demonstrations of techniques and processes 
promoting the development of false memories are not relevant to the 
kinds of traumatic events that might trigger repression. Ethical con- 
straints on laboratory research prevent either exposure to traumatic 
events or efforts to implant memories of such events. Moreover, 
though many real-life patients who have claimed to have recovered 
false memories of abuse through therapy have later recanted and 
claimed the new memories were false, repression proponents claim 
that these recantings are themselves subject to doubt (for example, 
they might be the product of social influence to recant), and not evi- 
dence of false recovered memories (for a discussion of retractor ex- 
periences, see [23]). 

These debates continue. Nevertheless, we believe that the reality of 
false memories of highly traumatic autobiographical events has been 
convincingly demonstrated, if nothing else by the fantastical nature 
of many purportedly recovered memories, such as those for sex, 
childbearing, and infanticide with Satan; alien abduction and im- 
pregnation; impossible memories from beyond the veil of infantile 
amnesia; past lives; and others. Moreover, these false memories were 
developed through normal processes of memory distortion. Next, 
we discuss how this occurs. 





If this memory is not real, where did it come from? 


Of issues relevant to the controversy, questions of whether and how 
false memories can be created are, in some respects, the easiest to 
answer. To understand this, it is important to begin with the concept 
of memory itself. 


The nature of memory 


Many laypersons think of memory as akin to an enhanced movie 
consisting of sensory images replaying the event in question. 
Memory science has revealed that instead memory consists of at 
least sensory images, beliefs (the story we tell ourselves about what 
happened), and information relevant to the plausibility of those 
beliefs. A memory judgement criterion is then applied to these 
narrative cognitions, assessing whether the ‘memory’ is or is not 
a real memory for a real event. Some, for example, may require of 
themselves clear detailed sensory images to decide something is a 
memory. At the other extreme, a person may be convinced that a 
bodily sensation is an embodied memory of sexual abuse (for a re- 
view, see [24]). 


Given that a real event did occur, each memory component can 
change over time. But each can also be fully created independent of 
the occurrence of a real event. We review the evidence of whether 
and how this happens in the sections to follow. We present evidence 
that false autobiographical memories can be fabricated, even for 
relatively extreme events and even for events that would seem rather 
implausible. 


Evidence that extreme false memories do occur 


False autobiographical memories can develop for events ranging 
from the relatively mundane (for example, school performance) to 
the seemingly impossible (for example, having been abducted by 
aliens, having sex with the devil). Many find it implausible that a 
person could falsely remember ‘extreme’ experiences such as alien 
abduction, satanic ritual abuse, sexual abuse by one’s father, or 
having witnessed or committed a horrific crime. 


Real-life examples 


Nevertheless, real-life examples of each of the aforementioned ex- 
treme false memories have been documented in the scientific lit- 
erature: witnessing murder [25, 26], alien abduction [27], enduring 
the painful surgical removal of one’s clitoris, sexual abuse (including 
satanic ritual abuse [18, 25]), witnessing dramatic fatal accidents 
[23, 28], and committing a crime that one did not actually commit 
[1, 2, 29]. 


Laboratory studies 


Research demonstrating that false memories for events that never 
happened can be easily planted is commonplace in psychology. 
Many such demonstrations involve planting false memories for 
relatively trivial events such as hearing sounds or words, seeing 
various (actually unseen) objects, saying things one did not actu- 
ally say, and many more [30]. False memories can be developed 
in people of normal cognitive function and lacking any mental 
illness or abnormality. Even those exceptional people who pur- 
portedly have near-perfect autobiographical memory (who can 
remember every day of their lives) have been led to develop false 
memories [31]. 

Though laboratory procedures for planting such false memories 
may seem inadequate to provoke the relatively extreme false mem- 
ories of concern here, it is important to note that they involve similar 
processes. Countless demonstrations of schema-consistent memory 
intrusions illustrate the tendency to develop false memories based 
on what makes sense, given other relevant knowledge of similar situ- 
ations, such as to falsely remember seeing books in a professor's of- 
fice that had none (for example, [32]). 

Plausibility and inference are also implicated in the develop- 
ment of dramatic false autobiographical memories. But perhaps the 
most relevant research has been experiments demonstrating that 
procedures leading to relatively mundane false memories (such as 
establishing plausibility, active imagination, suggestive interviewing, 
etc.) can also lead to the development of rich (realistic sensory detail) 
false autobiographical memories in a subset of study participants. 

One paradigm for planting false autobiographical memories 
is the ‘familial informant false narrative procedure’ or simply the 
‘lost-in-the-mall’ technique [33, 34]. Researchers first solicit the co- 
operation of a ‘familial informant (such as a parent) to ensure that a 
target event did not actually happen to the participant. Participants 


are told the family member had told researchers about several events 
that happened to the participant as a child. Some are true, but the 
target event is false. The participant may be interviewed several 
times about the target event, using suggestive procedures (some- 
times including guided imagination). In a final session, participants 
report on, and describe, any memories they have developed of the 
target event. 

Across many studies, an average of approximately 30% of subjects 
have developed partial or complete false memories (for reviews, see 
[33, 35]), varying from 0% (when an odd variation was used to at- 
tempt to plant a relatively implausible event memory) to more than 
50% for more mundane events (a ride in a hot air balloon). More 
developed false memories either with imagination procedures in- 
cluded or with additional ‘evidence included (such as a doctored 
photo). Some develop false memories right away, whereas others 
begin with little memory but, after several suggestive interviews, 
begin to recall false events in great detail. While researchers have 
not attempted to plant memories of abuse, they have planted mem- 
ories for highly unpleasant events such as hospitalizations, medical 
procedures, near drowning, or vicious animal attacks. They have 
also planted highly implausible memories for both mundane (for 
example, rubbing chalk on one’s head or kissing a plastic frog) and 
strange or dramatic events (witnessing demonic possession as a 
child) (for reviews, see [17, 36]), and for personally committing spe- 
cific aggressive acts [37] or crimes [38]. 

Notably, such false memories are consequential. Participants led 
to develop false beliefs or memories of being made sick by particular 
foods in the past avoided eating the food in follow-up assessments of 
a week to a month later (for example, [39, 40]). 


The path to false memory 


Memory is essentially a narrative story of the event, combined with 
sensory images. A memory judgement criteria is applied to assess 
the likelihood that the ‘memory’ is a true memory of a real event. In 
part, the question of whether the ‘memory’ is accurate is evaluated 
in light of other knowledge. In part, it is evaluated in light of other 
criteria one may use to decide the reality of the memory, such as the 
clarity of the sensory images or the way it was remembered (for ex- 
ample, in a dream or under hypnosis). Thus, to consider how false 
memories might be constructed, we must consider how the narra- 
tive or images may be constructed or altered and how one’s other 
relevant knowledge might be altered in a way favouring the validity 
of a new narrative, as well as how memory judgement criteria can 
be altered. 


Developing plausibility as the foundation 


for belief in the narrative 





Memory is primarily semantic—the story we tell ourselves about 
what happened, accompanied by sensory images that become in- 
creasingly vague over time. This story is informed by other relevant 
knowledge suggesting that the story does or does not make sense. 
This knowledge can affect our interpretations of events at the time 
they happen and also cause them to change over time. 

It is easy to see how our general knowledge and beliefs can affect 
the immediate interpretation of experiences. For example, shadows 
in one’s bedroom at night might be interpreted as an alien visitation 
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by those who believe that aliens visit the earth and make such per- 
sonal visits. For those who do not possess such beliefs, the shadows 
are more likely to be interpreted as created by wind in the trees out- 
side one’s window. 

But in a related way, they direct the development of false mem- 
ories. The formation of false memories entails the construction of 
a narrative that is plausible in light of one’s other knowledge and 
beliefs, including beliefs about oneself, others involved, and the way 
the world works. Thus, it is important to understand how a narrative 
involving seemingly impossible or improbable events can become 
plausible. Without such an element of plausibility, the person is un- 
likely to believe the false narrative and thus will not consider it a 
memory. 

Regarding sexual abuse, this can happen via multiple pathways. It 
can begin with psychological issues leading the person to search for 
explanations and solutions. In the course of this search, the person 
may be exposed to information that increases the plausibility of: (1) 
repression itself; (2) sexual abuse as a cause of repression; (3) sexual 
abuse as a cause of symptoms such as the person is experiencing; 
and (4) the possibility that one could have personally experienced 
abuse (or that a specific person could have committed it). This can 
happen via exposure to media accounts of repressed and recovered 
memories and through exposure to self-help literature, accounts of 
friends or family, among many other sources. 

Cultural acceptance of the trauma narrative, the idea of repres- 
sion, and the link between sexual abuse and many symptoms can 
lend plausibility to the ideas that one may have personally been sexu- 
ally abused (even in the absence of any memory of it) and that this 
unremembered abuse is causing most psychological/behavioural 
symptoms for which the person is most likely to seek psychotherapy 
or self-help advice. If the person sees a therapist, the therapist may 
further magnify the plausibility of repression and abuse narratives 
by actively discussing trauma and repression and/or alleged links 
between specific symptoms and sexual abuse or by explaining why 
he or she believes the patient was abused. Once beliefs linking sexual 
abuse, trauma, and repression are adopted, this can facilitate the for- 
mation of false memories of abuse via several mechanisms. 


Implications of fuzzy trace theory 


‘Fuzzy trace theory’ (for example, [24]) explicates how general 
knowledge and beliefs can distort memories or create false mem- 
ories. When one thinks about an event, both event-related images 
and beliefs and our general relevant knowledge and beliefs about 
such types of events and people are activated in memory. When clear 
verbatim sensory images (like an enhanced video) are not available 
to distinguish what actually happened from other information ac- 
tivated in memory, the person may misattribute the additional ac- 
tivated information from his or her general knowledge store as part 
of the event in question. Essentially, expectations based on our add- 
itional relevant knowledge and beliefs can result in false memories 
or false details, consistent with those expectations (for a review of 
schematic effects, see [32]). 

Schemas for specific persons (oneself or others involved in the 
event in question) will include expectations of behaviour—how the 
person thinks he or she is likely to behave, as well as expectations 
for others’ behaviour. ‘Negative stereotyping of a person suspected 
of CSA is commonly viewed as contributing to false memories of 
sexual abuse and refers to conveying information about the accused, 








SECTION 13 Trauma- and stress-related and adjustment disorders 


characterizing him as the ‘type of person likely to commit abuse spe- 
cifically or other damaging or aggressive acts (for example, [41]). 
Beliefs that sexual abuse tends to cause specific physical or psycho- 
logical effects will also tend to make the belief that a person experi- 
encing those symptoms was indeed abused more plausible. 

It is easier to create false beliefs and memories if the event it- 
self is viewed as more plausible. False memories for actions have 
been more easily planted in experimental subjects if the person 
found it plausible that he or she would have committed such an 
act (such as when a person who often behaves aggressively finds 
it plausible that he or she would have committed another aggres- 
sive act (for example, [37]). It is also easier to plant memories for 
things others have done if those actions are seen as plausible for the 
other person. Whereas it may be more difficult to plant implausible 
memories in many circumstances, implausibility itself is malleable 
[42]). For example, Mazzoni et al. [43] first exposed some partici- 
pants to material designed to enhance the plausibility of demon 
possession and later attempted to plant memories of having wit- 
nessed such an event. Those exposed to the plausibility-enhancing 
manipulation later reported greater likelihood of having witnessed 
demon possession. 

Many things can happen to increase the plausibility of an event. 
Among these is exposure to information that the type of behav- 
iour or event in question happens relatively frequently (vs rarely) 
(for example, [44]) or repeatedly imagining the event in question 
(for example, [45]). Regarding sexual abuse, many popular books 
claim a high prevalence of sexual abuse and offer lists of symptoms 
purportedly characteristic of abuse survivors (including failure to 
remember the abuse) that are so inclusive as to encourage almost 
anyone with problems to attribute them to abuse. E. Sue Blume, 
in Secret Survivors [8], lists the most common problems for which 
therapy is sought: depression, addiction, phobias, sexual issues, self- 
destructiveness, obsessions, and countless others. It includes many 
opposites such as too much or too little eating, sex, sleeping, etc. It 
even includes failure to remember abuse as a symptom of abuse, as- 
serting that failure to remember abuse is not evidence that one has 
not been abused. Such claims and lists of abuse symptoms in the sur- 
vivor literature at the least make plausible that virtually any problem 
can be caused by sexual abuse. 

Perhaps reflecting the influence of widespread depictions of abuse 
and repression, Rubin and Boals [46] found a relationship between 
self-rated likelihood of entering therapy and beliefs that one had ex- 
perienced (and could not now remember) childhood trauma and 
abuse. Notably, such patients may influence their therapists to focus 
upon abuse, creating a bi-directional cycle of suggestion [17]. 

At some point, the person may become fully convinced that 
sexual abuse did occur and even develop false memories of that 
abuse. Kihlstrom [47] described a condition in which a person's 
identity, lifestyle, and interpersonal relationships are centred around 
an objectively false memory of traumatic experience that the person 
strongly believes is true. This ‘false memory syndrome’—or the def- 
inition of one’s identity as an abuse survivor—becomes a central 
aspect of relevant knowledge that serves as context for interpretation 
and evaluation of new information as plausible or not and as the 
schematic basis of memory intrusions [17]. 

Finally, related to the issue of plausibility, it is important to note 
that it is easier to convince a person that they must have done or 
experienced something that would otherwise seem implausible 


(or to otherwise alter beliefs) if they doubt their own memories—a 
phenomenon Gudjonsson has dubbed ‘memory distrust. This phe- 
nomenon has been implicated in many cases of internalized false 
confession (that is, a confession where the person comes to believe 
falsely that they did commit the crime) [1, 2]. Therapeutic clients 
may, for various reasons, doubt the reliability of their memories and 
therefore be more easily convinced of things they do not initially 
remember. 


The role of belief: the transition from ‘it could have 





happened’ to ʻit did happen’ 


Many false memories develop through persuasion that an event 
happened or happened in a particular way (for example, [17, 
48, 49]). In practice, there are many pathways through which be- 
liefs regarding sexual abuse or other seemingly implausible events 
can be created or altered. These include conversations with other 
witnesses, police, therapists, and other interviewers, exposure to 
media accounts, and exposure to ‘evidence that the event occurred 
in a particular way or was committed by a particular person. 
Internal processes are also important, such as attempts to reason 
about what must have happened or attempts to understand the 
‘evidence’ the person is aware of. Some may interpret dreams as re- 
flecting reality. Essentially, any source of information or argument 
can convince the person that a particular event, or a version of that 
event, is true. 

Loftus and Davis [17] describe influence processes in therapy that 
can convince a client he or she must have been abused. This may 
happen at first in the absence of ‘memory’ of abuse. The therapist 
may tell clients that their symptoms suggest abuse, offer information 
and arguments to support the assertion, refuse to accept denials, and 
persist in asserting they were abused until the clients agree. These as- 
sertions can be supplemented by a variety of therapeutic procedures 
that appear to offer evidence that clients were abused. 

Evidence of many kinds can lead people to falsely believe they ex- 
perienced specific autobiographical events. In the ‘lost-in-the-mall’ 
procedure, family members’ assertions that an event did happen to 
participants as young children provided evidence crucial for the de- 
velopment of the many different false memories elicited through the 
procedure. False evidence against criminal suspects has been impli- 
cated as an important contributor to false confessions, particularly 
of the sort where suspects come to falsely believe (and sometimes 
falsely remember) that they committed the crime (for reviews, see 
[1, 2]). Dream interpretation suggesting that a specific type of event 
happened or exposure to doctored photographs depicting events 
that did not occur have led subjects to falsely believe and/or re- 
member that they experienced events ranging from before the age of 
3 to their recent past [36, 50]. 

Therapeutic techniques and persuasion can enhance belief in an 
event directly, by seeming to provide evidence that the event did 
happen. They can also do so indirectly, by making it more plausible 
that the event happened as believed, such as when information that 
a person is generally aggressive or has a criminal record can make 
it more plausible that he or she committed a specific aggressive act. 
Or information that sexual abuse is often committed by stepfathers 
can make it more plausible that one’s own stepfather would plausibly 
engage in abuse. 


Beliefs regarding sexual abuse can come from many sources of 
suggestion. These can include others who have become concerned 
about the possibility of abuse, others who talk to the person about 
the general issue of abuse or their own (or about the abuser), infor- 
mation acquired in sex education classes, media, etc., forensic inter- 
viewers, popular ‘survivor’ literature, and others. 

Even the simple process of thinking about an event can make it 
seem more likely to have happened in the past or to happen in the 
future. Studies of imagination inflation have demonstrated this with 
a very simple procedure. Participants first rate the likelihood that 
specific events happened to them in the past (or will in the future). 
Then they imagine that event happening. Finally, they rerate the 
likelihood of the event’s occurrence. Imagination increases the per- 
ceived likelihood that the event did (or will) happen (for example, 
[51]). Analogous to the process of imagination inflation, Sharman, 
Manning, and Garry [52] illustrated ‘explanation inflation’ whereby 
the process of explaining how or why childhood events could have 
happened leads persons to become more confident that they did 
happen. Both imagination inflation and explanation inflation have 
been demonstrated for a variety of past and/or future events [52]. 

Another laboratory procedure is analogous to situations where a 
therapist provides feedback to clients that their symptoms suggest 
abuse. Participants first rate the likelihood that certain events oc- 
curred during particular periods of their life. They then answer a 
series of questions (sometimes by computer) and receive false feed- 
back that their scores indicate they experienced a particular kind of 
event. Sometimes this is followed by imagination procedures to pic- 
ture the event in question. Finally, participants again rate the likeli- 
hood that the event actually happened and, in some studies, indicate 
the extent to which they recall the event. Like the imagination or 
explanation inflation studies, some participants in the false feedback 
studies develop stronger beliefs that the target event did happen and 
some also develop memories of the event. 

Generally, procedures targeting belief that an event did occur vary 
in suggestiveness, with imagination/explanation procedures falling 
at the lower end of suggestion, false feedback studies suggesting that 
the person did or probably did experience the event in the middle, 
and those providing ‘evidence’ that the event did occur at the high 
end (such as false claims by family or doctored photos). The more 
suggestive procedures tend to more strongly affect belief in, and/or 
memories of, the suggested event (for a review, see [35]). These pro- 
cesses can make the event more familiar, more accessible in memory, 
and more fluent (easy to picture or understand), all bases upon 
which people judge whether or not something is or is not a memory. 
Moreover, the procedures can cause the person to generate images 
of the event that can become misinterpreted as veridical memory. 


The role of sensory images: developing what 


‘feels’ like a memory 





Most think of event memory in terms of the ability to replay sensory 
images of the event in one’s mind. Some might not think of newly 
developed autobiographical beliefs as memories, until they can pro- 
duce a sensory replay of the event. Sensory images are indeed one 
component of memory. But what is less well known is that over time, 
the images one associates with events can change—and even be cre- 
ated through a number of processes. These may consist of exposure 
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to verbal or written accounts causing the person to generate images 
of the event, sensory images from other sources (such as pictures 
or video depictions), or internal processes such as the person’s at- 
tempts to remember and picture the event or efforts to imagine how 
things must have happened. Some internal processes are triggered 
by therapeutic procedures involving active imagination/imaging 
(for example, guided imagery or hypnosis). Others may occur spon- 
taneously such as dreams or sensory image associations triggered by 
exposure to reminders of the type of event. They may also be created 
through conversations with others or interviews, as described more 
fully in the section on suggestion. 

If the person mistakenly attributes the source of such images to 
the experience of the original event, rather than to the real source, 
the images may be interpreted as a ‘memory. Indeed, a substantial 
body of scientific literature has shown that it is commonplace to con- 
fuse sensory images acquired independently of the original event (or 
any event) as ‘memories’ for the event in question (for reviews, see 
[24, 30, 51]). 

With a foundation of plausibility for abuse, such misattributions 
become more likely. Mazzoni et al. [43], for example, suggested that 
imagination procedures can create false memories in a three-stage 
process by which a person first perceives the event as more plausible, 
then comes to believe the event actually did occur, and finally rein- 
terprets the narrative as actual memories. Imagery is crucial to this 
process, in that Green and Brock [53] have shown that narratives are 
persuasive to the extent they evoke imagery of the event. 


Imagery in the context of suggestion 


Essentially, suggestion involves any statement or action asserting or 
implying a particular version of an event is true or that others are 
not true. It may also state or imply that the speaker strongly believes 
what is suggested and/or the speaker strongly prefers that the listener 
reports a particular version of events. In effect, suggestion plants an 
idea in the target’s mind about either what might have or probably 
did happen or what another person wants the target to say or do. 

Suggestion can create imagery of events that did not occur and 
increase belief that the events did occur. Suggestion can also create 
expectations of more approval/reinforcement for giving specific 
memory reports and expectations of disapproval/punishment for 
continuing to give the initial report. These processes have been 
shown to lead to false memories and false reports of relatively trivial 
events in the laboratory, as well as of dramatic personal experiences 
among both children and adults, including false confessions to 
heinous crimes such as murder of one’s own children, parents, sib- 
lings, and others [30, 54]. 

Some common therapeutic techniques are inherently suggestive 
or are executed in a suggestive manner. They can simultaneously 
encourage imagery of suggested events and increase plausibility 
and belief. Guided imagery, for example, may be practised alone 
or in the context of other suggestive techniques. It involves efforts 
to induce the client to picture an event through multiple sensory 
modalities and can involve suggestions for both focus and content. 
Though sufficient alone to produce false memories, false memories 
become more likely when guided imagery is combined with other 
suggestive techniques. Herndon and colleagues [55] tested com- 
bined effects of guided imagery and an analogue to survivor group 
therapy. Each increased false memories and participants’ confidence 
in the memories they reported. But when exposed to both, fully 75% 
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of participants developed false memories for a suggested early child- 
hood medical procedure (compared to 5% among those exposed to 
neither and 41-50% among those exposed to one source of sugges- 
tion) (for a demonstration of additive effects of guided imagery, peer 
pressure, and repeated interviewing, see [56]). 

One frequently practised memory recovery technique is hyp- 
nosis, often used in conjunction with guided imagery or efforts to- 
wards age or past life regression. Though hypnosis does increase the 
amount recalled, even without suggestion, much new information is 
false. But unless carefully trained otherwise, therapists can be very 
suggestive with their hypnotic subjects—both through suggested 
focus and content and through selective reinforcement of expected 
reports of abuse, satanic ritual abuse, alien abduction, womb, birth, 
infantile, or past life experiences, etc. They may also be suggestive 
through directed imagery such as when therapists instruct clients 
to change the view of the ‘camera’ of memory to focus on a person, 
action, or location not originally noticed. This not only suggests 
memory content but also suggests unrealistic processes and abilities 
of memory. Given widespread public and professional beliefs in the 
ability of hypnosis to enhance recall, it is not surprising that both 
place greater confidence in memories recovered under hypnosis 
(for reviews, see [57-59]). The role of hypnosis in false memories 
appears to be, in part, rooted in the vulnerability of those who are 
hypnotized, in that hypnotizability itself is related to vulnerability to 
development of false memories [60]. 

These are only a few among many techniques that can alter the 
components of memory (plausibility, narrative, images, etc.). 
Others include survivor group participation, journaling and other 
‘homework, and eye movement desensitization and reprocessing 
(EMDR)—and even use of drugs such as sodium pentothal (3, 
17, 18]. 

Finally, related to the adoption of a ‘survivor’ identity, some re- 
search has shown that when led under hypnosis to believe in char- 
acterizations of themselves that were inconsistent with their current 
identities, participants have shown selective attention to, and en- 
hanced memory for, other information consistent with the new 
characterizations. In addition, they tended to perceive information 
consistent with the new characterizations as more self-relevant and 
personally meaningful and to more easily access autobiographical 
information consistent with the new characterizations. These ac- 
cessed memories often entail a reinterpretation of previous experi- 
ences that seemed to verify the new characterizations (for a review, 
see [58]). 


The role of a memory judgement criterion 





An additional step in the development of consequential false mem- 
ories is that the person must apply one or more criteria to assess 
whether a new narrative is a memory of a real event. 

Generally, the stronger the verbatim images or gist traces and 
the weaker the memory judgement criterion, the more likely a par- 
ticular narrative is to be judged a memory. Verbatim images devel- 
oped and strengthened through the many processes described thus 
far can be sufficiently vivid and detailed to pass even the most strin- 
gent memory judgement criterion. Gist traces (an abuse narrative) 
can likewise be strengthened by processes such as imagination, ex- 
posure to plausibility-enhancing information, the development of 


abuse-related personal identities, and others. The more plausible 
the narrative in light of the person’s general knowledge and beliefs, 
or the more information of any kind that appears consistent with 
the narrative in question, the more likely it will be judged an actual 
memory. Moreover, the sense of familiarity with an image or nar- 
rative, which grows stronger the more it is discussed or brought to 
mind, can be interpreted as memory. 

Neither the strength of verbatim and gist traces or their plausi- 
bility nor their familiarity are necessary criteria for a person to con- 
clude the memory is real, however. Loftus and Davis [17] described 
in detail the way in which the criteria one uses to assess whether or 
not a memory is real may be highly variable and malleable. Some 
people may require of themselves that they have very clear sensory 
images of an event in order to interpret or report it as a memory. 
Others may consider beliefs without images or very vague images 
to be memories. 

These and other memory judgement criteria are malleable and 
can exert significant influence on reports and behaviour. Some ther- 
apists and survivor literature encourage beliefs that dreams can be 
verbatim replays of actual events, and therefore that a specific dream 
is a memory of a real event. But given that dreams are most likely to 
reflect daily events and what is on one’s mind, a person who is ex- 
posed to content about abuse or who is thinking a great deal about 
abuse might dream about abuse for that reason. Beaulieu-Prevost 
and Zadra [61] provide a recent review of research showing that 
dreams are a common source of false memories and addressing the 
conditions under which dreams are most likely to be interpreted 
as memories, even in the absence of suggestion (and they further 
demonstrated that false memories for dreams themselves can be 
planted). 

Other therapy-related beliefs promote very liberal memory judge- 
ment criteria that can lead clients to classify a variety of questionable 
experiences as true event memories. For example, Francine Shapiro's 
book [62] on EMDR states that any memory that emerges during 
EMDR is, in fact, true. A therapist may also encourage the belief, 
already prevalent among laypersons, that all memories that emerge 
during hypnosis are true. Some therapists and survivor literature 
have also promoted the belief that bodily sensations are ‘body mem- 
ories’ of abuse (for example, [63]). 

Bass and Davis [7] discussed body memories and promoted liberal 
memory judgement criteria: “You may have forgotten large chunks of 
your childhood ... yet there are things you do remember. When you 
are touched in a certain way, you feel nauseated’ (p. 22). The authors 
suggested that although memories might be hazy, dream-like, or 
fragmentary, this was not evidence of inauthenticity: ‘if you ... have 
a feeling that something happened to you, it probably did’ (p. 21). 
Though the authors recounted stories of those who doubted their 
new memories, or explicitly said they knew they were not real, they 
also suggested these feelings did not justify discounting the mem- 
ories: ‘It’s natural that you have periodic doubts ... But that’s because 
accepting memories is painful, not because you weren't abused’ 
(p. 86). 

Beliefs that bodily sensations can be, or reflect, traumatic event 
memories are widespread among practising therapists, laypersons, 
and students. From one-third to over three-quarters of therapists 
in the United States and Canada report beliefs that body pains and 
other physical symptoms are symptoms of sexual abuse and/or that 
sensory impressions from early life can represent reliable memories 


that can be recovered (for a review, see [64]). Therapists adopting 
such beliefs may tell patients that a particular bodily sensation is, or 
reflects, a memory of abuse and thereby justify a series of suggestive 
therapeutic procedures to unearth a more explicit narrative or sen- 
sory ‘memory. 


Summary 


Influences that can distort or fabricate memories are well established 
within the scientific literature. Remaining controversy concerns the 
nature of events for which false memories can be created. Some 
argue that memories for traumatic events cannot be planted and la- 
boratory studies of less dramatic false memories are not applicable 
to sexual abuse. Nevertheless, many accounts of ‘retractors’ (those 
who reported recovered memories and later recanted them) have 
suggested that indeed false memories for horrific forms of abuse can 
be planted (for reviews, see [17, 18]). 

While laboratory studies are limited by ethical constraints to 
avoid attempts to create potentially very consequential false mem- 
ories, a substantial and growing literature has demonstrated that 
false memories for a variety of relatively dramatic, sometimes scary, 
autobiographical events can be planted in mentally healthy parti- 
cipants across the lifespan. And they can be planted by means of 
procedures used within therapy (such as dream interpretation, hyp- 
nosis, guided imagery, and others) or practised by the person (active 
imagination, reading survivor literature, exposure to abuse accounts 
in media, and others). Together with suggestion (that abuse may 
have, or must have, taken place) and loosening of memory judge- 
ment criteria, such procedures greatly increase the likelihood of the 
development of false memories of sexual abuse, satanic ritual abuse, 
or other seemingly implausible events. 


Conclusions 


Although our view is that evidence for massive repression is lacking, 
by endorsing this conclusion, we in no way deny the reality of sexual 
abuse. All parties in this controversy can appreciate the widespread 
nature of the problem and the severity of the consequences for many 
victims. However, we suggest two important conclusions. 

Firstly, claims of repressed and recovered memories are inher- 
ently suspect. The repression hypothesis has itself proven difficult, 
and some believe virtually impossible, to test. And it is fundamen- 
tally incompatible in many respects with the science of memory. 
Moreover, a very large and growing scientific literature has shown 
that common therapeutic procedures and other influences can dis- 
tort memories or lead to the development of fully fabricated false 
memories of sexual abuse and other dramatic and/or traumatic life 
events. 

When such claims are litigated, at the very least they should be 
given significantly reduced weight. There are two possible explan- 
ations for a subjectively ‘recovered’ memory: (1) that the memory 
is real, that it has indeed been repressed for as long as decades, and 
that it has now been recovered intact; or (2) that the memory was 
at best distorted—or at worst entirely fabricated—through pro- 
cesses shown across many scientific studies to have such effects. 
Even continuous memories, whether reported immediately after 
an event or significantly later, can be distorted. Memories can also 
be totally confabulated, even if no claim of repression or suspect 
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memory recovery procedures are involved. But when there is a 
claim of repression, and particularly when the claimant has been 
subject to influences shown to distort or fabricate memories, re- 
search suggests the claim must be given reduced weight, if not fully 
discounted. A plausible explanation, one that is arguably much 
more plausible than the reality of the memory, exists for where the 
memory came from (for reviews of legal issues entailed in the re- 
covered memory debate, see [65-67]). 

Secondly, research suggests that some therapeutic practices should 
be avoided—prominently direct suggestions and arguments to cli- 
ents claiming that they have likely been sexually abused, particularly 
in the absence of clients’ initial memories of abuse or in the face 
of an absolute denial. Also undesirable are focusing therapy on the 
issue of abuse with such clients, using memory recovery procedures 
within therapy focused upon retrieval of memories of abuse, and 
recommendations for outside activities focused on recovery of such 
memories (such as reading abuse centred self-help literature, par- 
ticipation in survivor groups, and other ‘homework’). These are all 
procedures known to increase the risk of memory distortion and 
fabrication [17, 18, 68]. 

These recommendations may be controversial, since despite the 
highly publicized nature of the controversy, belief in repression re- 
mains widespread within the therapeutic community and among 
some clinical or memory scientists (for surveys assessing such 
beliefs, see [64, 69]). Many popular therapies endorse the idea of 
memory ‘recovery’ and include techniques for that purpose, for ex- 
ample EMDR, sensorimotor psychotherapy, somatic experiencing 
therapy, neurolinguistic programming, alien abduction therapy, 
internal family systems therapy, and others [64]. Many therapists 
incorporate one or more practices that increase the risks of false 
memory development. Beliefs supporting such practices are likely 
to remain widespread for some time to come. 

Our hope and expectation is that researchers will continue the 
rapid development of new research and paradigms to inform the 
disputed issues and to discourage practices with known risks. While 
many false memories may be benign, those regarding sexual abuse 
are highly consequential and toxic for both the accused and the 
accuser. 

Finally, we hope that better education of professionals and the 
public will improve understanding of memory and that this educa- 
tion will provide inoculation against many faulty claims prevalent 
in popular culture. Although some widely held beliefs are correct, 
many respondents across varied populations have endorsed beliefs 
diametrically opposed to the actual operation of memory (for a re- 
view, see [64]). 
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Contemporary clinical approaches to anxiety disorders are based 
on a categorical approach. The Diagnostic and Statistical Manual of 
Mental Disorders, third edition (DSM-III) [1] provided a key im- 
petus to the field by providing operational diagnostic criteria for a 
range of different anxiety disorders and by clearly demarcating these 
conditions from normal anxiety responses. Subsequent decades saw 
progress in delineating the neurobiology of each of the anxiety dis- 
orders, as well as the introduction of specific pharmacotherapies and 
psychotherapies for the treatment of generalized anxiety disorder, 
panic disorder, specific phobia, and social anxiety disorder. 

However, several factors have also suggested the value of a return 
to a more dimensional approach to anxiety. First, there is signifi- 
cant comorbidity among the anxiety disorders, and the severity of 
anxiety symptoms in community and clinical symptoms falls on 
a continuum [2]. Second, psychobiological research has empha- 
sized that multiple genes of small effect and perhaps common en- 
vironmental factors are implicated in anxiety disorders, and that 
overlapping neural circuits and molecular mechanisms underpin 
these conditions [3]. Third, there is overlap in interventions for anx- 
iety disorders, and there is value in a transdiagnostic approach to 
treatment [4]. 

It is important to emphasize that categorical and dimensional 
approaches can readily complement each other [5-7]. Any dimen- 
sional variable can be split to form categories, and categorical con- 
structs can be evaluated with dimensional measures. Categorical 
constructs may have significant clinical utility, facilitating decision- 
making and treatment, while dimensional constructs may be par- 
ticularly important in research settings where fine delineation of 
underlying mechanisms is needed to advance diagnostic validity. 
Still, while categorical and dimensional approaches are complemen- 
tary, it is timely to focus here on dimensions. 

This chapter will provide a theoretical basis for approaching the 
psychobiological dimensions that underpin the anxiety disorders. 
We first begin with a brief description of categorical approaches. We 
then outline, in turn, dimensional approaches to the assessment of 
anxiety, the range of mechanisms underlying anxiety disorders, and 
transdiagnostic approaches to treatment. Subsequent chapters in 
this section of the volume will provide additional detail on each of 
these topics (for example, see Chapters 80, 87, 89, 90, and 92). 





DSM-II [8] was strongly influenced by the psychodynamic theory 
and described a number of categories of neuroses, including hys- 
terical neurosis (modifiable symptoms symbolized underlying con- 
flicts), phobic neurosis (fears are displaced from a focus unknown 
to the patient, to a phobic object), neurasthenic neurosis (charac- 
terized by somatic symptoms such as chronic weakness), obsessive- 
compulsive neurosis, and depressive neurosis. These were viewed as 
predominantly characterized by anxiety, which produced a range 
of symptoms (for example, anxious over-concern, panic, somatic 
manifestations) and which were experienced as subjective distress 
from which relief was desired [9]. The International Classification of 
Diseases, ninth revision (ICD-9) chapter on neurotic disorders simi- 
larly covers a fairly broad range of conditions [10]. 

DSM-III attempted to take a more atheoretical stance to classifica- 
tion, introducing a chapter on anxiety disorders and providing oper- 
ational diagnostic criteria for each of these conditions [1]. DSM-III 
anxiety disorders included agoraphobia, generalized anxiety dis- 
order, obsessive-compulsive disorder (OCD), panic disorder, post- 
traumatic stress disorder, simple phobia, and social phobia. DSM-IV 
also emphasized that panic attacks could occur ina range of different 
anxiety disorders and introduced the constructs of acute stress dis- 
order, anxiety disorders due to a general medical condition, and 
substance-induced anxiety disorder [11]. The ICD-10 chapter on 
‘neurotic, stress-related, and somatoform disorders’ includes these 
various conditions, as well as a number of additional disorders [12]. 

In DSM-5 [13] and ICD-11, the category of anxiety disorders has 
continued to evolve. The development of DSM-5 saw a strong focus 
on diagnostic validity and the inclusion of neuroscientific data, 
while the development of ICD-11 has emphasized the issue of clin- 
ical utility and considerations of public mental health [14, 15]. Both 
classification systems have now removed obsessive-compulsive and 
related disorders and trauma- and stressor-related conditions from 
the anxiety disorders, and placed them in separate chapters. At the 
same time, in both classification systems, the new chapters remain 
adjacent to that on anxiety disorders, and the overlap in their symp- 
tomatology is emphasized. 

Fear may be described as ‘an emotional response to perceived 
imminent threat or danger associated with urges to flee or fight’ 
[13]. This ‘fight-or-flight response’ is characterized by an increased 
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startle reaction and physiological autonomic alterations [16, 17]. In 
contrast, anxiety is ‘the apprehensive anticipation of future danger 
or misfortune accompanied by a feeling of worry, distress, and/or 
somatic symptoms of tension. The focus of anticipated danger may 
be internal or external’ [13]. From an evolutionary perspective, fear 
and anxiety appear to be adaptive responses [18]; and in the clin- 
ical setting, it is important to differentiate normal functional anxiety 
from pathological dysfunctional anxiety. 

Since the fourth edition of the DSM, the differentiation of normal 
and pathological anxiety is crucially dependent on the clinical cri- 
terion; disorders are characterized by clinical distress or by signifi- 
cant impairment [19]. The field is unfortunately unable to rely on 
biomarkers for this differentiation [20]. Instead, clinical judgement 
is key; considerations such as duration and persistence, frequency 
or intensity of symptoms, disproportionality of symptoms (given the 
context), and pervasiveness of symptom expression across contexts 
all point to evidence of underlying dysfunction and so help differen- 
tiate normal and pathological anxiety [21]. 


Dimensional approaches to anxiety assessment 





Although current psychiatric classification systems primarily em- 
ploy categorical constructs, itis notable that during the development 
of DSM-5, significant emphasis was placed on dimensional assess- 
ments of anxiety disorders [2]. Partly as a result, DSM-5 includes 
cross-cutting questions to assess anxiety symptoms, emphasizes that 
panic attacks may occur in a range of anxiety and other disorders, 
and underscores the value of assessing anxiety severity in depres- 
sion [13]. In this section, we focus on a number of non-diagnostic 
constructs that are relevant to the dimensional assessments of 
anxiety: behavioural inhibition (BI), anxiety sensitivity (AS), and 
neuroticism. 


Behavioural inhibition 


BI is a heritable temperamental trait characterized by restraint, cau- 
tion, or fearfulness and withdrawal in response to unfamiliar or 
novel people, stimuli, or situations [22, 23]. Approximately 15-20% 
of young children are born with marked BI, which is usually identi- 
fied early in life when children transition from the normative devel- 
opmental stage of ‘stranger anxiety’ and begin to adapt behaviourally 
to threat stimuli [24, 25, 26]. The temperamental bias categorizing 
BI may be underpinned by varying responses to novel stimuli by 
the amygdala [22]. For example, adults previously categorized as be- 
haviourally inhibited in childhood have been found to demonstrate 
greater amygdala response to novel (vs familiar) faces, when com- 
pared to non-behaviourally inhibited controls [27]. 

Behaviourally inhibited children appear to be at increased risk of 
developing anxiety disorders in adulthood, particularly social anx- 
iety disorder (SAD) [23, 28]. In their recent meta-analysis, Clauss 
and Blackford [26] found that BI was associated with a clear increase 
in the risk of developing SAD (OR = 7.59). This risk remained sig- 
nificant after considering individual study differences in phenotype 
assessment, parental risk, age of anxiety diagnosis, and other poten- 
tial confounders. 

The conceptualization of BI as a variant of normal (i.e. a tem- 
perament) emphasizes the value of a dimensional approach to 
the spectrum of normal and pathological anxiety. In particular 


environments, BI may have adaptive value [29], but increased BI 
may also increase sensitivity to the development of anxiety dis- 
orders. Further work is needed on how particular environments may 
intersect with BI to increase the risk for such conditions [30, 31]. 
Research elucidating the neurobiological underpinning of BI, as well 
as its role in predicting adult anxiety, may ultimately inform primary 
prevention tools to identify BI in children and to pre-empt the devel- 
opment of subsequent SAD. 


Anxiety sensitivity 


AS is the fear of anxiety-related sensations, based on the belief that 
such sensations are harmful [32, 33]. AS may be conceptualized as an 
‘anxiety amplifier’ [33], which increases arousal-related sensations, 
thus worsening anxiety in affected individuals. Individuals with AS 
are at increased risk of developing anxiety symptoms, particularly 
those pertaining to panic [34]. One meta-analysis [35] reported 
greater AS among patients diagnosed with an anxiety disorder vs 
non-clinical control participants. Further, panic disorder was found 
to be associated with greater AS, compared to the other anxiety dis- 
orders (with the exception of PTSD). 

AS is widely believed to comprise three interrelated phenotypic 
concerns: physical, cognitive, and social [33, 36]. The aetiology of 
these dimensions may be influenced by additive genetic and/or cu- 
mulative environmental effects. Preliminary evidence suggests that 
AS is heritable in women [33, 37], which may be attributable to hor- 
monally driven gene expression [38]. 

The Anxiety Sensitivity Index (ASI) [39] is the most widely 
used tool to assess AS. The ASI comprises 16 items to ascertain 
an individual’s beliefs about the sequelae of anxiety-related sensa- 
tions. Again, it is possible that AS and other markers of risk for 
the development of anxiety disorders could be screened for, and in 
those at high risk, preventive interventions could be developed and 
investigated. 


Neuroticism 


Neuroticism—a personality trait predisposing individuals to higher 
negative affect—has been rigorously studied for several decades 
and has been strongly associated with common mental disorders 
(CMDs) in general and internalizing disorders, including anxiety 
disorders, in particular [40, 41]. This association is consistent with 
Clark and Watson’s [42] influential tripartite model which suggested 
that high negative affect is common to both anxiety and depres- 
sion (with physiological hyperarousal specific to anxiety, and anhe- 
donia specific to depression). Indeed, an early proposal for DSM-5 
was for an emotional cluster of disorders characterized by negative 
affectivity [43]. 

A causative model would predict that shared genetic and environ- 
mental factors lead to both neuroticism and internalizing disorders 
[44]. In confirmation, a large population-based twin study found 
that genetic factors common to neuroticism accounted for up to 
half of the genetic risk across internalizing disorders [45]. This gen- 
etic overlap may also help to account for the extensive comorbidity 
among the internalizing disorders. Indeed, in their recent meta- 
analysis of genome-wide linkage scans of anxiety-related pheno- 
types [46], Webb and colleagues reported a high global correlation 
between neuroticism and anxiety disorders. 

A cross-cutting dimensional (rather than categorical) approach to 
assessing neuroticism [2] may be useful for the assessment of both 


anxiety and non-anxiety disorders, and there are well-validated 
scales for the assessment of neuroticism [44, 47]. 





Mechanisms underlying anxiety disorders 


Animal models of fear 


Pathological anxiety is often characterized by more rapid atten- 
tional threat engagement, elevated stress reactivity to potentially 
threatening stimuli, threat-based assessment of ambiguous stimuli, 
and impaired disengagement and avoidance of threat stimuli [48- 
50]. Further, increased amygdala responses to threat stimuli are often 
noted in pathological anxiety [50]. Fear conditioning and extinction 
have long been used to develop translational models of anxiety, as 
discussed in detail in Chapter 87. The amygdala and prefrontal areas 
of the brain are key components of the neurocircuitry underlying 
fear conditioning and extinction [51, 52] (see Chapter 90). 


Research Doman Criteria approach 


The Research Domain Criteria (RDoC) approach [49, 53-55]— 
developed by the NIMH and described in detail in Chapter 8—is 
informed by a translational formulation of anxiety disorders as 
arising from neurobiological, cognitive—affective, and neural circuit 
dysfunction. This model represents anxiety disorders (and indeed 
all mental disorders) on a continuum, and not as discrete entities 
that are separate from mental health and well-being. In contrast to 
the categorical DSM approach (see Chapter 7), the RDoC approach 
presents a matrix of key behavioural dimensions or ‘domains’ (rep- 
resented by rows), as well as a number of analytical tools, as follows 
(Fig. 86.1) [56]: 


e Rows (domains of function): negative valence systems, positive 
valence systems, cognitive systems, social processes, arousal/regu- 
latory systems, sensorimotor systems. 

e Columns (modes of analyses): genes, molecules, cells, circuits, 
physiology, behaviour, self-report, paradigms. 


Thus, psychiatric symptoms represent variation (deviation) in 
normal traits (ranging from normal to abnormal functioning) 
across one or more of the RDoC’s functional domains. Currently, the 
RDoC model holds promise as a research framework for the dimen- 
sional assessment of anxiety disorders, rather than as a diagnostic 
tool. Ultimately, such models—in conjunction with categorical 
diagnoses that remain integral to clinical practice—may assist in 
incorporating advances in neurogenetics, behavioural science, and 
the neurosciences, thus allowing the translation of basic work into 
clinically useful constructs. 

While quite promising as an approach to anxiety disorders, 
RDoC constructs are potentially no less complex than clinical dis- 
orders [57]. Further, the RDoC model has been criticized for an 
overly reductionist approach to psychiatric disorders, emphasizing 
the pathophysiological role of discrete neurocircuits and ignoring 
a range of other possible approaches to the conceptualization of 
mental illness [58]. 


Transdiagnostic neural circuitry 


Human functional imaging studies have built on the basic 
constructs of fear conditioning and extinction to address 
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anxiety-related symptoms and behaviours (see Chapter 90). 
Broadly, such neuroimaging studies have suggested involvement of 
the insula, periaqueductal (PA) grey matter, and hypothalamus— 
which together are responsible for registering negative stimuli, 
processing resultant emotions, and autonomic regulation—in the 
neurocircuitry of fear and anxiety [59]. Further, amygdala hyper- 
activity has been found in response to disorder-specific stimuli in 
social phobia, specific phobia, and PTSD [60-63]. 

There are fewer data on the role of the amygdala in panic disorder 
and generalized anxiety disorder, and OCD and related syndromes 
are likely underpinned by dysfunctional cortico-striato-thalamo- 
cortical (CSTC) circuitry (see Chapters 93-100). Nonetheless, 
hyperactivity of the amygdala in human subjects with certain anx- 
iety disorders is consistent with the neurocircuitry of fear condi- 
tioning proposed in animal studies. 

In summary, a transdiagnostic neural circuit network, informed 
by classical fear conditioning paradigms, may underlie a number of 
the commonly occurring anxiety disorders [64, 65]. Specific func- 
tional and structural abnormalities in this large-scale neural circuit 
may result in biotypes which not only differ from traditional cat- 
egorical DSM diagnoses, but also overlap, intersect, and interact 
[66]. Further delineation and validation of this network may be of 
value in informing novel diagnostic and therapeutic interventions 
for anxiety disorders, thus potentially bridging the gap between 
neuroscientific advances and clinical practice and perhaps even ul- 
timately contributing towards precision medicine in psychiatry. 


Transdiagnostic neurogenetics 


Genetic epidemiological studies have reported anxiety disorders to 
be moderately heritable, with heritability estimates ranging from 30% 
to 50% [67]. Further, genetic factors underlying anxiety disorders 
may be correlated with those underlying related phenotypes such 
as neuroticism [46] and may be transdiagnostic. Thus, ‘overlapping 
genes’ may account partially for the lifetime co-occurrence both of 
anxiety disorders and of related phenotypes. A detailed update on 
the genetics of anxiety and stress disorders is found in Chapters 80 
and 89. However, a brief summary of the major findings will illu- 
minate the dimensional issues described here. 

Candidate gene association studies have suggested a number 
of genes that may be involved in increasing vulnerability both to 
normal anxiety traits and to pathological anxiety, including those 
in the serotonergic (for example, 5-HTT), noradrenergic (for ex- 
ample, COMT), and hypothalamic-pituitary-adrenal (HPA) axis 
(for example, CRHR1, FKBP5) systems [68]. These findings are 
endorsed and complemented by a large body of work on animal 
models of anxiety, including a number of mouse studies mapping 
several genetic loci thought to affect anxiety-like behaviour in ro- 
dents [69]. Human and animal studies have also suggested a role 
for both gene-gene and gene-environment interactions in con- 
tributing to the overall risk for anxiety disorders. Further, there 
is some evidence suggesting a role for epigenetic DNA modifica- 
tions (for example, DNA methylation) and changes in gene ex- 
pression in the effect of early life stress—such as poor maternal 
care—on anxiety-like behaviours in mice. Genetic variants may 
also confer risk via intermediate phenotypes (endophenotypes) 
such as BI, AS, and trait neuroticism [46]. Indeed, it is likely that 
anxiety disorders and phenotypes are polygenic with multifac- 
torial aetiology. 
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Transdiagnostic approaches to treatment 


of anxiety disorders 





The concept of ‘pharmacotherapeutic dissection, with different dis- 
orders responding to different drugs, was key in delineating dif- 
ferent categories of anxiety disorders, including panic disorder 
[70]. However, there is also long-standing evidence that a range 
of different psychotropic agents (for example, benzodiazepines, 
SSRIs, D, blockers) are useful across several different anxiety con- 
ditions (see also Chapter 92). Somewhat analogously, early interest 
in specific cognitive behavioural therapy (CBT) treatments for par- 
ticular anxiety disorders has been replaced by a growing interest in 
transdiagnostic CBT. Indeed, transdiagnostic CBT interventions are 
increasingly viewed as efficacious across anxiety disorders, with a 
large treatment effect size and stable maintenance of improvement 
during follow-up [4, 71]. 

More recently, the Unified Protocol for Transdiagnostic 
Treatment of Emotional Disorders (UP) has emerged as a focus 
of transdiagnostic CBT studies. The UP comprises five treatment 
modules to target emotion processing, regulation, and extinction, 
as follows [72-74]: to increase present-focused emotion awareness; 
to increase cognitive flexibility; to identify/prevent emotion avoid- 
ance and maladaptive behaviours; to increase awareness and toler- 
ance of emotion-related physical sensations; and interoceptive and 
situation-based emotion-focused exposure. 

One small randomized controlled trial of clinical patients with 
heterogenous anxiety disorder diagnoses [72] found that immediate 
UP treatment resulted in significant improvement in outcome meas- 
ures of general anxiety/depressive symptoms, clinical severity, and 
functional impairment, as compared with delayed treatment fol- 
lowing a waitlist period. Similarly, another study of UP treatment in 
two consecutive trials [75] reported decreased AS from pre- to post- 
treatment; and the majority of this change coincided largely with the 
introduction of interoceptive exposure therapy. 

The next step in this area is work on combined transdiagnostic 
pharmacotherapy and psychotherapy. One study has investigated the 
effect of SSRIs/SNRIs and sedatives on the efficacy of transdiagnostic 
5-week group CBT [76]. These authors found that combined anti- 
depressant/CBT resulted in greater improvement in depressive 
symptoms when compared to CBT monotherapy (and that the effi- 
cacy of combined therapy was less for those patients also treated with 
sedatives). Nevertheless, given the evident paucity of empirical data 
on transdiagnostic combined psychotherapy and pharmacotherapy 
for anxiety disorders, further work is required before such interven- 
tions are routinely advised. An even greater transdiagnostic chal- 
lenge is primary prevention of anxiety disorders, described at length 
in Chapter 91. 


Conclusions 


Categorical approaches to anxiety are currently key in the clinical 
diagnosis and management of DSM-5- and ICD-11-defined dis- 
orders. Dimensional approaches may hold promise for future re- 
search and clinical innovation. These approaches are informed by a 
growing body of evidence that dimensional assessments of anxiety 
and related symptoms are informative, that spectrum approaches to 


understanding cognitive—affective mechanisms and the underlying 
neurocircuitry are useful, and that transdiagnostic treatment inter- 
ventions are effective. However, dimensional approaches are, for 
now, a complement to—rather than a substitute for—current cat- 
egorical approaches to anxiety disorders. 
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Existing anxiolytic drugs, such as benzodiazepines (BZs) and se- 
lective serotonin reuptake inhibitors (SSRIs), have been developed 
several decades ago, partly as a result of serendipity (BZs), partly 
because they were primarily targeted at major depressive disorder 
(SSRIs). Extensive search has occurred in the ensuing decades to 
find new targets for anxiolytics, based on the known and anticipated 
neurochemical mechanisms underlying anxiety. However, no real 
breakthroughs have emerged. Currently, SSRIs remain the preferred 
drugs for the treatment of anxiety disorders (and augmented with 
BZs for a limited time interval). Although efficacious, some patients 
are treatment-resistant, and inherent disadvantages are attached to 
SSRI use, including lasting side effects. Therefore, in the last decades, 
preclinical and clinical studies have extensively focused on other 
mechanisms to target anxiety processes in order to ultimately treat 
anxiety disorders. Since then, evidence has emerged pointing to a 
modulating role of corticotrophin-releasing factor 1 (CRF1) recep- 
tors, neurokinin 1 (NK-1) receptors, and glucocorticoid receptors 
(for a review, see [1]). Although the efficacy of such drugs during 
development often looked promising in the preclinical phase, thera- 
peutic anxiolytic effects were often absent or inferior to the existing 
drugs. Thus, so far no ‘superior’ drugs have reached the market that 
will replace the current treatment choices. 

As described in Chapter 86 in the new DSM-5, the former 
DSM-IV category of ‘Anxiety disorders became three separate 
categories: ‘Anxiety disorders; ‘Obsessive compulsive disorders, and 
“Trauma and stress-related disorders. This categorization does not 
create an easier pathway for new drugs or an easier discovery track. 
‘Thus, the creation of preclinical models for each individual anxiety 
disorder is not only impractical, but actually impossible. Therefore, 
it has been proposed that a classification based on the neurobio- 
logical mechanisms underlying anxiety and pathological anxiety 
(intermediate phenotypes) would enhance drug development [2]. 
Another classification approach would be based on dysfunctional 
neurotransmitter systems, for example, abnormal serotonin trans- 
porter function [3]. In the present chapter, we mainly focus on what 
we know on the pharmacology of GABA-ergic and serotonergic 



































drugs, and we try to outline, focusing on genetic and genomic ap- 
proaches and innovative animal models of anxiety disorder, how 
to find putative new targets and some new promising avenues and 
technologies in the anxiety field. 


GABA, receptor a subunits and anxiety 


In the 1950s, BZs were serendipitously found as having therapeutic- 
ally interesting activity with anxiolysis, sedation, anticonvulsive ac- 
tivity, and muscle relaxation. It took more than 20 years to discover 
the molecular target of BZs—the GABA, receptor [4]. It is now 
known that the GABA-ergic system has ionotropic (GABA,) and 
metabotropic (GABA ,) receptors that are ubiquitously expressed 
and distributed in the CNS [5, 6]. The GABA, receptor (GABA,R) 
is the target of clinically important anxiolytics, including BZs [7]. 
A role for GABA,R in anxiety and stress has been found preclinically 
but has not (yet) led to clinical applications [7, 8]. 

GABA,Rs are ligand-gated ion channels that mediate fast inhibi- 
tory effects on many post-synaptic sites and are ubiquitously present 
in the CNS, although there is a specific and differential subunit dis- 
tribution. When the endogenous ligand gamma-aminobutyric acid 
(GABA) binds to its receptor, chloride ions flow into the neuron, 
causing hyperpolarization of the cell membrane and inhibiting cell 
firing. GABA,Rs occur synaptically and extra-synaptically and 
display large molecular heterogeneity made up via a variable sub- 
unit composition. This heterogeneity in composition determines 
the physiological and pharmacological properties and contrib- 
utes to the flexibility in signal transduction and modulation [7]. 
GABA,Rs are always composed of five subunits, a complex that 
forms a heteropentamer creating a channel permeable to chloride 
ions. The subunits can be made up, although not randomly, from 19 
subunits (al-6; 1-3; y1-3; ô, £, 8, and p1-3). Most GABA, recep- 
tors are built from two a, two f, and one y subunit (Fig. 87.1 from 
[9]). The endogenous neurotransmitter GABA binds to the GABA 
site, which is formed by a and p subunits, whereas the binding site 
for BZs is formed by one of the a subunits (al, a2, a3, or a5) and 
a y subunit (almost exclusively the y2 subunit). GABA, receptors 
with a4 or a6 subunits do not bind to classic BZs. The most frequent 
subtype in the CNS is a pentameric complex, consisting of two a, 
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Fig. 87.1 Schematic picture of the GABA, receptor. (a) A global picture 
of the two a, two B, and one y subunits and the direction of the chloride 
influx. (b) A top view with the subunits arranged around a central pore— 
the chloride channel. The GABA and benzodiazepine binding sites are 
indicated. 


Reproduced from The Open Pharmacology Journal, 4, Vinkers CH, Cryan JF, Olivier B, 
et al., Elucidating GABAA and GABAB Receptor Functions in Anxiety Using the Stress- 
Induced Hyperthermia Paradigm: A Review, pp. 1-14, Copyright (2010), Vinkers et al. 
Reproduced under the Creative Commons Attribution License 4.0 (CC BY-NC 3.0). 


two p, and one y subunit. Approximately 60% of all GABA, recep- 
tors are alB2y2 subunits, 15-20% a2B3y2, 10-15% a3ßny2, and 5% 
a4Bny or a4Bnd, and <5% have a662/3y2 subunits [10]. GABA, re- 
ceptors are the main inhibitory neurons in the CNS, and it is es- 
timated that 20-30% of all neurons in the CNS are of the GABA, 
type. BZs do not open the chloride channel in the absence of GABA. 
Different classes of pharmacological agents act at different sites of 
the GABA,R, including the endogenous agonist GABA and various 
GABA,R agonists (for example, muscimol) or antagonists (for ex- 
ample, bicuculline). Classical BZs (for example, chlordiazepoxide, 
diazepam) bind to the GABA,R BZ modulatory site. Other drug 
classes also bind to the GABA,R, including alcohol, barbiturates, 
and neurosteroids. BZs mediate their action via the modulatory 
binding site that is present on most, but not all, GABA, Rs. The allo- 
steric binding site for BZs is always formed by two a subunits (al, 
a2, a3, or a5), two B subunits, and the y2 subunit. Only if the GABA 
receptor site is activated (by GABA or external receptor agonists) 
may activation of the BZ site modulate the opening of the channel 
(frequency and/or time). Ligands at the BZ binding site are called 
allosteric modulators. They modify the efficacy and/or affinity of 
GABA in positive allosteric modulation (PAM) or negative allosteric 


modulation (NAM) or have neutral effects by stabilizing different 
three-dimensional conformations of the GABA,R complex. 
Selectivity of a ligand for a specific receptor subtype can be obtained 
by affinity and/or efficacy changes that determine the potential 
potency of a ligand [11]. Benzodiazepines have anxiolytic, seda- 
tive, hypnotic, muscle relaxant, and anticonvulsive properties. The 
sedative-hypnotic properties are useful for, for example, insomnia 
treatment but are unwanted side effects if anxiolysis is the primary 
indication. Classic BZs (for example, diazepam) are non-selectively 
activating a1, a2, a3, and a5 subunits, but extensive research, mainly 
in genetically modified mice in which individual GABA,Ra sub- 
units (GABRA) were made insensitive to diazepam binding (by 
histidine-to-arginine point mutations at a conserved residue in the 
al, a2, a3, or a5 subunits, whereas the actions of the endogenous 
ligand GABA remained intact) showed that the different subunits 
represent different functions. It has become clear that the broad pro- 
file of therapeutic effects of BZs is dependent on activation of se- 
lective a -subunits. Strong evidence was gathered that a1 subunits 
are involved in sedative and anterograde amnesia effects mediated 
by diazepam (a1(H101R)mice); a2 point mutations (a2(H101R) 
mice) led to absence of the anxiolytic and diminished muscle re- 
laxant action, but intact anxiolytic effects, and in a3 (a3(H126R) 
mice) and a5 point mutations (a5(105R)mice), diazepam did not 
induce myorelaxation, whereas sedation and anxiolysis were intact. 
Such data strongly suggest a functional differentiation in the GABA, 
receptors, depending on the a subunit composition (for reviews, see 
[12, 13]). 

Classic BZs, still frequently prescribed, have therapeutic activity 
but, inherent to the activation of all relevant a subunits, come with 
built-in side effects. If used as an anxiolytic tool, sedation is one of 
the troubling side effects. Furthermore, upon chronic use, BZs can 
lead to dependency and tolerance and induce abuse liability, limiting 
long-term use [14]. Recent efforts have tried to synthesize new drugs 
that have selectivity and potency for specific a subunits [13], al- 
though relatively selective drugs for the a1 subunit are already in use 
for sedation/hypnotic purposes (zolpidem, zopiclone, (S)-zopiclone, 
and zaleplone). Compounds that selectively activate the a2 subunits 
and have no effects on any other a subunit might constitute an ideal 
non-sedative anxiolytic, although activation of a3 subunits might 
contribute to an anxiolytic profile [15-17]. L-838417, a partial PAM 
at a2-, a3-, and a5-containing GABA, receptors and an antagonist 
at al-containing receptors has a non-sedating anxiolytic profile in 
mice [18, 19] and primates [20]. Development of this compound has 
been stopped due to an unfavourable pharmacokinetic profile [21]. 
TPA023, an a2/a3 PAM, has anxiolytic and no sedative effects in ro- 
dents [22]. TPA023 was evaluated in three phase 2 studies in GAD 
and proved preliminary indications of anxiolytic activity without 
sedation [17]. However, this compound had to be withdrawn due to 
severe preclinical toxicity. A comparable story holds for ocinaplon, 
having a non-sedative, anxiolytic profile in humans but which also 
had to be withdrawn due to hepatotoxicity [23, 24]. Several other 
ligands have been synthesized and tested, mostly restricted to pre- 
clinical phases. It appears possible to make compounds with some 
selectivity for specific a subunits, but in vivo efficacy is extremely 
difficult to design; both PAMs and NAMs have been found, and 
sometimes even mixed PAM/NAM effects on different a subunits 
are present or no selectivity is present in vitro, whereas in vivo, some 
efficacy is found (for example, ocinaplon). MRK-409, an extremely 
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low partial agonist (PAM) at al-, a2-, and a5-containing GABA, 
receptors, but with higher intrinsic activity at a3 subunit GABA, 
receptors, appeared to be anxio-selective in animals but sedative in 
humans, already at low (<10%) receptor occupancy [25]. 

Another class of compounds—the neurosteroids (for example, 
allopregnanolone, alphaxolone)—have modulatory effects (PAM) 
on the GABA,R, comparable to BZs. However, due to several differ- 
ential mechanistic differences, the neurosteroids have not emerged 
as potent anxiolytics [26]. 

One of the unresolved issues around subunit selective GABA- 
ergic compounds is the issue of tolerance and abuse potential. 
Does activation of all a subunit-containing GABA, receptors lead 
to addiction or is that caused by specific a subunits [27]? This is an 
important issue because the development of medications with po- 
tential for misuse may not be possible. There is some evidence that 
activation of al subunits is essential in the addictive properties of 
BZs [14, 28]. However, the processes of tolerance development 
are complex and endpoint-dependent [29]. If the therapeutic ef- 
fects of activation of al-containing GABA, receptors cannot be 
separated from potential addictive side effects, no further devel- 
opment of a1 subunit-specific ligands can be expected. However, 
if addictive properties are not entwined in (chronic) activation 
of the other a2, a3, a5 subunits, new developments in the field 
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of anxiety (and others like cognition and analgesia) might be ex- 
pected [9, 29, 30]. 


The 5-HT,, receptor 


The 5-HT,, receptor has been implicated in anxiety because 5-HT,, 
receptor agonists exert anxiolytic activity in rodent models of anx- 
iety [31, 32], whereas buspirone, a partial 5-HT,, receptor agonist, 
exerts mild anxiolytic effects in human anxiety patients, particularly 
in GAD [33]. Although clinically the development of new 5-HT,, 
receptor agonists for anxiety disorders (for example, ipsapirone, 
gepirone, tandospirone, flesinoxan) failed, the 5-HT,, receptor 
has received considerable interest as a critical target implied in 
anxiety and depression [31-37]. 5-HT,, receptors are G-protein- 
coupled inhibitory receptors expressed in serotonergic neurons as 
auto-receptors and in non-serotonergic neurons as heteroreceptors 
(Fig. 87.2). The somatodendritic 5-HT,, auto-receptor controls 
serotonergic tone via feedback inhibition in concert with 5-HTT, the 
target of the most commonly used anxiolytic/antidepressant SSRIs. 
It has been hypothesized that desensitized 5-HT,, auto-receptors 
delay the onset of action of SSRIs that act by enhancing brain sero- 
tonin levels [38, 39]. 5-HT,, receptors are quite abundantly distrib- 
uted and expressed, although restrictedly present in some brain 
areas. Auto-receptors are mainly, if not only, found in the dorsal 
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Fig. 87.2 Schematic representation of serotonin (5-HT) in the terminal and synapse. G-protein-coupled receptors are located presynaptically [5-HT 
auto-receptor (5-HT,4/)] or post-synaptically (5-HT, 24/5/67 receptors). (1) 5-HT is released from the presynaptic neuron and binds to a heterotrimeric 
G-protein post-synaptic receptor. Heterotrimeric G-protein complexes contain an a, a $, and a y subunit, which, in an inactive state, are bound to 
GDP (2) 5-HT acts on post-synaptic receptors and induces a change in the conformation of the post-synaptic receptor. GDP is phosphorylated to GTP 
and binds to the æ subunit, which subsequently becomes active. The B and y subunits are freed. (3) Extracellular 5-HT is taken up by the SERT into the 
presynaptic neuron. (4) Back in the presynaptic neuron, 5-HT is broken down by MAO to 5-HIAA (occurs also extracellularly) or is being stored in the 
vesicles for future release. MAO, monoamine oxidase; 5-HIAA, 5-hydroxyindole acetic acid; SERT, serotonin transporter. 


Reproduced from The Open Pharmacology Journal, 4, Vinkers CH, Cryan JF, Olivier B, et al., Elucidating GABAA and GABAB Receptor Functions in Anxiety Using the Stress- 
Induced Hyperthermia Paradigm: A Review, pp. 1-14, Copyright (2010), Vinkers et al. Reproduced under the Creative Commons Attribution License 4.0 (CC BY-NC 3.0). 
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and median raphe nuclei, whereas post-synaptic heteroreceptors 
are found in high densities in limbic regions (including the hippo- 
campus) and in the frontal, medial prefrontal, and entorhinal 
cortices. 

Genetic and imaging studies in humans suggest that 5-HT,, re- 
ceptor density or regulation are associated with depression and anx- 
iety, but also with the response to antidepressants [40]. Recently, 
an association was found between a C(-1019)G polymorphism 
(rs6295G/C) in the promoter region of the 5-HT,, receptor gene 
(Htrla) and mood-related variables, including amygdala reactivity 
[41] and depression [42]. Enrichment with the G-allele is associ- 
ated with enhanced raphe (presynaptic) 5-HT,, auto-receptor ex- 
pression, whereas post-synaptically, the reverse occurs with the 
5-HT,, heteroreceptor [42]. A clear hypothesis of how such changes 
contribute to an anxious/depressed phenotype has, however, not 
emerged yet. More polymorphisms in the Hirla gene are known, 
but no clear data on their influence on anxiety are present [43]. 

Studies in rodents found that 5-HT,, receptors modulate anx- 
iety and depression. Three independent research groups generated 
5-HT,, receptor knockout (KO) mice [44-46], and all KO strains 
displayed enhanced anxiety in standard anxiety paradigms. The 
unanimous finding that inactivation of the 5-HT,, receptor gene 
(both auto- and heteroreceptor) resulted in an anxious pheno- 
type in all three lines was rather striking. The picture became even 
more blurred when the anxious phenotype appeared dependent 
on the paradigm used [47]. Moreover, in one of the strains (Swiss- 
Webster), the 5-HT,, receptor KO mouse displayed reduced sensi- 
tivity to the anxiolytic and sedative effects of diazepam, an a subunit 
non-selective GABA,-positive allosteric modulator [48, 49], at- 
tributed to changes in some a subunits of the GABA,-BZ receptor 
complex. However, this BZ insensitivity was not present in the KO 
mice in both other genetic backgrounds [49, 50]. Apparently, dys- 
function of the GABA,-BZ system is not a prerequisite for the 
‘anxiogenic phenotype of the 5-HT,, KO mouse model. Along with 
the anxiogenic phenotype in the 5-HT,, receptor null mouse, which 
was unresponsive to SSRIs [51], it appeared that overexpression of 
the 5-HT,, receptor reduced anxiety [52]. Rescue experiments of 
forebrain 5-HT,, receptors showed that post-synaptic 5-HT,, re- 
ceptors are critical in the development of the anxiogenic phenotype 
in the null mutations [53]. The latter authors showed that transgenic 
developmental overexpression of 5-HT,, receptors in the rostral 
brain in the early stages of life (not adulthood) was sufficient to re- 
store normal anxiety levels. Pharmacological blockade of 5-HT, 
receptors in early development, but not in adulthood, appeared 
sufficient to enhance anxious behaviour in normal (wildtype) mice 
[9, 54]. The complex regulation of anxiety (and depression) pro- 
cesses during development and adulthood illustrates the complexity 
of the neural substrate, including genetic regulation of anxiety and 
its pathology, and makes clear that straightforward and simple rela- 
tionships between the function of a certain receptor and anxiety are 
not very likely. 

By genetic modification, Richardson-Jones et al. [55] manipulated 
the level of presynaptic 5-HT,, auto-receptors at adult age, without 
changing post-synaptic 5-HT,, heteroreceptors. Mice with higher 
(1A-high) or lower (1A-low) levels of auto-receptors were tested on 
their vulnerability to stress and response to antidepressants. The in- 
vestigators suggested that 1A-low mice showed an enhanced 5-HT 
tone and decreased depression-like behaviour and still responded 


to an SSRI, whereas 1A-high mice had a decreased 5-HT tone and 
more depressed-like behaviour and were unresponsive to SSRIs. 
In their modelling, they suggested that 1A-lows reflect human C/ 
C, whereas 1A-highs model G/G carriers of the Htrla C(-1019)G 
polymorphism. Such genetic mouse models are extremely useful 
in studying the underlying processes emerging in depression and 
anxiety disorders. Further molecular biological studies into the role 
of 5-HT,, receptors implicate, in particular, 5-HT,, post-synaptic 
receptors in the prefrontal cortex in the expression of anxiety [56]. 
In the prefrontal cortex, those heteroreceptors are expressed on two 
antagonistically acting neuronal populations—excitatory pyramidal 
neurons and inhibitory interneurons. These populations play an im- 
portant role in the expression of anxiety over the lifespan and may be 
generating new targets for future therapeutic approaches. 


The serotonin transporter (5-HTT) 


The 5-HTT has, for a long time, been implicated in the processes 
underlying mood and anxiety and its associate disorders mainly be- 
cause the SSRI antidepressants block the serotonin uptake into the 
neuron thereby increasing the serotonergic output (Fig. 87.2). As 
outlined in the introduction, polymorphisms in the 5-HTT gene and 
its associated transcriptional control region have influence on the 
functioning of the serotonergic system [57]. Although we restrict 
ourselves here to anxiety disorders, it is noteworthy to realize that 
several human behavioural traits and whole-body medical disorders 
(for example, myocardial infarction, pulmonary hypertension, irrit- 
able bowel syndrome, and sudden infant death syndrome) are as- 
sociated with variations in the 5-HTT gene (SLC6A4), indicating 
the importance of serotonin neurotransmission in all aspects of our 
biology [3]. Reduced 5-HTT expression and function associated 
with the S-allele of the 5-HTTLPR is associated with anxiety and de- 
pression personality traits [58, 59]. In the ensuing decade, it became 
clear that 5-HTTPRL and other variations in coding and non-coding 
regions of the 5-HTT gene play a wide role in neuropsychiatric dis- 
orders, including bipolar disorders, depression, anxiety disorders 
[also obsessive-compulsive disorder (OCD)], suicide, eating dis- 
orders, substance abuse disorders, autism, attention-deficit/hyper- 
activity disorder (ADHD), and neurodegenerative disorders (for 
reviews, see [3, 60]). Human 5-HTT maps to chromosome 17q11.2, 
has 14 exons that span around 40 kB, and produces a 630-amino acid 
protein with 12 membrane domains. The expression of the gene in 
humans is modulated by a variation in the length of the 5-HTTLPR, 
together with two SNPs in this region (rs25531 and rs25332), all lo- 
cated upstream of the transcription start. Moreover, variable num- 
bers of tandem repeat (VNTR) polymorphisms are known in intron 
2, as well as several SNPs that influence the structure of the 5-HTT 
protein. In particular, the rs2551 polymorphism is quite frequent in 
the population and interacts with 5-HTTLPR in 5-HTT transcrip- 
tion [3]. This makes the modulation of serotonergic transmission 
via the 5-HTT mechanism highly complex and gives probably an 
important insight in the factors that play a role in the genetic com- 
plexity of any psychiatric disorder. The identification of specific gene 
variants remains a tricky avenue. It becomes increasingly clear that 
such variations influence intermediate biological phenotypes, in 
concert with other (background) genes, epigenetic variation, and 
environmental and developmental factors, and this complex inter- 
action contributes to risk or resilience to develop a psychiatric con- 
dition. One avenue to pursue would be to try to find associations 
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between specific candidate genes (for example, the 5-HTT gene) 
and intermediate phenotypes mediating between a moderating 
allele and a more complex disease phenotype [61]. Studies on the 
5-HTTLPR gene are now under way that explicitly test gene x en- 
vironment and gene x gene interaction. One study [62] found that 
a significant interaction between maternal anxiety during gestation 
and subsequent levels of infant negative emotionality at 6 months 
of age was modulated by 5-HTTLPR of the child. S-allele-carrying 
children showed high levels of negative emotionality, with mothers 
showing high maternal anxiety during pregnancy, whereas no such 
association was found in L-allele-carrying infants. In a BOLD-fMRI 
study, healthy women were threatened with unpredictable electric 
shocks of uncertain intensity, and activation was measured in sev- 
eral cortico-limbic networks upon threat anticipation [63]. During 
stress exposure, neural systems enhancing fear and arousal modu- 
lated attention towards threat and SS-individuals appeared particu- 
larly sensitive towards the stress, suggesting that such a mechanism 
may underlie the risk for psychopathology. In a different study, 
deductive reasoning appeared also dependent on the 5-HTTPRL 
genotype. Apparently, differences in the functioning of the 5-HTT 
renders some individuals more vulnerable to emotional factors, 
thereby generating a deleterious effect on rational reasoning [64]. 
In another study, an interaction between the 5-HTTLPR (measure 
in LL-variants) and an oxytocin receptor variant (TT variant of 
the SNP rs2268498) that influences the number of oxytocin recep- 
tors was found in individual differences in negative emotionality, 
indicating a gene x gene interaction [65]. Such data indicate that 
serotonergic and oxytocinergic neurotransmission processes are 
somewhere entwined and seem to play a role in affective disorders. 
In general, S-alleles of the 5-HTTLPR are associated with increased 
risk for a variety of psychiatric disorders, including anxiety. The S- 
allele is considered a ‘risk or ‘vulnerability’ allele [66], whereas the 
function of the L-allele is far less clear, although this allele has been 
suggested as a potential risk factor for the development of psycho- 
pathic traits too [67]. Because every human has either the L or the 
S allele, or both, and most people do not suffer from psychiatric ab- 
normalities, it must be assumed that the genome includes several 
‘protective’ alleles that make many individuals resilient to stress and 
pathology. Such protective genes have been suggested, for example 
the CRF1-receptor variants that have been associated with protec- 
tion from the extreme stresses of maltreatment during childhood 
[68] and protective emotional resilience-enhancing effects of the 
L-allele in students [69]. Belsky and colleagues [70] suggested that 
S-allele carriers are more vulnerable in general, not only negatively, 
but also positively. Thus, ‘vulnerability genes’ or ‘risk alleles’ seem to 
make individuals more susceptible to environmental influences— 
for better and for worse. Homberg and Lesch [71] postulated a 
similar picture of the functions of the allelic variants in 5-HTTLPR. 
They did not portray the S-allele as a ‘risk allele and the L-allele as a 
‘resilient’ allele but took the hypothesis that S-carriers (both in hu- 
mans and in non-human primates) perform better in cognitive tasks 
than L-carriers. They argue for a switch from a deficit-orientated 
connotation of 5-HTTLPR variants to a cognitive superiority of S- 
allele carriers (which have enhanced reactivity of the cortico-limbic 
neural circuitry) and that environmental conditions will determine 
whether a positive (cognitive) or negative (emotional) response 
will happen. Part of their support comes from studies in genetic- 
ally modified mice and rats (heterozygote and homozygote) 5-HTT 


KOs. 5-HTT^ rodents show brain and behavioural phenotypes ra- 
ther similar to S-allele carriers such as increased emotionality, im- 
proved cognition, and increased sensitivity to psychosocial factors 
(reviewed in [72]). This seems present without environmental in- 
fluence in the homozygote KOs, but in the heterozygotes, early or 
later life stress is additionally needed to show the same phenotypes 
[3, 72, 73]. 


Genetic background of anxiety disorders 


Genes certainly contribute to the risk for developing anxiety dis- 
orders. An update of the current findings is given in Chapters 80 and 
89. No ‘candidate’ genes have emerged that play a straightforward 
role in the expression for vulnerability to anxiety or anxiety disorders 
[74]. This is probably due to the fact that several or more risk fac- 
tors have individually small effects. Among those neurotransmitter 
systems which have provided targets to develop anxiolytic drugs 
are serotonin (5-HT), noradrenaline (NA), glutamate, dopamine, 
GABA, RGS2, and neuropeptides (corticotrophin-releasing factor, 
neuropeptide Y, brain-derived neurotrophic factor). Regulator of G- 
protein signalling proteins (RGS) are key modulators of G-protein- 
coupled receptor signalling. RGS2 modulates anxiety both in mice 
[75] and in humans [76]. In mice, a quantitative trait locus (QTL) on 
chromosome 1 was associated with anxiety-related phenotype [77]; 
the principal quantitative trait gene for this linkage signal appeared 
to be RGS2 [78]. 

In conclusion, it has proved a challenge to find loci involved in 
anxiety disorders, which is likely due to a complex and polygenic 
genetic background. Probably many genes influence the risk for 
developing anxiety disorders, each of them with a small effect. 
Moreover, epistatic processes, having the ability to mask the pheno- 
type derived from other genes, are also very likely to be involved, 
whereas environmental factors induce complex gene-environment 
(G x E) interactions. 

Another strategy has been to study endophenotypes or inter- 
mediate phenotypes in man. Such familial or heritable traits are 
assumed first to underlie relevant disorders and second, more con- 
troversially, to be more susceptible to genetic analysis. The amyg- 
dala shows enhanced activity in phobias and PTSD, compared to 
healthy individuals [79]. The function of the amygdala in processing 
emotion and the mechanisms underlying fear conditioning have 
provided important paradigms for imaging studies in man (see 
Chapter 90). The link with genetics has proved more challenging. 
There was great interest when the 5-HTTLPR polymorphism was 
associated with amygdala reactivity [80, 81], implicating the S-allele 
in the increased amygdala reactivity towards external stimuli [82]. 
This kind of finding in small samples has proved difficult to replicate 
(see Chapters 80 and 89). 


Preclinical genetic approaches to fear and anxiety 





Animal models of anxiety and fear have greatly enhanced our 
knowledge in the neurobiological mechanisms underlying fear 
and anxiety. Anxiety and fear represent the most translatable pro- 
cesses from animal to human. Indeed, animal pathology appears to 
resemble human pathology to a certain (but varying) degree [83]. 
Such models can be powerful in dissecting putative genes in anxiety 
and anxiety-associated traits [84, 85]. Apart from large-scale mouse 
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Fig. 87.3 Traditionally, novelty-induced exploratory behaviour in the open field and the elevated plus maze are assessed to study anxiety-related 
behaviours in rodent species. In these representative data examples, video-tracking summary plots indicate the location of the mouse during a short 
session of open field exploration (a, b) or of exploration of an elevated plus maze (c, d). Different inbred strains of mice showed robust behavioural 
differences in these paradigms, as can be visualized in the data samples given. For example, C57BL/6) mice showed high levels of movement 
throughout the open field (a) and the elevated plus maze (c). In contrast, A/) mice showed lower levels of movement and stay close to the wall of the 
open field (b) and preferred the closed arms of the elevated plus maze (d) [closed arms are indicated by dark black lines (c, d)]. Based on the avoidance 
of the centre of the open field and the open arms of the elevated plus maze, A/J mice are considered to express higher levels of anxiety-related 


behaviour, when compared to C57BL/6) mice. 


crossings for genetic mapping, leading to, for example, localization 
and identification of one gene rgs2, which influences anxious behav- 
iours [76, 78, 86], gene-targeting approaches can be applied in ro- 
dents (Fig. 87.3). 

Apart from classical searching for an animal model of anxiety, one 
can also try to use cross-species trait genetics in combination with 
an endophenotype approach [87, 88]. Using chromosome substitu- 
tion strains, in which a single chromosome from one inbred strain 
(donor) is substituted with that of another inbred strain (host), 
makes it possible to locate a certain parameter or phenotype (such 
as avoidance behaviour) on a locus of a particular chromosome. Via 
further genetic crossings, the location can be refined and this may 
lead to candidate genes [89, 90]. By integrating genetic data from the 
mouse and from a large human GWAS on bipolar disorder, novel 
candidate genes for bipolar disorder were unravelled [91] (Fig. 87.4). 

In the next section, some examples will be given of established tar- 
gets involved in anxiety. A target can be defined as a neurobiological 
or cellular mechanism that, upon systematic manipulation, causes 
correlated effects on circuit function (in this case, anxiety), as well as 
behavioural/cognitive processes [92]. Because genomic technology 
advances rapidly, linkage between targets and neuronal circuitry 


and genetic factors involved in anxiety disorders are becoming in- 
creasingly elucidated. Fundamental research aimed at these targets 
may contribute to unravelling novel insights in anxiety processes 
and consequently engender new opportunities for drug discovery. 

Although the 5-HT,, receptor, 5-HT'T, and GABA, receptor com- 
plex belong to the most known and discussed targets in the field, 
human and animal research still continues to find new mechan- 
isms around these targets that opens new possibilities to apply in 
animal models and human psychopathology. The future needs a 
strict translational approach; data found in human (anxiety) re- 
search, including genetic and environmental factors, should be used 
to formulate scientific approaches in animals, and vice versa. In ani- 
mals, we have the opportunity to apply cell-specific inducible KOs or 
knock-ins. Moreover, new optogenetic technology enables selective 
manipulation of cellular mechanisms and circuit functions linked to 
the gene’s suggested function [93]. 


Integration 


As genetic studies advance, validated animal anxiety models with 
translational validity are increasingly needed to study the biological 
role of genes found and their possible contribution to anxiety. In 
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Fig. 87.4 Behavioural screening of a chromosome substitution strain (CSS) panel revealed a locus for avoidance behaviour in an automated home 


cage environment on mouse chromosome 15. 
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The automated home cage environment is equipped with a home base shelter (in which mice mainly 
, and two feeding platforms—one feeding platform exposed to the environment, and one allowing 


sheltered feeding (c). The PhenoTyper’ top unit contains an infra-red camera and infra-red LED lights, allowing continuous recording, independent 

of lighting conditions in the test room. C57BL/6) and CSS15 females; feeding duration on the two platforms on 3 days of testing. CSS15 females 
showed an increasing preference for the sheltered platform over consecutive days (d).° Exposed vs sheltered feeding platform per day; P <0.05. 
Representative track samples during automated baseline behavioural registration, showing a reduction in visitation of the exposed feeding platform for 
CSS15 (f), compared to C57BL/6 (e). Further genetic fine mapping of the locus on mouse chromosome 15 revealed a homologous human locus on 








chromosome 8 previously associated with bipolar disorders. 
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search for animal paradigms with high translational value, we exten- 
sively explored a readout parameter of autonomic nervous system 
activation—stress-induced hyperthermia (SIH). SIH occurs in 
animals and man and depends on stress intensity, and individual 
differences in response to stress are dependent on genetic and en- 
vironmental factors [94, 95]. In humans, autonomic stress responses 
correlate with perceived stress levels. Although the SIH response 
does not specifically model any psychiatric disease, SIH may be very 
useful as a readout parameter of stress or an anxiety-like response. 
It can be studied at preclinical and clinical levels, using different 
and varied stressful experimental conditions. It is highly suitable 
to detect genetic effects and anxiolytic properties of drugs, for ex- 
ample in 5-HT,, receptor KO mice [50, 94, 96]. The SIH response 





can be reduced using GABA, receptor agonists and 5-HT,, receptor 
agonists [95, 97]. Baseline body temperature and circadian rhythmi- 
city were comparable, but SIH was lower in 5-HTT KO than in the 
corresponding wildtypes. Pharmacological studies revealed that 5- 
HT), receptors modulating SIH belong to a population of receptors 
that differ from those involved in hypothermia [98]. Later research 
found also changed sensitivity in dopamine and NA receptors in 5- 
HTT KOrats [99]. It might be postulated that the 5-HTT KO rat con- 
stitutes an animal model of certain psychiatric disorders, including 
anxiety disorders. A preliminary study [100] tried to find a genetic 
basis for the sensitivity of various chromosome substitution mouse 
strains [101] for diazepam (a non-subunit selective GABA, receptor 
agonist) and flesinoxan, a highly selective 5-HT,, receptor agonist 
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[102], using the SIH paradigm as experimental paradigm. Using 
the SIH paradigm, eight chromosomal substitution strains (CSS) 
on a C57BL6/J (host) x A/J (donor) background were screened to 
localize chromosomes involved in GABA, and 5-HT,, receptor sen- 
sitivity. Preliminary data indicated that some strains showed clear 
differences in either diazepam or flesinoxan sensitivity. Further re- 
search is ongoing to unravel underlying putative genes. 

GABA-ergic and serotonergic mechanisms play an important role 
in SIH, and rodent and human studies into the underlying brain 
mechanisms indicate high translational comparability. Mouse gen- 
etic mapping studies (by using chromosomal substitution strains) 
indicated genetic loci involved in the pharmacological mechanisms 
behind the SIH processes. If such loci and genes can be localized, 
human counterparts have to be sought in the quest for novel candi- 
date genes involved in anxiety and its disorders. 

Besides SIH, other highly valuable translational animal paradigms 
also have been developed. For instance, the startle reflex method- 
ology can translate animal research into human experimentations, 
and vice versa. Especially the combination of the startle reflex meth- 
odology with Pavlovian aversive conditioning makes it a powerful 
procedure to develop cross-species studies [103]. The startle reflex 
is a response to an intense and unexpected stimulus. In animals, 
this reflex can be measured by assessing the whole body reflex. In 
humans, the eyeblink reflex is used to measure the amplitude and 
latency of a startle reflex. Davis and colleagues [104] extensively in- 
vestigated the fear-potentiated startle, which refers to the increase 
(potentiation) of the startle reflex during a state of fear. This state 
of fear is induced by the anticipation of an aversive stimulus (for 
example, a shock). Animals and humans display similar effects, that 
is, the amplitude of the startle reflex is greater in the presence of con- 
ditioned stimuli than in their absence [104-107]. The advantage of 
the fear-potentiated startle procedure is that the startle in animals 
and humans are potentiated by similar procedures. Thus, findings 
in humans can be replicated in animals, and vice versa, increasing 
the translational value of animal models of human anxiety. When 
a new gene is discovered by means of, for example, GWAS, an 
animal model can be genetically manipulated and used to study the 
underlying mechanisms or to investigate new potential anxiolytics. 


Translational studies into anxiety 


Can data on the involvement of serotonin in anxiety and anxiety 
disorders (here illustrated with the 5-HTT and the 5-HT,, receptor) 
be used to design translational research that possibly will generate 
new hypotheses and targets for anxiolytic therapeutics? Recently, 
Jasinska et al. [108] formulated a hypothesis around the involvement 
of the 5-HTT gene, stress, and raphe-raphe interactions in order to 
try to explain the risk of depression as a result of gene-environment 
interactions between the 5-HTT gene and stress. Different popula- 
tions of serotonergic neurons in the dorsal raphe (DR) nucleus exist 
that differentially contribute to the response to stress. Although the 
authors hypothesize this mechanism mainly for depression, there is 
no a priori reason why anxiety disorders would not be mediated by 
this or a similar mechanism. The authors propose that the G x E 
interaction of the 5-HTTPRL and stress depends on the genetically 
produced variability in serotonin reuptake during stressor-induced 
raphe-raphe interactions and alters the balance in the amygdala- 
ventromedial prefrontal cortex—DR circuitry that underlie reactivity 
to stressors and regulation of emotion. In LL-individuals with an 


efficient 5-HT transport, the circuitry is able to normalize, but not so 
in SS-individuals, potentially leading to abnormal activity and path- 
ology. Whether such a mechanism also acts in human pathology is 
as yet unresolved but could lead to specific searches for new mech- 
anisms causing pathological anxiety. Next to different functional 
serotonergic populations in the DR, serotonin transporters appear 
very dynamically regulated [109] and undergo regulated membrane 
trafficking, as well as transitions between low and high activity states, 
with many signalling pathways involved. Moreover, 5-HTT exhibits 
dynamic associations with cytoskeletal binding proteins; actually, 
Chang et al. [110] found two pools of 5-HTT proteins on the sur- 
face of serotonergic cells, with one relatively with free diffusion and 
the other with restricted mobility due to binding to the cytoskel- 
eton. Whether the serotonergic system exerts this kind of extremely 
variability, which might lead to new and better understanding of the 
role of the 5-HTT complex, including its genetic variability, is still a 
matter of the future, but it remains fully possible that new mechan- 
isms involved in anxiety and its disorders might emerge. 


Conclusions 


The discovery and development of novel anxiolytic drugs are 
hampering. The existing anxiolytics have been around for decades, 
and no subsequent breakthrough has come through. Especially, the 
lack of innovative and effective novel drug targets is disappointing. 
The reasons for this lack of progress are not completely clear but 
may be due to the complex regulatory and financial regulations in 
the finding of new ‘druggable’ targets (we do not go into many other 
financial, outsourcing, human, and marketing reasons (see, for ex- 
ample, [111]). 

However, central to the continued paucity of new targets for 
anxiolytic drugs is our limited knowledge of the mechanisms and 
pathology underlying the various anxiety disorders. We simply 
do not know what is wrong in the brain of pathologically anxious 
people and whether the DSM-based anxiety disorders constitute 
independent entities. Therefore, we have to invest in fundamental 
research in the neurobiological mechanisms involved in anxiety 
processes in healthy people and look for what is different in diseased 
brains. Because intrusive investigations of the human brain are often 
not (or never) possible, animal research is inevitable and can con- 
tribute considerably to find the neural substrates for anxiety and its 
pathology, although it is not realistic to think that such knowledge is 
100% translatable to the human situation. Any strategy to find new 
targets to modulate aspects of anxiety (and hopefully its pathology) 
will use genetic and genomic approaches. Our initial hope was that, 
after elucidation of the human genome, we would be able to pinpoint 
certain genes as the perpetrator causing the problem. However, it 
soon became clear that that is an illusion. Notwithstanding a high 
heritability in many of the anxiety disorders, no single gene or a 
small number of genes have emerged from a large number of studies 
on large cohorts of patients thus far. It has become increasingly clear 
that anxiety disorders, similar probably to the fundamental mech- 
anisms involved in anxiety processes, are caused by action at many 
hundreds of genes with very complex interactions, including both 
environmental factors and gene x gene interactions. It is difficult to 
expect that one single gene contributes sufficient influence to modu- 
late anxiety processes to a large degree, and no specific drug will 
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probably be developed for such a target. In the present contribu- 
tion, three (druggable) targets have been presented, with all three 
somewhere involved in the (genetic) pathways of anxiety processes 
(5-HT,, receptor, 5-HTT, and GABA, receptor). Although it is not 
known how these three targets are involved in ‘normal or ‘patho- 
logical’ anxiety in the brain, we know at least that they are involved 
in several aspects of anxiety and may aid in our further under- 
standing of the underlying mechanisms. Whether they represent 
real direct targets that are changed under pathological conditions 
or only as primary entrance to influence underlying mechanisms is 
not clear. In the case of serotonin modulation (via 5-HT,, receptor 
activation or blockade of the serotonin transporter), an indirect 
effect is possibly the most logical explanation, because treatment 
of anxiety disorders with SSRIs or buspirone takes weeks, or even 
months, before anxiolytic activity is seen. This points to an induc- 
tion of mechanisms that slowly change and need time to become 
effective (plasticity changes). Anxiolytic effects after activation of 
GABA, receptors seem acute and might point to a primary mech- 
anism directly involved in anxiety-regulating mechanisms. Close 
collaboration between fundamental academic research into anxiety 
mechanisms in the brain and drug discovery research and develop- 
ment might lead to breakthroughs in our search for new and better 
anxiolytics. Clearly, animal models will play an important role in 
future anxiolytic drug development, but only as a component of a 
broad and integrated multi-disciplinary approach. To be successful, 
preclinical researchers need to ensure that novel clinical insights 
into the aetiology of anxiety disorders are integrated in the design 
and use of animal models and tests. 
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For an understanding of anxiety disorders, it is helpful to distin- 
guish anxiety and fear as different, though overlapping and related, 
constructs. Anxiety is a basic emotion and is typically adaptive, and 
not pathological. It can be described as a future oriented mood state 
associated with the anticipation of future threat and possibly its 
avoidance. Fear is the organism's response to present/imminent real 
or perceived threat and is associated with autonomic arousal and 
escape [1]. Clinically relevant anxiety disorders differ descriptively 
from non-pathological forms of fear and anxiety by being consider- 
ably more persistent and excessive, causing significant distress and 
impairment [2]. It is assumed that various conditioned dysfunc- 
tions in the regulation of the complex neurobiological and psycho- 
logical fear/anxiety circuits are responsible for pathological anxiety 
[3]. In childhood and adolescence, anxiety often manifests as part 
of typical development, which can render identification and diag- 
nosis of pathological anxiety difficult. However, it is particularly this 
sensitive phase of life when initially transient and circumscribed 
neurobiological, cognitive, affective, and behavioural dysfunctions 
may generalize and escalate, leading to psychopathologic forms of 
anxiety associated with distress, impairment, and negative develop- 
mental outcomes [3]. 

Anxiety disorders represent a phenotypically heterogenous class 
of disorders. Although the individual anxiety disorders share sev- 
eral clinical diagnostic features (that is, excessive levels of fear or 
anxiety, physical symptoms, and avoidance behaviour resulting in 
distress and/or impairment [1, 2, 4]), the specific anxiety disorders 
(such as separation anxiety disorder, panic disorder, agoraphobia, 
social anxiety disorder, generalized anxiety disorder, and the spe- 
cific phobia subtypes animal, natural environment, blood injection 
injury, situational, and ‘other’) also differ from each other signifi- 
cantly in terms of their typical focus of fear and anxiety, form, type, 
and severity of the psychophysiological fear response and the asso- 
ciated cognitions, as well their onset and patterns of course and the 
clinical and neurobiological correlates, as well as risk factors [1, 5- 
7]. Moreover, the way fear and anxiety are expressed across different 





developmental periods and stages may differ in the various anxiety 
disorders [8] and may hinder diagnosis [3]. For example, particu- 
larly in children, anxiety disorders may manifest in non-specific 
somatic symptoms or anxiety-related symptoms such as crying, 
irritability, or tantrums, often resulting in misinterpretation by 
adults as oppositional or disobedience [9]. However, no separate 
sets of criteria or classifications exist for children and adults in the 
most recent international diagnostic classification system DSM- 
5 [2] (Table 88.1). Even separation anxiety disorder and selective 
mutism—traditionally regarded as childhood anxiety disorders— 
were moved from the previous DSM-IV category of ‘disorders usu- 
ally first diagnosed in infancy, childhood or adolescence’ to the 
anxiety disorder class. Yet, in order to account for differences in 
the symptom expression of children and adults, the individual diag- 
nostic criteria sets in DSM-5 provide additional notes to guide the 
clinicians in the diagnostic process [2]. Some noteworthy differ- 
ences between children and adults are: different symptom presen- 
tations (for example, clinging or tantrums in children with social 
anxiety disorder), duration (for example, 4 weeks in children and 
6 months in adults in separation anxiety disorder), symptom type 
or count (for example, one physical symptom in children and at 
least three in adults to diagnose generalized anxiety disorder), and 
level of insight into the excessiveness/inadequacy of fear (children 
may lack insight). 

This chapter summarizes current knowledge on the epidemiology 
of anxiety disorders across the lifespan, with a particular focus on 
childhood and adolescence, given the usually early onset of anxiety 
disorders. It describes the prevalence, onset, course, persistence, 
comorbidity, and outcome, as well as correlates and risk factors of 
separation anxiety disorder, specific phobia (and the various sub- 
types), social anxiety disorder, agoraphobia, panic disorder, and 
generalized anxiety disorder. We do not address selective mutism be- 
cause of a lack of data in epidemiological studies, and post-traumatic 
stress disorder and obsessive-compulsive disorder—despite being 
closely associated with pathological anxiety—have now been moved 
from anxiety disorders into separate diagnostic groups in DSM-5 
(see Sections 13 and 15). 
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Table 88.1 Anxiety disorders in DSM-IV and DSM-5, highlighting different symptomatology in youth 


Disorders usually first 


diagnosed in infancy, 
childhood, or adolescence 


309.21 


3323 


Separation anxiety 
disorder 


Selective mutism 


Anxiety disorders 


309.21 


31225 


Separation anxiety 
disorder 


Selective mutism 


Different criteria in children (vs adults) 
Information on childhood anxieties as highlighted in DSM text portion 





A5: Children may be unable to stay in or go to a room by themselves. They may also display 
‘clinging’ behaviour, staying near or ‘shadowing’ the parent around the house, or needing someone 
to be with them when switching to a different room in the house. 

A6: Children often have difficulty during bedtime and may insist that someone stays with them 
till they fall asleep. In addition, they may make their way to their caretakers bed (or that of a 
significant other, ex, sibling). Children may refuse or be reluctant to go to camp, attend sleepovers, 
or run errands. 

A8: Headaches, abdominal complaints, nausea, and vomiting are common in children. 
Cardiovascular symptoms, such as palpitations, dizziness, and feeling faint, are less common in 
younger children. However, they may occur in adolescents and adults. 

B: Duration is at least 4 weeks in children and adolescents and 6 months or longer in adults. 
Younger children are reluctant to go or may avoid going to school altogether. They may not 
express worries or fears about threats to their home, parents, or themselves, because the anxiety 
manifests only when separation takes place. As they age, they often worry about specific dangers 
(kidnappings, accidents, muggings, and death) or will have vague concerns about not being 
reunited with their attachment figures 


In children, when encountering individuals in social interations, they do not initiate speech or 
reciprocally respond when spoken to by others. In addition, they will speak in their homes in the 
presence of immediate family, but not too often in front of extended family or even close friends. 
They often refuse to speak at school and sometimes use non-verbal means (grunting, pointing, 
writing) to communicate. They may be more willing and eager to engage in social encounters 
when speech is not required or necessary. 

C: In children, duration of disturbance is not limited to the first month of school 


Anxiety disorders 


300.01 


300.21 


300.22 


300.23 


300.29 


300.02 


300.00 


Panic disorder 
without 
agoraphobia 


Panic disorder 
with agoraphohia 


Agoraphobia 
without history of 
panic disorder 


Social phobia 


(Specify 
if: generalized) 


Specific phobia 


(Specify 

type: animal, 
natural 
environment, blood 
injection injury, 
situational, other) 
Generalized 
anxiety disorder 


Anxiety disorders 
not otherwise 
specified 


300.01 


300.22 


300.23 


300.29 


300.02 


300.00 


300.09 


Panic disorder 


Agoraphobia 


Social anxiety 
disorder 


(Specify 
if: performance only) 


Specific phobia 


(Specify if: animal, 
natural environment, 
blood injection injury, 
situational, other) 


Generalized anxiety 
disorder 


Unspecified anxiety 
disorder 


Other specfied 
anxiety disorder 


B: Adolescents may be less worried about additional panic attacks than young adults 


B: ‘Other incapacitating or embarrassing symptoms’ in children include a sense of disorientation 
and getting lost 


A: Social anxiety must occur in peer settings, not just in interactions with adults 


C: In children, the anxiety may be expressed by crying, tantrums, freezing, clinging, shrinking, 
or failing to speak in social situations 


F Duration of disturbance is at least 6 months, for that aids in distinguishing the disorder from 
transient social fears that are common, particularly among children. Adolescents display a broader 
pattern of fear and avoidance, compared to younger children. Older adults display lower levels of 
anxiety across a broader range of situations, whereas younger adults express high levels of social 
anxiety for specific situations 


A: In children, the fear or anxiety may be expressed by crying, tantrums, freezing, or clinging 


When a diagnosis of this disorder is being considered, it is important to evaluate the degree of 
impairment and the duration of fear, anxiety, or avoidance, and the typicality of such behaviour, 
relative to the child’ particular developmental stage. Fear and anxiety are often expressed 
differently between children and adults. In children, the C criterion (which recognizes that fear is 
excessive/unreasonable) may be absent. Children are typically unable to understand the concept 
of avoidance 


C: In children, one, instead of three, out of six symptoms is required 


In children and adolescents, the anxieties and worries often concern the quality of their 
performance or competence at school or in sporting events, even when their performance is not 
being evaluated by others. It is possible for children with this disorder to worry about catastrophic 
events (such as earthquakes or nuclear war), in addition to mundane habits such as punctuality. 
Children may be perfectionists, redoing tasks due to dissatisfaction with their own performance. 
They are, more often than not, eager to seek approval and require a substantial amount of 
reassurance about their performance and their worries 


Table 88.1 Continued 


Obsessive-compulsive and related disorders 


300.3 Obsessive- 300.3 Obsessive- 
compulsive compulsive disorder 
disorder 
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Trauma- and stressor-related disorders 


308.3 Acute stress 308.3 Acute stress disorder 
disorder 
309.81 Post-traumatic 309.81 Post-traumatic stress 


stress disorder disorder 


Source: data from DSM-IV-TR: Diagnostic and Statistical Manual of Mental Disorders, 4th Revised edition (DSM-IV-TR), Copyright (1994), American Psychiatric Association; Diagnostic 
and statistical manual of mental disorders, Sth edition (DSM-5), Copyright (2013) American Psychiatric Association. 


Prevalence of anxiety disorders 


Cross-sectional community surveys conducted in many countries 
around the globe [10-14] have shown the prevalence of anxiety dis- 
orders to be the highest among all classes of mental disorders (that is, 
mood, substance use, impulse-control, somatoform, and eating dis- 
orders) in most countries, with estimates of up to 30% for lifetime and 
14% for the past 12 months. The estimated lifetime risk up to age 75 
is even higher (up to 41% in the United States [15]). In addition, pro- 
spective longitudinal studies that examine mental disorders in the same 
study cohort at multiple points in time provide consistent evidence that 
the cumulative rates go beyond 30% when entering early adulthood 
[16-18]. Thus, anxiety disorders are already the most common class of 
mental disorders in childhood and adolescence [8, 19-21]. 

Cross-sectionally, the most frequent anxiety disorders in adulthood 
are specific phobias, followed in frequency by social anxiety disorder, 
with 12-month prevalence estimates of up to 10% and 8%, respect- 
ively (13, 15]. In children, in addition to specific phobias and social 
anxiety disorder, separation anxiety disorder is highly prevalent, with 
quite variable 12-month prevalence estimates across studies, ranging 
from 2.8% and 8% [8]. It is noteworthy that the National Comorbidity 
Survey Adolescent Supplement (NCS-A) found that separation anx- 
iety disorder had a low current period prevalence (12 months: 1.6%; 
30 days: 0.6%) [22] but displayed a high lifetime prevalence (7.6%) 
[19]. This might indicate that this early childhood anxiety condition 
rarely persists into adolescence and adulthood. Consistent with these 
data, adult surveys have found a relatively low 12-month prevalence 
of separation anxiety disorder in adulthood of 1-1.5% [23, 24]. 

The less prevalent anxiety conditions across all ages are general- 
ized anxiety disorder, panic disorder, and agoraphobia. These three 
anxiety disorders reveal low prevalence estimates in childhood (1% 
or lower); somewhat higher estimates are found in adolescence (2- 
3% for panic disorder and 3-4% for agoraphobia and generalized 
anxiety disorder) (for comprehensive reviews, see [6, 8]). 


Onset of anxiety disorders 


There is abundant epidemiological evidence that the developmental 
period of childhood, adolescence, and young adulthood constitutes 
the core high-risk phase for the onset of anxiety disorders [5, 25-27]. 


Noteworthy heterogeneity, however, exists in the specific age of onset 
patterns for the various types of anxiety disorders. The bars in Fig. 88.1 
show the cumulative age of onset distribution for specific anxiety dis- 
orders by gender, as derived from data from the Early Developmental 
Stages of Psychopathology (EDSP) study [18, 28]. Separation anx- 
iety disorder and specific phobias are the earliest forms of anxiety 
disorders, revealing an emergence of 50% of all onsets prior to the 
age of 8 years; almost all affected cases have emerged by the age of 
12. Social anxiety disorder begins to develop later in childhood, 
showing a steep increase in incidence in early adolescence. Although 
agoraphobia, panic disorder, and generalized anxiety disorder reveal 
some onset cases in childhood, their core onset risk period is in later 
adolescence and early adulthood. Despite differences in prevalence 
estimates for females and males (the former being more commonly 
affected than the latter), no remarkable gender differences in the age 
of onset patterns are observed. Slightly earlier onsets are seen for spe- 
cific phobias in males, which are due to an earlier onset of the nat- 
ural environmental subtype of specific phobias in males [8], and for 
generalized anxiety disorder in females. Similar age of onset findings 
have emerged in other epidemiologic studies conducted in both adult 
and adolescent samples [15, 21, 26, 27]. 





Natural course and outcome of anxiety disorders 


Given that current 12-month period prevalence estimates are not 
considerably lower than the lifetime estimates for most anxiety dis- 
orders, we can assume that anxiety disorders typically have a per- 
sistent course, either in terms of chronic symptoms or in terms 
of a waxing and waning or frequent recurrent course [8]. For ex- 
ample, the 12-month/lifetime prevalence ratios in the National 
Comorbidity Surveys in the United States are higher for anxiety dis- 
orders (any: 0.7) than for depressive disorders (0.5), with variations 
across the anxiety disorders (0.5 for generalized anxiety disorder to 
0.8 for specific phobia). The prevalence ratio is highest in adoles- 
cents (0.8), as compared to adults (0.6) and the elderly (0.5) [15]. 
However, this decrease is not uniform across the anxiety disorders, 
with generalized anxiety disorder (0.4-0.5) and specific phobia 
(0.7-0.8) showing similar ratios across all age groups [15]. 
Persistence estimates for anxiety disorders from cross-sectional 
studies may overestimate the chronicity or recurrence of anxiety 
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Fig. 88.1 Cumulative incidence and age of onset distribution of anxiety disorders by gender. The EDSP study is a prospective longitudinal study in a 
representative sample of adolescents and young adults aged 14-24 years from the general population (Munich, Germany; N = 3021). Anxiety and other 
mental disorders were assessed using the standardized and computerized composite international diagnostic interview (DIA-X/M-CIDI), according to 
DSM-IV criteria, on a lifetime basis at baseline and for the interval period since the last assessment at 1-, 4-, and 8-year follow-up, respectively. Bars 

in the figure show the cumulative age of onset distribution; lines show the age-dependent cumulative lifetime incidence; in phobias, impairment was 
required among subjects aged 18 years or older; * Separation anxiety disorder was only assessed once at 1-year follow-up in subjects of the younger 
age cohort (aged 14-17 years at baseline). 

Reproduced from Child Adolesc Psychiatr Clin N Am., 21(3), Beesdo-Baum K, Knappe S, Developmental Epidemiology of Anxiety Disorders, pp 457-78, Copyright (2012), with 
permission from Elsevier Inc. 





problems in the general population, given the potential memory and 
recall bias in lifetime symptom reports and the age of onset/recency 
information [8]. Prospective longitudinal study approaches are able 
to provide a more reliable picture of the natural course and outcome 
of anxiety disorders (Fig. 88.2). However, given the considerable ef- 
forts and financial resources required for such studies, prospective 
course data are still scarce, particularly for longer time scales of 
10 years or more. The data available indicate that the stability or 
persistence is probably less pronounced than expected, based on 
the estimates of cross-sectional studies, at least when focusing on 
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full-threshold diagnosis (diagnostic stability). For example, in the 
15-year prospective Zurich Cohort study [29], a low stability rate 
of 4% was found for pure (non-comorbid) anxiety disorders (gen- 
eralized anxiety or panic disorder). For children and adolescents 
with anxiety disorders, Last and colleagues [30, 31] reported that 
80% remitted from the anxiety shown initially, over a 3- to 4-year 
follow-up. The prospective longitudinal EDSP study among ado- 
lescents and young adults found that 2 years after the baseline in- 
vestigation, 19.7% of adolescents with baseline anxiety disorder 
met the threshold criteria again at follow-up, with the probability 


Prospective longitudinal studies 


Comparison of diagnostic status assessed at two (or more) 
assessment waves 
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Fig. 88.2 Approaches to the natural course assessment of anxiety disorders in cross-sectional and prospective longitudinal epidemiological 
community-based studies. Cross-sectional studies most frequently utilize retrospective age of onset and age of recency reports to estimate the 























persistence of anxiety, as indicated by the duration of a condition in years. This approach assumes the continuous presence of a disorder, neglecting 
waxing and waning in symptom severity, including symptom-free intervals, and may thus overestimate the stability (or persistence/chronicity). Another 
indirect indicator of disorder chronicity is the 12-month to lifetime prevalence ratio. The higher the proportion of 12-month cases among lifetime 
cases, the higher the assumed chronicity of a condition. Because only categorical diagnoses are considered here (no symptomatic improvements 
below the diagnostic threshold), this may lead to underestimation of chronicity. On the other hand, as lifetime occurrence prior to the past 12 months 
may have not been reported or remembered by study participants, this ratio may also overestimate the chronicity. Thus, cross-sectional studies only 
allow for crude estimations of the natural course of anxiety disorders. Longitudinal studies, in contrast, allow for a more realistic and comprehensive 
course description as ‘true persistence’, recurrence, progression, improvement, and remission can be mapped using the diagnostic information on a 
hreshold and subthreshold level available from two (or more) assessment waves months to many years apart. 





Adapted from Psychiatr Clin of North Am., 32(3), Beesdo K, Knappe S, Pine DS, Anxiety and anxiety disorders in children and adolescents: Developmental issues and 











implications for DSM-V, pp 483-524, Copyright (2009), with permission from Elsevier Inc. 
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of a negative outcome increasing as a function of baseline anxiety 
severity [32]. Considerable variability in stability rates by type of 
anxiety diagnosis have been observed. For example, when taking 
threshold and subthreshold anxiety diagnoses from both baseline 
and follow-up of the EDSP study into account, panic disorder (44%) 
and specific phobia (30.1%) were found to be the most stable; other 
anxiety disorders showed lower stability rates, particularly agora- 
phobia (13.4%) and social phobia (15.8%) [32]. Of course, stability 
rates at least slightly increase when following up individuals who 
have been affected for longer periods of time. There is then a higher 
chance for recurrence to have occurred. For social anxiety dis- 
order in the EDSP study, for example, 15.5% of baseline cases met 
threshold criteria again at least once during any of the three follow- 
up assessments conducted over a time period of 10 years, and a fur- 
ther 19.7% had at least subthreshold social anxiety disorder [33]. 

To summarize, these findings seem to indicate that anxiety dis- 
orders in the community are typically not chronic, in the sense that 
the symptoms are consistently stable above the diagnostic threshold. 
Instead, the symptoms typically wax and wane over time around the 
diagnostic threshold, with spontaneous complete remissions being 
the rare exceptions (Fig. 88.3) [32]. 

Itis also a consistent finding from prospective studies on the course 
of anxiety that there is a considerable degree of homotypic continuity. 
This is despite the proportion of youth diagnosed again with the 





Risk Factors and 
Antecedents 

* Genetic Predisposition 
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same anxiety disorder or symptoms thereof being numerically ra- 
ther low to moderate (for comprehensive reviews of studies, see [8, 
34, 35]). Individuals diagnosed with an anxiety disorder, compared 
to those without, are at a statistically higher risk of having the same 
disorder (for example, [36-38]) or symptoms of the same disorder 
[32, 33] at later points in time (homotypic prediction) [39]. Moreover, 
the limited homotypic continuity of anxiety disorders observed in 
prospective longitudinal community studies does not mean that 
anxiety disorders completely remit (Fig. 88.3). In the EDSP study, 
only 10% of children and adolescents with (pure or comorbid) spe- 
cific phobia at baseline had no mental disorder at the 10-year follow- 
up (full diagnostic remission); 41% reported specific phobia again 
(strict homotypic continuity), and overall 73% were diagnosed with 
any anxiety and/or depressive disorder at subsequent assessments 
(heterotypic continuity) [40]. Similarly, only 13% of baseline social 
anxiety disorder cases were free of any mental disorder during the 
10-year follow-up, while the remaining 87% reported having other 
types of anxiety disorders or depressive disorders. All of the base- 
line generalized anxiety disorder cases revealed either homotypic or 
heterotypic continuity. Other multi-wave, prospective longitudinal 
studies revealed similar findings [29, 31, 37, 41]. Thus, after onset 
of the first (that is, pure) anxiety disorder emerges in childhood or 
early adolescence, a pattern with continuous and frequently variable 
anxiety develops by later adolescence or early adulthood [6, 42]. 
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Fig. 88.3 The evolution of anxiety disorders. 
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Adapted from Compr Psychiatry, 41(2, suppl. 1), Wittchen H-U, Lieb R, Pfister H, et al., The waxing and waning of mental disorders: evaluating the stability of syndromes of 
mental disorders in the population, pp. 122-32, Copyright (2000), with permission from Elsevier Inc. 


Complications and development 


of comorbid disorders 





A frequent outcome of primary anxiety disorders is the develop- 
ment of a cascade of other psychopathological complications [4]. 
Anxiety disorders have been consistently shown to contribute not 
only to the development of further anxiety disorders, but also to de- 
pressive disorders, substance use disorders, and suicidality, as well 
as to other adverse developmental outcomes (such as educational 
underachievement, early parenthood, and financial disadvantage 
[25, 43]). Thus, for anxiety cases in which homotypic continuity is 
not observed, there is a substantial degree of heterotypic continuity 
in both cross-sectional and longitudinal epidemiological studies 
[39]. The longitudinal associations between anxiety and sub- 
sequent depression are particularly strong [5, 21, 31, 43-47]. 
Prospective epidemiological studies investigating particular anx- 
iety diagnoses found that children and adolescents with specific 
fears and phobias (especially fear of darkness) [48], social anx- 
iety disorder [49-51], and other types of anxiety disorders, such 
as agoraphobia, panic disorder, and generalized anxiety disorder 
[5, 38, 51], are all at significantly increased risk for subsequent 
depression. 

There is also consistent epidemiologic evidence that certain clin- 
ical characteristics of anxiety disorders, such as greater severity 
(multiple anxiety diagnoses or more severe impairment of indi- 
vidual anxiety disorders), greater impairment or persistence, and 
co-occurring panic attacks, are associated with an increased risk 
of depression [49, 51], suggesting that anxiety disorders may play a 
causal role in depression development [52]. 

Substance abuse or dependence is also a frequent heterotypic 
problem outcome of primary anxiety disorders [43, 45, 53-56], 
providing some evidence that alcohol or other drug use is pos- 
sibly motivated by self-medication for pathological anxiety 
symptoms [57]. 


Correlates and risk factors 


A range of sociodemographic, individual, and environmental fac- 
tors assessed have been found to be associated with anxiety dis- 
orders in epidemiological studies. We will describe these factors as 
correlates or risk factors, not speculating whether they should be 
considered as distal or proximal vulnerability factors or risk factors 
in a strict sense. This is because cross-sectional studies predominate 
and identify correlates; prospective longitudinal studies are required 
to evidence risk factor status [58], either for the first onset of an anx- 
iety disorder or for an adverse course after the anxiety disorder has 
initially developed (chronicity/persistence/progression/recurrence 
vs improvement/remission) (Fig. 88.3). Course predictors have 
been less extensively studied, compared to risk factors for first onset; 
yet both appear crucial, and at least partly different factors could 
emerge relevant for prevention and early intervention. Moreover, 
while the correlates overlap, risk factors and course predictors for 
the individual anxiety disorders or for anxiety vs depressive or other 
comorbid disorders may be different. They may be implicated in the 
heterogeneity of the various disorder types and classes in relation to 
the phenomenology, onset, and course. 
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Sociodemographic factors 


Female gender is the most consistent risk factor for the develop- 
ment of each of the specific anxiety disorders (for a review, see 
[8]). Gender differences are usually small in childhood and grow 
with age, reaching a ratio of approximately 2:1 to 3:1 for females vs 
males in late adolescence. The recent NCS-A found a higher preva- 
lence of anxiety disorders among non-Hispanic black, compared 
to non-Hispanic white, adolescents, as well as among respondents 
with divorced or separated parents, compared to married or co- 
habiting parents [19]. Anxiety disorders are frequently also found 
among individuals with lower, compared to higher, education (for 
example, [59]). In NCS-A, low parental education was also shown 
to be associated with adolescent anxiety disorders. Low household 
income or precarious financial situations have been shown to be 
associated with anxiety disorders (for example, [59]), and there 
is evidence of adverse financial outcomes in young adulthood 
among children and adolescents with anxiety disorders, particu- 
larly generalized anxiety disorder/overanxious disorder [25]. As 
sociodemographic risks co-occur, Copeland and colleagues [60] 
identified a moderate-risk class containing low parental education 
and poverty being associated with offspring anxiety disorders, as 
well as with depressive disorders, but not with externalizing dis- 
orders. After taking into account comorbidity and multiple other 
risk factors, Shanahan et al. [61] found that parental unemploy- 
ment was associated with anxiety and behavioural disorders, but 
not with depressive disorders. 


Familial aggregation 


Anxiety disorders consistently ‘run in families, with strong asso- 
ciations between parental and offspring anxiety diagnoses [5, 52, 
62-65]. A particularly strong risk is present for offspring with two 
affected parents [62, 65] and for offspring of severely affected parents 
as indicated by impairment, number of anxiety disorders, and early 
onset of anxiety disorders [66]. However, not only parental anx- 
iety disorders increase the risk for anxiety in offspring, but parental 
depression was also found to be associated with offspring anxiety 
[67, 68]. Investigations into the specificity of the familial aggrega- 
tion of anxiety and other mental disorders revealed mixed findings, 
with some evidence for specificity [5, 69, 70]. Furthermore, some 
specificity appears to exist in the familial aggregation of certain anx- 
iety disorders [5, 71]. The exact mechanism of ‘familial transmission’ 
of anxiety from parents to offspring is complex and still far from 
being well understood. Biology in terms of genetic or physiological 
(intrauterine) factors, the family environment in terms of living 
with an anxious parent (for example, modelling, anxious rearing, 
insecure attachment), or both (additive risks or gene-environment 
interactions) are implicated [72-74]. Twin studies indicate that the 
genetic liability for specific anxiety disorders, as well as for depres- 
sion and anxiety, may overlap with different environmental risks, re- 
sulting in phenotypically diverse outcomes [74-76]. For more detail 
on genetic factors, see Chapters 80 and 89. 


Temperament/personality 


Personality traits such as neuroticism, trait anxiety, or negative 
affect, as well as temperamental styles often investigated in epi- 
demiological studies in terms of behavioural inhibition are likely 
overlapping constructs that can be viewed as precursor conditions 
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to anxiety disorders, with some indications of specificity between 
anxiety and depression outcomes [5, 77-83]. 


Environmental factors 


Adverse life events and conditions in childhood and adolescence, 
as well as unfavourable parental rearing styles, have been linked in 
many epidemiologic studies to the development of anxiety disorders 
in offspring. Indeed, loss of a parent by death, separation, or divorce, 
as well as physical or sexual abuse or neglect in childhood, have been 
associated with almost all mental disorders, including anxiety dis- 
orders (for example, [70, 84-86]). In the EDSP study data, there was 
nevertheless some indication that childhood separation events were 
specifically associated with pure anxiety, as well as with comorbid 
anxiety and depression, but not with pure depression [5]. Other 
negative life events have also not just been evidenced as risk factors 
for depression [87, 88], but also for anxiety disorders [89]. In par- 
ticular, threat events tend to precede anxiety disorders (or comorbid 
anxiety and depressive disorders), whereas loss events tend to pre- 
cede depression [61, 90-92]. 

Parenting styles have also been linked in epidemiologic studies 
with offspring anxiety disorders. For example, in the EDSP study, 
parental overprotection and parental rejection were associated with 
increased prevalence of social anxiety disorder in offspring [63, 71]. 
Overprotection was found to increase the risk for anxiety disorders, 
but not for pure depressive disorders, while parental rejection was 
linked with depressive disorder [5]. After adjusting for comorbidity 
and other putative risk factors, Shanahan et al. [61] found harsh 
discipline to be specifically associated with generalized anxiety dis- 
order or overanxious disorder (among a range of poor emotional 
and behavioural outcomes); overintrusive parenting was revealed as 
a non-specific risk factor for anxiety and behavioural disorders. 


Course predictors 


Sociodemographic, individual, and environmental factors have 
been less frequently examined as course predictors for anxiety 
disorders, particularly when based on prospective longitudinal 
data [35]. Contrasting to correlates and risk factors for onset, 
sociodemographic factors are not consistently associated with 
an adverse course of anxiety disorders. In a cross-sectional study 
among adolescents in the United States, for example, female gender 
was the only demographic variable found to be associated with a 
higher persistence of most anxiety disorders, while race/ethnicity, 
family income, or family composition were not consistently impli- 
cated [22]. Symptom severity and duration of illness are the best 
prognostic indicators for anxiety in both epidemiologic and clin- 
ical studies [8]. For example, the prospective longitudinal EDSP 
study revealed several clinical features, such as early age of onset, 
the generalized subtype, a high number of catastrophic anxiety cog- 
nitions, severe avoidance, and impairment, as predictors for higher 
persistence, as well as prospectively assessed diagnostic stability for 
social anxiety disorder [33]. Comorbidity with other disorders was 
not found to predict the course of social phobia, with the exception 
of co-occurring panic attacks that were associated with both higher 
persistence and diagnostic stability of social anxiety. Established 
vulnerability characteristics for the onset of anxiety conditions, such 
as parental social anxiety disorder and depression, as well as a be- 
havioural inhibited temperament, were related to poor prognosis. 
Thus, besides gender and clinical diagnostic information, familial 


and temperamental characteristics may inform estimates of prog- 
nosis for anxiety disorders and appear useful to target treatment 
interventions. 





Summary and implications 


Anxiety disorders are frequent clinical conditions with an early 
onset in life, associated with considerable developmental, psycho- 
social, and psychopathological complications. Early anxiety syn- 
dromes in children may remit spontaneously, but most children 
and adolescents with an anxiety disorder continue to suffer from 
anxiety or other psychopathological problems for the major part of 
their subsequent life. This persistent course is causally related to a 
tremendous personal, economical, and societal burden (11, 93, 94]. 
Recent EU-wide calculations rank anxiety disorders as the third 
most burdensome group of disorders [3]. Although effective cog- 
nitive behavioural and pharmacological treatments exist for anx- 
iety disorders and related conditions, the vast majority of anxiety 
disorders typically never receive any appropriate treatment and the 
majority remains undiagnosed and untreated, even in the most ad- 
vantaged countries. Evidence suggests that treatment only occurs 
after many years when additional psychopathological complications 
like depression have developed. The degree of often lifelong non- 
recognition, underdiagnosis, and undertreatment has been related 
to the tremendous burden caused by anxiety in the United States, 
the EU, and worldwide [11]. Given that available treatments are 
highly effective and cost-efficient, timely and earlier treatment has 
the potential of substantially reducing the burden (see Chapter 92). 
Moreover, given the early onset of anxiety disorders, the burden of 
anxiety on the population level might be substantially reduced by 
concerted action, with targeted preventive interventions in children 
and adolescents, before the disorder becomes persistent and before 
comorbid complications arise (see Chapter 91). In order to work 
towards these goals, considerably more basic research is needed to 
identify the most powerful and specific risk factors for the first onset, 
as well as poor course of anxiety disorders, and to delineate the com- 
plex underlying psychological, behavioural, and biological mechan- 
isms and interactions [3, 95-97]. Focusing on ‘optimal outcomes’ of 
anxiety problems experienced by children and adolescence is a core 
societal challenge [98]. 
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The present chapter provides a review of the current state of genetic 
studies across the spectrum of anxiety disorders, including specific 
phobias (SP), social anxiety disorder (SAD, also referred to as social 
phobia), agoraphobia (AP), panic disorder (PD), and generalized 
anxiety disorder (GAD), with a special focus on genome-wide asso- 
ciation studies (GWAS), candidate gene studies, gene-environment 
studies, and recent advances in epigenetics. Given the interrelated- 
ness of mental disorders in general and of anxiety disorders in 
particular, we will expand upon the genetics of intermediate pheno- 
types, crossing classic categorical disorder definitions, and give a 
critical assessment of related upcoming clinical challenges and fu- 
ture perspectives. 











Anxiety disorders show significant familial aggregation, with a 
meta-analysis of epidemiological family studies calculating substan- 
tial summary odds ratios for SP and SAD (4.1), PD (5.0), and GAD 
(6.1) in first-degree relatives [1]. Heritability of PD, as determined 
by a meta-analysis of twin studies, was 0.43 and of GAD 0.32, with 
mainly non-shared environmental influences contributing to the re- 
maining variance in liability for PD [1]. GAD appeared to display a 
significant influence of shared environmental influences in females 
[1] (Table 89.1). Heritability for phobic disorders has been estimated 
to range between 0.43 and 0.63 [1] (Table 89.1), and factor analysis 
suggested one common genetic factor underlying SP and one shared 
factor underlying SAD and AP [2]. However, the influence of gen- 
etic factors on anxiety disorders has been suggested to decrease from 
childhood towards adolescence, while the influence of shared envir- 
onmental factors has been reported to increase [3]. This could be 
explained by an increased direct environmental transmission from 
parents to their adolescent offspring, in line with developmental the- 
ories of fear-anxiety learning [4]. 

Other studies have reached similar heritability estimates, although 
very rarely have they corrected for the unreliability of a single life- 
time assessment that is most commonly used in these investigations. 








When accounting for variability in the recall of unreasonable fears 
and interviewer bias towards which fears constitute phobias, twin 
resemblance due to genetic factors of phobias was determined to be 
0.46-0.59 for SP, 0.51 for SAD, and 0.67 for AP [5]. This hints at a 
universal underestimation of the true genetic contribution at least in 
phobic disorders [5]. Unsurprisingly, both family studies and twin 
studies in the anxiety spectrum have acknowledged that the influ- 
ence of genes respects neither hierarchic diagnostic disease bound- 
aries nor the threshold between rational and irrational pathological 
anxiety or fear. Comparing genetic correlations from twin studies 
among SAD, AG, PD, and GAD, the lowest correlation of pairwise 
comparisons with 0.65 was found to exist between SAD and AP, 
while PD and GAD reached an almost perfect correlation coefficient 
of 0.98 [6]. This position is further supported by the fact that when 
twin data of SP, SAD, PD, AP, and GAD were combined in one single 
analysis, the latent liability to all anxiety disorders was more herit- 
able than for any individual disorder [7]. 

In summary, the classic genetic studies of anxiety disorders sug- 
gest the potential for success in identifying the genes involved. The 
current landscape includes interesting findings from structural and 
rare variant studies, genome-wide association studies (GWAS), and 
candidate gene studies. 


A genome-wide copy number variation (CNV) association study 
in 535 PD cases and 1520 control subjects yielded Bonferroni- 
corrected significant common duplications in the pericentromeric 
region of 16p11.2, including genes like the immunoglobulin heavy 
locus (IGH) and solute carrier family 6 member 8 (SLC6A8, cre- 
atine transporter 1) [8]. Furthermore, a meta-analysis of PD linkage 
studies amounting to over 700 analysed subjects reported significant 
bins on chromosomes 1, 5, 15, 16, and 22 [9]. 

The follow-up with GWAS or next-generation sequencing studies 
appears to be a promising avenue for future rare variant research ef- 
forts. For example, the transmembrane protein 132D (TMEM132D) 
locus, a gene previously associated with PD and anxiety severity (see 
GWAS, candidate gene, gene-environment interaction, and ‘therapy 
genetics’ studies, p. 929), has a 40-kb region spanning all exons 
now fully sequenced in a cohort of 300 anxiety disorder patients, 
including 262 PD, compared with 300 healthy controls [10]. A total 
of 371 variants were identified, most of which (76%) had a minor 


Table 89.1 Heritability of anxiety disorders 


SP NA - - 
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NA 


0.46-0.59 0.41-0.53 
SAD* 0.51 NA = = 0.48 NA 
AP? 0.67 NA = = 0.33 NA 
PD? 0.43 0222053 = = 0.57 0.47-0.68 
GAD? 0.32 0.24-0.39 0.17 0.03-0.29 0.51 0.41-0.64 


Heritability estimates not adding up to 1.0 are derived from rounding up/down effects. 


SP, specific phobias; SAD, social anxiety disorder; AP, agoraphobia; PD, panic disorder; GAD, generalized anxiety disorder; Cl, confidence interval; a’, additive genetic factors; c?, shared 
environmental factors; e?, unique environmental factors; NA, information not available in the respective study. 


a [5]. 
b [1]. 


allele frequency of under 5%, with an overrepresentation of func- 
tional coding variants in control subjects, suggesting that putatively 
protective TMEM132D variants decreased the risk to be affected by 
an anxiety disorder [10]. 


GWAS, candidate gene, gene-environment interaction, 
and ‘therapy genetics’ studies 


Within the anxiety disorder spectrum, PD has received the most 
attention in GWAS. Initially, two single-nucleotide polymorphisms 
(SNPs) within TMEM132D (rs7309727 T allele, rs11060369 A al- 
lele) were discovered as top hits in independent and combined 
GWAS approaches in three samples of patients with PD or PD 
and AP diagnoses (totalling 909 cases and 915 controls) [11]. The 
variants were associated with the severity of anxiety symptoms in 
patients affected by PD or panic attacks, and post-mortem studies 
revealed an increased expression of TMEM132D mRNA in the 
frontal cortex [11]. Subsequently, these results were replicated in a 
meta-analysis combining cases with primary PD (patients without 
major psychiatric comorbidities, 1039 cases and 2411 controls), 
which confirmed the rs7309727 T risk allele and a combined T/A 
rs7309727-rs11060369 risk genotype [12]. An independent meta- 
analysis further supported the PD ‘risk allele status of the rs7309727 
T allele (2630 cases, 3294 controls; and 1867 cases, 2527 controls, 
respectively) and the rs11060369 C allele (2609 cases, 3279 con- 
trols; and 1852 cases, 2516 controls, respectively) in samples of pri- 
mary PD and European PD [13]. 

Based on the assumption that different diagnostic definitions re- 
flect alternative expressions of a common underlying anxiety diath- 
esis, a recent meta-analysis of GWAS totalling over 18,000 unrelated 
individuals followed a dual approach by comparing cases (SP, SAD, 
PD, AP, and GAD, combined into one group of subjects with any 
anxiety disorder) to controls and employing, as a phenotypic score, 
an overall latent anxiety disorder factor [14]. The case-control de- 
sign revealed a so far uncharacterized non-coding RNA (ncRNA) 
locus on chromosome 3q12.3 (LOC152225), with the rs1709393 T 
allele as a ‘protective’ factor against any anxiety disorder diagnosis, 
while analysis of the quantitative factor score highlighted a linkage 
disequilibrium block on chromosome 2p21 spanning three genes 
coding for the neutral and basic amino acid transport protein rBAT 
(SLC3A1), the prolyl endopeptidase-like protein (PREPL), and the 
calmodulin-lysine N-methyltransferase (CAMKMT), exceeding 
genome-wide significance, with the CAMKMT rs1067327 minor G 
allele as the top hit ‘risk’ SNP [14]. 


Candidate gene association studies of anxiety disorders have been 
based on the mechanism of action of anxiolytic drugs, preclinical 
models, or GWAS results for categorical or dimensional anxiety dis- 
order traits. Within anxiety spectrum conditions, studies of common 
genetic variation have mainly investigated the serotonergic system, 
including its transporter and receptors, enzymes related to the syn- 
thesis or degradation of catecholamines and other neurotransmit- 
ters, or components of neuropeptide and hormonal systems. 

The implication of the serotonin transporter (SLC6A4, chromo- 
some 17q11.2) polymorphic region (5-HTTLPR) in animal models 
of anxiety [15] sparked interest in the evaluation of genes related 
to serotonergic neurotransmission in the human anxiety spectrum. 
The more active (higher transcription rate) long allele (1) has been 
associated with lower anxiety-related personality traits and impli- 
cated as a protective factor against the development of a depressive 
episode due to an individual's experienced amount of lifetime stress 
[16]. Conversely the short (s) allele was claimed to be associated 
with increased anxiety-related traits (neuroticism and harm avoid- 
ance) and vulnerability for an affective disorder diagnosis [15, 16]. 
The less active forms of the 5-HTTLPR have been associated with 
a variety of clinical phenotypes in anxiety disorders, for example 
increased blushing propensity in SAD patients [17], increased and 
more persistent activation of the amygdala during an auditory ha- 
bituation paradigm [18], and increased amygdala activation during 
exposure to happy faces [19] (for a review of imaging genetic effects 
on amygdala responsiveness in anxiety disorders, see [20]). 

Unfortunately, several studies have not shown a direct association 
between 5-HTTLPR variation and categorical PD (see, for example, 
[21]), and a meta-analysis of candidate gene studies failed to find 
supporting evidence for a significant effect [22]. Somewhat similarly, 
in a therapy-genetic approach, the 5-HTTLPR genotype was suc- 
cessfully included, alongside other factors like gender and comorbid 
mood and externalizing disorders, in a risk index predicting the 
response of children (384 cases) to cognitive behavioural therapy 
(CBT) for primary anxiety disorders [23]. However, replication of 
the association between the short 5-HTTLPR variant and increased 
response to CBT in a cross-cohort meta-analysis (1044 cases) failed 
for primary anxiety disorders, albeit s/s allele carriers were more 
likely to be free of any anxiety disorder post-CBT treatment [24]. 
Positive results have been reported in pharmacogenetic studies fo- 
cusing on selective serotonin reuptake inhibitors (SSRIs), with the 1 
variant predicting an increased reduction of SAD symptoms [25] and 
a significant reduction of panic attacks in female PD patients [26]. 
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A recent systematic literature review of neurobiological markers of 
treatment response in anxiety disorders supported the role of a func- 
tional polymorphism within the l variant of the polymorphic region 
(5-HTTLPR/rs25531; the minor G allele renders the | variant trans- 
porter expression levels more closely to the ‘risk s variant [27]) in 
the prediction of therapy outcome [28]. The ultimate status of the 
5-HTTLPR gene as a successful candidate remains to be definitively 
established in larger studies or by meta-analysis of smaller studies 
with sufficiently similar designs. 

The G/G genotype of the serotonin receptor 1A gene (HTRIA, 
chromosome 5q12.3) rs6295 polymorphism (also referred to as - 
1019C/G, due to its position in the transcriptional control region) has 
been associated with increased escape behaviour during an avoid- 
ance task prior to CBT in patients suffering from PD with AP, as well 
as increased amygdala activity towards threat and safety cues and re- 
duced self-initiated exposure to aversive stimuli following CBT [29]. 
Correspondingly, the G allele predicted a significant persistence of 
panic attacks in PD patients treated with SSRIs [30]. Nonetheless, 
the pharmacogenetic study was relatively underpowered. 

Monoamine oxidase A (MAOA) is responsible for the oxidative 
deamination of serotonin, noradrenaline, dopamine, and xeno- 
biotic amines; its gene on chromosome Xp11.3 has a functional 
polymorphism in the promoter region. The relevant location of the 
variation is commonly referred to as the upstream variable number 
of tandem repeats (uVNTR); it can take several forms generally 
distinguished in short (low transcriptional activity) or long alleles 
(high transcriptional activity), the latter of which has already been 
shown to occur significantly more frequently in female PD pa- 
tients than in corresponding control samples [31]. Meta-analytical 
evaluation (1115 cases, 1260 controls) of candidate gene studies ex- 
ploring the MAOA uVNTR validated a female-specific allelic associ- 
ation between the long, high-expression gene variants and PD [32]. 
Remarkably, a controlled and randomized multi-centre analysis 
(283 cases) discovered that the long alleles of the MAOA uVNTR 
resulted in significantly worse outcome measurements on dimen- 
sional scales of anxiety, following CBT, in patients suffering from PD 
with AP [33]. Inferior therapy outcome among long allele PD car- 
riers, as compared to short allele carriers, was also accompanied by 
increased heart rate and reported fear during a behavioural avoid- 
ance task involving agoraphobic conditions and impaired habitu- 
ation to the task upon repeated exposure [33]. Also, the T allele of 
the T941G polymorphism in the MAOA gene sequence has been 
possibly linked to GAD [34]. 

The catechol-O-methyltransferase (COMT, chromosome 
22q11.21) is essential for the degradation of catecholamine neuro- 
transmitters via the S-adenosyl methionine-coupled methylation of 
ahydroxyl group. It displays a SNP at base pair position 472 replacing 
a G with an A (rs4680, commonly referred to as the vall158met poly- 
morphism), causing a 3- to 4-fold lower overall COMT activity [35]. 
Initial association analysis revealed the G allele (that is, the valine/ 
high-activity variant) to be overrepresented in female PD patients 
[36]. A recent meta-analysis (1130 cases, 1783 controls), including 
Asian and European-Caucasian samples, confirmed the ‘risk’ status 
of the rs4680 COMT G allele in European PD samples, regardless of 
gender [13]. Interestingly, the 158met/met genotype has been sug- 
gested to profit less from exposure-based CBT for PD, as compared 
to carriers of at least one val allele, who reported higher symptom in- 
tensity prior to CBT [37]. Additionally, while the COMT val variant 


has been linked to SP [38], internalizing disorders, including SAD 
and GAD, have been associated with other genetic variation related 
to the catecholaminergic system, including the sodium-dependent 
dopamine transporter (SLC6A3) [39]. 

Given the pivotal role of corticotropin-releasing hormone (CRH) 
in the hypothalamic-pituitary—adrenal (HPA) axis, and therefore as 
a regulator of cortisol release in the stress response, the CRH re- 
ceptors CRHR1 and CRHR2 have been an additional focus in the 
exploration of anxiety disorder genetics, particularly in PD. A joint 
cohort approach genotyping nine CRHR1 (chromosome 17q21.31) 
SNPs in 531 matched PD case-control pairs delivered evidence of 
four SNPs (rs7209436, rs12936181, rs17689918, and rs17689966) to 
be associated with PD in at least one subsample, with the minor A al- 
lele of rs17689918 remaining statistically significant in PD females 
across all cohorts, even following conservative post hoc correction 
[40]. Multi-level evidence suggests the rs17689918 A allele is associ- 
ated with decreased CRHR1I mRNA expression in the frontal cortex 
and amygdala of post-mortem brains, aberrant differential con- 
ditioning in the prefrontal cortex, and impaired safety signal pro- 
cessing in the amygdala in an imaging paradigm [40]. Furthermore, 
the risk allele caused carriers to show less escape tendencies in a 
behavioural avoidance task, yet increased levels of nervous appre- 
hension and anxiety sensitivity [40]. This suggests reduced CRHR1 
expression to be linked to an increased susceptibility to anxious sen- 
sations accompanied by feelings of sustained fear [40]. Additionally, 
an investigation of several anxiety disorder candidate gene effects 
on serotonin-noradrenaline reuptake inhibitor (SNRI) treatment 
response in GAD supported an association of CRHR1 SNPs with 
therapy outcome [41]. 

Among the neuropeptide class and their respective receptors, gen- 
etic variation of neuropeptide S (NPS) and the NPS receptor (NPSR1, 
chromosome 7p14.3) correlates psychopathological processes 
relating to stress, arousal, and fear, and to their mediation in par- 
ticular, with anxiety-related behaviour and anxiety disorders (Fig. 
89.1) [42]. Upon binding of NPS, the G-protein coupled NPSR1 pro- 
motes the mobilization of intracellular calcium stores and thereby 
affects second messenger signalling [43]. In independent, as well as 
joint, analyses of case-control cohorts (766 cases and 776 controls), 
the NPSR1 1s324981 T allele (replacing an asparagine in amino 
acid position 107 with an isoleucine [Asn lle], leading to receptor 
hyper-responsiveness) has been significantly associated with PD 
with and without AP in females in a categorical analysis approach 
and, dimensionally, with increased levels of anxiety sensitivity [44]. 
Evaluations of the NPSR1 rs324981 T allele in PD females demon- 
strated increased heart rates and other symptom measures during a 
behavioural avoidance task and decreased activation of the anterior 
cingulate cortex top-down control following exposure to emotion- 
ally relevant fear stimuli [44]. The status of NPSR1 as a PD ‘risk 
gene has been highlighted recently by a replication of the NPSR1 
rs324981 T allele association with PD in samples of European an- 
cestry in a meta-analysis of candidate gene studies (914 cases, 1028 
controls) [13]. For a review of the NPSR1 gene in mechanisms of 
anxiety, also see [45]. 

A further valuable candidate gene strategy targets mediating 
models of anxiety proneness, for example increased sensitivity 
to carbon dioxide. Hypercapnia reliably elicits a panicogenic ef- 
fect in PD patients. Following linkage evidence for the chromo- 
somal region 12q13 and PD [46], further studies expanded upon a 
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Fig. 89.1 Multi-level evidence linking the neuropeptide S receptor 1 gene (NPSR7) to anxiety disorders and anxiety-related cross-disorder 
intermediate phenotypes. The NPSR7 rs324981 T allele was chosen as seed node. Arrows represent domain interrelations of systems analysis mediating 
interactions between genetic and environmental factors, potentially leading to observable meta-phenomena crossing classic nosological boundaries 
and connecting different spectra of psychopathology. Refer to the individual sections in this chapter for detailed supporting literature. ACC, anterior 
cingulate cortex; AMY, amygdala; CCK-4, cholecystokinin tetrapeptide; CTQ, Childhood Trauma Questionnaire; DLPFC, dorsolateral prefrontal cortex; 
FPC, frontoparietal cortex; LC, locus coeruleus; LTE, list of threatening experiences/lifetime events; OFC, orbitofrontal cortex; PFC, prefrontal cortex. 


potential role of the acid-sensing ion channel 1 (ACCN1, chromo- 
some 12q13.12) in mediating the detection of brain acidosis/ 
hypercarbia and acting as a chemosensor triggering fear-related 
behaviour. ACCN1 SNPs rs685012 (C allele) and rs10875995 (C al- 
lele) have been associated with PD ina case-control analysis, with a 
particularly strong association in either early-onset PD or PD with 
primarily respiratory symptoms [47]. There was also an increased 
amygdala volume and reactivity to emotional stimuli in a cohort 
of healthy subjects [47]. Interestingly, following the recent interest 
in carbon dioxide-induced acidosis related to ACCN1 in patients 
suffering from PD, an investigation in mice established that ex- 
posure to early-life adversity induced separation anxiety-related 
behaviour and hyperventilation upon carbon dioxide inhalation 
[48]. This phenotype was further associated with epigenetic his- 
tone modifications and a gene expression network in the me- 
dulla oblongata, not only confirming increased expression of the 
murine ACCN1 orthologue, but also affecting biological processes 
related to neurodevelopment and emotionality, as well as actual 
drug targets in the form of the mammalian/mechanistic target of 
rapamycin (mTOR) signalling pathway [48]. 


Epigenetic studies 


Given the substantial impact of gene-environment interactions in 
the aetiology, disease course, and treatment of psychiatric disorders 
[49], the rising field of epigenetics has the potential to expand our 


knowledge of systemic biological alterations underlying anxiety- 
related inter-individual differences. Given the implied intrinsic 
mechanistic capacity for plasticity, and therefore reversibility, the 
potential for treatment innovation is particularly exciting. 
Assessments of epigenetic DNA methylation patterns spanning 
the transcription regulatory site and exon 1/intron 1 region of the 
MAOA gene in PD patients and matched controls revealed sig- 
nificantly reduced methylation levels in female PD patients [50]. 
Notably, across patients and controls, the occurrence of recent 
negative life events resulted in relatively decreased MAOA methy- 
lation levels, while positive life events were associated with in- 
creased methylation levels [50]. The potential clinical relevance 
of this dynamic epigenetic state of MAOA methylation was sup- 
ported by the hypomethylation pattern in female PD patients and 
the correlation of decreased methylation levels with increased 
panic attack frequency and heightened agoraphobic avoidance in 
the absence or presence of a trusted person [51]. Exceptionally, 
following CBT, the average methylation of therapy responders in- 
creased to a level indistinguishable from that of healthy controls, 
while methylation of non-responders remained unchanged or 
even declined [51]. Further evidence for an epigenetic role in the 
serotonergic system was provided bya study assessing methylation 
levels upstream of the SLC6A4 promoter region in children with 
a variety of anxiety disorders pre- and post-CBT, with one CpG 
site differentiating responders with increasing methylation, along 
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with treatment success, from non-responders with decreasing 
methylation [52]. 

Oxytocin has repeatedly been implicated in social functioning 
and potentially SAD (for a review of oxytocin’s role in anxiety dis- 
orders, see [53]). Epigenetic screening of a CpG island just down- 
stream of the oxytocin receptor (OXTR, chromosome 3p25.3) exon 
3 translation start site in a cohort of 110 medication-free SAD 
patients and 110 gender-matched controls revealed a decreased 
methylation pattern in SAD patients, with both overall and CpG3 
methylation negatively correlating with SAD symptom intensity 
in patients, as well as dimensional social anxiety in controls [54]. 
OXTR methylation negatively correlated with the cortisol response 
to the Trier Social Stress Test (TSST) in healthy controls and amyg- 
dala activity levels upon exposure to social phobia-related words in 
SAD patients [54]. 

Finally, decreased CRHR1 promoter methylation was associated 
with PD ina case-control design and with heightened anxiety ratings 
in an independent sample of healthy volunteers [54]. A functional 
luciferase assay linked an unmethylated CRHR1 promoter region to 
increased gene expression, implicating C1 hypomethylation and a 
resulting CRHR1 overabundance in a dysfunctional stress response 
as possibly pathogenetically relevant in PD [55]. 





Genetics of cross-disorder anxiety phenotypes 


Family studies and twin studies support the genetic clustering and co- 
heritability of SP, SAD, AP, PD, and GAD, along with post-traumatic 
stress disorder (PTSD) [56-58]. In line with their epidemiological, 
as well as nosological, overlap and their shared genetic background, 
anxiety disorders and trauma- and stress-related disorders repre- 
sent closely interrelated diagnostic constructs. They exist beyond 
a threshold where subsyndromal anxiety and fear responses grade 
into degrees of psychopathology. Accordingly, genetic investiga- 
tions of cross-disorder anxiety and post-traumatic stress pheno- 
types have explored temperamental traits considered to mediate the 
susceptibility of children, adolescents, and adults to suffer from the 
respective disorders; these include neuroticism/negative affectivity, 
introversion, anxiety sensitivity, behavioural inhibition, insecure 
attachment styles, and harm avoidance. Further relevant cross- 
disorder phenotypes are expanded upon in Chapter 80. 

Several GWAS have reported on genome-wide significant asso- 
ciations for neuroticism. One approach (106,716 samples) impli- 
cated risk genes like the glutamate ionotropic receptor kainate type 
subunit 3 (GRIK3, rs490647) and CRHRI1 (rs111433752), as well 
as a 4-Mb region on chromosome 8 spanning more than 36 genes 
[59]. Another genome-wide analysis of 170,911 subjects identified 
11 other variants associated with neuroticism; a genetic correlation 
based on linkage disequilibrium score regression between five dif- 
ferent neuropsychiatric phenotypes and neuroticism was highest for 
anxiety disorders (0.88) [60]. Furthermore, a GWAS twin study in 
730 monozygotic and dizygotic twin pairs, exploring the genetic basis 
of anxiety sensitivity estimated a heritability of 0.44 and identified a 
top hit within the coding region of the RNA binding protein fox-1 
homolog 1 gene (RBFOX1, rs13334105) [61]. Notably, a genome- 
wide genetic linkage of neuroticism quantitative trait loci (QTLs) 
in extremely discordant and concordant sibling pairs established 
a locus on chromosome 1 spanning the regulator of the G-protein 


signalling gene (RGS2) [62]. This proved to be syntenic with a QTL 
modulating anxiety-related behaviour in mice [63], marking a rare 
occasion of inter-species QTL concordance. Moreover, the RGS2 
rs4606 C allele (hypothesized to lead to increased mRNA stability, 
and therefore higher expression levels) has been associated with in- 
creased incidence rates of PD [64] and GAD [65]. 

Based on a PD- and AP-related dimensional intermediate pheno- 
type (Agoraphobia Cognition Questionnaire [ACQ]), a GWAS in 
healthy German probands (1370 samples) found genome-wide as- 
sociations between increased agoraphobic scores and SNPs in the 
glycine receptor subunit beta (GLRB), one of which (1rs7688285) was 
successfully replicated in an independent dimensional sample and a 
case/control sample [66]. The minor rs7688285 A allele was subse- 
quently demonstrated to be linked to increased GLRB expression in 
the midbrain of post-mortem tissue, reduced startle habituation and 
increased startle potentiation and generalization, as well as stronger 
amygdala and insula activation, during the acquisition phase and 
thalamus, putamen/pallidum and skin conductance responses, fol- 
lowing the acquisition phase in a fear conditioning paradigm [66]. 
A comparable dimensional GWAS approach in a Hispanic com- 
munity sample (11,456 samples free of antidepressant or anxiolytic 
medication) detected a genome-wide significant association between 
a self-constructed GAD symptom score and the thrombospondin 2 
gene (THBS2, rs78602344) [67]. Despite these encouraging advances 
with regard to personality dimensions and psychometric scores, so 
far, no genome-wide significant associations have been reported for 
a continuous measure of phobic anxiety [68], anxiety-related behav- 
iour in childhood [69], or harm avoidance [70]. 


Serotonergic system 


Complementary to GWAS, candidate gene, and gene-environment 
studies outlined previously, the serotonergic system, and therein par- 
ticularly the 5-HTTLPR, has been a focus of attention in the investiga- 
tion of cross-disorder and dimensional anxiety-related phenotypes. 
Anticipation of an unpredictable shock following a long phase of ex- 
pectation resulted in increased neuronal activation of the amygdala, 
hippocampus, anterior insula, thalamus, pulvinar nucleus, caudate 
nucleus, precuneus, anterior cingulate cortex, and medial prefrontal 
cortex and increased positive coupling between the medial pre- 
frontal cortex and physiological anxiety experience based on insular 
activity in 5-HTTLPR s/s homozygotes, while in | allele carriers, a 
negative coupling was observed [71]. Additionally, increased activa- 
tion of the dorsomedial prefrontal cortex in 5-HTTLPR short allele 
carriers mediated increased psychophysiological reactions in antici- 
pation of future aversive events, explicitly by increased skin conduct- 
ance and startle reaction [72]. The low-expression 5-HTTLPR risk 
alleles have also been implicated in an increased amygdala response 
to sad emotional faces [73]. Furthermore, healthy controls homozy- 
gous for the 5-HTTLPR | allele reacted with increased amygdala and 
insula activity towards the presentation of fearful faces, if they were 
treated with citalopram, as compared to placebo treatment, yielding 
interesting insights into the often reported ‘jitteriness’ phenomenon 
accompanying SSRI treatment during the titration phase [74]. 

Moreover, decreased SLC6A4 promoter methylation levels have 
been linked to increased cortisol secretion, following the TSST in 
5-HTTLPR s allele carriers in a dose-dependent fashion, while no 
such association was detectable in s allele carriers with high methy- 
lation levels [75]. 


Catecholaminergic system 


In healthy female subjects, the COMT 158val allele conferred in- 
creased amygdala responses to fearful and angry facial stimuli, as 
well as heightened activation of the ventral visual stream and the lat- 
eral prefrontal cortex, pointing to a gender-specific effect of altered 
catecholaminergic activity on emotional processing [76]. Focusing 
on cross-generational interaction and neuronal development, it has 
also been shown that increased antenatal maternal anxiety was sig- 
nificantly positively correlated with ventrolateral prefrontal cortex 
thickness in neonates when carrying the COMT val/val genotype, 
suggesting a complex genetic interaction between maternal suscep- 
tibility to anxiety disorders and in utero development of anxiety- 
relevant cortical areas [77]. 


Neuropeptide systems 


The multi-level disease framework depicted in Fig. 89.1 reflects 
the converging evidence for a role of the NPS system in categor- 
ical anxiety diagnoses and cross-disorder intermediate phenotypes. 
Additional evidence for a key function of this system in panic- 
related anxiety emerges from a challenge approach, allowing for 
studying an unfolding panic attack by injection of the panicogenic 
pharmacological agent cholecystokinin tetrapeptide (CCK-4) [78]. 
Following CCK-4 injection, healthy NPSR1 rs324981 T allele car- 
riers showed lower levels of excitatory neurotransmitter abundance 
in the anterior cingulate cortex, as measured by magnetic resonance 
spectroscopy, during the course of a panic attack [79]. This sug- 
gests reduced inhibitory top-down control, allowing maladaptive 
behavioural responses [79]. A further pertinent link between the 
NPS system and phenotypes with particular relevance to anxiety- 
related disorders has been established by an imaging genetics ap- 
proach; the NPSR1 rs324981 T risk allele has been associated not 
only with increased amygdala activation in response to fear-relevant 
stimuli, but also with increased harm avoidance traits, potentially 
connecting the NPSR1 risk allele to disorder-specific neuronal pro- 
cesses shaping pathological fear reactions in the limbic system [80]. 
Furthermore, study of event-related potentials uncovered intensi- 
fied response inhibition and error monitoring to be associated with 
the NPSR1 rs324981 T allele, with the latter also mediating increased 
levels of anxiety sensitivity [81]. Similarly, higher activation patterns 
in the prefrontal cortex and the locus coeruleus region during an 
alerting condition and increased activations in fronto-parietal re- 
gions during an executive control condition in T/T genotype car- 
riers correlated with heightened anxiety sensitivity scores [82]. 
This hints at an influence of the NPSR1 rs324981 polymorphism on 
neuronal processes of anxiety-relevant attentional functioning [82]. 
Remarkably, analysis of fronto-limbic connectivity during brain 
development revealed a higher connectivity between the middle 
frontal gyrus and the amygdala in older adolescents carrying the 
NPSR1 rs324981 A allele, while T/T homozygosity was linked to a 
reduced connection between the medial frontal gyrus and both the 
amygdala and insula [83]. Hence, the NPSR1 genotype has a poten- 
tial role in dysfunctional development of cortico-limbic interaction 
during a crucial time window of anxiety disorder onset [83]. The ef- 
fect of the NPSR1 rs324981 genotype on anxiety sensitivity revealed 
that T/T homozygosity and high levels of childhood trauma resulted 
in increased levels of anxiety sensitivity, particularly concerns about 
mental incapacitation and physical sensations, while acute somatic 
concerns interacted with the NPSR1 genotype and the amount of 
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recent life events [84]. Interestingly, from the perspective of innova- 
tive future treatment strategies, intranasal NPS administration has 
been shown to reach the anterior olfactory area, nucleus accumbens, 
thalamus, hypothalamus, and particularly the hippocampus where 
it modulates synaptic transmission and plasticity, next to exerting 
anxiolytic properties in multiple behavioural tests [85]. 

Moreover, the influence of a less secure attachment style on in- 
creased levels of social anxiety may be partially modulated bythe OXTR 
rs53576 A/G polymorphism, with the A allele exerting a significantly 
greater negative influence of insecure attachment on SAD-related di- 
mensional traits, as compared to the G/G genotype [86]. The potential 
for a paradigm shift from ‘risk genes towards a notion of ‘plasticity’ 
genes is exemplified by a neuroimaging study assessing OXTR gene- 
environment interactions for the very same polymorphism previously 
linked to protective, as well as maladaptive, properties. OXTR 1s53576 
G/G homozygotes exhibited a significant reduction of bilateral ventral 
striatum grey matter volume interacting with the extent of childhood 
trauma suffered and increased amygdala reactivity towards emotional 
facial stimuli [87]. In turn, decreased ventral striatum grey matter vol- 
umes were also associated with the prosocial trait of lower reward de- 
pendence in G/G homozygotes [87]. Thus, the G allele may not only 
mediate the vulnerability of limbic structures to adverse childhood 
events but could also confer resilience in the form of increased limbic 
responsiveness to emotional interaction [87]. 


Clinical challenges and future perspectives 


The establishment of the genetic variation influencing anxiety 
disorders is an ongoing process. Recent advances converge on a 
cross-disorder explanation of psychopathology, pointing towards 
the need for a revised understanding of mental disease frame- 
works (Fig. 89.1), which may be translated into novel clinical tools. 
Specifically, the field of psychiatric genetics/epigenetics holds the 
promise to catalyse a paradigm shift in drug discovery, treatment 
response prediction, individual risk evaluation, and nosological 
conceptionalization, by ultimately answering questions regarding 
the molecular plasticity of anxiety and fear, experience and trauma, 
and resilience and susceptibility [45]. 

The inspection of a given gene, or rather a gene set, in relation 
to biological pathways and processes yields the potential to un- 
cover innovative molecular treatment targets and pioneer alterna- 
tive mechanisms of action. This is why especially the follow-up of 
unbiased GWAS by candidate gene studies and functional in silico 
enrichment analyses represents a pivotal step in the identification 
and prioritization of therapeutic objectives. The greatest present 
challenge is the consistent replication of findings. However, as re- 
viewed in detail previously, several genes (for example, NPSR1 or 
ACCN1) and systemic processes (for example, glucocorticoid or 
oxytocinergic signalling) have accumulated considerable evidence 
for their modulation of disease-relevant traits like fear generaliza- 
tion or perseverative thought patterns. 

In addition to the provision of novel therapeutic targets, psychi- 
atric genetics may also contribute to the discovery of clinically valid 
phenotypes, exploring the molecular characterization of patient 
subgroups that would allow for an improved pairing of patient and 
treatment in an individualized precision medicine approach. Future 
steps might comprise the combination of several or even many 
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SNPs, as well as the inclusion of environmental factors to account for 
the phenotypic heterogeneity observed in anxiety disorders. A poly- 
genic risk score of environmental sensitivity based on within-pair 
variability in emotional problems of monozygotic twins has been 
found to significantly moderate the effect of parenting on emotional 
problems in an independent group [88]. It also predicted a better re- 
sponse to CBT for childhood anxiety disorders in individual, rather 
than in group, sessions, suggesting that, in line with the differential 
susceptibility hypothesis, individuals with greater environmental 
vulnerability are at an increased likelihood to develop emotional 
problems in an adverse environment but correspondingly benefit 
more from an intense form of treatment [88]. 

Finally, as summarized, a growing body of evidence suggests 
dynamic epigenetic changes in chromatin structure and methyla- 
tion patterns to underlie anxiety-related phenotypes, their patho- 
physiological signature, their disease course, and perhaps most 
strikingly—in a possibly mechanistic way—their clinical response 
to psychotherapeutic or psychopharmacological interventions. For 


example, increased MAOA methylation levels following successful 
CBT might not only constitute a plastic epigenetic marker of therapy 
response [51] but could also pave the way for personally tailored 
therapy augmentation. This could be provided in the form of mono- 
amine oxidase inhibitors (MAOIs) as an adjunct to psychotherapy 
in order to supplement for the degree of MAOA hypomethylation in 
patients suffering from PD. Although the particular attractiveness 
of epigenetic risk markers lies in their inherent capacity for revers- 
ibility, permanent (for example, SNPs), as well as transient (for ex- 
ample, methylation), changes to the DNA strand might not always 
be reducible to ‘risk and ‘resilience’ markers, but rather be viewed in 
the perspective of differential susceptibility. Thus, individuals most 
vulnerable to environmental adversity due to their genetic loading 
might simultaneously be most receptive to beneficial experiences, 
which would consequently result in their nomenclature being re- 
labelled as ‘plasticity’ genes [89]. Following their consistent valid- 
ation in longitudinal studies of diagnosis and treatment, epigenetic 
signatures like methylation patterns could eventually meet the 


Table 89.2 Overview of genetic variants implicated in anxiety disorders and cross-disorder anxiety phenotypes. The overall degree 
of confidence was ranked from ‘+’ = ‘low’ to ‘++++’ = ‘high’, based on the reviewed evidence (refer to the individual sections in this chapter 


for detailed supporting literature) 


Top risk variant 


epigenetic studies 


Evidence from structural, rare variant, GWA, candidate gene, gene - environment, and 


Overall 
degree of 
confidence 


























MAOA Upstream variable Candidate gene association with PD in females [31] 
number of eta-analytical candidate gene association with PD in females [32] 
tandem Association with decreased PD CBT treatment response [33] 
repeats (UVNTR) Reduced methylation in the transcription-regulatory region in female PD patients [50] eee 
Long allele egative life events resulted in decreased MAOA methylation levels; positive life events were associated 
with increased methylation [50] 
Decreased methylation correlated with increased symptom measures in female PD patients [51] 
Response to PD CBT was linked to increased methylation [51] 
NPSR1 rs324981 Candidate gene association with PD in females [44] 
T allele ncreased symptom measures during behavioural avoidance and decreased anterior cingulate cortex 
top-down control towards fear-relevant stimuli [44 
Gene-environment interaction with childhood trauma associated with increased anxiety sensitivity [84] 
eta-analytical candidate gene association with PD [13] E 
Lower anterior cingulate cortex excitatory neurotransmitter levels during a panic attack [79] 
ncreased amygdala response to fear-relevant stimuli and increased harm avoidance [80] 
ncreased frontal and locus coeruleus activation in the neuronal alerting network [82] 
ncreased fronto-parietal activation in the executive control network correlating with anxiety sensitivity [82] 
Reduced development of neuronal fronto-limbic connectivity [83] 
TMEM132D rs7309727 Rare functional coding variants protective of PD [10 
T allele Top hit in PD/PD+AP GWAS [11] N 
Confirmed in follow-up candidate gene association [12] 
eta-analytical candidate gene association with PD [13] 
COMT rs4680 Candidate gene association with PD in females [36] 
G allele eta-analytical candidate gene association with PD [13] +44 
(vall58met, val variant) Increased amygdala response to negative emotional stimuli [76] 
SLC6A4 Serotonin transporter ncreased neuroticism and harm avoidance [15] 
polymorphic ncreased blushing propensity in SAD [17] 
region ncreased amygdala activation during auditory habituation [18] 
(5-HTTLPR) short eta-analysis of PD candidate gene studies found no association [22] 
allele Contradictory findings of association with anxiety disorder CBT treatment response in children [23, 24] ++ 
Association with decreased SSRI treatment response [25, 26] 
ncreased coupling of insular activity and physiological anxiety [71] 
ncreased amygdala response to sad faces [73] 
Decreased methylation levels linked to increased cortisol response to social stress [75] 
CRHR1 rs17689918 Cross-cohort candidate gene association with PD in females [40] 
A allele ncreased anxiety sensitivity and impaired safety signal processing in the amygdala [40] ie 
Different SNP (rs111433752) implicated in neuroticism GWAS [59] 
Decreased promoter methylation associated with PD [55] 
AP, agoraphobia; SAD, social anxiety disorder; PD, panic disorder; CBT, cognitive behavioural therapy; GWAS, genome-wide association study; SNP, single nucleotide polymorphism. 


clinical need for an easily accessible peripheral biomarker, allowing 
the stratification of patients in susceptibility classes. Subsequently, 
regular quantitative observations could guide the development and 
provision of targeted, and thus the most effective, preventive, as well 
as interventional measures. The next generation of psychothera- 
peutic and psychopharmacological treatments might also be evalu- 
ated by the speed, strength, and permanence with which they alter 
the multilocus impact of environmental stimuli and traumata on the 
epigenetic architecture of anxiety disorders. 


Anxiety disorders display a distinct familial clustering, and herit- 
able factors contribute significantly to their disease burden. Large- 
scale unbiased GWAS and meta-analyses have begun to reach the 
necessary sample sizes to uncover meaningful SNP associations 
and have been complemented and extended by candidate gene 
and gene-environment studies implicating the serotonergic and 
catecholaminergic, as well as neuropeptide and hormonal, sys- 
tems (Table 89.2). The polygenic nature of anxiety disorders sug- 
gests that the still incompletely unravelled molecular signature of 
anxiety might transcend traditional hierarchical diagnostic bound- 
aries. Facing the complexity of potential pleiotropic effects and their 
interactions with each other, systems—biological methods could 
facilitate the mapping of candidate genes and functional pathways 
onto cross-cutting symptom measures and uncover their interface 
with temperamental, developmental, and environmental variables. 
One of the emerging key mechanistic processes potentially unifying 
these components appears to be psychiatric epigenetics. Given the 
dynamic nature of chromatin structure and methylation patterns re- 
lated to environmental influences, as well as psychotherapeutic and 
psychopharmacological interventions, epigenetics may explain part 
of the missing heritability of anxiety disorders, constitute a mechan- 
istic link between stress and biological vulnerability, and finally—in 
synopsis with genetic and other risk factors—inform personalized 
treatment and indicated prevention in anxiety disorders. 
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The development of neuroimaging techniques, such as functional 
magnetic resonance imaging (fMRI), has enabled neuroimaging re- 
searchers to identify structural and functional patterns in the brain 
underlying mental disorders. Thus, neuroimaging has helped to 
shed light on the neurobiological basis of anxiety and the patho- 
physiological mechanisms in anxiety disorders [1]. A large body of 
research has been conducted on experimentally induced anxiety in 
healthy individuals. Fear conditioning and fear generalization para- 
digms serve as models for the development and maintenance of 
pathological anxiety and for the maladaptive overgeneralization of 
fear [2, 3]. Studies investigating pharmacologically induced panic 
in healthy individuals tried to exemplarily resemble symptoms 
of patients with panic disorder (PD) [4]. Neuroimaging studies 
investigating patients with anxiety disorders have attempted to dis- 
entangle the neurobiological characteristics specific to each disorder 
[5]. Furthermore, more and more effort is focused on under- 
standing the neural mechanisms of successful treatment of anxiety 
disorders [6]. 

The present review attempts to introduce a comprehensive over- 
view of the current neuroimaging findings from research on anxiety 
disorders. In the following, we discuss evidence from neuroimaging 
studies on experimentally induced anxiety in healthy subjects, re- 
sults of studies investigating neuronal patterns in patients with anx- 
iety disorders, and finally findings from neuroimaging research on 
treatment effects in anxiety disorders. 
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The functional neuroanatomy of anxiety in healthy humans has been 
investigated using experimentally induced anxiety. Such studies con- 
siderably enhanced the knowledge on pathophysiological aspects of 
anxiety disorders and offer the possibility to develop new treatment 
strategies. Mainly two different approaches of anxiety induction in 
healthy subjects have been applied in the recent past: fear condi- 
tioning paradigms and pharmacologically or respiratory-induced 
panic. In classical fear conditioning studies, a previously neutral 
stimulus becomes ‘conditioned’ after repeatedly pairing it with an 





aversive stimulus which elicits an autonomic fear response. After 
several paired presentations, the previously neutral stimulus elicits 
the autonomic fear response by itself [7]. For instance, this was done 
by conditioning neutral faces with an unpleasantly loud tone [3]. 
Thus, conditioned neutral faces elicited brain activity in a ‘fear net- 
work’ comprising the anterior cingulate cortex (ACC), the anterior 
insula, and the amygdala. Activity within this network evoked by 
aversive conditioned stimuli has repeatedly been shown in several 
fear conditioning studies [8, 9]. Moreover, this ‘fear network is also 
known to be active during symptom provocation in patients with 
anxiety disorders. 

The idea of fear conditioning in healthy subjects has recently been 
used in neuroimaging studies investigating the neurobiological 
mechanisms involved in fear generalization. Healthy organisms 
tend to generalize conditioned fear responses to a certain stimulus 
and transfer it to perceptually close events in order to adapt be- 
havioural responses to complex real-world situations by means of 
fear learning [10]. A maladaptive overgeneralization of fear is con- 
sidered as a core pathophysiological mechanism of anxiety disorders 
[11, 12]. A first {MRI study investigating neurobehavioural mechan- 
isms of human fear generalization revealed brain activity within re- 
gions also involved in the acquisition of conditioned fear in response 
to stimuli resembling an aversive conditioned stimulus. Activity 
within this network comprising the insula, caudate, thalamus, and 
ACC increased with growing similarity of the probe stimulus to the 
aversive conditioned stimulus, whereas the ventromedial prefrontal 
cortex (PFC) showed increased activity in response to a stimulus 
resembling a non-conditioned stimulus [13]. These results were rep- 
licated by studies presented in [2] and [14]. Additionally, Lissek et al. 
showed an increase in the involvement of the hippocampus and its 
connectivity to the ventromedial PFC with increasing similarity of 
the stimulus to a cue representing safety (in contrast to the danger- 
conditioned cue). The functional connectivity between the hippo- 
campus and the amygdala, as well as the insula, increased in response 
to stimuli more and more resembling the danger-conditioned cue 
[14] . On basis of these findings, a fear generalization model has 
been proposed, suggesting that sensory information on stimuli re- 
sembling danger-conditioned cues is directly forwarded from the 
thalamus to the amygdala, bypassing the sensory cortex. ‘Thus, 
conditioned fear is rapidly expressed via connections between the 
amygdala and, for example, the insula. Furthermore and according 


to this model, the hippocampus is suggested to contribute processes 
of pattern completion in case of presentation of stimuli resembling 
danger-conditioned cues, which initiates the conditioned fear re- 
sponse. On the other hand, in case of pattern separation processes 
in the hippocampus due to presentation of a cue not resembling 
the danger-conditioned cue, the ventromedial PFC is activated and 
downregulates the amygdala [15]. However, in view of the results 
of their recently published fMRI study, Onat et al. suggest that fear 
generalization is not only passively dependent from the percep- 
tual similarity of a stimulus to a danger-conditioned cue, but also a 
mechanism which enables the human brain to actively transfer the 
threat classification to perceptually similar stimuli, using additional 
variables besides perceptual similarity [16]. 

In order to expand the understanding of neurobiological mechan- 
isms of psychotherapeutic interventions in the treatment of anxiety 
disorders, fear conditioning paradigms also were used to investigate 
processes of fear extinction in neuroimaging studies [8]. Therefore, 
fear extinction has experimentally been modelled by repeatedly 
presenting a previously neutral, but aversive, conditioned stimulus 
without the aversive stimulus. Thus, the conditioned fear response 
can gradually be eliminated. There is evidence for activation in the 
ventromedial PFC and the hippocampus during fear extinction that 
might reflect a regulating inhibition effect on the amygdalar fear 
reaction [17]. Fear conditioning is differentiated from extinction 
learning by an anti-correlated dynamic competition between the 
large-scale functional networks underlying conditioning (amygdala- 
thalamo-cortical network) and extinction (hippocampal-prefrontal 
network) [18]. Moreover, a contribution of the cerebellar vermis to 
the extinction of conditioned fear has been shown [19]. Eckstein 
et al. recently found that oxytocin facilitates the extinction of condi- 
tioned fear by increasing PFC activity and inhibition of amygdalar 
responses during extinction [20]. 

An experimental model of the relapse of anxiety symptoms after 
successful therapeutic intervention in anxiety disorders is provided 
by a number of studies using return of fear experiments. For this 
purpose, the aversive event is re-presented after extinction [21]. 
Return of fear processes critically involve brain structures within 
the hippocampus and the ventromedial PFC [22]. The return of fear 
could be prevented experimentally by administration of the dopa- 
mine precursor 1-3,4-dihydroxyphenylalanine (1-DOPA) [23], in- 
take of the B-blocker propranolol [24], or REM sleep [25], which, 
in all cases, was accompanied by alterations of activity in return of 
fear-related brain regions. 

The experimental induction of panic attacks with panicogenic 
substances in healthy subjects is another approach of model- 
ling human anxiety. Different substances have been used to elicit 
panic symptoms in healthy individuals. For instance, the respira- 
tory induction of panic symptoms has been conducted using lac- 
tate or carbon dioxide administration simulating life-threatening 
hypoxia. Pharmacological approaches involved noradrenergic 
(for example, yohimbine) and serotonergic (for example, meta- 
chlorophenylpiperazine) panic induction strategies. However, the 
most commonly used panicogenic agent is the synthetic neuropep- 
tide cholecystokinin-tetrapeptide (CCK-4), a synthetic analogue of 
the endogenous neuropeptide cholecystokinin. It is considered as an 
ideal, safe, and valid experimental model of panic, serving as a useful 
approach to study the pathophysiology and neurobiology of PD 
[26]. CCK-4 induces symptoms closely resembling spontaneously 
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occurring panic attacks in patients with PD [27]. Additionally, 
antipanic pharmacological treatment like imipramine reduces CCK- 
4-induced panic symptoms [28]. To this day, several neuroimaging 
studies have used the CCK-4 panic model in healthy participants 
in order to characterize brain regions which might be involved in 
the generation of panic attacks. The first studies investigating the 
functional neuroanatomy of CCK-4-induced panic in healthy par- 
ticipants using positron emission tomography (PET) showed an 
increase in cerebral blood flow (CBF) in the ACC, the claustrum- 
insular-amygdala region, and the cerebellar vermis [29, 30]. These 
results were confirmed by a first {MRI study contrasting brain ac- 
tivity induced by CCK-4 vs placebo administration [31]. However, 
the authors did not report amygdala activation elicited by CCK-4. 
In contrast, Eser et al. showed significantly stronger amygdala acti- 
vation after CCK-4, in comparison to placebo administration, and 
five out of 11 healthy subjects with high amygdala activity reported 
more pronounced fear symptoms, in contrast to subjects with low 
amygdala activity. According to the authors’ discussion, these find- 
ings indicate that the extent of amygdala activation might modu- 
late the experience of CCK-4-induced fear, emphasizing the role of 
this structure in fear and anxiety. Moreover, CCK-4, in comparison 
to placebo, elicited large responses in several other brain regions, 
for example, the ventral ACC. In contrast, anticipatory anxiety in 
expectance of CCK-4 administration led to increased activity in the 
dorsal part of the ACC. Eser et al. interpreted this differential activa- 
tion of the ACC as a correlate of a diverse functional neuroanatomy 
between cognitive anticipation and real autonomic and affective re- 
sponses to CCK-4 in healthy subjects [32]. Using a data-driven fMRI 
analysis approach called tensorial probabilistic independent com- 
ponent analysis, Dieler et al. identified strong increases of activity in 
12 different brain networks being active only after CCK-4, but not 
after placebo, administration or showing a significant increase in ac- 
tivity after CCK-4 administration, compared to placebo. The way of 
analysis in this study additionally allows to explore the time courses 
of each activated network and could enable future studies to quan- 
tify and visualize changes in CCK-4-activated neural networks, for 
example, elicited by pharmacological treatment of CCK-4 induced 
panic [33]. The CCK-4 model of PD was used not only to reveal 
the functional neuroanatomy of panic attacks, but also to investigate 
putative genomic risk factors for anxiety [34]. For instance, using 
magnetic resonance spectroscopy, Ruland et al. showed that the 
brain glutamate/glutamine levels in bilateral ACC during an acutely 
occurring CCK-4-induced panic attack is dependent on the neuro- 
peptide S receptor gene variation [35]. Neuropeptide S is suggested 
to be involved in the pathogenesis of anxiety and PD. Accordingly, 
the CCK-4 panic challenge might be able to help to uncover genetic 
mechanisms influencing anxiety brain networks. 





Imaging neural correlates of anxiety disorders 


Symptom provocation is the most commonly used procedure for the 
investigation of anxiety disorders in neuroimaging studies. For this 
purpose, anxiety-specific brain activity is induced by negative emo- 
tional stimuli such as, for example, angry faces [36], trauma-related 
visual or auditory stimuli [37], or pictures of phobia-related contents 
[38]. Most of the studies revealed increased activity of the ‘fear net- 
work comprising the amygdala, insula, ACC, and medial PFC [7] in 
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patients with anxiety disorders, compared to healthy control groups, 
during symptom provocation. The appearance of anxiety symptoms 
seems to involve a pathologically hyperactivated amygdala under in- 
sufficient frontal top-down control [39]. 

The amygdala is considered to be involved in grading the bio- 
logical significance of a stimulus [40]. Thus, it plays a crucial role 
in the processing of emotionally and socially relevant information 
[41]. The insula is another brain structure which is related to the 
processing of emotional contents [42] and interoceptive awareness 
[43]. Mechanisms of fear learning critically involve the medial PFC 
[44]. Thus, the crucial relevance of these regions for fear and anxiety 
may arise from its engagement in ‘the processing of emotions as they 
relate to the self? [45]. 

Patients with generalized anxiety disorder (GAD) show a deficient 
ability of regulating their emotions. The results of the neuroimaging 
studies investigating this cognitive dysfunction in GAD patients sug- 
gest that the dysregulation of emotions is related to a malfunction of 
the medial PFC and alterations of its connectivity to the amygdala in 
terms of an insufficient top-down control [46]. This frontal control 
may be realized by means of theta oscillations, for example during 
emotion regulation [47]. Reduced mPFC activity was found in GAD 
patients during viewing emotional faces [48], suppressing worries 
[49], and reappraising emotional responses to negative pictures [50]. 
In a fear generalization experiment, the mPFC was not sufficiently 
recruited during fear inhibition in GAD patients [51]. Moreover, the 
connectivity between the mPFC and the amygdala has been shown 
to be altered in GAD patients [52]. 

The first functional neuroimaging studies investigating patients 
with PD have reported alterations of the blood flow in hippocampal 
brain regions [53, 54]. Results of {MRI studies investigating the 
processing of emotional stimuli suggest that hyperactivity of sev- 
eral different brain regions, including the amygdala, dorsolateral 
and medial PFC, ACC, insula, inferior frontal gyrus, orbitofrontal 
cortex, striatum, and hippocampus, are involved in the pathophysi- 
ology of PD [55, 56]. For instance, stronger activations were found in 
the bilateral ventral striatum and the left insula in PD patients during 
the anticipation of agoraphobia-specific pictures [57]. Increased 
anxiety sensitivity, which is known to be an important component 
of PD, has been linked to activity in regions encompassing the PFC, 
ACC, and insula during emotional processing [58]. A neural cor- 
relate of the altered interoceptive processing in patients with PD 


CM amygdala 





might be represented by an increased functional connectivity be- 
tween the somatosensory cortex and thalamus in the resting state 
brain [59]. The hypersensitivity of PD individuals to inhaled carbon 
dioxide goes along with alterations of the acid-sensing ion channel 
ASICla, and ASIC1a risk allele carriers showed increased amygdala 
volume, as well as task-evoked amygdala reactivity to fearful and 
angry faces [60]. Domschke et al. showed a reduction of effective 
connectivity between the middle frontal gyrus and both the amyg- 
dala and the insula, reflecting an impaired frontal top-down con- 
trol of limbic structures in adolescent carriers of a variant in the 
neuropeptide S receptor gene which has been associated with PD 
[61]. Another imaging genetics study investigated the influence of 
the corticotropin-releasing hormone receptor gene CRHR1 as a risk 
factor for PD on brain activity during fear conditioning. Risk allele 
carriers showed aberrant activity predominantly in bilateral pre- 
frontal regions and the amygdala, implicating altered generalization 
of fear processes [62]. 

Post-traumatic stress disorder (PTSD) has been associated re- 
currently with amygdala hyperactivity [63, 64] and hypoactivity in 
the medial PFC [65] in functional neuroimaging studies. Moreover, 
a lack of negative connectivity between the amygdala and the 
subgenual ACC has been observed (Fig. 90.1) [66]. Among others, 
one established model of PTSD assumes that amygdala hyper- 
activity reflects permanently increased fear responses, whereas the 
observed hypoactivity in frontal regions might be a correlate of an 
aberrant ability of top-down regulation and fear extinction [67]. 
Moreover, a reduction of hippocampal activity has been observed 
in PTSD patients [68], which has been interpreted as a disturbed 
ability of the hippocampus to provide context information, espe- 
cially with respect to the identification of safe contexts [69]. This 
exemplary conception is further strengthened by structural changes 
in several brain regions, including the hippocampus, amygdala, and 
medial PFC observed in PTSD patients [65, 70, 71]. Interestingly, 
some studies report increased activity in the dorsal part of the ACC 
and dorsolateral PFC which might be the correlate of an increased 
recruitment of top-down attentional control [72]. Disturbances of 
fear generalization processes in military veterans with PTSD have 
been linked to an increase of amygdala connectivity with sensory 
brain regions and the thalamus and a decrease of connectivity be- 
tween the amygdala and the ventromedial PFC [73]. Spielberg et al. 
reported an association between PTSD re-experiencing symptoms 
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Fig. 90.1 (see Colour Plate section) Absent negative connectivity between the centromedial (CM) amygdala and the subgenual anterior cingulate 
cortex (sgACC) in trauma-exposed youth. (a) Anatomically defined seed region of interest for the connectivity analysis within bilateral CM amygdala. 
(b) Tukey's boxplots depicting connectivity values by group centred on the sgACC peak. The middle line indicates the median, the vertical line the 


range, and the limits of the box represent the upper and lower quartiles. 


Reproduced from Soc Cogn Affect Neurosci., 10(11), Thomason ME, Marusak HA, Tocco MA, et al., Altered amygdala connectivity in urban youth exposed to trauma, pp. 1460- 


8, Copyright (2015), with permission from Oxford University Press. 


and a weakened connectivity in a hippocampus-PFC network in- 
volved in providing contextual information [74]. The fact that 
exposure to a traumatic event does not necessarily lead to the devel- 
opment of PTSD raises the question of whether the structural and 
functional abnormalities predispose to or follow the development 
of PTSD [75]. For instance, a smaller hippocampal volume has been 
shown to predict pathological vulnerability to psychological trauma 
[76]. On the other hand, hippocampal grey matter loss is predicted 
by the duration of PTSD [77]. 

Neuroimaging studies investigating the neural alterations 
underlying specific phobia mainly focused on the animal sub- 
type, particularly spider phobia subjects. These studies consist- 
ently reported hyperactivation in fear-relevant structures such 
as the amygdala, dorsal ACC, PFC, thalamus, and insula [5]. For 
instance, spider-phobic subjects showed stronger responses of 
both amygdalae during exposure to pictures of spiders. The mag- 
nitude of amygdala responses was positively correlated with the 
intensity of disorder-related vigilance for threat [78]. At the same 
time, there are reports on reduced activity in both the dorsolateral 
PFC and the lateral orbitofrontal cortex during conscious percep- 
tion of phobia-related pictures. Taken together, these results have 
been suggested to reflect an increased activation of the fear circuit, 
accompanied by an insufficient top-down control in response to 
phobia-related stimuli [79]. The exaggeration of the expectancy 
to be faced with the phobic threat was shown to be related to de- 
creased activity in the lateral PFC, precuneus, and visual cortex, 
reflecting irrationality in encounter expectancies and deficiencies 
in cognitive control and contextual integration [80]. The overesti- 
mation of the relationship between fear-relevant stimuli and aver- 
sive consequences, the so-called covariation bias, was found to be 
predicted by connectivity between the paracentral lobule (PCL) 
and sensory cortices, whereas reduced covariation bias was pre- 
dicted by connectivity between the PCL and the PFC [81]. In con- 
trast to the phasic fear responses to threatening stimuli associated 
with increased amygdala activity, sustained anxiety in phobic pa- 
tients has been related to increased activity in the bed nucleus of 
the stria terminalis (BNST) and the ACC, as well as to enhanced 
connectivity between the BNST and the amygdala [82]. Increased 
activity of the ACC has also been linked to anticipatory anxiety in 
phobia patients [83]. Moreover, structural brain abnormalities of 
fear-related brain regions in patients with specific phobia have also 
been reported [84]. 


Neuroimaging of treatment effects 





in anxiety disorders 


Neuroimaging techniques offer the opportunity to monitor struc- 
tural and functional neuronal changes as a result of psychotherapy 
in anxiety disorders or to predict the efficacy of psychotherapeutic 
interventions. For example, behavioural changes in patients with 
social anxiety disorder after mindfulness-based stress reduction 
(MBSR) therapy seem to be reflected by distinct patterns of neural 
activity [85]. Hölzel et al. demonstrated that GAD patients with 
higher amygdala activity in response to neutral faces than healthy 
participants showed a decrease in amygdala reactivity following 8 
weeks of MBSR. Moreover, the activity in the ventrolateral PFC and 
the functional connectivity between the amygdala and PFC regions 
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increased after MBSR, in comparison to a stress management edu- 
cation active control group (Fig. 90.2) [86]. 

Cognitive behavioural therapy (CBT) is known to be an ef- 
fective first-line intervention in the treatment of patients with PD. 
Neuroimaging studies have shown that neural activation patterns 
predict and moderate the therapeutic success of CBT in PD [87]. 
For instance, CBT in patients with PD led to a significant greater 
reduction in bilateral amygdala activation during the processing of 
agoraphobia-related pictures than treatment with selective serotonin 
reuptake inhibitors (SSRIs) [88]. Another sample of PD patients re- 
vealed normalization of PFC hypoactivation and hyperactivation 
of the amygdala and hippocampus after CBT, when panic-related 
symptoms had improved [89]. In a fear conditioning study, patients 
with PD showed a CBT-induced reduction of activity in the left in- 
ferior frontal gyrus (IFG) during the conditioned response and an 
increase of connectivity between the IFG and regions of the ‘fear 
network’ such as the amygdalae, insulae, and ACC after CBT. The 
change of neuronal patterns was correlated with a reduction in 
agoraphobic symptoms [90]. An increase of activation of the hippo- 
campus and a decrease of its connectivity with the IFG have also 
been reported as an effect of successful CBT in PD [91]. Moreover, 
neuroimaging data were able to predict the treatment response to 
CBT in PD with high accuracy on single-patient level [92]. For in- 
stance, improved response to brief CBT was predicted by increased 
pretreatment activation in bilateral insulae and the left dorsolateral 
PFC during threat processing [93]. Lueken et al. showed an asso- 
ciation between treatment response to CBT and increased right 
hippocampal activation, as well as inhibitory functional coupling 
between the ACC and the amygdala [94]. 

Studies investigating the functional neuroimaging correlates of 
PTSD treatment have shown a correlation between improvement of 
PTSD symptoms after trauma therapy and alterations of activity in 
different brain regions such as the amygdala, insula, hippocampus, 
ACC, and dorsolateral PFC [95-98]. Treatment with eye movement 
desensitization and reprocessing (EMDR) in PTSD patient has 
been shown to modulate activity in limbic and prefrontal regions 
[99, 100] and to increase the amygdala volume [101], along with 
significant improvements in PTSD symptoms. The functional con- 
nectivity within the fear extinction network seems to be modified by 
PTSD treatment with repeated exposure to the traumatic memory, 
which emphasizes the relationship between human fear extinction 
and trauma memories [102]. Studies applying neuroimaging meas- 
ures in order to predict trauma therapy responses in PTSD patients 
have revealed relationships between the degree of symptom reduc- 
tion and the pretreatment activity of bilateral amygdalae [103], the 
cortical thickness in the subgenual ACC [104], or the hippocampal 
volume [105]. 

There is strong evidence for high efficacy of exposure therapy 
during CBT in the treatment of specific phobia patients [106]. 
The neural mechanisms underlying the reduction of anxiety 
symptoms after CBT have been extensively investigated in recent 
neuroimaging studies showing that processes of fear extinction and 
the corresponding mechanisms in the brain might play a crucial 
role. Different studies demonstrated a decrease in amygdala activity 
and normalization of insular and ACC activity after successful ex- 
posure therapy in specific phobia patients [107]. Reductions in 
amygdala responsiveness have been associated with self-reported 
symptom improvement [108]. An improvement in the effectiveness 
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Fig. 90.2 (see Colour Plate section) Functional connectivity between the seed region in the right amygdala and several regions in the frontal cortex 


increased from pre- to post-intervention with mindfulness-based stress reduction (MBSR) therapy in patients with generalized anxiety disorder, but 
not in control subjects who underwent a stress management education (SME) class. Top row: anatomical location displayed on an inflated surface. 
Middle row: regression coefficients extracted from the clusters from MBSR (black) and SME (blue) participants at pre- and post-interventions. Bottom 
row: scatter plots for correlations between pre- to post-increase in connectivity (y-axis) in the frontal cortex region and Beck Anxiety Inventory (BAI; x- 
axis) for MBSR and SME participants at post-intervention. ACC, anterior cingulate cortex. 


Reproduced from Neuroimage Clin., 2, Hélzel BK, Hoge EA, Greve DN, et al., Neural mechanisms of symptom improvements in generalized anxiety disorder following 
mindfulness training, pp. 448-58, Copyright (2013), Elsevier. Reproduced under the Creative Commons Attribution License (CC BY). 





of exposure therapy via faster fear extinction processes has been 
reported for the application of D-cycloserine, a partial N-methyl- 
D-aspartate (NMDA) receptor agonist, in combination with 
exposure-based therapy [109]. This might be due to enhancement 
of activation in regions involved in cognitive control and intero- 
ceptive integration, such as the PFC, ACC, and insula, during 
symptom provocation [110]. A novel psychotherapeutic approach 
is provided by applying neurofeedback techniques during training 
of anxiety regulation by cognitive reappraisal. fMRI neurofeedback 
has recently been shown to enable spider-phobic patients to 
downregulate insula activation levels by cognitive reappraisal strat- 
egies, accompanied by lower anxiety levels. Feedback had been pro- 
vided, based on activation in the left dorsolateral PFC and the right 
insula. The achieved changes in insula activation levels seemed to 
be based on a learning mechanism and predicted long-term anxiety 
reduction [111]. 

Psychopharmacological treatments of anxiety disorders have also 
been shown to alter abnormal neural processes that were found to be 
key characteristics of fear and anxiety in neuroimaging studies. For 
instance, SSRI treatment attenuated activity in prefrontal regions, 
the striatum, the insula, and paralimbic regions during listening to 
worry sentences in GAD [112], reduced amygdala and enhanced 


ventral medial PFC reactivity to emotional faces in social phobia pa- 
tients (113), and increased activation in the dorsolateral PFC during 
emotion regulation (114), as well as increased mean hippocampal 
volume in PTSD patients (115). The anxiolytic effect of benzodi- 
azepines, such as alprazolam, seems to involve a reduction of rostral 
ACC activity and changes of its connectivity with the dorsolateral 
PFC and the amygdala, as revealed by means of CCK-4 panic chal- 
lenge in healthy subjects (Fig. 90.3) [4]. 

The use of the neuropeptide oxytocin has been suggested as a 
promising approach for the facilitation of psychotherapy effects in 
anxiety disorders. Recent neuroimaging studies have extensively 
illustrated its stress-reducing, anxiolytic, and prosocial properties 
[116]. For instance, oxytocin attenuated the exaggerated amyg- 
dala activity [117] and enhanced the functional connectivity of 
the amygdala with the insula, medial PFC, and ACC [118, 119] in 
pathological social anxiety. Furthermore, intranasal application of 
oxytocin normalized the amygdala functional connectivity to the 
dorsal ACC and the ventromedial PFC in PTSD patients, indicating 
suppression of the anxiety and fear expression of the amygdala via 
increased top-down control of the ventromedial PFC or decreased 
salience processing of the dorsal ACC [120]. In summary, recent 
evidence from neuroimaging findings suggests that oxytocin could 


CHAPTER 90 Neuroimaging of anxiety disorders 


= 
=a 
= 
= 
Ss 
= 
i 
== 
= 


Alprazolam vs Placebo 





Fig. 90.3 (see Colour Plate section) Contrasts of functional magnetic resonance imaging responses compared between different treatment conditions. 
Group analysis (n = 16) of the blood oxygen level-dependent response (functional magnetic resonance imaging) to challenge with cholecystokinin- 
tetrapeptide after alprazolam vs placebo pretreatment (Talairach co-ordinates: x = -1, y = 30). Activation of the rostral anterior cingulate cortex was 
reduced after pretreatment with alprazolam, but not after pretreatment with placebo. SAG, sagittal; COR, coronary; A, anterior; P posterior; L, left; 


R, right. 


Reproduced from Biol Psychiatry, 73(4), Leicht G, Mulert C, Eser D, et al., Benzodiazepines Counteract Rostral Anterior Cingulate Cortex Activation Induced by Cholecystokinin- 
Tetrapeptide in Humans, pp. 337-344, Copyright (2013), with permission from Society of Biological Psychiatry. 


potentially enhance treatment response in PTSD by modulating the 
fear network. 

These lines of research suggest that neuroimaging techniques 
could potentially identify the mode of action of anxiety treatment 
drugs and therefore facilitate the development of new pharmaco- 
logical treatment options for anxiety disorders. Furthermore, there 
is evidence that pretreatment patterns of functional neuronal ac- 
tivity might predict the treatment response to pharmacological 
interventions. For instance, the treatment response to venlafaxine 
was predicted by greater pretreatment ACC activity and lesser re- 
activity in the amygdala in GAD patients [121, 122]. 


Conclusions 


In the present article, we have provided an overview of the results of 
current neuroimaging studies in fear and anxiety. Human models of 
anxiety in healthy individuals serve as an experimental tool for the 
investigation of neural processes underlying the pathophysiology of 
clinically relevant anxiety disorders, for example, with respect to the 
expression of fear during symptom provocation or fear generaliza- 
tion. Moreover, neurobiological aspects of therapeutic interventions 
have been uncovered in fear extinction studies. Studies in human 
models of anxiety, as well as investigations in anxiety disorder pa- 
tients, consistently demonstrated the pathophysiological role of a 
‘fear network’ comprising the amygdala, insula, medial PFC, and 
ACC. Symptoms of anxiety are considered to be due to a patho- 
logically hyperactivated amygdala and insufficient top-down regu- 
lation by frontal brain regions. Moreover, a disturbed function of 
the hippocampus with respect to processes of pattern completion, 
pattern separation, and integration of contextual information seems 
to crucially contribute to the development and maintenance of anx- 
iety disorders. 


Effective psychotherapeutic and pharmacological treatments 
of anxiety occurring along with normalization in patients’ 
perception and behaviour seem to specifically alter patterns 
of brain activation in these structures. Neuroimaging tech- 
niques might enable therapists and researchers to continuously 
monitor treatment success and thus may contribute to the re- 
finement of existing approaches and the development of more 
effective therapeutic options. Furthermore, neuroimaging 
parameters can help to predict the success of certain treat- 
ment strategies. Thus, clinicians might prospectively be able 
to assign the most promising treatment option to individual 
patients. 
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Anxiety disorders comprise the most prevalent class of psychological 
disorders across the lifespan. Onset frequently occurs early in life, 
with a median age of onset at 11 years [1]. The 12-month prevalence 
estimates exceed 10% [2] and affect approximately 8% of youth in a 
12-month period [3]. Anxiety disorders are also highly comorbid 
with other mental disorders such as behavioural disorders, substance 
use disorder, and eating disorders [4], and frequently co-occur with 
depression, even in youth [5]. There has been some suggestion that 
anxiety in childhood acts as a precursor and risk factor for the subse- 
quent development of depression [6, 7]. A more detailed description 
of the epidemiology is given in Chapter 88. 

The adverse consequences of anxiety disorders are well docu- 
mented and cause impairment across social, emotional, and aca- 
demic domains [8, 9]. For example, young people with anxiety 
disorders have lower self-esteem, are more likely to be socially iso- 
lated, and perform worse academically than their non-disordered 
counterparts [10]. The economic impact of anxiety disorders is sub- 
stantial, with current estimates of £9.8 billion per year in the United 
Kingdom in terms of lost productivity and direct costs to the health 
system [11]. This picture is mirrored in the United States, with costs 
reaching $42.3 billion per year during the 1990s [12]. The financial 
and societal burden posed by anxiety disorders has been recognized 
by world health authorities who have identified it as a public health 
priority [13]. 

One way to address this burden is via prevention. While effective 
treatments for anxiety disorders exist, the shift towards prevention 
has been driven by increased recognition of the shortcomings of 
treatment. It has become clear that the demand for mental health 
services far exceeds the availability of such services and that these 
services are costly and often inaccessible to many [8]. Additionally, 
more than half of individuals with mental health disorders do not 
receive clinical treatment [14], and of those who do, many fail to 
respond or they terminate treatment prematurely [8, 15]. Young 
people have particularly low levels of help-seeking, for reasons 
that include stigma, confidentiality concerns, and beliefs about the 
value of treatment [16]. Accordingly, there is a legitimate case for 
developing preventive interventions for anxiety disorders, which 
may avert some of the aforementioned limitations associated with 
treatment. 














Three levels of prevention were first described by public health re- 
searchers in the 1940s: primary, secondary, and tertiary. Primary 
prevention refers to a reduction in the incidence of a disorder by 
intervening in advance of disorder onset. Secondary prevention 
seeks to reduce the prevalence of psychopathology once symp- 
toms have been identified but are not yet severe or do not meet the 
threshold level for the disorder, while tertiary prevention involves 
the treatment of disorders and relapse prevention [17, 18]. Although 
this classification system is pervasive in public health domains, the 
conceptual overlap between prevention and treatment has led to in- 
consistencies in how prevention is discussed and examined in the 
literature. As a result, the Institute of Medicine (IOM) [19] devel- 
oped a definitional system specific to mental health research, which 
suggested that prevention should comprise programmes designed 
at reducing incidence rates, that is, delivered in advance of disorder 
onset. Three types were described: universal, selective, and indi- 
cated prevention. Universal prevention is delivered to all individ- 
uals within an identified population, regardless of risk. For example, 
universal prevention programmes for youth anxiety are typically 
delivered on a large scale in the school environment to every child 
in the grade [20]. Selective prevention refers to delivering interven- 
tions to individuals who have an increased risk profile for a specific 
disorder. Risk factors can be conceptualized as a characteristic, an 
experience, or an event that increases the likelihood of an adverse 
outcome, relative to the general population [21]. A sound know- 
ledge of risk factors is therefore necessary to inform selective pre- 
vention efforts. Several comprehensive reviews document the risk 
and protective factors of anxiety disorders [8, 22], which include 
genetic (for example, family history), individual (for example, tem- 
perament, anxiety sensitivity), and environmental factors (for ex- 
ample, parental divorce, exposure to violence and crime). Finally, 
indicated prevention refers to approaches that are delivered to indi- 
viduals who have subclinical symptoms but do not yet have a clinical 
disorder. 

The present chapter aims to provide a systematic review of the psy- 
chological interventions that aim to prevent anxiety. This review is 
broader than previous reviews that either have focused individually 
on specific prevention settings, such as school environments [20], 
community settings [23], workplaces [24], or primary care settings 


[25], or have examined specific age groups [26]. Although broad, with 
respect to setting, we restrict our focus on studies which specifically 
measure anxiety outcomes, rather than include studies measuring a 
less specific range of psychopathology, health, or well-being. 


Chapter outline 


Firstly, we will review the existing research studies examining pri- 
mary prevention. We believe it is important in a textbook such as 
this one to establish firmly the effectiveness of prevention for anx- 
iety disorders, to consider the size of the effect, and to indicate for 
which population groups prevention may be of most use. Healthy 
scepticism about anxiety prevention is useful, but if these interven- 
tions result in a reduction of anxiety in a percentage of the popula- 
tion, there is an imperative to begin to systematically apply them. 
Secondly, we then consider the key elements common to effective 
prevention programmes, before thirdly considering barriers to the 
implementation of prevention programmes. Fourthly, we conclude 
with a summary of the current challenges facing the field and sug- 
gestions for future research. 


Systematic review of effectiveness 





Adopting Mrazek and Haggerty’s IOM conceptualization, primary 
prevention is defined here as interventions delivered in advance of 
the onset of anxiety symptoms or disorder and thus include uni- 
versal and selective prevention approaches. It excludes indicated ap- 
proaches targeting specific individuals with symptoms, which could 
be considered early intervention. By including universal approaches 
that target the whole population, it is assumed that approximately 
8% of those included will already be experiencing an anxiety dis- 
order [3]. Although a reduction in incidence is the gold standard for 
prevention studies, we have opted to include studies which focus on 
the total population group, including those universal studies that do 
not explicitly exclude individuals who may already have a diagnosis 
of anxiety. In large universal studies, screening for anxiety diagnosis 
or elevated symptoms prior to study entry is often impractical and 
expensive, and prevention interventions delivered in real-world situ- 
ations are likely to include the full population cohort. For these prac- 
tical reasons, studies that relied on continuous symptom measures, 
rather than diagnostic assessment tools, were included, although it is 
acknowledged in the case of some participants already experiencing 
the disorder that this could be considered early intervention. 


A systematic review of prevention 





of anxiety studies 


Methods 

Search and screening procedures 

PsycINFO, PubMed, and the Cochrane Library databases were elec- 
tronically searched for articles with the key search terms ‘prevent*’ 
AND ‘anxiety’ OR ‘anxious’ OR ‘internalising’ OR ‘internalizing’ 
AND “control? OR ‘random” OR ‘trial’ Reference lists from rele- 
vant reviews were hand-searched for additional articles. 
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The criteria for study inclusion were the following: (1) the clearly 
stated goal of the study was to prevent the incidence of anxiety or 
reduce symptoms; (2) the presence of anxiety disorder or symp- 
toms was the primary outcome (or one of two or three outcomes 
in the absence of identified primary outcomes); (3) a universal or 
selective approach was employed; (4) the study was a randomized 
controlled trial (RCT); and (5) the study was published in a peer- 
reviewed English language journal. Studies were excluded if: (1) the 
focus was on medical or dental anxiety or trauma, post-traumatic 
stress disorder, or acute stress disorder; (2) the study targeted gen- 
eral mental health or well-being, operationalized by the inclusion of 
anxiety as one of more than three outcomes when no primary out- 
come was identified; (3) the study focused on transitory changes in 
state anxiety; and (4) the study conducted a diagnostic assessment or 
screening measure at baseline and allowed those with current or past 
disorder or those with high symptom levels to remain in the study 
(for example, indicated prevention/early intervention). Selective 
prevention studies that did not involve a screening procedure were 
permitted. There were no restrictions in terms of the setting, target 
group, intervention type, or control condition type. Studies were 
examined according to inclusion criteria using a checklist and were 
coded by the first author. Decisions about borderline trials (N = 4) 
were made by the first author and were then discussed with the final 
author. All decisions made by the first author were endorsed, and 
complete agreement reached. 


Effect size calculations 


Standardized effect size estimates for continuous outcomes were cal- 
culated using Cohen's d [27] whereby d is calculated by subtracting 
the mean score of the intervention group from the mean score of the 
control group at post-test or follow-up, and dividing by the pooled 
standard deviation. Effect size estimates are reported for between 
group differences at post-test and follow-up intervals. According to 
Cohen, effect sizes of 0.20, 0.50, and 0.80 are considered small, me- 
dium, and large, respectively. Standardized effect size estimates for 
dichotomous outcomes were first transformed into In (the natural 
logarithm of the odds ratio) and then converted into a standardized 
effect size equivalent to Cohen’s d, using the method described by 
Chinn [28]. Where diagnostic and continuous outcomes were both 
included in a study, both effect size estimates are reported. In studies 
where follow-up occurred at very short intervals (for example, every 
month), data were extracted for key follow-up periods (for example, 
6, 12, and 18 months). 


Results 
Study selection 


The search returned a total of 5276 articles after duplicates were re- 
moved (n = 367). The titles and abstracts of the 5276 articles were 
screened for relevance. Completely irrelevant articles that were un- 
related to the topic were excluded at this stage, while relevant articles 
were retained and full texts were examined (n = 128). Of these, 42 
articles describing 38 trials met inclusion criteria. The remaining 86 
articles were excluded due to: not universal or selective prevention 
(n = 36), not anxiety-focused (n = 21), not prevention (n = 16), not 
an RCT (n = 12), and no full text available (n = 1). See Tables 91.1 
and 91.2 for details of included universal and selective prevention 
studies, respectively. 
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SECTION 14 Anxiety disorders 


Study characteristics 


There were 38 unique studies identified in the current review, which 
included a total of 23,772 participants. Sample size of the included 
studies varied considerably from between 40 participants [29] to 
7741 participants [30], with a median of 188 participants. Included 
studies were published between 1997 and 2016, with 50% between 
2010 and 2016 (and only two studies prior to the year 2000). A ma- 
jority of studies were delivered either in Australia (11; 29%) or in 
America (9; 24%), with a further seven across mainland Europe 
(18.5%), four in Canada (10.5%), four in the United Kingdom 
(10.5%), two in Asia (Taiwan and Korea; 5%), and one in Iran (2.5%). 


Prevention type 


Twenty-two of the studies were universal prevention trials (58%), 
and 16 were selective (42%). Of the 16 trials delivered to selective 
groups, risk was defined as: anxiety sensitivity (n = 2), having a nega- 
tive attributional style (n = 1), attending a low socio-economic school 
(n= 1), having a parent with a past or current anxiety disorder (n = 2), 
being an undergraduate student for a high-risk profession (teacher, 
n= 1; nurse, n = 2), having a high-stress job (n = 1), having a phys- 
ical chronic illness such as stroke, cancer, or hypertension ( = 5), or 
being the primary carer of someone with dementia (n = 1). 


Population and setting 


Recipients of more than half ( = 20) of the programmes were chil- 
dren or adolescents (53%); two programmes were delivered to fam- 
ilies (5%); eight were delivered to university students (21%); one 
was delivered to carers (3%); five were delivered to patients with a 
physical illness (13%), and two were delivered to the general adult 
population (5%). Of programmes delivered to children and ado- 
lescents, effect sizes could be calculated for 18 studies, and of these, 
eight (45%) reported significant reductions in symptoms of anxiety 
(ES = 0.15-0.66), while ten found no difference relative to the control 
condition. Of the trials delivered to university and college students, 
four of the five studies for which effect sizes could be calculated (90%) 
reported positive effects on anxiety prevention (ES = 0.51-1.51). 
The two studies delivered to families prevented the onset of an anx- 
iety disorder over the 12-month follow-up period (ES = 1.22-1.42). 
The intervention delivered to carers was not effective in preventing 
anxiety, and of the five programmes delivered to individuals with a 
physical illness (including stroke, hypertension, chronic obstructive 
pulmonary disease, and cancer), effect sizes could be calculated for 
four. Of these, three (75%) were effective in preventing depression 
(ES = 0.58-1.79). Of the two programmes delivered to the general 
adult population, one study reported positive effects (ES = 0.66), while 
the other did not. The most common setting in which prevention was 
delivered was the school environment (n = 20; 53%), of which 90% 
were delivered universally. Eight trials targeted university and college 
students and were delivered in the university setting (22%). A single 
study was conducted in the workplace (3%), and three studies were 
conducted in general community-based settings (8%). Six trials were 
conducted in hospital or clinic settings (16%), and these were pre- 
dominantly delivered to patients with a chronic illness. 


Content 


A majority of trials (n = 26) delivered a prevention intervention 
based on cognitive behavioural therapy (CBT; 68.4%) or incorpor- 
ated at least some element of CBT, blending it with problem-solving 


therapy (n = 2; 5.3%). Of the programmes based on CBT, effect sizes 
could be calculated for 24, and 15 of these (62.5%) were effective 
in preventing anxiety or reducing symptoms (ES = 0.20-1.79). The 
median effect size for all programmes evaluating CBT (regardless 
of outcome) was 0.25. The programme receiving the most attention 
was the FRIENDS programme (evaluated in ten studies), which is 
a manualized face-to-face programme for children and adolescents 
that can be delivered by teachers or mental health professionals. 
Four of these studies found FRIENDS to be an effective interven- 
tion at post-test (ES = 0.20-0.66), while six did not (ES = -0.18 to 
0.20). Two studies evaluated the Aussie Optimism Programme, 
which is another standardized CBT school-based programme de- 
signed to teach young people about the connection between how 
they think, feel, and behave. Neither of these studies found signifi- 
cant effects (ES = -0.19 to 0.17). Other CBT prevention interven- 
tions included MoodGYM, eCouch, Dominique’s Handy Tricks, 
Learn Young Play Young, Cool Kids, Taming Worry Dragons, and 
ThisWayUp Schools. Ten studies investigated approaches that were 
not CBT-based. Two of these tested a mindfulness intervention, 
one derived from mindfulness-based cognitive therapy and the 
other from mindfulness-based stress reduction (5.3%). Two studies 
(n = 2; 5.3%) involved relationship-focused therapy, delivering the 
Prevention and Relationship Enhancement Programme (ePREP). 
The remaining therapeutic approaches included a meditation pro- 
gramme, a tai chi programme, a relaxation programme, support 
and counselling, problem-solving therapy and escitalopram (an 
antidepressant medication), and a standardized anti-bullying pro- 
gramme known as KiVa based on education and skill development 
(each making up 2.6%). Of the ten trials that delivered therapeutic 
techniques that were not CBT-based, eight reported enough data for 
effect size calculation at post-test, of which six studies (75%) [30-35] 
were effective (ES = 0.41-1.10) in preventing anxiety. 


Programme format, length, and mode of delivery 


More than half (n = 23, 60.5%) of the included programmes were 
delivered in a small group format (typically of groups comprising 
6-10 individuals), and three programmes (7.9%) involved a group 
component, combined with individual sessions (n = 2) or com- 
bined with a self-directed computer-delivered programme (n = 1). 
Six programmes (15.8%) were delivered entirely individually face- 
to-face, and six programmes were delivered by computer (15.8%). 
Programme length ranged considerably from a single intervention 
session to 22 sessions, with most programmes (90%) being delivered 
in between 6 and 12 sessions (median = 8 sessions), usually on a 
weekly or a fortnightly basis. There was substantial variability in 
the duration over which these programmes were delivered, ran- 
ging from between a single session to 52 weeks (median = 8 weeks), 
although it needs to be noted that three studies delivered a single 
session, followed by 7 weeks of reminder emails encouraging parti- 
cipants to implement the skills they acquired during the session [35- 
37]. Sessions ranged in duration from 35 to 120 minutes, with the 
exception of one study where day-long workshops were delivered 
[38] and one study that did not report on session length [34]. 

In terms of personnel delivering the programme, ten studies 
were delivered exclusively by psychologists or mental health pro- 
fessionals (26%); five studies were delivered by researchers or 
graduate students (13%), and three programmes were delivered 
by either psychologists or graduate students (7.9%). One study 


was delivered by a t’ai chi instructor (3%), and one by a specialist 
physician (3%). Of these 20 studies delivered by professionals, ef- 
fect sizes could be calculated for 18, and of these, 13 studies (72%) 
reported a significant preventive effect on anxiety (ES = 0.20-1.79). 
Nine of the included studies were delivered exclusively by teachers 
or school staff within the school environment (23.5%), of which two 
(22%) were effective (ES = 0.13-0.66). Two school-based studies 
involved both school and external staff, comparing a school-led 
version of the programme to a health professional-delivered ver- 
sion [39, 40]. In the Barrett and Turner study [39], the personnel 
delivering the intervention did not impact effect size, with both 
health- and school-led delivery producing significant reductions in 
anxiety (health ES = 0.32; teacher ES = 0.31). Conversely, a differ- 
ence between school-led and health-led prevention was detected 
in the Stallard et al. study [40], with health-led programmes pro- 
ducing significant preventive effects at 12 months (ES = 0.20), 
while teacher-led prevention had no effect (ES = 0.03). Of the six 
computer-delivered programmes, two (5%) were delivered in the 
school environment and supported by teachers and one (3%) in- 
volved a comparison between a school-supported version to a 
version supported by a mental health organization [41]. The re- 
maining three (7.9%) computer-based programmes were entirely 
self-directed and delivered to adults. Of the six computer-delivered 
programmes, four of the five (80%) for which effect sizes could be 
calculated were effective (ES = 0.15-1.51). One study (3%) did not 
report on personnel delivering the programme [42]. 


Evaluation control groups 


Seventeen of the included studies (45%) compared the prevention 
programme to either a no-intervention control group or usual care 
(NJ), nine studies involved a wait-list control group (WL; 23.6%), 
seven studies used an attention control group (AC; 18.4%), and six 
studies involved more than one control group (15.7%). Of these 
six studies involving more than one comparison, three included an 
NI and an AC arm, and two involved a WL arm and an AC con- 
dition. Usual care typically involved regular class attendance in 
school-based programmes or standard care (education, check-ups) 
in medical settings. Attention control conditions are used to con- 
trol for extraneous factors (such as attention, exposure to educa- 
tional material, and non-specific prevention factors such as being 
in a group) that could feasibly impact preventive outcomes. Of the 
studies that included an attention control condition, 60% reported 
significant effects on anxiety (ES = 0.20-1.22), while 58% of trials 
employing NI or WL groups found a significant reduction in anxiety 
(ES = 0.15-1.78). 


Outcome measures 


Anxiety symptoms were measured as outcomes on all but two of 
the included studies (95%). Two studies used diagnostic instru- 
ments exclusively, and five studies included both diagnostic and 
symptom outcomes. Of the studies using diagnostic tools, three 
studies (43%) used the Anxiety Disorder Interview Schedule [43], 
three used the Structured Clinical Interview for DSM-IV disorders 
[44], and the final study (14%) employed the MINI International 
Neuropsychiatric Interview [45]. Anxiety symptoms were meas- 
ured most frequently with the Spence Children’s Anxiety Scale 
(22%) [46], followed by the Multidimensional Anxiety Scale for 
Children (14%) [47] and the State-Trait Anxiety Inventory (14%) 
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[48], followed by the Beck Anxiety Inventory (11%) [49] and finally 
the Revised Children’s Manifest Anxiety Scale (8%) [50]. 


Overall outcomes 


From the 38 studies included in this review, eight studies did not 
include enough information for effect sizes to be calculated or did 
not collect data at post-test (for example, some studies included 
follow-up data only). Of the 30 (79%) studies for which effect sizes 
could be calculated, three of these reported change score analyses 
only, and so group differences at post-test could not be ascertained. 
Out of the 27 (71%) remaining studies, 17 (63%) trials reported that 
the prevention programme significantly improved anxiety outcomes 
(ES = 0.15-1.79), while ten (37%) trials did not (ES = -0.19 to 0.21). 
Twenty-one studies (55%) collected follow-up data, for which ef- 
fect sizes and between-group differences could be calculated for 19 
(50% overall). Of these, 12 (63%) studies reported significant effects 
on anxiety (ES = 0.25-1.78), while seven (37%) did not (ES = 0.00- 
0.33). Of the studies that included follow-up periods, 15 studies 
(39%) did not follow up beyond 12 months, two studies included a 
final follow-up at 18 months (5%), and only three studies followed 
up individuals for longer than 2 years (8%). 


Universal programme outcomes 


Effect sizes could be calculated for all but two of the 22 universal 
trials, but a further three did not report between-group outcomes 
at post-test and/or follow-up. Of the 17 (53%) trials for which ef- 
fect sizes and group differences could be ascertained, 11 trials (65%) 
found positive outcomes on anxiety (ES = 0.15-1.51), while six did 
not (ES = -0.19 to 0.32). Eleven studies included follow-up data, for 
which group differences were reported for nine. Six studies (67%) 
reported an impact of anxiety prevention at follow-up (ES = 0.25- 
1.01), while three trials did not (ES = 0.03-0.33). 


Selective programme outcomes 


Effect sizes for the 16 selective prevention programmes could be cal- 
culated for ten of the studies and ranged from 0.08 to 1.78. Of these 
ten studies, six (60%) found a significant preventive effect on anxiety 
(ES = 0.60-1.79), while four did not (ES = 0.08-0.21). Ten selective 
studies reported follow-up information, with six finding a positive 
effect of anxiety prevention (ES = 0.58-1.78) and four reporting no 
difference between groups on anxiety at follow-up (ES = 0.01-0.22). 





Effectiveness of the primary prevention of anxiety 


Overall, this review suggests that there is legitimate value in 
pursuing primary prevention of anxiety disorders using both uni- 
versal and selective approaches, with a median effect size for all 
included studies of 0.21. More than half of the trials (63%) found 
that the prevention programme either prevented onset of an anx- 
iety disorder or reduced anxiety symptoms, both at post-test and at 
follow-up. Small (0.15) to large (1.78) effect sizes were reported for 
both universal and selective interventions. As might be expected, 
80% of the studies included were delivered to children, adolescents, 
or university students, which is appropriate, given that anxiety often 
develops early in life. Preventive interventions delivered to young 
people have the greatest chance of preventing first onset, which can 
have implications for the trajectory of disorder across the lifespan. 
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Studies that involved prevention in adults and the elderly were more 
likely to be selective prevention programmes, delivered to those at 
risk for disorder development (for example, diagnosis of cancer, ex- 
perience of stroke). 

There has been considerable debate in the literature about whether 
universal or targeted prevention approaches are more effective, and 
to date, there does not seem to be a clear advantage of one over the 
other (for example, [18, 20, 51]). Acommon argument against uni- 
versal prevention is that programmes are delivered to large samples 
with relatively low levels of need. However, the counter-argument is 
that unless prevention is universal, individuals who are on the tra- 
jectory to disorder but do not yet meet a given definition of being 
‘at risk’ will not be captured. Furthermore, anxiety disorders can 
be extremely difficult to detect in children by parents and teachers, 
relative to other disorders [52], making universal approaches de- 
livered in the school environment particularly attractive in terms 
of reaching these individuals. Another advantage of school-based 
universal approaches is that they minimize stigma and are usually 
easier for school administrators to schedule by directly integrating 
them into the school curriculum. Although universal programmes 
are associated with advantages for delivery in schools, outside of 
this environment, universal implementation is more difficult—we 
identified only a single community-based universal programme 
[38] and three universal programmes delivered in university set- 
tings [35-37]. 

Control group type did not appear to influence effectiveness 
outcomes. It is encouraging that approximately 32% of the studies 
included an active control group, which is double what has been 
reported previously [20]. This may reflect an increasing awareness 
of the importance of active control conditions to control for non- 
specific factors associated with the prevention programme, some- 
thing which prevention researchers have been calling for [53]. There 
was virtually no difference in the proportion of studies reporting sig- 
nificant preventive effects on anxiety as a function of control group 
type (60% of studies with an active condition found effects, while 
58% of those employing inactive control group types reported posi- 
tive effects). 

There was substantial variability in effect size estimates, and this 
likely reflects the heterogeneity in study variables (for example, 
setting, age, content). The fact that most selective prevention pro- 
grammes screened participants for a history of anxiety or mental 
disorder and excluded participants on this basis, while universal 
prevention studies did not involve a screening process, is likely to 
have impacted effect sizes. Furthermore, the quality of the research 
methodology also varied significantly, which is very likely to have 
influenced effect size estimates. For example, many studies did not 
report attrition rates or whether fidelity measures were used (for ex- 
ample, independent coding of therapy tapes). 

Although our results provide consistent evidence for the effect- 
iveness of anxiety prevention programmes, findings need to be in- 
terpreted in the context of several limitations, including the fact that 
three studies reported only change scores, so between-group effect 
size estimates at post-test and follow-up could not be calculated. 
A further five studies did not report enough information for effect 
sizes to be calculated. Therefore, the effect size estimates from this 
review are based on 30 studies, and not the 38 that met inclusion cri- 
teria. Finally, a single rater (the first author) screened and extracted 
the data, without an independent coder. 


Key elements common to effective 


prevention programmes 





A consistent finding from both the literature and the current review 
is that programmes can be effective in preventing anxiety. What is 
now needed is insight into the elements of consistently effective pre- 
vention programmes. There is not yet enough existing research to 
confidently address this issue, although there are some emerging 
patterns. 

Much of the research that has been carried out in the field of anx- 
iety prevention has tested psychological approaches and, most fre- 
quently, CBT. More than 70% of the included programmes in the 
current review delivered at least some component of CBT, including 
several trials delivering automated, computer-delivered CBT. CBT 
typically involves a range of different cognitive and behavioural 
strategies, meaning that the active ingredient and corresponding 
mechanisms of change cannot be identified when the overall 
package is examined. Dismantling studies will help to address this 
issue, which may inform the development of more effective preven- 
tion programmes in the future. 

Drawing on the existing literature and the results from the current 
review, factors such as age, gender, prevention type, and setting have 
not consistently been associated with effect size outcomes (for ex- 
ample, [20, 26]). Notably, personnel delivering the programme may 
impact outcome. Existing evidence and data from the current review 
suggests that programmes delivered by mental health professionals 
could be more effective than those delivered by teachers [26], al- 
though this finding has not been consistently replicated [20, 51]. In 
our review, programmes that were delivered by mental health pro- 
fessionals (including psychologists, psychiatrists, doctors, graduate 
students, or researchers) or by computers were associated with a 
higher percentage of successful preventive outcomes than those 
delivered by teachers. However, fewer than one-third of included 
studies were delivered by teachers, and so a relatively low number of 
teacher-delivered programmes precludes strong conclusions about 
the effectiveness of these programmes being drawn. Of note, two 
studies directly compared teacher- vs mental health professional- 
led face-to-face prevention programmes, of which one noted su- 
periority in the professional-delivered programme, while the other 
did not [39, 40]. More research is required to resolve this issue, and 
regardless of outcome, future studies should focus on identifying 
ways to improve the effects of programmes delivered by school staff, 
which may involve the development of teacher training and sup- 
port structures to scaffold teacher competence in delivering mental 
health programmes. 


Barriers to the implementation 


of prevention programmes 





Engagement and fidelity 


Engagement is a key barrier to delivering effective prevention pro- 
grammes. These programmes are notoriously challenging to make 
appealing and relevant to the target audience, who might not per- 
ceive the need for the material or appreciate its value, particularly 
when there is no obvious immediate benefit. A discussion about 
prevention necessarily includes a focus on young people, and the 


emergence of programmes that use technology may have greater 
appeal to young people, particularly those involving gamified and 
interactive components (for example, [41, 54]). These programmes 
also lend themselves to being updated and incorporating techno- 
logical innovations that may be particularly appealing to young 
people, such as completing mood ratings on their own digital device, 
rather than in a classroom setting. Systematic evaluation of engage- 
ment factors is likely to inform how prevention programmes might 
best be delivered at a population level. 

Evidence-based anxiety prevention programmes are unlikely to 
be effective if they are not delivered as intended. Poor adherence 
to the protocol could reflect inadequate training or competence 
on the part of the provider or an overall lack of engagement with 
the programme. In the current review, most studies did not re- 
port on how fidelity to the programme was measured (if it was at 
all), and it is important to keep in mind that fidelity is likely to be 
higher in a rigorous research trial than under real-world conditions. 
One way this barrier might be overcome is for implementation 
and effectiveness-implementation hybrid trials to systematically 
examine factors that are associated with higher levels of engagement 
and fidelity to the programme and how these relate to effectiveness, 
uptake, and completion rates. 


Organizational priority and fit 


A related issue is the compatibility of the prevention programme to 
the culture in which it is being delivered. In the example of school 
settings, the fit within the school culture and the willingness of 
school administrators to schedule time in the school curriculum for 
delivery are likely to influence outcomes. There are many demands 
placed on an already crowded school curriculum, and one of the key 
emerging barriers from our work in the field is willingness of school 
executives to prioritize mental health prevention programmes over 
other valid educational programmes. One way to overcome this is 
to align school prevention programmes with curriculum outcomes, 
such that teachers can fulfil teaching requirements by delivering 
mental health prevention programmes without having to find extra 
time in their schedule. Issues of fit and priority against competing 
demands are likely to apply more broadly to general workplace set- 
tings as well where it might be expected that initiatives and pro- 
grammes that fit with the culture of a workplace would be prioritized. 
Mediating factors likely to influence implementation in a particular 
context include the attitude of the organization to the programme, 
existing stigma, and the attitude and skills of the provider. Empirical 
investigation into these factors as a way to promote engagement and 
adherence to the programme is now needed. 


Cost and sustainability 


The delivery of prevention programmes is expensive. It requires in- 
vestment in terms of paying professionals to deliver the programme 
or training non-mental health specialists to deliver the programme. 
Programmes delivered by lay people, including school staff, provide 
information about whether a programme is likely to be effective 
and suitable for sustainable, large-scale implementation because 
these programmes ultimately need to be delivered by individuals 
who will be available to do so over the long term. Technology-based 
approaches show tremendous promise in providing a low-cost and 
potentially sustainable avenue through which large-scale rollout 
might occur. Individuals with minimal training in the community 
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can support a computer-delivered programme at much lower cost 
than undergoing training themselves to deliver complicated psycho- 
logical therapies. 


Current challenges facing the field and suggestions 





for future longer research follow-up 


Most of the research in the field of anxiety prevention has been 
conducted over the past decade. Although growing, the existing 
evidence base is relatively scarce, compared to the treatment lit- 
erature, and even research into the prevention of depression. What 
this means is that we do not yet know what the longer-term impact 
of delivering prevention programmes is, and the field now needs 
studies that assess participants over longer follow-up periods. To de- 
tect a true preventive effect, a difference in the incidence of disorder 
or symptoms needs to be identified as differentially increasing be- 
tween prevention and control groups. The use of latent growth curve 
modelling techniques may help to identify the impact of prevention 
programmes on trajectories towards disorder over time. The impact 
of prevention is hypothesized to be long term, and yet there are not 
enough data available to address this question, with only three of 
the included studies following participants over 2 years or longer. 
Researchers are encouraged to assess participants over significantly 
longer periods, so that genuine long-term effects can be detected. 
The increase in the use of digital technology may assist in the ease 
with which participants may be contacted and followed over longer 
periods. 


More rigorous research methods 


Many of the studies in this review, together with previous prevention 
reviews, were of poor methodological quality. For example, 47% of 
the included studies involved less than 200 participants, suggesting 
inadequate power to detect preventive effects, while fewer than half 
of the studies in the current review did not conduct intent-to-treat 
analyses. These statistics are not unique to the current review and 
are broadly consistent with what has been reported previously (for 
example, (20, 51]). Given some of these methodological weaknesses, 
it is possible that effect size estimates are inflated, and so results need 
to be interpreted cautiously. The increasingly common requirement 
to publish protocol papers in advance of completing an RCT and to 
adhere to Consolidated Standards of Reporting Trials (CONSORT) 
guidelines [55] is likely to enhance the overall quality of RCTs 
moving forward. 


Cost estimates 


An important next step in this line of research is to identify the cost- 
effectiveness of delivering anxiety prevention programmes in these 
environments. We know of one study that has examined the cost 
of delivering the FRIENDS programme universally in schools, with 
researchers reporting no evidence for its cost-effectiveness [56]. 
This is clearly an important issue from a public health perspective 
that future work will need to address. Again, the issue of cost may 
be addressed, at least in part, by the emergence of new, effective 
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digital platforms through which prevention programmes might be 
delivered. 


Final comment 


A common criticism of prevention programmes is that they are as- 
sociated with modest effect sizes. Prevention studies require very 
large sample sizes to detect effects because of relatively low base 
rates of disorder incidence found in unselected samples [3, 57]. The 
picture emerging from the literature is that with enough statistical 
power, genuine small preventive effects may exist. Even small ef- 
fect sizes are likely to be associated with meaningful and clinically 
important improvements at a population level. The current review 
found a median effect size of 0.21 at post-test, which includes all 
studies meeting inclusion criteria (not just those with significant 
between-group differences). This estimate is in line with mean 
weighted effect sizes reported in a previous meta-analysis of youth 
anxiety where Cohen's d at post-test was 0.18 (for example, [26]). 
Specifically, the median effect size for CBT programmes, the most 
widely studied approach, was 0.25 at post-test. Effect sizes of this 
magnitude may be of practical significance, as they apply to a whole 
population. Adopting the Kraemer approach [58] to estimate the 
number needed to treat, approximately 9.2 individuals would need 
to be delivered the prevention programme for a favourable preven- 
tion outcome in one individual. It is not possible to know exactly 
how many people are needed to prevent onset, as the reliance in 
the studies reviewed largely rely on symptom measures, for which 
we cannot be sure what reduction is needed to prevent incidence. 
However, given that comparable effect sizes in studies using diag- 
nostic vs continuous measures (and specifically those included in 
this review), this suggests that 9.2 is a valid estimate. Considering 
that participants included universal, non-selected samples, the po- 
tential impact of prevention is substantial, particularly if prevention 
can be delivered to large samples at low cost, possibly using digital 
technologies. The successful prevention of anxiety disorders has im- 
portant implications at both an individual and societal level, and 
investigation into the factors that promote the successful implemen- 
tation of prevention programmes, together with more attention to 
factors which moderate outcomes and whether they are sustained, 
is now warranted. 
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Anxiety is an understandable response to perceived threat or experi- 
enced stress and is typically fleeting and controllable. It can be con- 
ceptualized as an ‘alarm, allowing a physical and mental response to 
perceived danger (the ‘fight-or-flight response). Anxiety symptoms 
are mostly mild and transient, though many individuals experience 
severe and persistent symptoms that cause considerable personal 
distress and impair social and occupational function. A distinc- 
tion is often made between physical (or ‘somatic’) symptoms, which 
mainly result from autonomic arousal or muscular tension (for ex- 
ample, shortness of breath, palpitations, tremor, and headache) and 
psychological (or ‘psychic’) symptoms, including apprehension, ir- 
ritability, and worrying. 

Anxiety symptoms are common among patients undergoing 
examination, investigation, and medical or surgical treatment but 
are also frequent in community settings in physically well individ- 
uals. If distressing and impairing anxiety exceeds specified severity 
thresholds and persists beyond minimum duration requirements, 
and providing its symptoms are not explicable by another condi- 
tion, an anxiety disorder can be diagnosed. These disorders share 
common psychological and physical symptoms but differ in the 
characteristic features that aid specific diagnosis (Table 92.1), for 
example recurrent unexpected panic attacks in panic disorder or 
the fear of embarrassment and humiliation in social anxiety dis- 
order. Accurate delineation of a suspected anxiety disorder can be 
challenging, but a simple algorithm can aid diagnosis (Fig. 92.1). 
Differential diagnosis from depression and other mental disorders is 
usually simple (Table 92.2). 
































Some people with anxiety receive unnecessary or inappropriate 
treatment, as mild symptoms of recent onset and associated with 
stressful events will often improve spontaneously. However, the 
persistence and associated disability of anxiety disorders means 
that most patients who meet criteria for diagnosis are likely to 
benefit from pharmacological or psychological interventions. 
Unfortunately, many patients who could benefit from treatment are 
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not recognized. The need for treatment should be determined by 
ascertaining the severity and persistence of symptoms, their impact 
on everyday life, the level of coexisting depressive symptoms, and 
other features such as a previous good response to medication or 
psychotherapy [1]. 

The choice of treatment is influenced by clinical characteris- 
tics, patient and doctor preferences, and the local availability of 
potential interventions. There is much overlap across anxiety dis- 
orders for evidence-based effective therapies—such as prescrip- 
tion of a selective serotonin reuptake inhibitor (SSRI) or a course 
of individual cognitive behavioural therapy (CBT)—but there are 


Table 92.1 Characteristic features of anxiety disorders 


Generalized Prolonged excessive worrying that is not restricted to 
anxiety particular circumstances. Worries often centre on possible 
disorder physical ill health affecting themselves or family members, 


and patients can repeatedly present with medically 
unexplained physical symptoms or craving reassurance or 
requesting medical investigations. 


Panic disorder Recurrent unexpected surges of severe anxiety (‘panic 
attacks’), which typically reach a peak within 10 minutes and 
ast around 30-45 minutes. Patients may believe they are 

in imminent danger of death or collapse and seek urgent 


medical attention. 





Agoraphobia Fear and avoidance of public spaces and other situations 
(crowds, transportation) from which immediate escape 
may be difficult; in clinical samples, most patients also have 


expected panic attacks and some have comorbid panic 


disorder. 

Social anxiety arked and persistent fear of being observed or evaluated 

disorder negatively by other people, in social or performance 

(social situations. Many avoid consulting doctors, but some present 

phobia) with physical symptoms (such as excessive perspiration) or 
psychological symptoms (such as fear of vomiting in public). 

Simple Excessive or unreasonable fear of (and restricted to) single 

phobia people, animals, objects, or situations (for example, dentists, 
spiders, lifts, flying, seeing blood), which are either avoided or 
are endured with significant personal distress. 

Separation Fear or anxiety concerning separation from those to whom 

anxiety an individual is attached; common features include excessive 

disorder distress when experiencing or anticipating separation from 
home and persistent excessive worries about potential harms 
o attachment figures or untoward events that might result in 





separation. 
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Fig. 92.1 Suggested scheme for exploring a suspected anxiety disorder. 


differences in treatment response between disorders, and it helps to 
be familiar with the evidence base for each condition (Table 92.3). 
Depressive symptoms often accompany anxiety disorders; around 
one-third of people with anxiety disorders also fulfil diagnostic cri- 
teria for a depressive episode. Treatment of depression will usually 
relieve anxiety symptoms when depression is the primary diag- 
nosis, but if depression is comorbid or follows an anxiety disorder, 
each condition requires separate consideration and often specific 
treatment [1]. 


Pharmacological and psychological treatments 


The overall efficacy of psychological and pharmacological ap- 
proaches in acute treatment of anxiety disorders has been regarded 
as broadly similar, although the findings of a recent meta-analysis 
have questioned this assumption [2]. The strongest evidence for 
acute treatment is for judicious prescription of an SSRI or under- 
taking manualized CBT delivered by trained and supervised staff. It 
is uncertain whether combining these approaches is associated with 
greater improvement than with either treatment given alone, at least 
in some disorders. Sequential steps in patient management, such as 
the ‘stepped care’ approach recommended by the National Institute 
for Health and Care Excellence (NICE), are recommended if a pa- 
tient does not respond to first-line approaches [3]. Continuation 
treatment, following a satisfactory response to acute treatment 
(ideally resulting in remission of symptoms), is needed in all pa- 
tients with anxiety disorders to consolidate the response and reduce 
the risk of relapse. It has been argued that psychological treatments 


may be more effective than pharmacological treatments in keeping 
patients well, though the evidence to support this view is limited and 
has been questioned. 

Many patients worry about starting and continuing pharmaco- 
logical treatment, fearing problems such as unwanted sedation, 
weight gain, or the potential risk of becoming dependent on pre- 
scribed medication. By contrast, others are reluctant to engage 
in a psychological treatment that is often limited in availability, 
emotionally intrusive, and time-consuming. Regardless of the re- 
commended treatment modality, patients should be advised that 
transient worsening of symptoms can occur and that prolonged ef- 
forts are needed to consolidate and maintain an initial response to 
treatment. 


Access to treatment and referral to psychiatric services 


Stigma influences the perception of psychiatric illness and its treat- 
ment. Patients with anxiety disorders may fear being dismissed as 
having minor ‘lifestyle’ complaints and can be discouraged from 
seeking help and undergoing treatment. Many patients may receive 
conflicting messages about the benefits and risks of pharmacological 
treatment. Similarly, access to evidence-based psychological treat- 
ments is often prioritized for those with ‘severe and enduring mental 
illnesses’ such as schizophrenia, though this rationing reflects the 
widespread failure to recognize the impairment and chronicity of 
many anxiety disorders. 

Most patients with anxiety disorders can be managed effectively 
within primary care, but some need the extra expertise of secondary 


Table 92.2 Important common differential diagnoses in anxiety 
disorders 


Differentiation from anxiety disorder 


Depressive Early morning waking, feeling worse in the morning, loss 
illness of capacity for pleasure, constipation, guilty thoughts, 
and suicidal thoughts all suggest depression, rather than 
anxiety. Depressive symptoms in anxiety disorders tend to 
develop after the psychological and somatic symptoms 
hat characterize the anxiety disorder (for example, 
anticipation of embarrassment, anxiety, and avoidance in 
social phobia). 


Psychotic illness Delusions, hallucinations, and thought disorder are not 


seen in patients with primary anxiety disorders. 


Psychostimulant Use of amphetamines, ecstasy, cocaine, and hallucinogens 
use can all result in agitation and severe anxiety, including 
panic attacks. Primacy of drug-seeking behaviour 

and physical signs of intoxication (such as stereotypic 
movements with amphetamine use) support the diagnosis 
of drug dependency. Excess consumption of caffeine- 
containing drugs can result in physical and psychological 
symptoms of anxiety. Many novel psychoactive substances 
can cause anxiety, but their full effects are still not 
established. 











Drug withdrawal Abrupt withdrawal of opiates, alcohol, barbiturates, 
benzodiazepines, or antidepressants can result in agitation, 
tremor, dizziness, gastrointestinal upset, and insomnia. Anxiety 
disorders are not associated with acute confusional states or 
with marked autonomic instability. Characteristic physical 
signs are seen after withdrawal from certain drug classes such 


as pupillary dilatation when withdrawing from opiates. 


Physical ill 
health 


Anxiety symptoms are common in many physical 
health problems and can be the presenting feature (for 
example, in thyrotoxicosis, recurrent hypoglycaemia, 
complex partial seizures, paroxysmal tachycardia, and 
phaeochromocytoma). 


care mental health services. Referral criteria will vary, depending on 
service availability. It seems reasonable to expect that patients should 
be referred by general practitioners to secondary care services when 
there is uncertainty regarding a possible underlying diagnosis (such 
as schizophrenia), after the non-response to two evidence-based 
acute treatment approaches, when there has been a recurrence of 
symptoms despite continuing treatment, when anxiety disorders 
are comorbid with other disorders (for example, depression or sub- 
stance use), when there is a risk of suicide, or when there is aneed for 
specialist intervention (for example, dopamine antagonist augmen- 
tation after a partial response to an SSRI) [1]. 


Table 92.3 Summary of possible treatment options in anxiety disorders 
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Generalized anxiety disorder 


Generalized anxiety disorder (GAD) is characterized by excessive 
worrying, lasting more than 6 months and not restricted to par- 
ticular circumstances (for example, only when attending a social 
gathering). Common features include apprehension, tension, and 
difficulty in concentrating, and autonomic symptoms such as dry 
mouth and abdominal discomfort. GAD is one of the most common 
mental disorders in primary care but is often not recognized, pos- 
sibly because only a minority of patients present with its charac- 
teristic psychological symptoms. The most important differential 
diagnosis is depressive illness (although GAD and major depression 
often occur together), and patients should be asked about key de- 
pressive symptoms such as reduced interest, loss of weight, and sui- 
cidal thoughts, but distinction between the two conditions can be 
difficult. 

Psychological and pharmacological treatments have broadly 
similar efficacy. Based on efficacy, safety, and tolerability, first- 
line pharmacological treatment will usually involve an SSRI, a 
serotonin-noradrenaline reuptake inhibitor (SNRI), or pregabalin 
(a calcium channel modulator). Once a drug is started, patients 
should be treated for a minimum of 4 weeks before deciding whether 
or not that treatment is working. If no response is seen after this 
time, increasing the dosage or switching to an alternative treatment 
may be advisable. The optimal duration of any drug treatment re- 
mains unclear, but a minimum of 12 months’ further treatment is 
recommended in patients who have responded to medication [1]. 

A meta-analysis suggests fluoxetine may be the most likely drug 
for achieving response and symptomatic remission, while sertraline 
is the best tolerated [4]. Neither is currently licensed for treatment 
of GAD in the UK, though NICE recommends sertraline as initial 
treatment. At present, five medications are licensed for GAD in the 
UK: escitalopram and paroxetine (both are SSRIs), duloxetine and 
venlafaxine (both are SNRIs), and pregabalin. Among these drugs, 
the same meta-analysis found duloxetine was most likely to produce 
a beneficial response, escitalopram most likely to establish symptom 
remission, and pregabalin most likely to be best tolerated. Relative 
to SSRIs or SNRIs, pregabalin might have an earlier onset of effect. 
It also reduces the sleep disturbance commonly seen in GAD, both 
directly and indirectly, via the reduction of anxiety. It is generally tol- 
erated well (although dizziness and drowsiness are common during 
acute treatment), is eliminated unchanged in the urine and has few 
drug interactions, and has a low rate of discontinuation symptoms 


Generalized anxiety disorder Panic disorder with/ Specific phobia Social anxiety disorder 
without agoraphobia 


First line 
Psychological CBT CBT Exposure-based therapy CBT 
Pharmacological SSRI SSRI SSRI SSRI 
Second line 
Psychological Applied relaxation Supportive psychotherapy CBT Combination (CBT + SSRI) 
Pharmacological SSRI, SNRI, pregabalin SSRI, SNRI SSRI SSRI, SNRI, SSRI plus 
benzodiazepine 
Third line 
Psychological Uncertain Combination Uncertain Uncertain 
(CBT + SSRI) 

Pharmacological Agomelatine, buspirone, Psychodynamic Uncertain MAOI, pregabalin, 

quetiapine, TCA, benzodiazepine psychotherapy benzodiazepine 





TCA, MAOI, benzodiazepine 


No randomized controlled trials of adult separation anxiety disorder are currently published (August 2017). 
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[5]. However, concern has been raised about the possible risks of de- 
pendence and abuse [6]. 

Dopamine-serotonin antagonists, particularly quetiapine, have 
been investigated in GAD, both as monotherapy and as adjuncts to 
antidepressants. A potential benefit is an earlier onset of action, but 
this must be balanced against potential adverse effects such as meta- 
bolic syndrome. Use of quetiapine is therefore mainly limited to a 
second-line role, but it might be appropriate to use quetiapine in 
primary care in this way as long as the clinician feels s/he has suf- 
ficient experience and competence. Agomelatine, which acts as an 
agonist at melatonin MT1 and MT2 receptors and as an antagonist 
at 5HT,c receptors, is generally well tolerated and has no significant 
discontinuation symptoms and relatively few drug interactions, 
when compared with SSRIs and SNRIs. Agomelatine is also effica- 
cious in GAD, both in acute treatment and in preventing relapse 
[7]. Due to its mechanism of action, it may be particularly beneficial 
in patients with marked sleep disturbance. Preclinical studies with 
the novel ‘multimodal’ antidepressant drug vortioxetine suggested 
potential anxiolytic effects, and it reduces anxiety symptoms in de- 
pressed patients; however, despite its efficacy in preventing relapse, 
randomized placebo-controlled trials of acute treatment of GAD 
have produced inconsistent findings [8]. Vilazodone has both SSRI 
and 5-HT,, partial agonist properties and is effective in acute treat- 
ment of GAD [9]; it may have an advantage, compared with SSRI or 
SNRI treatment, in having a lower incidence of treatment-emergent 
sexual dysfunction. 

There is uncertainty about how best to proceed after a non- 
response to first-line treatment. A pragmatic approach suggests 
employing a further monotherapy before proceeding to augmen- 
tation strategies. Other efficacious drugs (usually reserved for spe- 
cialist psychiatric treatment) include benzodiazepines (alprazolam, 
diazepam), imipramine, and the 5-HT,, partial agonist buspirone. 
Clinicians and patients have concerns about the use of benzodi- 
azepines, mainly the long-term risk of dependence, though this 
risk has to be weighed against the potential relief an individual with 
chronic, severe, distressing, and disabling symptoms might derive 
from intermittent use or more prolonged treatment [10]. When at- 
tempting to relieve acute exacerbations of symptoms, a short-acting 
benzodiazepine (for example, lorazepam) may be preferable to a 
longer-acting alternative due to its more rapid onset of effect and a 
lower risk of sedation. A prescriber should evaluate the likelihood of 
developing problematic use before issuing a first or subsequent pre- 
scription [11] (Box 92.1); for example, individuals who abuse drugs 
and/or alcohol and those with chaotic lifestyles are more likely to 
develop long-term problems with benzodiazepines. 

Many psychological treatments have been investigated in GAD, 
but CBT and applied relaxation have the strongest supporting 
evidence. 

CBT offers an effective alternative to pharmacological treat- 
ments and can help around half of patients with GAD to achieve 
significant symptom reduction and to improve their level of func- 
tioning. Although other psychological approaches may produce 
some benefit, relapse rates over the longer term are probably 
lower with CBT [12]. However, CBT may be less efficacious in 
older adults than in adults of ‘working age’ [13]. Augmentation 
of escitalopram treatment with CBT is more effective in reducing 
worrying than escitalopram alone, but there is at present min- 
imal evidence for an additional benefit when combining other 


Box 92.1 Predictors of the risk of dependence 
upon benzodiazepines 


Participation in self-help group for medication dependence 
Younger age 

Longer duration of use 

igher dose use 

Lower level of education 

on-native cultural origin 

Outpatient treatment for alcohol and/or drug dependence 
Short duration of elimination half-life 

igher levels of depression and anxiety 
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Reproduced from Compr Psychiatry, 45(2), Kan CC, Hilberink SR, Breteler MH, 
Determination of the main risk factors for benzodiazepine dependence using a 
multivariate and multidimensional approach, pp 88-94, Copyright (2004), with per- 
mission from Elsevier Inc. 




















pharmacological and psychological treatments, although this is 
often done in practice. 


Panic disorder with or without agoraphobia 


Accurate diagnosis depends on establishing the presence of recur- 
rent unexpected panic attacks, with relative freedom from anxiety (at 
least initially) between these attacks, together with associated con- 
cern, worry, or change in behaviour. Coexisting depressive symp- 
toms are common, and patients with comorbid depressive episodes 
are typically more impaired by symptoms, make greater use of health 
services, and have a worse prognosis. Panic disorder can occur with 
or without agoraphobia—anxiety in feared situations from which es- 
cape might prove difficult or embarrassing—and agoraphobia can 
either precede or follow the development of panic attacks. 

Treatment is aimed at achieving complete recovery from panic 
attacks, resolution of anticipatory anxiety (fearful expectation of 
attacks), and abolition of agoraphobia [1]. Panic attacks can be 
shortened by rebreathing exhaled air, which corrects the abnormal 
blood chemistry that arises from rapid shallow breathing. In estab- 
lished panic disorder, psychological and pharmacological interven- 
tions are helpful. As with GAD, NICE guidance for panic disorder 
recommends a stepwise approach to patient management. 

SSRIs are the preferred first-line drug treatments in primary care, 
as they cause less drowsiness and carry lower risks of tolerance and 
dependence than benzodiazepines. In secondary care, treatments 
include venlafaxine, the selective noradrenaline reuptake inhibitor 
reboxetine, and the tricyclic antidepressants (TCAs) imipramine 
and clomipramine. Certain SSRIs, venlafaxine, some TCAs, and 
the reversible monoamine oxidase inhibitor (MAOI) moclobemide 
have demonstrated long-term efficacy in clinical trials, though SSRIs 
tend to be better tolerated. Patients must follow a restricted diet 
while taking an MAOI, and these compounds are therefore reserved 
for second- or third-line treatment. 

There are few studies to guide pharmacological management of 
treatment resistance, but several strategies may be beneficial. These 
include augmentation of fluoxetine with the beta-blocker pindolol, 
switching to an agent that acts via different receptors (for example, 
non-responders to citalopram could be switched to reboxetine, and 
vice versa), augmentation of a TCA with fluoxetine or augmentation 
of fluoxetine with a TCA, combination therapy with sodium valproate 
and clonazepam, or augmentation of clomipramine with lithium. 


Psychological and pharmacological interventions have broadly 
similar efficacy and acceptability [14]. A network meta-analysis in- 
dicates that CBT is probably superior to other psychological treat- 
ments, though non-specific supportive psychotherapy also appears 
efficacious and psychodynamic psychotherapy may have fewer as- 
sociated dropouts from treatment [15]. Routinely combining CBT 
with medication as a first-line approach in all patients is not recom- 
mended, but findings from a meta-analysis suggest that combining 
CBT with drug treatment can enhance the efficacy of CBT. CBT 
alone may be more cost-effective than either SSRI alone or SSRI 
when combined with CBT [16]. 


Specific (simple) phobia 


Specific fears of particular objects, animals, or situations are very 
common, but only a minority of affected people present for treat- 
ment, this often being at the time of changes in domestic or career 
responsibilities. The impairment associated with specific phobia 
is usually limited, but the presence of comorbid specific phobia 
in other anxiety disorders or depression can increase their associ- 
ated disability. The effectiveness and acceptability of psychological 
or pharmacological treatments for specific phobia are less well re- 
searched than in other anxiety disorders. 

A meta-analysis of randomized controlled trials indicated that 
exposure-based therapies (particularly those involving in vivo ex- 
posure) are more effective than other psychological interventions— 
effectiveness being seen regardless of the nature of the phobia, and 
greater with multiple, rather than single, sessions [17]. Patients with 
‘blood-injury phobia can develop hypotension and bradycardia 
during exposure sessions, so they may require additional muscular 
tension exercises to benefit fully from treatment. Most patients re- 
spond to a few treatment sessions based on exposure techniques, 
but SSRI treatment (escitalopram, paroxetine) is helpful when pa- 
tients have not responded to exposure therapy. It is unclear if con- 
comitant benzodiazepine use enhances or reduces the efficacy 
of exposure therapy. Efficacy may be increased marginally by d- 
cycloserine administration prior to a session, though not all evi- 
dence is consistent [18]. 


Social anxiety disorder (social phobia) 


Social anxiety disorder is characterized by the intense, persistent 
fear of being scrutinized or evaluated negatively by others. Patients 
anticipate ridicule or humiliation and either avoid many social situ- 
ations or endure them with great distress. Some experience panic 
attacks, but the condition can be distinguished from panic disorder, 
as in social phobia, the attacks occur expectedly in feared situations. 
Furthermore, in panic disorder, the presence ofa friend typically pro- 
vides some reassurance, but in social phobia, this makes little differ- 
ence. Shyness is a core symptom of social phobia, though this should 
be distinguished from healthy individuals who are merely shy and 
typically experience little anxiety, functioning well at home and at 
work. A significant proportion of patients develop depression, and 
differentiating the disorders can sometimes be difficult, but in social 
phobia, the capacity to enjoy oneself remains and energy levels are 
not impaired significantly. Many patients use alcohol or other drugs 
in an attempt to quell anxiety symptoms prior to a social encounter. 
Medications with proven efficacy in acute treatment include 
most SSRIs (escitalopram, fluoxetine, fluvoxamine, paroxetine, and 
sertraline), venlafaxine, the MAOIs phenelzine and moclobemide, 
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some benzodiazepines (bromazepam, clonazepam), some anticon- 
vulsants (gabapentin, pregabalin), and olanzapine. There is little evi- 
dence of a dose-response relationship in acute treatment. Response 
is more likely in patients whose symptoms have reduced in inten- 
sity within 2 weeks of starting treatment. Beta-blockers can reduce 
physical symptoms of anxiety in some discrete performance activ- 
ities (for example, acting or singing), but they do not help in patients 
with generalized social phobia who fear and avoid multiple social 
situations. As with GAD, SSRI treatment is usually regarded as being 
first line, based on efficacy, tolerability, and safety [1]. 

CBT is the most established psychological treatment. A network 
meta-analysis of the comparative efficacy of pharmacological and 
psychological treatments indicated that SSRIs and venlafaxine are 
superior to pill placebo, and CBT superior to ‘psychological placebo’ 
(that is, non-specific psychological intervention) [19]. In one study, 
the combination of phenelzine and group CBT was more effective 
than either treatment given alone, on measures of both response and 
remission, but there is inconsistent evidence regarding whether the 
combination of CBT with medication is associated with greater im- 
provement than with either treatment given alone [20]. 

SAD is typically a long-lasting condition, and successful response 
to acute treatment should be followed by at least 6 months of con- 
tinuation treatment, in order to consolidate recovery and prevent 
an early relapse of symptoms. However, relapse prevention studies 
demonstrating the long-term efficacy of pharmacological treatments 
(escitalopram, paroxetine, sertraline, pregabalin) are limited. Patients 
who make only a limited initial response to CBT may benefit if fur- 
ther CBT is combined with medication, and those who respond only 
partially to pharmacological treatment may benefit if further drug 
treatment is combined with CBT. Combination of an SSRI (sertraline) 
with a benzodiazepine (clonazepam) has been found superior to con- 
tinuing with sertraline alone or switching to venlafaxine [21]. 


Separation anxiety disorder 


Differential diagnosis between separation anxiety disorder and 
panic disorder is difficult, but in patients with the former, the pri- 
mary concern is fear of separation, rather than fear of a panic at- 
tack. Furthermore, patients with panic disorder are preoccupied 
with their personal health and safety, whereas individuals with sep- 
aration anxiety disorder are concerned for the well-being of their 
attachment figures. Differential diagnosis between separation anx- 
iety disorder and GAD can also be troublesome, as worrying that 
something bad might happen to loved ones is a feature common to 
the two conditions. Distinction largely rests on establishing that in 
GAD, the fear and worry of losing loved ones is just one of a wide 
range of worrisome themes, whereas in separation anxiety disorder, 
the central and often only concern is fear of, and worry about, losing 
a major attachment figure. Other important differential diagnoses 
include dependent personality disorder, emotionally unstable per- 
sonality disorder, and morbid jealousy [22]. 

The efficacy of psychological or pharmacological treatment in 
adults with separation anxiety disorder has not been studied exten- 
sively, and treatment studies in children have often involved mixed 
diagnostic groups. The findings of randomized, placebo-controlled 
trials of pharmacological treatment in children with separation anx- 
iety disorder provide no convincing evidence of benefit for any medi- 
cation. However, the SSRIs fluvoxamine and sertraline have been 
found efficacious among the separation anxiety disorder subgroup of 
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patients within mixed diagnostic samples. Psychological treatment 
studies in children find some evidence of benefit with CBT, parent- 
child interaction training, and ‘summer camp’ programmes [23]. 


Room for improvement in the treatment 
of anxiety disorders 


A wide range of options is available for the pharmacological and psy- 
chological treatment of patients with anxiety disorders. Current re- 
commendations included within evidence-based guidelines and 
consensus statements are derived mainly from the findings of random- 
ized, double-blind controlled trials, which suggested robust efficacy 
and generally good tolerability for many interventions, both in acute 
treatment and in preventing relapse. However, the patient groups that 
participate in randomized controlled trials may have a better prognosis 
than patients treated in ‘real-world’ clinical settings. In routine prac- 
tice, response rates to initial treatment can be disappointing; many pa- 
tients will experience unwanted effects during treatment; others will 
relapse despite adhering to treatment, and some will be affected by 
troublesome discontinuation symptoms. Furthermore, at present, it 
is not possible for doctors to predict the likelihood of a treatment re- 
sponse in an individual patient with great accuracy, and there are con- 
tinuing uncertainties about optimal steps in the further management 
of patients after non-response to first-line treatment approaches. 

For the time being, attempts to optimize clinical outcomes rest 
largely on making the best use of currently available treatments. 
However, there is much room for improvement in the pharma- 
cological and psychological treatment of anxiety disorders. 
Improvements in clinical outcomes could result from the develop- 
ment and arrival of approaches with novel mechanisms of action, 
leading to greater efficacy and improved acceptability in all patient 
groups. Alternatively, novel treatments could lead to enhanced ef- 
fectiveness and higher acceptability in certain specific patient sub- 
groups, when combined with the use of reliable biomarkers for 
identifying which individuals are most likely to benefit. 
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Obsessive-compulsive disorder (OCD) was once thought to be a 
relatively rare disorder; however, the lifetime population prevalence 
is now estimated at 2.3% [1]. Sustained research efforts have led to 
a better understanding of the aetiology, presentation, and treatment 
of OCD. However, debate continues regarding the conceptualization 
and classification of OCD in diagnostic systems. Further, there is 
increasing recognition that OCD is a clinically heterogenous dis- 
order, and thus treating OCD as a unitary disorder may obscure 
important aetiological and clinical variables. This has led to an 
interest in identifying subtypes and dimensions of OCD and exam- 
ining dimension-specific associations between OCD and important 
constructs. 





Clas ation of OCD 








Historically, OCD was classified as an anxiety-related disorder due 
to similarities between OCD and anxiety disorders across core 
symptoms of fear and avoidance, cognitive styles, and response to 
pharmacological and psychological therapies [2]. However, a com- 
peting conceptualization of OCD is that it belongs to a separate 
group of disorders characterized by repetitive thoughts and behav- 
iours, often referred to as the obsessive-compulsive spectrum [3]. 
Evidence supporting this approach has emerged from studies of 
genetic risk factors, comorbidity, familial transmission, neuroana- 
tomical correlates, neuropsychological functioning, and treatment 
response [4, 5]. Despite initially proposing the creation of a shared 
anxiety and obsessive-compulsive spectrum [6], the fifth edition of 
the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) 
[4] saw the creation of a separate ‘Obsessive-Compulsive and Related 
Disorders’ (OCRD) chapter, placed after the ‘Anxiety Disorders’ 
chapter. The DSM-5 OCRD chapter includes OCD, body dys- 
morphic disorder, hoarding disorder, trichotillomania (hair pulling 
disorder), excoriation (skin picking disorder), and other specified 
and unspecified OCRDs. A similar OCRD chapter is proposed for 
the forthcoming eleventh edition of the International Classification 





of Diseases and Related Health Problems (ICD-11), which will also 
include hypochondriasis and olfactory reference disorder [5]. 

The creation of a separate OCRD chapter is not without criticism, 
particularly regarding the close relationship between OCD and the 
anxiety disorders, the limited evidence for the association between 
some OCRD disorders, and the distinctive presentation of excori- 
ation and trichotillomania (for example, see [2] for a critique of the 
empirical validity of the DSM-5 OCRD chapter). Further research 
on the classification of OCD is clearly needed, especially given that 
some of the OCRD diagnoses are new or have been substantially 
changed, for example hoarding disorder and excoriation in DSM- 
5 and body dysmorphic disorder in ICD-11. Despite the ongoing 
debate about the classification of OCD, there is a strong consensus 
regarding the diagnostic criteria and clinical features of OCD. 








Core symptoms 


Obsessions are persistent and unwanted thoughts, images, or urges 
that evoke distress or negative affect such as anxiety, disgust, feelings 
of incompleteness, or unease [5, 7]. Individuals typically attempt to 
counteract obsessions, either by ignoring, resisting, or controlling 
them, avoiding things that may trigger the obsession, or by neu- 
tralizing their obsessions with a thought or action. Compulsions 
are repetitive behaviours or mental acts performed in response to 
an obsession, to achieve a feeling of completeness or rightness, or 
following very specific rules. Compulsions are unrealistic or exces- 
sive and function to relieve distress or negative affect or to prevent a 
feared outcome from occurring. Compulsions can include observ- 
able behaviours, such as rearranging items or excessive cleaning, 
or mental acts such as making mental lists, neutralizing unaccept- 
able thoughts with ‘good’ thoughts, or praying in response to sexual 
thoughts [8]. Examples of common types of obsessions and compul- 
sions are presented in Table 93.1. 

Although a diagnosis of OCD only requires the presence of ob- 
sessions or compulsions, they co-occur in 99% of cases of OCD 
[8]. To meet diagnostic criteria, obsessions or compulsions must be 
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Table 93.1 Examples of obsessive-compulsive disorder symptom dimensions 





Symmetry/ordering 
example, household objects are misaligned 


Contamination/cleaning 
pollution 


Harm/aggression may be responsible for a murder 
Sexual/religious 


Hoarding 
uture 





Miscellaneous ntrusive non-sensical images, songs, or words 


Concern about developing cancer from environmental 


Unwanted images of violent sexual behaviour towards others 


Do not throw things away as they may be important in the 


Uncomfortable sensation that things do not ‘feel right’; for 


Touching, tapping, or rubbing things in special ways 


Repeated and excessing cleaning such as cleaning all household 
surfaces with bleach every day 


Checking that nothing bad happened, asking others if they are 
ok, checking the news for reports of murders 


Praying over and over to prevent the sexual behaviour from 
occurring 


Hoarding unusual items or excessive acquisition of objects like 
magazines 


Getting stuck performing everyday tasks such as getting dressed, 
eating, and work tasks 


Source: data from Mol Psychiatr., 11(5), Rosario-Campos MC, Miguel EC, Quatrano S, et al., The Dimensional Yale-Brown Obsessive-Compulsive Scale (DY-BOCS): an instrument for 
assessing obsessive-compulsive symptom dimensions, pp. 495-504, Copyright (2006), Springer Nature. 


associated with significant distress or impairment and usually occur 
for at least 1 hour per day [4]. Obsessions and compulsions should 
be carefully assessed to differentiate them from symptoms of other 
mental disorders, for example from worries as found in generalized 
anxiety disorder, appearance preoccupation in body dysmorphic 
disorder, and ritualized eating in eating disorders. 


Clinical correlates 


Lifetime comorbidity is very common, reported by 90-92% of those 
with OCD [1, 9]. The most common types of comorbid disorders, 
as defined using the fourth edition of the Diagnostic and Statistical 
Manual of Mental Disorders [10], were anxiety disorders (70-76%), 
mood disorders (61-63%), impulse-control disorders (36-56%), 
and tic disorders (28%). The most common individual disorders 
were major depressive disorder (41-56%), social phobia (31-44%), 
specific phobia (31-43%), separation anxiety disorder (28-37%), al- 
cohol use disorders (8-39%), and generalized anxiety disorder (8- 
34%). Further details are given in Chapter 95. 

OCD is associated with high levels of functional impairment in 
both community [1] and clinical populations [11]. Individuals with 
OCD reported poor social functioning, particularly relating to 
social and leisure activities and romantic relationships [12]. Their 
social functioning quality of life was poorer than those with schizo- 
phrenia, panic disorder, or major depressive disorder [11]. This 
may be due to the excessive time spent on obsessions or compul- 
sions and the often secretive nature of their symptoms. Suicidality 
appears to be relatively common in OCD; thoughts that life was not 
worth living were reported by 60% of those with OCD, 38% reported 
suicidal thoughts, 22% had suicidal plans, and 11% had attempted 
suicide [13]. 

Family accommodation of OCD symptoms is also common, 
whereby parents or partners of the affected individual engage in 
OCD-related behaviours such as washing their hands excessively, 
changing family routines to avoid triggering or exacerbating OCD 
symptoms, taking on responsibilities for the individual such as a 
child’s chores, or providing reassurance [14]. 

OCD is typically a chronic disorder, with a 5-year full remission 
rate of 17% and a partial remission rate of 22%, and a relapse rate of 
59% [15]. Chronic OCD was associated with longer illness duration, 
higher baseline OCD severity, and greater burden of illness [15, 16]. 


Insight 


Insight refers to the extent to which the individual recognizes that 
they are experiencing obsessions and compulsions, that these symp- 
toms are excessive, and that their concerns are unrealistic [17]. 
Insight occurs on a spectrum—good or fair insight indicates that 
the individual recognizes that their OCD-related beliefs are untrue 
or that they may be false; poor insight indicates that the individual 
believes these are likely to be true, and absent insight or delusional 
beliefs indicate that the individual is convinced that their beliefs are 
true [4]. Although OCD with absent insight is often referred to as 
delusional OCD, it is not considered to be a psychotic disorder [4]. 
This is clinically important as those with delusional OCD should 
receive treatment recommended for OCD, rather than treatment in- 
dicated for psychotic disorders [18]. 

Absent or poor insight was reported in 9-22% of clinical samples 
and was associated with greater OCD severity and earlier OCD onset, 
a lower probability of achieving remission following treatment, and 
a need for a higher number of medication trials during treatment [8, 
19, 20]. Poor insight can complicate the process of psychotherapy; 
thus, therapists should adapt the therapy content to the level of OCD 
insight, for example, including additional psychoeducation about 
insight, spending extra time on symptom identification, and adding 
additional sessions [17]. 


Subtypes of OCD 


OCD is a highly heterogenous disorder. The heterogeneity of OCD 
has led to concerns that treating OCD as a unitary disorder may 
obscure differential associations between OCD subtypes and aetio- 
logical factors, clinical features, and even treatment response [21]. 
This has led to efforts to identify meaningful subtypes of OCD based 
on specific clinical features. 


Tic-related OCD 


Tics are particularly common in those with OCD, with a history of 
tics reported by 30-55% of youth with OCD [22, 23] and by 24% 
of adults with OCD [8]. Simple tics involve rapid and recurrent 
movements or sounds with no clear purpose such as jerking limbs, 
blinking eyes, grunting, or squeaking [24]. In contrast, complex 
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tics involve repetitive behaviours that appear purposeful, such 
as touching and rubbing in certain ways, imitating peoples man- 
nerisms, or repeating phrases, and tics which are often elicited by 
specific stimuli. Tic-related OCD was characterized by a higher pro- 
portion of males, an earlier onset of OCD, and possibly with greater 
OCD severity and higher rates of some comorbid disorders [8, 22, 
23, 25]. Tic-related OCD may be associated with lower overall effi- 
cacy of SSRIs and greater response to neuroleptic augmentation of 
SSRIs [25], but was not associated with differences in cognitive be- 
havioural therapy (CBT) outcome [22, 23]. Due to the evidence that 
tics may be associated with distinctive clinical features, tic-related 
OCD was added as a specifier to DSM-5, and tic disorders will be 
cross-referenced with OCD in ICD-11 [4, 5]. 


Age of onset 


The mean age of OCD onset was reported at 19-22 years [1, 26], 
though symptoms are often present for several years before full dis- 
order onset. Age of onset has been proposed to delineate subtypes, 
with studies suggesting that early onset may be associated with male 
gender, greater OCD severity, more symptoms of attention-deficit/ 
hyperactivity disorder, lower rates of employment, and a greater 
probability of living alone in adulthood [26-28]. Conversely, later 
disorder onset is relatively rare, and onset after the age of 40 was 
associated with female gender, greater frequency of precipitating 
factors and traumatic events, and lower odds ofa positive family his- 
tory of OCD [26, 29]. Unfortunately, interpretation of age of onset 
findings across studies is complicated due to the use of different cri- 
teria for onset. Further, some studies divide participants based on 
pre-determined developmental groups such as child, adolescent, or 
adult, while others use statistical methods to determine group mem- 
bership. Further research is needed before robust subtypes of OCD 
based on age of onset can be identified; thus, age of onset was not 
included as a specifier in DSM-5 [25]. 


Symptom presentation 


Many studies have attempted to classify participants based on the 
focus of their OCD symptoms, for example washing or checking. 
The most common measure used to assess OCD is the Yale-Brown 
Obsessive-Compulsive Scale (Y-BOCS), which contains checklists 
of obsessions and compulsions grouped into similar symptom types, 
and items assessing the severity of obsessions and compulsions [30]. 
Factor analysis of the Y-BOCS typically identifies four or five fac- 
tors, and a meta-analysis of 21 studies reported a four-factor solu- 
tion: (1) symmetry, repeating, ordering, and counting; (2) cleaning 
and contamination; (3) forbidden thoughts, aggression, sexual, reli- 
gious, and somatic; and (4) hoarding [31]. These factors were similar 
across child, adolescent, and adult samples [32] and were temporally 
stable [33]. 

Another common measure for assessing OCD dimensions is the 
Dimensional Yale-Brown Obsessive-Compulsive Scale (DY-BOCS), 
which identifies six dimensions: (1) symmetry, ‘just right, counting, 
and arranging; (2) contamination and cleaning; (3) harm content, 
including aggression, injury, violence, and accidents; (4) moral, 
sexual, and religious content; (5) hoarding; and (6) miscellaneous, 
including somatic and superstition [21]. 

A review of studies that classified participants according to their 
primary OCD symptoms found differences across treatment re- 
sponse, neuroimaging findings, and neuropsychological deficits 


[34]. However, 81% of those with OCD reported symptoms across 
multiple symptom groups [1], challenging the validity of assigning 
individuals into discreet symptom-specific groups. Instead, a multi- 
dimensional model of OCD symptoms has been proposed [35]. 





A multi-dimensional approach to OCD symptoms 


In the multi-dimensional model of OCD, symptoms are conceptu- 
alized as forming several overlapping dimensions based on the con- 
tent of these symptoms [35]. Although there are some differences in 
the factors identified in the Y-BOCS and DY-BOCS, the following 
factors are the most robustly identified. 


Symmetry/ordering 


Symmetry/ordering symptoms may be the most frequently reported 
OCD symptom type [9, 36] They involve obsessions about things 
being symmetrical or ‘just right, for example the alignment of ob- 
jects or needing school work to be done perfectly. Compulsions can 
involve rewriting over and over, performing actions a special number 
of times, or touching or tapping in specific ways. A review of studies 
found that symmetry/ordering symptoms have been associated with 
perfectionism, ‘not just right’ sensations, magical thinking, and 
compulsive slowness [37]. There may also be associations between 
symmetry/ordering and anger, suicide, and traumatic life events. 


Contamination/cleaning 


Contamination/cleaning symptoms are often considered ‘classic’ 
symptoms of OCD. Contamination obsessions often involve con- 
cerns about contracting an illness or disease, sensations of being 
dirty or unclean, and fears of spreading contaminants to others. 
A wide range of contaminants can be feared such as germs, pol- 
lution, bodily fluids, and household/environmental chemicals. 
Individuals often go to great length to avoid dirty or contaminated 
objects, people, or situations and, when exposed to these, may com- 
pulsively and excessively clean themselves or contaminated objects. 
Contamination/cleaning symptoms may be closely associated with 
feelings of disgust, which can take longer to habituate than feelings 
of anxiety [37]. 


Forbidden thoughts 


Symptoms related to forbidden thoughts are sometimes identified 
as a single dimension [31], but they are often divided into separate 
dimensions [4, 21] of harm/aggression and sexual/religious symp- 
toms. Harm/aggression symptoms include obsessions about causing 
harm to self or others, or other violent or aggressive acts occurring. 
Associated compulsions can include checking that the feared out- 
come did not occur or mental rituals to neutralize or distract from 
the thoughts. Sexual/religious obsessions involve religious taboos 
and blasphemy, morality, and improper sexual thoughts, and asso- 
ciated compulsions can involve ritualized praying and religious be- 
haviour, avoidance of situations that may trigger the obsessions, and 
checking that the feared outcome did not occur. 

Forbidden thoughts may vary from other OCD dimensions in 
their personal significance; for example, obsessions about sexu- 
ally assaulting someone may be associated with greater shame and 
negative self-perceptions, compared to obsessions that are morally 
neutral such as symmetry concerns [38]. The fear of harming others 
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may be associated with particularly severe social withdrawal, due 
to the feared outcomes of the obsessions [12]. Forbidden thought 
obsessions may also face a greater burden of stigma; individuals in 
the community are least likely to recognize forbidden thoughts as a 
symptom of OCD, compared to other symptom types, and commu- 
nity members report a greater level of fear and perceived dangerous- 
ness and a greater desire for social distance in those with forbidden 
thought symptoms, compared to other OCD symptoms [39]. 


Other symptoms 


Hoarding symptoms were initially considered a subtype of OCD and 
thus are included in OCD measures. However, hoarding disorder 
has since been established as a separate diagnosis due to its distinct 
genetic risk factors, phenotype, comorbidity patterns, and poor re- 
sponse to OCD-focused treatment [2, 40]. As hoarding is discussed 
in depth in Chapter 99, it will not be a focus of this chapter. The 
Y-BOCS and DY-BOCS also contain miscellaneous symptoms such 
as appearance obsessions, intrusive thoughts about numbers, and 
excessive list-making. The miscellaneous group of symptoms are 
generally not considered to be a discreet OCD dimension due to the 
heterogeneity of the symptom content and thus are not analysed in 
most studies. 


Evidence supporting the multi-dimensional model 


of OCD symptoms 





The interest in identifying OCD symptom dimensions stemmed not 
only from a desire to better characterize the disorder, but also from 
the possibility that symptom dimensions may have differential as- 
sociations with important aetiological and clinical factors. Evidence 
supporting the multi-dimensional model of OCD has been re- 
ported across a diverse range of research areas. A summary of key 
findings is presented, but note that the identified symptom dimen- 
sions vary somewhat across studies, depending on the measures and 
methodology used. 


Genetics and family history 


The familial transmission of OCD has long been recognized, with 
twin studies and family linkage studies indicating that OCD in- 
volves polygenic influences, likely on serotonergic, glutamatergic, 
and dopaminergic neural systems [41]. For a full discussion of the 
genetics of OCD, see Chapter 96. A small number of studies have ex- 
plored dimension-specific familial and genetic associations. In sib- 
ling pairs with OCD, forbidden thought symptoms had the highest 
sibling concordance, whereas other symptoms were weakly or non- 
significantly associated [42]. Interestingly, sibling associations were 
strongest among female pairs, and weaker in male and cross-sex 
pairs. In OCD patients, those with a family history of OCD were 
more likely to report symptoms of contamination/cleaning and 
symmetry/ordering than those without a family history of OCD 
[43]. A population study of Brazilian children also reported familial 
transmission of OCD symptom dimensions, with the strongest 
familiality found for contamination/cleaning symptoms [36]. 
Another family history study found specific familial transmission 
of contamination/cleaning symptoms, but not of other symptom 


types [44]. 


Studies of the genetic heritability of OCD dimensions indicate 
both common and dimension-specific genetic influences. In one 
family history study, all OCD symptom dimensions were heritable, 
but specific genetic associations were found between forbidden 
thoughts and both contamination/cleaning and doubting symptoms 
[45]. A large twin study indicated that around 50% of the variance 
in OCD symptoms was due to genetic factors, that 30-35% of the 
genetic effect in checking, obsessing, and ordering symptoms was 
due to specific genetic factors, but that washing symptoms showed 
no specific genetic factors [40]. Another twin study found evidence 
of differential associations between OCD symptom dimensions 
and the genetic overlap with OCRD and anxiety disorders [46]. 
Specifically, sexual/religious and contamination/cleaning symp- 
toms had a stronger genetic association with anxiety disorders than 
with OCRDs; symmetry/ordering and harm/aggression symptoms 
had a stronger association with OCRDs than anxiety disorders, 
and symmetry/ordering showed a high proportion of specific gen- 
etic factors. In most cases, common genetic factors were strongest 
when considering both anxiety disorders and OCRDs in the models. 
Single gene studies have also reported dimension-specific asso- 
ciations; symmetry/ordering symptoms were associated with the 
DRD4 dopamine receptor gene [47], and washing symptoms with 
the HTR3E serotonin receptor gene [48]. 


Neuroanatomy and neuropsychology 


Abnormalities in fronto-striatal functioning are robustly associated 
with OCD [41], and there appears to be dimension-specific asso- 
ciations with anatomical and functional differences in these brain 
regions [49]. In studies using whole-brain voxel-based morph- 
ometry, symmetry/ordering symptoms were associated with dif- 
ferences in the volume of the left lateral orbitofrontal cortex and 
insula, aggressive/checking symptoms with differences in insula 
volume, and washing symptoms with the volume of the premotor 
cortex [50]. Patterns of striatal functional connectivity also differ 
across the dimensions; aggressive symptoms were associated with 
decreased connectivity with the amygdala and increased connect- 
ivity with the ventromedial frontal cortex, whereas sexual/religious 
symptoms were associated with greater connectivity between the 
ventral caudate and the mid and anterobasal insular cortex [51]. 
Further, heightened amygdala activation to fearful faces was asso- 
ciated with increased severity of aggressive/checking and sexual/re- 
ligious symptoms, suggesting that these types of symptoms may be 
particularly related to abnormal fear processes [52]. Theories of psy- 
chopathology are now increasingly informed by neuroscience (see 
Chapters 94 and 97). 

There is also evidence of dimension-specific neuropsychological 
deficits. Contamination/washing symptoms were associated with 
deficits in visuo-spatial skills, working memory and attention, and 
impulsive response patterns; checking was associated with deficits in 
association learning, and symmetry with poorer verbal fluency; and 
forbidden thoughts with improved performance on attention and 
working memory and visuo-spatial scanning [53]. Symmetry/or- 
dering symptoms have been associated with specific forms of neuro- 
psychological impairment (for a review, see [54]); for example, in 
youth with OCD, symmetry/ordering symptoms are associated with 
greater deficits in non-verbal fluency, processing speed, and inhib- 
ition and switching [54]. 
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Clinical features 


A range of clinical features have been associated with OCD symptom 
dimensions. In those seeking treatment for OCD, forbidden 
thoughts were associated with higher overall OCD severity, both be- 
fore and after receiving CBT [38]. OCD-related insight also varied 
by dimension; higher scores for contamination symptoms predicted 
poorer insight, whereas higher forbidden thoughts scores predicted 
better insight [19]. There are also differences in the association be- 
tween OCD dimensions and broader obsessive-compulsive cogni- 
tive domains; symmetry/ordering was specifically associated with 
perfectionism and intolerance of uncertainty cognitions, forbidden 
thoughts with importance and control of thoughts cognitions, and 
doubt/checking with cognitions related to increased responsibility 
and threat estimation [55]. Contamination/cleaning symptoms 
were not significantly associated with the examined obsessive- 
compulsive cognitive domains, suggesting either a lack of specific 
cognitive associations or associations with cognitions not assessed 
by the study measure. 

Suicidal ideation and behaviours have been independently as- 
sociated with symmetry/ordering symptoms [56] and with sexual/ 
religious symptoms [57]. Sexual/religious symptoms were also as- 
sociated with poorer social functioning [12]. Another study found 
specific associations between quality of life and OCD symptom 
dimensions; contamination/cleaning symptoms were associated 
with impairment in leisure, social, and health-related quality of life, 
symmetry/ordering symptoms with social quality of life, and for- 
bidden thought symptoms with health-related quality of life [58]. 

The onset and course of OCD may also vary across symptom di- 
mensions. In one study, early disorder onset was associated with re- 
ligious and symmetry obsessions and repeating compulsions [27]. 
Symmetry concerns had the earliest age of onset of the dimensions, 
and the presence of forbidden thoughts was associated with a more 
fluctuating course of illness and reduced chances of reporting a 
deteriorating course [59]. Contamination/cleaning symptoms were 
associated with a chronic OCD course [16]. Primary harm obses- 
sions have been associated with higher odds of OCD remission over 
time [15]. 

Some studies have explored the association of OCD symptom di- 
mensions with other demographic variables. For example, those with 
tic-related OCD report an earlier age of onset for cleaning symptoms 
than those without tics [59]. Symptom dimensions may also vary 
by gender; females report a greater frequency and severity of harm/ 
aggression and contamination/cleaning symptoms, whereas males 
report a greater frequency and severity of sexual/religious symp- 
toms [13]. Although symmetry/ordering frequency did not differ by 
gender, it was reported as more severe by females. 

There is also evidence that symptom dimensions may be as- 
sociated with different comorbidity profiles [9]. Sexual/religious 
symptoms were specifically associated with mood disorders, panic/ 
agoraphobia, social phobia, separation anxiety disorder, non- 
paraphilic sexual disorder, somatoform disorders, body dysmorphic 
disorder, and tic disorders. Harm/aggression symptoms were as- 
sociated with comorbid post-traumatic stress disorder, separation 
anxiety, impulse control disorders, and excoriation. Contamination/ 
cleaning was associated with hypochondriasis. Interestingly, there 
were no specific comorbidity associations for symmetry/ordering 
symptoms. In a different study of siblings with OCD, forbidden 


thoughts were associated with higher odds of comorbid major de- 
pressive and bipolar disorders, hypochondriasis, separation anxiety 
disorder, tic disorder, and body dysmorphic disorder; symmetry/or- 
dering symptoms with higher odds of alcohol dependence, bulimia 
nervosa, and attention-deficit/hyperactivity disorder; and contam- 
ination/cleaning symptoms with separation anxiety disorder [42]. 


Overview of findings 


Studies have replicated a multi-dimensional model of OCD symp- 
toms across individuals from different developmental stages, 
cultural backgrounds, and recruitment settings. Further, there 
is support for dimension-specific associations in genetic, neural, 
and clinical features of OCD. However, future research should 
focus on further characterizing the nature of these dimensions 
to resolve discrepancies across studies. Further development of 
assessment measures and statistical techniques is also required 
to better assess these dimensions. Consistency in identified di- 
mensions will allow clearer comparisons across studies and may 
help to clarify associations with meaningful clinical variables. 
Despite these limitations, there is evidence that OCD symptoms 
comprise several overlapping dimensions that have differential 
associations to important aetiological and clinical variables. 


Important research advances have led to an improved under- 
standing of the core features of OCD and to the reclassification of 
OCD in DSM-5 and ICD-11. A core focus of OCD research has been 
to better understand the heterogenous presentation of the disorder, 
with the aim of identifying meaningful disorder subtypes. Studies 
attempting to classify participants using features like age of onset, 
tic status, or primary symptom types have largely been superseded 
by the multi-dimensional model of OCD symptoms. This notion 
that OCD symptoms can be understood as a number of overlapping 
symptom dimensions has received support across a wide range of 
studies. Further, there is evidence of dimension-specific associations 
across studies of family history and genetics, neuroanatomy and 
neuropsychological functioning, clinical features, and treatment 
response. The multi-dimensional model of OCD thus provides a 
framework for an improved understanding of OCD. 
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As described in Chapter 93, obsessive-compulsive disorder (OCD) 
is a condition featuring obsessions (intrusive ideas) and compul- 
sions (repetitive behaviours such as checking, washing, or iterated 
thought patterns) associated with high levels of anxiety, and con- 
servative estimates suggest it may affect 2.9 million persons across 
the EU (that is, 0.7% prevalence) [1]. The most severe forms lead 
to high costs both for the individual and society as a whole [2]. 
While there are currently many validated pharmacological and 
psychotherapeutical treatments for OCD, 20-30% of patients do not 
respond to them [3]. This is particularly problematic, given that 10% 
of these patients show severe symptoms (for example, compulsions 
taking >4 hours/day), resulting in high social and financial burdens 
to both families and society in general. 

In this chapter, we describe some promising approaches that have 
been developed in recent decades to better understand the func- 
tional and pathophysiological basis of OCD at various different 
levels, including: 


e Improved understanding of functional impairments and behav- 
ioural dimensions that most affect patients (macro-scale level). 

e Characterization of dysfunctional brain activities and their related 
neural circuits (meso-scale level). 

e Identification of specific cell type and neuronal assemblages impli- 
cated in obsessive and/or compulsive disorders (micro-scale level). 


Importantly, we refer to studies carried out both on humans and 
animal models, providing a wider range of tools to investigate the 
dysfunctional neural activity associated with this condition. In par- 
ticular, we consider the extracellular recording of single neurons 
at different brain sites during specific behaviours and also the re- 
cent development of optogenetic approaches that allow the study of 
neural activity with a high degree of spatial, cellular, and temporal 
specificity. Despite the major challenges and limitations in con- 
ducting such a translational approach across species, we believe that 





it offers the best means of unravelling the neural basis of psychi- 
atric diseases [4, 5]. Our description of studies in human is comple- 
mented and extended in Chapter 97. 

We do not aim exhaustively to review all the possible therapeutic 
strategies to treat OCD here but will illustrate the potential of a 
multi-level approach using powerful, recently developed tools to de- 
velop new treatments for OCD, using some relevant examples. 








Numerous studies on OCD patients over the past two decades 
have consistently identified abnormal patterns of activity in one 
neural circuit in particular—the cortico-basal ganglia (CBG) loop 
[6], which is known to serve both motor and cognitive functions. 
Anatomical and functional studies of the CBG provide further evi- 
dence of their role in the parallel processing of sensorimotor, asso- 
ciative, and limbic information arising from topographically and 
functionally distinct cortical areas [7]; sensorimotor, associative, 
and limbic information appears to be segregated along a dorsolat- 
eral to ventromedial axis within the CBG (Fig. 94.1). 

‘These circuits are used to differing extents during the learning of 
a behaviour (either motor or cognitive). While the ventral loop— 
the ‘reward circuit’—is more important during initial behaviour 
acquisition, it is the dorsolateral loop that is crucial once learning 
becomes habitual [7, 8, 9]. Recent studies have supported the view 
that the dorsomedial and the dorsolateral striatum complement 
each other in the elaboration and consolidation of learnt behav- 
ioural sequences. This associative CBG loop is essential in linking 
contextual information with its appropriate motor response. The as- 
sociative loops can therefore modulate the action of sensorimotor 
loops in the control of stereotyped actions [10, 11]. 

This organization within distinct functional loops has been char- 
acterized in the context of the pathophysiology of OCD through 
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Fig. 94.1 Schematic illustration of the convergence of projections from 
the cerebral cortex (CX), caudate nucleus (CD), and globus pallidus (GP) 
in the basal ganglia. Different colours represent the types of information 
processed through the CBG: limbic (light blue), associative (dark grey), 
and sensorimotor (light grey). 

Adapted from Proc Natl Acad Sci, 104(25), Mallet L, Schüpbach M, N'Diaye K, 

et al, Stimulation of subterritories of the subthalamic nucleus reveals its role in the 
integration of the emotional and motor aspects of behaviour, pp. 10661-10666, 
Copyright (2007), with permission from National Academy of Sciences, U.S.A. 





a number of studies on both human and animal models. In recent 
years, novel investigative techniques have emerged while more es- 
tablished methods have improved, allowing more detailed studies of 
dysfunctional brain circuits and the neural basis of OCD. 


Animal models 


Detailed experimental evidence on the role of the different CBG 
loops in OCD is scarce in human subjects, mostly due to ethical 
issues arising from the use of invasive procedures and the time and 
spatial specificity constraints inherent in non-invasive imaging 
techniques. For these reasons, and with the development of genet- 
ically engineered mutant mice, we have witnessed the emergence of 
numerous animal models relevant to OCD-like behaviours within 
the last decade [12]. These animal models allow detailed study of the 
specific mechanistic processes that could be altered in OCD treat- 
ments; the results of recent studies are promising and point to in- 
vestigations of specific targets in human patients. Two recent studies 
have shown that in a mouse model expressing compulsive-like be- 
haviour, compulsions could be drastically reduced by targeting the 
lateral orbitofrontal cortex (OFC) and the medial part of the stri- 
atum (that is, the caudate in humans) [13, 14]. The combination of 
animal models of obsessive-compulsive spectrum symptoms with 
new methods available to neuroscientists should allow to better in- 
vestigate the links between the genes implicated in OCD-related dis- 
orders and the neurophysiological circuits affected and behavioural 
phenotypes [13, 15-17]. 

The main characteristics of validating mutant animals as models 
of OCD are functional (OCD-related behaviour), neurophysiologic 
(the neural circuits affected by the mutation), and genetic (OCD 
candidate genes). The most promising mouse models developed so 
far share common behavioural phenotypes characterized by exces- 
sive grooming, which can be interpreted as a proxy for compulsive 
behaviour in mice, that is, an analogue of abnormal ritualistic and 
stereotyped behaviours that are overexpressed despite their negative 
consequences (for example, deleterious skin lesions). From a neural 


circuits point of view, there is striking evidence that a mutation in 
all these mouse models alters the normal functioning of the CBG 
circuits known to be implicated in OCD (Fig. 94.2). 

The introduction of gene mutations causing symptoms related 
to OCD had anatomical consequences such as a decrease in striatal 
volume [15], but also more consistently pointed towards dysfunc- 
tional synaptic transmission, particularly in the striatum compart- 
ment (for example, equivalent to caudo-putamen areas in humans) 
receiving cortical inputs. These features have been shown following 
the deletion of genes coding for SAPAP3, Slitrk5, and EAAC1— 
proteins located post-synaptically in striatal neurons [13, 15, 18]. 
These mutations had electrophysiological consequences resulting 
in abnormally hyperactive striatal neurons, supporting the idea that 
elevated striatal activity is implicated in OCD pathophysiology, as 
suggested by some imaging studies in human patients [14, 16]. 


Imaging 
Structural connectivity MRI 


Diffusion-weighted imaging (DTI) is now a well-established tech- 
nique to infer brain connectivity based on magnetic resonance 
imaging (MRI) of the random motion of water molecules in tissues 
[19] (see Chapter 12). Because this motion is firmly restricted by the 
sheets of myelin surrounding the axons, the diffusion pattern reveals 
the main direction of these axons locally. The integration of infor- 
mation on the local directionality of white matter fibres can then 
be used to infer the entire network of connections within the brain 
using the tractography techniques now popular. 

The directional dependence of molecular diffusion in white 
matter (‘anisotropy’) can be used to reconstruct tracts through the 
brain in vivo by assuming that the main direction of diffusion in a 
voxel indicates the local orientation of white matter fibres [20, 21]. 
A number of DTI tractography algorithms have been proposed to 
reconstruct fibre tracts [20]. Tractography enables the investigation 
of both tract-specific white matter lesions and the changes in con- 
nection probability between distinct brain regions (Fig. 94.3). 

DTI has been used recently to study the pathophysiology of OCD 
and possible deficits in neural circuit connectivity in particular. 
Most of the studies have confirmed the implication of the CBG and 
its related fibres such as the cingulate bundle, the corpus callosum, 
and the anterior limb of the internal capsule [22]. 

Thanks to the increasing resolution and refinement of ana- 
lytical algorithms in the field of anatomical imaging, it has been 
possible to discover new connections of interest. For example, the 
subthalamic nucleus (STN) has only recently been the subject of 
structural connectivity studies using diffusion MRI. Very few em- 
pirical data have yet been collected on the role of this nucleus in 
OCD, although its direct connectivity to the cortical areas, known as 
the hyperdirect pathway, has been implicated in action selection and 
decision-making [23]. A detailed connectivity map of the cortico- 
STN pathway in OCD patients, compared to healthy subjects, would 
therefore be of great interest in assessing its involvement in this 
condition. 

With the advent of high magnetic field systems (>3 T) capable of 
producing strong gradients, diffusion methods have been extended 
from human to small animal imaging. With such MRI systems, 
high resolution can be achieved (in the order of 100 microns) and 
very small structures in rodent brains can be visualized to study the 
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Fig. 94.2 Central role of the CSTC circuitry in obsessive-compulsive disorder in humans and compulsive-repetitive behaviours in mice. (a) Illustration 
of CBG loops in the human brain. (b) Illustration of equivalent CBG loops in a mouse brain. CTX, cortex; STR, striatum; CAU, caudate; PUT, putamen; 
HIP hippocampus; THAL, thalamus; STN, subthalamic nucleus; SNr, substantia nigra pars reticulata; GPe, globus pallidus externa; GPi, globus pallidus 
interna; SC, superior colliculus; BS, brainstem; CB, cerebellum. (c) Examples of four candidate OCD gene expression patterns that have been targeted in 
mutant models. Note the overlapping implication of the CBG in these different expression patterns. 


Reproduced from Curr Opin Neurobiol., 21(6), Ting JT, Feng G, Neurobiology of obsessive-compulsive disorder: insights into neural circuitry dysfunction through mouse 
genetics, pp 842-848, Copyright (2011), with permission from Elsevier Ltd. 





Fig. 94.3 (see Colour Plate section) CONNECT/Archi atlas of the human brain white matter connectivity developed at NeuroSpin, as part of the 
European CONNECT FP7 project. 


Reproduced courtesy of Cyril Poupon,CEA NeuroSpin. 
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Fig. 94.4 (see Colour Plate section) Representative three-dimensional image of colour-coded fractional anisotropy in a mouse brain based on 


diffusion MRI at 11.7 T. 
Reproduced courtesy of S. B. Sébille and M. D. Santin, ICM-CENIR. 


peripheral nervous system [24], as well as neurodegenerative dis- 
eases such as multiple sclerosis, Alzheimer’s disease, and Parkinson's 
disease (PD) [25]. Recently, feasibility studies of DTI tractography 
have allowed the identification of neural networks in rodent brains 
in target regions relating to homologous limbic and associative re- 
gions [26] (Fig. 94.4). 

With the high definition offered by recent MRI systems (up to 17 
T), one could hope to detect potential structural differences in the 
CBG circuits between wildtype and animal models of OCD. One of 
the main advantages using anatomical connectivity in these models 
is that these data can be compared to the functional connectivities 
which can be finely characterized in animal models by using innova- 
tive investigative tools (for example, dense chronic single neurons 
recording and/or optogenetic neuromodulation). 


Functional connectivity MRI 


Once the anatomical features of a given circuit are established, the 
next challenge is to describe its functional aspects. Such character- 
ization may be achieved by analysing the long-distance synchroniza- 
tion between structures connected by the circuit. 

By using fMRI [for example, echo planar imaging (EPI)] to 
measure the slow changes in blood oxygenation level-dependent 
(BOLD) signal across distant brain regions, one can identify con- 
nected networks. This technique allows the detection of spontan- 
eous patterns of functional connectivity MRI (fcMRI). In recent 
years, the fcMRI approach has identified large-scale brain networks 
interacting, while the subject is at rest, as well as while engaged in 
an active task. Across neurological and psychiatric disorders, ab- 
normal patterns of fcMRI have been interpreted as correlates of 
altered brain-scale network functioning, so providing a potential 
marker for abnormalities in information processing. For example, 
recent studies of fcMRI in OCD have reported changes in func- 
tional coupling between the striatum and frontal regions, compared 
to healthy controls. Nakame et al. [27] found increased functional 
connectivity between the ventral striatum and the OFC and ventral 
medial prefrontal and dorsal lateral prefrontal cortices of subjects 
with OCD using this method. Similarly, Harrison and colleagues 


[28, 29] consistently found that this augmented functional connect- 
ivity between the OFC and the ventral striatum was positively cor- 
related with OCD severity. However, in the particular case of the 
associative-limbic hyperdirect pathway, the small volume of the STN 
(relatively to the typical voxel size used with functional imaging, for 
example 2 mm isotropic) poses special difficulties. 

At the whole-brain scale, functional imaging of OCD points to- 
wards hyperactivity in the limbic and associative cortical regions 
[anterior cingulate cortex (ACC) and OFC] and related basal ganglia 
areas [30]. Additionally, studies from our groups and others iden- 
tified abnormalities in the CBG circuits in OCD. Indeed, in fMRI 
studies of OCD patients, abnormal haemodynamic responses in 
the OFC and ACC may decrease as patients improve either through 
deep brain stimulation (DBS) therapies [31] or psychotherapy [32]. 
These data support the involvement of limbic/associative frontal- 
basal ganglia loops in OCD and suggest that their dysfunction 
might result in pathological doubt, obsessive anxiety, and compul- 
sive behaviours. 


Neural recording and neuromodulation 
Electrophysiology and long-range synchronization 


Unlike metabolic measures, electrophysiological recordings provide 
a unique source of information to assess long-distance synchroniza- 
tion at sub-millimetre scales with sub-millisecond resolution. Two 
main types of signal can be recorded—either using micro-electrodes 
(diameter <50 um), which allows the identification of spike activity, 
and therefore single- or multi-neuron activities; or macro-electrodes 
(diameter >500 um), which collect local field potentials (LFPs) that 
sum the relatively low-frequency (<150 Hz) electrical activity of mil- 
lions of neurons and their afferences in the area surrounding the tip 
of the electrode. The micro-electrodes are typically used in animal 
studies, but they can also be used during DBS neurosurgery to con- 
firm the targeting of the intended structure. In DBS-treated patients, 
LFPs may be directly recorded from the definitive stimulating elec- 
trodes, while their leads are still externalized prior to connection to 
the pulse generator. 
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To date, electrophysiological assessment of long-distance network 
connections has been little used in OCD patients. Most recently, 
simultaneous recording of the STN, LFPs, and cortical activity from 
scalp measurements [such as magneto- or electroencephalography 
(MEG/EEG)] has allowed the identification of long-distance coher- 
ence between the STN and motor cortical areas, in specific frequency 
bands (alpha and beta, 7-12 Hz and 13-35 Hz, respectively), sug- 
gestive of the involvement of the motor hyperdirect pathway [33, 34]. 
Interestingly, functional connectivity measures using electrophysio- 
logical recordings have been conducted on DBS-treated PD patients 
and shown increased values at higher frequencies when levodopa 
was concurrently administered to improve motor symptoms. 

Characterizing the differences in neural activity between OCD pa- 
tients and unaffected subjects would show how neural activity within 
elements of the CBG pathway is altered in DBS-implanted OCD pa- 
tients and provide valuable data on the functional interactions be- 
tween the various brain areas. By providing insight into how limbic 
cortical activity is co-ordinated with subcortical activity, such ana- 
lyses would be crucial in understanding the pathophysiology of OCD 
and begin to dissect the mechanisms by which DBS functions in 
OCD patients. Finally, these data could lead to novel DBS technolo- 
gies to target specific pathways to normalize the information transfer 
along the cortico-subcortical pathway that is disrupted in OCD. 


Deep brain stimulation 


DBS (see Chapter 19) represents a promising type of intervention 
for severe, resistant forms of OCD, based principally on the seren- 
dipitous observation that DBS of the basal ganglia can reduce OCD 
symptoms in those patients suffering from comorbid OCD [35, 36]. 
Independent studies have proposed multiple targets in the CBG 
for DBS in OCD, such as the anterior limb of the internal capsule 
and the STN [37-39] (Fig. 94.5), as an alternative to stereotactic 
lesional neurosurgery. But progress has been limited by the lack of 
understanding of the pathophysiology of OCD at both the neural 





Fig. 94.5 (see Colour Plate section) Three-dimensional image 

of electrodes implantated in a DBS procedure. Depending on the 
stimulating electrode (5) configuration and stereotactic co-ordinates, 
several anatomical sites can be targeted. Two brain targets are 
represented in this example: the caudate caudate nucleus (2) and the 
subthalamic nucleus (2). Other brain structures, such as the substantia 
nigra (3) and the red nucleus (4), can be used as anatomical landmarks. 
Adapted courtesy of Jerome Yelnik, INSERM/ICM. 


and behavioural levels and how DBS operates on these brain circuits 
and their functions. Two meta-analyses have suggested that OCD 
symptoms showed a mean improvement of 45% after 3-36 months 
of DBS, with 60% of patients being considered responders (>35% 
reduction in OCD severity) and found no significant differences be- 
tween the brain structures targeted [40, 41]. Yet according to Kisely 
et al. [40], only one study meets all four of the quality criteria, namely 
the first randomized double-blind study performed on 16 OCD pa- 
tients which showed that 3 months of DBS of the STN is sufficient 
to decrease OCD severity [38]. Only a few studies report long-term 
efficacy (>3 years) and sustained toleration of DBS in OCD patients. 
Stimulation of the nucleus accumbens, the ventral caudate/ventral 
striatum and the anterior limb of the internal capsule, and the bed 
nucleus of the stria terminalis have been associated with a median 
reduction of 46%, 36-64%, and 45% in OCD severity, respectively, 
36-171 months after surgery [42-45, 45a]. 

Understanding the contribution of cortico-subcortical circuits to 
OCD pathophysiology and therapeutic interventions therefore rep- 
resents a major challenge in improving the selection of candidate 
patients and the optimization of treatment strategies. As recognized 
by various authors [46], traditional nosography—even when com- 
bined with more recent techniques such as brain imaging, has made 
limited progress in this direction, and novel approaches combining 
hypothesis-driven experimental studies of endophenotypes, animal 
models, and neuromodulation may be essential to open the way to 
translational research for innovative treatments. 

As an example, therapeutic mechanisms underlying the action 
of STN high-frequency stimulation (HFS) involve the hyperdirect 
pathway. This pathway provides direct cortical inputs to the STN. 
It was first described in the motor/premotor domain and was con- 
sidered to act through motor selection [23]. Recent evidence shows 
a limbic element stemming from the OFC and ACC [47]. Current 
proposals regarding DBS treatment in humans now incorporate 
the hyperdirect pathway not only for motor disorders, but also for 
psychiatric conditions such as OCD [48, 49]. Indeed, the effective 
target for OCD was the associative-limbic part of the STN-HFS [38], 
which is the target of hyperdirect afferents from the limbic and asso- 
ciative prefrontal regions (ACC and OFC). 

Interestingly, per-operative recordings have revealed that popula- 
tions of neurons from the associative-limbic part of the STN exhibit 
electrophysiological correlates of OCD symptoms (for example, re- 
petitive checking [50]). At the cellular level, burst and oscillatory 
patterns of activity from the associative-limbic part of the STN cor- 
relate with OCD symptomatology (across patients) and predict the 
post-operative long-term response of STN-HFS [51]. 

However, the nature of the benefit is not clear and difficult to 
investigate in patients, as the amount of physiological data simul- 
taneously available at each node of the cortico-subcortical loops is 
limited. Moreover, the use of DBS is today empirically driven. The 
exact effects of this technique are not known, especially the physio- 
logical consequences of such stimulations on the CBG loop [52]. 
Finally, electrical currents may trigger antidromic effects in afferent 
neurons, most notably in limbic cortical areas, paralleling the effects 
shown in cases of Parkinsonism [53]. 


Optogenetics 


The functional deconstruction of neuronal circuits can now be 
achieved in animal models using optogenetic techniques [54]. These 
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have already been used in rodents to decipher neural codes of basic 
motor, cognitive, and emotional behaviours [14, 53, 55, 56]. These 
techniques allow the manipulation of neuronal activity using op- 
tical stimulation by expressing, through viral vectors, specific light- 
sensitive membrane proteins—called opsins—in neural cells. This 
approach is of great interest in the selective inhibition or excitation 
of nodes of given targeted circuits and can be used to assess physio- 
logical and functional causal links. Using a combination of genetic 
and stimulation-dependent effects, optogenetics allows a high level 
of temporal, spatial, and cell type specificity. 

It can also be used to overcome certain issues encountered in pre- 
vious classical approaches. For example, the observation of electro- 
physiological effects of DBS in OCD patients is challenging because 
of the electrical artefacts it induces. DBS effects also depend on cell 
type, active fibres, and nearby structures. These issues could be over- 
come by taking advantage of the optogenetic technology, which 
avoids electrical artefacts and allows the stimulation of specific cell 
types only. 

Primary applications of optogenetics have been used to better 
analyse and understand discrete and specific normal or abnormal 
behaviours in rodents. To effectively validate and translate this novel 
knowledge to clinical applications, the next step is to scale up these 
optical experiments to animal models displaying greater anatom- 
ical and physiological similarities with human beings, especially in 
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complex behavioural abnormalities. In particular, rodents allow for 
a wide range of tools to be used in order to manipulate the cortex- 
basal ganglia circuits, yielding models of neurologic and psychiatric 
disorders. 

Using optogenetics in rodents, some recent studies have sug- 
gested new targets that could be considered to be causally linked 
to OCD symptomatology. Optogenetic manipulation of specific 
brain circuits, such as the orbitofrontal-striatal pathway, could in- 
duce or decrease compulsive-like behaviours [14, 57]. Although 
these brain areas have previously been regularly identified as ab- 
normally active in OCD patients, these studies were the first to 
show their manipulation could elicit/decrease compulsive-like 
behaviours. Thus, an optogenetic approach in animal models not 
only allows the testing of circuit-level hypotheses for OCD, but 
also challenges further the micro-circuitry within various brain 
areas. Indeed, genetic studies in human patients have already in- 
dicated that genes affecting the serotonergic, dopaminergic, and 
glutamatergic systems, and the interaction between them, may 
be affected in OCD and many potential therapeutic targets may 
arise from these biological systems [58]. Furthermore, Burguiére 
et al. [14] focused on a new potential cellular subtype—the striatal 
interneurons—which may play an essential role in regulating the 
expression of striatal-dependent behaviours and could be impli- 
cated in OCD (Fig. 94.6). 
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Fig. 94.6 Hypothetical model to illustrate striatal hyperactivity as a consequence of interneuronal network deficiency. Above: in normal conditions, an 
extensive inhibitory network in the striatum preserves a balance of excitation and inhibition driven by the cortex. Below: in the OCD condition, there 

is an imbalance towards excitation in the striatal region due to a lack of interneurons. As a result, there will be hyperactivation of the entire CBG loops. 
ACC, anterior cingulate cortex; MSN, medium spiny neurons; OFC, orbitofrontal cortex; PV, parvalbumin-positive interneurons. 

Reproduced from Curr Opin Neurobiol., 30, Burguière E, Monteiro P, Mallet L, et al, Striatal circuits, habits, and implications for obsessive-compulsive disorder, pp 59-65, 


Copyright (2015), with permission from Elsevier Ltd. 
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In fact, it has been shown that a powerful inhibitory network 
of parvalbumin (PV)-immunoreactive and cholinergic (Chat)- 
immunoreactive interneurons is crucial for the functional regu- 
lation of striatal networks [59]. Moreover, depending on whether 
either the sensorimotor or associative-limbic cortico-striatal path- 
ways are affected by a lack of interneurons, the functional conse- 
quences can be drastically different. According to human imaging 
studies, it has been suggested that cortico-striatal sensorimotor 
pathways (including the dorsolateral part of the striatum equiva- 
lent to the putamen in humans) are more likely to be affected in 
Tourettes patients, while cortico-striatal associative-limbic path- 
ways (including the medio-ventral part of the striatum or caudate 
nucleus in humans) are more likely to be affected in OCD patients, 
although some overlap can be observed in the pathophysiology of 
these behaviours [60-63]. 


Linking behavioural and neurophysiological 


endophenotypes of OCD to strengthen 
target validity 





Experimental psychologists and clinicians have performed nu- 
merous studies on OCD patients to assess their performance in 
the main cognitive domains (memory, attention, verbal fluency, 
etc.) using classical behavioural test batteries. Even if some deficits 
have been observed, no clear magnitudes of effects for each cogni- 
tive domain have been observed using meta-analyses [64-66]. So in 
order to identify the relevant behavioural dimensions that might be 
affected by OCD, investigators must focus on more selective cog- 
nitive processes. To illustrate this approach, we will discuss three 
dimensions that could prove dysfunctional in OCD, as well as the 
underlying brain circuits that play a role in these processes and that 
often overlap with those identified in the pathology. 


Uncertainty monitoring 


Compulsive checking is a core symptom of OCD. It is character- 
ized by the urge to verify repeatedly that an action, usually aimed 
at preventing a possible harm, has been properly completed [67]. 
The clinical phenomenology of compulsive checking illustrates 
two hypotheses regarding its psychopathological mechanisms— 
compulsive checking may proceed from a maladaptive goal-directed 
behaviour towards uncertainty reduction; or, in the context of habit- 
driven behaviours, it may result from a failure to control impulses 
to check. These two alternatives fit nicely into two lines of research 
on the behavioural functions of the basal ganglia in general, and the 
STN in particular. 

A growing number of studies have expanded the role of the STN 
from purely motor functions to include higher-level processes such 
as emotion and decision-making [68, 69]. In PD patients (for whom 
STN-HFS treatment is most widespread), the STN has been associ- 
ated with the regulation of behaviour in the face of conflicting stimuli 
[70], for example postponement when faced with two positively re- 
inforced cues in reward-based motor tasks [70-72]. Conversely, le- 
sion and stimulation studies in animals have also shown that various 
forms of STN intervention result in premature responses and in- 
creased attentional errors [73, 74]. Interestingly, the hyperdirect 
pathway, which connects the cortical frontal areas to the STN [23], as 
previously described, could constitute an anatomical route for such 


decisional signals whereby behaviour is regulated according to the 
strength of the sensory evidence. Indeed, the hyperdirect pathway 
not only connects motor/premotor functions, but also the limbic/ 
associative regions, such as the ACC to the anterior-medial STN, 
precisely the target of DBS in the context of OCD [47]. Considering 
the role of the ACC and other associative/limbic frontal cortices in 
action selection and decision-making, one could surmise that the 
STN receives higher-level decisional signals [69] to regulate a be- 
haviour adaptively. 

Recently, new behavioural paradigms have been developed to as- 
sess uncertainty monitoring in animals, revealing that, when given 
the opportunity, rats can adaptively ‘opt out’ and move on to the 
next, possibly easier, trial of the task [75, 76]. In such tasks, varying 
uncertainty about stimulus identity (hence decision difficulty) leads 
the animal to discard the trials where its response is most uncer- 
tain. Indeed, simultaneous electrophysiological recordings have 
shown that stimulus-related uncertainty is encoded in the spike rate 
of the OFC neurons. However, despite putative connectivity (via the 
hyperdirect pathways), it is still unknown whether these uncertainty 
monitoring signals reach the STN and participate in the regulation 
of goal-directed behaviour, either to opt out from a trial or to check 
back at the stimulus. 

Paradoxically, empirical studies in humans have reported that 
repetitive checking, even in healthy subjects, instead of reducing 
stimulus-related uncertainty, actually decreases subjective con- 
fidence [77]. To investigate repetitive checking in the laboratory, 
we introduced a so-called ‘verification task, based on a working 
memory paradigm in which, on each trial, participants could review 
the stimulus before proceeding to the feedback. This paradigm al- 
lowed us to provoke and assess compulsive checking in OCD pa- 
tients [78, 79] and also to reproduce this pathological behaviour 
during the DBS surgery when the patient is awake and behaving 
[50]. We were therefore able to record the STN from populations of 
neurons, as OCD patients entered bouts of repetitive checking. The 
STN showed a significant increase in the population spike rate when 
the patient was about to perform another checking action, com- 
pared to when he/she was about to proceed to the next trial. These 
results suggest that uncertainty signals anticipating checking actions 
may be detected in STN neurons [50]. 


Behavioural inhibition 


A different and contrasting line of research has focused on action 
control [80, 81], comprising behavioural inhibition and the ces- 
sation of ongoing motor programmes before they have reached 
overt expression. According to the classical model of the basal 
ganglia-thalamo-cortical loops, the major output nuclei of the 
basal ganglia (the STN and the internal globus pallidus) maintain 
sustained inhibition at the cortical level via the thalamic relay. To 
explore behavioural inhibition in humans, clinical [35], functional 
[68], and electrophysiological studies in DBS-treated PD patients 
have proved critical, linking successful behavioural inhibition to 
frequency-specific changes of oscillatory activity within the STN 
[71], as well as between the STN and prefrontal cortices [82]. As a 
consequence, a lesion or inactivation of the STN performed in rats 
leads to impulsive behaviour, characterized by failed inhibition of 
prepotent responses initiated in the striatum-—pallidum circuits 
[83]. In this context, compulsive behaviour may reflect a func- 
tional deficit at the STN level, causing failure to inhibit prepotent 
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or habitual actions that are produced, irrespective of the behav- 
ioural context. 

Indeed, neural representations of habitual sequences of actions 
triggered as encapsulated motor programmes, or ‘chunks; have been 
observed in the striatal circuits of rodents [84-86]. Additionally, 
perturbation of the fronto-striatal circuitry could indeed modulate 
the execution of habitual actions and prevent compulsive behaviour, 
as has been shown using optogenetic manipulation in mice [14] and 
DBS in OCD patients [31]. Therefore, the fronto-striato-pallidal 
pathway may play an important role in the emergence of motor 
programmes and, if dysfunctional, provoke their overexpression, 
leading to compulsive behaviours. Complementary to the cortico- 
striatal pathway, the cortico-STN pathway (that is, the hyperdirect 
pathway) has been shown to stop habitual behaviours before they 
reach completion [82, 87]. In this context, compulsive behaviour 
may reflect a functional deficit which fails to inhibit prepotent or 
habitual actions that are produced, irrespective of the behavioural 
context. 


Behavioural flexibility 


OCD in humans is characterized by repetitive behaviour, performed 
through rigid rituals. This phenomenological observation has led 
to exploration of the idea that OCD patients have diminished be- 
havioural flexibility (that is, the ability to change one’s behaviour 
according to contextual cues). Behavioural flexibility may be chal- 
lenged in experimental tasks such as reversal learning paradigms. 
In these tasks, the participant has to respond to either of two visual 
stimuli. In the deterministic version, one stimulus is positively re- 
warded while the other is not. Unbeknown to the participant, the 
reward contingencies are reversed after a certain number of trials, 
so that the previously neutral stimulus is now rewarded while the 
previously rewarded stimulus is not. In this deterministic version, 
healthy adult humans immediately detect the change in contingen- 
cies and start responding to the other stimulus. In addition, the task 
may be rendered more challenging using a probabilistic version 
of it [88]. Here, the reward is not systematically delivered after the 
subject has chosen the rewarding stimulus, providing uncertainty 
regarding the current state of the reward contingencies. Only after 
having collected enough evidence that the previously reinforced 
stimulus has changed, will the participant shift to the newly (but still 
varyingly) reinforced stimulus. Performance in reversal learning 
is scored according to the number of persistent errors committed 
when participants maintain their response towards previously re- 
inforced stimuli in spite of a negative reward. At the brain level, be- 
havioural flexibility tasks, and reversal learning in particular, appear 
to engage the OFC and ACC regions. In a 2006 study, Remijnse et al. 
[89] showed that OCD was associated with a reduced response in the 
orbitofrontal-striatal network during adaptive switching to the new 
stimulus reinforcement regime. Using a different type of behavioural 
flexibility task, Chamberlain et al. [90] has demonstrated that OCD 
patients, as well as their relatives, show diminished OFC responses 
to behavioural adaptation triggered by the task, pointing towards 
the existence of an OCD endophenotype. Interestingly, in PD, STN 
stimulation has a beneficial effect on mental flexibility and working 
memory [91] and on cognitive flexibility, as expressed by the inhib- 
ition of habitual responses tested in a random number generation 
task [92]. Unsurprisingly, similar behavioural tasks have long been 
used with mice using various response modalities (T-maze, lever 


press, nose-poke), and animal studies have confirmed the role of 
OFC or OFC homologous regions in behavioural flexibility. 


Within the last decade, a multi-dimensional approach has been pro- 
posed to better link clinical observations with the increasing amount 
of data obtained in biological psychiatry studies [93, 94]. 

In fact, the integrative view of complementary research fields, 
including neuroimaging, genetics, experimental psychology, com- 
putational psychiatry, and animal models, has led to a redefinition 
of OCD nosography and its heterogenous expression. In the fu- 
ture, further insights may come from studies combining different 
fields, as illustrated by recent pioneering work with OCD patients 
which has combined computational and neurophysiological ap- 
proaches [95] or experimental psychology with neuroimaging [96]. 
The identification of new targets using such complementary ap- 
proaches could also be used to develop entirely novel animal models 
of psychiatric diseases. It would open the way to developing new 
ideas by manipulating brain circuits, for example using optogenetic 
manipulations that would reproduce relevant aspects of the OCD 
symptomatology. 

To analyse and understand abnormal behavioural functions in OCD 
patients, a promising perspective comes from the recent field of com- 
putational psychiatry. It sets out to investigate the changes in elemen- 
tary cognitive processes in dysfunctional patients, such as behavioural 
flexibility or uncertainty monitoring, as described previously. Using 
the computational approach, researchers try to infer the hidden vari- 
ables and states associated with the computations the brain might use 
when performing a task. Indeed, mathematical models of how choices 
emerge from elementary computations can potentially bridge the gap 
between neural activities and the behavioural choices. 

A more translational and comprehensive approach that integrates 
novel fields of fundamental and clinical research should therefore 
be used to unveil the neurobiology of OCD and possibly lead to the 
development of more effective treatments. 
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Up to the early 1980s, obsessive-compulsive disorder (OCD) was 
considered a rare (less than 0.5%) treatment-refractory, chronic con- 
dition of psychological origin. Dynamic psychotherapy was widely 
used and yet had little benefit; this was also the case for pharmaco- 
logical interventions [1]. 

The observation that clomipramine, a tricyclic drug for depression 
with a serotonergic profile, is effective in treating symptoms of OCD 
[2, 3] has increased clinical interest in this disorder, including its epi- 
demiology; several researchers have since reported a prevalence of 
OCD of about 2% in the general population [4, 5]. However, changes 
to the diagnostic system, especially the addition of OCD-related dis- 
orders (OCRDs) to OCD (and creating the OCRD cluster) [6], have 
dramatically increased the calculated prevalence of this cluster to 
close to 9%. 
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The diagnosis of OCD according to DSM-5 criteria is based on the 
presence of either obsessions or compulsions, which cause marked 
distress, are time-consuming (more than an hour daily), or signifi- 
cantly interfere with a person’s normal routine and social and occu- 
pational activities. Notably, in DSM-5, two specifiers were added: the 
degree of insight (ranging from good or fair to absent) and the pres- 
ence (or absence) of tics. 

According to DSM-5 [6], obsessions are repetitive, intrusive, and 
distressing thoughts, ideas, images, or urges that are often experi- 
enced as meaningless, inappropriate, and irrelevant, and persist 
despite efforts to suppress, resist, or ignore them. Compulsions are 
repetitive, stereotyped behaviours and/or mental acts that a person 
feels driven to perform as a standalone urge or as a perceived reac- 
tion to an obsession. Though such intrusive thoughts and ritualistic 
behaviours are also frequently reported in the general population 
[7-9], those seen in OCD are considered psychopathological as they 
are time-consuming and cause marked distress and significant func- 
tional impairment [10]. OCD can express itself in various ways. The 




















obsession component includes: dirt/germs, aggression/harm, re- 
ligious/moral issues, and sexual obsessions. The compulsion com- 
ponent include: washing, checking, ordering, counting, reassurance 
seeking, and symmetry acts and various mental compulsions [11]. 

In accordance, a common expression of OCD is an obsession 
around dirt or ‘germs, together with washing or avoiding pre- 
sumed contaminated objects (door knobs, electrical switches, 
newspapers, peoples’ hands, telephones, etc.). It contains ‘hard- 
to-avoid’ substances (for example, faeces, urine, dust, or germs), 
to the extent that individuals sometimes avoid leaving home. 
Another common expression of OCD involves pathological doubt, 
which is frequently accompanied by a compulsion to check. For 
example, the person may need to check whether the oven is turned 
off or the front and back doors are closed—the checking may in- 
volve many trips back home to recheck what was already checked. 
In OCD, the checking procedure, instead of resolving uncertainty, 
often contributes to even greater doubt, which leads to further 
checking. Other common expressions of OCD are related to the 
fear of causing harm, for example by being neglectful (for instance, 
an obsession of hurting someone while driving, leading to driving 
back towards the relevant spot repeatedly and checking no one 
was hurt). 

The need for symmetry or precision, which may interfere with 
task completion and result in pathological slowness, is another pres- 
entation of OCD. For those patients, it can take hours to eat a meal or 
shave, in an attempt to do things ‘just right. Other common expres- 
sions include hoarding, involving the urge to accumulate and diffi- 
culty with discarding objects (“What if I will need iť?) and religious 
obsessions (fear of blasphemy). 

At times, patients complain of intrusive thoughts without no- 
ticeable compulsions. However, such patients are often found to 
be engaging in mental compulsions, for example mental checking 
or analysing (for instance, whether the intrusive thought exists). 

Several studies [12-16] have used factor-analytic methods to sys- 
tematically examine the structural characteristics of OCD symp- 
toms, revealing four main obsessive-compulsive (OC) factors: 


1. Aggressive/sexual/religious/somatic obsessions accompanied by 
checking compulsions. 
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2. Symmetry obsessions accompanied by ordering and arranging/ 
counting/repeating compulsions. 

3. Contamination obsession accompanied by cleaning compulsions. 

4. Hoarding obsession accompanied by hoarding and collecting 
compulsions. 


Other theories [17-20] have suggested that the heterogeneity of 
OCD may be better understood with two core dimensions: harm 
avoidance and sense of incompleteness. They suggested that OCD 
is characterized either ‘by anxious apprehension and exaggerated 
avoidance of potential harm (for example, washing, repugnant ob- 
sessions, harm-avoidant checking) or alternatively by attempts to 
reduce feeling of incompleteness (for example, symmetry, counting, 
repeating, slowness). Interestingly, these feelings of incompleteness 
were first described in 1903 [21] as a central phenomenological fea- 
ture of OCD, according to which OCD patients are tormented by 
the feeling that their actions are incompletely achieved or do not 
produce the expected satisfaction. Their inability to achieve ‘closure’ 
concerning actions/perceptions leads to what later was labelled as 
the ‘not just right experience’ (NJRE) [22, 23]. 

However, it should be noted that recent studies [24] documented 
that obsession may not always precede compulsion, as the traditional 
acronym OCD suggests. Alternatively, the pathological loop may 
start with the actual urge to perform the compulsion (not as a pur- 
poseful, goal-driven act, but as a habit), while the obsessional com- 
ponent develops later as a secondary phenomenon. The acronym 
OCD may be used to describe this course of illness. 

Indeed, obsessions and compulsions are not necessarily rationally 
related; hence, a patient might perform a compulsion of clapping 
hands in a specific way, in order to prevent an obsession of being 
contaminated with germs. 

Most patients present with multiple obsessions and compulsions. 
The symptoms may shift over time; for example, a patient who had 
washing rituals during childhood may present with checking rituals 
as an adult [19, 20]. 

As OCD symptoms are usually embedded within the updated 
culture and technology, it often assimilates with the frequent usage 
of the Internet and smartphones in daily life. Harnessing this tech- 
nology into compulsive behaviours (for example, compulsive 
checking of emails/posts, taking photos as another presentation of 
list makings, electronic hoarding, etc.) might not always be diag- 
nosed, since it often camouflages as proper Internet use. 


Diagnosis and differential diagnosis 


As OCD has diverse expressions, the diagnosis, as well as differential 
diagnosis, requires careful and thorough evaluation. 

The NICE guidelines include five simple questions that might help 
identify OCD (Box 95.1) [25]. These questions are recommended 
whenever a mental status examination is carried out; since pa- 
tients tend to hide their symptoms (due to the ego-dystonic nature 
of OCD), asking direct, specific questions is often the only way to 
diagnose OCD. 

Another clue may be past or current tics (including vocal tics) and 
a family history of OCD or OC spectrum disorders (OCSDs) which 
include: hoarding disorder; body dysmorphic disorder (BDD); and 
body-focused repetitive behaviours such as trichotillomania (hair 
pulling), onychophagia (nail biting), and excoriation (skin picking). 
Personal distress and functional impairment, which are required for 


Box 95.1 Quick screen for obsessive-compulsive disorder 





e Do you wash or clean a lot? 

e Do you check things a lot? 

e Is there any thought that keeps bothering you that you would like to 
get rid of but can't? 

e Do your daily activities take a long time to finish? 

e Are you concerned about orderliness or symmetry? 

e Do these problems trouble you? 





Source: data from National Collaborating Centre for Mental Health, Obsessive- 
compulsive disorder: Core interventions in the treatment of obsessive-compulsive disorder 
and body dysmorphic disorder, Copyright (2006), The British Psychological Society and 
The Royal College of Psychiatrists. 














diagnosis, differentiate OCD from ordinary or mildly excessive wor- 
ries, thoughts, and habits. 

Psychiatric diagnoses that show a presentation similar to 
that of OCD include schizophrenia, depressive disorder, post- 
traumatic stress disorder (PTSD), social anxiety, eating dis- 
order, and phobias. The importance of an accurate diagnosis is 
due to the uniqueness that underlines the treatment, as well as 
the course, prognosis, and therapeutic goals for these disorders. 
Severe, bizarre, and incapacitating OCD symptoms, especially 
when accompanied with a lack of insight (delusions), are often 
misdiagnosed as schizophrenia. By introducing the lack of insight 
specifier and by restricting the diagnosis of schizophrenia to only 
those with primary symptoms, DSM-5 might help to reduce such 
misdiagnoses. 

At times, OCD symptoms may appear as a phobia. While the 
classic phobic objects are avoidable (for example, tunnels, dogs, ele- 
vators) and the anxiety is reduced in the absence of the stimulus, 
the ‘phobic’ stimulus in OCD are often unavoidable (for example, 
germs, dirt, radioactive waves, etc.). 

Major depressive disorder (MDD) is frequently associated with 
ruminations that are mood-congruent and, unlike OCD, usually 
involve beliefs of personal failure related to past events. Although 
MDD is common in OCD, in the majority of cases, the OCD had 
started first and depression developed later as a secondary response 
to the substantial difficulties resulting from OCD. 


Epidemiology 


Prevalence 


OCD is a global phenomenon, with substantial similarity across 
cultures in symptom clusters, gender distribution, age of onset, and 
comorbidities [26]. However, regional and historical variations in 
symptom expression exist, and cultural factors may shape the con- 
tent layer of obsessions and compulsions (for example, contam- 
ination by syphilis in the past and by AIDS nowadays; repetition 
of prayers by religious individuals vs personal mantras in secular 
persons). 

Using the restricted DSM-IV definition (that is, not including 
OCD-related disorders), the lifetime prevalence of OCD in the 
general population was calculated at 2-3% in a number of seminal 
studies [4, 27]. Those percentages have been confirmed across cul- 
tures [5] and support the biological basis of OCD. Although the cul- 
tural, economic, and social factors play a role in the presentation of 


OCD (that is, the content of obsessions and the shape of the com- 
pulsions), yet its prevalence is equally distributed across the globe. 

More recent large-scale cross-sectional surveys of US and 
European citizens using diagnostic instruments compatible with 
DSM-IV produced a more conservative estimate of a 1.6% [28, 29] 
and 0.8% [30, 31] lifetime prevalence, respectively. In contrast, a 
detailed longitudinal prospective study of community respondents 
in the Swiss Canton of Zurich, [32], that additionally looked for 
subthreshold forms of OCD, reported a prevalence rate of 3.5% for 
OCD (males: 1.7%; females: 5.3%), 9.7% for obsessive-compulsive 
syndrome (males: 1.7%; females: 5.3%), and 11.2% for obsessive- 
compulsive symptoms (males: 13.7%; females: 8.8%). Thus, in 
this study, taking into account the whole spectrum of obsessive- 
compulsive symptomatology, over 10% of the population were 
affected. 

In DSM-5, OCD was separated from the anxiety disorders and cat- 
egorized within a new diagnostic category (‘Obsessive-compulsive 
and related disorder’), which includes, in addition to OCD, disorders 
such as: hoarding disorder; BDD; and body-focused repetitive be- 
haviours such as trichotillomania (hair pulling), onychophagia (nail 
biting), and excoriation (skin picking). Considering this spectrum- 
oriented view of OCRDs, the cumulative prevalence of OCSDs is 
much higher than that of OCD alone. Summing up the prevalence of 
the individual OCRDs (Table 95.1) and factoring in the comorbidity 
rates, the overall prevalence of OCRDs as a group could be as high 
as 9.5%. 


Gender 


OCD is overrepresented in females (around 1.5:1) [32], and gender 
differences in the pattern of symptom dimensions have been re- 
ported. For example, females are more likely to have contamination 
obsessions and symptoms in the cleaning dimension and males are 
more likely to have symptoms in forbidden thoughts and religious 
and symmetry rituals [41]. Males are more prone to be associated 
with significant interference in life; for example, two-thirds remain 
single vs only one-third of women. Moreover, almost half of male 
patients continued to live with their original families or in assisted 
homes, compared with only 20% of women. These rates might be 
due to the relatively earlier onset of the pathology in males [41] or 
the different course of OCD in males vs females. 


Course of OCD 


The onset of OCD is often spontaneous; however, in some cases, 
OCD symptoms follow a stressful or traumatic event (for example, 
pregnancy, labour, accident, etc.) [42, 43]. Due to the secretive 


Table 95.1 Summary of prevalence of OCRD 
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nature of the disorder, there is often a delay (of up to several years 
[44]) before patients are adequately diagnosed. However, the delay 
may shorten with increased public awareness to the disorder, for 
example through articles, books, and movies, and if specific ques- 
tions regarding OCD are included in the mental status examination 
(Box 95.1). 

The course is usually chronic; yet some patients experience a 
fluctuating course [45]. In a 40-year longitudinal study (1953-1993), 
during which no effective and long-lasting therapies were available, 
the duration of illness was chronic for most patients, with half still 
experiencing clinically relevant symptoms and another third sub- 
clinical symptoms at 40 years’ follow-up [46]. Among those followed 
up for half a century since onset, 37% still had OCD. Although, in 
the study, most patients showed some improvement along the years, 
complete recovery occurred in only about one-fifth of patients. 
A study prospectively examined 113 OCD patients (all of whom had 
a history of at least one other comorbid anxiety disorder) and found 
that the probability of OCD remission was 0.16 at year 1 and reached 
0.42 by year 15 [47]. For those who remitted, the probability of re- 
currence of OCD remained low, approximating 0.25 from year 5 on- 
wards. Those who were married and those without comorbid MDD 
were more likely to enter remission. A relatively high rate of remis- 
sion at a specialist paediatric OCD clinic was also found in a 2010 
study [48] where approximately 60% of 142 young people with OCD 
assessed at baseline did not have a full clinical syndrome of OCD 
after up to 9 years. However, approximately 50% of participants were 
still receiving treatment and about 50% expressed a need for further 
treatment. In the same study, the main predictor for persistent OCD 
was the duration of illness at assessment. Indeed, a meta-analysis 
of 22 studies (n = 521) of child/adolescent-onset OCD found that 
earlier age at onset and increased duration predicted persistence of 
illness [49]. 

Environmental factors like family accommodation or compliance 
with OCD symptoms are also important variables. Those kinds of 
family interactions, while understandable, can be destructive. For 
example, answering reassurance questions or participating in avoid- 
ance behaviours (for example, ‘I will open the door for you so you 
will not need to wash;'I will turn the stove off so you will not need to 
check; etc.) stands as a significant barrier to recovery. 


Age at onset of OCD 


The mean age at onset of OCD has traditionally been reported 
to be late adolescence [50-52], with a range of 10-22 years [32]. 
However, recent studies report an even earlier mean age at onset of 
around 15 years [53], with onset after 30 years being uncommon 


Disorder Prevalence Comorbidity with OCD Net prevalence 
(prevalence - comorbidity) 











Body dysmorphic disorder SEZ 2s 
oarding disorder 2-5% ae 
Trichotillomania (hair pulling disorder, 0.6-2% [37] 
PD) 
Excoriation (skin picking disorder) 1.4-5.4% [39] 
et prevalence equals prevalence (mean) minus comorbidity (mean). 





Comorbidity equals comorbidity (mean) times prevalence (mean). 





Sum of OCRD = 1.6% (OCD) + 1.4% (pure BDD) + 2.4% (pure hoarding) + 1% (pure trichotillomania) + 3% (pure skin picking disorder) = 


30-35% [34] 1.4% 
8-35% [36] 2.4% 
13% [38] 1% 
16% [40] 3% 


9.4% (OCRD). 
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[54]. Females are overrepresented in late-onset OCD (around 1.5:1), 
while in the earlier onset of illness grouping, males are more repre- 
sentative [55]. Nearly 25% of males have an onset before the age of 
10 years [56]. 

In a 40-year longitudinal study [46], early-onset OCD was asso- 
ciated with poorer outcome, especially in men. More recent studies 
have reported a negative correlation between age at onset and dur- 
ation of illness [53]. Additionally, an association between early 
onset and greater symptom severity [57], male gender [58], gen- 
etic loading [59], [60], severity of compulsions [61], poor insight 
[62], co-occurring tic disorders [63], hair pulling [64], multiple 
anxiety disorders [61], and OC personality disorders [65] has been 
identified. 

Late-onset OCD may include cases of OCD that have been de- 
veloped after exposure to traumatic events (post-traumatic OCD) 
[66]. Females may develop OCD in pregnancy or after parturition 
or miscarriage [42]. 


OCD and suicidal risk 


Historically, OCD was considered a psychiatric disorder with rela- 
tively low rates of suicidal ideation (which was attributed to its 
compulsive domain). However, recent studies have challenged this 
perception and concluded that the risk of attempted and completed 
suicide among individuals with OCD is substantial [67]. Some au- 
thors suggested that suicide risk is more strongly predicted by psy- 
chiatric comorbidity, for example rates of comorbid depression and 
anxiety, rather than OCD itself [10, 11]. However, even when con- 
trolling for comorbidity, the risk of suicidality remains significant 
among OCD sufferers [68], and rates of suicidal ideation ranging be- 
tween 20% and 46% have been reported [11-15]. Sixty-five per cent 
of one OCD sample had endorsed at least one of the questions about 
suicidal phenomena as positive. Of these, 32% had made suicidal 
plans and 19.4% had attempted suicide [16]. Nevertheless, it is im- 
portant to note that suicidal thoughts may sometimes be a symptom 
of OCD (for example, ‘I might commit suicide, ‘what is the prefer- 
able way to commit suicide’), and not a genuine suicidal ideation. 


OCD and insight 


Classically, OCD is defined as an ego-dystonic disorder, that is, a dis- 
order in which the patient realizes that the obsessions and/or com- 
pulsions are irrational or excessive. 

However, keeping in mind the diversity of the disorder, DSM-IV 
included the subtype of ‘poor insight. 

DSM-5 extended the diagnostic boundaries of OCD by allowing 
to include absent insight or delusions in OCD [6]. Even if the clin- 
ician spotted a clear-cut delusion, that is, the individual is com- 
pletely convinced that the OCD beliefs are true, a diagnosis of OCD 
may be appropriate. The diagnosis should, however, be based on 
careful history-taking; in OCD, the disorder usually starts as ego- 
dystonic and slowly moves on the continuum of insight to delusional 
symptomatology. Moreover, in the case of OCD diagnosis, other 
compulsive components (now or in the past) are present and help to 
differentiate OCD from delusional disorders. 


Comorbidity 


If another Axis I disorder is present, the diagnosis of OCD requires 
that the content of the obsessions or compulsions is not restricted to 
that disorder (for example, a preoccupation with food or weight in 
eating disorders, or guilt feelings in the presence of a major depres- 
sive episode). Similarly, OCD should not be diagnosed if it is due 
to the direct effects of a substance (for example, a drug abuse or a 
medication) or a general medical condition. 

In a national survey of adults in the United States [44], OCD 
was associated with substantial comorbidity, not only with anxiety 
(75.8%) and mood disorders (63.3%), but also with impulse control 
(55.9%) and substance use disorders (38.6%). 


Anxiety disorders 


In a lifetime comorbidity community study [69], 73% of OCD pa- 
tients experienced an anxiety disorder (specifically: GAD 50%, 
social phobia 40%, agoraphobia 30%, simple phobia 20%, and panic 
disorder 17%). However, a recent international study reported a 
more conservative prevalence of comorbidity, with 14% of patients 
experiencing social anxiety, 13% experiencing GAD, and 12% ex- 
periencing panic disorder with or without agoraphobia [26]. 


Obsessive-compulsive spectrum disorders 


An early study by Du Toit and colleagues [70] showed that 57.6% 
of their OCD sample currently met the criteria for at least one pu- 
tative OCSD and that 67.1% had a lifetime history of at least one 
comorbid OCSD. In that study, the OCSDs with the highest preva- 
lence were compulsive self-injury (22.4%), compulsive buying 
(10.6%), and intermittent explosive disorder (10.6%). More re- 
cently, a cluster analysis of OCSDs in a sample of 210 OCD patients 
identified three separate clusters that were named: (1) ‘reward 
deficiency’ [including hair pulling disorder (trichotillomania or 
HPD), pathological gambling, hypersexual disorder, and Tourette’s 
disorder (TD)]; (2) ‘impulsivity’ (including compulsive shopping, 
kleptomania, eating disorders, self-injury, and intermittent explo- 
sive disorder); and (3) ‘somatic’ [including body dysmorphic dis- 
order (BDD) and hypochondriasis] [9]. Along these lines, a report 
from the International College of Obsessive-Compulsive Spectrum 
Disorders (ICOCS) showed that rates of comorbid OCSDs were 
high, particularly given the relatively low base rate of such disorders 
in the general population [26]. This is consistent with earlier work 
[4], which has emphasized the high prevalence of tic disorders, as 
well as of OCSD, in OCD. 


Depression and bipolar disorder 


The lifetime prevalence for a major depressive episode in individ- 
uals with OCD ranges from 40% to 80% [44]. However, depression is 
usually secondary (that is, the OCD started first). Depression in chil- 
dren and adults with OCD has been related to variables, including 
an early age at onset, more severe obsessive-compulsive symptoms, 
lower levels of perceived competence, and family history of recur- 
rent MDD [71]. This comorbidity is often associated with greater 
illness severity [72] and reduced quality of life [73]. 

Few studies have identified a greater likelihood of bipolar (espe- 
cially bipolar-II) disorder comorbidity in OCD, as compared to its 
prevalence in the general population [44]. 


Addiction 


Up to the 1980s, OCD was regarded asa disorder of ‘overcontrol and 
the consensus was that substance use would be incompatible with 
OCD [74]. This notion has been dramatically changed with later 
studies [75-78], which found a strong association between OCD 
and substance abuse/dependence (including alcohol). Earlier age at 
OCD onset was associated with an increased risk of alcohol use dis- 
orders, but only comorbid bipolar disorder was associated with an 
increased risk of drug use disorders, suggesting that patients with 
early-onset OCD or comorbid BPD may be especially vulnerable to 
substance abuse disorder [79]. 


Schizophrenia 


Up to 30% of patients with chronic schizophrenia experience OC 
symptoms, and about 12% of them may be qualified for the diag- 
nosis of OCD [80]. 

As in OCD in general, the OCD symptoms in these patients do not 
necessarily surface, unless specific questions are asked. Moreover, 
compulsive behaviour may be interpreted as ‘stereotypic behaviours. 
Patients with schizophrenia can distinguish (if properly questioned) 
ego-dystonic OC symptoms (perceived as coming from within) 
from schizophrenic ego-syntonic delusions (perceived as introduced 
from the outside). Follow-up studies demonstrated that a ‘schizo- 
obsessive’ diagnosis has stability over the years and is characterized 
by poor prognosis, especially if appropriate (that is, symptomatic, 
rather than syndromatic) treatment is not administered [81]. 

The poor prognosis of patients with schizophrenia and OCD, 
along with preliminary data regarding their response to a specific 
combination of treatments [specific drugs for psychosis (for ex- 
ample, amisulpride, ziprasidone, aripiprazole) in combination with 
serotonin reuptake inhibitors (SRIs)], has led several researchers to 
suggest that a ‘schizo-obsessive’ category may be of value [82]. 


Post-traumatic stress disorder 


PTSD-OCD comorbidity and post-traumatic obsessions have been 
reported [83, 84]. In one study, 41% out-of-combat PTSD patients 
were diagnosed with OCD, while an additional 6% had subthreshold 
OC symptoms [66]. No difference was found between PTSD and 
PTSD-OCD participant characteristics (including demographics, 
trauma-related factors, and other psychiatric comorbidity). The 
surprisingly high number of OCD found in this study suggests that 
PTSD-OCD (post-traumatic OCD) might be underdiagnosed and 
signifies the importance of direct assessment of OCD in patients 
with PTSD. 


Tic disorder 


The high comorbidity with tic disorders raises interesting patho- 
physiological and therapeutic implications. The rate of tic disorders 
approaches 40% in juvenile OCD [85, 86], and the prevalence of 
Tourettes syndrome is increased among the relatives of OCD pa- 
tients [87]. Due to its high prevalence, tic disorder was introduced 
as a specifier in DSM-5. Namely, each time that OCD is diagnosed, a 
specific and extra question needs to be included in order to identify 
tics. The importance of the specific attention to tics is related to dif- 
ferent pharmacological intervention; augmentation (of serotonergic 
medications) with very small doses of a D2 receptor antagonist (for 
example, Haldol’) or a D2/5-HT2 receptor antagonist (for example, 
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risperidone) or a partial agonist (for example, aripiprazole) in those 
cases was repeatedly found to be clinically effective [88]. 


OCD is traditionally considered to be a relatively chronic disorder 
[46, 89, 90]. However, some studies reported a more favourable 
prognosis and outcome [48, 89]. In a prospective, community-based 
study that followed up respondents for up to 30 years [32], almost 
two-thirds of individuals with OCD entered a sustained period of 
remission. The median duration for OCD was 16 years, suggesting a 
better prognosis for remission for less severe illness. Individuals with 
alonger duration of illness (more than 14 years) anda greater number 
of OC-burdened years (more than 5 years) and those seeking profes- 
sional help (probably due to a more severe condition) experienced 
significantly delayed remission. In addition, these factors, together 
with comorbid anxiety and—to a lesser extent—affective disorders, 
were associated with significantly reduced remission rates. 

With adequate treatment [which includes ERP, family guidance, 
encouragement to return to full activity, and therapeutically ap- 
propriate (that is, medium to high) doses of SRI], a significant per- 
centage of patients show considerable to full improvement in their 
symptoms [91]. Prognosis in children is even better, with up to 80% 
achieving partial or full remission in a 3-year prospective study [92]. 
Good prognosis is associated with compliance to treatment of both 
the family and the patient, good social and occupational adjustment, 
and less avoidance [45]. The obsessional content does not seem to be 
related to the prognosis, except for hoarding, which is usually con- 
sidered to have a less favourable outcome. 


Conclusions 


OCD is a chronic condition, considered by the World Health 
Organization as one of the ten most disabling disorders. Two main 
changes took place in DSM-5 in relation to OCD: the creation of 
the ‘Obsessive-compulsive and related disorders’ cluster and the 
addition of two specifiers to the diagnosis procedure—the degree 
of insight [ranging from good or fair to absent (delusional)] and the 
presence (or absence) of tics. OCD is a worldwide disorder, with 
substantial similarity across cultures in symptom clusters, gender 
distribution, age of onset, and comorbidities. Mood disorders and 
other OCRDs are among the most common comorbidities in OCD. 
While the life prevalence of OCD is about 2% of the population, the 
life prevalence of OCRD is about 9%. 

The mean age at onset of OCD is late adolescence, with a range of 
10-22 years. Females are overrepresented in late-onset OCD, while 
in the earlier onset of illness grouping, males are more representa- 
tive. Early onset and long duration of illness have been reported to 
be associated with poor prognosis. OCD is associated with substan- 
tial comorbidities, among which anxiety, mood disorders, and other 
OCRDs stand with the highest prevalence. 

As OCD symptoms are usually embedded within the related 
culture, Internet-related compulsion is becoming more and more 
frequent. Taking into account its diverse expressions and its ego- 
dystonic manner/presentation (which make it difficult for patients 
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to come forward with their symptoms), the diagnosis, as well as dif- 
ferential diagnosis, requires careful and thorough evaluation. 
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Clinicians have long observed that many of their patients with 
obsessive-compulsive disorder (OCD) have relatives with the dis- 
order, and they have suspected that hereditary transmission under- 
lies this familial aggregation. In an early report in 1936, of 50 cases 
of ‘obsessional neurosis, Lewis reported that 37% of the parents 
and 21% of the siblings of these cases had pronounced obsessional 
traits; moreover, he noted two pairs of identical twins who were con- 
cordant for obsessive-compulsive features [1]. Additional clinical 
case series of OCD patients and their twins and other relatives have 
provided presumptive evidence of the inheritance of OCD. Based 
on these early studies, Slater concluded in 1964 that ‘of all neurotic 
syndromes, the evidence relating to genetic predisposition is best in 
the case of obsessional neurosis’ [2]. 

Since then, abundant evidence has been provided from gen- 
etic epidemiological (twin and family) studies to support a genetic 
contribution to OCD [3]. In contrast, apart from relatively rare in- 
stances of OCD associated with brain trauma, ischaemia, tumours, 
perinatal factors, or B-haemolytic streptococcal infections [4-6], 
there is limited evidence for a major contribution of environmental 
factors to most cases of OCD. Therefore, the discovery of specific 
genes and the functional relevance of proteins encoded by them hold 
promise for elucidating the molecular pathogenesis of OCD and for 
developing rational, mechanism-based approaches to its treatment. 

Molecular genetic studies of OCD and other neuropsychiatric 
conditions have advanced rapidly and sequentially over the past 
20 years, due to developments in genotyping technology and com- 
putational genetics. Initially, efforts were focused on specific candi- 
date genes hypothesized to be functionally related to the disorder 
in question. Then, genetic linkage studies predominated; as more 
genetic markers were identified throughout the genome, genotyping 
became more rapid and less costly, and statistical genetic methods 
developed. During this scientific phase, the thought was that rela- 
tively few genes, that is, those with a ‘major effect, make a major 
contribution to the risk of OCD. Subsequently, with recognition that 
neuropsychiatric conditions are genetically complex, the ‘common 
disorder-common gene’ hypothesis became predominant, which 
posited relatively frequent genetic variants, each with a small effect 
on the risk of disease, ushering in an era of genome-wide association 
studies (GWAS). As sample sizes increased to the tens of thousands, 





relevant findings emerged for several of the complex neuropsychi- 
atric disorders; for example, with a sample of 37,000 cases, 108 gen- 
etic polymorphisms were found to be significantly associated with 
schizophrenia [7]. Nevertheless, these variants, although numerous, 
appeared to explain only a small proportion of the heritability of 
these diseases. With decreasing cost and speed of genotyping, and 
increasing sophistication of computational genetic methods, whole- 
genome and exon-wide sequencing studies of OCD are now in 
progress. 






































At least 15 family studies of OCD have been conducted, and their 
findings support the familial aggregation of OCD. These include 
the most recent family studies, which used clear diagnostic criteria, 
direct assessment of family members, and inclusion of families of 
case as well as non-OCD control probands (that is, the index case 
or the first person identified in the family). For example, Black 
and colleagues found that the risk of ‘broadly defined OCD’ (that 
is, OCD plus subsyndromal obsessive-compulsive symptoms) 
was substantially greater in the parents of OCD probands (that is, 
index cases or the first person identified in the family) than parents 
of controls (16% vs 3%) [8]. Pauls and colleagues found a higher 
risk of OCD in first-degree relatives of OCD probands than con- 
trols (10% vs 2%) [9]. The Johns Hopkins Family Study also found a 
similarly higher prevalence of OCD in first-degree relatives of case 
than control probands (12% vs 2% [10], as did Grabe and colleagues 
in a European family study (10% vs 1%) [11]. Several studies have 
found even greater differences in the prevalence of OCD in the re- 
latives of case vs control paediatric probands—for example, 23% 
vs 3% in the study by Hanna and colleagues [12]—suggesting that 
early-onset OCD may differentiate a strongly inherited subtype 
of the disorder. In addition, several family studies have found that 
additional disorders were more prevalent in relatives of OCD than 
control probands, including generalized anxiety disorder, body dys- 
morphic disorder, trichotillomania, pathologic skin picking, tic dis- 
orders, and obsessive-compulsive personality disorder, suggesting 
that these may be part of the spectrum of conditions aetiologically 
related to OCD in these families [13-15]. 
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A statistical approach—complex segregation analysis—has been 
used to determine if the transmission of OCD in families is consistent 
with a Mendelian mode of inheritance. The five published studies 
support a Mendelian model, with transmission of a gene, or genes, of 
major effect. Three of the studies found that the data were most con- 
sistent with a dominant transmission model; however, the data could 
not reject alternative models such as codominant or recessive [16-18]. 


Twin studies 


Although evidence of familial aggregation is a necessary condi- 
tion for a genetic contribution to OCD, it is not sufficient, since it 
does not rule out aggregation due to common exposures of family 
members to possible environmental factors related to the disease. 
Therefore, twin studies have been used to contrast concordance of 
OCD in monozygotic (identical) twins and dizygotic (fraternal) 
twins, under the assumptions that: (1) monozygotic twins are gen- 
etically identical; (2) dizygotic twins share only 50% of their genes, 
on average; and (3) both types of twins share similar environments. 
‘Thus, greater concordance of OCD symptoms in identical than non- 
identical twins provides further support for a genetic influence. 
Using the Maudsley Twin Registry, concordance of ‘obsessive symp- 
toms or features’ was found in 13 of 15 (87%) monozygotic twin 
pairs, but only seven of 15 (47%) of dizygotic twin pairs [19]. 

Moreover, twin studies have been used to estimate the ‘herit- 
ability’ of OCD, that is, the proportion of variation in dimensional 
obsessive-compulsive symptom scores that is explained by genetic 
effects, rather than by environmental influences. For example, in 
4564 4-year-old twin pairs rated by their parents, the estimated her- 
itability of obsessive-compulsive behaviours was 65%, with the re- 
maining 35% of the variance attributed to unshared environmental 
influences [20]. Similarly, in 4246 child twin pairs, the estimated 
heritability of parent-rated obsessive-compulsive symptoms was 
45-61%, with unique environmental influences of 42-55% [21]. 

Thus, twin studies provide further support for a substantial gen- 
etic contribution to OCD. However, unshared environmental factors 
also appear to contribute to OCD and interaction between genetic 
and environmental factors may be important. Adoption studies 
comparing twins raised together and apart would provide further 
evidence of the importance of genetic and environmental factors in 
OCD, although no such studies have been reported to date. 


Molecular genetic studies 


Linkage studies 


The aim of genetic linkage studies in OCD is to identify chromo- 
somal regions that contain genes for the disorder, by statistically 
testing whether specific genetic marker alleles cosegregate (that is, 
‘travel together’) with OCD in families. Whereas parametric ap- 
proaches analyse genetic recombination events and need to assume 
the mode of inheritance, non-parametric approaches evaluate the 
extent of sharing of marker alleles between affected relatives and do 
not require assumption about the mode of inheritance. If a linkage 
peak is identified, ‘fine mapping’ is often conducted by including 
additional markers to further narrow the linkage region, and family- 
based association analyses may be conducted to evaluate associ- 
ations between specific single-nucleotide polymorphisms (SNPs) 
within these regions and OCD. 


Hanna and colleagues were the first to report a linkage scan in 
OCD, which studied 56 individuals in seven families with a paedi- 
atric proband. Using parametric linkage analysis and assuming a 
dominant transmission model, they identified a linkage peak in the 
9p24 region on chromosome 9. This region harbours the glutamate 
transporter gene SLC1A1, which is involved in neurotransmission 
and is a potential candidate gene for OCD [22]. This finding was 
replicated by Willour and colleagues in a linkage analysis in 50 OCD 
pedigrees, using the same 13 genetic markers in the region used by 
the Hanna group, and pedigree-based association analyses identi- 
fied two markers associated with OCD in this region [23]. These 
linkage findings stimulated further studies of SLC1A1 as a candidate 
gene for OCD, as described in the next section. 

A genome-wide linkage study was conducted in 219 families 
with multiple relatives affected with OCD, mostly affected sibling 
pairs, as part of the OCD Collaborative Genetics Study. Using non- 
parametric linkage analysis, the strongest suggestive linkage signal 
was found on chromosome 3, in the 3q27-28 region; other sug- 
gestive linkage signals were found in regions on chromosomes 1, 
6, 7, and 15 [24]. Further analyses provided evidence for different 
linkage patterns in these families, depending on specific phenotypic 
characteristics of probands or families. For example, after stratifying 
families by proband gender, Wang and colleagues found suggestive 
linkage to the chromosome 11p15 region in families with male 
probands, and the linkage signal became substantially stronger after 
fine mapping in this region [25]. In another analysis, families were 
stratified into those with two or more individuals with compulsive 
hoarding behaviour and those with one or fewer hoarding relatives. 
In the hoarding-loaded families, there was significant linkage on the 
chromosome 14q23-32 region [26]. 

Findings from linkage studies have provided evidence linking 
specific chromosomal regions to OCD and suggest that genetic 
linkage may be stronger for specific phenotypic subtypes of the dis- 
order. However, other than the findings for the hoarding phenotype, 
all of the regions identified so far have not met the threshold for 
‘statistically significant’ linkage. The linkage finding in the 9p24 re- 
gion was particularly intriguing, given that this region harbours a 
functional candidate gene for OCD. However, the 9p24 region spans 
over 7 million base pairs and contains dozens of potential OCD can- 
didate genes; moreover, this region was not identified in the largest 
genome-wide linkage scan, which found several other regions more 
strongly linked to OCD. It may be that a greater number of families 
are required to detect and replicate statistically significant linkage 
findings. Furthermore, as has emerged from genetic studies of other 
psychiatric disorders, including schizophrenia and bipolar disorder, 
OCD may have a complex pattern of inheritance, with multiple 
genes each exerting a small effect, and genes of ‘major effect’ may be 
rare in OCD and may occur in only relatively few families. 


Candidate gene studies 


Genes have been identified as potential candidate genes for OCD, 
based on their location within linkage peaks and/or knowledge of the 
pathophysiology and pharmacology of the disorder. The neuronal 
glutamate transporter gene SLC1A1 was hypothesized as a primary 
candidate gene for OCD, based on its location within a linkage peak, 
as described; the protein it encodes—a neuronal transporter; and 
its high expression in the cerebral cortex, striatum, and thalamus, 
which are regions connected in functional circuits are implicated in 


OCD. Several research groups have reported associations between 
OCD and SNPs in this gene, while another found an association 
with SNPs about 9 kilobases ‘upstream’ of the gene. Moreover, the 
findings were specific to families with male probands or to males 
affected with OCD [27]. These replicated association findings 
implicating the SLC1A1 gene in OCD are intriguing, and further 
investigations of this and possibly other interacting glutamatergic 
genes are warranted. 

Given the therapeutic effect of serotonin reuptake inhibitors 
(SRIs) in the treatment of OCD, the serotonergic system also has 
been a major focus of candidate gene studies. Polymorphisms in 
the promoter region of the serotonin transporter gene (SHTTPLR), 
5-hydroxytryptamine (serotonin) receptor 1D (SHT1-D), and 5- 
hydroxytryptamine (serotonin) receptor 1D (SHT2C) have been 
reported associated with OCD. Of these the serotonin transporter 
polymorphism (SHTTPLR), located on chromosome 17, has re- 
ceived the most attention. Findings from these association analyses 
have unfortunately been conflicting. It is interesting that the short 
(s) allele of this polymorphism is consistently reported associated 
with anxiety, depression, and neuroticism, whereas it is the long 
(1) allele often reported to be associated with OCD, although other 
studies have implicated the S-allele, rather than the L-allele, in OCD. 
Nevertheless, meta-analyses of polymorphisms in the 5HTTPLR 
and 5HT2 genes support an association with OCD [28]. 

Several studies have found an association between OCD and al- 
leles of the dopamine receptor 4 (DRD4) gene on chromosome 11. 
In some studies, the association was specific to a phenotypic sub- 
type, for example tics or early onset of OCD. Other studies, however, 
have been negative, as have studies of other dopaminergic genes, 
including DRD2, DRD3, and the dopamine transporter SLC6A3. 
Association studies of other genes have been reported, including 
the catechol-O-methyltransferase (COMT) gene, the monoamine 
oxidase A (MAO) gene, the brain-derived neurotrophic factor 
(BDNF) gene, and the oligodendrocyte lineage transcription factor 
2 (OLIG2) gene. Some of the positive findings have been gender- 
specific. In general, however, the positive findings have not been 
consistently replicated [28]. 

Thus, although there have been many reports of a positive asso- 
ciation between functional candidate genes and OCD, the lack of 
consistent replication ‘in the majority of OCD psychiatric genetic 
association studies may seem discouraging, as noted by Hemmings 
and Stein [29]. At least part of the inconsistency may be due to dif- 
ferences between study samples, including clinical and underlying 
genetic heterogeneity, as well as methodology differences, including 
sample size and statistical power. Taylor conducted a comprehen- 
sive meta-analysis of 230 polymorphisms that had been investi- 
gated in 113 genetic association studies of OCD. The analysis found 
that OCD was significantly associated with serotonin-related (5- 
HTTLPR and HTR2A) and, in males only, catecholamine-related 
(COMT and MAOA) genes; an additional 18 polymorphisms had 
significant odds ratios. The findings support a polygenic model of 
OCD, in which multiple genes each have a small, incremental effect 
on the risk for development of OCD [28]. 


Genome-wide association studies 


GWAS of OCD involve scanning hundreds of thousands, or even 
millions, of SNPs across the genome to detect relatively common 
genetic variants associated with the disorder. The studies have 
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compared the variants in individuals with and without OCD or have 
focused the analysis on ‘trios, affected individuals and their parents, 
whether affected or unaffected. 

Three GWAS of OCD have been reported to date. The International 
OCD Foundation Genetics Collaborative (IOCDF-GC) analysed 
almost 1500 OCD cases, 5600 controls, and 400 trios with nearly 
500,000 SNPs. In the case-control analysis, the two most significantly 
associated SNPs were located within the DLGAP1 gene, although no 
association had ‘genome-wide significance’ after correcting for the 
large number of comparisons. In the analysis of trios, but not in the 
combined case-control-trio analysis, an SNP near the BTBD3 gene 
was found to be significantly associated with OCD [30]. 

The OCD Collaborative Genetic Association Study (OCGAS) 
analysed over 1000 families and 500,000 SNPs. Although no asso- 
ciation was found with genome-wide significance, the most signifi- 
cant finding was for an SNP on chromosome 9, near the protein 
tyrosine phosphate receptor D (PTPRD) gene. Several additional 
candidate genes emerged from these two studies, including Fas 
apoptotic inhibitory molecule 2 (FAIM2), glutamate ionotropic re- 
ceptor NMDA type subunit 2B (GRIN 2B), and the cadherin genes 
CDH9 and CDH10. Several of these genes appear to be involved in 
glutamatergic neurotransmission [31]. 

The third GWAS analysed almost 7000 individuals and 31,000,000 
SNPs as part of the Netherland National Twin Registry, using a quan- 
titative measure of obsessive-compulsive symptoms. A genome- 
wide significant finding was found for an SNP in the myocyte 
enhancer factor 2B neighbour (MEF2BNB) gene in region 19p13 on 
chromosome 19. Additional gene-based testing found four signifi- 
cantly associated genes in the same region—MEF2BNB, MEF2B and 
MEF2BNB-MEF2B, and RFXANK [32]. 

Several analytic approaches have recently been developed to es- 
timate heritability based on genotype information collected in 
GWAS, that is, SNP-based heritability. Using the genome-wide com- 
plex trait analysis (GCTA) approach, the estimated heritability of 
quantitative obsessive-compulsive symptoms was 34%, based on 
the Netherlands twin genotypes. In another study, the estimated 
heritability of OCD was 37%, based on OCGAS genotypes. Also it 
was found that common alleles contribute more to the heritability of 
OCD than did rarer alleles and that certain chromosomes contribute 
more than others to OCD risk [33]. 

The experience with other disorders, particularly neuropsychi- 
atric conditions, with complex patterns of inheritance, indicates 
that sample sizes for GWAS need to be an order of magnitude larger 
than those in the three studies reported here. This is because the 
magnitude of the association (odds ratio) between common gen- 
etic variants and these disorders is in the range of 1.0-1.4. GWAS 
in schizophrenia, for example, required samples in the tens of thou- 
sands before significant associations were detected. Meta-analyses of 
samples from several studies are currently being conducted, in order 
to increase the power to detect significant associations in OCD. 


In search of rare variants 


Among the genetic complexity of OCD is genetic heterogeneity, that 
is, the pattern of genetic variation may be different in different cases 
of OCD. Although GWAS search for the more common genetic 
variants, the search continues for less common variants associated 
with the disorder, which may have a ‘major effect’ and provide in- 
sight into the pathophysiology of the disorder. Three types of these 
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variants have been reported in OCD: chromosomal rearrangements, 
copy number variants (CNVs), and rare variants identified by ‘deep 
sequencing’ of the genome. 

There are few reports of chromosomal rearrangements in OCD, 
and most are in cases with co-occurring Tourette’s syndrome (TS). 
Bertelsen and colleagues reported a 21-year-old male patient with 
TS, OCD, and attention-deficit/hyperactivity disorder. The patient 
inherited a chromosome 3 to 9 translocation t(3;9) (q25.1;q34.3) 
from his mother, who had a complex motor tic. Sequencing revealed 
that the translocation breakpoints truncated the olfactomedin 1 
(OLFM1) gene, which plays a role in neuronal development and is 
a likely candidate gene for other neuropsychiatric disorders [34]. 
Devor and Magee reported a balanced chromosomal translocation 
t(1;8) (q21.1; q22.1) in siblings multiply affected with TS, OCD, 
complex motor tics, and attention-deficit disorder [35]. 

McGrath and colleagues reported the only GWAS of large 
(>500 kilobases), rare (1%) CNVs in OCD. They found that the 
proportion of individuals with deletions in known pathogenic 
neurodevelopmental loci was four times greater in patients with 
OCD than in controls. Deletions in the 16p13.11 region, which con- 
tributed disproportionately in the OCD cases, have been implicated 
in other neuropsychiatric conditions [36]. Since this was a case- 
control study, it could not distinguish between inherited and de novo 
variants, which would require a study of families. (A de novo variant 
is a new mutation that occurs for the first time in a family member, 
arising from a germ cell mutation in one of the parents or in the fer- 
tilized egg during embryogenesis.) De novo mutations in sporadic 
cases provide more certainty with respect to the causal nature of the 
variant, and such studies are in progress in OCD. 

‘Deep sequencing’ involves sequencing a genomic region mul- 
tiple times, allowing detection of rare variants, including de novo 
variants. To date, only one study has been reported for OCD. Cappi 
and colleagues sequenced all genome-coding regions in 20 sporadic 
OCD cases and their unaffected parents in order to identify rare de 
novo single-nucleotide variants. The study found a higher than ex- 
pected rate of de novo non-synonymous variants (that is, those that 
alter the amino acid sequence of a protein) in OCD cases. Analysis 
of the protein-protein interaction network suggested an enrichment 
of genes involved in immunological and central nervous system 
functioning and development [37]. Other genetic deep sequencing 
studies are in progress. As with CNVs, both inherited and de novo 
variants can be studied. It is clear that much larger sample sizes will 
be necessary to test the significance of rare causal variants. The iden- 
tification of rare genetic variants through sequencing studies may 
be aetiologically relevant for smaller subgroups of cases, which may 
exhibit different treatment response profiles. 


Alternate phenotypes 


An important consideration in investigating the genetic aetiology of 
OCD is the definition of the phenotype. Thus far, we have focused 
on the disorder OCD as the phenotype of interest in genetic studies. 
However, OCD is quite clinically heterogenous, with different cases 
exhibiting different constellations of symptoms, which could result 
from different underlying genetic or environmental aetiologies. 
In addition, OCD might share the same genetic basis with co- 
occurring disorders. Moreover, there may be subclinical dimensions 


that are more proximal than the OCD diagnosis to the ultimate gen- 
etic causes of the disorder. The challenge is to identify more genet- 
ically homogenous clinical subtypes, genetically related disorders, 
and genetically relevant subclinical dimensions that maximize the 
elucidation of the underlying genetic aetiology. 


Clinical subtypes 


In the preceding sections, we discussed several clinical features that 
may be useful for subtyping in genetic studies of OCD. For example, 
several family studies have reported that the relative odds of OCD 
in relatives are substantially greater in families of paediatric prob- 
ands or of adult probands with onset in childhood or adolescence 
[9, 10, 12]. Several candidate gene studies have found stronger as- 
sociations for one gender than the other; for example, some studies 
found an association with the COMT gene in women, but not men, 
while others found an association in men, but not women [38, 39]. 
Association studies of the SLC1A1 gene have found a stronger as- 
sociation in men, but not women [27]. The OCGS study found sig- 
nificant linkage of OCD to a region of chromosome 14 in families 
with two or more hoarding relatives, but not in other families [26]. 
These and other potential phenotypes might be the focus of future 
genome-wide and newer-generation association studies. However, 
clinical subtyping increases the cost of studies, given the larger 
number of affected OCD cases with specific subtypes that must be 
identified and the increased time required for clinical examinations. 


Cross-disorder phenotypes 


Several psychiatric disorders have been proposed to be related to 
OCD. This is an example of pleiotropy, that is, that the same genetic 
background can be expressed as clinically different disorders. The 
evidence for these relationships has typically been based upon the 
co-occurrence of these disorders with OCD, as well as by phenom- 
enological similarities or treatment response profiles. A stronger 
case for a genetic relationship between OCD and these disorders is 
provided for by family studies, in which the relatives of OCD cases 
have greater prevalences of these other psychiatric conditions than 
do the relatives of controls. As discussed, several family studies have 
found that generalized anxiety disorder, body dysmorphic disorder, 
trichotillomania, pathologic skin picking, and obsessive-compulsive 
personality disorder and additional disorders were more prevalent 
in relatives of OCD than control probands [13-15]. 

There has been much interest in the genetic relationship between 
OCD and TS and other tic disorders. Tic disorders and OCD fre- 
quently co-occur, and family studies have found that the prevalence 
of OCD is significantly higher in TS than control relatives, whether 
or not the TS-affected proband had OCD, supporting the hypoth- 
esis that the disorders share genetic risk factors [40]. More recent 
analyses of SNPs collected in GWAS found a genetic correlation 
between OCD and TS of 0.41, comparable to heritability estimates 
based on twin and family studies [33]. However, Yu and colleagues 
found that although genetically related, the two disorders had dis- 
tinct genetic components and OCD with co-occurring tic disorders 
may be genetically different from OCD alone [41]. 


Endophenotypes 


Like many other psychiatric disorders, OCD appears to be aetiologic- 
ally complex, with suspected involvement of multiple genes, epigen- 
etic factors, and environmental influences, which has complicated 


the search for specific genes and gene networks that may ultimately 
underlie the disorder. An endophenotype, or ‘intermediate pheno- 
type; is a biological marker that lies intermediate in the aetiologic 
pathway between genes and the clinical phenotype, that is, the 
signs and symptoms of the disorder. A psychiatric endophenotype 
may be biochemical, neuroanatomical, neurophysiological, neuro- 
psychological, or behavioural. A useful candidate endophenotype is 
associated with the disorder, is heritable, cosegregates with the dis- 
order within families, and occurs in non-affected, as well as affected, 
family members with greater frequency than in controls. It is a ‘trait’ 
marker underlying the disorder and independent of symptoms [42]. 

A large number of studies have found differences between 
OCD cases and non-cases in a variety of neurocognitive domains, 
including response inhibition, planning, decision-making, working 
memory, and visuo-spatial abilities [43]. Several of these domains 
involve functioning of orbitofrontal-striatal-thalamic networks in 
the brain, which imaging studies have implicated in OCD. In one 
of the first studies, Menzies and colleagues measured perform- 
ance on a response inhibition task (Stop-Signal task) in 31 OCD 
patients, 31 of their first-degree relatives, and 31 non-OCD con- 
trols and their first-degree relatives. They found that OCD patients 
and their relatives had significantly longer reaction times than the 
controls, indicating reduced ability to inhibit their response. Using 
magnetic resonance imaging (MRI), they also found that the im- 
pairment in response inhibition was significantly associated with 
reduced grey matter in the orbitofrontal cortex in right inferior 
brain regions and increased grey matter in the cingulate, parietal, 
and striatal regions. Moreover, there were significant familial effects 
for reaction times and brain volumes in these regions [44]. Several 
later studies, using a variety of neurocognitive paradigms, as well 
as structural and functional imaging approaches, have supported 
deficits in inhibitory control, as well as structure, function, and 
connectivity in corticostriatal-thalamic-cortical circuits, as candi- 
date endophenotypes for OCD [45]. 

Another neurocognitive domain of recent focus is performance 
monitoring. Many studies have found that OCD cases show a nega- 
tive deflection of the event-related potential following making an 
error on a flanker task. Recently, error-related hyperactivity of the 
anterior cingulate cortex has been found in OCD cases and their 
relatives, compared to controls [46, 47]. More recently, higher- 
level executive functions have been investigated as candidate 
endophenotypes for OCD. For example, Zhang and colleagues 
found that OCD cases and their unaffected first-degree relatives had 
deficits in decision-making under ambiguous conditions, as well as 
in planning [48]. 

Although these findings are promising, additional family 
studies with greater numbers of cases and relatives are needed to 
determine the heritability and segregation patterns of candidate 
endophenotypes in OCD families. Those candidates and families 
with the strongest presumptive evidence for genetic transmission 
would then be useful for genetic association and sequencing studies 
to elucidate specific genetic aetiologies. 


Pharmacogenetics 


A substantial proportion (40-60%) of OCD patients show poor re- 
sponse to SRIs, and others have significant adverse side effects that 
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make treatment intolerable. Genetic variation appears to be an im- 
portant contributor to antidepressant response in major depression, 
but less is known about the genetic contribution to pharmacologic 
treatment response to SRIs in OCD. Some studies have reported as- 
sociations between treatment response and SNPs in genes known 
to be involved in serotonergic, dopaminergic, or glutamatergic 
neurotransmission (for example, HTR1B, 5-HT2A, HTR2A, BDNF, 
SLCIA1, SLC6A4, COMT). However, findings mostly have been 
based on small samples, have small effect sizes, and have not been 
replicated, and further studies are needed [49, 50]. 

Recently, results from a GWAS of SRI treatment response in nearly 
1600 OCD-affected individuals in over 1000 families were reported. 
The most strongly associated SNP is located near the DISP1 gene, 
and two other moderately associated SNPs are near the PCDH10 
gene; both genes are known to be involved in cell-cell adhesion. An 
additional 35 SNPs with signals of potential significance were in sev- 
eral genes expressed in the brain and implicated in neuronal devel- 
opment, psychiatric disorders, or drug effects, including GRIN2B, 
GPC6, NTM, PARK2, PLCB1, and PKC [51]. 

There has been important work on genes coding for liver enzymes 
involved in drug metabolism, specifically among the polymorphic 
cytochrome P450 enzymes. There is substantial genetic variation in 
the CYP2D6 and CYP2C19 enzymes, for instance. This can affect 
the metabolism of selective SRIs and lead to a range of treatment re- 
sponse in OCD patients with different genetic alleles [52]. Different 
medications, even those within the same class, could therefore have 
different effects, from reduced efficacy to increased liability to tox- 
icity. The clinical utility of testing for these alleles remains controver- 
sial, but the implications need to be borne in mind by practitioners. 


Genetic studies in animals 





Canine compulsive behaviours 


Ethologists have described stereotypical behaviours in non-human 
mammals that appear to be excessive, maladaptive variants of normal 
behaviours involved in grooming, predation, eating and suckling, or 
locomotion. Among the best studied are canine compulsive behav- 
iours, including acral lick dermatitis, tail chasing, flank sucking, and 
pacing and circling. A GWAS of 92 Doberman pinschers with com- 
pulsive flank and/or blanket and sucking behaviours and 68 control 
dogs found a significant genome-wide association in chromosome 
7, with an SNP in the CDH2 gene, which codes for cadherin, a 
neuronal adhesion protein. Dogs having both blanket and sucking 
behaviours had the greatest frequency of the risk allele (60%), com- 
pared to 43% in dogs with only one behaviour and 22% in unaffected 
dogs [53]. Using a more powerful genotype-calling algorithm, 13 
new OCD-associated regions were found, in addition to the CDH2 
locus; these regions included genes involved in catenin-binding and 
regulation of dendrite morphogenesis. Sequencing these regions 
found variants specific to dogs with OCD and were significantly 
more common in dog breeds at high risk for OCD; four genes in 
these regions are involved in synaptic function: neuronal cadherin 
(CDH2), catenin alpha2 (CTNNA2), ataxin-1 (ATXN1), and plasma 
glutamate carboxypeptidase (PGPC). Several of these genes appear 
to be functionally connected to several of the SNPs most strongly 
associated with OCD in a recent genome-wide OCD study in hu- 
mans [30] and involved in glutamatergic signalling pathways [54]. 
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Recently, variants in the CDH2 gene have been reported to be asso- 
ciated with OCD in humans [55]. 


Knockout mice models 


A mouse model of compulsive grooming behaviour has been de- 
veloped by genetic deletion of the SAPAP3 gene, which codes for a 
scaffolding protein at excitatory synapses and is highly expressed in 
the striatum. Knockout mice express increased anxiety and compul- 
sive grooming behaviour, leading to facial skin lesions and hair loss, 
and have reduced activity in cortico-striatal synapses, which com- 
prise the majority of glutamatergic synapses in the striatum. When 
viruses containing the gene were injected into the striatum of these 
mice, excessive grooming behaviour, lesion severity, and anxious 
behaviour were reduced and cortico-striatal synaptic transmission 
increased [56]. In humans, a strong association between SNPs in the 
SAPAP3 gene and grooming behaviours in OCD families has subse- 
quently been reported [57], and mutations in SNPs in the SAPAP3 
gene have been found to be more frequent in individuals with OCD/ 
trichotillomania than controls [58]. Mice with deletions of other 
genes involved in cortico-striatal functioning, including SLITRK5, 
HOX-B8, and SLC1A 1, have been studied as possible models of com- 
pulsive grooming behaviours and OCD [59]. 


Conclusions and future directions 





Evidence from epidemiological and molecular genetic studies 
strongly supports genetic susceptibility to OCD. The working hy- 
pothesis is that OCD, like several other neuropsychiatric disorders, 
has a complex pattern of inheritance that involves: many common 
genetic variants, each of which is expected to have a relatively small 
overall impact (effect size) on the risk of disease; rare genetic vari- 
ants, each with potentially greater impact; and environmental fac- 
tors, for which there are few, if any, supportable candidates thus far. 
However, to date, except for a few candidate genetic variants, such as 
SLCIA1, DLGAP1, and PTPRD, there is no established variant with 
unequivocal evidence for association with OCD. Many of the ap- 
proaches outlined in this chapter, such as GWAS and whole-exome/ 
genome sequencing, are in progress and are likely to be productive 
as sample sizes increase. 

It should be borne in mind that identifying associated genetic 
variants is only the initial step in elucidating the pathogenic pro- 
cess underlying OCD. Understanding the functional basis of these 
findings will rely on additional experimental approaches, sev- 
eral of which are currently available and others being developed, 
for integrating the information obtained from multiple domains, 
including gene sequence, gene expression, gene regulation, and 
gene-gene and protein-protein interaction networks [60]. Genetic 
expression studies in brain tissues from relevant brain regions and 
application of induced pluripotent stem cell (iPSC) technology will 
help identify relevant variants and elucidate their functional rele- 
vance. Epigenetic studies will be important for elucidating how 
modification of gene expression, apart from variation in the gene 
sequence, contributes to the risk of OCD. The Encyclopedia of DNA 
Elements (ENCODE) programme of the US National Institutes of 
Health, the goal of which is to identify all functional elements in the 
human genome, is one example of a programme that will facilitate 
this progress [61]. 


It is likely that this progress will result in a redefinition of the 
phenotype of OCD and related conditions. We anticipate a pheno- 
type that will differentiate clinical subtypes within the existing diag- 
nostic categories, will expand across current diagnostic boundaries, 
and ultimately will be based on genetic criteria. We also envision 
a future in which understanding of the molecular pathophysiology 
will catalyse the development of rational treatments and preventive 
measures for OCD. 
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For the past several decades, neuroimaging techniques, such as 
functional magnetic resonance imaging (fMRI) and positron emis- 
sion tomography (PET), have been used to investigate neurocircuit 
functioning in OCD. While earlier studies examining neural re- 
sponses to symptom provocation have identified altered brain ac- 
tivity in OCD and are clearly relevant for the disorder, they do not 
address whether basic cognitive—affective mechanisms are impaired 
in the absence of direct symptom exacerbation. In this article, we 
discuss findings from cognitive neuroscientific studies probing 
basic cognitive—affective constructs that may underlie the complex 
phenomenology of OCD, including conflict and error monitoring, 
response inhibition, task switching and reversal, decision-making, 
and reward processing. Although several cognitive neuroscientific 
studies that are not discussed here have used task paradigms that 
tap into theoretically driven core mechanisms of OCD (including 
impaired fear extinction [1] or altered habit formation [2, 3]) , we 
focus the main part of this review on those topics where at least four 
studies compared adults with OCD with a healthy control group. 
We conclude by discussing findings in the context of OCD hetero- 
geneity and reviewing the small literature comparing functional 
neuroimaging findings between OCD subtypes. Our review is com- 
plementary to the description of brain circuits in Chapter 94. 






































Much research into the cognitive neuroscience of OCD has taken 
place in the field of conflict monitoring and error detection. This 
approach is based on the proposal that obsessions are caused by an 
overactive conflict or error signal continually telling the patient that 
‘something is wrong’ despite evidence to the contrary [4]. In this 
view, compulsions are behaviours that attempt to reduce this height- 
ened conflict signal or to correct perceived errors. Cognitive conflict 
is typically studied in tasks where there is a mismatch between what 
a subject would automatically do (that is, a prepotent response) and 
what is required in the task. The classic example of a conflict moni- 
toring paradigm is the Stroop task where subjects must make a re- 
sponse according to the font colour of a word, where the word itself 
is the name of a colour that is different from the font colour (that 





is, the word ‘blue’ written in red font). In this case, the prepotent 
response is to read the name of the word, yet the task requires a re- 
sponse according to the colour of the word, which creates conflict 
that significantly increases response times, compared to trials where 
the colour and word name are the same [5]. Conflict monitoring 
in healthy controls fairly consistently implicates the dorsal medial 
frontal cortical (MFC) regions including the dorsal anterior cingu- 
late cortex (dACC) and the supplementary motor area (SMA) and 
pre-SMA [6-9], yet results from the many neuroimaging studies 
investigating this process in OCD do not present a coherent picture. 
While some investigations have indeed found hyperactivity of the 
dorsal MFC regions during conflict in OCD [10, 11], other studies 
have identified reduced activity in this region [6-9, 12, 13] or no dif- 
ferences between patients and controls [14-16]. Many studies have 
reported differences between patients and controls during conflict 
monitoring in several other brain regions outside the dorsal MFC, 
including the parietal cortex [17, 18], inferior frontal gyrus/insula 
[19], ventral/rostral regions of the MFC [11, 20, 21], lateral [13, 17] 
and medial [18] temporal cortex, and striatum including the caudate 
nucleus and putamen [12, 13]. However, these results are often 
contradictory, with some studies finding activations to be greater in 
OCD than controls and other studies reporting the opposite effect. 
Errors reflect a specific instance of conflict where the intended 
or correct response does not match the actual response made by 
the subject. Similar to conflict monitoring, errors elicit activation 
in the dorsal MFC regions, including the dACC and SMA. In add- 
ition, errors tend to elicit an emotional reaction related to frustra- 
tion, disappointment, or fear of punishment and typically activate a 
broad range of brain regions, including the bilateral anterior insula, 
the rostral ACC (rACC), and the ventromedial prefrontal cortex 
(vmPFC), the dorsolateral prefrontal cortex (DLPFC), and the 
orbitofrontal cortex (OFC) [22]. The anterior insula, rACC/vmPFC, 
and OFC have been associated with valuation and emotion [23-26]; 
as such, activation of these brain regions to errors may reflect the 
neural processing of the emotional/motivational significance of 
mistakes [27]. Unlike the conflict processing literature, data some- 
what consistently point to an enhanced error detection mechanism 
in OCD. The error-related negativity (ERN), a negative event-related 
potential (ERP) that peaks approximately 100 ms after an error and 
is thought to be generated in the dorsal MFC, is consistently larger 
in OCD than controls (for a review, see [28]). In fMRI studies, OCD 
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Fig. 97.1 (see Colour Plate section) (a) Results from [15] showing greater error-related activity in the ventromedial prefrontal cortex and anterior insula 


in OCD patients, compared to controls. 


Reproduced from Biol Psychiatry, 69(6), Stern ER, Welsh RC, Fitzgerald KD, et al., Hyperactive error responses and altered connectivity in ventromedial and frontoinsular 


cortices in obsessive-compulsive disorder, pp. 583-91, Copyright (2011), with permission from Society o 


(b) Results from [54] showing regions activated for (successful) reversal (areas in yellow), 


for both OCD patients and their unaffected relatives, compared to controls (areas in blue). 


Biological Psychiatry. 
overlaid with orbitofrontal and parietal areas showing reduced activity during reversal, 


Reproduced from Science, 321(5887), Chamberlain SR, Menzies L, Hampshire A, et al., Orbitofrontal Dysfunction in Patients with Obsessive-Compulsive Disorder and Their 
Unaffected Relatives, pp. 421-2, Copyright (2008), with permission from American Association for the Advancement of Science. 


patients have shown an increased neural response to errors in dACC 
[10, 29], rACC [29], and lateral frontal cortex, including the OFC 
and DLPFC [14, 29]. Fitzgerald et al. [12] found hyperactivity in the 
vmPFC in a small group of patients with OCD, a finding that was 
replicated in a larger study [15], which also identified hyperactivity 
of the anterior insula (Fig. 97.1a). Despite some variation among the 
studies, overall these data suggest that OCD patients respond more 
strongly to errors than healthy individuals, particularly in ventral 
frontal regions potentially involved in processing the value or emo- 
tional importance of the error. However, it is important to note that 
these studies examine OCD patients’ responses to actual errors (and 
conflict), whereas the phenotype of the disorder is more consistent 
with the detection of errors (or conflict) where there are none (or 
at least where their presence is uncertain). Thus, while hyperactive 
error responses in OCD may reflect an important characteristic of 
the disorder related to sensitivity to mistakes, these studies do not 
directly probe the neural mechanisms associated with the feeling 
that something is wrong, even in the absence of overt errors. ERP 
studies that have attempted to address this issue by examining the 
‘correct related negativity’ (CRN)—a small negative deflection that is 
the analogue to the ERN on correct trials—have been somewhat in- 
consistent as to whether OCD patients show an enhanced CRN [28]. 





Response inhibition/motor output suppression 


It has been suggested that compulsions in OCD may result from 
an inability to engage mechanisms of response inhibition [30]. 
Response inhibition is commonly studied using a go/no-go task (or 





a variant thereof such as the stop-signal task) where subjects make 
button-press responses to frequent stimuli (‘go trials) and are re- 
quired to inhibit responses to infrequently presented stimuli (‘no-go 
trials). Even though this task inevitably involves conflict monitoring 
between the frequent ‘go’ and infrequent ‘no-go’ trials, these para- 
digms additionally involve a specific motor suppression component 
not present in conflict studies. Accordingly, while ‘no-go trials are 
associated with activation of some of the same regions as for conflict 
monitoring, including the (ACC and SMA/pre-SMA [31-34], they 
also elicit activation in subcortical structures, including the thalamus 
and basal ganglia, as well as predominantly right hemisphere lateral 
frontal and inferior parietal regions [31-35]. In a meta-analysis, the 
right inferior frontal gyrus (IFG) has been most consistently asso- 
ciated with ‘no-go’ trials [32], which has been supported by lesion 
and brain stimulation studies showing impaired response inhibition 
after inactivation of this region [36, 37]. 

Studies comparing OCD patients and controls have found re- 
duced activity in the SMA, ACC, right IFG, inferior parietal cortex, 
striatum, and thalamus in OCD patients during correct no-go trials 
(that is, successful response inhibition) [17, 38-41], which would 
suggest reduced recruitment of the response suppression network, 
even during inhibition success. However, OCD hyperactivity in the 
caudate nucleus and thalamus [29, 41], as well as the medial and 
lateral frontal regions, insula, premotor cortex, middle temporal 
cortex, posterior cingulate cortex, and cerebellum [17, 38-41], has 
also been reported for successful inhibition, which has been inter- 
preted as compensatory activation [17]. Given the variability of find- 
ings, it is not clear whether abnormal hypoactivity or hyperactivity 
of a response inhibition network is a core feature of OCD. These 


equivocal results may stem partially from the focus on neural dif- 
ferences during successful inhibitions (as opposed to inhibition 
failures, which would be most relevant for the disorder). Further re- 
search linking brain function to behaviour will be needed to address 
this issue. 





Task switching and reversal 


Another approach to investigating the basic mechanisms of OCD 
has focused on how patients switch attention between two or more 
different tasks, stimuli, or rewards. Rather than hypothesizing an 
overactive conflict or error signal or a failure to suppress motor 
output, these studies hypothesize that OCD patients exhibit an in- 
flexibility that prevents them from shifting attention away from 
stimuli or features that are no longer relevant to the task at hand. For 
an in-depth discussion of behavioural measures of cognitive inflex- 
ibility in OCD, the reader is referred to a review article by Gruner 
and Pittenger [42]. 

Many studies of switching have investigated neural activity as- 
sociated with shifting attention between features or dimensions of 
stimuli (‘cognitive switching tasks). Not surprisingly, in a meta- 
analysis examining brain regions involved in cognitive switching 
and motor suppression, overlap between these processes were found 
in the dACC/SMA, IFG, DLPFC, and inferior parietal cortex [32]. 
However, fronto-parietal activations were more widespread and bi- 
lateral for cognitive switching [32, 35], appearing very similar to a 
frontal-parietal network (FPN) often described as being involved 
in executive functions and task control [43, 44]. Studies looking at 
brain activity in OCD during cognitive switching have found re- 
duced activation in patients, compared to controls, in FPN as well 
as in the OFC, caudate nucleus, temporal cortex, and medial parietal 
regions [17, 45, 46]. Two studies reported widespread reductions 
across the cortex and basal ganglia [45, 46], whereas two reported 
hypoactivations localized to only a few regions of the frontal and 
parietal cortices [17]. In addition to hypoactivation of the anterior 
prefrontal cortex extending into the lateral OFC, a recent study also 
found hyperactivation of the dACC and putamen and post-central 
gyrus activity during task switching in OCD [47]. 

Another type of switching that has been of interest in OCD in- 
volves reversing stimulus-reward contingencies (sometimes re- 
ferred to as ‘affective’ switching). In reversal tasks, the reward and 
punishment value of two stimuli switch, so that the currently re- 
warded stimulus is punished and the previously punished stimulus 
is rewarded. There is some evidence to suggest that reversal of 
reward-punishment contingencies relies primarily on the OFC, 
rather than the lateral prefrontal regions involved in cognitive 
switching [35, 48-50]. Studies of reversal in OCD have examined 
neural activation on trials where subjects made a reversal error that 
lead to a successful switch, compared with errors that did not lead 
to a successful switch (thereby isolating activity associated with the 
moment subjects learn that a switch is required) [51-53], or during 
the correct choice after a successful reversal [54]. For these com- 
parisons, OCD patients exhibit reduced activation in the OFC, but 
similar to results from cognitive switching studies, reductions have 
also been found throughout the FPN, including the DLPFC, bi- 
lateral insula, lateral parietal cortex, and putamen [51-54]. In one 
study, unaffected relatives of OCD also showed reduced activity in 
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the lateral OFC, DLPFC, and lateral parietal cortex during reversal 
[54] (Fig. 97.1b), suggesting that impaired frontal recruitment 
during reversal may be an endophenotype of the disorder. With a 
slightly different approach using a fear conditioning reversal task, 
OCD patients showed reduced medial orbitofrontal activity, when 
compared with controls, when viewing a stimulus that was previ- 
ously associated with shock but was now safe [55]. 

Overall, these data generally support the notion that OCD patients 
show hypoactivation in a variety of cortical regions, both during task 
switching and when reversing reward—punishment contingencies. 
Despite the dissociation of lateral and orbital frontal involvement in 
these processes, dysfunctional brain activity in OCD does not ap- 
pear to be localized to one of these systems. Reduced recruitment 
of both the DLPFC and OFC has been found when patients switch 
cognitive set, as well as when they reverse reward contingencies. 
Similar to the concerns discussed for the other approaches, inter- 
pretation of results from switching studies is complicated by the 
fact that neural differences between OCD patients and controls are 
examined during successful switches (either at the time of the cor- 
rect switch response or at the time of the error right before a correct 
switch), rather than for unsuccessful switches. Future work focusing 
on brain activity during switch failures would further elucidate the 
neural mechanisms of cognitive inflexibility in OCD. 


OCD has been characterized as a disorder associated with impaired 
decision-making [56, 57]. Clinically, OCD often manifests as risk 
aversion and intolerance of uncertainty [58, 59], and experimentally, 
OCD patients show reduced risk-taking [60] (but see [61]) and ex- 
cessive information gathering in the face of uncertainty [62-65]. In 
healthy individuals, risk and uncertainty are associated with acti- 
vation of the FPN [66, 67], with effects most consistently found in 
the anterior insula and dorsal MFC [67-72]. In a study where OCD 
patients played an interactive game where they could make risky or 
safe choices and anticipate outcomes of their choices, patients made 
fewer risky choices than controls and showed greater amygdala ac- 
tivation when anticipating outcomes of their risky choices [60]. By 
contrast, a recent study found that OCD patients did not show any 
differences from controls in risky decision-making both behaviour- 
ally and in the brain, although patients did show a stronger correl- 
ation between individual level of risk aversion and activation of the 
insula, DLPFC, and precentral/post-central gyri when making risky 
choices [61]. In an examination of uncertainty and checking, Rotge 
et al. [73] found that OCD patients showed reduced mid-OFC ac- 
tivity, compared to controls, for decisions that lead to subsequent 
checking behaviour. In another task investigating uncertainty, pa- 
tients were required to make decisions based on evidence that was 
either uncertain (that is, the likelihoods for two different outcomes 
being similar) or certain (for example, the likelihood for one out- 
come being 100% and for the other being 0%) [74]. In this study, 
OCD patients reported more subjective uncertainty than controls, 
and showed increased activation of the amygdala, parahippocampus 
and hippocampus, temporal cortex, ventral anterior insula, lateral 
OFC, and vmPFC, when making decisions based on ‘certain’ (that 
is, unequivocal) evidence but showed no differences for decisions 
based on uncertain evidence. Interestingly, in the studies of Admon 
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[60] and Stern [74], hyperactivations in OCD were found in brain 
regions not typically linked to risk and uncertainty in healthy popu- 
lations such as the amygdala, parahippocampus and hippocampus, 
lateral temporal cortex, and vmPFC [69]. One could speculate 
that recruitment of these additional limbic and paralimbic regions 
when processing risk and uncertainty may reflect greater emo- 
tional involvement during decision-making in OCD or an exces- 
sive internal focus associated with ‘default mode network activity 
[75]. This hypothesis is consistent with the finding that OCD pa- 
tients hyperactivate the vmPFC and lateral temporal cortex (both 
areas of default mode network [75]), when making decisions about 
moral dilemmas [76]. Overall, these data suggest that OCD patients 
do engage different brain regions than controls when making de- 
cisions, although differences between paradigms and the relatively 
few studies investigating these behaviours in OCD suggest the need 
for additional study. 


Reward processing 


It has been suggested that the difficulty exhibited by OCD patients 
in terminating inappropriate responses is related to a reduced signal 
of goal attainment or satiety [77]. Within this framework, OCD pa- 
tients continue to engage in compulsive behaviours, suchas checking, 
washing, or repeating/ordering, because the normal ‘reward’ signal 
associated with successfully completing these tasks is not attained. 
In healthy individuals, rewards elicit activation in a network of 
brain regions, including the ventral striatum, thalamus, putamen, 
hippocampus, anterior insula, MFC, and parietal cortex [78]. Prior 
studies in OCD have found reduced activation in patients in some 
of these regions during reward feedback or anticipation, including 
the vmPFC in a reversal paradigm [52, 53] and the putamen in a 
spatial reward learning task [79]. However, using a version of the 
monetary incentive delay (MID) task [80], Jung et al. [81] reported 
increased activity in cortical and subcortical regions, including the 
putamen and dorsal MFC, to monetary reward in OCD patients, 
with no regions showing reduced activity, compared to controls. In 
another study also using a version of the MID, OCD patients showed 
reduced dorsal MFC activation, compared to controls, during re- 
ward feedback, as well as reduced activation in the ventral striatum 
when anticipating an upcoming trial that could potentially provide 
reward [82]. This latter finding contrasts with three other studies 
showing no difference in striatal activation between OCD patients 
and controls during anticipation of trials where reward was possible 
[81, 83, 84]. Overall, it is not yet clear whether dysfunctional brain 
responses to reward contribute to OCD, although impaired recruit- 
ment of the striatum during reward processing has not been con- 
sistently implicated in the disorder. Future work may benefit from 
examining reward processing from the standpoint of goal attain- 
ment or task completion, which may be particularly relevant for the 
symptoms of the disorder. 


Neural correlates of OCD subtypes 


As can be seen, results from fMRI studies are variable, with many 
revealing abnormalities of widespread fronto-parietal and fronto- 
striatal networks and others identifying more circumscribed 


differences between patients and controls. Given the heterogen- 
eity of the disorder, it is possible that the discrepant findings may 
be attributed to differing neural mechanisms mediating the various 
symptoms of OCD. There are several ways that OCD can be segre- 
gated into symptom clusters, including by distinguishing patients 
with sensory phenomena from those without [85] or by focusing 
on severity of harm avoidance symptoms in OCD [86]. While these 
approaches are critical to the future understanding of how dimen- 
sional (and fundamentally trans-diagnostic) symptoms manifest in 
the brain not only in OCD, but also in other psychiatric disorders 
(that is, sensory phenomena are prevalent not only in OCD but 
also in tic disorders; harm avoidance is a feature not only of OCD, 
but also of other anxiety disorders), the majority of work to date on 
OCD heterogeneity has focused on investigating subtypes that have 
been identified through factor analyses of Yale-Brown Obsessive- 
Compulsive Scale (Y-BOCS) responses [87-89]. Although there are 
slight variations, typically these studies identify 3-5 dissociable fac- 
tors of contamination/washing, aggressive/checking, symmetry/or- 
dering, sexual/religious, and hoarding symptoms. This section will 
review the literature on fMRI studies that examine the neural sub- 
strates of these different symptom subtypes. 


Contamination/washing 


OCD patients with contamination/washing symptoms are heavily 
concerned about cleanliness and disease. Washing compulsions 
have been associated with activation in several prefrontal regions, 
such as the OFC, ACC, and DLPFC, as well as the insula, during a 
symptom provocation [90] and continuous performance task [91]. 
Greater insular activation has also been associated with washers’ 
sensitivity to disgusting stimuli in general, including pictures and 
facial expressions of disgust [92, 93]. A recent study [94] found in- 
creased functional connectivity between the insula and the ventral 
striatum also during the provocation of contamination symptoms. 
The findings of insula hyperactivation are particularly interesting 
because the insula is one of the primary regions involved in pro- 
cessing disgusting stimuli [90, 95, 96], potentially due to its role in 
interoception [97]. Of importance, one study looking at the neural 
substrates of washing symptoms in a medication and comorbidity- 
free sample of OCD patients found decreased activation in the an- 
terior prefrontal cortex, DLPFC, OFC, ACC, insula, parietal cortex, 
and caudate [98]. Although the same brain regions were implicated, 
the activity was in the opposite direction of that in previous studies 
on washers. Although these results suggest that some of the prior 
findings of hyperactivation could be related to confounding factors, 
the consistency of the insula hyperactivation finding in washers and 
the link between disgust processing and insula activity suggest a pri- 
mary role for the insula in this OCD subtype. 


Aggressive/checking, sexual/religious, and 





symmetry/ordering 


The aggressive/checking, sexual/religious, and symmetry/ordering 
subtypes of OCD have not been widely studied, and the results 
paint an incomplete picture about the neural substrates of these 
subtypes. In a symptom provocation study, patients with checking 


compulsions showed greater activation than healthy controls in 
various brainstem nuclei, globus pallidus, and right thalamus, as 
well as in the dACC, precentral gyrus, and superior frontal gyrus 
[90]. In a study looking at blood flow during a continuous perform- 
ance task using PET [91], there was a positive correlation between 
severity of aggressive/checking and sexual/religious symptoms and 
blood flow in bilateral striatum, including areas of the caudate nu- 
cleus, putamen, and pallidum. Many of these regions are associated 
with sensorimotor functions that could possibly mediate some of 
the motor components of checking behaviour. Greater severity of 
aggressive obsessions has been correlated with increased resting- 
state functional connectivity between the ventral caudate and the 
vmPFC and decreased connectivity between the ventral caudate and 
the amygdala [99], and amygdala activity has also been correlated 
with the severity of both aggressive/checking and sexual/religious 
symptoms in OCD patients when viewing fearful faces [100]. Given 
the amygdala’ role in fear processing [101], these data suggest that 
these subtypes may be associated with altered amygdala-based fear 
reactivity related to concerns about causing harm and having what 
are deemed to be morally unacceptable thoughts related to sexual or 
religious themes. Interestingly, patients with more severe sexual/re- 
ligious obsessions also demonstrated relatively greater connectivity 
between the ventral caudate and the insula [99]. The involvement 
of the insula in the sexual/religious subtype could reflect a type of 
‘moral disgust’ (which also activates the insula [102]) that is char- 
acteristic of patients with these types of obsessions. Few studies 
have reported relationships between brain function and symmetry/ 
ordering symptoms, although the PET study using the continuous 
performance task described above did find that the severity of 
symmetry/ordering symptoms was negatively correlated with right 
caudate nucleus activity [91]. 


Hoarding is characterized by excessive collecting, or being unable 
to discard, items, even though they might not have objective value. 
Hoarders have emotional attachment to their possessions and face 
great difficulty in deciding what to keep or throw out. Although 
there is debate on whether hoarding is a subtype of OCD or is better 
characterized as its own distinct disorder [103], several studies have 
investigated hoarding symptoms within the context of OCD. One 
PET study found diminished glucose metabolism in the posterior 
cingulate cortex (PCC), occipital cortex, and ACC in hoarders, 
compared to non-hoarders or a control group [104]. Relatedly, anx- 
iety evoked during the viewing of hoarding-relevant pictures was 
associated with reduced activity in the dACC, parieto-occipital re- 
gions, basal ganglia, and temporal cortex in a group of OCD patients 
including both hoarders and non-hoarders [105]. However, other 
work has shown that hoarding severity in OCD patients correlates 
positively with activation and connectivity in the vmPFC and OFC, 
as well as in the precentral and post-central gyri, which may point to 
a sensorimotor component of the hoarding phenotype [90, 99, 105]. 
The vmPFC has been implicated in compulsive hoarding in animal 
and brain lesion studies [106-108] and is also considered to be cru- 
cial for decision-making [109], suggesting that hyperactivation in 
this region may underlie the difficulty in making decisions regarding 
which objects to collect and discard. 
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Conclusions 


The summary provides a selective review of neuroimaging correl- 
ates of psychological processes that could potentially underlie the 
complex symptom presentation of patients with OCD. The most 
consistent findings were hyperactivation in response to errors and 
hypoactivation during switching tasks, with both effects occurring 
predominantly in the prefrontal cortex, but with findings also noted 
in other areas, including the striatum, parietal cortex, and temporal 
cortex. Given the complexity of the brain in general, and OCD in 
particular, no one task paradigm is likely to fully explain the dis- 
order, and it is clear that the task chosen is absolutely critical for 
interpreting findings. Indeed, other approaches investigating fear 
extinction [1], habit formation [2, 3, 110], emotional face and pic- 
ture processing [93, 100, 111-115], planning [116, 117], and working 
memory [118-122] have identified neural alterations in OCD that 
may also contribute to the phenomenology of the disorder. Some 
of the variability in findings may be related to the heterogeneity of 
symptoms in OCD, with some studies identifying distinctions in 
limbic and sensorimotor regions based on subtype. Taken together, 
the many paradigms used in cognitive neuroscientific research 
provide powerful probes that have already shed some light on the 
underlying mechanisms of OCD. Future work would benefit from 
combining the strengths from these many approaches by further 
interrogating core processes and symptom dimensions in OCD in 
order to design novel treatments targeting symptom-specific neural 
markers. 


REFERENCES 


1. Milad MR, Furtak SC, Greenberg JL, et al. Deficits in conditioned 
fear extinction in obsessive-compulsive disorder and neurobio- 
logical changes in the fear circuit. JAMA Psychiatry. 2013;70:608- 
18; quiz 554. 

2. Gillan CM, Apergis-Schoute AM, Morein-Zamir S, et al. 
Functional neuroimaging of avoidance habits in obsessive- 
compulsive disorder. Am J Psychiatry. 2015;172:284-93. 

3. Rauch SL, Wedig MM, Wright CI, et al. Functional magnetic 
resonance imaging study of regional brain activation during im- 
plicit sequence learning in obsessive-compulsive disorder. Biol 
Psychiatry. 2007;61:330-6. 

4. Pitman RK. A cybernetic model of obsessive-compulsive psycho- 
pathology. Compr Psychiatry. 1987;28:334-43. 

5. MacLeod CM. Half a century of research on the Stroop effect: an 
integrative review. Psychol Bull. 1991;109:163-203. 

6. Botvinick MM, Braver TS, Barch DM, Carter CS, Cohen JD. Conflict 
monitoring and cognitive control. Psychol Rev. 2001;108:624-52. 

7. Garavan H, Ross TJ, Kaufman J, Stein EA. A midline dissociation 
between error-processing and response-conflict monitoring. 
Neuroimage. 2003;20:1132-9. 

8. Hester R, Fassbender C, Garavan H. Individual differences 
in error processing: a review and reanalysis of three event- 
related fMRI studies using the GO/NOGO task. Cereb Cortex. 
2004;14:986-94. 

9. Ridderinkhof KR, Ullsperger M, Crone EA, Nieuwenhuis S. The 
role of the medial frontal cortex in cognitive control. Science. 
2004;306:443-7. 

10. Ursu S, Stenger VA, Shear MK, Jones MR, Carter CS. Overactive 
action monitoring in obsessive-compulsive disorder: evidence 





1008 





AE 


We 


13% 


14. 


15: 


16. 


We 


18. 


19; 


20. 


Alls 


22! 


23: 


24. 


25: 


26. 


27: 


28. 


SECTION 15 Obsessive-compulsive and related disorders 


from functional magnetic resonance imaging. Psychol Sci. 
2003;14:347-53. 

Yucel M, Harrison BJ, Wood SJ, et al. Functional and biochemical 
alterations of the medial frontal cortex in obsessive-compulsive 
disorder. Arch Gen Psychiatry. 2007;64:946-55. 

Fitzgerald KD, Welsh RC, Gehring WJ, et al. Error-related hyper- 
activity of the anterior cingulate cortex in obsessive-compulsive 
disorder. Biol Psychiatry. 2005;57:287-94. 

Nakao T, Nakagawa A, Yoshiura T, et al. A functional MRI 
comparison of patients with obsessive-compulsive disorder 

and normal controls during a Chinese character Stroop task. 
Psychiatry Res. 2005;139:101-14. 

Hough CM, Luks TL, Lai K, et al. Comparison of brain activation 
patterns during executive function tasks in hoarding disorder 
and non-hoarding OCD. Psychiatry Res. 2016;255:50-9. 

Stern ER, Welsh RC, Fitzgerald KD, et al. Hyperactive error 
responses and altered connectivity in ventromedial and 
frontoinsular cortices in obsessive-compulsive disorder. Biol 
Psychiatry. 2011;69:583-91. 

Viard A, Flament ME, Artiges E, et al. Cognitive control in 
childhood-onset obsessive-compulsive disorder: a functional 
MRI study. Psychol Med. 2005;35:1007-17. 

Page LA, Rubia K, Deeley Q, et al. A functional magnetic reson- 
ance imaging study of inhibitory control in obsessive-compulsive 
disorder. Psychiatry Res. 2009;174:202-9. 

van den Heuvel OA, Veltman DJ, Groenewegen HJ, et al. 
Disorder-specific neuroanatomical correlates of attentional bias 
in obsessive-compulsive disorder, panic disorder, and hypochon- 
driasis. Arch Gen Psychiatry. 2005;62:922-33. 

Marsh R, Horga G, Parashar N, Wang Z, Peterson BS, Simpson 
HB. Altered activation in fronto-striatal circuits during sequen- 
tial processing of conflict in unmedicated adults with obsessive- 
compulsive disorder. Biol Psychiatry. 2014;75:615-22. 

Harrison BJ, Yucel M, Shaw M, et al. Evaluating brain activity 

in obsessive-compulsive disorder: preliminary insights from a 
multivariate analysis. Psychiatry Res. 2006;147:227-31. 
Nabeyama M, Nakagawa A, Yoshiura T, et al. Functional MRI 
study of brain activation alterations in patients with obsessive- 
compulsive disorder after symptom improvement. Psychiatry Res. 
2008; 163:236-47. 

Taylor SE, Stern ER, Gehring WJ. Neural systems for error 
monitoring: Recent findings and theoretical perspectives. 
Neuroscientist. 2007;13:160-72. 

Harrison NA, Gray MA, Gianaros PJ, Critchley HD. The 
embodiment of emotional feelings in the brain. J Neurosci. 
2010;30:12878-84. 

Kober H, Barrett LF, Joseph J, Bliss-Moreau E, Lindquist K, 
Wager TD. Functional grouping and cortical-subcortical inter- 
actions in emotion: a meta-analysis of neuroimaging studies. 
Neuroimage. 2008;42:998-1031. 

Kringelbach ML, Rolls ET. The functional neuroanatomy of the 
human orbitofrontal cortex: evidence from neuroimaging and 
neuropsychology. Prog Neurobiol. 2004;72:341-72. 

Lebreton M, Jorge S, Michel V, Thirion B, Pessiglione M. An 
automatic valuation system in the human brain: evidence from 
functional neuroimaging. Neuron. 2009;64:431-9. 

Taylor SF, Martis B, Fitzgerald KD, et al. Medial frontal cortex 
activity and loss-related responses to errors. J Neurosci. 
2006;26:4063-70. 

Endrass T, Ullsperger M. Specificity of performance monitoring 


changes in obsessive-compulsive disorder. Neurosci Biobehav Rev. 


2014;46 Pt 1:124-38. 


297 


30. 


ail. 


32: 


33h 


34. 


357 


36. 


oie 


38. 


oy), 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


Maltby N, Tolin DF, Worhunsky P, O’Keefe TM, Kiehl KA. 
Dysfunctional action monitoring hyperactivates frontal-striatal 
circuits in obsessive-compulsive disorder: an event-related fMRI 
study. Neuroimage. 2005;24:495-503. 

Chamberlain SR, Menzies L. Endophenotypes of obsessive- 
compulsive disorder: rationale, evidence and future potential. 
Expert Rev Neurother. 2009;9:1133-46. 

Aron AR, Poldrack RA. Cortical and subcortical contributions to 
stop signal response inhibition: role of the subthalamic nucleus. J 
Neurosci. 2006;26:2424-33. 

Buchsbaum BR, Greer S, Chang WL, Berman KF. Meta-analysis 
of neuroimaging studies of the Wisconsin card-sorting task and 
component processes. Hum Brain Mapp. 2005;25:35-45. 
Garavan H, Hester R, Murphy K, Fassbender C, Kelly C. 
Individual differences in the functional neuroanatomy of inhibi- 
tory control. Brain Res. 2006;1105:130-42. 

Garavan H, Ross TJ, Stein EA. Right hemispheric dominance of 
inhibitory control: An event-related functional MRI study. Proc 
Natl Acad Sci US A. 1999;96:8301-6. 

Robbins TW. Shifting and stopping: fronto-striatal substrates, 
neurochemical modulation and clinical implications. Philos 
Trans R Soc Lond B Biol Sci. 2007;362:917-32. 

Aron AR, Fletcher PC, Bullmore ET, Sahakian BJ, Robbins TW. 
Stop-signal inhibition disrupted by damage to right inferior 
frontal gyrus in humans. Nat Neurosci. 2003;6:115-16. 
Chambers CD, Bellgrove MA, Stokes MG, et al. Executive ‘brake 
failure’ following deactivation of human frontal lobe. J Cognitive 
Neurosci. 2006;18:444-55. 

Berlin HA, Schulz KP, Zhang S, Turetzky R, Rosenthal D, 
Goodman W. Neural correlates of emotional response inhibition 
in obsessive-compulsive disorder: a preliminary study. Psychiatry 
Res. 20153;234:259-64. 

de Wit SJ, de Vries FE, van der Werf YD, et al. Presupplementary 
motor area hyperactivity during response inhibition: a candi- 
date endophenotype of obsessive-compulsive disorder. Am J 
Psychiatry. 2012;169:1100-8. 

Kang DH, Jang JH, Han JY, et al. Neural correlates of altered 
response inhibition and dysfunctional connectivity at rest in 
obsessive-compulsive disorder. Prog Neuropsychopharmacol Biol 
Psychiatry. 2013;40:340-6. 

Roth RM, Saykin AJ, Flashman LA, Pixley HS, West JD, 
Mamourian AC. Event-related functional magnetic resonance 
imaging of response inhibition in obsessive-compulsive disorder. 
Biol Psychiatry. 2007;62:901-9. 

Gruner P, Pittenger C. Cognitive inflexibility in obsessive- 
compulsive disorder. Neuroscience. 2017;345:243-55. 

Bressler SL, Menon V. Large-scale brain networks in cog- 

nition: emerging methods and principles. Trends Cogn Sci. 
20103;14:277-90. 

Power JD, Cohen AL, Nelson SM, et al. Functional Network 
Organization of the Human Brain. Neuron. 2011;72:665-78. 

Gu BM, Park JY, Kang DH, et al. Neural correlates of cognitive 
inflexibility during task-switching in obsessive-compulsive dis- 
order. Brain. 2008;131:155-64. 

Han JY, Kang DH, Gu BM, et al. Altered Brain Activation 

in Ventral Frontal-Striatal Regions Following a 16-week 
Pharmacotherapy in Unmedicated Obsessive-Compulsive 
Disorder. J Korean Med Sci. 2011;26:665-74. 

Remijnse PL, van den Heuvel OA, Nielen MM, et al. Cognitive 
inflexibility in obsessive-compulsive disorder and major de- 
pression is associated with distinct neural correlates. PLoS One. 
2013;8:e59600. 


48. 


49. 


50. 


Dl. 


Sy, 


53: 


54. 


557 


56. 


SV) 


58. 


By), 


60. 


6l. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


Dias R, Robbins TW, Roberts AC. Dissociation in prefrontal 
cortex of affective and attentional shifts. Nature. 1996;380:69-72. 
Fellows LK, Farah MJ. Ventromedial frontal cortex mediates 
affective shifting in humans: evidence from a reversal learning 
paradigm. Brain. 2003;126:1830-7. 

Hampshire A, Owen AM. Fractionating attentional control using 
event-related fMRI. Cereb Cortex. 2006;16:1679-89. 

Freyer T, Kloppel S, Tuscher O, et al. Frontostriatal activation in 
patients with obsessive-compulsive disorder before and after cog- 
nitive behavioral therapy. Psychol Med. 2011;41:207-16. 
Remijnse PL, Nielen MMA, van Balkom AJLM, et al. 

Reduced orbitofrontal-striatal activity on a reversal learning 

task in obsessive-compulsive disorder. Arch Gen Psychiatry. 
2006;63:1225-36. 

Remijnse PL, Nielen MMA, van Balkom AJLM, et al. Differential 
frontal-striatal and paralimbic activity during reversal learning 
in major depressive disorder and obsessive-compulsive disorder. 
Psychol Med. 2009;39:1503-18. 

Chamberlain SR, Menzies L, Hampshire A, et al. Orbitofrontal 
dysfunction in patients with obsessive-compulsive disorder and 
their unaffected relatives. Science. 2008;321:421-2. 
Apergis-Schoute AM, Gillan CM, Fineberg NA, Fernandez-Egea 
E, Sahakian BJ, Robbins TW. Neural basis of impaired safety 
signaling in Obsessive Compulsive Disorder. Proc Natl Acad Sci 
USA. 2017;114:3216-21. 

Cavedini P, Gorini A, Bellodi L. Understanding obsessive- 
compulsive disorder: focus on decision making. Neuropsychol 
Rev. 2006;16:3-15. 

Sachdev PS, Malhi GS. Obsessive-compulsive behaviour: a 
disorder of decision-making. Aust N Z J Psychiatry. 2005;39: 
757-63. 

Steketee G, Frost RO. Measurement of Risk-Taking in Obsessive- 
Compulsive Disorder. Behav Cogn Psychother. 1994;22:287-98. 
Tolin DF, Abramowitz JS, Brigidi BD, Foa EB. Intolerance of 
uncertainty in obsessive-compulsive disorder. J Anxiety Disord. 
2003; 17:233-42. 

Admon R, Bleich-Cohen M, Weizmant R, Poyurovsky M, 
Faragian S, Hendler T. Functional and structural neural indices of 
risk aversion in obsessive-compulsive disorder (OCD). Psychiatry 
Res. 2012;203:207-13. 

Luigjes J, Figee M, Tobler PN, et al. Doubt in the Insula: Risk 
Processing in Obsessive-Compulsive Disorder. Front Hum 
Neurosci. 2016;10:283. 

Fear CF, Healy D. Probabilistic reasoning in obsessive-compulsive 
and delusional disorders. Psychol Med. 1997;27:199-208. 

Volans PJ. Styles of decision-making and probability appraisal in 
selected obsessional and phobic patients. Br J Soc Clin Psychol. 
1976;15:305-17. 

Foa EB, Mathews A, Abramowitz JS, et al. Do patients with 
obsessive-compulsive disorder have deficits in decision-making? 
Cogn Ther Res. 2003;27:431-45. 

Milner AD, Beech HR, Walker VJ. Decision processes and obses- 
sional behavior. Br J Soc Clin Psychol. 1971;10:88-9. 

Huettel SA. Behavioral, but not reward, risk modulates activation 
of prefrontal, parietal, and insular cortices. Cogn Affect Behav 
Neurosci. 2006;6:141-51 

Krain AL, Hefton S, Pine DS, et al. An fMRI examination of devel- 
opmental differences in the neural correlates of uncertainty and 
decision-making. J Child Psychol Psychiatry. 2006;47:1023-30. 
Grinband J, Hirsch J, Ferrera VP. A neural representation 

of categorization uncertainty in the human brain. Neuron. 
2006;49:757-63 


69. 


70. 


zl 


Wide 


Wah 


74. 


WS 


76. 


Vile 


78. 


79; 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


CHAPTER 97 Imaging of obsessive-compulsive disorder 


Mohr PN, Biele G, Heekeren HR. Neural processing of risk. J 
Neurosci. 2010;30:6613-19. 

Preuschoff K, Quartz SR, Bossaerts P. Human insula activa- 

tion reflects risk prediction errors as well as risk. J Neurosci. 
20083;28:2745-52. 

Stern ER, Gonzalez R, Welsh RC, Taylor SF. Medial frontal cortex and 
anterior insula are less sensitive to outcome predictability when mon- 
etary stakes are higher. Soc Cogn Affect Neurosci. 2014;9:1625-31. 
Volz KG, Schubotz RI, von Cramon DY. Predicting events 

of varying probability: uncertainty investigated by fMRI. 
Neuroimage. 2003;19(2 Pt 1):271-80. 

Rotge JY, Langbour N, Dilharreguy B, et al. Contextual and be- 
havioral influences on uncertainty in obsessive-compulsive dis- 
order. Cortex. 2015;62:1-10. 

Stern ER, Welsh RC, Gonzalez R, Fitzgerald KD, Abelson JL, Taylor 
SE, Subjective uncertainty and limbic hyperactivation in obsessive- 
compulsive disorder. Hum Brain Mapp. 2013;34:1956-70. 
Andrews-Hanna JR. The brain’s default network and its adaptive 
role in internal mentation. Neuroscientist. 2012;18:251-70. 
Harrison BJ, Pujol J, Soriano-Mas C, et al. Neural correlates of 
moral sensitivity in obsessive-compulsive disorder. Arch Gen 
Psychiatry. 2012;69:741-9. 

Szechtman H, Woody E. Obsessive-compulsive disorder as a dis- 
turbance of security motivation. Psychol Rev. 2004;111:111-27. 
Liu X, Hairston J, Schrier M, Fan J. Common and distinct net- 
works underlying reward valence and processing stages: A meta- 
analysis of functional neuroimaging studies. Neurosci Biobehav 
Rev. 2011;35:1219-36. 

Marsh R, Tau GZ, Wang Z, et al. Reward-based spatial learning 
in unmedicated adults with obsessive-compulsive disorder. Am J 
Psychiatry. 2015;172:383-92. 

Knutson B, Fong GW, Bennett SM, Adams CM, Hommer D. A 
region of mesial prefrontal cortex tracks monetarily rewarding 
outcomes: characterization with rapid event-related fMRI. 
Neuroimage. 2003;18:263-72. 

Jung WH, Kang DH, Han JY, et al. Aberrant ventral striatal responses 
during incentive processing in unmedicated patients with obsessive- 
compulsive disorder. Acta Psychiatr Scand. 2011;123:376-86. 

Figee M, Vink M, de Geus F, et al. Dysfunctional Reward 
Circuitry in Obsessive-Compulsive Disorder. Biol Psychiatry. 
2011;69:867-74. 

Choi JS, Shin YC, Jung WH, et al. Altered brain activity during 
reward anticipation in pathological gambling and obsessive- 
compulsive disorder. PLoS One. 2012;7:e45938. 

Kaufmann C, Beucke JC, Preusse F, et al. Medial prefrontal brain 
activation to anticipated reward and loss in obsessive-compulsive 
disorder. Neuroimage Clin. 2013;2:212-20. 

Subira M, Sato JR, Alonso P, et al. Brain structural correlates of 
sensory phenomena in patients with obsessive-compulsive dis- 
order. J Psychiatry Neurosci. 2015;40:232-40. 

Ecker W, Gonner S. Incompleteness and harm avoidance in OCD 
symptom dimensions. Behav Res Ther. 2008;46:895-904. 

Baer L. Factor analysis of symptom subtypes of obsessive com- 
pulsive disorder and their relation to personality and tic dis- 
orders. J Clin Psychiatry. 1994;55 Suppl:18-23. 

Leckman JE, Grice DE, Boardman J, et al. Symptoms of obsessive- 
compulsive disorder. Am J Psychiatry. 1997;154:911-17. 
Mataix-Cols D, Rauch SL, Manzo PA, Jenike MA, Baer L. 

Use of factor-analyzed symptom dimensions to predict out- 
come with serotonin reuptake inhibitors and placebo in the 
treatment of obsessive-compulsive disorder. Am J Psychiatry. 
1999;156:1409-16. 








SECTION 15 Obsessive-compulsive and related disorders 


90. 


91. 


92. 


93. 


94. 


95: 


96. 


97. 


98. 


99% 


100. 


101. 


102. 


103. 


104. 


105. 


106. 


107. 


Mataix-Cols D, Wooderson S, Lawrence N, Brammer MJ, 
Speckens A, Phillips ML. Distinct neural correlates of washing, 
checking, and hoarding symptom dimensions in obsessive-com- 
pulsive disorder. Arch Gen Psychiatry. 2004;61:564-76. 

Rauch SL, Dougherty D, Shin LM, et al. Neural correlates of 
factor-analyzed OCD symptom dimensions: a PET study. CNS 
Spectrums. 1998;1;3(:37-43. 

Shapira NA, Liu Y, He AG, et al. Brain activation by disgust- 
inducing pictures in obsessive-compulsive disorder. Biol 
Psychiatry. 2003;54:751-6. 

Lawrence NS, An SK, Mataix-Cols D, Ruths F, Speckens A, 
Phillips ML. Neural responses to facial expressions of disgust 
but not fear are modulated by washing symptoms in OCD. Biol 
Psychiatry. 2007;61:1072-80. 

Jhung K, Ku J, Kim SJ, et al. Distinct functional connectivity 

of limbic network in the washing type obsessive-compulsive 
disorder. Prog Neuropsychopharmacol Biol Psychiatry. 
2014;53:149-55. 

Wicker B, Keysers C, Plailly J, Royet JP, Gallese V, Rizzolatti G. 
Both of us disgusted in My insula: the common neural basis of 
seeing and feeling disgust. Neuron. 2003;40:655-64. 

Olatunji BO, Cisler J, McKay D, Phillips ML. Is disgust associ- 
ated with psychopathology? Emerging research in the anxiety 
disorders. Psychiatry Res. 2010;175:1-10. 

Craig AD. How do you feel—now? The anterior insula and 
human awareness. Nat Rev Neurosci. 2009;10:59-70. 

Agarwal SM, Jose D, Baruah U, et al. Neurohemodynamic 
correlates of washing symptoms in obsessive-compulsive dis- 
order: a pilot {MRI study using Symptom provocation para- 
digm. Indian J Psychol Med. 2013;35:67-74. 

Harrison BJ, Pujol J, Cardoner N, et al. Brain corticostriatal sys- 
tems and the major clinical symptom dimensions of obsessive- 
compulsive disorder. Biol Psychiatry. 2013;73:321-8. 

Via E, Cardoner N, Pujol J, et al. Amygdala activation and 
symptom dimensions in obsessive-compulsive disorder. Br J 
Psychiatry. 2014;204:61-8. 

Davis M. The role of the amygdala in fear and anxiety. Annu Rev 
Neurosci. 1992;15:353-75. 

Schaich Borg J, Lieberman D, Kiehl KA. Infection, incest, and 
iniquity: investigating the neural correlates of disgust and mor- 
ality. J Cogn Neurosci. 2008;20:1529-46. 

Mataix-Cols D, Frost RO, Pertusa A, et al. Hoarding disorder: a 
new diagnosis for DSM-V? Depress Anxiety. 2010;27:556-72. 
Saxena S, Brody AL, Maidment KM, et al. Cerebral glucose me- 
tabolism in obsessive-compulsive hoarding. Am J Psychiatry. 
2004;161:1038-48. 

An SK, Mataix-Cols D, Lawrence NS, et al. To discard or not to 
discard: the neural basis of hoarding symptoms in obsessive- 
compulsive disorder. Mol Psychiatry. 2009;14:318-31. 

Cohen L, Angladette L, Benoit N, Pierrot-Deseilligny C. A man 
who borrowed cars. Lancet. 1999;353:34. 

Volle E, Beato R, Levy R, Dubois B. Forced collectionism after 
orbitofrontal damage. Neurology. 2002;58:488-90. 


108. 


109. 


110. 


Ie 


1122 


Ss 


114. 


IANS, 


116. 


NY, 


118. 


119. 


120. 


121. 


122% 


Blundell JE, Herberg LJ. Effectiveness of lateral hypothalamic 
stimulation, arousal, and food deprivation in the initiation of 
hoarding behaviour in naive rats. Physiol Behav. 1973;10:763-7. 
Bechara A, Tranel D, Damasio H. Characterization of the 
decision-making deficit of patients with ventromedial prefrontal 
cortex lesions. Brain. 2000;123 (Pt 11):2189-202. 

Rauch SL, Savage CR, Alpert NM, et al. Probing striatal func- 
tion in obsessive-compulsive disorder: a PET study of im- 
plicit sequence learning. J Neuropsychiatry Clin Neurosci. 
1997;9:568-73. 

Cannistraro PA, Wright CI, Wedig MM, et al. Amygdala re- 
sponses to human faces in obsessive-compulsive disorder. Biol 
Psychiatry. 2004;56:916-20. 

Cardoner N, Harrison BJ, Pujol J, et al. Enhanced brain respon- 
siveness during active emotional face processing in obsessive 
compulsive disorder. World J Biol Psychiatry. 2011;12:349-63. 
Goncalves OF, Soares JM, Carvalho S, et al. Brain activation of 
the defensive and appetitive survival systems in obsessive com- 
pulsive disorder. Brain Imaging Behav. 2015;9:255-63. 
Weygandt M, Blecker CR, Schafer A, et al. fMRI pattern rec- 
ognition in obsessive-compulsive disorder. Neuroimage. 
2012;60:1186-93. 

Simon D, Adler N, Kaufmann C, Kathmann N. Amygdala 
hyperactivation during symptom provocation in obsessive- 
compulsive disorder and its modulation by distraction. 
Neuroimage Clin. 2014;4:549-57. 

van den Heuvel OA, Veltman DJ, Groenewegen Hj, et al. 
Frontal-striatal dysfunction during planning in obsessive- 
compulsive disorder. Arch Gen Psychiatry. 2005;62:301-9. 

van den Heuvel OA, Mataix-Cols D, Zwitser G, et al. Common 
limbic and frontal-striatal disturbances in patients with obses- 
sive compulsive disorder, panic disorder and hypochondriasis. 
Psychol Med. 2011;41:2399-410. 

de Vries FE, de Wit SJ, Cath DC, et al. Compensatory 
frontoparietal activity during working memory: an 
endophenotype of obsessive-compulsive disorder. Biol 
Psychiatry. 2014;76:878-87. 

Henseler I, Gruber O, Kraft S, Krick C, Reith W, Falkai P. 
Compensatory hyperactivations as markers of latent working 
memory dysfunctions in patients with obsessive-compulsive 
disorder: an fMRI study. J Psychiatry Neurosci. 2008;33:209-15. 
Koch K, Wagner G, Schachtzabel C, et al. Aberrant anterior cin- 
gulate activation in obsessive-compulsive disorder is related to 
task complexity. Neuropsychologia. 2012;50:958-64. 

Nakao T, Nakagawa A, Nakatani E, et al. Working memory 
dysfunction in obsessive-compulsive disorder: a neuro- 
psychological and functional MRI study. J Psychiatr Res. 
2009;43:784-91. 

van der Wee NJ, Ramsey NF, van Megen HJ, Denys D, 
Westenberg HG, Kahn RS. Spatial working memory in 
obsessive-compulsive disorder improves with clinical re- 
sponse: a functional MRI study. Eur Neuropsychopharmacol. 
2007;17:16-23. 


anagement and treatment 
of obsessive-compulsive disorder 


Naomi A. Fineberg, Lynne M. Drummond, Jemma Reid, Eduardo Cinosi, 


r Carmi, and Davis N. Mpavaenda 


The earliest reports of the potential efficacy in obsessive-compulsive 
disorder (OCD) of pharmacotherapy with the tricyclic drug 
clomipramine and of behavioural forms of psychotherapy, initi- 
ated in the 1960s and 1970s and subsequently validated in a series 
of seminal studies in the 1980s, produced a dramatic change in the 
psychiatric management of this serious and previously treatment- 
refractory disorder. Since that time, treatments for OCD have been 
subject to considerable scientific scrutiny and treatment ‘standards’ 
have been developed and refined in succeeding evidence-based 
treatment guidelines [1, 2]. 

The disorder usually arises in childhood or early adulthood, 
follows a chronic course (though episodic forms do occur), and 
is associated with substantial psychiatric comorbidity, frequently 
in the form of anxiety disorders, affective disorders, and other 
obsessive-compulsive and related disorders. Whereas substantial 
improvement can be achieved in many patients following standard 
treatment with serotonin reuptake inhibitors (SRIs) [selective sero- 
tonin reuptake inhibitors (SSRIs), clomipramine] and cognitive 
behavioural therapy (CBT), for approximately 50%, the response 
is incomplete and new treatment paradigms are sought. Several 
other conventional drug treatments for depression and anxiety 
have been investigated and found not to be effective in OCD, 
though evidence has accrued that dopamine receptor antagon- 
ists may be effective for individuals with SRI-resistant OCD when 
prescribed as an adjunct to the SRI. In this context, a developing 
role for somatic treatments has also emerged, with implications for 
clinical service development [1]. 

In this chapter, we review the management and treatment of OCD, 
including rational options for those failing to respond to standard 
treatments, based, as far as possible, on randomized controlled trial 
(RCT) data. Where relevant, we have cited review papers of good 
quality to maximize efficiency. 





The introduction of methods for the standardized assessment of OCD 
has contributed substantially to the evidence-based treatment of 
the disorder. Of the several available scales, the 10-item Yale-Brown 
Obsessive Compulsive Scale (Y-BOCS) [3] emerged as the most 
widely accepted instrument for rating the magnitude of symptomatic 
change, based on reliability, acceptability, and sensitivity to change. 
Pallanti et al. [4] were some of the first to advocate the use of stand- 
ardized operational ‘responder’ criteria across OCD treatment trials. 
They proposed that a meaningful clinical response could be conser- 
vatively represented by an improvement of 25-35% in the baseline Y- 
BOGS, or a score of ‘much’ or ‘very much improved’ on the Clinical 
Global Impression of Improvement (CGI-I) Scale [5], whereas ‘remis- 
sion’ necessitated a total Y- BOCS score of less than 16 (out of a total 
score of 40). Relapse, on the other hand, was defined as a worsening 
by 25% of the remission Y-BOCS score. Other suggested relapse cri- 
teria include a worsening by 250% of post-baseline Y-BOCS scores, a 
5-point worsening of the total Y-BOCS score, a total Y-BOCS score of 
219, and CGI-I scores of ‘muck or ‘very much worse’ [6]. 

A disadvantage of so much variability in these criteria is that very 
different claims about efficacy and relapse may ensue. Hollander 
et al. [7] attempted to validate the previously empirical responder 
and relapse criteria using trial data. When the treatment response 
was defined as at least 25% improvement in the Y-BOCS score 
relative to baseline, the data showed both a statistically significant 
and a clinically relevant differentiation in social and occupational 
functioning and health-related quality of life (HR-QoL) measures 
between responders and non-responders. This suggests that a 25% 
improvement in the baseline Y-BOCS does represent a clinically rele- 
vant change equivalent to a minimal partial response. Similarly, the 
analysis validated a definition of relapse represented by a 5-point 
worsening of the remission Y-BOCS [7]. 
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Exposure and response prevention 


Prior to the mid 1960s, there were no known effective treatments 
for OCD. Therefore, the development of new effective psychological 
treatments based on the principles of exposure to the feared object 
or situation, combined with compulsive ritual prevention, has had 
a huge and lasting impact. Victor Meyer, a psychologist working at 
St Luke’s Woodside Hospital and the Middlesex Hospital Medical 
School, developed a treatment for children and adults with OCD, 
based on earlier experimental work with animals. This treatment 
involved exposing the patient to situations which would normally 
provoke compulsive rituals and then actively preventing them from 
performing these rituals [8]. As OCD had previously been con- 
sidered an intractable condition with no effective treatment, this 
was a remarkable breakthrough. Isaac Marks, a psychiatrist working 
with the psychologists Stanley Rachman and Ray Hodgson, took 
this pioneering work further by developing a treatment based on 
exposure but in which the patient themselves took responsibility 
for preventing their compulsive rituals. This treatment proved to be 
effective, to have results which persisted over at least 2 years, and 
had fewer ethical issues than previous treatments when nurses and 
other health care workers had actively prevented patients from per- 
forming rituals [9]. 

Over the years, exposure treatment has been modified so that 
what is practised today is prolonged graded exposure in real life to 
the feared situation, combined with self-imposed response preven- 
tion (ERP). In order to perform this treatment, the patient needs to 
be educated about OCD and how the temporary reduction in anx- 
iety produced by performance of compulsive rituals actually serves 
to strengthen the association between obsessive thought and com- 
pulsive ritual, thus worsening the condition. 

Compulsive rituals and other behaviours, such as repeated 
reassurance seeking, reduce anxiety; because high anxiety is 
unpleasant, this reduction acts like a reward. Thus, repeated per- 
formance of compulsions strengthens the link between obses- 
sions and compulsions, which frequently results in increasing the 
amount of time spent in rituals. In order to overcome the OCD, the 
patient needs to face up to the feared situation without performing 
the compulsions. Initially, the urge to perform the ritual will rise, 
but if the compulsions are successfully resisted, the urge will re- 
duce and habituation will occur. 

It can be too stressful for a patient to initially face their most 
threatening situation. For this reason, the patient and therapist will 
usually construct a hierarchy of fear-provoking situations for ex- 
posure, ranging from those that would invoke minor fear if faced 
without anxiety-reducing compulsions to those that would pro- 
duce panic. The patient also needs to expose themselves to the 
fear-provoking situation for long enough to enable their anxiety to 
habituate. Due to the complexity of compulsive rituals and the fact 
that patients often give themselves reassurance mentally, it will gen- 
erally take up to 2 hours until the anxiety of OCD has fully subsided. 
This is known as ‘within-session habituation. 

There are several full descriptions of the treatment given else- 
where [9]. In summary, the therapist and the patient agree a 
task expected to induce moderate anxiety if performed without 


compulsive rituals and the patient is asked to perform this task. This 
exposure may be performed by the patient on their own or together 
with a therapist or a family member or friend working with the pa- 
tient. Once it has been successfully performed, then the task should 
be repeated in the same way approximately three times a day. This 
repetition of the ERP is usually referred to as ‘ERP Homework. As 
the task is repeated without compulsive rituals, each successive per- 
formance will generally become easier and the anxiety will last for 
a shorter time. This reduction in anxiety on successive repetitions 
is known as ‘between-session habituation. Once this task has been 
successfully performed, the patient tackles more difficult items on 
the hierarchy. 

Standing the test of time, ERP treatment has been proven to be 
a successful treatment for patients with OCD. The theory behind 
it is relatively simple to apply, and it has been used effectively in a 
variety of self-help methods, including using computerized manuals 
[10]. Following on from the initial studies, evidence of the robust 
improvements produced with ERP has accumulated [11]. Recent 
studies have often assessed ERP in OCD together with other inter- 
ventions, including psychopharmacological treatments [12, 13]. 

It is important to remember that one of the most common reasons 
for ERP not being successful is that it is not prescribed or performed 
ina regular, prolonged, predictable manner. For the best results, ERP 
should be practised three times a day and for sufficient time to allow 
the anxiety to reduce consistently by at least 50%. ERP can be applied 
most usefully in the patient’s home where the symptoms are usually 
maximal [13]. 


Cognitive behavioural therapies in OCD 


In recent years, there has been increasing usage of cognitive and 
behavioural methods to treat OCD. There are, of course, a variety 
of different forms of cognitive therapy which work on a variety of 
different principles. Rational emotive therapy was developed by 
Albert Ellis in the 1960s for depressive and anxiety disorders and is 
a confrontative type of therapy where beliefs are actively challenged 
by the therapist [14]. A rationale for using rational emotive therapy 
to treat OCD rests on the observation that OCD patients do not suf- 
ficiently challenge their beliefs before acting on them by performing 
compulsions. 

Early developers of ERP theorized that OCD patients would ex- 
perience a reduction in symptoms if they felt less responsible for any 
feared disastrous or harmful outcome resulting from their failure 
to perform compulsions. In the UK, a cognitive form of treatment, 
based on the theory that an over-inflated sense of responsibility is 
key to the symptomatology of OCD, leading to increased perform- 
ance of compulsions, increased sense of threat, and depressed mood, 
thereby establishing a vicious cycle of excessive responsibility and 
guilt, was developed [15]. 

However, it was observed by others that ideas about danger were 
more highly correlated with compulsive activity than were ideas of 
responsibility, self-efficacy, perfectionism, or anticipated anxiety. 
Danger ideation reduction therapy (DIRT) is another form of psy- 
chotherapy that provides information about harm in a systematized 
way, combined with rational emotive therapy [16]. DIRT was ini- 
tially developed for OCD patients with contamination fears and 
washing rituals. More recently, it has been trialled with patients with 
compulsive-checking rituals [17]. 
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ERP or CBT? 


Despite both the superficial attractiveness of the cognitive method 
of treatment, compared with ERP, and numerous studies designed 
to demonstrate its efficacy, there is little evidence to suggest that 
adding cognitive therapy to ERP offers any distinct advantage [18- 
20]. One of the problems with adding cognitive therapy to ERP is 
that it requires more training and supervision of the therapist and 
may thus be less cost-efficient. Some studies have found that cogni- 
tive methods can beas effective as ERP, but few have shown any clear 
advantage. This is true of rational emotive behaviour therapy [21]. 
Indeed a recent study demonstrated that in patients who initially 
failed to respond to ERP, there was no value in following this with 
a course of cognitive therapy in an attempt to improve outcome. 
These workers advocated using a serotonin reuptake-inhibiting 
drug (SRI) and demonstrated that this was a more effective second- 
line treatment [22]. 

Studies examining specific cognitive interventions aimed at an 
over-inflated sense of responsibility, similarly, have generally failed 
to demonstrate an advantage over ERP. Studies comparing ERP 
and cognitive therapy found that both had equal efficacy [23] or 
failed to demonstrate any improved efficacy of CBT, compared to 
ERP, on core OCD symptoms (although Cottraux et al. did find that 
depressive symptoms may improve more with CBT) [24]. A study 
examining group treatment for OCD found that ERP produced sig- 
nificantly better results than did CBT [25]. 

A major problem with most studies involving CBT or ERP is that 
the numbers of participants are small. Many studies also include 
pharmacological treatment. Inevitably, this leads to a higher chance 
of type 2 error and difficulty demonstrating statistical differences be- 
tween treatments. It is for this reason that meta-analyses may be the 
most useful. 

An early study [26] examining the use of ERP in severe, chronic, 
refractory OCD suggested that whereas ERP should be the treat- 
ment of choice, CBT may be useful for specific difficulties. A recent 
meta-analysis of available studies similarly concluded that “The 
available research indicates that ERP is the first line evidence based 
psychotherapeutic treatment for OCD and that concurrent admin- 
istration of cognitive therapy that targets specific symptom-related 
difficulties characteristic of OCD may improve tolerance of distress, 
symptom-related dysfunctional beliefs, adherence to treatment, and 
reduce drop out’ [27]. 





Evidence-based pharmacotherapies for OCD 


Clomipramine 
Adults 


Evidence from a series of small historical studies dating back to the 
1960s [28] identified clomipramine as the first potentially effective 
treatment for OCD. Subsequent RCTs of clomipramine in adult 
OCD patient samples demonstrated efficacy in the presence of low 
mood or comorbid depression. A seminal placebo-referenced RCT 
in patients with no coexisting depression showed a significant ad- 
vantage for clomipramine using relatively low, fixed daily dosages 
of 75 mg [29]. Two large multi-centre, double-blind RCTs, com- 
prising a total of 520 patients, conclusively demonstrated significant 


placebo-referenced superiority for clomipramine in non-depressed 
adults with OCD, using flexible daily dosages of up to 300 mg [30]. 

Further OCD RCTs comparing clomipramine ‘head to head’ 
with other tricyclic drugs, including imipramine, amitriptyline, 
nortriptyline, and desipramine [28], confirmed the superiority of 
clomipramine. Clomipramine has greater SRI activity than the other 
compounds, and this property was considered relevant in its anti- 
OCD effect. 


Children and adolescents 


Similar results were obtained in clomipramine studies of children 
and adolescents with OCD [28]. Not only was clomipramine found 
to be superior to desipramine, but a significant proportion of pa- 
tients who had received clomipramine as their first active treatment 
also showed evidence of relapse when switched to desipramine, 
highlighting the efficacy of clomipramine, compared to desipramine. 


Summary 


These results suggest that clomipramine is an effective treatment for 
OCD, irrespective of the presence of comorbid depression across 
the young adult age range. However, there is a want of studies in 
the elderly population, who may be more susceptible to the adverse 
effects of tricyclic drugs. Moreover, the robustly successful results 
from these studies apply largely to patients with stable, severe, and 
untreated OCD. There were not many treatment-refractory cases in- 
cluded within the study groups. 


Selective serotonin reuptake inhibitors 


Following the success of clomipramine, the SSRIs fluvoxamine, flu- 
oxetine, sertraline, paroxetine, citalopram, and escitalopram were 
tested in large adult OCD samples, in a series of well-powered 
multi-centre placebo-referenced RCTs, and consistently found to 
be efficacious [31, 32]. Similarly, evidence for the efficacy of the 
SSRIs fluvoxamine, sertraline, fluoxetine, and paroxetine subse- 
quently accrued from RCTs of children and adolescents with OCD 
[33]. Each SSRI is reviewed in the next sections. The decision to 
use SSRIs in children and adolescents with OCD has to be weighed 
against the possible risk of behavioural disinhibition, including sui- 
cidal ideation, as a recognized concomitant of SSRI treatment in 
young people. 


Fluvoxamine 
Adults 


The finding from a number of small randomized placebo-controlled 
trials that fluvoxamine appeared efficacious in adults with OCD 
was substantiated in multi-centre placebo-controlled studies [34]. 
An earlier improvement was seen for obsessions vs compulsions, 
and significant improvement on the total Y-BOCS was evident 
by 6 weeks. Efficacy for fluvoxamine (100-300 mg) delivered in 
controlled-release form was also demonstrated [35]. 


Children and adolescents 


Fluvoxamine was also efficacious and well tolerated in a multi-centre 
study of 120 young patients (8-17 years) with OCD [36]. Insomnia, 
asthenia, and behavioural disinhibition were reported as possible 
adverse events. It has been suggested that paediatric OCD pa- 
tients with comorbid impulsive or tic disorders may be at increased 
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risk of developing behavioural disinhibition after initiation of SRI 
treatment. 


Sertraline 
Adults 


Positive placebo-referenced RCTs, as well as a multi-centre 12-week 
RCT involving 324 OCD patients [37], demonstrated that sertraline, 
flexibly prescribed in daily dosages of 50-200 mg/day, was effective 
in adults with OCD. 


Children and adolescents 


Evidence from large multi-centre RCTs, demonstrated a significant 
advantage for sertraline (2200 mg/day), delivered as monotherapy 
or in combination with CBT, compared to placebo [38]. Prospective 
evaluation did not reveal clinically meaningful drug-related changes 
in vital signs, ECG indices, or cardiovascular adverse events in any 
of the study subjects. 


Fluoxetine 
Adults 


The first placebo-referenced fixed-dose RCT of an SSRI in OCD, 
which allowed the direct comparison of different daily dosages, dem- 
onstrated placebo-referenced superiority for fluoxetine with a mar- 
ginally statistically greater improvement for patients receiving the 
highest fluoxetine dosage (60 mg daily) and a significantly higher re- 
sponder rate in patients receiving fluoxetine 40 mg or 60 mg daily, 
implying the existence ofa dose-response relationship [39]. Ina larger 
13-week multi-centre RCT of 355 OCD patients [40], all fixed daily 
doses of fluoxetine (20 mg, 40 mg, 60 mg) were efficacious, compared 
to placebo, with a non-significant trend towards greater efficacy evi- 
dent for the 60 mg daily dosage. Fluoxetine was adequately tolerated, 
even at the higher dosages. Another study [41] found fluoxetine to be 
superior to phenelzine, which did not differentiate from placebo. 


Children and adolescents 


The first controlled study in young OCD patients found fixed daily 
dosages of 80 mg fluoxetine demonstrated efficacy over placebo, with 
reports of discontinuation on account of suicidal ideation (resolved 
when fluoxetine was discontinued) and adverse events, particularly 
dose-related behavioural activation [42]. Thus, it was advocated that 
treatment should be initiated at low dosages in childhood OCD. More 
recent evidence confirmed fluoxetine to be significantly superior to 
placebo in terms of efficacy and just as well tolerated in terms of ad- 
verse event-related discontinuation. Moreover, after the first 6 weeks, 
the fluoxetine dose could be increased to 80 mg/day [43]. At week 
16, those taking fluoxetine demonstrated a significant improvement 
in Children’s Yale-Brown Obsessive Compulsive Scale (CY-BOCS) 
scores, and no patients discontinued due to adverse events. Taken to- 
gether, these results suggest satisfactory efficacy and tolerability for 
fluoxetine in childhood OCD, imply that the treatment effect may take 
several weeks to develop, and draw attention to behavioural disinhib- 
ition and suicidal ideation as possible dose-related adverse effects. 


Paroxetine 


Adults 


Paroxetine (20-60 mg/day) was superior to placebo and of compar- 
able efficacy with the active comparator clomipramine (50-250 mg/ 


day) in a multi-national 12-week RCT of 406 adults with OCD [44]. 
Of note, paroxetine showed significantly better tolerability than 
clomipramine. In addition, a 12-week placebo-referenced fixed- 
dose RCT in 348 adult OCD patients [45] also found efficacy for 
paroxetine, compared to placebo, at fixed daily dosages of 40 or 60 
mg/day, but no demonstrable efficacy over placebo at fixed dosages 
of 20 mg/day. 


Children and adolescents 


A multi-centre 10-week RCT in 207 children and adolescents [46] 
found paroxetine (10-50 mg/day) to be superior to placebo. Adverse 
events with paroxetine were of mild to moderate intensity, and the 
authors concluded it was generally well tolerated. 


Citalopram 
Adults 


A 12-week multi-centre placebo-referenced fixed-dose RCT of 401 
adults with OCD found that citalopram (20, 40, and 60 mg) was sig- 
nificantly superior to placebo [47]. Whereas no significant differ- 
ence between the individual doses of citalopram was seen on the 
primary outcome, a dose vs time-to-response relationship favoured 
the 60 mg dose. Citalopram was well tolerated and associated with 
improved psychosocial functioning. 


Escitalopram 
Adults 


A multi-centre fixed-dose RCT in 466 adults with OCD compared 
escitalopram (10 or 20 mg) and paroxetine (40 mg) with placebo 
[48] over 24 weeks. Escitalopram and paroxetine were each superior 
to placebo from 6 weeks onwards. Those in the escitalopram 10 mg/ 
day group also exhibited improvement, compared to placebo, on 
the Y-BOCS, though this effect took more than 12 weeks to become 
significant. The higher (20 mg/day) escitalopram dose produced 
additional advantages (vs escitalopram 10 mg/day) on secondary ef- 
ficacy measures, including remission. 


Meta-analysis of placebo-referenced SRI studies 


A meta-analysis provides an objective and quantifiable measure of 
effect size. However, by grouping together findings from disparate 
studies, the analysis is relatively removed from the originating data. 
Several meta-analyses in adults and in children and adolescents have 
supported placebo-referenced efficacy for SSRIs and clomipramine 
in OCD [32]. The meta-analysis by Soomro et al. [49], which included 
17 adult OCD studies (3097 study participants), demonstrated that 
in treatment trials, SSRIs were almost twice as likely as placebo to 
produce a clinical response. Another recent random effects meta- 
analysis of SRI trials involving 789 young people with OCD [50] 
found a moderate effect size for treatment efficacy (g = 0.50), treat- 
ment response [relative risk (RR) = 1.80], and symptom/diagnostic 
remission (RR = 2.06). 


Which SRI is the most effective? 


There is a shortage of studies directly comparing available treatments 
for OCD. A few head-to-head comparator studies of adequate statis- 
tical power indicated equivalent efficacy for SSRIs and clomipramine, 
but superior tolerability for SSRIs [28]. The UK National Institute for 
Health and Care Excellence (NICE) meta-analysed all the available 
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evidence [51]. They concluded there was little difference between 
clomipramine and SSRIs, in terms of efficacy and adverse events. 
However, patients were more likely to discontinue treatment prema- 
turely due to adverse events with clomipramine, compared to SSRIs. 
NICE therefore recommended SSRIs as the preferred first-line treat- 
ment for OCD, with clomipramine reserved for patients for whom 
SSRI treatment is inefficacious or unacceptable [51]. 

Although SSRIs differ in pharmacokinetics, selectivity, and po- 
tency of effect on the serotonin transporter and other neurotrans- 
mitter receptor-binding properties, there is insufficient evidence to 
recommend one SSRI over another. Most recently, Skapinakis et al. 
[32] performed a network meta-analysis that combined direct and 
indirect evidence to rank treatments. The analysis demonstrated ef- 
ficacy for SSRIs as a whole, compared to placebo; individual SSRIs 
produced similar effect sizes, and there was no evidence to suggest 
superiority of any individual compound. Clomipramine appeared to 
have a larger effect size, compared to placebo, than did the SSRIs, but 
this difference was not statistically significant. 


Optimizing SRI dosage 


Cumulative evidence suggests OCD responds preferably to a 
higher daily dosage of SRI, compared to depression or other anx- 
iety disorders. Fluoxetine, sertraline, paroxetine, citalopram, and 
escitalopram were explored in fixed-dose studies (see Selective sero- 
tonin reuptake inhibitors, p. 1013). Of these SSRIs, a positive dose- 
response relationship was demonstrated for paroxetine (dose range 
40-60 mg), fluoxetine (dose range 20-80 mg) [40], and escitalopram 
(dose range 10-20 mg) [48]. Neither clomipramine nor fluvoxamine 
have been subjected to multiple fixed-dose comparisons. However, a 
relatively low, fixed daily dose of 75 mg clomipramine was found to 
be effective vs placebo [29]. 

Another meta-analysis incorporating nine studies [52] demon- 
strated that the highest licensed doses of SSRI were more effica- 
cious than low or mid-range doses. However, this advantage had 
to be balanced against poorer tolerability at higher dose levels. The 
British Association for Psychopharmacology treatment guideline 
[2, 53] suggested starting with the lowest efficacious daily dosage of 
SSRIs, which may subsequently be increased in the face of insuffi- 
cient response. However, slow up-titration could be problematic for 
severely ill patients, for whom a speedy response is needed. The APA 
guidelines [54] recommend up-titration of SSRIs to the maximum 
FDA-approved doses within 4-6 weeks and waiting for another 6 
weeks to evaluate effectiveness. 


Onset of SRI efficacy 


The SRI response in OCD is comparatively slow and gradual. Thus, 
although some well-powered studies were able to demonstrate 
placebo-referenced efficacy for SRIs within 2 weeks of starting treat- 
ment, in most RCTs, the onset of efficacy was delayed by several 
weeks. In the meta-analysis by Soomro et al. [49], SSRIs as a group 
produced a placebo-referenced statistical difference in Y-BOCS after 
between 6 and 13 weeks of treatment. 


Is the efficacy of SRIs sustained long term? 


The limited long-term follow-up data investigating SSRI treatment 
responders from acute-phase OCD studies suggest that treatment- 
related gains are maintained without tolerance developing. For ex- 
ample, in a large 24-week multi-centre study by Stein et al. [48], 


responders continued to accrue over this extended period. Other 
follow-up studies suggested SSRIs remain efficacious and well toler- 
ated for at least 2 years [31]. 


SRI and relapse prevention 


It is of interest that in a recent study of a community-based sample 
of adults with OCD, approximately 50% achieved sustained remis- 
sion by the age of 50 years, suggesting natural remission occurs com- 
monly [55]. This contrasts with OCD as seen in the clinic, which 
usually follows a chronic and relapsing course. Full remission is in- 
frequent, even in expert treatment centres. 

Irrespective of treatment duration, discontinuation of pharmaco- 
therapy is frequently, but not always, associated with relapse [31]. 
A number of relatively short-term studies have investigated the ef- 
fect of stopping clomipramine. In the majority of cases, symptoms 
re-emerged within a few weeks, whereas improvement to a level 
near to that prior to discontinuation was achieved by reinstating 
clomipramine [6]. 

The effect of discontinuing SSRI has been subject to more robust, 
controlled investigation. A meta-analysis of randomized placebo- 
referenced RCTs of stable treatment responders demonstrated that 
continuing treatment with SSRIs as a class reduced relapse [6]. 
Interestingly, individual studies testing sertraline [56] and fluox- 
etine [57], which may have been underpowered, did not fully dif- 
ferentiate between continuation treatment or switching to placebo, 
although patients remaining on the highest (60 mg/day) fluoxetine 
doses showed significantly lower relapse rates than those given 
placebo [57]. 

Together, the results suggest that SSRIs are effective at preventing 
relapse. A pooled analysis by Hollander et al. [7] demonstrated the 
magnitude of the negative effect of relapse on quality of life and 
psychosocial functioning, emphasizing the importance of early 
intervention and relapse prevention, through the continuation of 
pharmacotherapy, as rational goals for those who have responded 
to an SSRI or clomipramine. Regular clinic appointments have been 
shown to enhance treatment adherence [58]. The APA treatment 
guidelines [54] recommend continuation of pharmacotherapy for 
a minimum of 1-2 years in treatment-responsive individuals and 
emphasize the importance of planning long-term treatment from 
the outset. 


Treatment options for patients who do 





not respond 


The clinical management of treatment-resistant OCD has not been 
thoroughly investigated, although studies indicating promising 
treatment strategies are starting to appear. The majority of such 
studies have investigated SRI non-responders, and there remains a 
want of data relating to psychotherapy resistance. Studies have also 
chosen different entry criteria; some included only extremely refrac- 
tory cases who had failed to respond to successive sustained treat- 
ments with more than one SRI, whereas others included those who 
had made a partial response to treatment with a single drug. 


Defining treatment resistance 


Although most individuals with OCD respond well to first-line 
treatment with SRIs or ERP, for most, the treatment response is not 
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complete. In around 40% of cases, residual symptoms remain in 
spite of prolonged treatment [59]. There remains controversy as to 
the length of an adequate treatment trial before judging its success. 
As OCD responds in a slow, gradual way, expert guidelines suggest 
waiting at least 12 weeks using optimized SRI dosages [54]. For prac- 
tical purposes, patients experiencing <25% reduction of their base- 
line Y-BOCS score, after completing 12 weeks of SRI treatment, of 
which at least 6 weeks should be at the maximal tolerated dose, may 
be considered to have failed to respond [54]. 


Extending trial of original SRIs 


The APA [54] recommended continuing with an SSRI for 8-12 
weeks, of which 6 should be at the maximum tolerated dose, be- 
fore considering a change in treatment. However, this may result in 
patients enduring ineffective treatment for longer than necessary. 
A few studies attempted to identify likely treatment non-responders 
within a shorter time frame, based upon their early treatment re- 
sponse. For example, symptomatic improvement on SRIs at 4 
weeks strongly predicted subsequent clinical response at 12 weeks 
[60]. This finding emphasizes the importance of looking for early 
treatment-related improvement and hints that changing SRIs could 
be considered earlier than 12 weeks into the course of treatment. 
However, data from another study of fluoxetine non-responders 
[61] suggested delayed benefit accrues with extended fluoxetine 
treatment. Importantly, the patients in this study were not neces- 
sarily fully SSRI-resistant at baseline. However, it may still be appro- 
priate to persist for >12 weeks with a given SRI in patients showing 
at least some signs of improvement, particularly if the treatment is 
well tolerated. 


Increasing SRI dosage 


The maximum recommended dose (250 mg/day) of clomipramine 
should not usually be exceeded, due to the increased risk of dose- 
related serious adverse events, including cardiotoxicity or seizures. In 
contrast, some evidence suggests SSRIs may show better efficacy and 
are well tolerated by OCD patients at doses exceeding the licensed 
maximum. A double-blind study [62] randomized 66 OCD patients 
who had failed to respond to standard doses to receive sertraline 200 
mg/day or 250-400 mg/day. Improvement (on Y-BOCS and CGI) 
was significantly greater for the high dose group. Both groups dis- 
played similar levels of adverse events, suggesting sertraline is com- 
paratively more effective and well tolerated at doses of >200 mg/day. 
Similarly, open-label studies that tested a ‘higher than usual’ dose of 
escitalopram (20-50 mg/day) [63] showed a significantly greater im- 
provement in Y-BOCS scores and response rate than doses of 20 mg/ 
day. Interestingly, the difference disappeared when baseline depres- 
sion and anxiety scores were analysed as covariates, suggesting that 
OCD patients with comorbid depression or anxiety may specifically 
benefit from a higher SSRI dosage. 

The APA OCD guideline [54] provides a list of upper doses of 
SSRIs that are occasionally prescribed for those who have failed to 
respond to conventional doses or are recognized to be ‘fast metab- 
olizers. As long as the patient is not experiencing undue adverse ef- 
fects, the guideline [54] recommends ‘occasionally prescribed’ doses 
of up to 120 mg/day of fluoxetine, 450 mg/day of fluvoxamine, 100 
mg/day of paroxetine, and 400 mg/day of sertraline. Such doses seem 
most warranted when the patient experienced a partial response to 
a lower dose and is tolerating the medication well. However, such 


an approach is not without risk. In the case of citalopram and 
escitalopram, dose-dependent effects are recognized to extend the 
ECG QT interval. Therefore, caution is required if the maximum 
dose is exceeded. The elderly or those with a cardiac history may be 
at particular risk. However, a recent study did not show an elevated 
cardiac risk on higher doses of citalopram [64]. If exceeding the li- 
censed daily dosage, it is advisable to monitor for adverse effects on 
cardiac conduction (for example, by ECG). The elderly may also be 
susceptible to SSRI-induced electrolyte disturbances and bleeding 
tendencies, and for those on anticoagulant therapy, the international 
normalized ratio (INR) may require more stringent monitoring. 


Switching SRI/SSRI to SNRI 


Assuming good adherence, if response to the first SRI is inadequate 
after 8-12 weeks on the maximum dose or is poorly tolerated, 
switching to another SRI or from SSRI to clomipramine is accept- 
able, though limited trial data support this option. The chances of 
responding to a second SRI were estimated at 40%, and to a third 
SRI at even less [59]. Some evidence supports switching SRIs in the 
case of non-response and suggests that paroxetine (SSRI) is more ef- 
ficacious than venlafaxine (SNRI) for non-responders to a previous 
SRI trial [65]. 


Adjunctive dopamine antagonist (antipsychotic) 


Adjunctive dopamine antagonists have arguably produced the 
strongest evidence of effectiveness in SRI-resistant OCD. Even so, the 
evidence is largely derived from small RCTs and uncontrolled case 
series (reviewed in [59]). There have been no dose-finding studies 
of dopamine antagonists in OCD. Moreover, there are limited data 
to inform the optimal duration of adjunctive treatment, the effect 
on relapse prevention, and the need for metabolic monitoring for 
this patient group. Apart from encouraging preliminary results for 
aripiprazole in young people (see Aripiprazole, p. 1017), the current 
limited evidence does not support the use of dopamine antagonists 
as monotherapy in OCD. 


Early studies with dopamine antagonists 


A case series by McDougle et al. [66] reported benefit from adding 
open-label pimozide in patients unresponsive to fluvoxamine. Those 
with comorbid chronic tic or schizotypal disorder responded best. 
A subsequent double-blind placebo-controlled study [67] demon- 
strated significant improvements when haloperidol was added to flu- 
voxamine. Better response was reported for patients with comorbid 
tic. However, dose-dependent extra-pyramidal side effects, such 
as akathisia, occurred to the extent that in some studies [66], up 
to half the patients required co-administration of B-blockers and/ 
or anticholinergics. Therefore, these agents are usually reserved as 
third-line treatment after a trial of a dopamine antagonist drug with 
serotonergic actions. It is recommended to start treatment at low 
dosages and increase cautiously, subject to tolerability (for example, 
0.25-0.5 mg haloperidol, titrated slowly to 2-4 mg) [67]. 


Newer dopamine antagonist drugs 
Risperidone 


Several small RCTs have demonstrated efficacy for adjunctive 
risperidone (0.5-4 mg/day) in SSRI-resistant OCD [68]. However, a 
RCT [69] that treated 100 SSRI partial or non-responders found no 
benefit for adjunctive risperidone (up to 4 mg) over placebo. These 
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data suggest that the benefits of adding risperidone may be limited 
to patients with clear evidence of SSRI resistance. 


Paliperidone 


A small 8-week double-blind, placebo-referenced RCT of adjunctive 
paliperidone (9-OH-risperidone) (<9 mg/day) in 34 cases of SSRI- 
resistant OCD did not produce statistically significant improvement 
[70]. Better powered studies are needed to unequivocally demon- 
strate efficacy. 


Olanzapine 


There has been one positive and another negative placebo-referenced 
RCT, which may have failed because the study population was not 
truly SSRI-resistant prior to entering the trial (reviewed in [31, 59]). 


Quetiapine 


Studies investigating adjunctive quetiapine (<400 mg daily) in 
treatment-resistant OCD have also yielded mixed results (re- 
viewed in [31, 59]). In one double-blind, placebo-controlled study, 
adjunctive quetiapine was efficacious. However, another smaller 
RCT in more highly treatment-resistant cases found that although 
quetiapine (mean 215 mg/day) was numerically superior to placebo, 
the results did not reach statistical significance. In two further RCTs, 
no significant difference between adjunctive quetiapine and placebo 
emerged. Another study found that quetiapine successfully aug- 
mented the effect of citalopram in 76 cases of non-resistant OCD, 
suggesting its effects may not be limited to resistant patients. 


Aripiprazole 


Two small, double-blind, placebo-controlled trials of adjunctive 
aripiprazole (15 mg/day, 10 mg/day) found evidence of efficacy in 
adults with SRI-resistant OCD [(reviewed in [31, 59]). Adjunctive 
aripiprazole has also been subjected to a preliminary open-label 
study in young people with SRI-resistant OCD, with promising 
results. An open-label study that investigated aripiprazole mono- 
therapy (2-7.5 mg/day) in 16 children with SSRI and CBT-resistant 
OCD also reported positive findings [71]. These results suggest 
promise for aripiprazole in treatment-resistant OCD across the life- 
span and support further exploration of the effect of antipsychotic 
monotherapy in OCD. 


Meta-analysis of adding dopamine antagonist/partial 
agonist drugs 


While studies on the aforementioned drugs have demonstrated ef- 
fectiveness as augmentation agents in OCD, the overall effect size is 
modest and only around one-third of patients are noted to respond. 
Ina pooled analysis [72] of data from placebo-controlled quetiapine 
trials, the best results were seen when quetiapine was used in com- 
bination with clomipramine, fluoxetine, or fluvoxamine and when 
the SRI dose was lower, suggesting quetiapine may be of most benefit 
in patients unable to take maximal doses of SRIs. 

A subsequent meta-analysis [73], which combined results from 
14 randomized, placebo-controlled studies (incorporating 491 pa- 
tients), showed that significantly more participants responded to 
augmentation with dopamine antagonists/partial agonists ‘as a 
group than placebo. Furthermore, aripiprazole, haloperidol, and 
risperidone significantly outperformed placebo control. Another re- 
cent meta-analysis [74] that was limited to newer drugs for psychosis 


and included 14 studies and 493 patients produced a similar result. 
Dopamine antagonists as a class were efficacious, as were aripiprazole 
and risperidone, but there was insufficient evidence to suggest that 
quetiapine or olanzapine were superior to placebo. 

A mixed-model meta-regression analysis of 13 RCTs [75] cor- 
related the efficacy of each antagonist with its neurotransmitter 
receptor-binding affinities. A significant positive correlation was 
found between dopamine receptor (D2R, D3R) affinity and clin- 
ical efficacy (standardized mean difference in Y-BOCS score), sug- 
gesting the drugs may exert their clinical effects in OCD through 
this mechanism. 


Intravenous administration of SRIs 


The intravenous (IV) administration of SRIs is highly dependent 
on the configuration of health systems and has limited practicality 
outside inpatient settings. In a study of 54 oral clomipramine- 
refractory OCD patients randomized to IV clomipramine (25 
mg/day, increasing to 250 mg/day) or IV placebo [76], significant 
advantages were seen for clomipramine on some efficacy meas- 
ures, but not on the Y-BOCS. IV clomipramine was not associ- 
ated with serious adverse events. In a more recent open-label study 
[77] of 30 multiply SRI-resistant inpatients administered with IV 
clomipramine for 1 week, followed by oral clomipramine dosed 
up to 225 mg/day, 76.7% were responders and 60% were still re- 
sponders at 24 weeks. No relevant persistent adverse effects were 
reported. The results suggest IV clomipramine could be of at least 
short-term benefit for severe OCD cases that have not adequately 
responded to several therapies, but the long-term advantages need 
further exploration. 

Another open-label study of IV citalopram in SSRI-resistant 
adults, which was well tolerated even at high doses (up to 80 mg daily) 
[78], produced satisfactory response rates. All 21 IV citalopram re- 
sponders showed significant further improvement on continuation 
treatment with oral citalopram, suggesting that IV citalopram may 
be a useful method of accelerating the response for OCD patients 
and for predicting response to oral citalopram. 


Combining SSRIs and clomipramine 


This combination should be used with caution, as pharmacokin- 
etic interactions on hepatic P450 cytochrome isoenzymes may 
occur, resulting in elevated clomipramine plasma levels and poten- 
tial toxicity. One study found that the combination of fluvoxamine 
and clomipramine was efficacious and generally well tolerated, 
although high plasma levels of clomipramine associated with a 
higher frequency of clinically relevant adverse effects were seen 
[79]. Therefore, ECG, EEG, and clomipramine plasma level moni- 
toring was recommended for co-medicated patients. Compared to 
other SSRIs, citalopram and escitalopram are potentially less likely 
to interact with clomipramine. Promising results from a small 
open-label trial of citalopram (40 mg daily) in combination with 
clomipramine (150 mg daily) contrast with the results from two 
more recent small studies with methodological weaknesses, which 
found no benefit from combining clomipramine with an SSRI and 
highlighted the potential for adverse effects associated with this 
combination [61, 80]. One patient on adjunctive clomipramine 
had to withdraw because of serotonergic syndrome. Another three 
developed ECG QTc prolongation and were withdrawn. Taken to- 
gether, the results suggest limited benefits for such combinations, 
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and if they are contemplated, ECG and plasma level monitoring 
are advisable. 


Novel pharmacological agents 


The following compounds are under investigation for OCD and 
have shown some evidence of efficacy, but as so far they lack con- 
vincing validation in controlled studies, they cannot at present be 
judged to be effective. 

The glutamatergic compound memantine has appeared helpful 
as an adjunct to an SSRI in a few open-label trials and two small 
randomized, placebo-controlled trials (31, 81]. Preliminary results 
from open-label studies suggesting efficacy for riluzole, another 
glutamate-modulating agent [31], have so far not been substanti- 
ated. In a placebo-controlled trial of riluzole in children with refrac- 
tory OCD, no significant difference was noted on any of the primary 
or secondary outcome measures [82]. 

The glutamatergic hypothesis of OCD has been further explored 
through investigation of IV ketamine. Ketamine must be used cau- 
tiously, as it is associated with urinary tract damage. A placebo- 
referenced cross-over RCT in eight resistant patients produced a 
clinical response in 50% of those receiving ketamine [83]. However, 
in another 3-day open-label ketamine trial in ten subjects with 
refractory OCD and depression, there were no responders and, 
although depressive symptoms improved, the improvement in Y- 
BOCS scores amounted to <12% [84]. 

The 5-HT3 receptor antagonist ondansetron, administered 
in combination with fluoxetine, demonstrated a superior effect 
over adjunctive placebo in a randomized controlled pilot study 
in treatment-resistant OCD [85]. However, an as yet unpublished 
multi-centre trial did not demonstrate an improvement in OCD 
symptoms vs placebo. 

Mirtazapine as monotherapy was found to significantly improve 
outcomes in a placebo-controlled discontinuation study of 15 open- 
label mirtazapine responders [86]. 

Clonazepam, as an adjunct to an SRI, may produce symptomatic 
benefit, possibly through improving associated anxiety. It is less suit- 
able in those with a previous history of benzodiazepine or other sub- 
stance abuse or dependence [87]. 

Anti-epileptic mood stabilizers may, in combination with an 
SSRI, have a role in the treatment of OCD, but the supporting evi- 
dence is not strong and further placebo-controlled trials are neces- 
sary. Positive results were obtained in a small RCT of lamotrigine 
and another of topiramate in which the therapeutic effect was seen 
on reducing compulsions only (reviewed in [31]). A second RCT 
of adjunctive topiramate in 38 resistant patients failed to demon- 
strate efficacy [88]. Adjunctive pregabalin has been investigated in 
an open-label case series only, with signs of possible efficacy, as has 
gabapentin [89]. 

A randomized, placebo-controlled trial of single-dose D- 
amphetamine produced short-lived benefits, while another RCT 
directly comparing D-amphetamine and caffeine noted that both 
compounds were associated with rapid improvement of obsessive 
compulsive symptoms within a week, hinting that stimulants, such 
as D-amphetamine, could play a role in treating OCD, possibly in 
the context of comorbid attention-deficit/hyperactivity disorder 
(ADHD) [90]. 


Combined CBT and pharmacotherapy 


Although many specialist centres offer combined treatment, there 
remains uncertainty over the degree to which combining CBT with 
SSRIs improves outcomes, compared to monotherapy. Some studies 
suggested that combining CBT with pharmacotherapy produced 
better results than CBT monotherapy [59]. Other studies found that 
patients who responded only partially to an SRI fared better if CBT 
with ERP was added [12, 91]. Thus, a trial of combined CBT plus an 
SSRI would seem appropriate for those patients failing to respond 
adequately to SSRI monotherapy. 

The APA Practice Guidelines [54] suggest using either an SRI or 
CBT as first-line treatment. According to a cost-effectiveness ana- 
lysis, the UK NICE guidelines [51] recommend combining CBT 
with an SRI only in more severe or treatment-resistant OCD cases. 

The network meta-analysis by Skapinasis et al. [32] highlighted 
that although there are limited data regarding the efficacy of com- 
bination treatment, many trials of psychological therapies can be 
considered as variations of combination trials, as most allowed study 
participants to remain on stable doses of drugs. They concluded that 
the combination of SRIs with psychotherapy is likely to be more 
beneficial for patients with severe OCD than monotherapy. Further 
research is necessary to substantiate this claim. 


Somatic treatments in OCD 


Failure to respond to the aforementioned treatments, including 
combination treatment with intensive inpatient and/or home- 
based or clinic-based therapist-assisted CBT, may indicate refrac- 
toriness to treatment. At this stage, if the symptoms remain severe 
and incapacitating, it may be necessary to liaise with specialist 
services offering somatic treatments such as neurostimulation or 
neurosurgery. 


Non-invasive neurostimulation 
Repetitive transcranial magnetic stimulation 


Repetitive transcranial magnetic stimulation (rTMS) modulates 
neuronal activity by inducing a magnetic field pulse. rTMS directed 
at the prefrontal subcortical circuits is hypothesized to be beneficial 
in OCD [92]. However, there is presently insufficient evidence to 
recommend the use of rTMS as treatment, and it remains an experi- 
mental procedure. A meta-analysis [93] of ten RCTs involving 282 
subjects found a moderate effect size (0.59) and the response rates 
were 35% and 13% for patients receiving active and sham rTMS, re- 
spectively. The most promising target areas for stimulation included 
the orbitofrontal cortex (OFC) and the pre-supplementary motor 
area (SMA). rTMS is generally safe. However, rarely, high-frequency 
rTMS may induce seizures. Other reported adverse effects include 
localized pain, paraesthesiae, hearing change, thyroid-stimulating 
hormone and blood lactate level changes, and hypomania; 

however, these problems are usually transient [92]. 


Transcranial direct current stimulation 


Research on the use of transcranial direct current stimulation 
(tDCS) for treating OCD is still in its infancy. There are no currently 
published RCTs of tDCS in OCD, but results from a small number 
of uncontrolled tDCS treatment trials targeting either the SMA 
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or the OFC, in which tDCS was well tolerated in all cases, appear 
promising [94]. 


Transcranial alternating current stimulation 


A recent case series showed the successful application of transcranial 
alternating current stimulation (tACS) in seven SSRI- and 
psychotherapy-resistant OCD patients [95]. Again, further sham- 
controlled double-blind studies are warranted if transition of this 
therapy from the laboratory to the clinic is to be considered. 


Deep brain stimulation 


Deep brain stimulation (DBS) involves the neurosurgical implant- 
ation of electrodes that interrupt electrical impulses linking spe- 
cific locations in the brain. This approach permits focal, relatively 
low-risk, and relatively reversible modulation of brain circuitry. It is 
hypothesized that DBS brings about therapeutic effects in OCD by 
modulating the cortico-striatal neurocircuitry proposed to mediate 
OCD [96]. At present, DBS remains a highly experimental treatment 
for extremely severe enduring illness, with evidence largely based on 
case series. Studies are small with, at best, partially controlled de- 
signs. Although the procedure is reported to be ‘relatively safe, with 
limited side effects [96], the 3-year outcomes of bilateral stimulation 
of ventral capsule/ventral striatum in ten adult OCD patients in- 
cluded surgical adverse effects, such as asymptomatic haemorrhage, 
seizure, and superficial infection, as well as psychiatric adverse ef- 
fects, including worsening of depression and OCD when DBS was 
interrupted [97]. Other acute adverse effects included transient 
hypomania, sadness, anxiety, euphoria, and giddiness. 

A meta-analysis by Alonso et al. [98], which included 116 OCD 
patients from 31 studies, calculated the global reduction in Y-BOCS 
score as 45.1% and the global number of responders as 60.0%. 
Older age of onset of OCD and the presence of sexual or religious 
obsessions or compulsions were associated with a better response. 
Worsening of anxiety was the most commonly reported adverse 
event, occurring in 21.6% of patients. However, the authors state that 
all side effects were generally mild and transient and that the treat- 
ment appeared to be tolerable for most patients. 


Ablative neurosurgery 


Modern ablative neurosurgical procedures are stereotactically 
guided, resulting in small and accurately placed lesions. This is 
most commonly achieved using thermal stimuli, although there is 
ongoing research into the use of radiosurgical techniques such as 
the gamma knife. 

Anterior cingulotomy, involving lesions placed in the dorsal anterior 
cingulate cortex, and anterior capsulotomy, involving lesions placed 
within the inferior fronto-thalamic connections within the anterior 
limb of the internal capsule, are the most common procedures [99]. 
Both are hypothesized to modulate functioning within the cortico- 
striatal-thalamic circuitry. The available evidence suggests that neuro- 
surgery produces significant therapeutic benefits to 30-60% of patients 
with otherwise highly refractory OCD. Serious adverse effects are un- 
common but have been reported with both procedures (for example, 
intracranial haemorrhage, recurrent seizures). Anterior cingulotomy 
appears to offer a superior safety profile to that of anterior capsulotomy. 
The quality of evidence supporting each procedure is reflective of 
neurosurgery as a whole. A single RCT of gamma capsulotomy has 
been performed in 16 highly refractory patients [100]. There was a 


non-significant benefit on the Y-BOCS for the actively lesioned group. 
The most serious adverse event was a single asymptomatic radiation- 
induced cyst. Surgical intervention is reserved for patients with severe, 
incapacitating OCD who have failed an exhaustive array of expertly 
delivered medication trials and intensive evidence-based CBT. 
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Hoarding disorder (HD) is a mental disorder that was newly in- 
cluded in the Diagnostic and Statistical Manual of Mental Disorders, 
fifth edition (DSM-5) [1]. More recently, it was also included in the 
eleventh revision of the International Classification of Diseases (ICD- 
11) [2]. Individuals meeting the diagnostic criteria for HD experi- 
ence persistent difficulty discarding or parting with possessions due 
to a perceived need to save the items and distress associated with 
discarding them. This results in the accumulation of possessions that 
congest and clutter active living areas and substantially compromise 
their intended use, causing clinically significant distress, interfering 
in daily life, and sometimes posing a risk to self and others. General 
medical conditions and other mental disorders that are known to 
result in excessive accumulation of objects must be ruled out before 
the diagnosis of HD can be made. 

Hoarding as a characterological trait has its origins in the psy- 
choanalytical descriptions of the ‘anal’ character [3]. The ‘inability 
to discard worn-out or worthless objects even when they have no 
sentimental value’ became a core diagnostic criterion for obsessive- 
compulsive personality disorder (OCPD) only with the publication 
of DSM-III-R [4]. In 1994, DSM-IV [5] introduced the idea that 
‘extreme’ hoarding might, in fact, warrant a diagnosis of obsessive- 
compulsive disorder (OCD), leading to the widespread conceptu- 
alization of hoarding as a symptom dimension of OCD. However, 
research accumulated over the last two decades has found that most 
individuals who display hoarding behaviour do not endorse other 
symptoms of OCD and has further delineated the differences be- 
tween the two constructs. This culminated in the inclusion of the 
new disorder in DSM-5, separate from OCD [3]. 

In DSM-5, HD is included under the broad umbrella of obsessive- 
compulsive and related disorders, which—in addition to OCD— 
also includes body dysmorphic disorder, hair pulling disorder 
(trichotillomania), and excoriation (skin picking) disorder. The 
reasons for including HD in this chapter are primarily historical, 
given the previous conceptualization of problematic hoarding as 
being a criterion of OCPD and/or a symptom of OCD. While there 
are similarities between HD and other obsessive-compulsive and re- 
lated disorders [6], it is also apparent that HD shares features with 
other emotional, impulse control, and neurodevelopmental dis- 
orders such as attention-deficit/hyperactivity disorder (ADHD) [7]. 





During the process leading to DSM-5, the OCD and Related 
Disorders Sub-Workgroup also proposed the removal of the 
hoarding criterion of OCPD, a recommendation that was initially 
endorsed by the DSM-5 Personality Disorders Workgroup [8], but 
eventually not followed. Thus, in DSM-5, OCPD retains the ‘in- 
ability to discard objects’ criterion, but it encourages clinicians to 
consider the possibility of HD, rather than OCD, when the hoarding 
is ‘extreme’ 









































The DSM-5 diagnostic criteria for HD [1] are based on an earlier op- 
erational definition of ‘compulsive’ hoarding [9]. The landmark fea- 
ture of HD is a persistent difficulty with discarding or parting with 
possessions (Criterion A). The most commonly saved items include 
newspapers, old clothing, bags, books, and paperwork, but virtually 
any item can be saved. The nature of items is not limited to worth- 
less possessions, because many individuals collect and save valuable 
things too. These discarding difficulties are generally motivated by 
the perceived utility or aesthetic value of the items, a strong senti- 
mental attachment to the possessions, the fear of losing important 
information, a desire to avoid being wasteful, or—typically—a 
combination of these factors [9]. The prospect of discarding or 
parting with possessions causes substantial distress to the individual 
(Criterion B). 

These difficulties result in the disorganized accumulation of pos- 
sessions that congest and clutter active living areas and substantially 
compromise their intended use (Criterion C). Affected individuals 
may not be able to sleep in their beds, cook in the kitchen, or sit 
on the sofas in their living rooms. Frequently, the clutter extends 
beyond the person’s actual home, with accumulation of possessions 
taking place in garages, gardens, vehicles, and even workplaces. 
Some individuals may pay for private storage spaces or ask family 
members or friends to keep items in their homes. Criterion C can 
still be met if the living areas are uncluttered only because of the 
intervention of third parties (for example, family members, cleaners, 
or authorities). Because children and adolescents typically do not 
control their living environment and have limited acquisitive power, 
the possible intervention of third parties (for example, parents 
keeping the spaces useable and thus reducing interference) should 
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be considered when making the diagnosis. Young individuals with 
HD who live at their parent’s home may not have unduly cluttered 
bedrooms but still fulfil all other diagnostic criteria. 

Criterion D requires that the difficulties with discarding and/or 
clutter cause clinically significant distress or impairment in social, 
occupational, or other important areas of functioning, including 
maintaining a safe environment for self and others. In severe cases, 
hoarding can put individuals at risk for fire, falling, poor sanitation, 
and other health risks. Some individuals, particularly those with 
poor insight, may not report distress, and the impairment may only 
be apparent to those around the individual. However, any attempts 
to discard or clear the possessions by third parties result in high 
levels of distress. Quality of life is often severely impaired, and family 
relationships are frequently strained [10]. Legal proceedings ranging 
from forced clearings to evictions and, where relevant, the removal 
of dependents from the home (for example, children, elderly) are 
relatively common [11]. 

A diagnosis of HD can only be given once other general medical 
conditions (Criterion E) and mental disorders (Criterion F) have 
been ruled out (see Differential diagnosis, p. 1024). 

For a diagnosis of HD, all six criteria must be met. For individuals 
fulfilling the diagnostic criteria for HD, clinicians may also assess 
the presence of two specifiers: excessive acquisition and insight. 

The excessive acquisition specifier is endorsed when the individual 
engages in excessive acquisition of free items, excessive buying, 
or—less frequently—stealing items that are not needed or for which 
there is no space available. Approximately 63-90% of persons with 
HD engage in excessive acquisition [12-15]; these individuals typic- 
ally experience distress if they are unable to acquire items or are pre- 
vented from acquiring items. Reducing excessive acquisition is one 
of the main goals in the treatment of HD (see Treatment, p. 1027). 

The insight specifier has three categories (good/fair, poor, and 
absent/delusional) and refers to the degree to which the individual 
recognizes that hoarding-related beliefs and behaviours are prob- 
lematic. A substantial proportion of sufferers lack insight into their 
difficulties and are reluctant to seek help for their problems [16]. 


Increasing motivation for change is an integral part of the current 
psychological treatment for HD (see Treatment, p. 1027). 

Other common features of HD include indecisiveness, perfec- 
tionism, avoidance, procrastination, difficulty with planning and 
organizing tasks, and distractibility [9]. Some individuals with 
HD live in various degrees of unsanitary conditions (squalor) that 
may be a logical consequence of severely cluttered spaces and/or 
related to planning and organizing difficulties [17]. However, the 
majority of individuals living in severe domestic squalor do not 
meet the criteria for HD; rather, squalid homes are typically seen 
in other medical or neurocognitive conditions (see Differential 
diagnosis, p. 1024). 

Some individuals are known to accumulate large number of ani- 
mals. In animal hoarding, there is generally failure to provide min- 
imal standards of nutrition, sanitation, and veterinary care for the 
animals and failure to act on the deteriorating condition of the ani- 
mals and the environment. It is unclear if animal hoarding is a spe- 
cial manifestation of HD, but most individuals who hoard animals 
also hoard inanimate objects [18]. 





Differential diagnosis 


A diagnosis of HD can be made only after other medical condi- 
tions and mental disorders that can lead to excessive accumulation 
of possessions have been ruled out. A careful psychopathological 
interview is necessary to establish the differential diagnosis of HD. 
The presence of cluttered living spaces alone does not necessarily 
signal the presence of HD, because cluttered (and sometimes un- 
hygienic) living spaces may be the consequence of multiple condi- 
tions (Fig. 99.1). 


Other medical conditions 


HD is not diagnosed if the symptoms are judged to be a direct con- 
sequence of another medical condition (Criterion E), such as trau- 
matic brain injury, surgical resection for the treatment of a tumour 
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or seizure control, cerebrovascular disease, infections of the cen- 
tral nervous system (for example, herpes simplex encephalitis), 
or neurogenetic conditions such as Prader-Willi syndrome [3]. 
Damage to the anterior ventromedial prefrontal and cingulate cor- 
tices has been particularly associated with the excessive accumula- 
tion of objects [19, 20]. In these individuals, the hoarding behaviour 
is not present prior to the onset of brain damage and appears shortly 
after brain damage occurs [19]. Some of these individuals appear to 
have little interest in the accumulated items and are able to discard 
them easily or do not care if others discard them, whereas others ap- 
pear to be very reluctant to discard anything [20]. 


Neurodevelopmental disorders 


HD is not diagnosed if the accumulation of objects is judged to be a 
direct consequence ofa neurodevelopmental disorder such as autism 
spectrum disorders (ASD) or intellectual disability. Individuals with 
ASD may, for example, excessively acquire or retain possessions 
that correspond to a particular sensory preoccupation or ‘special 
interest. The relevance of these behaviours should be investigated 
[for example, “Do the objects you save largely share a particular 
physical characteristic (for example, material, texture, shape)?’]. 


Schizophrenia spectrum and other psychotic disorders 


HD is not diagnosed if the accumulation of objects is judged to be 
a direct consequence of delusions or negative symptoms in schizo- 
phrenia spectrum and other psychotic disorders [3, 21]. 


Major depressive episode 


HD is not diagnosed if the accumulation of objects is judged to be 
a direct consequence of psychomotor retardation, fatigue, or loss of 
energy during a major depressive episode. 


Obsessive-compulsive disorder 


OCD warrants particular consideration, given the previous con- 
ceptualization of hoarding as a symptom of OCD [3]. HD is not 
diagnosed if the symptoms are judged to be a direct consequence of 
typical obsessions or compulsions such as fears of contamination, 
harm, or feelings of incompleteness. In such cases, OCD should be 
diagnosed instead [22]. Symmetry obsessions and compulsions are 
most commonly associated with this form of hoarding. The accu- 
mulation of objects can also be the result of persistently avoiding 
onerous rituals (for example, not discarding in order to avoid end- 
less washing or checking rituals) (3, 21]. 

Some additional clinical features are helpful in differentiating HD 
from OCD. In OCD, the behaviour is generally unwanted and highly 
distressing, and the individual experiences no pleasure or reward 
from it. Excessive acquisition is usually not present; if excessive ac- 
quisition is present, items are acquired because of a specific obses- 
sion (for example, the need to buy items that have been accidentally 
touched in order to avoid contaminating other people), not because 
of a genuine desire to possess the items. These individuals are also 
more likely to accumulate bizarre items such as trash, faeces, urine, 
nails, hair, used diapers, or rotten food [23]. Accumulation of such 
items is very unusual in HD. 

Both disorders may be diagnosed when severe hoarding appears 
concurrently with other typical symptoms of OCD but is judged to 
be independent of these symptoms [3]. 
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Neurocognitive disorders 


HD is not diagnosed if the accumulation of objects is judged to be a 
direct consequence ofa degenerative disorder such as neurocognitive 
disorder associated with fronto-temporal lobar degeneration or 
Alzheimer’s disease. Typically, onset of the accumulating behaviour 
is gradual and follows the onset of the neurocognitive disorder. The 
accumulating behaviour may be accompanied with self-neglect and 
severe domestic squalor (where there is trash, rotten food, or ex- 
crement), alongside other neuropsychiatric symptoms such as dis- 
inhibition, gambling, rituals/stereotypies, tics, and self-injurious 
behaviours [17]. 


Normal collecting 


While technically not a differential diagnosis, it is worth delineating 
the differences between normative or healthy collecting and patho- 
logical hoarding [24]. This is particularly relevant because quite a 
few individuals meeting the criteria for HD define themselves as 
‘collectors, as they perceive this term as being somewhat less pejora- 
tive [24]. Normative collecting is a common activity that is both be- 
nign and pleasurable. Most children and up to 30% of adults collect 
items at some point [25]. Collectors report the acquisition of, attach- 
ment to, and reluctance to discard objects, but they do not have the 
disorganized clutter, distress, and impairment that is characteristic 
of HD. In contrast to hoarding, the process of collecting is highly 
structured and planned, very selective (that is, confined to a narrow 
range of items), pleasurable, and often a social activity. Most col- 
lectors, even those who might be considered eccentric, are unlikely 
to meet the diagnostic criteria for HD [24, 25]. 


Although approximately 5-10% of patients with OCD display 
hoarding symptoms, the majority of individuals (>80%) with 
hoarding problems do not display other OCD symptoms [12, 13, 
23, 26]. In fact, hoarding symptoms may be equally prevalent in indi- 
viduals with anxiety disorders other than OCD, although these symp- 
toms often go unnoticed as clinicians do not ask about them [27]. 
The most common psychiatric comorbidities among hoarding 
cases are generalized anxiety disorder (31-37%), major depressive 
disorder (26-31%), OCD (15-20%), panic disorder (17%), social 
anxiety disorder (14%), and post-traumatic stress disorder (14%) [12, 
13, 15, 21, 28]. Symptoms typical of ADHD, particularly inattention, 
are also commonly reported [7, 13]. These comorbidities, rather than 
hoarding, may often be the main reason for clinical consultation [27]. 
General medical conditions may also be highly frequent in pa- 
tients with HD. The presence of HD has been associated with both 
poorer perceived physical health and higher rates of disability [15]. 
These limitations may be particularly evident in elderly individuals, 
who appear to have a disproportionally high rate of medical com- 
plications of HD, compared with age-matched control subjects [29]. 


Course and prognosis 


Retrospective reports indicate that hoarding difficulties begin early 
and span well into the late stages of life. Hoarding symptoms may 











SECTION 15 Obsessive-compulsive and related disorders 


first emerge around the age of 11-15 years, start interfering with the 
individual's everyday functioning by the mid-20s, and cause clinic- 
ally significant impairment by the mid-30s [30-32]. Problems with 
acquisition, when they do emerge, appear to do so (or become more 
evident) earlier than problems with discarding [30]. Participants in 
clinical research studies are usually in their 50s [12, 23]. A steady 
worsening of symptoms is typically reported over each decade of life 
[33]. Once symptoms begin, the course of hoarding is often chronic, 
with a few individuals reporting a waxing and waning course 
[32]. Given the absence of treatment studies with long follow-up 
periods, it is currently unknown what the long-term prognosis of 
treated HD is. 


Epidemiology 


Initial prevalence estimates, primarily from questionnaire studies 
conducted in Europe and the United States, estimated the point 
prevalence of clinically significant hoarding to be approximately 2- 
6% among adults [21] and 2% among adolescents [34]. However, in 
the only epidemiological study to date to employ the current DSM- 
5 criteria, conducted in England, the prevalence was estimated to 
be approximately 1.5% in both adult men and women [15]. This 
estimate, which was obtained principally through diagnostic inter- 
views conducted face-to-face in the participant’s homes, contrasts 
with some prior work in its suggestion that rates of HD are balanced 
across the genders. Help-seeking clinical samples are predominantly 
female, perhaps reflecting better insight and motivation for change. 
Epidemiological studies are more consistent in their finding that the 
prevalence of HD seems to increase with advancing age [15, 21]. 

Individuals with HD are more frequently unemployed and more 
often unmarried, separated, or divorced than individuals from the 
general community [15, 24, 35]. Additionally, they are more likely 
to feel that their hoarding difficulties impair their social and occu- 
pational functioning [10, 15]. The functional impairments caused by 
the disorder typically extend to the relatives or carers of the person 
who hoards [10]. 


Aetiology 


Genetic and environmental factors 


Although controlled family studies have not yet been carried 
out, patient self-report suggests that hoarding runs in families 
[23]. Consistently, a handful of twin studies conducted in large 
population-based adult twin samples suggest that this familiality is 
largely attributable to additive genetic factors (with heritability ran- 
ging between 36% and 50%), with the remaining variance attribut- 
able to non-shared environmental factors and measurement error 
[36-38]. Interestingly, a twin study in adolescents indicated possible 
sex-based variations regarding the role of genes and the environ- 
ment in hoarding symptoms, suggesting that the aetiology of HD 
may be dynamic across the lifespan, with the roles of genes and the 
environment varying over time and between the sexes [34]. 

On the other hand, research in twins suggests substantial, yet 
incomplete, genetic overlap between core symptoms of HD (diffi- 
culties discarding and excessive acquisition), while non-shared en- 
vironmental influences appear to be more specific to each of these 


traits [39]. Together with the fact that not all individuals with HD 
also excessively acquire, these twin modelling results support the 
current placement of excessive acquisition as a specifier in DSM-5, 
rather than as a core diagnostic criterion for HD. 

The mechanisms by which either genetic or environmental fac- 
tors, or their interaction, confer risk to the development of HD 
remain unknown. Specific genes implicated in HD have not yet 
been identified. Although research has suggested some unique en- 
vironmental risk factors (for example, traumatic life events), small 
samples and retrospective self-report methods leave unclear the 
role of such exposures in the onset or exacerbation of HD [31, 36]. 
Anecdotal links between material deprivation (for example, child- 
hood poverty) and hoarding have received no support in the 
literature [31]. 

Future work is warranted to understand how specific genetic 
and environmental risk factors interact to confer risk to individuals 
carefully diagnosed as having HD. It is likely that these genetic and 
environmental risk factors will be, at least in part, shared with re- 
lated conditions such as OCD and other related disorders [40]. The 
potential aetiological overlaps between HD and other neuropsychi- 
atric conditions (for example, anxiety and mood disorders, ADHD) 
remain to be explored. 


Neurobiology 


The neural substrates of hoarding behaviour have been well studied 
in animals that naturally display hoarding behaviours as part of 
their behavioural repertoire (for example, rodents, birds), as well as 
primates. These studies clearly implicate subcortical limbic struc- 
tures (nucleus accumbens, ventral tegmental area, amygdala, hippo- 
campus, thalamus, hypothalamus) and the ventromedial prefrontal 
cortex in the mediation of hoarding behaviour [20]. Research with 
animals also suggests that the dopaminergic system plays a crucial 
role in hoarding behaviour (for a review, see [20]). Whether the 
known involvement of the dopaminergic system in hoarding among 
animals can explain the relatively poor response of hoarding indi- 
viduals to serotonergic drugs [41] is an attractive hypothesis that 
remains to be investigated. 

Much less is known about normal and abnormal hoarding be- 
haviour in humans. Useful clues come from case studies of brain- 
damaged patients and of individuals with dementia, particularly of 
the fronto-temporal type. This research suggests that the ventro- 
medial prefrontal/anterior cingulate cortices, as well as medial 
temporal regions, may be implicated in hoarding behaviour [20]. 
One theory is that the former cortical regions modulate or sup- 
press subcortically driven predispositions to acquire and collect 
and adjust these predispositions to the environmental context [19]. 
Damage to these cortical regions may result in dysregulated col- 
lecting and hoarding behaviour. 

The neuroimaging literature of ‘non-organic hoarding has also 
implicated the ventromedial prefrontal/anterior cingulate cortices 
and subcortical limbic structures (for example, amygdala/hippo- 
campus) in hoarding patients [42-44]. However, the evidence is 
preliminary, obtained from small samples, and confounded in many 
cases by the presence of comorbid OCD symptoms [20]. 

Similarly, it is difficult to draw firm conclusions from early reports 
on the neurocognitive profile of HD [20]. Many of the initial neuro- 
psychological studies recruited OCD patients with hoarding symp- 
toms (with different degrees of severity), whereas others recruited 


individuals with severe hoarding, but predominantly without OCD. 
The neuropsychological tests employed have been heterogenous 
and have tapped into different domains, and few employed psychi- 
atric control groups. Findings to date suggest possible impairments 
in spatial planning, visuo-spatial learning and memory, sustained 
attention/working memory, organization, response inhibition, set 
shifting, probabilistic learning, and reversal [20, 45]. 

Future work in this area should include carefully characterized 
samples of individuals meeting strict criteria for HD. 


The diagnosis of HD is usually made on the basis of a direct interview 
with the person being evaluated to establish whether the diagnostic 
criteria are met. The Structured Interview for Hoarding Disorder 
(SIHD) [46] is available for this purpose. 

Because hoarding may not always be the initial reason for consult- 
ation [27], clinicians often need to ask direct questions such as ‘Do 
you find it difficult to discard or part with possessions?’ or ‘Do you 
have a large number of possessions that congest and clutter the main 
rooms in your home?’ An affirmative answer to these questions can 
initiate a dialogue that may lead to diagnosis. 

A home visit is recommended for the assessment of clutter, im- 
pairment, and associated risks. If a home visit is not feasible, the 
clinician should try to gather additional information from reliable 
informants such as a spouse or relative (with the patient’s consent). 
This is particularly important for affected persons with limited in- 
sight, because they may underestimate the extent and consequences 
of their difficulties. Informants may also help establish whether 
the current presentation is long-standing or transient, whether 
third parties have intervened to clear away some of the clutter, and 
whether there are potential risks that require attention. 

Photographs of the patients home can also be very useful in 
helping to document the presence of clinically significant clutter and 
to track treatment outcomes [47, 48], particularly when home visits 
are not possible or are impractical. However, photographs should 
not be a substitute for a thorough psychopathological interview. 

The diagnostic interview provides an opportunity to carry out a 
thorough risk assessment. Attention should be paid to potential fire 
hazards, the risk of a clutter avalanche, the presence of rodent or 
insect infestation, and unsanitary living conditions that pose a risk 
to health. In addition, it is important to establish whether other vul- 
nerable persons (for example, children and elderly people) live with 
the person who is hoarding. 

Numerous clinician and self-administered measures of hoarding 
severity exist and can be used to quantify the severity of various 
aspects of hoarding such as difficulties discarding, clutter, distress, 
etc. While useful, these measures do not allow a diagnosis of HD, 
primarily because it is not possible to rule out other disorders that 
may cause hoarding. Therefore, they should be restricted to quan- 
tify the severity of hoarding in already diagnosed individuals or to 
screen individuals in population-based studies. 

Some of the most widely used clinician administered measures 
include the Hoarding Rating Scale-Interview (HRS-I) [49] and the 
Clutter Image Rating (CIR) [47]. Among the most widely used 
self-administered measures are the Hoarding Rating Scale-Self 
Report (HRS-SR) [50], the Saving Inventory-Revised (SI-R) [51], 
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the UCLA Hoarding Severity Scale (UHSS) [52], and the Hoarding 
Disorder Dimensional Scale (HD-D) [53]. Typically, these meas- 
ures have empirically derived cut-offs, which give an indication of 
whether the person is likely to have clinically significant hoarding 
problems. Supplementary measures that assess saving cognitions 
(for example, Saving Cognitions Inventory) [54], problematic ac- 
quisition (for example, Compulsive Acquisition Scale) [55], self- 
reported squalor (for example, Home Environment Index) [56], 
and caregiver burden and accommodation (for example, Family 
Impact Scale for Hoarding Disorder) [57] are available and may 
prove useful for clarifying the clinical picture and monitoring 
treatment outcomes. 


Few treatment studies have specifically included individuals ful- 
filling the DSM-5 criteria for HD, and therefore, the evidence to 
guide treatment choice is currently weak. 


Psychological treatment 


Currently, the intervention that has the strongest evidence base for 
HD is a multicomponent psychological treatment that is based on 
a cognitive behavioural model [9]. The intervention includes office 
and in-home sessions; motivational interviewing methods to ad- 
dress ambivalence about therapy; education about hoarding; goal- 
setting; organizing, decision-making, and problem-solving skills 
training; exposure to sorting, discarding, and not acquiring; and 
cognitive strategies to facilitate this work [58]. 

Tolin and colleagues [59] conducted a meta-analysis of all open or 
randomized controlled trials (RCT) of CBT or behavioural therapy 
for hoarding in adults. The eligible studies used hoarding-specific 
outcome measures, and HD was the primary condition being 
studied, as opposed to hoarding measured in the context of another 
psychiatric disorder (for example, OCD). Ten relevant studies— 
comprising 12 samples—were identified, of which three were RCTs, 
including a total of 103 patients. Four samples used individual CBT, 
whereas the remaining eight used a group format. Three samples 
employed a peer-directed, bibliotherapy-based programme of CBT, 
with no professional therapist present. The number of CBT sessions 
ranged from 13 to 35 (mean = 20.2 sessions), and the number of 
visits conducted in patients’ homes ranged from 0 to 33 (mean = 5.6 
sessions). Overall HD symptom severity decreased significantly 
across studies with a large effect size (Hedges’ g = 0.82 for the overall 
HD severity score). Significant effects were evident on all outcome 
domains. The strongest effects were seen for difficulty discarding 
(Hedges’ g = 0.89). Clutter and acquiring showed effects in the mod- 
erate range (Hedges’ g = 0.70 and 0.72, respectively). Functional 
impairment (which was not used in the calculation of overall HD 
symptom severity) showed the smallest effect in the moderate range 
(Hedges’ g = 0.52) [59]. 

Thus, although encouraging, the clinical outcomes of CBT inter- 
ventions for hoarding are relatively modest (many individuals still 
require additional treatment), and there is still significant room for 
improvement. The long-term benefit of such interventions is un- 
clear, but data from a single trial suggest that the treatment gains of 
CBT for HD were largely maintained at 12-month follow-up [60]. 
Studies with even longer follow-up times are needed. 
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Pharmacological treatment 


Pharmacotherapy has also been suggested as a treatment for HD. 
However, data regarding the use of such treatment are limited and 
derived from small, uncontrolled studies. While no RCTs of medi- 
cation for HD have been conducted, some data are available from 
studies in OCD patients with hoarding symptoms. In patients diag- 
nosed with OCD, the presence of hoarding symptoms is associated 
with a worse response to serotonin reuptake inhibitors (SRIs), alone 
or in combination with CBT [41]. Specifically, OCD patients with 
hoarding symptoms are, on average, 50% less likely to respond to 
SRIs than non-hoarding OCD patients [41]. 

In an open-label study of paroxetine that included 79 OCD pa- 
tients (32 patients meeting criteria for ‘compulsive hoarding syn- 
drome’ [9] and 47 patients with non-hoarding OCD), hoarding 
symptoms improved as much as other OCD symptoms. However, 
paroxetine was not well tolerated (with adverse effects including 
sedation, fatigue, constipation, akathisia, headache, and sexual dys- 
function), and the overall response was moderate in both groups. 

To date, only three studies (two open-label trials and one case 
series) have included samples of individuals diagnosed with HD 
according to the DSM-5 criteria. An open study evaluated the ef- 
ficacy of a 12-week trial of venlafaxine and classed 16 of 24 parti- 
cipants as responders (defined as at least 30% reduction in UHSS 
and SI-R scores, as well as a rating of at least ‘much improved’ on 
the Clinical Global Impression-Improvement) [61]. Two other 
studies have used ADHD treatments to treat hoarding symptoms. 
The first of these studies was a case series of four patients treated 
with methylphenidate extended-release for 4 weeks. Of the four pa- 
tients, two had a modest reduction in hoarding symptoms (25% and 
32% decrease in the SI-R) [62]. More recently, another uncontrolled 
trial reported that nine of 12 participants with HD showed some de- 
gree of reduction in their hoarding symptoms in a 12-week trial of 
atomoxetine [63]. Eleven patients completed the trial, and of these, 
six were classified as full responders based on the UHSS (mean re- 
duction of 57%); three other patients were classified as partial re- 
sponders (mean reduction of 27%). 


Future directions 


Further controlled clinical trials are needed to assess the short- and 
longer-term efficacy of psychological and pharmacological treat- 
ments, alone and in combination, in carefully diagnosed individuals 
with primary HD. Additional behavioural interventions that have 
shown promising results and require further research include those 
aimed at improving cognitive functioning (for example, attention, 
organization) in individuals with HD [64, 65] and the development 
and evaluation of interventions for carers of individuals with HD 
[10, 66]. The development of interventions adapted for special popu- 
lations (for example, hoarding symptoms in individuals ASD or in- 
tellectual disabilities) is also warranted [67, 68]. 

While the current psychological treatments emphasize the im- 
portance of motivational interviewing techniques, the patients in- 
cluded in the trials described had sufficient insight to at least seek 
help. It is currently unclear how to best approach individuals with 
absent or delusional insight and who may only come to the attention 
of mental health services against their will. The clinical management 


of these cases may require an unprecedented level of inter-agency 
co-ordination. Several countries have set up local or federal multi- 
agency task forces (often including mental health, fire, pest control, 
housing, legal and social services) to tackle the most severe cases of 
hoarding who do not voluntarily seek or want help [69]. 
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“The dysmorphophobic patient is really miserable; in the middle of his daily routines, talks, while 
reading, during meals, everywhere and at any time, he is caught by the doubt of deformity...’ 
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Body dysmorphic disorder (BDD) (previously called ‘dysmor- 
phophobia’) is a common, often severe psychiatric disorder that has 
been described for more than a century [1-3]. BDD is character- 
ized by distressing or impairing preoccupations with non-existent 
or slight defects in physical appearance [4]. In addition, BDD con- 
sists of repetitive behaviours (that is, compulsions and rituals) that, 
at some point during the course of the illness, have been performed 
in response to the appearance concerns [4]. BDD is classified as an 
obsessive-compulsive and related disorder in DSM-5 [4]. In ICD-10, 
BDD is classified as a type of somatoform disorder, but recommenda- 
tions for ICD-11 are to classify BDD in a new category of obsessive- 
compulsive and related disorders [5]. Psychosocial functioning is 
often markedly impaired in BDD [6]. In addition, BDD is character- 
ized by high rates of suicidality and comorbid substance use [7, 8]. 
Despite its prevalence, severity, and negative effects on functioning, 
BDD is under-recognized in clinical settings [2, 3]. 
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BDD occurs in all age, racial, and ethnic groups [2, 3, 7, 9]. BDD ap- 
pears slightly more prevalent in women than men [2, 3, 7]. Most in- 
dividuals with BDD have never been in a primary relationship [10]. 
Occupational and academic functioning is often substantially im- 
paired, and unemployment rates are elevated, which often appears 
attributable to BDD symptoms [2, 3, 6, 7]. 


Bodily preoccupations 





People with BDD are preoccupied with the idea that some aspect of 
their appearance is ugly, unattractive, deformed, flawed, or defective 
in some way when it actually is not [1-3, 6-7]. Concerns usually 
focus on the face or head but can involve any body area [1-3, 7]. 
Concerns with the skin (for example, perceived acne, scars, lines, 


Enrico Morselli, 1891 [1] 


or pale skin), hair (for example, perceived thinning or excessive 
body or facial hair), and nose (for example, size or shape) are the 
most common. Concerns about perceived asymmetry of body areas 
occur in about a quarter of individuals (for example, concerns about 
perceived unevenness or asymmetry of the hair, breasts/chest, eyes, 
face, jawline, or lips) [11]. 

Most patients with BDD are preoccupied with multiple body 
areas; the number of body areas of concern may range from only 
one to virtually every aspect of one’s appearance. BDD concerns may 
also focus on one’s overall appearance. People with BDD often be- 
lieve that they are unacceptable and will be rejected by others be- 
cause they look abnormal [2, 3]. 

Muscle dysmorphia is an often more severe form of BDD and 
bodily preoccupation, characterized by distress over a perceived 
‘small and insufficiently muscular body build, which occurs pri- 
marily in men. Individuals with muscle dysmorphia appear to have 
even poorer quality of life and higher rates of suicidality and sub- 
stance use disorders—in particular, anabolic steroid use—than 
those with BDD who are preoccupied with other body areas [12]. 
Muscle dysmorphia may also be associated with particularly adverse 
psychological and medical sequelae, including an increased risk of 
violent behaviour, which may, in part, reflect abuse of anabolic- 
androgenic steroids [13]. 
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In usionality 


BDD-related insight is typically poor but can range from good to 
absent insight (delusional beliefs); the latter is characterized by com- 
plete conviction that the person looks abnormal, ugly, unattractive, 
or deformed. Studies have found that 36-60% of individuals with 
BDD currently have delusional beliefs about their appearance 
[14, 15]. Poor insight or delusional BDD beliefs may limit patients 
recognition that their appearance concerns are due to a mental 
illness and make them unwilling to accept mental health treatment 
[2, 3, 16]. 

In addition, a majority have ideas or delusions of reference, be- 
lieving that others take special notice of the supposed appearance 
defects—for example, staring at them or mocking them because of 
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how they look [16]. Referential thinking can fuel feelings of anger 
and rejection, as well as social isolation. These findings are con- 
sistent with evidence that individuals with BDD have an inter- 
pretive bias for threat. Compared with other groups, those with 
BDD more often misinterpret neutral facial expressions as con- 
temptuous and angry, and they similarly more often misinterpret 
ambiguous scenarios (appearance-related, social, and general 
scenarios) as threatening [17, 18]. 


Compulsive and safety behaviours 


Compulsive behaviours, also referred to as repetitive behaviours 
or rituals, occur in response to preoccupations with perceived 
physical defects. Like the compulsive behaviours that occur in 
obsessive-compulsive disorder (OCD), they are usually distressing, 
time-consuming (occurring over an average of 3-8 hours a day), 
and difficult to resist or control [2, 3]. The behaviours usually aim to 
examine, improve, hide, or obtain reassurance about the perceived 
defects. These behaviours typically do not alleviate BDD-triggered 
distress and may even worsen it. 

Common compulsive behaviours and their occurrence in those 
with BDD are described in the American Psychiatric Association’s 
DSM-5 [4]. Individuals with BDD often compare their appearance 
to others, including people in newspapers or magazines or on televi- 
sion or social media. Other compulsive behaviours include repeated 
checking of mirrors and other reflective surfaces (for example, win- 
dows), excessive grooming (for example, hair styling, make-up appli- 
cation), reassurance seeking, and excessive exercise or weightlifting. 

Several compulsive behaviours have potential medical risks. 
Compulsive skin picking, which 27-45% of BDD patients do to try 
to improve the appearance of their skin, can cause considerable skin 
damage [2, 3]. Emergency surgery is sometimes required—for ex- 
ample, when sharp implements are used for picking ruptured major 
blood vessels. Compulsive tanning to darken ‘pale’ skin or minimize 
perceived acne, scarring, or ‘marks’ can cause skin damage and in- 
creases cancer risk [2, 3]. 

Some compulsive behaviours, such as camouflaging disliked body 
areas (for example, hiding perceived balding with a hat or ‘asym- 
metrical’ eyes with sunglasses), may also be conceptualized as safety 
behaviours, as they are motivated by the desire to prevent a feared 
outcome. However, these behaviours may also be repetitive (for ex- 
ample, repeatedly reapplying and fixing make-up), and thus they 
may also sometimes be compulsive in nature. 


Psychosocial functioning and quality of life 


BDD is associated with very poor quality of life and marked func- 
tional impairment [2, 3, 6]. Mental health-related quality of life 
scores in BDD are poorer than for depression, OCD, schizophrenia, 
and bipolar disorder [6]. Most individuals with BDD also have 
marked impairment in activities of daily living; many have been 
housebound due to their BDD symptoms [2, 3, 18]. In addition, 
BDD is associated with high lifetime rates of psychiatric hospital- 
ization [19, 20]. Prospective research on the course of BDD dem- 
onstrates that functional impairment tends to be chronic over time 
[21], although in treatment studies, functioning usually improves 
when BDD symptoms do. 

Social functioning is notably poor in BDD. Individuals with BDD 
often avoid other people because they are afraid of being rejected 
because of their appearance [2, 3, 10]. As a result, many people 


with BDD have difficulty forming and maintaining relationships 
with others. BDD is also associated with high rates of unemploy- 
ment and often marked impairment in occupational and academic 
functioning [18]. In a broadly obtained sample of convenience 
of individuals with BDD, 38% were currently unemployed [18]. 
Unemployment due to BDD is associated with greater current se- 
verity of BDD and depressive symptoms, a more chronic course 
of BDD, and poorer current social functioning and quality of life 
[18]. BDD may not only negatively affect a person's ability to hold a 
job, but it also often impairs one’s concentration, work quality, and 
work productivity [18]. High rates of academic impairment are also 
common in BDD. In one study, 18% had dropped out of elementary 
school or high school primarily because of BDD symptoms, and in 
another study, 22% of youth dropped out of school because of their 
BDD symptoms [22, 23]. 

In population-based epidemiologic studies, compared to individ- 
uals without BDD, those with BDD were more likely to be divorced 
and less likely to be married; they also had a lower educational 
level, lower household income, and greater unemployment and sick 
leave [24]. 


Other associated features 


BDD preoccupations may trigger a broad range of negative emo- 
tions. BDD is associated with an increased frequency of depression, 
anxiety, and social anxiety, as well as high levels of these symp- 
toms: feelings of shame and low self-esteem; hostility and anger; 
rejection sensitivity; and high levels of neuroticism, introversion, 
perfectionism, and perceived stress [11]. Pursuit of cosmetic treat- 
ment, such as surgery and dermatologic and dental treatment, 
is a common feature of BDD that often leads to poor treatment 
outcomes. 

Common core beliefs, which are addressed in cognitive behav- 
ioural therapy (CBT), include: ‘I am inadequate; ‘I am defective, ‘I 
am worthless, and ‘I am unlovable: Negative core beliefs such as 
these are compatible with the high rates of depression and suicidality 
that occur in BDD. 

Suicidality 

Suicidal ideation and attempts are very common in BDD. Reported 
lifetime rates of suicidal ideation and suicide attempts in clinical 
samples and broadly ascertained samples of convenience are 78- 
81% and 24-28%, respectively [8, 19]. In a population-based epi- 
demiologic study, 20% of participants with BDD reported suicidal 
thoughts specifically because of appearance concerns, and 7% had 
attempted suicide because of appearance-related concerns, whereas 
only 1% of the non-BDD group had attempted suicide. In another 
epidemiologic study, 31.0% in the BDD group vs 3.5% in the non- 
BDD group had suicidal ideation due to appearance concerns, and 
22.2% of those with BDD vs only 2.1% of those without BDD had 
attempted suicide specifically because of appearance concerns [24]. 

The rate of completed suicide, while preliminary, appears mark- 
edly high. In a prospective study, the annual suicide rate was 0.35%, 
which is approximately 45 times higher than for the population in 
the United States (adjusted for age, gender, and geographic region) 
and higher than for most other psychiatric disorders [25]. 

Preliminary evidence suggests that adolescents and Veterans with 
BDD, as well as those with muscle dysmorphia, appear to have sig- 
nificantly higher rates of suicidality than other BDD groups [25]. 


Variables that are independently associated with suicidal ideation 
and/or suicide attempts in BDD are greater BDD symptom severity; 
comorbid major depressive disorder, PTSD, a substance use dis- 
order, social anxiety disorder, OCD, and three or more comorbid 
Axis I disorders; onset of BDD before the age of 18; being un- 
employed; childhood maltreatment; and BDD-related restrictive 
food intake [25]. 

BDD is associated with many risk factors that have been shown 
more generally to predict completed suicide. These risk factors in- 
clude high rates of suicidal ideation and attempts, depression and 
substance use disorders, history of perceived abuse, functional 
impairment and social isolation, unemployment, disability, being 
single or divorced, and psychiatric hospitalization, as well as high 
levels of anxiety, depression, poor self-esteem, anger, and feelings 
of humiliation and shame [25]. Whether these risk factors predict 
completed suicide in BDD is unknown and warrants investigation. 


Comorbidity 


Lifetime major depressive disorder is the most frequent comorbid 
disorder, occurring in about 75% of individuals with BDD [26]. The 
other most common co-occurring Axis I psychiatric disorders are 
social anxiety disorder, OCD, and substance use disorders [26]. Axis 
I comorbidity is associated with greater functional impairment and 
morbidity. In particular, the presence of comorbid major depressive 
disorder is associated with poorer quality of life and more functional 
impairment [26]. Comorbid personality disorders occur in 40-100% 
of individuals with BDD, with Cluster C disorders—in particular, 
avoidant personality disorder—occurring most frequently [26]. 


Gender 


BDD is slightly more common in women than in men in both clin- 
ical and non-clinical samples, with the exception of cosmetic and 
dermatological settings [27]. Men and women appear to have largely 
similar clinical features [7, 19, 28, 29]. However, there are some not- 
able gender differences, some of which reflect gender differences 
in the general population. Men are more likely to be single, have a 
substance use disorder, and be preoccupied with thinning hair and 
small body build (muscle dysmorphia) [19, 29]. Women are gen- 
erally more likely to be preoccupied with their weight, hips, and 
excessive body hair and are more likely to pick their skin and use 
their hands or make-up for camouflage [19, 29]. One study (n = 58) 
[28] found that women are more likely to be preoccupied with their 
breasts/chest and legs, check mirrors, and use camouflaging to hide 
disliked body areas. In addition, concerns about genitals are more 
common in men, and a comorbid eating disorder is more common 
in women [19, 28, 29]. 


BDD in children and adolescents 


BDD usually begins during early adolescence and can occur in child- 
hood [30]. While data are limited, BDD’s clinical features in youth ap- 
pear largely similar to those in adults [30]. Youth with BDD typically 
have very poor psychosocial functioning and mental health-related 
quality of life. BDD often causes academic underachievement, social 
avoidance, and other types of psychosocial impairment; it may also 
lead to school refusal and school dropout, which can derail healthy 
adolescent development [30]. Suicidal ideation and attempts, phys- 
ical aggression, and substance use disorders are risk behaviours that 
commonly occur in youth with BDD [30]. 
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Preliminary findings suggest that BDD may be more severe in 
youth than in adults. Of note, children and adolescents appear to 
have lifetime rates of functional impairment similar to those in 
adults, despite having had fewer years over which to have developed 
these problems [30]. In addition, adolescents appear more likely 
than adults to have delusional BDD beliefs (as opposed to better in- 
sight), and they have a significantly higher lifetime suicide attempt 
rate (44% vs 24%, respectively). These findings underscore the im- 
portance of recognizing and treating BDD in this age group [30]. 


Cross-cultural aspects of BDD 


Case reports and case series from around the world suggest that 
BDD’s clinical features are generally similar across cultures, but 
that cultural factors may produce nuances and accents on a basic- 
ally invariant, or universal, expression of BDD [2, 3]. In east Asian 
cultures, a form of social anxiety disorder—taijin kyofusho (shubo- 
kyofu—‘the phobia of a deformed body’) [31]—has many features of 
BDD. Thus, in Asian cultures, BDD is considered more closely akin 
to social anxiety disorder; in Western countries, BDD is considered 
closely related to OCD due to prominent obsessive and compulsive 
features. Much more research has been done on BDD’s relationship 
to OCD than to social anxiety disorder; it is likely that BDD is closely 
related to both of these disorders. 


BDD can be diagnosed using questions at the top of Box 100.1, 
which are consistent with DSM-5’s diagnostic criteria. Clinicians 
should adequately probe for examples of clinically significant dis- 
tress and impairment in social, occupational, academic, and other 
aspects of functioning. BDD is diagnosed if the person is exces- 
sively preoccupied with one or more non-existent or slight physical 
flaws (for example, thinking about it for at least an hour a day) and 
the preoccupation causes clinically significant distress or clinically 
significant impairment in psychosocial functioning. In addition, 
at some point during the course of the illness, the appearance pre- 
occupations must have caused repetitive behaviours such as mirror 
checking, excessive grooming, skin picking, reassurance seeking, 
or comparing one’s appearance with others [4]. Finally, the appear- 
ance concerns should not be better explained by an eating disorder. 
However, BDD and eating disorders may co-occur, in which case 
both disorders should be diagnosed. 

In addition, DSM-5 contains two new specifiers for BDD: muscle 
dysmorphia and level of BDD-related insight (good or fair, poor, or 
absent insight/delusional beliefs) [4]. It should be emphasized that 
delusional BDD beliefs should be diagnosed as BDD, not as a psych- 
otic disorder, as this has significant implications for pharmaco- 
logic treatment. BDD-triggered panic attacks may be designated by 
DSM-5’s panic attack specifier, which can be used for any disorder 
that triggers panic attacks; in BDD, common triggers are feeling 
scrutinized by others, looking at perceived defects in mirrors, and 
being under bright lights [32]. 

The bottom of Box 100.1 includes questions that are not recom- 
mended to screen for, or diagnose, BDD. The word ‘imagined’ is prob- 
lematic, because most patients have poor or absent insight and do not 
think their appearance problem is imagined. Terms such as ‘deformed’ 
or ‘disfigured’ are too extreme for some patients to endorse. Asking if 
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there is something wrong with one’s body is too broad a question, as 
patients may interpret this as referring to bodily functioning. 

ICD-10’s description requires that patients persistently refuse to 
accept the advice and reassurance of doctors or healers that they do 
not have an abnormality. However, many people with BDD do not 
disclose their appearance concerns to doctors or even seek medical 
care because they are housebound, ashamed of their appearance 
concerns, believe they cannot be helped, lack health insurance, or 
do not have access to health care for other reasons. Using these cri- 
teria will underdiagnose BDD. 

BDD usually goes undiagnosed in clinical settings [2, 3]. Sufferers 
often conceal their symptoms due to embarrassment and shame 
[2, 3]. They may volunteer only depression, anxiety, or discomfort in 
social situations. The compulsive and safety behaviours may be clues 
to BDD’s presence. 

BDD may also be misdiagnosed as another disorder [2, 3]. BDD 
may be mistaken as social anxiety disorder because of social anxiety 


and isolation related to appearance concerns and anxiety over being 
rejected by others due to the perceived appearance defects. BDD 
may also be misdiagnosed as OCD due to obsessional preoccu- 
pations and compulsive behaviours. Individuals with delusional 
BDD are sometimes misdiagnosed with schizophrenia, psychotic 
depression, or another psychotic disorder. People with BDD may 
be misdiagnosed with panic disorder because of the occurrence 
of panic attacks that are triggered by appearance concerns, which 
should instead be diagnosed as BDD with the panic attack speci- 
fier. BDD symptoms may also be misdiagnosed as trichotillomania 
(hair pulling disorder) when hair is repeatedly cut or plucked to 
improve perceived flaws in its appearance (for example, ‘bushy’ or 
uneven eyebrows). BDD may be misdiagnosed as excoriation (skin 
picking) disorder when skin is picked in an attempt to diminish 
perceived skin flaws. To diagnose BDD, patients must usually be 
asked directly about BDD symptoms, using questions such as those 
in Box 100.1. 


Box 100.1 Questions to diagnose BDD according to DSM-5 diagnostic criteria 





Questions to evaluate diagnostic criteria 


Criterion A: preoccupation with non-existent or slight appearance defects 

or flaws 

1 ‘Are you very worried about your appearance in any way?’ OR ‘Are you 
unhappy with how you look?’ 

2 Invite the patient to describe their concern by asking, ‘What don't you 
like about how you look?’ OR ‘Can you tell me about your concern?’ 
Listen to the patient's description; do not provide reassurance about, or 
comment on, their appearance. 

3 Ask if there are other disliked body areas—for example, ‘Are you un- 
happy with any other aspects of your appearance, such as your face, 
skin, hair, nose, or the shape or size of any other body area?’ 

4 Ascertain that the patient is preoccupied with these perceived flaws by 
asking, ‘How much time would you estimate that you spend each day 
thinking about your appearance, if you add up all the time you spend?’ 
OR ‘Do these concerns preoccupy you?’ 


Criterion B: repetitive behaviours in response to the appearance 
concerns 


5 Ask ‘Is there anything that you do over and over again in response to 
your appearance concerns?’ ... ‘For example, do you compare your- 
self with others, check your appearance in mirrors or other reflecting 
surfaces, ask other people if you look okay or what they think of your 
appearance, touch the disliked areas, or pick at your skin?’ ... ‘Do you do 
anything else to try to check, fix, hide, or be reassured about your (fill in 
disliked body areas)?’ 


Criterion C: clinically significant distress or impairment in functioning re- 

sulting from appearance concerns 

6 Ask ‘How much distress do these concerns cause you?’ Ask specifically 
about resulting anxiety, social anxiety, depression, panic, shame, hope- 
lessness, guilt, and suicidal thinking. 

7 Ask about effects of the appearance preoccupations on the patient's 
life—for example, ‘Do these concerns interfere with your life or cause 
problems for you in any way?’ Ask specifically about effects on work, 
school, other aspects of role functioning (for example, caring for chil- 
dren), relationships, intimacy, family and social activities, household 
tasks, and other types of interference. Examples of interference include: 





e Decreased focus and concentration. 
e Being late for, or missing, school or work. 





e Interruption of school, work, or household routines by BDD rituals 
for example, leaving class to check the bathroom mirror or reapply 
make-up). 
Dropping or failing grades. 

Dropping out of school. 

Quitting a job or being fired; being unemployed. 
ot dating. 
arital conflict or divorce. 

Sexual difficulties. 

ot seeing friends as often or at all. 

issing family events. 

Turning down or avoiding social gatherings. 

Difficulty caring for children or managing a household, going shop- 
ping, or doing chores. 

e Avoiding activities like going to the gym. 

e Using drugs or alcohol to cope with BDD. 




















Criterion D: the appearance preoccupation is not better explained by 
concerns with body fat or weight if these symptoms meet diagnostic 
criteria for an eating disorder. 

Individuals who have excessive and problematic concerns with the belief 
that they weigh too much or that their overall body or parts of their body 
are too fat should be evaluated for the presence of an eating disorder (an- 
orexia nervosa, bulimia nervosa, and binge eating disorder). If one of these 
disorders explains these body image concerns, then the concerns do not 
count towards a diagnosis of BDD. 


Questions to evaluate specifiers 


With muscle dysmorphia 
Ask ‘Are you preoccupied with the idea that your body build is too small or 
that you're not muscular enough?’ 


Insight 

Elicit a global belief about the perceived defect(s) (rather than asking about 
specific body areas): ‘What word would you use to describe how bad all 
of these areas (fill in all disliked areas) look?’ If the patient has difficulty 
choosing a word (often because of embarrassment), ask ‘Some people use 
words like unattractive, ugly, deformed, hideous’; do you think any of these 
apply to you?’ Then ask ‘How convinced are you that these body areas look 
(fill in the patients global descriptor)?’ 


Source: data from Diagnostic and Statistical Manual of Mental Disorders, Fifth 
Edition, DSM-5, Copyright (2013), American Psychiatric Association. 








Epidemiology 


In large population-based epidemiologic samples (n = 2129 to 
2891), the point prevalence of BDD ranges from 1.7% to 2.9% [24]. 
In military and Veteran samples, BDD’s point prevalence has been 
much higher, ranging from 11% to 15% [33, 34]. BDD also appears 
fairly common in cosmetic treatment settings. A prevalence of 3- 
15% has been reported in cosmetic dermatology outpatient settings 
(weighted prevalence of 9%) [24], 11% in acne dermatology clinics, 
and 4-29% in general outpatient dermatology settings (weighted 
prevalence of 11%) [27]. In general cosmetic surgery settings, the 
prevalence of BDD has been found to be between 6% and 53%, with 
a weighted prevalence of 13% [24]; in orthognathic surgery settings, 
the weighted prevalence is 11%. The prevalence is higher than this in 
rhinoplasty samples (weighted prevalence of 20%), consistent with 
findings that rhinoplasty is the most common surgery received by 
people with BDD [24]. 

In samples from mental health settings, the prevalence of BDD 
has been reported to be 8-37% in patients with OCD, 11-13% in 
social phobia, 26% in trichotillomania, and 14-42% in atypical 
major depression [24]. In psychiatric inpatient samples, the point 
prevalence of BDD ranges from 6% to 13% [24]. 


Pathogenesis 


BDD’s pathogenesis is likely complex and multifactorial, involving 
both ‘proximal’ factors, such as aberrant visual processing, as well 
as ‘distal’ factors, such as genetic predisposition, and perhaps evo- 
lutionary influences—for example, physical appearance judge- 
ments, such as those involving symmetry, are relevant to efforts to 
attract and secure reproductively healthy mates [35]. BDD likely 
involves a complex interplay of genetic and environmental risk 
factors [36]. 

BDD appears to be familial [36]. It also appears to have high 
familiality with OCD. For example, BDD is more common in 
first-degree relatives of individuals with OCD than in first-degree 
relatives of probands without OCD [36]. A family history study de- 
termined that 7% of BDD patients had a first-degree relative with 
OCD, which is elevated, compared to OCD’s prevalence in the gen- 
eral population (about 1-2%) [36]. 

Twin studies, which can determine the relative contribution of 
genetic vs environmental factors, indicated that approximately 
44% of the variance of ‘dysmorphic concerr (a construct similar to 
BDD) is due to genetic factors [37]. BDD appears to have shared 
genetic liability with OCD and other obsessive-compulsive and re- 
lated disorders, as well as unique disorder-specific genetic factors. 
Preliminary data indicate an association of the GABA ,-y2 gene with 
BDD [38, 40]. 

Environmental influences for BDD and other obsessive- 
compulsive and related disorders appear largely disorder-specific 
[37]. Little is known about specific environmental risk factors in 
BDD, but they may include perceived childhood neglect and/or 
abuse, teasing, and low parental warmth [2, 3]. A role is also likely 
for sociocultural pressures [2, 3]. 

Research on BDD’s neurocircuitry is still limited, but increasing, 
and indicates that there is likely a complex interplay of dysfunction in 
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several neural systems [36]. Structural and functional neuroimaging 
studies demonstrate abnormalities within regions of the brain in- 
volved in visual processing, the limbic system (that is, the amyg- 
dala), and the orbitofrontal cortex, in addition to broadly observed 
changes in the white matter [36]. Findings of hyperactivity in the 
orbitofrontal cortex and the caudate are similar to findings in 
OCD [36]. 

Neuropsychological and neuroimaging studies indicated a ten- 
dency to focus on isolated details of visual and verbal stimuli, ra- 
ther than more global, configurational attributes [39]—consistent 
with clinical observations that patients selectively attend to specific 
aspects of their appearance or minor flaws. Cognitive processing 
studies indicate that BDD patients tend to misinterpret ambiguous 
social (and other) situations as threatening and misinterpret self- 
referent facial expressions as contemptuous and angry [16, 17]. It 
can be theorized that these interpretive biases may combine with 
rejection sensitivity, perfectionism, and a focus on aesthetics to con- 
tribute to BDD’s development [2, 3]. High neuroticism and low ex- 
troversion may also play a role [2, 3]. Many potential risk factors 
(for example, neuroticism) are not specific to BDD, but the overall 
combination of risk factors may be. 


Course and prognosis 


Prospective and retrospective studies of individuals ascertained for 
BDD indicated that BDD is usually chronic [19, 40]. Greater se- 
verity of BDD symptoms predicts a lower likelihood of remission 
from BDD and a greater likelihood of relapse following remission. 
A longer duration of BDD also predicts a lower probability of remis- 
sion [40]. However, when BDD is accurately identified and its treat- 
ment optimized, the prognosis is much more favourable. 


BDD’s treatment is described in more detail elsewhere, including 
in a guideline from the UK’s National Institute of Health and Care 
Excellence (NICE) [41-43]. Serotonin reuptake inhibitors (SRIs, or 
SSRIs) and CBT that is specifically tailored to BDD are currently 
recommended as first-line treatments [41-43]. Cosmetic treat- 
ment appears ineffective for BDD [43]. Treatment studies are still 
limited, and more research and treatment development are needed. 
However, available data consistently indicate that a majority of pa- 
tients improve with these treatments [41-43]. 


Essential groundwork for treatment 


It can be challenging to engage and retain patients with BDD in psy- 
chiatric treatment. Most have poor or absent insight, believing that 
they probably or certainly look abnormal. For this reason, many 
prefer cosmetic treatment. Depressive symptoms may cause hope- 
lessness and decrease motivation for treatment. A number of ap- 
proaches can help clinicians engage and retain patients in treatment, 
which are discussed in more detail elsewhere [44]: 


1. First try to engage the patient and establish an alliance, so they 
are willing to try treatment. This can be difficult to accomplish, 
as many patients are delusional, prefer cosmetic treatment, are 
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rejection-sensitive, and do not want other people (including a 
clinician) to see them. 

2. Empathize with the patient's suffering. 

3. Take the patient's appearance concerns seriously, neither 
dismissing their concerns about how they look nor agreeing that 
there is something wrong with their appearance. Avoid debating 
or arguing with the patient about how they look. Trying to con- 
vince patients (especially delusional patients) that their beliefs 
are irrational or that they look normal is usually not helpful. It 
can be helpful to say that people with BDD see themselves differ- 
ently than others see them and to note the findings from visual 
processing studies. 

4. Instead, focus on the potential for psychiatric treatment to di- 
minish their distress and preoccupation and improve their func- 
tioning and quality of life. 

5. Provide psychoeducation about BDD and recommend reading. 

6. For patients who wish to pursue cosmetic treatments, explain 
that such treatment appears ineffective for BDD. 

7. Provide education about recommended treatments. It can be 
helpful to explain, for example, that SRIs are usually well tol- 
erated, are not habit-forming, appear to normalize the brain 
(and do not cause brain damage), and often diminish suicidal 
thinking in people with BDD. CBT is a practical and doable 
‘here-and-now’ treatment, in which patients actively collaborate 
with the therapist and learn helpful skills by attending sessions 
and doing homework. When patients fully adhere to, and par- 
ticipate in, these treatments, a majority will improve. 


Treatment should be initiated with an SRI and/or CBT. SRIs are 
also the first-line medication for delusional BDD. All severely ill 
patients, especially those who are highly suicidal, should, in the 
authors’ opinion, receive an SRI. Patients with severe comorbid de- 
pression also warrant SRI treatment. Other comorbidity may war- 
rant additional medication. 


Surgical, dermatologic, and other cosmetic treatment 


A majority of patients with BDD seek and receive often costly cos- 
metic treatment [3, 8, 9, 12, 26]. Some patients, including those who 
are turned down by physicians or cannot afford treatment, perform 
their own surgery [2, 3, 6]—for example, cutting open their nose 
with a razor blade and trying to replace their nose cartilage with 
chicken cartilage in the desired shape. Dermatologists and surgeons 
are most often consulted, but cosmetic treatment may be requested 
of virtually any type of physician. It appears that most BDD patients 
are dissatisfied with such treatment [2, 3, 26] and that cosmetic 
treatment is not effective for BDD. Occasional dissatisfied patients 
sue, or are violent towards, the physician. Thus, cosmetic treatment 
is not recommended for BDD. 


Pharmacotherapy and other somatic treatments 


Serotonin reuptake inhibitors: first-line 
pharmacotherapy for BDD 


All SRI studies to date indicate that SRIs are often efficacious for BDD 
[2, 3, 41] (Table 100.1). These studies include a placebo-controlled 
fluoxetine study (n = 67), a controlled and blinded crossover study 
comparing the SRI clomipramine to the non-SRI desipramine 
(n = 29), and open-label trials of fluvoxamine, citalopram, and 


escitalopram (n = 15-30). In these studies, 53-73% of patients re- 
sponded to the SRI in intention-to-treat analyses (response rates 
were higher in completer analyses). The crossover trial found greater 
efficacy for clomipramine than desipramine, suggesting that SRIs 
are more efficacious than non-SRIs for BDD. This important finding 
is consistent with clinical series and retrospective data suggesting 
that SRIs are more efficacious than a broad range of non-SRI medi- 
cations for BDD [2, 3, 41]. 

A more recent relapse prevention study treated 100 participants 
for 14 weeks with the SRI escitalopram (mean dose 26.2 + 7.2 mg/ 
day), with an intention-to-treat response rate of 67% and a com- 
pleter response rate of 81% (Table 100.1) [45]. Escitalopram re- 
sponders were randomized to continuing to take escitalopram or 
were switched to placebo treatment for another 6 months. Time to 
relapse was significantly longer among patients who continued to 
receive escitalopram. The proportion who relapsed while continuing 
escitalopram was 18%, whereas 40% relapsed after being switched 
to placebo. Thus, continuation of escitalopram protected against 
relapse. 

No methodologically rigorous prospective study has compared 
different SRIs to one another, although they are likely all about 
equally efficacious. The second author often first uses escitalopram 
or fluoxetine; these are the best studied SRIs for BDD and are usu- 
ally well tolerated, and escitalopram has the advantage of having 
few drug—drug interactions. Clomipramine is often reserved for use 
when selective SRIs (SSRIs) have not been helpful, because SSRIs are 
usually better tolerated. Citalopram is the least desirable choice, be- 
cause the FDA dosing limit is lower than the dose typically needed 
to effectively treat BDD [41]. 

High SRI doses (higher than typically used for depression) ap- 
pear to often be needed to effectively treat BDD [2, 3, 41]. Doses 
may sometimes need to exceed the manufacturer’s maximum dose, 
although this is not recommended for clomipramine or citalopram. 
Response to an SRI usually develops gradually and may require up 
to 12-16 weeks of treatment (while reaching a relatively high dose) 
to be evident [2, 3, 41]. 

Response to medication usually includes a decrease in appear- 
ance preoccupations, distress, and compulsive/safety behaviours, 
as well as improved functioning. Suicidality, depressive symp- 
toms, anxiety, and anger-hostility often improve [2, 3]. Of note, 
delusional patients often improve with SRI monotherapy, whereas 
limited data suggest that dopamine antagonist monotherapy is 
usually ineffective for delusional BDD [2, 3, 41]. Thus, SRIs are 
recommended for patients with delusional (absent insight) BDD 
beliefs. 

If an optimal 12- to 16-week SRI trial—during which a high SRI 
dose is reached if needed and tolerated—improves BDD symp- 
toms by at least 30%, it is recommended that the SRI be continued, 
because BDD symptoms may further improve with additional 
time [41]. 


Approaches if an SRI is not adequately effective 


Before concluding that an SRI is ineffective, it should be tried for 
12-16 weeks, reaching the highest dose recommended by the manu- 
facturer or tolerated by the patient (if necessary) for at least 3-4 of 
those 12-16 weeks. Because medication adherence is often poor, 
the clinician should determine whether medication adherence has 
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Table 100.1 Prospective controlled and open-label medication studies in adults with body dysmorphic disorder’ 


Medication Study design Sample size Trial duration and | Response of BDD symptoms Reference 


endpoint mean 
dose (mg/day) 





Serotonin reuptake inhibitor (SRI, SSRI) monotherapy 



























































Fluoxetine (Prozac’) vs Randomized, double- 74 enrolled; 67 12 weeks: Fluoxetine significantly more effective Phillips KA, et al., 
placebo blind, placebo-controlled, randomized 771.7 +80 than placebo; response rate of 53% vs 2002 [60] 
parallel group trial 18% on BDD-YBOCS; effect size d = 0.70 
Clomipramine Randomized, double- 40 enrolled; 29 16 weeks (8 Clomipramine significantly more Hollander E, 
(Anafranil’) vs blind, controlled crossover randomized weeks on each effective than desipramine; response etal., 1999 [61] 
desipramine trial medication) rate of 65% vs 35% on BDD-YBOCS 
CMI: 138 + 87 
DMI: 147 + 80 
Escitalopram (Lexapro) Open-label trial with 100 in open-label 14-week open-label Open-label trial: BDD symptoms Phillips KA, et al., 
vs placebo escitalopram; responders phase; 58 in double- phase: 26.2 + 7.2 significantly improved; 67% of 2016 [45] 
to open-label treatment blind phase 6-month subjects (intention-to-treat) and 
were randomized to randomized 81% of completers responded on 
double-blind continuation phase: 28.7 + 4.6 BDD-YBOCS 
treatment with Time to relapse significantly longer with 
escitalopram vs placebo escitalopram than placebo 
for 6 months Relapse proportions: 18% for 
escitalopram vs 40% for placebo 
In continuation phase, BDD significantly 
improved in escitalopram-treated 
subjects; 36% of subjects further 
improved 
Fluvoxamine (Luvox’) Open-label trial 1S O weeks: 208.3 + BDD symptoms significantly improved; Perugi G, et al., 
63.4 ten subjects responded on CGI 1996 [62] 
Fluvoxamine (Luvox") Open-label tria 30 6 weeks: 238.3 + BDD symptoms significantly improved; Phillips KA, etal., 
85.8 63% of subjects responded on 1998 
BDD-YBOCS 
Citalopram (Celexa’) Open-label trial 15 12 weeks: 51.3 + BDD symptoms significantly improved; Phillips KA & 
6.9 73% of subjects responded on Najjar F, 2003 
BDD-YBOCS [63 
Escitalopram (Lexapro’) Open-label tria 15 12 weeks: 28.0+ 6.5 BDD symptoms significantly improved; Phillips KA, 2006 
73% of subjects responded on [64 
BDD-YBOCS 
Serotonin-noradrenaline reuptake inhibitor (SNRI) monotherapy 
Venlafaxine (Effexor’) Open-label trial W 12-16 weeks: 163.6 BDD symptoms significantly improved; Allen A, et al., 
+303 45% responded on BDD-YBOCS 2008 [65] 
Anti-epileptics 
Levetiracetam (Keppra’) Open-label trial 17 (monotherapy 12 weeks: 2044 + BDD symptoms significantly improved; Phillips KA & 
in 12 cases; SRI 1065 53% responded on BDD-YBOCS Menard W, 2009 
augmentation in five [66] 
cases) 
SRI augmentation studies 
Pimozide (Orap’) vs Randomized, double- 28 8 weeks: 2 mg Pimozide was not more efficacious than Phillips KA, 2005 
placebo blind, placebo-controlled, placebo; 18% of subjects responded to [67] 
augmentation parallel-group trial pimozide and 18% to placebo 


of fluoxetine (Prozac’) 


“Case reports, case series, cross-sectional, and retrospective studies are not included in the table but are described in the text. 

” Results are reported for an intention-to-treat analysis, except for the clomipramine/desipramine trial, which used a minimum treatment analysis, and the venlafaxine study, which 
used a completer analysis for the primary outcome. Intention-to-treat analyses are more conservative because they include study dropouts; response rates for those who completed 
the full trial (intention-to-treat sample) are higher than reported in this table. 


Most studies defined response as 30% or greater decrease in total score on the Yale-Brown Obsessive Compulsive Scale (YBOCS). 
Modified for body dysmorphic disorder (BDD-YBOCS). Other symptoms, such as depression, usually improved, as did psychosocial functioning and quality of life. 





been adequate; improving adherence may convert medication non- 40 mg/day, and clomipramine 250 mg/day (with blood levels). It is 
response to response. recommended that an ECG be checked at a higher escitalopram 

As a next step, the second author often raises the SRI dose even dose and when using clomipramine. These doses are consistent with 
further until the following maximum doses are reached if toler- those in the American Psychiatric Practice Guideline for OCD [46]. 
ated: escitalopram 60 mg/day, fluoxetine 120 mg/day, sertraline 400 If this approach does not produce the desired response, an- 
mg/day, fluvoxamine 450 mg/day, paroxetine 100 mg/day, citalopram other SRI can be tried, as this may lead to a response [2, 3, 41]. 
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Regarding SRI augmentation, a small double-blind randomized 
controlled trial found that the antipsychotic pimozide was not 
more efficacious than placebo as a fluoxetine augmenter [2, 3, 41]. 
However, clinical observations suggest that other dopaminergic 
drugs, such as aripiprazole, may be helpful as SRI augmenters 
and may be especially appealing for patients who are highly delu- 
sional, agitated, anxious, and/or aggressive. Clinical series suggest 
that augmentation of an SSRI with buspirone or clomipramine 
may be helpful [2, 3, 41]. (SSRIs may increase clomipramine blood 
levels, however, which may cause toxicity; thus, if this approach 
is tried, clomipramine should be started at a very low dose, with 
monitoring of levels.) Clinical observations suggest that SSRI 
augmentation with venlafaxine or bupropion may be helpful for 
some patients [2, 3, 41]. SRIs can also be augmented with CBT. No 
studies have evaluated the relative efficacy of these approaches, 
although clinical experience indicates that all of them may be 
effective. 


Monotherapy with non-SRI medications 


Monotherapy with agents other than SRIs has not been well 
studied [2, 3, 41]. A small open-label trial (n = 11) suggested 
that the serotonin-noradrenaline reuptake inhibitor (SNRI) 
venlafaxine may be efficacious [2, 3]. Another small open- 
label trial (n = 17) suggested efficacy for the anti-epileptic 
levetiracetam [41]. For severe and treatment-refractory cases, 
an MAO inhibitor may be worth trying (but should never be 
combined with an SRI). 

It appears that long-term treatment with effective medication 
may be needed; efficacy appears to usually be sustained over time 
[2, 3, 41]. For patients who appear at high risk of suicide or violence, 
lifelong treatment with an effective SRI is recommended, as suicides 
have been known to occur after SRI discontinuation. 


Other somatic treatments 


Available case series and case reports, while very limited, suggested 
that ECT is generally ineffective for BDD and secondary depressive 
symptoms [2, 3, 41], although it should be considered for severely 
depressed and highly suicidal patients. 

Improvement in BDD symptoms has been noted in case re- 
ports with a modified leucotomy, capsulotomy, bilateral anterior 
cingulotomy plus subcaudate tractotomy, and anterior capsulotomy, 
and with deep brain stimulation that targeted the ventral capsule/ 
ventral striatum. Neurosurgery should be considered only when a 
patient has not responded to many adequate medication trials and 
to intensive CBT using an empirically based treatment manual 
for BDD. 


Cognitive behavioural therapy 
Research findings on CBT for BDD 


In recent years, great strides have been made in understanding the 
cognitive behavioural processes that contribute to BDD’s develop- 
ment and maintenance. CBT for BDD that is based on this theor- 
etical understanding is highly effective in reducing BDD symptom 
severity, as well as associated symptoms [47]. 

In an initial small randomized controlled trial of 19 individuals 
with BDD who were randomized to 12 sessions of CBT or a waitlist 
control, the CBT group showed significantly greater improvement 


in BDD symptoms and depressive symptoms [48]. Another ran- 
domized controlled trial of 12 sessions of CBT, compared to an 
anxiety management treatment, for 46 individuals with BDD 
found that the CBT group showed significantly greater improve- 
ment in BDD symptoms than the anxiety management group [49]. 
A study of a modular 22-week CBT treatment for BDD vs a 12- 
week waitlist condition (followed by crossover to CBT) for 36 in- 
dividuals with BDD showed that 50% of those who participated in 
CBT responded to CBT, compared to 12% in the control group, at 
week 12, a significant difference [50]. After 22 weeks of treatment, 
81% of all participants (immediate treatment group plus waitlisted 
patients subsequently treated with CBT) who completed treatment 
were responders. A meta-analysis also showed that CBT is asso- 
ciated with more improvement in BDD symptoms than a waitlist 
or psychological placebo [51], resulting in a reduction in BDD 
symptoms, depressive symptoms, and improvement in insight 
and delusionality [51]. Improvement appears to be sustained over 
months, and even years [47, 52]. 


Elements of CBT for BDD 


CBT must be tailored to BDD’s unique symptoms; in the authors’ ex- 
perience, CBT for OCD, social anxiety disorder, or other disorders 
will not adequately treat BDD. The authors recommend use of an 
empirically based CBT treatment manual, because BDD is a com- 
plex disorder that can be difficult to treat without a manual. Two 
such therapist manuals are available [53, 54]. 

Many experts recommend weekly hour-long outpatient sessions 
for 6 months, although some patients improve with fewer sessions 
and others require longer or more intensive treatment in an inten- 
sive outpatient or residential treatment setting. 

CBT for BDD typically includes [53, 54]: (1) cognitive restruc- 
turing to identify cognitive errors and develop more accurate and 
helpful BDD-related thoughts and beliefs; (2) exposure to avoided 
situations (for example, leaving the house, attending social gather- 
ings), combined with behavioural experiments to test the accuracy 
of BDD beliefs; (3) ritual (response) prevention to decrease or stop 
compulsive behaviours (for example, stopping excessive mirror 
checking, limiting grooming time); (4) advanced cognitive strat- 
egies to address maladaptive core beliefs; (5) perceptual retraining 
and mindfulness, which involves learning to see one’s entire body in 
a non-judgemental and ‘holistic’ way (rather than focusing on dis- 
liked areas), while refraining from excessive mirror checking; and 
(6) relapse prevention. 

One of the previously noted empirically supported treatment 
manuals includes the following optional modules for patients with 
relevant symptoms [53]: (1) skin picking and hair pulling or plucking; 
(2) muscularity and weight/shape concerns; (3) cosmetic treatment; 
and (4) depression/mood management. Motivational interviewing is 
often needed to engage and keep patients in treatment and may be 
used any time during treatment as needed. 

Patients who are not improving with CBT may need motivational 
interviewing to increase engagement in treatment and homework 
completion, more frequent sessions, longer sessions, or a change 
in the current CBT focus. Maintenance/booster sessions following 
CBT may reduce relapse risk. It may be helpful to add an SRI to CBT 
if CBT alone is insufficient or the patient is too depressed to partici- 
pate in CBT. 


Additional treatment considerations 
Therapy and support 


Only limited research has been conducted on third-wave therapies, 
like acceptance and commitment therapy (ACT), although prelim- 
inary evidence from one small study is promising [55]. Clinical ex- 
perience suggests that concomitant insight-oriented or supportive 
therapy—in addition to an SRI and/or CBT—may help some pa- 
tients cope with their illness or with co-occurring problems or dis- 
orders [2, 3]. Families can be an invaluable support and facilitate 
treatment [2, 3]. Mental health professionals may need to interface 
with dermatologists, plastic surgeons, and other physicians from 
whom patients have requested or are receiving cosmetic treatment. 


Suicidality 


Treatment approaches for more highly suicidal patients are dis- 
cussed in more detail elsewhere [54]. Such patients should always 
receive an SRI, which has been shown to decrease suicidal thinking 
and protect against worsening of suicidal ideation in BDD [56]. 
Concomitant medication may also be helpful, depending on the 
patient’s specific symptoms. The authors also recommend CBT for 
BDD if BDD appears to be the main cause of suicidal thinking. For 
highly suicidal patients, we recommend the use of an evidence- 
based CBT treatment manual that focuses on suicidal thinking 
[57]. Supportive therapy may be additionally helpful, depending 
on the patient’s specific situation, and a higher level of care may 
be needed. 


Substance use 


Substance abuse is common in individuals with BDD and often 
reflects attempts to self-medicate BDD-induced distress [58]. 
In such cases, successful treatment of BDD may reduce or 
eliminate problematic substance use. However, substance abuse 
often needs to be a major focus of treatment, using evidence- 
based medication (for example, naltrexone) and psychosocial 
treatment approaches for substance abuse, in addition to treat- 
ment for BDD. 


Conclusions 


BDD is a common, but under-recognized, disorder that is often se- 
verely impairing and can lead to suicide. Research and knowledge of 
BDD are greatly increasing, but more research is necessary to better 
understand the features of this disorder and increase education, so 
clinicians recognize this disorder and implement recommended 
treatment. SRIs and CBT for BDD often improve BDD symptoms 
and associated symptoms such as depression, anxiety, and poor or 
absent insight. Clinicians should specifically screen patients for BDD 
in order to recognize this common disorder and provide effective 
treatment. 


Further information 


Further information about BDD and its treatment is provided in ref- 
erences [2-4], [43], [53-54], and [59]. 
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Eating disorders may be defined as ‘a persistent disturbance of eating 
behaviour or behaviour intended to control weight, which signifi- 
cantly impairs physical health or psychosocial functioning’ [1]. It 
is conventional to divide them into diagnostic categories, based on 
their clinical presentation. The three main diagnoses are anorexia 
nervosa, bulimia nervosa, and binge eating disorder. In addition, 
there are similar presentations that fall just outside these three diag- 
noses. We will term these the ‘other eating disorders’ or OEDs. 









































The classification of eating disorders and their principal diag- 
nostic criteria are shown in Box 101.1. Anorexia nervosa was 
first named and formally described in 1873 [2, 3]; the concept 
of bulimia nervosa was introduced in 1979 [4], and binge eating 
disorder was formally recognized in 2013, although the phenom- 
enon was described in the 1950s [5]. The OEDs came to attention 
in the 2000s when they were encountered in large numbers with 
the opening of community-based eating disorder clinics [6]. The 
American term for OEDs used to be ‘eating disorders not other- 
wise specified, but in DSM-5, it was changed to ‘other specified 
feeding and eating disorders’. 

Other disorders of eating are recognized. In DSM-5, these are 
pica, rumination disorder, and avoidant/restrictive food intake dis- 
order. These states are quite different from the three main eating 
disorders and are mostly seen in children or those with intellectual 
disability. They are not covered in this chapter. 

















Anorexia nervosa, bulimia nervosa, and OEDs share a common 
and distinctive ‘core psychopathology’ This is the over-evaluation 
of shape and weight and their control. This psychopathology is not 
usually present in binge eating disorder. It is characterized by the 
judging of one’s self-worth largely, or even exclusively, in terms of 
one’s shape and weight and one’s ability to control them. In con- 
trast, most people judge themselves on the basis of their perceived 





performance in a variety domains of life, such as their perform- 
ance at work, at sport, and at parenting, and the quality of their 
relationships. 

Most features of these disorders stem from the over-evaluation 
and its consequences. There are repeated intense feelings of fatness, 
and most people either closely monitor their shape or avoid seeing it 
altogether. The desire to control body weight leads to persistent and 
strict dieting, together with other forms of extreme weight control 
behaviour, including self-induced vomiting, laxative misuse, and 
over-exercising. Strict dieting is highly impairing, as it requires a 
great deal of effort, often provokes anxiety, and can make socializing 
difficult, if not impossible. It may also lead to weight loss or binge 
eating, or both. 

Generally, people with an eating disorder view dieting as an ac- 
complishment, rather than a problem. It is not something that they 
want to change. However, in bulimia nervosa, the strict dieting is 
interrupted by repeated episodes of loss of control over eating, in 
which large amounts of food are eaten (binges). These binges are 
highly aversive, and as a result, they lead people to seek help. Their 
help-seeking is hindered, however, by the shame and secrecy that 
typically accompanies binge eating. 





Anne 
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xia nervosa 


In anorexia nervosa, the pursuit of weight loss is successful in that 
significantly low weight is achieved. In most instances, there is no 
true anorexia as such. The loss of weight is primarily the result of a 
deliberate severe and selective restriction of food intake, with foods 
viewed as fattening being excluded. In some people, restriction 
over food intake is encouraged by additional motives, including as- 
ceticism, a desire to feel ‘special, and a wish to punish themselves. 
Self-induced vomiting and other extreme forms of weight control 
behaviour are practised by some. Many also engage in a driven form 
of over-exercising, which can also contribute to their low weight. 
There is a constellation of clinical features which are a direct re- 
sult of being significantly underweight and is seen in other starva- 
tion states. These features include obsessional features, low mood, 
irritability, impaired concentration, loss of sexual appetite, and poor 
sleep. Typically, these features get worse, as weight is lost, and im- 
prove with weight regain. Interest in the outside world also declines 
as people become underweight, with the result that most become 
socially withdrawn and isolated. This feature is also reversible. Some 
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Box 101.1 Classification and diagnosis of the eating disorders 


Definition of an eating disorder 

There is a definite disturbance of eating habits or weight control be- 
haviour. Either this disturbance or the associated ‘core psychopathology’ 
(see text) results in clinically significant impairment of psychosocial func- 
tioning or physical health. The behavioural disturbance is not secondary 
to any general medical disorder or to any other psychiatric condition (for 
example, major depression). 


Principal diagnostic criteria 

Anorexia nervosa 

1 Over-evaluation of shape and weight (that is, judging self-worth 
largely or exclusively in terms of shape and weight). 


2 Active maintenance of an unduly low body weight (for example, body 
mass index <18-5 kg/m’). 


Bulimia nervosa 

1 Over-evaluation of shape and weight (that is, judging self-worth 
largely or exclusively in terms of shape and weight). 

2 Recurrent binge eating (that is, recurrent episodes of uncontrolled 
eating in which an unusually large amount of food is eaten for the 
circumstances). 

3 Extreme weight control behaviour (for example, strict dietary restric- 
tion, frequent self-induced vomiting, or laxative misuse). 

4 The diagnostic criteria for anorexia nervosa are not met. 








Binge eating disorder 

1 Recurrent binge eating (that is, recurrent episodes of uncontrolled 
eating in which an unusually large amount of food is eaten for the 
circumstances). 

2 The absence of extreme weight control behaviour (for example, strict 
dietary restriction, frequent self-induced vomiting, or laxative misuse). 


Other eating disorders 
Eating disorders of clinical severity that do not conform to the diagnostic 
criteria for anorexia nervosa, bulimia nervosa, or binge eating disorder. 




















people have times when they lose control over eating, although the 
amounts eaten may not be large. 


Bulimia nervosa 


The main feature that distinguishes bulimia nervosa from anorexia 
nervosa is that attempts to restrict food intake are interrupted by 
repeated ‘binges’ (episodes of eating during which there is an ac- 
companying sense of loss of control and an unusually large amount 
of food is eaten for the circumstances). The amount consumed 
varies greatly but is typically between 1000 and 2000 kcal. In most 
instances, binge eating is followed by self-induced vomiting or laxa- 
tive misuse, in an attempt to avoid absorbing what has been eaten. 
The episodes of binge eating, together with strict dieting, result in a 
body weight which is generally unremarkable, providing the other 
obvious difference from anorexia nervosa. Depressive and anxiety 
features are prominent in bulimia nervosa, and as in some cases 
of anorexia nervosa, there is a subgroup who engage in substance 
misuse or self-injury, or both. 


Binge eating disorder 


In binge eating disorder, there is just one primary feature. This is the 
occurrence of repeated episodes of binge eating. Unlike in bulimia 
nervosa, where the binges occur in the context of extreme attempts 
to diet, little or no extreme weight control behaviour is seen in binge 


eating disorder. Indeed, if there is a disturbance of eating outside 
the binges, it is generally overeating, rather than undereating. For 
this reason, binge eating disorder often co-occurs with a raised body 
weight or frank obesity. As in bulimia nervosa, there is distress over 
the binge eating and accompanying shame and self-criticism. 


Other eating disorders 


The OEDs are clinical states which meet the general definition of an 
eating disorder but do not fulfil the diagnostic criteria for any of the 
three specified eating disorders. They include presentations which 
closely resemble anorexia nervosa, bulimia nervosa, or binge eating 
disorder. For example, the patient's weight might be just above the 
diagnostic threshold for anorexia nervosa or the frequency of binge 
eating might not be high enough to allow a diagnosis of bulimia 
nervosa or binge eating disorder. Other presentations are ‘mixed’ in 
form, with features of both anorexia nervosa and bulimia nervosa 
being present. For instance, there may be extreme dietary restraint, 
pronounced over-exercising, occasional binge eating, and a low-to- 
normal weight. Over-evaluation of shape and weight is present in 
most of the OEDs, although in the early stages of the eating disorder, 
the focus of the over-evaluation may be primarily on maintaining 
strict control over eating. 





Development and course of the eating disorders 


Anorexia nervosa typically starts in mid-teenage years with the 
onset of dietary restriction, which then proceeds to get out of con- 
trol. In some instances, the disorder is short-lived and self-limiting, 
or only requires a briefintervention. These instances are most typical 
of young people. In others, the eating disorder becomes entrenched, 
although it is common for binge eating to develop and the weight to 
increase, with the result that the eating disorder evolves into bulimia 
nervosa. In 10-20% of people, the anorexia nervosa state persists 
and proves intractable [7]. This heterogeneity in the course and 
outcome of anorexia nervosa is often neglected in accounts of the 
disorder. Anorexia nervosa has been consistently found to be asso- 
ciated with a raised mortality rate, with the standardized mortality 
ratio being about 6 [8]. Most deaths are either a direct result of med- 
ical complications or due to suicide. 

Bulimia nervosa has a slightly later age of onset than anorexia 
nervosa, although it usually starts in much the same way; in- 
deed, in about a quarter of cases, the diagnostic criteria for anor- 
exia nervosa are met for a time. Eventually, however, episodes of 
binge eating begin to interrupt the dietary self-control and, as a 
result, the body weight rises to normal or near-normal levels. The 
disorder tends to be highly self-perpetuating in the absence of ef- 
fective treatment. The average length of history at presentation is 
about 5 years. 

Little is known about the development and course of the OEDs. In 
many instances, they are preceded by anorexia nervosa or bulimia 
nervosa, in which case the current state is simply the latest expres- 
sion of an enduring eating disorder. 

Binge eating disorder differs from anorexia nervosa and bulimia 
nervosa in its course and outcome. Rather than running a persistent 
course, it tends to be phasic, with there being long periods free from 
binge eating. Indeed, there appears to be a high spontaneous remis- 
sion rate, at least in the short term. 


The transdiagnostic perspective on the 


eating disorders 





An alternative way of viewing the eating disorders is to take a 
‘transdiagnostic’ perspective [9]. This view is based on two obser- 
vations. Firstly, anorexia nervosa, bulimia nervosa, and most of the 
OEDs have many highly distinctive clinical features in common. 
These include the over-evaluation of shape and weight, a character- 
istic and persistent form of dieting, and a tendency to engage in ex- 
treme weight control behaviour. And secondly, patients frequently 
move between these diagnoses. Temporal migration between sup- 
posedly distinct mental disorders is a most unusual phenomenon 
outside the eating disorders, and it casts doubt over the validity of 
the diagnostic distinctions. However, the fact that eating disorders 
do not evolve into other mental disorders supports the distinctive- 
ness of the diagnostic category as a whole. 
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In light of the current difficulties in treating feeding, eating, and 
weight disorders, the overall pragmatic goal is to find novel portals 
of entry for successful pharmacological and non-pharmacological 
interventions that prove superior to those currently pursued in 
terms of both efficacy and safety. The ongoing elucidation of the 
basic mechanisms underlying feeding, eating, and, in particular, 
weight disorders, should ideally help to develop novel treatments. 
However, similar to other mental disorders or complex disorders 
per se, the sheer number of aetiological factors, their proven and as- 
sumed small effect sizes, and their complex interactions render this 
a formidable task. In addition, side effects of weight loss medications 
have led to the withdrawal of anti-obesity drugs, which, in turn, has 
reduced research investment of the pharmaceutical industry. 

Because of the association of specific eating disorders with dif- 
ferent body weight categories and the obvious clinical relevance of 
underweight in both anorexia nervosa (AN) and avoidant restrictive 
food intake disorder (ARFID) and obesity in binge eating disorder 
(BED), one potential avenue is to focus jointly on body weight and 
the specific eating disorder symptomatology. Both obesity and AN 
have been viewed as representing behavioural addictions [1, 2]; 
accordingly, consideration of a treatment strategy in accordance 
with an addiction hypothesis is also warranted. Alterations of the 
microbiome have been detected in patients with AN and obesity; in 
animal models, evidence has accumulated that body weight can be 
influenced by altering the microbiome. 














Health care professionals are keenly aware of the hesitancy of pa- 
tients with AN to readily embark on treatment schemes. In some 
ways, this hesitancy is similar to that observed for a substantial 
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subgroup—if not even the majority—of patients with obesity. To 
gain weight or lose weight requires commitment over an extended 
period of time—and even then, success is by no means guaranteed; 
in both AN and obesity, patients have difficulties in maintaining 
their achieved weights. For eating and weight disorders, dropouts 
and lost-to-follow-ups are frequent in clinical trials [3-5], indirectly 
signalling that patients are unable to adhere to, or are not content 
with, the currently available treatment regimens. 

Indeed, conventional treatment schemes do not result in quick rem- 
edies. Eating disorders, and AN in particular, run a protracted course 
with persistent symptoms. The mean body weight of patients remains 
skewed—albeit to a not as extreme extent—towards the left (under- 
weight) side of the distribution, even after recovery from full-blown 
AN [6, 7]. Recent evidence now links genetic factors involved in AN 
to constitutional underweight [8, 9], as was already suggested over 
20 years ago [10]. Obesity in adolescence or adulthood seems to be 
even more difficult to overcome. Thus, a meta-analysis based on be- 
havioural intervention trials focusing on both diet and physical activity 
and encompassing almost 3000 adults resulted in an average difference 
of -1.56 kg (95% confidence interval -2.27 to -0.86 kg) at 12 months, 
in comparison to controls [11], which is a rather marginal effect size. 

Subgroups of individuals with AN or obesity reject a patient 
status. Instead exceedingly low or high weight is perceived as being 
within the ‘normal’ range of the body weight distribution. Thus, the 
mission of the National Association to Advance Fat Acceptance [12] 
is ‘to eliminate discrimination based on body size and provide fat 
people with the tools for self-empowerment though public educa- 
tion, advocacy, and support’ Similarly, ‘it’s not a diet, it’s a lifestyle’ is 
a slogan referenced in MyProAna [13]. Obviously, health care pro- 
fessionals are also aware of patients with eating disorders or obesity 
who do suffer and seek treatment. In part, this suffering is driven by 
comorbid somatic and/or mood disorders. 

Accordingly, it is difficult to predict the extent to which patients 
with eating and weight disorders would indeed take a pill to over- 
come their disorder. Apart from ego-syntonicity, comorbidity, cul- 
tural influences, and lay beliefs, efficacy and safety represent key 
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determinants. In addition, if the disorder, in itself, in some way has a 
rewarding effect, treatment-induced removal of such an effect could 
also influence acceptability. 





Energy homeostasis and body weight categories 


A stable body weight results from a balance between energy intake 
and expenditure. Energy intake is a complex phenomenon which 
largely depends on internal (including emotional), social, time, and 
environmental cues to eat, feelings of appetite/hunger, food and 
beverage preferences, alcohol, tobacco and drug use, medication, 
and conscious efforts to influence eating behaviour. In sedentary 
individuals, total energy expenditure is determined as approxi- 
mately 70% by resting energy expenditure (REE) largely accounted 
for by muscle and organ mass. The remaining 30% is accounted for 
by the thermogenic effect of food (the energy used to absorb, di- 
gest, and store nutrients) and activity subdivided into regular phys- 
ical activity and non-exercise activity thermogenesis (NEAT) as in 
fidgeting [14]. 

This highly complex regulatory system has evolved to defend the 
individual against loss of body weight (fat and fat-free mass) and 
to allow adaptation to starvation [15-17]. Both intermittent experi- 
mental overfeeding and caloric restriction entail counter-regulatory 
adaptations of energy expenditure; upon cessation of such experi- 
ments, the body weight of participants frequently returns to the pre- 
intervention level. However, the recent obesity pandemic illustrates 
that the human organism is not well protected from the develop- 
ment of high body weight. In contrast, the signalling network to 
allow adaptation to hunger and starvation is extensive, because food 
supplies have repeatedly been scant during evolution. 

Adult body weight categories are sub-classified according to 
body mass index (BMI) (Table 102.1) [18]. In healthy underweight 
(BMI <18.5 kg/m’) subjects, whose BMI is below the weight cri- 
terion threshold for AN, a long-standing history of underweight 
is the rule. Intermittent substantial weight loss is not observed, 
although weight loss may occur during stressful life events; fur- 
thermore, a family history of underweight and the virtual absence 
of obesity in first-degree relatives are common. Subjects with over- 
weight (BMI 225 kg/m’) and obesity (BMI 230 kg/m”), including 
extreme obesity (BMI 240 kg/m’), have usually accumulated 
their excess adiposity over a prolonged time period; however, a 
more rapid weight gain leading to overweight can occur within 


Table 102.1 Weight categories based on BMI 


eM 


Underweight <18.5 
Normal weight 185-249 
Overweight 275 
Pre-obesity 25299 
Obesity grade | 30-34.9 
Obesity grade || 2D 
Obesity grade III >40 


Reproduced from World Health Organization, Obesity: preventing and managing the 
global epidemic, Copyright (2000), with permission from the World Health Organization. 
Available at http://www.who.int/nutrition/publications/obesity/WHO_TRS_894/en/ 


the context of specific time periods such as, for example, puberty, 
pregnancy, menopause, an episode of major depression (for ex- 
ample, atypical depression), cessation of smoking or during treat- 
ment with drugs that induce weight gain, and even binge eating as 
a side effect [19]. 

Complex interactions of environmental factors with individual 
genetic factors result in normal weight, underweight, or over- 
weight/obesity. Whereas it has proven difficult to unequivocally 
attribute the development of obesity to specific environmental 
factors entailing an increased energy intake and/or reduced en- 
ergy expenditure and, as a net effect, an energy surplus, the 
current obesity epidemic is attributed to environmental factors 
(obesogenic environment). Potentially, and similar to the gen- 
etic architecture of BMI [20], the number of environmental fac- 
tors is large; overall, effect sizes of single factors may well prove 
to be rather small; nevertheless, in a given individual genome- 
environment, interactions may render specific environmental fac- 
tors much more important than others. Within any given society, 
the obesogenic environment may not differ substantially, thus 
enabling genetic factors to play a larger role in explaining BMI 
variance of the respective population. Accordingly, BMI herit- 
ability estimates based on twin studies have remained stable over 
the past decades [21]. 

Despite the assumed importance of the environment for the 
obesity epidemic, there is no easy way to turn the clock back; this, 
by itself, points to the multitude of factors involved. In industrial- 
ized countries, sedentary office workplaces, a sedentary lifestyle, and 
even more so overconsumption of foods are perceived as major con- 
tributors to excess weight gain. Changes in the global food system, 
which is producing palatable, more processed, cheap, and effectively 
marketed food, are thought to entail ‘passive’ overconsumption of 
energy. Overweight and obesity can thus be viewed as a predictable 
outcome of market economies predicated on consumption-based 
growth [22]. Importantly, social class is a predictor of body weight 
in (post-)industrial and developing countries, albeit with partially 
opposite effects [23, 24]. 

Behaviour and emotionality undoubtedly contribute to the devel- 
opment of eating and weight disorders. However, among laypersons, 
these factors frequently are perceived as the sole factors. The lay per- 
ception that behaviour is important and modifiable is, to a large 
extent, responsible for the stigma associated with both groups of 
disorders. Such lay beliefs may even have an impact on body weight; 
for example, subjects who believe that obesity results from a lack of 
exercise had a higher BMI than those who considered a poor diet 
at the core [25]; this effect on body weight was not explained by 
other factors with an influence on body weight, co-assessed in the 
respective study. Quite a substantial number of patients with either 
AN or obesity have the perception that surmounting mental prob- 
lems will enable them to achieve a normal body weight; relatives, 
friends, and health care professionals frequently contribute to this 
way of thinking. This mindset can just as well be found in patients, 
in whom the respective condition affects multiple family members, 
demonstrating that the complexity of the interaction of genetic and 
environmental factors in the aetiologies of these disorders is fre- 
quently confusing to those most concerned. Interestingly, the lay 
belief that genetic factors play a major role in obesity has been asso- 
ciated with reports of lower levels of both physical activity and fruit 
and vegetable consumption [26]. 
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Disordered body weight regulation 


Because eating and weight disorders are associated with weight 
loss (AN), weight stunting (AN, ARFID), weight fluctuations 
(AN, bulimia nervosa, BED), and weight gain (obesity), diverse 
regulatory mechanisms that normally serve to defend body 
weight are overridden. This perturbation can stem from, and en- 
tail, both somatic and psychological malfunctioning, resulting 
in a complex intertwinement of primary and secondary factors. 
[27]. Neuroendocrine findings in eating disorders primarily re- 
flect the consequences of these perturbations (Table 102.2); major 
causal neuroendocrine mechanisms have not been identified 
for eating disorders. Knowledge of the induced neuroendocrine 
dysregulations helps partially to understand the respective clinical 
symptomatology. In addition, efforts to gain weight by a patient 
with chronic AN may be hampered by ‘physiological’ counter- 
regulation, in addition to the primary underlying disturbances; 
the long-standing underweight may have led to a reduction of the 
body weight set-point. 


Elucidation of monogenic obesity opened 
the door to the neuroendocrine circuitry 


underlying body weight regulation and 
the physiological adaptation to starvation 





Multifactorial obesity is the most common form of obesity in the 
general population [19]; this type of obesity results from the com- 
plex interaction of hundreds of polymorphisms and environmental 
factors. Currently, genetic variation at 941 independent loci has been 
detected via genome-wide association analyses; the mean allelic 
effects on body weight range from approximately 1.5 kg to <150 g. 
Approximately 6% and 22% of the BMI variance can be attributed to 
the currently identified polygenic loci or have been estimated to, in 
total, explain the polygenic contribution [20, 28]. 

Cloning of the leptin gene in 1994 and the identification of the rare 
mutations entailing the autosomal recessive phenotype of the obese 
mouse and the equivalent human leptin deficiency syndrome repre- 
sented initial breakthroughs in our understanding of the molecular 


Table 102.2 Endocrine alterations in AN and BN’ 


























Thyroid axis J £13, n (Y) £14 n(f 
Gonadal axis IHES pudl 
4 LH pulsatility mi 
{ Oestrogens nil 
{ Androgens nil 
Adrenal axis ft cortisol n(t 
n DHEAS n 
Growth hormone GH resistance n(t 
(t GH/J IGF-1) 
Appetite-regulating hormones { leptin nl 
t ghrelin (fasting) i 
T (n) PYY (fasting) N 
T, elevated; |, reduced; n, normal; fT3, free triiodothyronine; fT4, free thyroxine; LH, 





luteinizing hormone; FSH, follicle-stimulating hormone; GH, growth hormone; IGF-1, 
insulin-like growth factor, type 1. 
“ [90, 91]. 


mechanisms involved in body weight regulation and the behaviours 
associated with both an increased energy intake and a decreased ex- 
penditure [29]. Leptin-deficient humans for the first time revealed 
that extreme obesity without cognitive impairment can result from a 
monogenic disorder [30]; the voracious appetite and ensuing behav- 
iours of these individuals are driven by the genetically based lack of 
this anorexigenic hormone. 

Leptin is mainly synthesized in adipocytes and secreted into 
blood; circulating concentrations are correlated with BMI and fat 
mass in particular. Central leptin signalling is initiated via binding 
of leptin to its receptor at the cell surfaces of two different types of 
first-order neurons within the arcuate nucleus in the hypothalamus 
(Fig. 102.1). These neurons, in turn, convey the leptin signal to 
second-order neurons; brain-derived neurotrophic factor (BDNF) 
and its receptor are important downstream effectors of the leptin- 
melanocortinergic pathway, which projects to different neurotrans- 
mitter systems [15, 16, 28]. 

Prolonged reduced energy intake due to, for example, a famine 
reduces the endocrine function of adipose tissue, which is com- 
pounded by the ensuing loss of fat mass. Hence, circulating leptin 
drops to subnormal levels and, via this process, initiates neuroendo- 
crine adaptation to starvation [15, 16]. Reduced leptin signalling af- 
fects the hypothalamus-pituitary axis, which down- or upregulates 
the functions of downstream end-organs, including the thyroid, 
gonads, and adrenals. In addition, growth hormone is upregulated, 
while others like insulin-like growth factor 1 (IGF-1), a neurotrophic 
polypeptide, are downregulated. 

Other examples of autosomal recessive forms of monogenic 
obesity include those due to mutations in the genes for the leptin 
receptor, pro-opiomelanocortin (POMC) and proconvertase 1 [29]. 
All of the recessively inherited forms of monogenic obesity entail 
additional phenotypic features such as hypothalamic amenorrhoea 
associated with functional leptin deficiency or red hair and adreno- 
corticotrophic hormone (ACTH) deficiency in a subgroup of POMC 
mutation carriers; these associated phenotypes underscore the fact 
that the neuroendocrinology of body weight regulation is inter- 
woven with other functions. 

One of the second-order neurons expresses the melanocortin-4 
receptor (MC4R) gene, which can harbour a vast range of function- 
ally relevant mutations entailing dominantly inherited obesity in 1- 
5% of extremely obese children and adults. The anorexigenic effect 
of the leptin signal is diminished, if binding of the POMC cleavage 
product alpha-melanocyte stimulating hormone (a-MSH) to the 
MCAR is reduced as a consequence of such mutations. The net ef- 
fects of such MC4R mutations result in body weight increments of 
approximately 15 kg and 30 kg in males and females, respectively. 
Obesity due to MC4R mutations is not associated with any readily 
easily recognizable associated phenotype [29]. 

Apart from the hypothalamus, several other subcortical and cortical 
brain regions, including those comprising the reward system [31], are 
involved in body weight regulation and adaptation to starvation. In 
addition to leptin, a number of peripheral hormones and peptides 
arising from the digestive system (gut-brain axis), muscle, and other 
organs provide the link between the periphery and the central nervous 
system via specific central receptors. Furthermore, endocrine and 
neurocrine pathways are assumed to be involved in gut microbiota- 
to-brain signalling; vice versa, the brain can alter microbial compos- 
ition and behaviour via the autonomic nervous system [32]. 
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Fig. 102.1 Leptin has central effects on feeding behaviour, appetite, motor activity, and cognitive function, but also peripheral effects on the insulin- 
glucose axis. In the arcuate nucleus of the hypothalamus, leptin activates two different types of first-order neurons. One co-expresses the orexigenic 
peptides neuropeptide Y (NPY) and agouti-related protein (AgRP), the other the anorexigenic peptides cocaine- and amphetamine-related transcript 
(CART) and pro-opiomelanocortin (POMC). POMC is cleaved into different peptides, including a-melanocyte-stimulating hormone (a-MSH). AgRP 
and a-MSH compete for the MC4 receptor (MC4R). AgRP antagonizes MCAR activity, and a-MSH stimulates MCAR activity. Decreased MCAR activity 
generates an orexigenic signal, whereas increased activity conveys an anorexigenic signal. 


Drug-induced weight loss and its potential for the 


treatment of obesity 





Recent insight into the neuroendocrine regulation of body weight 
has led to the identification of many potential targets to influence 
body weight. As a consequence of the high prevalence of obesity, 
drug-induced weight loss has been the major focus of pharmaco- 
logical research; however, AN, ARFID, sarcopenia, and cachexia 
due to infections, cancer, and other conditions also clearly warrant 
efforts to devise drugs that promote weight gain. The elucidation of 
polygenes in body weight regulation [20, 28] is of a too recent origin 
to have allowed their full exploitation with respect to a selection of 
drug targets. It is also questionable whether the low effect sizes of the 
identified genes will trigger research efforts in the pharmaceutical 
industry. However, the advances into treatment of different types of 
monogenic obesity underscore the potential of making use of the 
neuroendocrine mechanisms and of genetic variation underlying 
obesity in particular. Thus, daily treatment with subcutaneously ap- 
plied recombinant leptin is able to normalize the eating behaviour 
of patients with inborn functional leptin deficiency within days; in 
the longer term, substantial weight loss ensues [30]. Two patients 
with a form of monogenic obesity due to POMC mutations benefited 
from treatment with the MC4R agonist setmelanotide; weight loss 
amounted to 51 kg and 20.5 kg after treatment for 42 and 12 weeks, 
respectively [33]. Functional rescue of stop mutations in MC4R has 
been demonstrated in a rodent model based on aminoglycoside- 
mediated read-through of stop codons [34]. MC4R agonists are 
currently being evaluated for the treatment of both obesity due to 
MCA4R mutations and multifactorial obesity [35]. Case reports in- 
dicate that indirect sympathomimetic stimulants and atomoxetine 
may also induce substantial weight loss in individuals with MC4R 
mutations [36, 37]. 

In general terms, the known anti-obesity drugs can be classified 
as either centrally acting appetite suppressants and satiety enhan- 
cers (with or without additional peripheral actions) or peripherally 
acting agents [38]. As of today, no major clinical breakthrough has 


been achieved in the pharmacological treatment of multifactorial 
obesity or eating disorders. Whereas some drugs or drug combin- 
ations have been shown to acutely induce weight loss, clinically 
significant and regulatory agency-required medium-term (1 year) 
weight loss mostly averaged in the range of 3-5 kg in excess of pla- 
cebo. Weight regain typically sets in upon discontinuation of the re- 
spective medications [38]. 

Apart from these disappointingly low mean weight losses, the 
history of anti-obesity drug development is fraught with signifi- 
cant cardiovascular or central nervous system side effects that have 
frequently led to their market removal (for example, fenfluramine, 
sibutramine, rimonabant [39]) or failed clinical studies prior to any 
new drug approval (NDA) submission; interestingly, the rare emer- 
gence of AN represents one such serious adverse event [40]. As a 
consequence, all large pharmaceutical companies have stopped or 
significantly reduced their obesity drug development programmes; 
in light of the regulatory requirements, the financial risks currently 
appear too high. Currently, only startups or some mid-sized com- 
panies pursue this challenging indication. However, paradoxically, 
patients with grade III (BMI 240 kg/m’) or grade II (35 < BMI < 
40 kg/m”) obesity and obesity-related comorbid disorders are eligible 
for bariatric surgery, which entails a mortality risk of approximately 
0.5% [41]. As such, a more positive and differentiated approach to 
pharmacological treatment appears warranted, according to which 
higher risks of adverse events during pharmacological treatment of 
extreme obesity should not automatically entail withdrawal of the 
respective drug. On the other hand, regulatory agencies are justifi- 
ably concerned with off-label use of drugs to promote weight loss in 
less obese individuals. 

Interestingly, obesity drugs have failed because of side effects due 
to actions in pathways overlapping with the reward and other central 
nervous systems. Of 25 drugs withdrawn worldwide between 1964 
and 2009, 23 acted via central mechanisms, with psychiatric side 
effects explaining seven withdrawals; drug abuse and dependence 
accounted for roughly 50% of these withdrawals [39]. A centrally 
acting drug that leads to weight loss (or gain) frequently also affects 
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mood, cognition, and/or behaviour. The serotonin—noradrenaline 
reuptake inhibitor sibutramine was originally developed as an anti- 
depressant; due to an elevated cardiovascular risk profile and despite 
weight loss, which, in theory, should have reduced cardiovascular 
risks, the drug was taken off the market. The cannabinoid receptor- 
1 antagonist rimonabant (Acomplia) was associated with depres- 
sion and suicidal ideation in a small subgroup of patients, leading 
finally in 2009 to authorization by the European Medicines Agency 
(EMA) for its withdrawal from the market, 3 years after its approval 
in 2006; the US Food and Drug Administration (FDA) had never 
licensed the use of this drug in the United States due to concerns 
for neurologic and psychiatric side effects. The approved anticonvul- 
sant topiramate (Topamax ), which induces weight loss in excess of 
both sibutramine and rimonabant, is associated with paraesthesiae, 
taste impairment, suicidal ideation, and psychomotor disturbances; 
in total, the adverse side effects again were deemed too problem- 
atic to market topiramate as a stand-alone anti-obesity drug [39]. 
However, low-dose combination with phentermine has received 
market approval. 

Of the five drugs currently licensed by the FDA for the treat- 
ment of obesity, two represent combination drugs (phentermine- 
topiramate and naltrexone-bupropion). All five drugs are superior 
to placebo in inducing a weight loss of at least 5% within a clinical 
trial duration of 12 months. The proportions of patients meeting this 
threshold are shown in Table 102.3 [42]. 

Liraglutide and naltrexone-bupropion were associated with the 
highest odds of adverse event-related treatment discontinuation. Of 
these five drugs, the EMA has licensed the use of orlistat, naltrexone- 
bupropion, and liraglutide. 

The ongoing elucidation of the pathways involved in energy 
homeostasis is continuing to turn up novel targets for anti-obesity 
drugs; similarly, the metabolic alterations after bariatric surgery 
have provided additional insight into the complex regulation of 
body weight [43]. Several drugs, including an MCA4R agonist and 
an enhancer of fat oxidation, are in development for subtypes of 
obesity. A small number (in comparison to the era prior to 2007) of 
clinical trials are ongoing (see clinicaltrials.gov). Due to both the 
limited effect of a single drug and dose-dependent increases in side 


effects, combination therapies and, in particular, polypeptide hor- 
mone combinations appear promising [44]. 

Despite the detection of numerous targets and ongoing drug de- 
velopment, the four new anti-obesity drugs approved by the FDA 
between 2012 and 2014 did not fulfil their market expectations. 
Acceptance by physicians in the United States was substantially 
lower than for other novel drugs; furthermore, patient acceptance 
has only picked up slowly [45]. Obviously, the frequent market 
withdrawals of anti-obesity drugs have contributed to a negative 
perception of pharmacological treatment of elevated adiposity. 
This history has reduced investment in drug development for this 
highly relevant disease for the foreseeable future. The expectations 
of both patients and physicians require a realistic discussion as to 
the pros and cons of drug treatment of elevated adiposity; in some 
countries, reimbursement issues by health insurances also need to 
be addressed. A stand-alone drug treatment of obesity is unlikely 
to be successful; for the time being, lifestyle modification will re- 
main the cornerstone of weight management [38]. Central nervous 
system side effects and risks for drug dependence represent serious 
obstacles for the development and marketing of anti-obesity drugs. 


Drug-induced weight gain and its potential for the 


treatment of AN and AFRID 





No drugs have been specifically licensed for the treatment of AFRID 
or AN. As such, all medications currently used to treat these dis- 
orders symptomatically are prescribed off-label, for instance, to sup- 
press severe eating disorder-specific cognitions. However, because 
starvation-associated underweight can figure prominently in both 
disorders, approaches for drug-induced weight gain represents a 
major therapeutic aim. Appetite stimulants/orexigenic drugs clearly 
represent an option, although drugs that reduce energy expend- 
iture and physical activity may have potential too. Additionally, 
drug treatment of sarcopenia [46] and cachexia associated with 
cancer [47, 48], chronic kidney disease, HIV, other infectious dis- 
eases, cancer, and chronic obstructive pulmonary disease also 
warrants scrutiny for potential application in eating disorders. For 


Table 102.3 Obesity agents producing 5% weight loss or more (proportions of patients meeting this threshold shown as percentages) 


Mechanism of action % meeting threshold for 
significant weight loss 


Phentermine 
Topiramate 


Phentermine: non-selective stimulator of monoamine release 75% 
Topiramate acts as multifactorial: enhances GABAergic neurotransmissions, blocks voltage-dependent 


sodium channels, activates L-type calcium channels, enhances inhibitory activity of glutamate at AMPA 
(amino-3-hydroxy-5-methyl-4-isoxazole-proprionate) and kainate receptors, and inhibits several 


carbonic anhydrases isoenzymes [92] 
Liraglutide Glucagon-like peptide 1 analogue 


Naltrexone-bupropion Naltrexone: opiate antagonist 


63% 
55% 


Bupropion: previously seen as weak noradrenaline-dopamine reuptake inhibitor (NDRI), but potentially 
rather a noradrenaline and dopamine releasing agent (NDRA) 


Lorcaserin Serotonin 2C receptor agonist 
Orlistat Inhibitor of gastric and pancreatic lipases 
Placebo 


49% 
44% 
23% 


Source: data from JAMA, 315(22), Khera R, Murad MH, Chandar AK, et al., Association of Pharmacological Treatments or Obesity With Weight Loss and Adverse Events: A systematic 
Review and Meta-analysis, pp 2424-34, Copyright (2016), with permission from American Medical Association. 
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example, given the prominent role of the MC4R as a drug target for 
both induction of weight loss and weight gain, it is interesting that 
cachexia induced by lipopolysaccharide administration and by tu- 
mour growth is ameliorated by central MC4R blockade in rodent 
models [49]. 

Several licensed drugs, particularly psychopharmacological 
medications, show weight gain as an unwanted side effect. Clozapine 
and olanzapine have the strongest effect on body weight [50]. They 
act via an increase of caloric intake without affecting resting energy 
expenditure [51] and can induce binge eating attacks [52]. Weight 
gains exceeding 20 kg can occur in genetically susceptible individ- 
uals [53] who are treated for psychosis or other mental disorders; 
the mean average weight gain after 10 weeks is between 4 kg and 
4.5 kg [50]. However, use of olanzapine and related drugs in AN to 
promote weight gain has proven unsuccessful, as revealed by meta- 
analyses of randomized controlled trials [54]. It is unknown if the 
unresponsiveness is due to cognitive restriction or to starvation- 
dependent dysregulation. Interestingly, our own clinical observa- 
tion is that olanzapine can induce weight gain once patients have 
reached their target weight. 

Systematic studies of dopamine antagonists are not available for 
AFRID and feeding disorders associated with low body weight. The 
young age of the respective patients should imply a restrictive ap- 
proach to treatment with off-label medications. Three case reports 
based on pre-adolescent patients with feeding disorders suggest that 
risperidone may help to increase body weight [55]. A retrospective 
chart review of 127 children between 7 and 80 months with feeding 
difficulties not exclusively fulfilling the DSM-5 criteria for ARFID sug- 
gested that cyproheptadine, an antihistaminergic and antiserotonergic 
agent approved for use in cold urticaria in children and adolescents, 
can lead to weight gain and a positive change in mealtime and feeding 
behaviours within the context of a complex treatment programme 
[56]. Mild and transient side effects, such as drowsiness, irritability, 
and constipation, were reported in 14% of the children. 

Ghrelin represents an example of a hormone which induces food 
intake as an appetite stimulant; the hormone is synthesized in the 
enteroendocrine cells of the stomach and released upon caloric 
restriction—it thus signals hunger. Consistent with the neuroendo- 
crine adaptation to starvation, serum levels are elevated in patients 
with AN. Based on encouraging initial findings in a small number 
of patients with AN, a clinical trial is currently assessing the effects 
of the novel ghrelin receptor agonist RM-131 for treatment of AN 
[57]. Apart from other functions, the hypothalamic neuropeptide 
oxytocin is a regulator of food intake; a clinical trial in patients with 
AN is looking at the effects of intranasal oxytocin on smell and food 
consumption. In a double-blind, single-dose, placebo-controlled 
crossover study, oxytocin had no effect on the 24-hour calorie in- 
take in AN patients, whereas patients with BN consumed less cal- 
ories [58]. Because testosterone increases fat-free mass, a phase 2 
clinical trial is investigating the effect of transdermal testosterone on 
BMI of patients with AN [57]. The cannabinoid agonist dronabinol 
[the international non-proprietary name (INN) for delta-9- 
tetrahydrocannabinol (THC)] induced a small (0.73 kg in excess of 
placebo), but significant, weight gain after 4 weeks of treatment in 
the absence of severe adverse events in an add-on, prospective, ran- 
domized, double-blind, controlled crossover study, which included 
25 adult patients with chronic AN [59]. 


Pharmacological dissection of primary 


phenomena from starvation-induced 
symptoms in AN 





Treatment of patients with AN with recombinant leptin appears to 
be a promising approach to ameliorate starvation-induced symp- 
toms related to hypoleptinaemia [60]. Treatment with recombinant 
leptin has been shown to improve the function of the reproductive 
system, including resumption of menstruation in currently non- 
eating disordered females with hypothalamic amenorrhoea [61]. 
Amenorrhoea in patients with AN is induced by hypoleptinaemia 
[16]. The exact mechanisms linking the metabolic system with 
the reproductive system require further elucidation. Leptin binds 
to receptors on gonadotrophin-releasing hormone (GnRH) 
neurons. The signalling cascade for reproductive function includes 
gonadotrophin-inhibitory hormone (GnIH) and kisspeptin [62]. 

At the behavioural level, hyperactivity is encountered in a sub- 
stantial subgroup of patients with AN. Hypoleptinaemia likely 
also plays a crucial role in the development of this phenomenon, 
which, if severe, substantially impedes treatment. In rats subjected 
to food restriction, running-wheel activity increases by 300-400%. 
This starvation-induced hyperactivity is assumed to facilitate food- 
foraging behaviour. The development of hyperactivity is completely 
suppressed by concomitant treatment with recombinant leptin [63]. 
Because removal of the adrenals also prevents the development of 
anorexia-based activity, it appears that leptin exerts its effects via the 
hypothalamic-pituitary—adrenal axis [64]. In agreement with the rat 
model, low serum leptin levels in patients with AN have been asso- 
ciated with increased activity levels [65], suggesting that the same 
mechanism applies. This may provide a unique example of a pro- 
found physiologically induced and hormonally determined change 
in activity levels. It certainly substantiates the pivotal role of leptin in 
the adaptation to starvation [15, 16]. 

Other findings supporting the role of leptin in the neuroendo- 
crine adaptation to starvation include its suppression of growth 
stunting in rodent starvation models, its antidepressant effect in in- 
born leptin-deficient mice, and its effects on bone growth. Leptin 
has also been shown to have a neurotrophic effect; in leptin-deficient 
humans, treatment with recombinant leptin entails increments of 
the volume of specific brain regions [15, 16, 60]. Recombinant leptin 
has recently been licensed for the treatment of lipodystrophy by the 
FDA [66]; the EMA followed suit in 2018. Adverse effects occurred 
during treatment with metreleptin (Myalept ), but these seemed to 
be linked to the underlying disorder. 

Such trials in patients with AN would enable the dissection of 
primary from secondary starvation-induced symptoms in AN; an 
amelioration of somatic, cognitive, and behavioural symptoms ap- 
pears possible [66]. Patients with BN whose metabolism is in a semi- 
starved state might benefit as well. The risks of metreleptin treatment 
of patients with eating disorders might be manageable. Thus, leptin 
treatment of eight females with hypothalamic amenorrhoea was not 
associated with significant side effects [61]; in addition, long-term 
treatment of individuals with leptin deficiency [30] and a 24-week- 
long treatment of hypoleptinaemic (<5 ng/mL) adult females [67] 
were not associated with serious side effects. The production of anti- 
bodies neutralizing leptin has occurred after longer treatment and 
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was countered by intermittent discontinuation and application of 
higher doses. 

Obviously, patients with AN would have to be closely moni- 
tored during a pilot study; in particular, adequate feeding during 
treatment is crucial, as an elevation of leptin may reduce appetite. 
Preferentially, initial attempts should be directed at adult patients 
with hyperactivity [60, 66]. The high costs of metreleptin treatment 
(currently approximately halfa million dollars annually for the treat- 
ment of a patient with lipodystrophy [66]) represent an important 
obstacle to treatment acceptance. 


Eating disorders and behavioural addictions 





An addiction-like process has been suggested to explain the eating 
behaviour of subjects with AN [2], BED, and obesity [1]. Potentially, 
either overeating or caloric restriction can have a positive effect on 
mood, emotion regulation, and cognition in predisposed individ- 
uals. With respect to obesity, the terms ‘food addictior and ‘eating 
addiction illustrate the current controversy as to whether particular 
foods and substances (for example, sugar) can support an addictive 
behaviour as in substance use disorders or whether overeating is 
best categorized as a behavioural addiction or compulsion. The term 
‘paradoxical liveliness’ has been coined to illustrate that some pa- 
tients with AN appear more lively and become more focused and 
more ambitious when they initially develop their eating disorder 
[65]. The fact that all major religions have propagated short-term or 
intermittent fasting suggests that emotional and spiritual well-being 
may be positively influenced by the metabolic and physiological 
processes associated with the adaptation to starvation. 

The function of leptin within the reward system warrants appreci- 
ation [1, 68]. The rewarding value of nutrients requires activation of 
midbrain dopamine (DA) neurons. Fasting enhances the rewarding 
value, whereas it is diminished by leptin. Via leptin receptors ex- 
pressed on a subpopulation of midbrain DA neurons, leptin exerts 
an inhibitory effect on these neurons, thus potentially explaining a 
decreased nutrient reward and anorexia via downstream galanin- 
mediated inhibition of orexin neurons. Conversely, DA levels in the 
nucleus accumbens (NAc) are decreased in hyperphagic, leptin- 
deficient obese mice and increased by leptin injections. Food intake 
may thus result from DA deficiency in the NAc, entailing normaliza- 
tion of DA levels. Low levels of DA in the NAc have been postulated 
to underlie both the drive to eat in obese subjects and substance use 
in drug addicts [68]. 

Alterations of the brain reward circuitry have also been postu- 
lated to contribute to the development and maintenance of eating 
disorders. At the onset of AN, eating less food may be perceived 
as rewarding; afterwards, conditioning sets in. In the binge eating/ 
purging type of AN, BN, and BED, patients report a strong urge to 
eat, together with a sense of loss of control of their energy intake 
and a transient reduction of negative emotional states after bingeing. 
Patients with eating disorders frequently present with anhedonia, 
meaning that they are less able to experience reward in comparison 
to healthy controls [69]. O'Hara and coworkers [2] hypothesized 
that, rather than a generalized inability to experience reward, a cog- 
nitively driven reluctance to gain weight may contribute to the aver- 
sive appraisal of food- and taste-related stimuli. Hypotheses focusing 
on a reduced experience of reward include inappropriate processing 


of visual perception of food, distinct patterns of brain activation in 
response to gustatory stimuli, and ‘reward contamination’ (normal 
rewarding stimuli are experienced as aversive). Patients with eating 
disorders may activate dopaminergic motivational circuits differ- 
ently, so that normally aversive stimuli become attractive [69]. 

Whereas a reward-centred model of AN, and thus an overlap 
with addiction, appears plausible, substantial differences are also ap- 
parent. Further research is required to assess potential implications 
for psychotherapeutic and psychopharmacological treatment of 
eating disorders [2]. Prevention and treatment of obesity, based on 
the assumption that it (partially) results from food or eating addic- 
tion, warrants further scrutiny [1]. The 5-HT2c agonist lorcaserin, 
which has received FDA approval for the treatment of obesity, has 
further potential in substance use disorders, especially nicotine/ 
smoking. In particular, the lorcaserin dose dependently reduces 
smoking rates and at the same time reduces/prevents the weight gain 
typically associated with cessation of smoking [70]. 





Obesity, eating disorders, and the microbiome 


Microbiota transplantation experiments in mouse models first in- 
dicated that the gut microbiota influences weight regulation and 
modulates psychopathology. Stool extracts transferred from genet- 
ically or nutritionally induced obese to germ-free (GF) mice pro- 
duced a marked increase in body fat within a relatively short period 
of time [71]. Moreover, obesity and obesity-associated metabolic 
phenotypes were transmissible by uncultured microbiota of humans 
to GF rodents. Conversely, cohousing the microbiota-transplanted 
obese mouse with a mouse harbouring the microbiota from a lean 
individual prevented the development of obesity and the associ- 
ated metabolic complications [72]. Several studies, especially those 
following Roux-en-Y gastric bypass, demonstrated that obesity 
is related to a change in the relative abundance of two bacterial 
strains—the Firmicutes-to-Bacteroidetes ratio is increased in obesity 
and reduced with weight loss. However, it is still not clear whether 
these changes in microbial populations directly contribute to pro- 
mote weight loss or result from it [73]. 

The microbiome of obese individuals has an increased poten- 
tial to harvest energy from nourishment, for example by nutrition 
absorption and polysaccharide utilization. It produces bioactive 
metabolites, for example short-chain fatty acids (SCFAs) (specific- 
ally acetic acid, propionic acid, and butyric acid), by fermentation 
of partially digestible and non-digestible carbohydrates of dietary 
fibres. SCFAs affect the physiology of the host organism, for example 
the pH level of the colon, gut transit time, and glucose metabolism, 
thus modifying appetite and energy metabolism [74, 75]. Obese in- 
dividuals, including children and adolescents, have a higher relative 
abundance of microbiota able to ferment polysaccharides. The con- 
sequence is a higher amount of SCFAs providing more energy for 
the host, which can be stored as fat tissue or converted into glucose. 
In adults fed on a Western diet, the extra energy supply by produ- 
cing SCFAs has been estimated to add up to 10% of the overall daily 
caloric intake [76]. Moreover, these metabolites are able to cross the 
brain-blood barrier and impact neural circuits [77]. 

On the other hand, the microbiome also seems to play an im- 
portant role in starvation-induced weight loss. Transplantation of 
frozen bacterial species from children with kwashiorkor, a severe 
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form of malnutrition often observed in developing countries, pro- 
duced significant weight loss in GF mice, accompanied by severe 
metabolic changes. One-week application of an oral antibiotic sig- 
nificantly improved the nutritional status of malnourished Malawian 
children [78]. Significant changes of the gut microbiome were also 
found in the frequently used animal AN model of activity-based an- 
orexia (ABA) [79], which stimulates weight loss by reduction of food 
and access to a running wheel [63, 64]. Because of the starvation- 
and hyperactivity-induced weight loss in the animal model, de- 
creased thickness of the muscularis layer and significantly increased 
permeability of the intestine were observed [79]. 

Unfortunately, studies in patients with AN are still scarce. Two re- 
cent investigations found a significant difference between the abun- 
dance of certain bacterial groups in healthy controls and patients 
with AN, especially an increase in mucin-degraders and a decrease 
in butyrate producers in the latter group, which were only partly al- 
leviated by weight gain [80, 81]. Mucin-degraders feed on mucus 
covering the intestinal wall and probably contribute to the increased 
intestinal permeability identified in the ABA model for AN men- 
tioned previously [75]. 

Both in obesity and AN, there is emerging evidence for an increase 
in intestinal permeability, followed by low-grade inflammation, 
which, in turn, might contribute to the metabolic (and psycho- 
pathological) alterations found in both disorders [73]. The intestinal 
barrier prevents pathogenic microorganisms from spreading into 
the systemic circulation. However, a ‘leaky gut’ might facilitate the 
transfer of harmful microorganisms and toxins from the gut lumen 
to the lamina propria and mesenteric lymph nodes, from which they 
may reach the systemic circulation, especially if the immune system 
is dysfunctional. From this point, such so-called ‘pathobionts’ and 
other harmful substances could stimulate the production of pro- 
inflammatory peripheral and central cytokines, the latter having an 
impact on neuronal function (gut-brain interaction). Indeed, pro- 
inflammatory cytokines are known to be increased during the acute 
state of AN [82]. Similar processes are observed after several weeks 
of a high-fat diet in animals with an altered gut barrier function and 
elevated inflammation markers [83]. 

Implications for treatment of obesity and eating disorders may 
include changes of diet and the administration of pre- or pro- 
biotics or even of antibiotics. The composition of the human 
microbiome rapidly responds to a dietary change. In an inform- 
ative study by David et al. [84], the researchers compared the 
effects of a shift from usual eating habits to either a vegetarian 
or an animal-based diet. In the group on the animal-based diet, 
changes in the gut microbiome occurred already 1 day after the 
new nourishment had reached the distal gut. The microbiome 
changed to an abundance of bacteria with high bile resistance 
(Bilophila wadsworthia), which is associated with the occurrence 
of intestinal bowel disease in mouse models. Interestingly, only 
this group, but not that fed on a vegetarian diet, showed signifi- 
cant weight loss. These findings are of potential importance for 
refeeding strategies in AN. After admission to hospital, the diet 
of the starved patient is often swiftly changed from a more plant- 
derived alimentation to high-calorie food rich in carbohydrates 
and fat to achieve weight gain. This may be doing more harm than 
good. Within a very short time, the microbiome of our patients 
is affected without considering probable consequences such as 
the growth of inflammation-inducing bacteria. Interestingly, an 


increased risk for autoimmune disorders, especially Crohn’s dis- 
ease, has been described in AN [85]. 

Prebiotics are defined as ‘non-digestible food ingredients that 
beneficially affect the host by selectively stimulating the growth/ac- 
tivity of certain bacteria in the gut’ [86]. Probiotics are defined as 
living microbial components delivered to humans or animals, with 
health benefits for the host. To our knowledge, there are not suf- 
ficient data to judge the effect of either prebiotics or probiotics in 
obesity or eating disorders. However, with further understanding of 
the role of the microbiome in weight and eating disorders, there is 
hope that modification of the gut microbes can exert a positive im- 
pact on these disorders, at least in some individuals. This could foster 
a more personalized treatment in these often chronic diseases [73]. 

Changes in the gut microbiome are also associated with symp- 
toms of psychiatric disorders. Infections of the gastrointestinal tract 
are followed by anxiety-like behaviour in a rodent model (for a re- 
view, see [87]). Inflammatory bowel disease is associated with a high 
prevalence of anxiety and depressive disorders, which is much more 
pronounced than in other chronic diseases of childhood or adult- 
hood. Further, a large case-control study in the UK demonstrated 
that repeated use of antibiotics was associated with an increased risk 
for depression and anxiety [88]. In acute AN, levels of depression, 
anxiety, and eating disorder psychopathology were associated with 
the composition and diversity of the intestinal microbiota [80]. 


Conclusions 


The recent years have witnessed a veritable knowledge explosion 
regarding the molecular mechanisms of body weight regulation [8, 
9, 20, 28, 29, 89], which is currently continuing. The genetic pre- 
dispositions to AN [8, 9], and possibly other eating disorders too, 
overlap with the molecular mechanisms involved in this complex 
physiology. In addition, the genetic predispositions to diverse 
mental and somatic disorders and quantitative somatic and mental 
traits overlap [89], suggesting that novel insights may have impli- 
cations for the future treatment of eating disorders. Furthermore, 
substantial advances have been made with respect to the pathways 
involved in the adaptation to starvation and in considering addictive 
processes for both obesity and AN. Finally, microbiome-related re- 
search has come up with important findings. It appears likely that 
some of these novel findings will entail benefits for the treatment of 
patients with eating and weight disorders. 

Unfortunately, pharmacological research into obesity has suffered 
from many setbacks that have entailed withdrawals of specific drugs. 
It is crucial to re-engage the pharmaceutical industry in research re- 
quired to come up with novel treatments. The scientific basis for drug 
development in eating and weight disorders has never been as good 
as of today. The major challenge is to identify targets that will indeed 
allow the development of safe and effective drugs. The overlap of di- 
verse functional pathways suggests that a certain degree of side effects 
must be taken into account in order to achieve this goal. 
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Almost all published incidence and prevalence studies of eating 
disorders are based on data using DSM-IV diagnostic criteria that 
were obtained over at least a decade ago [1]. Most of these studies 
were conducted in specific population settings, such as schools, 
colleges, and primary care clinics, as well as from medical records. 
Information was collected either from standardized interviews or 
via self-report questionnaires. Thus, comparing the incidence 
and prevalence of eating disorders among various countries is 
more likely to be approximate than accurate. Adequate incidence 
studies are restricted by the need for large population samples and 
costs. The incidence of eating disorders, which is the number of 
new cases in a population over a specific period of time, is usu- 
ally expressed as 100,000 persons per year. Prevalence is usually 
expressed either as point prevalence, which is the number of cases 
with a disorder at a specific point in time, or lifetime prevalence, 
which is the proportion of persons that had the disorder at any 
time in their life. There are many more published prevalence, 
compared to incidence, studies of eating disorders. With varying 
sensitivity of assessments used in these studies, comparisons are 
difficult to make. 

The incidence of anorexia nervosa (AN) remained relatively 
stable between 1980 and 2000. In the United Kingdom (UK), the 
number of new cases from 1988 to 1993 was 4.2 per 100,000, and that 
from 1994 to 2000 was 4.7 per 100,000 [2]. From 1985 to 1989, the 
Netherlands had 7.4 new cases of AN per 100,000, and from 1995 to 
1999, there were 7.7 new cases per 100,000. The incidence of eating 
disorders was highest among females aged 15-19 and comprised 
40% of all cases—56.4 per 100,000 in 1985 and 109.2 per 100,000 in 
1995-1999 [3]. The incidence of males was less than 1 per 100,000 
in both countries. Incidence studies in the United States (US) are 
restricted to the state of Minnesota where the overall age-adjusted 
incidence was 15 per 100,000 in females and 1.5 per 100,000 in males 
from 1935 to 1989. Among 15- to 24-year-old females, the incidence 
rates increased from 1950 to 1984 by 1.03 per 100,000 person-years 
per calendar year. The increased rate was especially present in fe- 
males aged 10-14 [4]. 





Studies on the incidence of bulimia nervosa (BN) are sparse. 
The incidence of BN among Finnish females aged 15-18, including 
those who lacked one criteria, was 438 per 100,000 person-years 
[5]. The incidence of BN in a nationwide primary care study in the 
Netherlands was 8.6 per 100,000 person-years in 1985-1989 and de- 
creased to 6.1 per 100,000 person-years in 1995-1999 [3]. Similarly, 
the incidence of BN in the UK decreased from 12.2 per 100,000 
person years in 1993 to 6.6 per 100,000 person years in 2000. The 
incidence remained stable in women aged 10-19 at 40 per 100,000 
person years in 1993 and 2000 [2]. In the US, incidence rates from 
1980 to 1990 in Minnesota were 26.5 per 100,000 per year in females 
and 0.8 per 100,000 per year in males. The mean age of onset in fe- 
males was 23 years. Among 15- to 24-year-old females, it has be- 
come over twice as common as AN [6]. 

There are no published incidence studies for binge eating dis- 
order (BED) or DSM-IV eating disorder not otherwise specified 
(EDNOS), and there are no adequate incidence studies available 
for rumination syndrome (RS) and pica. Avoidant/restrictive food 
intake disorder (ARFID), an eating disorder diagnosis created by 
DSM-5 [7], has only been recently defined, such that there are no 
published epidemiology and prevention studies as yet. 

Two large-sample studies reported a lifetime prevalence of AN in 
the US of 0.3% among 13- to 18-year-old females and males [8], and 
0.9% among adult females and 0.3% among males [9]. An 8-year 
prospective study of adolescent girls in the United States reported 
a lifetime prevalence of AN of 0.6% by age 20 and 2.0% for atypical 
AN using DSM-IV criteria [10]. A population-based survey in six 
European countries reported a lifetime prevalence of AN of 0.9% 
among females and 0% among males [11]. In a Dutch community 
sample, over 2000 children were assessed from age 11 in 2001 to age 
19-20 in 2010. At age 19-20, there was a lifetime prevalence of 1.7% 
of DSM-5 AN among women and 0.1% among men [12]. 

The lifetime prevalence in the US for BN was 1.6% among 20-year- 
old females who were followed up for 8 years [10]. In a study across 
six European countries, the lifetime prevalence of BN was 0.88% 
among females and 0.12% among males [11]. The point prevalence 
of BN among university women in the US decreased from 4.2% in 
1982 to 1.7% in 2002 [13]. Initially, high rates for BN found in the 
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1980s were based on DSM-III criteria [14]. DSM-IV criteria from 
1994 for BN were more restrictive. Since the criteria for BN have 
become again less restrictive in DSM-5, increases in BN prevalence 
may be expected with DSM-5 criteria. 

BED appears to be the most common of the three major eating dis- 
orders, with a lifetime prevalence in the US of 3.5% among women 
and 2.0% among men [9]. In US adolescents aged 13-18 years, 
the lifetime prevalence of BED was 2.3% among females and 0.8% 
among males [8]. In a study involving six European countries, the 
lifetime prevalence of BED was found to be 1.9% among women and 
0.3% among men [11]. 

Prevalence data on RS are scant. One survey of 2163 children aged 
10-16 years in Sri Lanka found a prevalence of RS of 5.1% among 
boys and 5.0% among girls. Only 8.2% had daily symptoms, and 
62.7% had weekly symptoms [15]. Likewise, adequate prevalence 
data on pica are not available. The prevalence of pica in populations 
considered to be at risk was found to be up to 77% among preg- 
nant women, 15% among persons with intellectual disabilities, 74% 
among sub-Saharan African children, and 61% among persons with 
iron deficiency [16]. 


Outcome and mortality 


Of psychiatric disorders, AN has one of the highest mortality rates. 
A meta-analysis of 36 studies showed a standardized mortality rate 
(SMR) of 5.86 for AN, with one in five persons dying by suicide [17]. 
In the US, a 20-year follow-up study reported 6.5% deaths with an 
SMR of 4.37 for lifetime AN. The duration of illness affected the SMR, 
such that those with AN for 0-15 years had an SMR of 3.25, and those 
with AN for 15-30 years had an SMR of 6.6 [18]. An analysis of 119 
outcome studies found less than half of AN patients had full recovery, 
33% improved, and 20% developed a chronic course [19]. A Dutch 
study found that age at the time of detection and treatment predicted 
recovery, with those aged 19 years and below having an odds ratio 
of recovery of 4.3, compared to those aged 20 years and above [20]. 
A US longitudinal analysis of mortality among those aged 8-25 years 
in 906 BN clinic outpatients found a crude mortality rate of 3.9% and 
an SMR of 1.57. There was a significantly elevated risk for suicide at 
0.9%, which accounted for 23% of the deaths [21]. Outcome studies 
of BN reported recovery rates which varied from 48% [22] to 77% 
[23]. One large-sample, 12-year follow-up study of 68 German BED 
patients found a crude mortality rate of 2.9% and a non-significant 
SMR of 2.29 [24]. Most outcome studies of BED comprised EDNOS 
analyses. The overall summary is that BED remission is somewhat 
better than that of BN. Published outcome and mortality studies 
other than a few case reports are not available for RS and pica. 





Ethnicity and sociocultural demographics 


Global social and cultural changes associated with industrialization 
and urbanization have influenced the increasing prevalence of eating 
disorders among minority groups in Western Europe and North 
America, as well as in Asian and Arab countries [25]. The prevalence 
of eating disorders and disturbed eating behaviours is increasing 
in China, Singapore, South Korea, Egypt, Pakistan, United Arab 
Emirates, Oman, Lebanon, Kuwait, Iran, and Jordan [25]. 


In the twenty-first century, the NIMH Collaborative Psychiatric 
Epidemiological Studies (CPES) found no significant differences 
in prevalence for AN and BED in the US across Latinos, Asians, 
African-Americans, and non-Latino whites. BN was more preva- 
lent among Latinos and African-Americans than among non- 
Latino whites [26]. Utilization of mental health services was lower 
in all ethnic minority groups, compared to non-Latino whites [27]. 
Contributing factors include lack of health insurance, lack of afford- 
able and accessible treatment services, and beliefs about seeking 
treatment. African-American women presenting for BED treatment 
had a higher BMI and binge eating frequency than whites, and lower 
depression than white and Hispanic groups [28]. A survey among 
American college students found Arab Americans reported the 
highest prevalence of binge eating of 10% [29]. 

Social factors have also been identified to be associated with the 
development of eating disorders. In Curacao, cases of AN were found 
only in mixed-ethnicity women who reported that thinness allowed 
them greater acceptance in the more affluent white community [30]. 
Higher rates of eating disorders have been reported to be associated 
with activities in which leanness is valued, such as dancing and gym- 
nastics, as well as among jockeys [31]. 


Primary prevention 


Risk factors 


Identification of risk factors is essential for the implementation of 
effective intervention strategies. Risk factors for eating disorders 
and disordered eating behaviour diagnosed according to DSM-IV 
and DSM-5 increase in early adolescence, on which most studies on 
eating disorder risk thus focus. The majority of these studies have 
used self-report questionnaires to assess for disordered eating be- 
haviour, rather than discrete eating disorder diagnoses. 

For specific eating disorders, female adolescents in sixth to ninth 
grades reporting social pressure to be thin and body image preoccu- 
pation showed a higher risk for onset of threshold or subthreshold 
BN or BED over a 3-year follow-up [32]. High-school females re- 
porting weight and shape concerns and negative affectivity had a 
higher risk for onset of threshold and subthreshold BN over a 4- 
year follow-up [33]. High dietary restraint was associated with an 
increased risk for onset of any eating disorder among women aged 
16-23 years [34]. 

A study examining six risk factors at ages 13, 14, 15, and 16 found 
body dissatisfaction was the most consistent and robust predictor 
of a future eating disorder. Five of the risk factors were highly sig- 
nificant predictors at age 14, suggesting this age may be a key de- 
velopmental time point for preventative intervention. The authors 
proposed a combination of physical and cognitive maturational 
processes such as completion of puberty, increased capacity for ab- 
stract thinking, increased peer influence, and the onset of dating and 
sexual experiences at this age [34]. 

Moderating risk factors for AN were identified in a study of chil- 
dren and adolescents from the Danish Psychiatric Central Research 
Register. Risk factors included female sex, having a sibling with AN, 
affective disorders in family members, and five major comorbid 
disorders (affective, anxiety, obsessive-compulsive, substance use, 
and personality disorders) present in the AN proband [35]. Eating 
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disorders in either parent were independently associated with eating 
disorders in their female children in an analysis of the Stockholm 
Youth Cohort [36]. 

An analysis of obstetric records found maternal anaemia, diabetes 
mellitus, pre-eclampsia, placental infarction, neonatal cardiac prob- 
lems, and hypoactivity were significant independent predictors of 
the development of AN. The risk of developing AN increased with 
the total number of obstetric complications. In the same study, ob- 
stetric complications significantly associated with BN were: pla- 
cental infarction, neonatal hypoactivity, early eating difficulties, and 
low birthweight for gestational age [37]. There is some question on 
whether in utero exposure to a viral infection could increase the risk 
of developing AN. Exposure during the sixth month of pregnancy 
to the peak of chickenpox and rubella infections was significantly 
associated with the risk of developing AN [38]. A comparison of risk 
factors across AN, BN, and BED [39] showed that AN and BED were 
distinct from each other, with AN having greater exposure to perfec- 
tionism and BED reporting greater exposure to conduct problems, 
substance abuse, severe childhood obesity, and family overeating. 
BN shared risk factors with both AN and BED. On examining the 
AN subtypes, perfectionism carried a greater risk for restricting 
AN and family overeating while sexual abuse had a greater risk for 
the binge purge subtype of AN. Two-thirds of women with AN re- 
ported their eating disorder started with dieting, whereas two-thirds 
of women with BN and BED reported their eating disorder started 
with binge eating. 

Risk factors have also been assessed for disordered eating and 
weight loss behaviours occurring without a designated eating dis- 
order diagnosis. Depression-related symptoms and body dissatisfac- 
tion were found to be risk factors for disordered eating in overweight 
youths [40]. Dieting, depression, and body image distortion in ado- 
lescence predicted extreme weight loss behaviours and binge eating 
in young adulthood. 


Prevention programmes 


Meta-analytic reviews 


Eating disorder prevention programmes that were controlled or 
those with systematic follow-up began in the 1980s. The outcome 
of these studies was determined by the decrease in related eating 
disorder symptoms, rather that identifying the occurrence of diag- 
nosable eating disorders. Targeted populations have varied from 
at-risk individuals to all persons in specific age groups and in spe- 
cific environments such as schools and colleges. A meta-analysis 
conducted in 2007 of 66 controlled outcome studies that targeted 
persons aged 15 and above with high levels of body dissatisfaction 
and weight concerns, using interactive paradigms administered by 
professionals, found small, but significant, effect sizes of these pre- 
ventative interventions [41]. 

More recently, a study of an Internet-based prevention pro- 
gramme (Student Bodies) [42] and a meta-analytic review of six US 
and four German randomized controlled trials of 990 female high- 
school and college students [43] found that interventions produced 
moderate improvements in reduction of negative body image and 
the desire to be thin. 

Further, a meta-analysis of 20 Internet-based programmes de- 
signed to reduce eating disorder symptoms and representing three 


different interventions found these programmes effective both in 
individuals with a clinical diagnosis of an eating disorder and in 
those with symptoms only [44]. Compared with control conditions, 
these interventions reduced body dissatisfaction, internalization of 
the thin ideal, shape and weight concerns, dietary restriction, drive 
for thinness, bulimic symptoms, purging frequency, and negative 
affect. 

In a systematic review of interventions involving parents in 
the prevention of body dissatisfaction or eating disorders in chil- 
dren, only 20 studies were identified. Of these, only four had 
high-quality data, with two reporting parental involvement sig- 
nificantly improved children’s outcome on body dissatisfaction 
or disordered eating. Two studies were inconclusive on the role 
of parental involvement [45]. However, several considerations 
were noted—a child’s perception of their parent’s behaviour is 
more important than the parent’s perception of their own behav- 
iour; measuring and communicating a child’s at-risk status does 
not improve parental engagement with prevention programmes; 
and parents of a child with the diagnosis of an eating disorder 
were more likely to access prevention programmes. The benefit of 
involving parents in eating disorder treatment is well established. 
The challenge is how to motivate parents to participate in pre- 
ventative research. 


Specific prevention programmes 


The most frequently studied dissonance-based prevention pro- 
gramme is the Body Project. This paradigm is based on the 
proven theory that cognitive dissonance motivates participants 
to reduce their adherence to their original attitude. In the Body 
Project, participants are instructed to critique the thin ideal 
through verbal, written, and behavioural exercises. This is done 
in the context of small groups, with one or two clinicians, over 
four sessions. Home exercises include writing an essay about the 
costs of pursuing the thin ideal and standing in front of a mirror 
with minimal clothing while recording positive attributes about 
one’s body. Group sessions include role plays and discussion as- 
signments [46]. The Body Project was found to reduce caudate 
response to thin models in an fMRI study, providing evidence 
that this intervention reduces the idealization and significance 
attached to media images. 

Another effective prevention programme tested in randomized 
controlled trials is the Student Bodies, which is an 8-week Internet- 
based programme focusing on reducing body weight and shape 
concerns. This programme uses cognitive behavioural therapy tech- 
niques and includes weekly exercises and journal log prompts. It is 
designed to create healthier behaviour patterns with eating, exer- 
cise, sleep, mood, and emotion regulation. After completion of the 
programme, participants are provided continued access to Student 
Bodies for 9 months for reviewing material as needed [47]. 

A mindfulness- and school-based eating disorder prevention pro- 
gramme of three sessions aimed to increase the capacity to refrain 
from automatic responses when confronted with the thin ideal and 
related sociocultural pressures and to reduce the impact of any ex- 
periences with a negative affective component. The sessions include 
common coping strategies, role play, and various guided exercises. 
This programme significantly reduced weight and shape concerns, 
dietary restraint, thin ideal internalization, eating disorder symp- 
toms, and psychosocial impairment [48]. 








SECTION 16 Feeding, eating, and metabolic disorders 


Targeted populations for eating 


disorder prevention 





Several recent controlled studies have demonstrated either efficacy 
or effectiveness of school-based programmes in reducing risk fac- 
tors for eating disorders in adolescents. A Spanish study based on 
the social cognitive model, media literacy education, and cogni- 
tive dissonance found the intervention group had greater reduc- 
tion in beauty ideal internalization, disordered eating attitudes, and 
weight-related teasing, compared to the control group, at 1 year 
follow-up [49]. An Australian study using a similar intervention 
in girls and boys in seventh and eighth grades showed similar re- 
sults [50]. A US web-based intervention—Staying Fit—was ef- 
fective in reducing weight/shape concerns and improving fruit and 
vegetable consumption in ninth-grade students at risk for eating 
disorders [51]. 

College students participating in the online programme Student 
Bodies in their first college year had a reduction in body image and 
eating concerns, continuing into their sophomore year [52]. Body 
dissatisfaction and shape and weight concerns decreased, whereas 
self-esteem increased in college students participating in a brief con- 
ditioning intervention [53]. 

Women with subthreshold eating disorder syndromes responded 
to the Internet-based prevention programme Student Bodies with 
a reduction or abstinence in binge eating, purging, and other dis- 
ordered eating symptoms [54]. 

Female adolescents reporting body image dissatisfaction were 
recruited from two primary care clinics. They responded to a brief 
dissonance-based eating disorder prevention programme with re- 
duced thin ideal internalization, pressure to be thin, dieting, and 
eating disorder symptoms [55]. 

The Parent-Based Prevention (PBP) of eating disorders targeted 
risk factors and facilitated behavioural change in mothers with 
eating disorders and their spouses to mitigate negative outcomes in 
their children. This uncontrolled study suggested PBP produces de- 
creased risk of eating and mental problems in these children [56]. 


Salient issues for primary prevention of 
eating disorders 


The implementation of evidence-based eating disorder preven- 
tion programmes requires attention to issues delineated in the 
Consolidated Framework for Implementation Research (CFIR) 
model [57]. These issues, as applied to the eating disorder preven- 
tion programme Body Project, are well outlined by Shaw and Stice 
[58]. In summary: (1) a programme is more readily adopted if those 
delivering it perceive they had input into developing it; (2) there is 
belief among programme deliverers on the quality and validity of 
evidence for a desired outcome; (3) the programme has an advan- 
tage over alternative interventions; (4) the programme is accepted 
by participants and those delivering it and has low attrition; (5) the 
intervention can be adapted to meet local needs; (6) the complexity 
of the intervention does not inhibit its implementation; (7) design 
quality and packaging are appealing; and (8) costs, such as cost of 
manuals and related items, clinician training, and recruiting are 
reasonable. 

For the Body Project, the most commonly reported barriers 
to including additional groups were limited time and high staff 


turnover. For participants who do not respond to the prevention 
programmes, revisions such as increased intensity or creative ad- 
justments are necessary. 

The problems of recruitment, screening, implementing the 
intervention, and influencing the community culture have been 
addressed in the Healthy Body Image programme [59]. This pro- 
gramme targets intervention across socio-environmental levels that 
impact students’ eating and activity patterns. It aims to improve 
body esteem and eating attitudes and behaviours and to change cul- 
tural norms around nutrition and body image for a healthier campus 
community. 


Future directions 


The changing demographics of eating disorders need to be con- 
tinually monitored, so that all ethnic and socio-economic af- 
fected groups have access to adequate and financially feasible 
treatment programmes. Large population studies are needed to 
better ascertain the strength of the influence various moderating 
and mediating risk factors have on the development of eating 
disorders. 

Specific risk factors that respond to specific treatment strategies 
need to be identified to create more effective preventative interven- 
tions. All interventions need to be assessed for adherence to the pro- 
gramme, attrition rate, and effectiveness over time. 
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Eating disorders are serious psychiatric illnesses that are associ- 
ated with a broad range of somatic and psychiatric comorbidities, 
high disability-adjusted life years, and premature mortality [1-4]. 
The Diagnostic and Statistical Manual of Mental Disorders, fifth edi- 
tion (DSM-5) diagnostic category of feeding and eating disorders 
encompasses anorexia nervosa (AN), bulimia nervosa (BN), binge 
eating disorder (BED), avoidant and restrictive food intake disorder 
(ARFID), rumination disorder, pica, and other specified feeding and 
eating disorders (OSFED) [5]. Genetic research has primarily inves- 
tigated AN, BN, and BED throughout the last 25 years, with very 
little research conducted into the remaining categories. This chapter 
therefore focuses on these three eating disorders. 


























The aetiological understanding of eating disorders is closely linked 
to the understanding of complex disorder genetics, which has 
changed fundamentally in the last two decades. Genetic research has 
been divided into two major fields: quantitative and molecular gen- 
etics. In quantitative genetics, twin, adoption, and family studies are 
used to estimate the influence and relative contribution of genetic 
and environmental factors on human traits. Unlike family studies, 
twin and adoption studies have the advantage of being able to dis- 
tinguish between shared and non-shared environmental factors, 
while generating an estimate of the total heritable contribution to 
risk for developing a disorder, called heritability. Shared environ- 
mental factors are those which siblings have in common, such as 
their shared family home (which increase sibling similarity), while 
non-shared environmental factors are those which they do not have 
in common, such as activities or sports in which only one sibling 
takes part (which decrease sibling similarity) [6]. 

Within this framework of an integrative model of genetic and en- 
vironmental factors, every individual is assumed to have a certain 
genetic liability to developing an eating disorder. When individuals 
with a genetic liability that exceeds a threshold face common envir- 
onmental pressures, this may induce symptoms. If the severity or 
duration of symptoms exceeds a defined clinical threshold, this indi- 
vidual may be diagnosed as mentally or physically ill [7]. Identifying 








genetic and environmental risk factors could enable health care pro- 
fessionals to intervene at any possible stage of a disorder—before 
the onset, at the onset, during progression, and after remission—to 
guarantee the best possible prevention, diagnosis, and treatment. In 
practice, considerations of false positive rates mean that this is best 
done in high-risk populations, rather than the general population. 
However, the identification of genetic variants can also shed light 
on the underlying aetiopathology of a disorder; the identification 
and understanding of these biological changes may lead to the de- 
velopment of new medications. In molecular genetics, linkage, can- 
didate gene, and genome-wide association studies (GWAS) have 
been conducted to detect genetic variants associated with some of 
the eating disorders. These methods differ in their underlying theor- 
etical considerations and assumptions, particularly in how much of 
the genome they cover and the assumptions (or lack thereof) of what 
the underlying genetic architecture of a disorder is. 


Link studies 





Linkage studies are based on the ‘linkage’ (that is, the co-segregation 
of specific genetic regions with disorders in families), using a set 
of a few hundred to several thousand genetic markers and testing 
which appear to be inherited alongside a given trait or disorder. Co- 
segregation is tested in large family pedigrees or in many smaller 
pedigrees simultaneously. Linkage studies of eating disorders have 
yielded inconsistent results and have not been replicated. This may 
be due to a number of reasons, but fundamentally the evidence is 
that there are no genes with large effect size mutations in families 
with eating disorders that are detectable by linkage studies, which 
are unsuitable to detect genetic risk variants that have small effects. 
Linkage studies are thus currently out of favour. 


Association studies 


Association studies, such as candidate gene studies and GWAS, are 
an alternative method to investigate the genetic underpinnings of 
disorders by comparing frequencies of the different alleles of gen- 
etic markers between cases and healthy controls. Candidate gene 
studies were popular in the 1990s and 2000s and are hypothesis- 
driven studies that only included the analysis of one or a few genetic 
markers in single or several genes, with both the genes and geno- 
typed markers selected mostly based on their biological function/ 
impact. Single-nucleotide polymorphisms (SNPs) are used as genetic 
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Fig. 104.1 Genome-wide association study. Single-nucleotide polymorphisms (SNPs) are used as genetic markers for specific genetic loci. 
Genotyping chips can assay up to several millions of these genetic markers, and coverage of the genome can then be increased through the statistical 
method of imputation. Imputation predicts genotypes at unassayed locations on the basis of reference panels of the human genome. After genotyping 
of patients and controls, allele frequencies are compared between them and a logistic regression is performed to calculate B (the standardized 
regression coefficient) and odds ratios (ORs). SNPs are defined as genome-wide significant if they reach a P-value of 5 x 10°, which is P = 0.05 


corrected for all the tests done in a GWAS. 


markers and are the most common variants in the human genome. 
They are characterized by the replacement of a single nucleotide 
with another. For example, the base cytosine might appear at a spe- 
cific base location in the majority of the population but might be 
replaced with guanine in an individual. Variable numbers of tandem 
repeats (VNTRs) and microsatellites are regions of the genome with 
variable numbers of a particular sequence, with a repeating unit of 
between two and 50 or more bases. Candidate gene studies used 
these VNTRs, microsatellites, or SNPs to identify which variant of a 
gene (that is, allele) may be associated with a symptom of disordered 
eating or an eating disorder, but large-scale attempts at replication of 
these findings have not been successful and this approach has fallen 
out of favour in psychiatric genetics in general [8]. 


Genome-wide association studies 


GWAS investigate the genetics of mental disorders using infor- 
mation from, and coverage of, the whole genome in a hypothesis- 
neutral manner (Fig. 104.1). Depending on the genotyping chip used, 
500,000-1,000,000 SNPs are tested for association with a trait under 
study. In GWAS, SNPs are used because they are chemically easy to 
assay and a dense map of SNPs can ‘tag’ associations with a disorder 


at genetic loci across the genome. GWAS are limited by the fact that 
they only detect chromosomal regions which can be multigenic and 
often do not clearly indicate the causal variants. Causal variants can be 
detected in further analyses such as fine mapping, exome sequencing, 
or whole-genome sequencing [9]. Fundamentally, GWAS are highly 
dependent on sample size, as shown by the results on schizophrenia 
[10] with, in general, >15,000 cases needed for multiple robust single 
genetic associations to be detected. 

GWAS for eating disorders have not yet reached this sample size. 
However, smaller GWAS can still be used with methods such as 
linkage disequilibrium score regression, which uses summary stat- 
istics of GWAS to calculate the degree to which genetic variants are 
shared by two traits or disorders. The degree of common genetic 
variants shared between two traits or disorders is called a genetic 
correlation (r,). 


Genetic architecture of mental disorders 





Despite clear evidence from twin studies that eating disorders are 
substantially heritable, their complex genetic architecture (alongside 
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Fig. 104.2 Heritability estimates of eating disorders derived from twin studies. Heritability estimates are expressed in percentages. Genetics (white), 
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a poor understanding of their neurobiology) has stymied both linkage 
and candidate gene studies. A large body of evidence collected over 
the past decade suggests that the genetic risk for psychiatric and other 
complex disorders is—in large part—accounted for by thousands of 
different genetic variations with small effect sizes [11]. For instance, 
GWAS on schizophrenia emphasize that this highly heritable disorder 
is highly polygenic, consistent with the picture in bipolar disorder and 
major depressive disorder (MDD) [10]. The GWAS approach also af- 
fords better estimation and control of type I errors, allowing accurate 
estimation of individual DNA sample quality, as well as fine-scale ad- 
justment for ancestry differences in case-control studies—both of 
which can generate spurious associations and which have been consid- 
erable problems in candidate gene studies. 

A complete review of a largely unsuccessful decade or more of 
candidate gene and linkage studies on eating disorders can be ac- 
cessed elsewhere [12-15]. Our chapter focuses on the period of 
modern genomic approaches and the genetic overlap across the 
various eating disorders, as well as with common comorbid condi- 
tions. We start with the emergence of the first GWAS of AN and the 
formation of the Eating Disorders Working Group of the Psychiatric 
Genomics Consortium (PGC-ED) in 2013. We now review the 
quantitative and molecular (GWAS) genetic findings for AN, BN, 
and BED, contrasting these findings with GWAS results in food ad- 
diction, obesity, and BMI. 


Anorexia nervosa 


Family studies of anorexia nervosa 


Anorexia nervosa runs in families, although sporadic cases also 
do occur. First-degree relatives of affected individuals are 11 times 
more likely to develop AN than relatives of unaffected individuals 
[16]. Reflecting the considerable diagnostic crossover seen during 
the course of eating disorders, relatives of individuals with AN are 
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also at increased risk for a range of eating disorders, including full 
and partial AN syndromes, as well as OSFED [16, 17]. 


Twin studies of anorexia nervosa 


A series of twin studies in European ancestry populations have sug- 
gested a fairly wide range of heritability point estimates (Fig. 104.2); the 
heritability of varying definitions of AN ranges from 0.28 to 0.74. Apart 
from genetic influence, twin studies report a significant contribution of 
non-shared environmental factors of 24-68%, but strikingly negligible 
influence of the shared environment [18-20]. As is the case with other 
psychiatric disorders, definitions of AN that meet the full syndrome 
diagnostic criteria are associated with higher heritability estimates than 
more loosely defined phenotypes [21]. Twin studies have also addressed 
the considerable diagnostic flux across eating disorder presentations, 
revealing that the common transition from AN to BN (and less com- 
monly vice versa) may be due to shared genetic factors, with twin-based 
genetic correlations of between 0.46 and 0.79 being reported [22]. 


Twin studies and comorbidity 


AN is commonly comorbid in both clinical and epidemiological 
samples with major depression, anxiety disorders, and obsessive- 
compulsive disorder (OCD) [23]. Twin studies suggest that these 
comorbid profiles may be, at least in part, due to shared genetic 
factors. Bivariate twin studies of AN have revealed genetic correl- 
ations of 0.34, 0.20, and 0.52 with major depression [24], general- 
ized anxiety disorder [21], and OCD [25], respectively. They also 
showed a genetic overlap between AN and suicidality [26]. The gen- 
etic overlap between AN, MDD, and suicide attempts was robustly 
replicated [27]. OCD is the psychiatric disorder with the strongest 
genetic overlap with AN. 


Developmental twin studies 


Although no twin studies have been adequately powered to explore 
changes in heritability estimates for AN across development, this has 
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been examined in related phenotypes. In girls, the genetic effects on 
eating disorder symptoms increase with age [28]. Whereas environ- 
mental factors are the primary contributors to variance in eating dis- 
order phenotypes during pre-adolescence, the cumulative additive 
genetic contributions significantly increase across adolescence and 
remain constant through mid life. Work by Klump [28] implicates pu- 
berty and its hormonal changes (with consequent frontal temporal and 
other maturation of the brain), rather than a direct effect of age in this 
transition. 


Modern genomic approaches to anorexia nervosa 
Genome-wide association studies 


The first GWAS for AN was conducted by the Children’s Hospital 
of Philadelphia (CHOP) and the Price Foundation Collaborative 
Group [29]. The analysis included 1033 AN cases and 3733 paediatric 
controls. Given the small sample size and as predicted by experience 
in other psychiatric disorders, no SNPs reached genome-wide sig- 
nificance (P <5 x 10°). Also complicating the design was the fact 
that the control group had not yet passed through the age of risk for 
developing AN (mean = 12.75 years; standard deviation = 4.2 years) 
and so were effectively unscreened for eating disorders and other 
genetically related psychiatric problems, resulting in a decrease in 
statistical power to detect association. 

The second AN GWAS was performed by the Genetic Consortium 
for Anorexia Nervosa (GCAN) as part of the Wellcome Trust Case- 
Control Consortium 3 (WTCCC3). This GWAS included 2907 AN 
cases of European ancestry and 14,860 ancestry-matched female 
controls. Although this was a larger study, it is still a modest sample 
size for GWAS and no genome-wide significant loci were detected, 
although when 72 independent markers with the lowest P values 
were selected for replication, sign tests revealed that a highly signifi- 
cant 76% of these markers yielded results in the same direction in 
the discovery and replication samples [30]. These tests encouraged 
the field to continue GWAS efforts, as it suggested that the signifi- 
cant signal did exist in the data, but larger samples were required for 
detection. The controls selected for this GWAS were also not ideal 
as, although they were selected to be ancestrally compatible, they 
had been genotyped on similar (but not identical) platforms and at 
different times and in different laboratories (Fig. 104.3). 

In an effort to unite research groups and consolidate findings, the 
PGC-ED was established in 2013. The first analysis of the combined 
CHOP and WTCCC3 data sets emerged in 2016 and has reported 
the first genome-wide significant locus for AN [31]. The GWAS com- 
prised 3495 cases and 10,982 controls and now has been imputed to 
phase 3 of the 1000 Genomes Project to enable association statistics at 
millions of ungenotyped variants to be calculated [32]. A subsequent 
larger GWAS comprising 16,992 cases and 55,525 controls yielded 
eight significant loci and a pattern of genetic correlations implicating 
both psychiatric and metabolic origins of AN [33]. As has been seen in 
other mental disorders and the rest of common disorder medicine, it 
is reasonable to expect that the addition of samples will lead to further 
loci being identified, with an accelerated trajectory of significant loci. 


Low-frequency and rare variants 


A GWAS focuses on common genetic variants, which occur in 
>1% of the population. By definition, rare variants are present in 
<1% of the population. During the past 3 years, some studies were 
performed on rare variants in the aetiology of AN. A series of 
sequencing and genotyping studies by Scott-Van Zeeland et al. [34] 
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Fig. 104.3 Expected number of cases for anorexia nervosa GWAS. 
Through the joint international endeavours of the Psychiatric Genetic 
Consortium of Eating Disorders (PGD-ED), the depicted numbers of 
cases are expected to have GWAS data over the new few years. The 
increase in cases is accompanied by a considerable increase in power, 
for what other disorders have suggested as plausible effect size of GWAS 
risk variants. The power calculation assumes an OR of 1.1 and an allele 
frequency of 20%. 


on cases from the CHOP/Price Foundation cohort focused on the 
coding regions and upstream sequence of 152 candidate genes in 
1205 AN cases and 1948 controls. They reported associations with 
variants in the oestrogen receptor P (ESR2) and the epoxide hydro- 
lase 2 (EPHX2) genes in AN, identified via targeted gene sequencing. 
Case-control allele frequency differences for variants in these two 
genes were also observed in the CHOP/Price Foundation GWAS 
data set [29]. Somewhat paradoxically, hypercholesterolaemia has 
been reported in a subset of individuals with AN [35, 36], rendering 
EPHX2 an intriguing find, as it has been implicated in cholesterol 
metabolism [37]. 

Subsequently, Cui et al. conducted a high-throughput sequencing 
study with two families who were densely affected for AN and BN 
[38]. They initially conducted a linkage analysis on the first pedigree 
(20 individuals) and identified a region on chromosome 11 from 
44.1 to 64.3 cM with a significant logarithm of odds score. They 
then conducted whole-genome sequencing on two family mem- 
bers who were affected with AN and detected a missense mutation 
in the oestrogen-related receptor a (ESSRA). This pedigree was also 
enriched for OCD. In the second pedigree, enriched for both AN 
and BN cases, exome sequencing identified a rare mutation in his- 
tone deacetylase 4 (HDAC4). HeLa cell and mouse cortical tissue 
analyses demonstrated a potentially functional interaction between 
ESRRA and HDAC4, and using transcriptional activity assays, it was 
reported that HDAC4 may act as an inhibitor of ESRRA activity. 
Given the central role of histone deacetylases in the activation and 
repression of gene expression genome-wide, this interaction might 
not be unexpected. 

In order to rule in or out more common rare and functional vari- 
ation in AN, an exome-array rare variant GWAS was performed in 
2158 cases from nine populations of European origin and 15,327 
ancestrally matched controls [39]. Cases and controls were identi- 
fied from those gathered for the GCAN/WTCCC3 cohort. Sixteen 
independent variants were selected for in silico and de novo rep- 
lication (12 common, four rare). No findings reached genome- 
wide significance. The authors instead highlighted two common 


variants—rs10791286, an intronic variant in OPCML (P = 9.89 x 
10); and rs7700147, an intergenic variant located near ANKRD50 
(P = 2.93 x 10°), reaching near suggestive significance. 


Copy number variation 


Copy number variations (CNVs) are 1kb DNA segments, or 
larger, occurring in an altered quantity, compared with a refer- 
ence genome. The most common CNVs are duplications and de- 
letions. A genome-wide analysis of CNVs was also conducted on 
the CHOP/Price Foundation cohort [29]. No evidence emerged 
supporting enrichment of AN cases for CNVs above controls, 
and rare or large CNVs were not notably overrepresented in AN 
cases. Although a recurrent 13q12 deletion (1.5Mb) disrupting 
the sacsin molecular chaperone SACS was seen twice in cases and 
CNVs disrupting the contactin 6/contactin 4 (CNT'N6/CNTN4) 
region was found in multiple cases, these observations were not 
statistically significant. Yilmaz et al. [40] conducted a case-only 
genome-wide CNV survey in 1983 female AN cases that were part 
of the GCAN/WTCCC3 AN GWAS. Their case-only approach ex- 
plored whether pathogenic CNVs implicated in other psychiatric 
and neurodevelopmental disorders were also observed in AN cases. 
Four of these well-established pathogenic CNVs (deletions or du- 
plications in 1q21.1, 7q36.3, 15q13.3, or 16p11.2) were found in a 
small number of AN cases. One case also had a large deletion in the 
13q12 region [29], and 41 cases had deletions or duplications which 
were 1 Mb or larger. However, at this point, it is not clear whether 
large-effect CNVs play a demonstrable role in AN. Larger sample 
sizes are required before the effect of smaller-effect CNVs can be 
ruled out. 


Genetic correlations with psychiatric phenotypes 


A genetic correlation is an estimate of the genetic overlap between 
two traits. An analytic extension of linkage disequilibrium score re- 
gression [41] enables computation of cross-trait genetic correlations, 
and intriguing insights for AN have emerged. Bulik-Sullivan et al. 
[42] revealed a significant positive genetic correlation between AN 
and schizophrenia (r, = 0.26), with positive, but non-significant, cor- 
relations with major depression and bipolar disorder. A second ana- 
lysis replicated the positive correlation with schizophrenia (r, = 0.15) 
and revealed a strong positive genetic correlation between AN and 
OCD (r, = 0.55) [33, 43]. The high genetic correlation between AN 
and OCD parallels twin-based genetic correlations (r, = 0.52) and re- 
flects common comorbidity profiles seen clinically in AN [25, 44, 45]. 
The positive correlation with schizophrenia is less readily explicable, 
as this is not a commonly observed clinical comorbidity profile, but it 
should be noted that OCD also has a strong genetic correlation with 
schizophrenia and so could be mediating the observed correlation. 
However, comorbidity with schizophrenia has been documented in 
individuals with AN, although it has not been extensively explored 
[46]. The positive genetic correlation with schizophrenia thus does 
raise an intriguing question about whether some of the more per- 
plexing features of AN, such as a distorted body image, as well as 
overvalued ideation about food and weight, may have a psychotic 
component. 


Genetic correlations with metabolic phenotypes 


Additional intriguing genetic correlations emerged between 
AN and an array of anthropometric and metabolic phenotypes. 
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Bulik-Sullivan etal. first reported a significant negative correlation 
between AN and obesity (r, = -0.20) and BMI (r, = -.18), sug- 
gesting that the same genes may influence extreme dysregulation 
of body mass in both directions. Other genetic correlations be- 
tween AN and metabolic measures were with high-density lipo- 
protein cholesterol, fasting insulin, fasting glucose, and type 2 
diabetes. Furthermore, AN was genetically correlated with phys- 
ical activity [33]. 


Bulimia nervosa 


Family studies of bulimia nervosa 


BN has been shown to be familial and genetically overlaps with AN; 
individuals with relatives who suffer from either disorder are also 
more likely to develop either AN or BN [16, 17]. However, relatives 
of cases suffering from the restrictive anorexia subtype do not show 
an increased liability to develop BN [47], suggesting that the overlap 
between the restricting type of AN and BN is, at most, modest [12]. 


Twin studies of bulimia nervosa 


The twin-based heritability for BN ranges between 0.55 and 0.62 [20, 
22, 48, 49], suggesting that about 60% of the variance in liability can 
be explained by genetics (Fig. 104.2). The diagnosis of BN is some- 
what unreliable in population-based samples but can be improved 
by including behavioural components of BN such as self-induced 
vomiting [50]. A twin study on specific BN symptoms showed 
that BN core symptoms are differentially heritable, such that spe- 
cific symptoms, such as vomiting, are more heritable than, for ex- 
ample, the influence of weight on self-evaluation [51]. The genetic 
correlation reported by twin studies between the two key compo- 
nents of BN—binge eating and vomiting—is high, but not absolute 
(r, = 0.74), and self-induced vomiting exhibited higher heritability 
of 72% vs 46% reported for binge eating alone [52]. Various twin 
studies have conclusively demonstrated that most of the variance in 
risk for BN can be explained by additive genetic effects [20, 22, 48, 
49, 51]. One twin study confirmed the findings of family studies and 
reported the genetic correlation between AN and BN to be 0.79 [22]. 
This strong genetic correlation suggests that considerable overlap in 
the underlying biology of the two disorders may contribute to the 
observed diagnostic flux [12]. 


Twin studies and comorbidity 


BN occurs with concurrent mental illnesses. Bivariate twin studies 
of BN revealed a genetic overlap with major depression [53], phobia, 
panic disorder [54], and substance use disorder [55]. BN patients are 
at increased risk of suicide, which is partly influenced by common 
genetics [26]. 


Modern genomic approaches to bulimia nervosa 
Genome-wide association studies 


Although there has been no genome-wide study testing BN per se, 
one major GWAS on eating behaviours investigated the bulimia 
subscale of the Eating Disorder Inventory-2. The study tested six 
eating-related traits, including drive for thinness, body dissatis- 
faction, bulimia, weight fluctuation symptoms, breakfast-skipping 
behaviour, and childhood obsessive-compulsive personality 
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disorder traits. Furthermore, an arbitrary cut-off for case/control 
status determination of the bulimia scale was used, and unfortu- 
nately this was a case definition which does not fully satisfy ICD- 
10 or DSM-IV diagnostic criteria. The study design utilized one 
discovery sample and two replication samples. The study tested 
samples that were relatively small (approximately 2500-3000) for 
a GWAS, with no genetic marker reaching genome-wide signifi- 
cance. However, the effect sizes in the discovery sample and the 
replication samples were significantly in the same direction. In this 
study, the strongest (although non-significant) association with the 
bulimia scale was intergenic, lying between the cyclin L1 (CCNL1) 
and the leucine glutamate, leucine-rich 1 (LEKR1) genes which had 
previously shown multiple associations with fetal growth, birth- 
weight, elevated insulin release, and breast cancer [56]. Effect sizes 
in the discovery and replication samples are concordant. GWAS of 
other psychiatric disorders have resulted in significant findings as 
sample sizes have increased. Together, these strongly suggest that 
increasing the sample size will enable eating disorder GWASs to 
reach sufficient statistical power in future. 


Binge eating disorder 


Genetic research on BED is in its infancy in part due to the fact that 
it only became an independent diagnosis in DSM-5 in 2013. Current 
research focuses on neurotransmitter systems, such as dopamine 
and endogenous opioids, which may underlie the human central 
nervous reward system. Changes in the reward system are associ- 
ated with addictive behaviour. 


Family studies of binge eating disorder 


BED aggregates in families with a heritability of 57% [57]. Family 
studies [58] also showed a genetic overlap between obesity and BED. 
These results were replicated in additional twin studies. 


Twin studies of binge eating disorder 


Twin studies of both the disorder and its cardinal symptom—binge 
eating—revealed consistently moderate to strong heritabilities of 
between 33% and 69% [57, 59-61]. 


Twin studies and comorbidities 


BED is associated with obesity and the metabolic syndrome [62]. 
Twin studies revealed that both obesity and BED are influenced by 
partly shared and partly different genetic factors [63]. The direction 
of causality between obesity and BED is not fully elucidated and 
must be further investigated. Furthermore, BED exhibits a signifi- 
cant genetic overlap with substance use disorder [64]. Similar to the 
observation about other eating disorders, BED has also a significant 
genetic overlap with suicidality [26]. 

All of these observations have emerged from quantitative genetic 
studies, as limited molecular genetic findings and no genome-wide 
findings for BED exist. 


Modern genomic approaches to binge eating disorder 
Genome-wide association studies 


A GWAS on BED per se has not yet been conducted. However, the 
phenotypic overlap with obesity and parallel studies of phenotypes, 
such as food addiction, are relevant. 


Food preference patterns and food addiction 


Neuroimaging studies reported altered dopamine signalling in 
the fronto-striatal neuronal circuitry involved in reward and self- 
regulatory processes [65]. From an evolutionary biology perspec- 
tive, supported by epidemiological and neuroscientific evidence, 
certain foods, such as processed foods high in sugar, fat, and salt, 
may have an addictive potential similar to psychoactive substances 
such as cocaine or alcohol [66, 67]. This provides the foundation 
for the so-called construct of ‘food addiction, which is assessed by 
self-report using the Yale Food Addiction Scale (YFAS), based on 
the DSM-IV diagnostic criteria for substance use disorders. Food 
addiction is separate from BED but is an overlapping construct that 
has evolved from different clinical and research areas. Genetic find- 
ings for food addiction must also be interpreted cautiously because 
most results are inferred from small sample sizes [68]. 

Food addiction is highly prevalent in patients suffering from BED 
[69]. A 2013 study investigated food addiction in individuals with 
BED. Two subgroups of BED patients with or without food addic- 
tion were equivalent in age and BMI, but BED patients also suffering 
from food addiction reported higher rates of emotional eating and 
rated themselves more responsive to rewarding food types [70]. 


Twin studies of food preference patterns 


Twin studies on the consumption of sugar through drinks (48%) [71], 
liking for sweet solutions (49%), liking for sweet foods (54%), use 
frequency of sweet foods (53%) [72], and food preference patterns 
(36-58%) [73] showed that these traits are moderately heritable. 


Twin studies and comorbidities 


One twin study showed that the association between high sugar 
consumption and substance use disorder is 59% genetically driven. 
These findings underpin the hypothesis that food addiction may be 
an independent entity. 


Modern genomic approaches to food addiction 
Genome-wide association studies 


A quantitative trait GWAS on the modified form of the YEAS was 
performed by Cornelis et al. [74] in 9314 women and identified 
genetic loci associated with food addiction but produced limited 
evidence for shared genetics with substance use disorder. Two 
genome-wide significant hits close to protein kinase C a (PRKCA) 
and neurotrimin (NTM) were discovered. PRKCA is involved in in- 
sulin signalling, inflammation, and the mitogen-activated protein 
kinase (MAPK) pathway. Neurotrimin was associated with opioid 
binding protein/cell adhesion molecule-like (OPCML). OPCML was 
also associated with alcohol dependence and body fat distribution 
[74]. However, these results require replication. 


Body mass index, obesity, and body fat genetics 





in relation to eating disorders 


Obesity has been considered as a potential behavioural disorder, and 
the nature of the relation of eating disorders with BMI (body weight) 
and body fat is of considerable scientific and medical interest. 
Obesity, BMI, and body fat percentage are highly heritable, poly- 
genic, and multifactorial human traits. Body weight is influenced 


by a complex interplay of genetic, environmental, and psychosocial 
factors. These three factors affect energy intake and energy con- 
sumption mediated by the central nervous and metabolic systems 
in humans [75]. BMI is commonly used as a proxy measure for both 
weight and body fat. However, its application has been widely criti- 
cized in healthy individuals, as well as in eating disorder patients 
[76, 77]. 


Family and adoption studies of body mass index 


Family studies generally report heritability for BMI ranging from 
20% to 80%, with the observed heritability increasing with age. Data 
from adoption studies replicated these findings, with genetic factors 
accounting for 20-60% of the variation in BMI. Adoption studies 
showed no association between the BMI of adoptees and their adop- 
tive parents, whereas the adoptee’s BMI was associated with the bio- 
logical parents’ BMI [78]. 


Twin studies of body mass index and body fat 
Twin studies 


The (adult) twin-based heritability of BMI ranges between 59% and 
64% [79, 80], with heritability estimates of about 70% for twins who 
were reared apart, affirming the high genetic influence [81]. Twin 
studies also estimated high heritability for body fat and its distri- 
bution. The heritability for total body fat is about 86%, for trunk 
fat about 82%, and for lower body fat about 83% [82, 83]. Parental 
overweight and adiposity are established risk factors for childhood 
obesity that are mainly genetically determined [84]. 


Developmental twin studies 


An extensive developmental twin study performed by Silventoinen 
et al. [85] shed new light on the influence of age, sex, and socio- 
economic background on BMI. The heritability of BMI increases 
from 41% after the age of 5 to 78% at the age of 19 and remains 
stable for the rest of the lifespan. Despite the increasing heritability, 
the variance of BMI also increases after the age of 5. Two possible 
explanations are hypothesized—either children may be able to 
choose their food more independently after the age of 5 or neuronal 
pathways involved in appetite regulation may be influenced by age- 
dependent gene expression. Furthermore, sex differences in BMI 
and its heritability are a well-known phenomenon and were repli- 
cated, which means that different genes have differential effects in 
both sexes. Different sets of genes seem to influence BMI before and 
after puberty which may be influenced by hormonal changes. 
North America and Australia showed regional differences in BMI, 
compared with Europe and East Asia, having a higher mean and a 
greater variance. Including poorer socio-economic regions into the 
heritability analysis of BMI led to lower heritability estimates, with 
an increasing influence of shared environmental factors [85]. This 
observation may be explained by a restricted presentation of the 
phenotype due to limited food resources in poorer areas. 


Modern genomic approaches to weight and body fat 
Genome-wide association studies 

GWAS on human obesity detected 227 independent genetic vari- 
ants that are associated with BMI. These 227 SNPs were narrowed 


down to 97 genetic loci. These genetic loci are enriched in various 
biological pathways such as central nervous and peripheral appetite 
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regulation, adipocyte differentiation, insulin and leptin signalling, 
lipid metabolism, and gut microbiota. In total, common SNPs are 
estimated to explain about 27-30% of the phenotypic variance of 
BMI [86], but the identified genome-wide significant obesity SNPs 
only explain about approximately 3% of this variance, indicating 
that this is a highly polygenic trait [87]. Only two of the 97 genetic 
loci showed sex differences, with stronger effects in women being 
observed in each case. 

GWAS of body fat distribution investigating the waist-to-hip 
ratio found 49 significant genomic loci [88, 89]. The waist-to-hip 
ratio was adjusted for the effect of BMI, to avoid detection of gen- 
omic loci having an effect on BMI. With 19 of the 49 loci showing 
a larger effect size in women, compared with men, considerable 
genetic sex dimorphism was uncovered. A GWAS conducted on 
body fat percentage revealed 12 genetic loci that were genome- 
wide significant. Seven of the loci showed a larger effect on body fat 
percentage, compared with BMI, whereas the rest showed a larger 
effect on BMI, suggesting an association with both fat and fat-free 
mass [90]. 


Genetic correlations with eating disorders 


As discussed earlier, Bulik-Sullivan et al. [41] achieved a major 
breakthrough when they developed the method of linkage disequi- 
librium score regression to calculate genetic correlations between 
different traits. They reported a significant negative correlation be- 
tween AN and obesity (r, = -0.20) and BMI (r, = -.18), suggesting 
that shared genetic variants between AN and body fat may influence 
body mass in opposite directions. This negative genetic correlation 
paved the way for a more detailed exploration of AN as both a meta- 
bolic and a psychiatric disorder. 

Comparing results from two genome-wide meta-analyses of AN 
and BMI, Hinney et al. [91] identified three independent genomic 
loci that were associated with both AN anda lower BMI. These three 
independent loci were marked by nine different SNPs. The analysis 
was stratified by sex and showed that the associated SNPs showed a 
sex-specific effect, such that the SNPs were more strongly associated 
with BMI in females, compared with males. The association with 
the region on chromosome 10 was mainly due to females. The four 
genes located closest to the chromosomal loci are involved in brown 
adipose tissue function and regulation, preadipocyte proliferation, 
and BDNF expression. 


Conclusions 


A large body of evidence shows unequivocally that eating disorders 
are influenced by genetics. However, these findings must be inter- 
preted cautiously. Firstly, although genetics play a significant role 
in the aetiology of eating disorders, it is important to bear in mind 
that the development of an eating disorder is also substantially in- 
fluenced by environmental factors. Secondly, genetics is a highly 
complex, quickly developing area of research that is often difficult to 
comprehend and is even more so for patients and the public. Finally, 
it is important to communicate results correctly and in an under- 
standable manner. It should be explained that genetics underlying 
eating disorders are neither deterministic nor a genetic defect and 
that there is no evidence that one single gene causes any of the 
common eating disorders. 
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Large numbers of eating disorder patients need to be studied more 
systematically to redefine the phenotypes of eating disorders. This so- 
called ‘deep phenotyping’ as part of ‘phenomics’ allows us to clarify 
diagnoses. More specific diagnoses incorporating various quantita- 
tive measures might subsequently increase the power to detect gen- 
etic variants and environmental risk factors. Increasing the sample 
size is a further approach to increase statistical power. Both strategies 
are currently being utilized by the PGC-ED on an international level. 
This gives hope to identify more genetic variants, environmental risk 
factors, and critical developmental stages involved in the aetiology of 
eating disorders. These identified risk factors and critical periods may 
ultimately be translated into more effective preventive and curative 
measures, but this goal can only be reached by joint efforts and ex- 
pertise of clinicians, epidemiologists, neuroscientists, and geneticists. 
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Neuroimaging has helped refocus thinking about the aetiology and 
formulation of the pathophysiology of eating disorders (EDs) in the 
past few decades [1]. Cultural pressures to be thin were once theor- 
ized to underlie or perpetuate anorexia nervosa (AN) and bulimia 
nervosa (BN) almost exclusively. It is now recognized that a bio- 
logical basis exists for these disorders, and neuroimaging studies 
provide evidence that the behavioural symptoms of EDs are asso- 
ciated with altered brain function and structure [2]. New insights 
regarding the brain pathways involved in the processing of general 
and taste reward, cognitive inhibition, and interoception are of par- 
ticular importance. This chapter will focus primarily on functional 
alterations in AN, BN, and binge eating disorder (BED). The feeding 
disorders, including pica, avoidant restrictive food intake disorder 
(ARFID), and rumination disorder, will also be briefly discussed. 

EDs are severe psychiatric diseases with significant physical and 
mental health complications. EDs occur more commonly among fe- 
males than males, with the typical age of onset during early adoles- 
cence [3]. AN has the highest mortality rate of all mental illnesses 
and is characterized by restriction of energy intake, leading to sig- 
nificantly low body weight, intense fear of gaining weight, and a 
marked disturbance in the experience of one’s weight and shape [4]. 
Individuals suffering from the restricting subtype of AN (AN-R) 
limit their caloric intake and sometimes also over-exercise, whereas 
those with the binge eating/purging type eat large amounts of food 
in a short period of time, accompanied by a sense of loss of control, 
and/or engage in compensatory behaviours (that is, self-induced 
vomiting and use of laxatives or diet pills) to prevent weight gain 
[4]. BN is characterized by average or above average body weight 
and regular binge eating and compensatory behaviours. The binge 
eating episodes of individuals with BN tend to include sweet, high- 
fat foods, and dietary restriction between binge eating episodes is 
common [5]. Finally, BED is associated with an increased risk of 
obesity and is characterized by recurrent episodes of binge eating 
without compensatory behaviours and marked distress about binge 
eating [4]. Across diagnoses, EDs are difficult to treat and have a 
high rate of relapse [6]. 

A consistent body of evidence suggests that structural and func- 
tional alterations in circuitry involved in taste and reward pro- 
cessing, inhibitory control, and interoception may underlie eating 





disorder pathology [7-12]. As such, this chapter will detail these 
neural circuits and associated findings in EDs. Directions for future 
research to better understand the aetiology of these complex dis- 
orders and implications for treatment will also be discussed. 











Growing evidence suggests structural brain abnormalities among 
eating-disordered individuals, with the majority of research fo- 
cusing on differences in brain volume. Among participants with 
AN, the most pronounced differences are observed during the acute 
stages of illness [13]. A systematic review of structural MRI studies 
reveals significantly lower grey matter (GM) volume and greater 
cerebrospinal fluid (CSF) volume among ill AN patients, relative 
to controls [14]. Lower GM volume was found in areas including 
the dorsal and rostral anterior cingulate cortex (ACC), right frontal 
operculum, left cerebellum, and right insula. While one study shows 
lower global GM volume among individuals remitted from anorexia 
nervosa (RAN), relative to controls, another reveals no differences 
in GM, white matter (WM), or CSF volume between the two groups 
[14]. No differences were found in WM volume between acute AN, 
RAN, and age-matched healthy controls (HC) [15]. Longitudinal 
research in AN suggests that GM volumetric differences normalize 
upon recovery [14], suggesting state-specific findings are likely due 
to malnutrition. 

Inconsistent findings are also seen among patients with BN, with 
one study finding no differences in GM, WM, or CSF volume, com- 
pared to healthy participants [14]. However, another reveals that 
those with BN show greater GM volume than HC in the medial 
orbitofrontal cortex (OFC) and striatum. Although not significant, 
individuals remitted from bulimia nervosa (RBN) showed larger 
GM volume in the insula, relative to HC [14]. Finally, individuals 
with BED showed increased GM volume of the ACC and medial 
OFC, relative to controls [14]. Alterations in structural volume 
among individuals with EDs may be related to the extreme eating 
patterns observed among these participants. 


Structural connect 





Only one study has examined structural connectivity among indi- 
viduals with EDs. Diffusion-weighted MRI revealed that individuals 
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with RAN showed abnormal structural connectivity in regions 
involving frontal, basal ganglia, and posterior cingulate nodes [16], 
relative to HC. RAN individuals exhibited longer normalized mean 
path lengths in the right caudal anterior cingulate and right pos- 
terior cingulate, which correlated to poor insight. Such findings in 
neural systems involved in habit formation and reward may underlie 
the establishment and maintenance of compulsive behaviours char- 
acteristic of AN [16]. 


Functional imaging 


Several studies have utilized resting-state functional magnetic res- 
onance imaging (rsfMRI), a method that allows for making tem- 
poral correlations between brain areas using task-independent 
changes of blood oxygen level-dependent (BOLD) signals [17] 
to examine functional connectivity in EDs. A review of rsfMRI 
studies in AN suggests differences in functional connectivity be- 
tween AN and HC in the dorsal ACC (dACC), thalamus, and right 
inferior frontal gyrus (IFG) [18]. AN exhibit decreased connect- 
ivity between the sensory-motor and visual networks, relative to 
HC [19]. Additionally, acute AN show decreased connectivity in 
the thalamo-insular subnetwork, relative to HC, reflecting alter- 
ations in neural systems involved in pain, body size perception, and 
hunger [20]. Increased functional connectivity in the default mode 
network (DMN), a region involved in introspective rumination, 
low-level arousal, and self-regulation [21], was found in AN, rela- 
tive to HC [22]. Compared to HC, RAN subjects showed alterations 
in the DMN [23] and stronger functional connectivity between the 
dorsolateral prefrontal cortex (DLPFC) and medial OFC (mOFC) 
during a monetary incentive delay task [24]. The latter findings in- 
dicate greater cognitive control relative to motivationally salient 
cues among RAN. However, another study showed no differences, 
compared to controls [25]. Discrepant findings may be due to meth- 
odological differences, including participant characteristics and 
resting-state approaches [18]. 

Studies comparing AN, BN, and HC showed that AN individuals 
demonstrated stronger synchronous activity between the dACC and 
the retrosplenial cortex, while those with BN showed stronger syn- 
chronous activity between the dACC and the mOFC [26], relative to 
HC. AN and BN individuals showed stronger activity between the 
dACC and the precuneus, correlating to higher rates of concern with 
body shape on the Body Shape Questionnaire [26]. These findings 
indicate that greater synchrony between such regions may be asso- 
ciated with preoccupation with eating and body shape among indi- 
viduals with EDs. Alternatively, one study using functional magnetic 
resonance imaging (fMRI), voxel-mirrored homotopic connectivity 
(VMHC), and regional interhemispheric spectral coherence (IHSC) 
revealed significantly reduced interhemispheric functional con- 
nectivity in patients with acute AN and BN, compared to HC [27]. 
Specifically, lower interhemispheric functional connectivity was 
shown in the precuneus, cerebellum, and posterior insula among 
AN patients vs HC, and in the frontal lobe of BN patients, compared 
to HC. Interhemispheric functional connectivity was higher in the 
Slow-5 band, a frequency range in which the majority of GM oscil- 
lations primarily occur [28], in all regions except the insula. There 
were no group differences in left-right structural asymmetries or in 
WM or GM volume. Functional connectivity anomalies, despite the 


absence of structural differences between groups, indicate that the 
acute phase of EDs are associated with decreased interhemispheric 
connectivity in regions involved in cognitive control and reward 
processing [27]. 





General reward processing in eating disorders 


Neuroimaging evidence supports a role for altered reward pro- 
cessing in the development and maintenance of EDs [29]. Briefly, 
ventral limbic neural circuitry, which includes the amygdala, rostral 
ACC, ventromedial prefrontal cortex (vmPFC), and anterior ven- 
tral striatum, is involved in identifying and processing rewarding or 
emotionally significant information and in generating affective re- 
sponses to this information [30]. Through a series of non-reciprocal 
connections, dopamine (DA)-mediated reward and motor infor- 
mation travels via the limbic circuit to executive and sensorimotor 
areas of the striatum [31]. Together with cognitive control and sen- 
sorimotor circuits, these circuits code the rewarding value of food 
and non-food stimuli and evaluate and integrate reward predic- 
tions to guide behaviours [32]. The majority of research on illness- 
specific reward processing in EDs is based on studies examining 
responses to food and taste stimuli, though some studies have also 
examined general reward processing (for example, money) to avoid 
potentially eliciting symptoms caused by disorder-specific stimuli. 
Determining whether alterations in reward processing are specific 
to food or whether they generalize to secondary reinforcers (for ex- 
ample, money) can help elucidate neural mechanisms underlying 
ED pathology and identify shared and unique alterations across ED 
and non-ED psychiatric conditions. 


Anorexia nervosa 


Individuals with AN often display anhedonia and find little pleasure 
outside of the pursuit of weight loss [33]. They also tend to show an 
enhanced capacity to delay reward, relative to their healthy peers, 
though only when ill [34]. Many of these characteristics typically per- 
sist to some degree following remission [35], suggesting trait-, rather 
than state-, related symptoms of eating disorders [36]. Individuals ill 
with, and remitted from, AN (RAN) show altered DA function [37] 
and altered brain response, particularly in striatal circuits [1, 36, 38], 
suggestive of abnormal reward processing. Moreover, increased DA 
release in the precommissural dorsal caudate has been associated 
with increased anxiety in RAN [39]. Since ingestion of palatable 
foods is associated with an increase in striatal DA [40], food restric- 
tion is thought to be an effective means of reducing the stressful ex- 
perience of endogenous DA release among individuals with AN [7]. 

In one study, RAN adults failed to show a difference in ventral stri- 
atal response to monetary gains vs losses, suggesting an insensitivity 
to the distinction between positive and negative feedback that may 
contribute to anhedonia, difficulty with making decisions, and de- 
creased food intake [8]. Adolescents with AN also show exaggerated 
neural responsivity to punishment, relative to controls, suggesting 
an altered ability to code for both the value and valence of poten- 
tial outcomes [9]. Moreover, in contrast to healthy control women, 
who exhibit a greater reward response when hungry, the reward re- 
sponse in RAN participants does not differ significantly between 
hunger and satiety during a delay discounting task, suggesting re- 
duced sensitivity to the motivational drive of metabolic state when 


determining the value of rewarding stimuli in AN [41]. Findings 
from this study suggest that hunger does not increase reward among 
RAN and that such individuals fail to integrate the physiological 
state and reward motivation into decision-making [41]. Reduced 
capacity to experience reward is thought to serve as a mechanism to 
ignore hunger and engage in prolonged periods of fasting, even in 
the face of physical starvation [41]. 


Bulimia nervosa 


Individuals who engage in binge eating and purging behaviours 
are typically oversensitive to reward and often engage in sensation- 
seeking behaviours [35]. Alterations in reward circuitry are seen in 
individuals with BN. For example, reduced CSF concentrations of 
the DA metabolite homovanillic acid (HVA) are associated with in- 
creased frequency of binge eating behaviours among individuals ill 
with BN [42]. Positron emission tomography (PET) studies reveal 
a trend towards decreased D2/D3 receptor binding potential in the 
posterior putamen and caudate in ill BN participants, with an in- 
verse association between binge eating and vomiting frequencies 
and striatal DA response [43]. Interestingly, individuals remitted 
from BN (RBN) and participants with the binge eating/purging sub- 
type of AN have normal DA D2/D3 receptor binding and CSF HVA 
levels [39]. Similar to RAN women, RBN women also fail to differ- 
entiate between wins and losses in ventral-striatal regions during 
a monetary choice task, suggesting an inability to differentiate be- 
tween positive and negative feedback [11]. These findings may indi- 
cate elevated sensitivity to both reward and punishment among BN 
individuals, even after recovery [44]. 


Binge eating disorder 


Altered reward processing also has been observed among individ- 
uals with BED. Following treatment, obese individuals with BED, 
relative to obese individuals without BED, demonstrated dimin- 
ished activation in reward circuitry during both the anticipation 
of rewards and actual reward processing [45]. Individuals with the 
greatest hypoactivation activation during reward processing en- 
gaged in more frequent binge eating [45]. In sum, research suggests 
that across ED diagnoses, altered reward processing, which may not 
be specific to food reward, likely contributes to disordered eating. 


Taste and food reward processing 


Appetite is a physiological motivational drive that depends on the 
interaction between an individuals homeostatic needs, the re- 
warding aspects of food, and the cognitive ability to choose alter- 
native behaviours to eating [36]. Cortical taste circuitry is relatively 
well defined. Animal studies show that sweet taste perception is 
mediated by tongue receptors that send signals through the brain- 
stem and the thalamus to the primary gustatory cortex in the insula 
[46]. In humans, the anterior insula receives chemosensory taste in- 
formation from the mid insula where it is integrated with homeo- 
static drives and reward value to guide motivated eating behaviour 
[47, 48]. Together with the amygdala, which valuates the emotional 
saliency of food tastes and textures [49], the anterior insula projects 
along the ventral aspects of the striatum to regulate palatable food 
intake [50]. The mOFC assesses the pleasantness of taste or smell 
[51] and projects to the subgenual and rostral ACC. ACC subregions 
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integrate reward with automatic function and motor control [52] 
and contribute to appetite [53]. The OFC also processes how much 
food an individual has eaten and guides decisions about when to 
stop eating one food type, while still remaining interested in other 
types of foods [54]. This ventral (limbic) neurocircuit thus translates 
taste signals into food reward and guides the approach towards, or 
avoidance of, food [36] (Fig. 105.1). 


Anorexia nervosa 


One approach to examining food reward processing in EDs is to 
directly interrogate gustatory processing. Studying brain response 
to administration of actual tastes permits direct examination of 
whether individuals with EDs show alterations specific to encoding 
the rewarding aspects of food. Imaging studies in RAN show de- 
creased ACC, insula, and striatal activation in response to expected 
sweet taste receipt, relative to controls (Fig. 105.2) [55, 56], although 
these studies do not directly compare responses when hungry vs sa- 
tiated. Decreased brain response to palatable taste supports a role 
for altered reward processing of gustatory stimuli in AN. However, 
data from another study suggest striatal and insular activation in 
response to pleasant and aversive taste is increased in RAN when 
taste is less predictable, potentially indicating a hypersensitive sa- 
lience response to unexpected food among this population [57]. 
Discrepant findings may relate to the predictability of the delivered 
taste, rather than to the tastes themselves. In addition, individuals 
with AN showed greater right amygdala and left medial temporal 
gyrus activity in response to chocolate milk than controls [58]. The 
amygdala is involved in the acquisition of conditioned emotional re- 
sponses to neutral stimuli and is activated during fear [59]. Further 
research is needed to clarify the role of altered activation in response 
to tastes in AN. 


Bulimia nervosa 


Imaging studies of BN have been somewhat inconsistent. Some re- 
search suggests reduced reward response to expected taste receipt 
in BN, potentially suggesting reward habituation to food [60]. It is 
possible that a decreased subjective sense of reward while eating 
may contribute to overeating in predisposed individuals [61] or that 
recurrent episodes of binge eating over time contribute to blunted 
reward responses [62]. However, other findings showed reduced 
reward response to unexpected taste receipt among ill BN individ- 
uals, relative to controls [10]. Exaggerated hedonic response to pre- 
dictable receipt of palatable foods also has been observed among 
individuals with BN [5, 55]. Responsivity of reward regions during 
anticipation, but not receipt, of palatable food is associated with in- 
creased negative affect among women with BN [63]. This relation- 
ship has been interpreted to suggest that greater reward anticipation 
activation during states of negative affect in BN may make individ- 
uals with BN more likely to consume high-energy foods, and nega- 
tive affect could increase the rewarding value of food if used as a 
temporary tool to regulate mood [63]. 


Reward-based learning 


Anorexia nervosa 


As learning from feedback shapes behaviour, altered feed- 
back learning could contribute to the onset and maintenance of 
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Fig. 105.1 (see Colour Plate section) Cortical taste circuitry. Chemoreceptors on the tongue detect a sweet taste, which is transmitted through 
brainstem and thalamic centres to the primary gustatory cortex, adjacent to, and interconnected with, the insula. The anterior insula is a part of the 
ventral neurocircuit and is connected with the ACC and OFC. Cortical structures in the ventral neurocircuit send signals to the ventral striatum, while 
structures involved in cognitive strategies send input to the dorsal striatum. Taste and motivation are integrated in a decision to approach or avoid food. 
The figure links cortical structures with arrows, and all cortical structures project to the striatum in a topographic manner. 

Reproduced from Nat Rev Neurosci., 10(8), Kaye W, Fudge J, Paulus M, New insight into symptoms and neurocircuit function of anorexia nervosa, pp. 573-84, Copyright 


(2009), with permission from Springer Nature. 


disordered eating. Reward-based learning can be studied by exam- 
ining brain response to receipt of expected or unexpected reward 
stimuli [64]. DA is released during expectation of a reward, whereas 
omission of an expected reward is followed by a decrease in DA 
[65]. The difference between the value of the reward stimulus re- 
ceived and the stimulus predicted is known as the prediction error 


(Fig. 105.3) [66] and is implicated in EDs [1, 10]. RAN participants 
exhibited heightened anticipatory responses to food images [67], 
and individuals ill with AN showed a higher brain response to un- 
expected taste stimuli than controls, suggesting heightened anx- 
iety and responsivity to anticipated and unanticipated food stimuli 
in AN [68]. Adolescents with AN, relative to controls, showed a 


CHAPTER 105 Imaging of feeding and eating disorders 





CW: r = 0.656, p = 0.006 






















































































Left Insula 4 
1.2 v 
6 34 o 
= 1.0 4 rs) 
© A 24 00 o o 
E 084 È 
£ £ 14 o o o 
3, 0.6 5 2 
S 044 ge 04 
6 0.2 4 8 -14 oo ð 
oS £ 
& 00 < T T T o et T 7 5 -24 0 oo 
n ed a 
xs -024 2 A 6 8 10 12714 16 18 20 È 3J FS 
ital Time-point (seconds) i ; ' i ! ! 
R -0.2 0.0 0.2 0.4 0.6 0.8 1.0 
% Signal change Insula Left 
Anteroyentral 
S r tu ; Insula Anteroventral Striatum 
3.0 4 +... 4 2.0 4 Wo 
7 t=; .? 
AEE 15 a bed oo 6 
204 mile l 
£ 2e 
Q 1.0 4 
Instila a 
10 — 
0.5 ~ Ea 
ER Mn MEANS 3 
AN CW OB AN CW OB 





Fig. 105.2 Functional MRI studies examining taste reward processing in anorexia nervosa (AN) participants and control women (CW). (a) During 
taste processing of sucrose vs water, RAN, compared to CW, showed less activation of the ventral and dorsal striatum and the insula to sucrose and 
water. (b) Corresponding time points of the blood oxygen level-dependent (BOLD) in the left insula showed this decreased response to sucrose in RAN. 
BOLD response is correlated with pleasantness response for sucrose in the left and right insula in CW, but not in RAN. (c) By contrast, a taste reward 
conditioning tasks shows underweight AN have greater brain response in the anteroventral striatum, insula, and PFC compared to control and obese 
women, during taste anticipation. 


Reproduced from Neuropsychopharmacology, 33(3), Wagner A, Aizenstein H, Frank GK, et al., Altered insula response to a taste stimuli in individuals recovered from restricting- 
type anorexia nervosa, pp. 513-23, Copyright (2008), with permission from Springer Nature. 


greater prediction error response within the ventral caudate/nu- response to unexpected reward receipt and to unexpected reward 
cleus accumbens, and the anterior and posterior insula during a omission. Following treatment, prediction error and response to un- 
monetary reward task [69]. They also showed an increased neural expected reward omission normalized, but unexpected reward re- 

ceipt remained heightened. Caudate prediction error response was 









































negatively associated with weight gain during treatment, indicating 

| tha a higher neural prediction error response may be a marker of 

E Receive Keep illness severity or the treatment-resistant nature of adolescent AN 

Use prediction outcome 2 Reward = prediction [70]. Since depression and anxiety are significantly associated with 
prediction unchanged intolerance of uncertainty and harm avoidance among AN individ- 

Reward a uals [68], an elevated prediction error in response to food could be 


prediction related to higher rates of depression among AN participants, rela- 
tive to controls. 
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prediction error responses may play a role in the opposing patterns 
of food consumptions among individuals with AN and BN. 





Reward response to food cues in eating disorders 


Anorexia nervosa 


The majority of research examining neural response to food among 
individuals with EDs uses visual food cues [2]. Functional magnetic 
resonance imaging (fMRI) studies investigating brain response to 
pictures of food revealed reduced reward brain response in AN in 
the inferior parietal lobe [70], OFC [71, 72], lateral PFC [72], hypo- 
thalamus [71], and insula [67, 71], but greater activation of the 
medial PFC and posterior cingulate [72], when viewing food im- 
ages. Decreased neural responses to food images in AN may indi- 
cate innate differences in reward processing [1]. Notably, decreased 
reward response to food pictures in individuals with AN was sig- 
nificantly correlated with illness severity [70], suggesting that de- 
creased reward sensitivity may contribute to an increased ability to 
restrict intake. Unlike healthy controls, both ill and weight-restored 
AN participants failed to demonstrate the expected associations be- 
tween the levels of ghrelin, a ‘hunger hormone; and brain response 
to visual food stimuli in limbic regions [71], suggesting a possible 
disconnect between brain reward response and physiological 
hunger signalling. 

Rather than eliciting reward response, food cues may be associ- 
ated with exaggerated attentional processing among ill and RAN in- 
dividuals [1, 72, 73]. Early fMRI research shows that, unlike healthy 
controls, individuals with AN exhibited increased left amygdala re- 
sponse to high- vs low-calorie foods [73]. Pictures of high-calorie 
foods elicited high self-reported anxiety in AN individuals, along 
with increased left insular and ACC brain response [73], suggesting 
elevated fear of calorically dense foods. Elevated activation in the 
medial temporal gyrus when viewing high-calorie foods was also 
associated with self-reported anxiety [74]. More recent research 
showed that both ill and RAN individuals demonstrated increased 
activation in areas of executive function and error monitoring, rela- 
tive to controls, when viewing food images [72]. Together, these 
findings could suggest heightened arousal, vigilance, or processing 
activity in response to food cues among AN individuals, possibly to 
predict and control the anxiety produced by food stimuli that may 
be experienced as unpleasant [7]. 

Tasks that permit examination of neural response to food images 
and during food-related decision-making have also been used to 
assess food-related processes in AN. For example, AN individuals 
showed greater activation of neural circuits associated with habit 
formation than controls when choosing low-fat vs high-fat foods 
[75]. Although it remains unknown whether reduced energy intake 
among AN individuals is a habit, these findings provide prelim- 
inary evidence that maladaptive behaviours in AN may be mediated 
by dorsal fronto-striatal circuits that underlie habitual behaviours 
[75]. Thus, alterations in these circuits may play a role in food 
choices in AN. 


Bulimia nervosa and binge eating disorder 


Limited research examining the response to visual food cues among 
individuals with BN and BED has thus far yielded inconsistent find- 
ings. Participants with BN showed greater activation in the mOFC, 


ACC, visual cortex, and insula in response to pictures of palatable 
foods, relative to controls [76, 77], and heightened insular and ACC 
activation in response to food images, relative to participants with 
BED, suggesting enhanced reward processing of visual food stimuli 
[77]. An exaggerated response to visual food cues could increase the 
risk for binge eating among individuals with BN. However, other 
studies showed evidence of reduced reward response to food stimuli 
among BN, compared to controls [12, 78]. In addition, patients with 
BN showed decreased neural activity in the temporal lobe, inferior 
parietal lobule, and post-central gyrus in response to visual food 
stimuli [76]. 

Results in BED to date are similarly mixed. One study revealed 
increased medial PFC response, relative to controls [77], and an- 
other showed elevated response patterns of the left ventral striatum, 
compared to BN, in response to food pictures [79]. However, other 
findings revealed a negative correlation between body mass index 
(BMI) and neural reward response to food cues in BED, suggesting 
that chronic excessive caloric intake may contribute to changes in 
brain function or that individuals with a lower reward response are 
at a greater risk for weight gain [2]. It remains unclear whether func- 
tional alternations among BN individuals and those with BED are 
premorbid traits that contribute to binge eating behaviours or if such 
differences result from extreme overeating. 





Inhibitory control in eating disorders 


A growing body of research suggests that EDs are marked by 
an imbalance between reward and inhibitory control circuitry. 
Homeostatic mechanisms regulating appetite interface with reward 
and cognitive control circuitry to guide eating behaviours [51]. 
Cognitive processes involved in planning, inhibitory control, and 
decision-making integrate interoceptive cues (for example, infor- 
mation about the internal physiological state) and reward signals 
to determine food selection, portion sizes, and frequency of eating 
[80]. These systems can override homeostatic signals, giving rise to 
extreme patterns of over- or under-consumption [81]. 

A network consisting of the DLPFC, ventrolateral prefrontal 
cortex (VLPFC), insula, dorsal caudate, and posterior parietal cortex 
is associated with inhibitory control [82]. The dorsal cognitive cir- 
cuit modulates selective attention, planning, decision-making, and 
effortful regulation of affective states [83]. The ventral limbic and 
dorsal cognitive circuits are involved in inhibitory decision-making 
processes, especially involving reward-related behaviours. Through 
its connections with the dorsal ACC, the insula is also involved in 
the cognitive valuation of salient stimuli such as food [84]. 


Anorexia nervosa 


Individuals with AN-R often exhibit temperaments marked by be- 
havioural inhibition, cognitive inflexibility, and emotional constraint 
[85]. Imaging data consistently provide evidence of alterations in 
dorsal inhibitory control circuits among AN and RAN individuals. 
Specifically, several fMRI studies revealed enhanced activity in in- 
hibitory control regions among AN, relative to controls, during 
tasks requiring effortful control, cognitive inhibition, and error de- 
tection [86, 87]. RAN, relative to healthy control, participants also 
demonstrated enhanced activation of inhibitory control regions 
during both hunger and satiety [41] (Fig. 105.4). While behavioural 
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Fig. 105.4 Functional MRI response of cognitive circuitry during a monetary decision task in RAN and CW. Left: within the left middle frontal gyrus, 
CW responded more strongly when hungry than when satiated (z = 2.6; P = 0.04), whereas RAN responded more than CW when satiated (z = 2.7; 

P = 0.03). Middle: within the right ventrolateral PFC, CW responded more when satiated than when hungry (z = 2.0; P = 0.02). Right: within the left 
insula, CW responded more when satiated than when hungry (z = 3.6; P = 0.002). CW, healthy comparison women; RAN, women remitted from 


anorexia nervosa. 


Reproduced from Biol Psychiatry, 77(7), Wierenga C, Bischoff-Grethe A, Melrose A, et al., Hunger does not motivate reward in women remitted from anorexia nervosa, pp. 642, 


Copyright (2015), with permission from Society of Biological Psychiatry. 


findings in regard to set-shifting among adolescents with AN are 
mixed [88, 89], adult AN and RAN participants showed decreased 
neural activity in frontal regions during set-shifting [90, 91]. Such 
findings call into question whether cognitive inflexibility may be as- 
sociated with brain development or a trait marker for AN [85]. AN 
participants also showed a positive association between successful 
inhibition and increased activation of the inferior parietal cortex 
[87]. Increased activation of brain regions involved in planning, 
control, and evaluation of negative outcomes may underlie excessive 
behavioural inhibition and restricted food intake characteristic of 
AN. Furthermore, over-control may enable AN individuals to re- 
strict food intake in order to achieve weight loss. 


Bulimia nervosa 


In addition to demonstrating high reward sensitivity, BN individuals 
are often emotionally dysregulated and behaviourally disinhibited [92]. 
They often engage in more impulsive and sensation-seeking behav- 
iours than individuals with AN and healthy controls [93]. Inhibitory 
control processes are impaired in BN participants, as evidenced by 
poor performance on Stroop, Stop Signal, and Go/No-Go tasks [94]. Ill 
BN adolescents and adults, relative to controls, showed reduced fronto- 
striatal activation during correct responding on incongruent trials of 
the Simon Spatial Incompatibility Task [12, 95]. However, a study con- 
ducted by Lock et al. [87] revealed increased activation of the right 


DLPFC among adolescents with binge eating and purging during a 
Go/No-Go task [87]. Reduced activation during response inhibition is 
associated with illness severity [12, 95]. Individuals with BN also dem- 
onstrated behavioural impairments in inhibitory control in response 
to food stimuli [96], indicating that altered functioning of inhibitory 
control circuitry may underlie eating pathology in BN. 


Binge eating disorder 


Behavioural studies suggested that individuals with BED demon- 
strated significantly poorer performance on measures of decision- 
making and inhibitory control, relative to controls [97, 98]. Poor 
decision-making, coupled with difficulty delaying immediate re- 
wards, may explain the tendency to binge eat despite delayed nega- 
tive consequences (for example, weight gain, feelings of guilt). 
Imaging studies among individuals with BED showed alterations 
in cognitive control circuitry, namely in the prefrontal, insular, and 
orbitofrontal cortex in response to both food-specific and non-food 
stimuli [99, 100]. Obese participants with BED showed less activa- 
tion in the vmPFC, IFG, and insula, compared to obese and lean 
controls, during a Stroop task [100]. Additionally, a negative correl- 
ation was found between activation in brain regions involved in im- 
pulse control and dietary restraint scores in the BED group, whereas 
no such association was found between obese non-BED and lean 
participants [100]. Dysregulation of these regions may contribute to 
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the development and maintenance of binge behaviours seen among 
individuals with BN and BED. 


Interoception in eating disorders 





Interoception refers to the brain’s receipt of, and response to, con- 
tinuous feedback of afferent signals that provide information 
about the representation of the body’s physiological state [48]. 
Interoception includes respiration, heartbeat, pain, hunger, thirst, 
and intestinal tension, all of which are implicated in EDs [36, 101]. 
Key neural structures involved in interoception include the insula, 
midbrain reticular nuclei, ventromedial and ventroposterior thalami, 
and posterior insular cortex [48]. The insula evaluates interocep- 
tive signals and plays a critical role in processing such sensations 
to guide behaviour. Evidence from neuroanatomical and functional 
neuroimaging indicates that the insula also plays an important role 
in regulating hunger, appetite, and thirst [48]. The insula receives 
exteroceptive signals from the spino-thalamo-cortical interoceptive 
pathway [48] and integrates such information with input from the 
hypothalamus and amygdala, creating a representation of one’s in- 
ternal and external environments [47]. This representation is pro- 
jected to the anterior insula where it is integrated with information 
from cortical regions and limbic regions involved in motivation and 
saliency [47], evaluating how the stimulus may affect the body [102]. 
Ventral portions of the middle and anterior insula process social- 
emotional, cognitive, and sensorimotor information [48]. The mid 
insula also codes interoceptive prediction error, signalling a mis- 
match between expected and actual bodily arousal, which can elicit 
anxiety and approach or avoidance behaviour [47, 48, 102]. In fact, 
recent evidence suggests that several mood and anxiety disorders are 
rooted in altered interoception [103]. Altered interoception may be 
particularly relevant to Eds, given that individuals with AN and BN 
often exhibit comorbid depression and anxiety [104]. 


Anorexia nervosa 


Emerging evidence suggests altered interoceptive processing in ED. 
For example, individuals with AN demonstrate less accurate de- 
tection of interoceptive sensations (for example, cardiorespiratory 


changes) during meal anticipation [105]. Other studies suggested 
ill and RAN individuals, relative to controls, were less accurate in 
the perception of body signals and showed altered brain response 
to taste, pain, gastric interoception, and hunger [36, 105-107]. 
For example, despite comparable pain intensity ratings, relative to 
controls, RAN participants exhibited increased anterior insula re- 
sponse to anticipation of pain, but decreased activation to pain re- 
ceipt [106]. Further, this altered right anterior insula functioning is 
associated with the alexithymic temperament characteristic of AN, 
with alexithymia possibly reflecting an inability to utilize intero- 
ceptive cues to evaluate outcomes [106]. The observed discrepancy 
between subjective pain experience and brain response among AN 
participants may indicate a disconnection between reported and 
actual interoceptive state [106]. Weight-restored individuals with 
AN showed reduced left dorsal mid-insula activity during a stomach 
interoceptive attention task and increased activation of the right an- 
terior insula during heart interoceptive attention, relative to controls 
[107]. Altered anterior insula responsivity, as evidenced by abnormal 
anticipation, interpretation, and integration of internal and external 
stimuli, is likely to interfere with eating, social interactions, and 
other reward-related behaviours among individuals with AN [36]. 


Bulimia nervosa 


Altered interoception is also observed in BN. Patients with BN, rela- 
tive to controls, reported more deficits in interoceptive awareness on 
the ED inventory, while also demonstrating a positive correlation 
between anticipatory anxiety, interoceptive awareness, and body 
image distortion [108]. Individuals with BN reported elevated panic 
and anxiety symptoms, compared to controls, following sodium lac- 
tate and isoproterenol infusions, despite exhibiting a similar sym- 
pathetic response [109]. However, RBN individuals demonstrated 
reduced heartbeat counting accuracy [110], and individuals with 
BN showed elevated thresholds for pain [111] and larger gastric cap- 
acity, in comparison to healthy controls and obese individuals [112]. 
Thus, findings regarding interoception among individuals with BN 
remain unclear, as results from these studies suggest both heightened 
and diminished responses to interoceptive signals among BN indi- 
viduals [113]. For example, individuals with BN showed decreased 
baseline sympathetic tone, yet increased adrenergic sensitivity to 


Table 105.1 Summary table of neuroimaging findings of eating disorders in adults 
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(+) indicates increased neural response to stimuli, relative to controls; (-) indicates decreased neural response to stimuli, relative to controls. 
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Table 105.2 Summary table of neuroimaging findings of eating disorders in adolescents 
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(+) indicates increased neural response to stimuli, relative to controls; (-) indicates decreased neural response to stimuli, relative to controls. 


isoproterenol [114]. It is hypothesized that repeated binge eating 
and purging may contribute to habituation or sensitization of in- 
ternal body states, as evidenced by reduced salivation to food pres- 
entation among BN individuals, which normalizes after treatment 
[115]. Further research in individuals across all stages of illness and 
in recovery is needed to clarify interoceptive processes among BN 
individuals [113]. 

No studies to date have examined interoceptive awareness among 
individuals with BED. Research is needed to investigate the role of 
interoception in binge eating. 


Feeding disorders 


Feeding disorders include pica, rumination disorder, and ARFID. 
Pica is characterized by the compulsive and persistent eating of non- 
nutritive substances, while the essential feature of rumination dis- 
order is the repeated regurgitation of food occurring after feeding or 
eating over a period of at least 1 month [4]. Individuals with ARFID 
avoid or restrict food consumption, resulting in a clinically signifi- 
cant failure to meet nutritional requirements through the oral intake 
of food [4]. To date, no published neuroimaging studies of feedings 
disorders exist. Incidence of pica following acquired brain injury 
may suggest neural abnormalities among these individuals [116]. 
Further, some case studies suggest that selective serotonin reuptake 
inhibitors, in particular, are helpful in treating pica, suggesting 
possible serotonin dysfunction among affected individuals [117]. 
Although these findings support hypotheses for future investigation, 
imaging research is needed to better understand the neurobiological 
underpinnings of feeding disorders. 


Conclusions 


A consistent body of research provides evidence that individuals 
with EDs have altered functioning in reward, control, and intero- 
ceptive circuitry (Tables 105.1 and 105.2). Reduced motivational 
reward response and exaggerated cognitive control, coupled with 


an altered ability to integrate taste, hunger, and body signals, are 
likely to facilitate and maintain prolonged periods of fasting in 
AN. By contrast, alterations in both anticipatory and actual re- 
sponses to receipt of food, deficient inhibitory control circuit 
activation, and altered interoception are likely to underlie binge 
eating behaviour in BN and BED. These findings suggest poten- 
tial ED phenotypes based on neural circuit abnormalities, but also 
provide insight for targeting behavioural and pharmacological 
treatment options [2]. 

However, many questions still remain regarding the neurobiology 
of EDs. For example, malnutrition, psychological comorbidities, 
and medication use often confound the findings of research con- 
ducted in ill individuals [1]. Many studies examine individuals in the 
remitted stages of illness, making it difficult to determine whether 
functional neural abnormalities are inherent traits or reflect scars of 
the illness [85]. Longitudinal studies may be one way to address this 
methodological question. Further research is also needed to eluci- 
date whether neurobiological differences exist along a continuum 
or reflect discrete categories of eating pathology. Finally, although 
the hypothalamus is important in regulating food intake and body 
weight, evidence supporting the role of the hypothalamus in the 
aetiology of EDs is limited [36]. Studies combining the assessment 
of hypothalamic neuropeptide levels and neuroimaging may fur- 
ther contribute to the recent advances in understanding and treating 
these complex illnesses. 
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Feeding and eating disorders are characterized by persistently dis- 
turbed eating behaviour that results in abnormal consumption or 
absorption of food that impairs health or functioning. In DSM-5, 
they include pica, rumination disorder, avoidant/restrictive food 
intake disorder (ARFID), anorexia nervosa (AN), bulimia nervosa 
(BN), binge eating disorder (BED), and a residual category (other 
specified feeding or eating disorder) that includes subthreshold AN, 
BN, and BED, purging disorder, and night eating syndrome (NES) 
[1]. The International Classification of Diseases, eleventh revision 
(ICD-11) will include similar disorders [2]. In this chapter, the man- 
agement and treatment of these conditions are reviewed. 
































Pica is the recurrent eating of non-food items or substances such as 
paper, paste, wood, soap, chalk, pebbles, rubber gloves, clay, paint, 
or faeces [1, 2]. Data on the prevalence of pica are inconclusive. Pica 
occurs in people of all ages but may be more common in children, 
those with intellectual disabilities or autism, pregnant women, and 
those receiving dialysis [3-5]. The presentation and medical com- 
plications of pica are highly variable and associated with the specific 
ingested items or substances. Pica may become a focus of clinical 
attention because of medical complications, such as broken teeth, 
choking, bowel obstruction or perforation, infection, or poisoning, 
some of which are potentially fatal. Pica is significantly associated 
with an increased risk for anaemia, low haemoglobin, low haem- 
atocrit, and low plasma zinc, although the direction of causation is 
unknown [6]. 

Management of pica starts with a thorough medical and psycho- 
logical assessment to detect medical complications and any possible 
contributing factors. If present, medical complications of the pica 
must be adequately addressed. If a contributing factor (for example, 
iron deficiency anaemia) is identified, addressing the factor (for 





example, treatment with supplemental iron) may lead to resolution 
of pica, but this does not always occur [3]. 

There have been no adequately sized and designed randomized 
controlled treatment trials in patients with pica, and empirically 
derived treatment guidelines for pica do not exist. Treatment often 
requires a multi-disciplinary team with physicians, dieticians, and 
psychologists. In a review of 26 treatment studies in individuals with 
disabilities, it was concluded that behavioural treatments, especially 
those involving a combination of reinforcement response reduction 
procedures, were effective [7]. In another review, it was concluded 
that applied behaviour analysis was the most effective strategy [8]. 

Regarding pharmacotherapy, there are case reports of patients 
whose pica and attention-deficit/hyperactivity disorder (ADHD) 
both responded to methylphenidate, and of patients whose pica and 
depression or obsessive-compulsive disorder both responded to a 
selective serotonin reuptake inhibitor (SSRI) [9-11]. These cases 
suggest that treatment of comorbid psychiatric disorders might be 
helpful for reducing pica in some patients. 

Nutritional supplements have been evaluated in pica, with 
negative findings. In a two-by-two factorial randomized, placebo- 
controlled trial of iron and multimicronutrient supplementation 
in 406 Zambian schoolchildren, 302 with geophagy, neither treat- 
ment was superior to placebo for decreasing geophagy prevalence 
[12]. Indeed, decline in geophagy prevalence was higher in chil- 
dren receiving placebo than those receiving iron (28.0% vs 22.3%; 
P = 0.044). There were also no interactions between iron and 
multimicronutrient supplementation. This trial was limited by an 
extremely high dropout rate. In a randomized, placebo-controlled 
trial of an intramuscular iron-dextran preparation in 31 African- 
American children with pica, improvement in pica was not signifi- 
cantly different at 2-3 months or at 9-10 months [13]. Patients with 
lower haemoglobin concentrations did not respond to iron more 
favourably than those with higher haemoglobin concentrations, or 
respond more favourably to iron than to placebo. Additionally, there 
was no correlation between changes in haemoglobin concentration 
and changes in pica severity. 
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Rumination disorder 


Rumination disorder is repeated regurgitation of recently ingested 
food (bringing previously swallowed food from the stomach back 
into the mouth), which may be re-chewed, re-swallowed, or spat out, 
that is not better explained by a gastrointestinal or other medical 
syndrome [1, 2, 14-17]. Of note, the Rome VI diagnostic criteria des- 
ignate ‘rumination syndrome’ as a functional gastrointestinal dis- 
order [18]. Though DSM-5 and the Rome criteria are silent on the 
relationship between rumination disorder vs rumination syndrome, 
the definitions are similar. 

The act of regurgitation in rumination disorder is usually effortless 
and may be preceded by urges to regurgitate and/or a belching sen- 
sation (but not by nausea or retching). Regurgitated material con- 
tains recognizable food that may have a pleasant taste. Regurgitation 
tends to stop once the regurgitant becomes acidic. Although preva- 
lence data are inconclusive, rumination disorder occurs in infancy, 
childhood, adolescence, and adulthood. It may be more common 
in certain groups such as infants and individuals with intellectual 
disability. Rumination can be accompanied by halitosis, chronic- 
ally chapped lips, malnutrition, growth delay, or weight loss and 
can be life-threatening, especially in infancy. Other medical com- 
plications include tooth decay and erosion; aspiration leading to re- 
current bronchitis or pneumonia, reflex laryngospasm, or asthma; 
and premalignant changes of the oesophageal epithelium. Patients 
with rumination have also been observed to have high rates of co- 
occurring depression and anxiety [17]. 

Management of rumination disorder begins with a comprehensive 
medical and psychological evaluation. Haematologic and chemistry 
tests are needed to exclude anaemia secondary to gastrointestinal 
bleeding and electrolyte imbalances. Rumination must be differ- 
entiated from other gastrointestinal conditions that involve gastro- 
oesophageal reflux or vomiting, including oesophageal stricture, 
achalasia, hiatal hernia, gastro-oesophageal reflux disease, Sandifer 
syndrome, gastroparesis, and peptic ulcer disease. Some authorities 
recommend studies of manometry, combined with impedance, to 
distinguish rumination from gastro-oesophageal reflux disease [16]. 

There are no randomized controlled treatment trials for rumin- 
ation disorder, and it is currently thought to be best addressed with 
behavioural techniques across all age groups. Strategies for chil- 
dren include changing the child’s posture during and after feeding, 
distracting the child when he or she starts the behaviour, and aversive 
conditioning (placing something bad-tasting on the child’s tongue 
when he or she starts the behaviour). Psychotherapy for parents of 
children with rumination disorder may also be useful. Teaching dia- 
phragmatic breathing to compete with the urge to ruminate may be 
helpful in children, adolescents, and adults [19]. In one study where 
28 patients with rumination syndrome were trained to modulate 
abdomino-thoracic muscle activity under visual control of elec- 
tromyographic recordings, the number of regurgitations decreased 
from 27 to eight after three sessions [20]. This improvement was 
maintained at 6-month follow-up. Patients with rumination asso- 
ciated with supragastric belching may respond well to behaviour 
therapy with a speech therapist [16]. There are also anecdotal reports 
of beneficial effects from gum chewing [16, 18]. 

Protein pump inhibitors may be helpful for associated oesopha- 
geal or gastric damage, but not for rumination itself. Prokinetic 


medications also are not effective for rumination and may worsen 
symptoms [17]. In an open-label trial using high-resolution 
manometry-impedence recordings in 12 patients with rumination 
or supragastric belching, treatment with the gamma-aminobutyric 
acid (GABA) B receptor agonist baclofen substantially reduced the 
number of gastrointestinal flow events, including ruminations [21]. 
Reduction in flow events correlated with an increase in lower oe- 
sophageal sphincter pressure and a reduction in swallowing. For 
treatment-resistant rumination, options include an inpatient multi- 
disciplinary programme [17]. Anti-reflux surgery (for example, 
Nissen fundoplication) has been reported to be effective, but further 
research is needed [16, 17]. 


Avoidant restrictive food intake disorder 





ARFID is characterized by avoidance or restriction of food intake 
accompanied by weight loss or growth delay, malnutrition, depend- 
ence on oral nutritional supplements or enteral feeding, and/or 
marked functional impairment that is not due to lack of food or a 
culturally endorsed tradition [1, 2, 22-25]. Unlike those with AN, 
individuals with ARFID have no disturbance in how they experience 
their body weight or shape and lack fear of weight gain, drive for 
thinness, or preoccupation with body image. Rather, they are more 
likely to report fear of vomiting or choking, concerns with food tex- 
ture, or abdominal pain as reasons for their eating difficulties or 
weight loss. 

ARFID most commonly develops in infancy or early childhood 
and may persist into adulthood. Patients with ARFID have high 
rates of comorbid medical and psychiatric disorders, especially low 
bone mineral density scores and anxiety disorders. Compared to 
those with AN in one study, ARFID patients were younger, more 
likely to be male, more likely to have a comorbid medical condition 
or anxiety disorder, and less likely to have a mood disorder [23]. 

The prevalence of ARFID is unknown. Among paediatric eating 
disorder inpatients, rates of ARFID have ranged from 5% [22] to 
14% [23]. Initial evaluation often begins with the primary care prac- 
titioner who should address the child’s prenatal, birth, and medical 
histories, as well as parental coping and mental health. Potential 
medical causes must be ruled out such as gastrointestinal disease, 
food allergies, or occult malignancy. Potential co-occurring medical 
disorders or complications and psychiatric disorders must also be 
determined. 

No randomized controlled treatment trials have been con- 
ducted in patients with ARFID, and empirically based treatment 
guidelines are not available [26]. Management of ARFID usually 
requires a multi-disciplinary team that provides medical man- 
agement, nutritional rehabilitation, and psychological treatment. 
Family therapy is often needed to help parents manage their child’s 
nutritional intake. In one study, ARFID patients hospitalized for 
acute medical stabilization at an academic medical centre relied 
on more enteral nutrition and had longer hospitalizations than 
AN patients but had similar rates of remission and readmission 
1 year after discharge [24]. In a study of 14 adolescent medicine 
eating disorder programmes, there were no significant differences 
between the types of treatments or programmes in terms of weight 
restoration [27]. Patients with ARFID sometimes also receive psy- 
chotropic medications, such as SSRIs for co-occurring anxiety 


CHAPTER 106 Management and treatment of feeding and eating disorders 


and/or olanzapine to enhance weight gain, but the efficacy of these 
interventions is unknown [25]. 


Anorexia nervosa 


AN is characterized by persistent energy intake restriction, leading 
to significantly low body weight or growth failure, in combination 
with intense fear of gaining weight or of becoming fat or persistent 
behaviour that interferes with weight gain, and distorted self- 
perceived weight or shape [1, 2, 26, 28-30]. Physical hyperactivity, 
obsessive-compulsive symptoms related to eating and food, depres- 
sive and anxiety symptoms, and impaired insight are also common 
features. Amenorrhoea occurs in many females with AN but is no 
longer required for the diagnosis. DSM-5 recognizes two subtypes of 
AN: restrictive (where there has been no binge eating or purging be- 
haviour in the past 3 months) and binge eating/purging (where there 
has been recurrent binge eating or purging in the past 3 months) 
[1]. Severity of AN is often determined by BMI status, with greater 
severity linked with lower BMI. New to DSM-5 is the diagnosis of 
atypical AN, in which there has been significant weight loss, but the 
individual's weight is within or above the normal range. 

The lifetime prevalence of AN by DSM-IV criteria is around 
1% in females and <0.5% in males [30]. Because amenorrhoea is 
no longer a defining component, AN, as defined by DSM-5 cri- 
teria, may be more common. Peak age of onset of AN is in early 
to mid adolescence, but the disorder may occur at any age. AN is 
associated with substantial medical and psychiatric comorbidity 
[26, 28-31]. Complications arising from the effects of starvation, 
binge eating, and purging occur across multiple organ systems and 
include: bradycardia, hypotension, and arrhythmias; reduced core 
body temperature; hypoglycaemia, hypercholesterolaemia, and 
amenorrhoea; fluid, electrolyte, and acid-base imbalances; anaemia 
and neutropenia; acute pancreatitis, salivary gland enlargement, 
Mallory—Weiss tears, elevated liver enzymes, and constipation; im- 
paired bone quality, low bone mineral density, and increased frac- 
ture risk; depressive symptoms and cognitive impairment; and tooth 
decay and gum disease [26, 28-32]. Patients with AN may have an 
increased prevalence of autoimmune disorders, particularly type 
1 diabetes, which may precede the onset of AN. More than half of 
adolescents and adults with AN meet criteria for another psychi- 
atric disorder, especially mood and anxiety disorders [33, 34]. The 
mortality of AN is the highest of all mental disorders, with most 
deaths due to starvation-related medical complications and suicide 
[30]. Of note, AN in children and adolescents, AN in adults, and se- 
vere and enduring AN likely require different treatment approaches 
[26]. Early access to treatment is important, as there may bea critical 
window for effective intervention within the first 3 years of illness 
onset, beyond which full recovery becomes more difficult to achieve 
[30]. For younger and recently ill patients, the goal of treatment is 
full recovery. For patients with severe and enduring AN, treatment 
should focus on improving quality of life and reducing harm more 
than achieving full recovery. 

Management of AN begins with a thorough medical and psy- 
chological evaluation, so that it can be determined where the pa- 
tient should be treated—in a general medical hospital, a psychiatric 
hospital, or as an outpatient [26, 28-30]. There is no evidence that 
inpatient treatment is superior to outpatient treatment, and most 


patients can be treated on an outpatient basis [35]. However, a 
number of guidelines provide parameters for when a patient should 
be treated in a general medical or psychiatric hospital [26] (Table 
106.1). No matter the location of care, most guidelines recommend 
a multi-disciplinary approach involving medical, dietetic, and psy- 
chological treatment. Whenever possible, it is crucial to involve 
family members or significant others in the treatment process. 

Medical stabilization, refeeding, and restoration of body weight, 
nutritional status, and, among females, regular menses are the pri- 
mary aims of initial management. There is some controversy as to 
how best restore body weight and nutritional status in patients with 
AN, as empirical data for this process are lacking and there is con- 
cern about the development of refeeding syndrome, a potentially 
fatal complication characterized by hypophosphataemia, hypomag- 
nesaemia, hypokalaemia, and cardiac failure that may occur with 
refeeding, including with intravenous dextrose, after a sustained 
period of malnutrition [36-38]. The standard of care therefore is to 
commence refeeding at low caloric levels and increase slowly, with 
variable recommendations internationally as to precise caloric pre- 
scriptions. In the United States, refeeding is begun with around 1200 
cal/day and increased by about 200 kcal every other day. The Royal 
Australian and New Zealand College of Psychiatrists (RANZP) 
recommends that adult AN patients should start refeeding with 
6000 kjJ/day, which is then increased by 2000 kJ/day every 3 days 
until adequate intake for weight restoration is achieved [26]. For 
mildly or moderately malnourished patients, there is some evi- 
dence that refeeding should start at >1400 kcal/day so as to avoid 
undernutrition. 

Refeeding methods include using food, high-energy liquid sup- 
plements, nasogastric or enteral feeding, and, on rare occasions, 
parenteral nutrition. The least intrusive method of refeeding that 
can provide adequate nutrition should be used. Routine monitoring 
during refeeding includes regular measurement of weight, electro- 
lytes, liver function tests, electrocardiographic findings, and other 
aspects of physiological functioning. In the early stages of refeeding, 
supplemental phosphate (500 mg twice daily) and thiamine (at 
least 100 mg/day) are often provided. Some guidelines argue that 
refeeding methods should be highly individualized. However, a 
study in adolescents with AN showed that a standardized caloric 
prescription was superior to individual prescription in facilitating 
early weight gain without increasing the risk of refeeding syndrome 
[39]. Standardized caloric prescription was also associated with a 
lower incidence of bed rest. Once adequate weight gain is achieved, 
continued dietary and psychological treatment should be provided 
for weight maintenance. 

A wide range of psychological treatments have been evaluated in 
patients with AN [26, 28-30, 40-43]. For children and adolescents, 
the RANZP guidelines concluded that empirical evidence for effi- 
cacy was strong for family-based treatment and Maudsley family 
therapy; moderate for family system therapy, adolescent focused 
therapy, and enhanced cognitive behavioural treatment (CBT-E); 
and weak to moderate for cognitive behavioural therapy (CBT) [26]. 
The core feature of family-based treatment for adolescents with AN 
is that the parents are guided on how to take control of their child’s 
eating. Of note, a recently conducted randomized controlled trial 
found parent-focused treatment, where the therapist meets only 
with the parents and a nurse monitors the patient, was superior to 
family-based treatment for bringing about remission in adolescents 
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Table 106.1 Indicators for consideration for psychiatric and medical admission for adults with AN or BN 





Weight 
Rapid weight loss 


Body mass index <14 kg/m? 


a | Psychiatric admission indicated Medical admission indicated 


Body mass index <12 kg/m? 


1 kg per week over several weeks or grossly inadequate 


nutritional intake (<100 kcal daily) or continued weight loss 


despite community treatment 














Systolic BP <90 mmHg 

Postural BP >10 mmHg drop with standing 
Heart rate 

Temperature <35.5°C or cold/blue extremities 
12-lead ECG 

Blood sugar Below normal range 

Sodium <130 mmol/L 

Potassium Below normal range 
Magnesium 

Phosphate 

eGFR 

Albumin Below normal range 


Liver enzymes Mildly elevated 


<1.5X 10°/L 


Suicidal ideation 

Active self-harm 

Moderate to high agitation and distress 
Psychosis 


Neutrophils 


Psychiatric symptoms 





<80 mmHg 











>20 mmHg drop with standing 
<40 bpm or >120 bpm or postural tachycardia >20 bpm 
<35°C or cold/blue extremities 


Any arrhythmia, including QTc prolongation or non-specific 
ST or T-wave changes (including inversion or biphasic waves) 


<2.5 mmol/L 
<125 mmol/L 
<3.0 mmol/L 
Below normal range 
Below normal range 


<60 mL/min/1.73m‘’ or rapidly dropping (for example, 25% 
drop within a week) 


<30 g/L 
Markedly elevated (AST or ALT >500) 
<1.0x 10°/L 


ALT, alanine aminotransferase; AST, aspartate aminotransferase; BP, blood pressure; bpm, beats per minute; ECG, electrocardiogram; eGFR, estimated glomerular filtration rate; g, gram; 


kg, kilogram; L, litre; mmHg, millimetres of mercury. 


Adapted from Aust N ZJ Psychiatry, 48(11), Hay P, Chinn D, Forbes D, et al., Royal Australian and New Zealand College of Psychiatrists clinical practice guidelines for the treatment of 


eating disorders, pp. 977-1008, Copyright (2014), with permission from SAGE Publications. 
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with AN, though differences in remission rates were not different at 
6- or 12-month follow-up [42]. 

For adults with AN, the RANZP guidelines concluded that there 
was moderately strong evidence of efficacy for CBT-E, focal dy- 
namic therapy, Maudsley Model of Anorexia Nervosa Treatment for 
Adults (MANTRA), and specialist supportive clinical management 
(SSCM); and weak evidence for CBT, behaviour therapy, interper- 
sonal therapy (IPT), psychodynamic therapy, and cognitive analytic 
therapy [26]. However, reviews of these treatments have concluded 
that no specific psychological therapy was consistently superior to 
any other specific approach [40, 41]. Thus, in a 2-year follow-up of a 
randomized clinical trial comparing MANTRA with SSCM in out- 
patient adults with AN, both treatments were comparable regarding 
improvement in BMI, eating disorder psychopathology, and clinical 
impairment [43]. In the only randomized clinical trial of psycho- 
therapy in outpatients with severe and enduring AN, shifting focus 
from weight gain and recovery to improved quality of life was found 
to be important [44]. There is no empirical evidence that guided self- 
help CBT (CBT-GSH), which may be helpful in BN or BED, is an 
effective treatment for AN [26]. 

No medication has regulatory approval for the treatment of AN. 
The World Federation of Societies of Biological Psychiatry (WFSBP) 
guidelines for the pharmacological treatment of eating disorders 
concluded that there was weak evidence from controlled studies for 
zinc supplementation and olanzapine for enhancing weight gain in 


AN [45]. One study found olanzapine might be helpful for promo- 
tion of weight gain in outpatients with AN when given with medi- 
cation management sessions [46]. Use of both agents, however, is 
controversial [26, 47]. Two meta-analyses of studies of newer drugs 
for psychosis in AN concluded that these agents were not efficacious 
for weight restoration or reduction of eating disorder psychopath- 
ology in patients with AN [48, 49]. Nonetheless, certain subsets 
of AN patients may be responsive to dopamine antagonist drugs, 
including those with prominent anxiety or depression, obsessive- 
compulsive symptoms, hyperactivity, or antipsychotic-responsive 
comorbid conditions such as bipolar disorder [47]. 

There is no evidence of efficacy for tricyclic drugs or SSRIs for 
enhancing weight restoration or reducing psychopathology in the 
refeeding phase of AN treatment [50], while evidence for fluoxetine 
for relapse prevention after weight restoration is mixed [51, 52]. 
Drugs for depression might improve co-occurring depressive or 
obsessive-compulsive symptoms that persist despite successful 
weight restoration. Results from randomized controlled trials of al- 
prazolam, cyproheptadine, D-cycloserine, dehydroepiandrosterone, 
growth hormone, lithium, naltrexone, physiologic oestrogen re- 
placement, oxytocin, and tetrahydrocannabinol in patients with AN 
have been mostly negative [47]. In a small randomized controlled 
trial, the ghrelin agonist retamorelin improved gastric emptying 
(P = 0.03) and body weight (P = 0.12) more than placebo [53]. A ran- 
domized controlled trial of the synthetic cannabinoid dronabinol in 
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patients with severe and enduring AN found small weight gain, with 
no side effects [54]. Different types of neuromodulation treatments 
(for example, deep brain stimulation, repetitive transcranial cur- 
rent stimulation, and transcranial direct current stimulation) have 
shown promise in severe and enduring AN, but systematic studies 
are needed before recommendations can be made [30]. 

Another area that has received some study is the medical treatment 
of AN-associated low bone mineral density. Available evidence sug- 
gests that the safest and most effective strategy for improving bone 
density in AN is to restore weight and, in females, menstrual func- 
tion. However, even with resolution of AN, bone mineral density is 
not always restored. Pharmacotherapies that show promise include 
physiologic oestradiol replacement (that is, with a transdermal oes- 
tradiol patch) and, in adults, bisphosphonates [32]. 

A final important consideration is the not infrequent need for 
coercion in the treatment of AN [28]. Compulsory refeeding has 
short-term benefits, but long-term outcome is unknown [55]. More 
research on the potential long-term benefits and harms of compul- 
sory refeeding in AN is needed. 


Bulimia nervosa 


BN is characterized by recurrent binge eating episodes, recurrent 
inappropriate compensatory behaviours to prevent weight gain, 
and self-evaluation that is unduly influenced by body weight or 
shape [1, 2]. The lifetime prevalence of BN is estimated to be 1.5% 
in women and 0.5% in men. BN occurs in people of all ages, with 
the most common onset occurring in late adolescence to early 
adulthood. BN is associated with substantial medical and psychi- 
atric comorbidity [31, 33, 34, 56]. Co-occurring medical conditions 
include complications of binge eating and inappropriate weight 
loss behaviours, obesity, and obesity-related disorders [26, 56]. 
Psychiatric comorbidity includes mood, anxiety, substance use, and 
impulse control disorders. 

The management of BN begins with a careful medical and psy- 
chiatric evaluation, with special attention to the frequency of 
binge eating episodes, the types and frequencies of inappropriate 
compensatory behaviours, potential medical complications, and 
co-occurring psychiatric disorders. Physical examination and la- 
boratory evaluation should be conducted with attention to height 
and weight (to determine BMI), blood pressure and pulse, electro- 
lytes, acid-base status, and complete blood cell counts. A cardiac 
evaluation with an electrocardiogram may be indicated if there is 
hypokalaemia, postural hypertension, bradycardia, or tachycardia. 
If vomiting has been a prominent symptom, a dental evaluation 
should be done. Most patients with BN can be treated on an out- 
patient basis, but high frequency of binge eating and/or compen- 
satory behaviours, medical complications, psychiatric comorbidity, 
such as mood disorder or suicidal ideation/behaviour, or failure to 
respond to outpatient treatment, may necessitate hospitalization 
(Table 106.1). Particularly important among medical complications 
of purging is pseudo-Bartter syndrome, an upregulation of aldos- 
terone that causes hypokalaemic metabolic acidosis and, upon ces- 
sation of purging, can lead to severe oedema [31]. This syndrome 
may be worsened by vigorous hydration with normal saline; it can be 
treated by gentle rehydration with a potassium-rich fluid and with 
spironolactone. 


Many authorities believe the first-line treatment of BN in adults 
is individual psychotherapy, with the strongest evidence for CBT- 
E [26, 28, 29, 57]. CBT-E has been found superior to both IPT (at 
end of treatment, but not at 1-year follow-up) and psychoanalytic 
psychotherapy [58-60]. A more complex form of CBT-E may be 
preferable for patients with affect intolerance, low self-esteem, per- 
fectionism, or interpersonal difficulties. In a comparison trial in 
adolescents with BN, family-based treatment was superior to CBT 
at end of treatment and 6-month follow-up, but the two therapies 
were not statistically significantly different at 12-month follow-up 
[61]. Dialectical behaviour therapy (DBT) and self-help modalities, 
especially CBT-GSH, also hold promise in BN. Self-help modalities 
are probably less effective than therapist-led psychotherapies but are 
appropriate first-step options when such therapies are not available 
or affordable. This includes self-help treatments administered via 
the Internet. 

Pharmacotherapy also has a role in the treatment of BN. The 
WFSBP guidelines concluded there was evidence of efficacy for tri- 
cyclics, SSRIs, and topiramate for reducing binge eating and purging 
in BN [45]. Indeed, fluoxetine has regulatory approval for the treat- 
ment of BN, and 60 mg/day is more efficacious than 20 mg/day [62]. 
Fluoxetine is also efficacious for maintenance of response, but the 
dropout rate is high [63]. Many other types of drugs for depression 
are efficacious in BN, but the weak dopamine and noradrenaline re- 
uptake inhibitor bupropion is contraindicated due to an increased 
risk of seizures. There is no clear evidence that these drugs enhance 
the effectiveness of psychotherapy for BN. In other randomized 
controlled trials, ondansetron was superior to placebo for redu- 
cing binge eating and purging, methylamphetamine was superior to 
placebo for decreasing the amount of food eaten, lithium and the 
prokinetic erythromycin were not efficacious, and results for opioid 
antagonists were mixed [45, 64, 65]. 


Binge eating disorder 


BED is characterized by recurrent binge eating episodes that are 
accompanied by distress, but not the inappropriate compensatory 
weight loss behaviours of BN [1, 2]. BED is the most common eating 
disorder, occurring as a lifetime diagnosis in 1.9% of adults world- 
wide and 1.3% of adolescents in the United States [33, 34, 56]. It 
is associated with obesity and obesity-related medical conditions 
(for example, diabetes, hypertension, dyslipidaemia, and possibly 
metabolic syndrome) and with mood, anxiety, substance use, and 
impulse control disorders. 

Management of BED involves a thorough psychological and 
medical evaluation to determine the severity of binge eating be- 
haviour, identify comorbid medical and psychiatric disorders, and 
ensure that BN is ruled out (that is, that there are no surreptitious 
inappropriate compensatory weight loss behaviours). At a min- 
imum, physical and laboratory examinations should include height 
and weight (to determine BMI), blood pressure and pulse, and 
fasting glucose and lipids. When surreptitious purging is a concern, 
serum electrolytes and amylase may be checked. Most BED patients 
can be treated on an outpatient basis, but psychiatric hospitaliza- 
tion should be considered for extremely frequent and distressing 
binge eating, a co-occurring mood disorder, or suicidal ideation 
and/or behaviour. 
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Psychological treatments that may be useful in patients with BED 
include psychoeducation, mindfulness techniques, self-help treat- 
ments (including CBT-GSH and those delivered via the Internet), 
behavioural weight loss therapy (BWLT), CBT, IPT, and DBT 
[66, 67]. The strongest evidence of efficacy is for CBT and IPT, which 
can work over the long term and in adolescents as well as adults [66- 
69]. Though CBT and IPT both reduce binge eating behaviour and 
associated psychopathology in BED, they are less effective for weight 
loss in obese patients. Patients who completely stop binge eating 
with these treatments, however, may lose a small amount of weight. 
CBT and IPT are both more effective than BWLT for reducing binge 
eating [69]. Though BWLT may produce greater weight loss initially, 
this weight loss is not maintained after 2 years. 

Physical activity may also be effective for reducing binge eating 
behaviour and excessive body weight in BED. In a review of three 
randomized controlled trials of physical activity in patients with 
BED, aerobic and yoga exercises reduced binge eating and BMI [70]. 
Additionally, aerobic exercise reduced depressive symptoms; CBT 
with aerobic exercise, but not CBT alone, reduced BMI; and CBT 
with aerobic exercise was more effective for reducing depression 
than CBT alone. 

Pharmacotherapy may be helpful for some patients with BED 
[45, 66, 67]. Lisdexamfetamine dimesylate (LDX), a prodrug of d- 
amphetamine marketed for treatment of children and adults with 
ADHD, is approved for the treatment of moderate to severe BED in 
adults in the United States. This approval was based in part on two 
phase 3 studies of LDX in BED, both of which found that LDX, ti- 
trated to 50 mg/day or 70 mg/day, was efficacious for reducing binge 
eating episodes and obsessive-compulsive features of binge eating 
and for inducing 4-week binge eating cessation rates [71]. An earlier 
phase 2 study found that LDX at 50 mg/day and 70 mg/day, but not 
30 mg/day, was efficacious for reducing binge eating [72]. LDX also 
caused significant weight loss in these trials (most participants were 
obese) but is not approved for weight loss or treatment of obesity. 
A long-term randomized withdrawal study of LDX in BED found 
that LDX was superior to placebo for maintenance of response [73]. 
No novel safety or tolerability issues were found. 

There is evidence from randomized controlled trials to support 
the efficacy of the anticonvulsant agents topiramate and zonisamide 
in BED [45, 66]. Both drugs reduce binge eating behaviour and 
obsessive-compulsive features of binge eating and cause clinically 
significant weight loss. Additionally, topiramate has been shown to 
augment the efficacy of CBT in obese patients with BED, enhancing 
weight loss and cessation of binge eating rates. However, the use of 
both drugs is limited by their adverse event profiles, which include 
cognitive dysfunction, paraesthesiae, and renal stones. 

Drugs for depression are modestly effective for reducing binge 
eating in BED, at least over the short term. The WFSBP eating dis- 
order pharmacotherapy guidelines concluded there was evidence 
of efficacy for sertraline in particular [45]. They are also effective 
for depressive symptoms associated with BED. In a study of patients 
with BED and a co-occurring depressive disorder, the serotonin/ 
noradrenaline reuptake inhibitor duloxetine was superior to placebo 
in reducing binge eating behaviour, global severity of BED symp- 
toms, global severity of depressive symptoms, and body weight [74]. 
In another study, bupropion (300 mg/day) was similar to placebo in 
reducing binge eating frequency but produced greater weight loss 
[75]. Also, it was well tolerated and there were no seizures. Data on 


the efficacy of the combination of antidepressant drugs with CBT 
are mixed. 

Other medications have been studied in BED. The weight loss 
agents orlistat (a pancreatic lipase inhibitor) and liraglutide (a 
glucagon-like peptide 1 analogue) may reduce body weight in obese 
patients with BED, but their effects on binge eating behaviour are 
unclear [66, 76]. Limited mixed data also exist for baclofen and 
chromium [66]. 

Treating patients with BED involves educating the patient about 
all treatment options, their degree of efficacy, and potential adverse 
effects. Determination of initial therapy often depends on patient 
choice and availability. As there has been virtually no research into 
managing treatment-resistant BED, specific recommendations 
cannot be made, but there is preliminary evidence that combination 
therapy may be helpful. 


Other eating disorders 


Other eating disorders that come to clinical attention are NES and 
purging disorder [1]. NES is broadly defined as morning anorexia, 
evening hyperphagia, and insomnia. A core feature is nocturnal 
eating episodes, characterized by the sense that, after awakening at 
night, one has to eat something in order to get back to sleep. The 
prevalence of NES has been estimated to be 1.1-1.5% in the general 
population [77]. NES often co-occurs with obesity, mood disorders, 
and BN or BED. If possible, NES should be distinguished from the 
parasomnia sleep-related eating disorder, which is characterized by 
recurrent episodes of involuntary eating or drinking during sleep. 

Study of the treatment of NES is in its infancy, and empiric- 
ally based guidelines are not available. The only two randomized 
controlled trials were with SSRIs and had inconsistent results— 
sertraline was superior to placebo, but citalopram was not [77]. 
Preliminary data suggest CBT, progressive muscle relaxation, and 
topiramate may be helpful, but these treatments need study in ran- 
domized controlled trials. 

Purging disorder is recurrent use of inappropriate weight loss 
behaviours for the purpose of influencing shape or weight, but 
without the binge eating episodes characteristic of BN or BED [1]. 
A newly recognized condition, it may be more common than real- 
ized. Though not accompanied by binge eating episodes, purging 
disorder may be characterized by loss of control of eating where 
small or normal amounts of food are ingested with a sense of loss 
of control. 

No randomized controlled trials have yet been conducted for 
purging disorder, and empirically based treatment guidelines are 
not available. In a tertiary care treatment-seeking sample, patients 
with purging disorder had similar post-treatment remission or com- 
pletion rates, as compared to those with AN or BN [78]. 


Conclusions 


Substantial advances in the management and treatment of feeding 
and eating disorders have been achieved, but this field lags behind 
that for other psychiatric disorders. The cornerstone of managing 
AN is refeeding, nutritional rehabilitation, and weight restoration. 
There is strong evidence of efficacy for family-based psychotherapy 
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for adolescents with AN, while other forms of psychotherapy for ado- 
lescents and adults have less evidence for efficacy. No medication is 
approved for the treatments of AN, and the only agents with possible 
evidence of efficacy are zinc, olanzapine, and dronabinol. For BN, 
there is evidence of efficacy for CBT, IPT, drugs for depression, and 
topiramate. For BED, there is evidence of efficacy for CBT, IPT, LDX, 
topiramate, and drugs for depression. Antidepressant drugs reduce 
comorbid depressive symptoms, and LDX and topiramate produce 
clinically significant weight loss in overweight or obese patients. Less 
is known about the management and treatment of pica, rumination 
disorder, ARFID, and other specified feeding or eating disorders such 
as purging disorder and NES. As feeding and eating disorders often 
co-occur with medical and other psychiatric disorders, the manage- 
ment and treatment of these conditions must also be addressed. 
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The World Health Organization (WHO) estimated that in 2014, more 
than 1.9 billion adults (18+) had overweight, of whom 600 million 
had obesity, representing 39% and 13% of the world’s adult popula- 
tion, respectively. Globally, 41 million children under the age of 5 
were estimated to have overweight or obesity [1]. Obesity has more 
than doubled since 1980 and is projected to increase still; if cur- 
rent rates continue, almost half of the world’s population will have 
overweight or obesity by 2030 [2]. Though once considered only a 
problem in high-income countries, recent years have seen dramatic 
rises in obesity rates in low- and middle-income countries, particu- 
larly in urban areas [1]. 

The prevalence of obesity varies by age, gender, and socio- 
economic status. Globally, obesity is more prevalent in women than 
in men and increases with age until old age when rates again decline. 
In middle- and high-income countries, the socially disadvantaged 
appear to be more vulnerable to overweight and obesity [3, 4]. The 
reasons for this association are not fully explained but may include 
greater exposure to environmental drivers of excess weight gain (for 
example, limited access to healthy food and safe and affordable re- 
creation spaces) [5], lower levels of executive functioning (a theor- 
ized control network regulating behaviours, which has been linked 
with poverty in childhood) [6], and a lack of cultural capital which 
may diminish motivation for health-related behaviour change [7]. 

Obesity has been formally recognized as a global epidemic by the 
WHO since 1997. In 2013, the American Medical Association rec- 
ognized obesity as a disease. 








Overweight and obesity are defined as abnormal or excessive fat 
accumulation that may impair health. There are a number of ways 
to measure and define overweight and obesity, which are described 
briefly here. 

Body mass index (BMI), weight in kilograms divided by the 
square of height in metres, is the most commonly used index for 
defining weight status in adults. The WHO defines adult overweight 
as a BMI 225 kg/m’ and adult obesity as a BMI 230 kg/m’ [1]. Some 
organizations use a lower BMI cut-off point for defining overweight 








and obesity in Asian populations due to increased susceptibility to 
obesity-related illnesses at lower BMI levels than in people of other 
ethnicities. Typically, this cut-off is BMI 223 kg/m? for overweight 
and 227.5 kg/m’ for obesity. 

For children under 5 years of age, the WHO defines overweight 
as weight-for-height greater than two standard deviations above 
the WHO Child Growth Standards median, and obesity as greater 
than three standard deviations above the median. In children aged 
5-19, the WHO defines overweight as BMI-for-age greater than 
one standard deviation above the WHO Growth Reference me- 
dian, and obesity as greater than two standard deviations above the 
median [1]. 

Waist circumference and waist-to-hip circumference are also 
used as indices of excess fatness and specifically of visceral adiposity. 
Although some evidence points to stronger associations with meta- 
bolic health outcomes [8], the measurements are less precise than 
weight and height and subject to considerable inter-observer error. 

Body composition measures, which aim to assess the proportion 
of body constituents, especially fat, are also available. Reference 
methods, such as whole body densitometry, dual-energy X-ray ab- 
sorptiometry (DXA), and magnetic resonance imaging (MRI), are 
not practical in many clinical contexts. Simpler techniques, such 
as measurements of skinfold thickness or bioelectric impedance 
(measuring impedance of the body to a small electric current), are 
sometimes used in large surveys and cohorts, but changes in fat 
mass in an individual are highly correlated with changes in weight 
and these additional measures are unlikely to change management 
decisions. 

Other diagnostic tools may be used to take into account obesity- 
related comorbidity and functional status [9]. 











The fundamental cause of obesity is an imbalance between energy 
intake and expenditure (calories consumed in food and drinks vs 
calories expended in basal metabolism and physical activity), but 
the determinants of intake and expenditure are complex. Rapid in- 
creases in overweight and obesity around the world, even in places 
where levels were historically low, illustrate the role of environ- 
mental changes in the rise in prevalence of obesity, including readily 
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Table 107.1 Drugs used to treat common mental health disorders which may predispose to weight gain 
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Sodium valproate 


Lithium 


Epilepsy, bipolar disorder and mania, migraine prophylaxis 


Bipolar disorder and mania, recurrent depression, aggressive and self-harming 


behaviour 


Clozapine 


Olanzapine, risperidone, and almost all first- and second-generation 
antipsychotics 

Amitriptyline, nortriptyline, and doxepin 

Fluoxetine 


Mirtazapine 


available energy-dense food and fewer imperatives for physical ac- 
tivity. These changes directly impact human behaviour, and urban- 
ization serves as a clear example of this phenomenon. Numerous 
studies and reviews have highlighted the process by which urban- 
ization in developing countries has led to concomitant increases 
in obesity prevalence [10-12]. Urbanization is characterized by 
marked increases in the consumption of animal fat and protein, re- 
fined grains, and added sugar—a phenomenon referred to as ‘the 
nutrition transition [11]. Simultaneously, the need for physical ac- 
tivity is reduced through decreases in active transport and increases 
in mechanized work where previously livelihoods were made in 
more manual occupations [10]. Such environmental and societal 
changes have been occurring worldwide and are outpacing human 
evolution. Humans evolved in conditions of relative food scarcity, in 
which high levels of physical activity were required to maintain live- 
lihoods and lives. We evolved to conserve energy, and as a species, 
this leaves us genetically ill-equipped to deal with the environmental 
changes that have occurred over recent decades [13]. 

Nonetheless, within any given environment, there are marked 
differences in weight status between individuals, reflecting funda- 
mental genetic differences, especially in appetite control systems 
(see Chapter 104) [14]. These interact with environmental drivers 
and behavioural differences, whether based in nature or nurture, 
to convey individual-level susceptibility to excess weight gain. 
Children are subject to the same environmental drivers as adults 
but are especially vulnerable. Young children lack the cognitive cap- 
acity to understand the consequences of poor dietary choices or the 
ability to understand the persuasive intent of food marketing, and 
they have limited control over their opportunities for physical ac- 
tivity [15]. Thus, the home environment and prevailing behaviours 
exert a powerful influence on a child’s weight. 

The systems mapping approach employed in the 2007 UK 
Government Foresight report on obesity concluded that energy im- 
balance was caused by a complex and multi-faceted system of de- 
terminants, in which no single influence dominated [16]. However, 
within a complex systems map, they identified primary appetite 
control in the brain, the force of dietary habits, physical activity 
levels, and ambivalence experienced by individuals in making life- 
style changes as key determinants. Psychological factors, including 
self-efficacy, autonomy, and the ability to cope with stress, have also 
been found to be associated with obesity (for more information, see 
Mental health, p. 1098). 


Schizophrenia and related psychoses 


Schizophrenia and related psychoses, bipolar disorder, mania, agitated and disturbed 
behaviour 


Depression, neuropathic pain, migraine prophylaxis 
Depression, bulimia nervosa, obsessive-compulsive disorder 


Depression 


Additionally, some pharmacotherapies also cause weight gain— 
these include sedatives and a number of medications used to treat 
mental health conditions (Table 107.1). 


Relationships between obesity, morbidity, 


and mortality 





The rising prevalence of obesity presents a challenge to health care 
systems because elevated BMI is a major cause of morbidity and 
mortality [17]. The latest Global Burden of Disease study estimates 
that worldwide overweight and obesity caused 3.9 million deaths in 
2015 and in excess of 120 million disability-adjusted life years lost 
in 2015, making it the fourth largest contributor to ill health [18]. 


Physical health 


The morbidity and premature mortality attributable to raised BMI 
is linearly associated with increased risk factors for a wide range of 
diseases, especially high blood pressure, raised low-density lipo- 
protein (LDL) cholesterol, insulin resistance, and non-alcoholic 
fatty liver disease. Together these explain much of the increased 
risk of coronary artery disease, stroke, and type 2 diabetes [17]. 
Excess weight is the most important cause of preventable cancer 
after smoking. Overweight and obesity are significantly associated 
with at least 13 types of cancer, including common (colorectal and 
post-menopausal breast) and hard-to-treat (pancreatic and renal) 
cancers [19]. The underlying mechanisms are not fully understood 
but include changes in hormone production, particularly oestrogen, 
which increases the risk of breast cancer and uterine cancer. Excess 
fat is also associated with over-production of insulin and other 
growth factors which encourage cells to divide more rapidly. Finally, 
as adipose tissue depots increase, macrophages within the fat secrete 
cytokines as part of a low-grade, chronic inflammatory response, 
which may increase the risk of cancer [20]. 

Obesity is also associated with an increased risk of musculoskel- 
etal disorders (especially osteoarthritis and lower back pain), re- 
productive and urological problems (including stress incontinence, 
infertility, polycystic ovarian syndrome, and erectile dysfunction), 
and respiratory problems (including sleep apnoea and asthma) [21]. 

Children with overweight or obesity exhibit elevated risk fac- 
tors for metabolic diseases, including raised blood pressure and in- 
sulin resistance. Moreover, they are significantly more likely to have 
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overweight as adults, and the longer duration of excess weight fur- 
ther increases their risks for developing heart disease and diabetes 
in later life [22]. 


Mental health 


A high prevalence of psychological comorbidities has been reported 
in patients with obesity, and mood disorders, anxiety, and low self- 
esteem are more common in individuals with obesity than their lean 
counterparts [23]. However, the relationship between obesity and 
psychological comorbidities is complex. In some cases, the rela- 
tionship may be causal (for example, obesity leads to psychological 
comorbidities and/or psychological comorbidities lead to obesity), 
and in others, the two may arise together because of common risk 
factors, which include lower socio-economic class, poor diet, and 
lower levels of physical activity [24]. A recent systematic review re- 
ported that, although obese individuals were at increased risk of 
developing common mental health conditions, the magnitude of 
the association was similar to the risk of individuals with mental 
health conditions developing obesity, suggesting that there is a bi- 
directional relationship [25]. 

Obesity is linked to poor self- image, low self-esteem, and social 
isolation, all contributors to depression [26]. People with obesity 
can also find themselves ostracized, stereotyped, and discriminated 
against, which may exacerbate the development of low mood and 
depressive symptoms [27]. Furthermore, individuals with obesity 
are more likely to develop diseases such as diabetes, osteoarthritis, 
and other conditions which cause disability (see Physical health, 
p. 1097), and many of these comorbidities independently increase 
the risk of depression [28, 29]. 

The relationship between obesity and mental health can be fur- 
ther complicated by the fact that some drugs used to treat mental 
health disorders may predispose to weight gain (Table 107.1). 


Benefits of weight loss 


Many of the health risks of overweight and obesity can be mitigated 
by weight loss. A recent review evaluating the impact of weight loss 
interventions on cardiovascular risk factors found that even losses in 
the region of 0-2.5% of initial body weight led to statistically signifi- 
cant reductions in systolic and diastolic blood pressure, total choles- 
terol, low-density lipoprotein (LDL) cholesterol, HbA1c, and fasting 
plasma glucose [30]. 

The strongest long-term data come from the large diabetes pre- 
vention trials [31, 32]. Here, intensive lifestyle interventions, 
including changes to diet and physical activity to achieve weight loss, 
led to significant reductions in the cumulative incidence of type 2 
diabetes [33]. In the 4-year diabetes prevention programme trial in 
the United States, the incidence of type 2 diabetes in the intensive 
lifestyle intervention 10 years after baseline was 4.8 cases per 100 
person years (95% CI 4.1-5.7), compared to 11.0 (95% CI 9.8-12.3) 
in the control group [34]. Similarly, in the Finnish diabetes preven- 
tion programme trial, at 13-year follow-up, the group randomized to 
the intensive lifestyle intervention sustained a significantly lower in- 
cidence of diabetes, compared to a minimal control group (adjusted 
hazard ratio 0.61, 95% CI 0.48-0.79) [32]. 

Reductions in body weight are associated with a roughly linear 
improvement in other cardiometabolic risk factors, with every 10 kg 


weight loss associated with a reduction of 0.23 mmol/L in total chol- 
esterol [35] and a decrease in blood pressure of 4.6 mmHg diastolic 
and 6.0 mmHg systolic [36]. In clinical practice, it may be more 
useful to consider the degree of weight loss which an individual 
needs to achieve to observe clinically measurable improvements in 
health. These are summarized in Table 107.2 [37]. 

The relationship between weight change and mental health 
outcomes is less well established. In cohort studies, weight loss is 
sometimes associated with adverse effects. However, this may be 
attributable to unintentional weight loss associated with disease, 
which adversely affects mental health, or a common antecedent 
such as bereavement or other life events. In contrast, randomized 
controlled trials of weight loss interventions tend to show improve- 
ments in mental health, in particular reductions in depression and 
improvements in self-esteem and general mood [38, 39]. However, 
it is difficult to disentangle whether these improvements are due to 
the weight loss or to the support provided for weight loss. Most ran- 
domized controlled trials of weight loss interventions exclude par- 
ticipants with serious mental illness, but those which have tested 
weight loss interventions in this group have not reported exacerba- 
tions of mental illness, and in some cases, there has been clinically 
significant weight loss and improvements to overall health [40, 41]. 

Evidence from randomized controlled trials largely suggests 
that the improvements in health outcomes are independent of the 
method of weight loss. However, some special considerations may 
apply in relation to bariatric surgery and mental health outcomes 
(see Bariatric surgery, p. 1100). 


The rise in obesity and evidence of the health benefits of weight loss 
point to the need for effective interventions. Many people seeking to 
lose weight do so without professional support, and there is relatively 
limited evidence on the content or effectiveness of these efforts. 

In recent years, there has been a significant expansion in the pro- 
vision of weight management services within health care systems 
and such interventions are associated with greater weight loss than 
self-guided weight loss attempts. Interventions range from various 
forms of bariatric surgery to behavioural interventions, including 
individual counselling, group support, and total or partial meal re- 
placements. There has been extensive investment by the pharma- 
ceutical industry to develop novel anti-obesity agents, but progress 
has proved to be challenging. 


Behavioural interventions 


Structured behavioural weight management programmes should 
be considered the first port of call and mainstay of treatment for 
people with overweight or obesity. By focusing on improvements 
in diet quality and increases in physical activity, these interventions 
can support weight loss and bring weight-independent health bene- 
fits. They can also include interventions without person-to-person 
contact, that is, ‘self-help’ interventions. A recent systematic review 
found that provision of self-help resources could lead to an add- 
itional 1.85 kg of weight loss at 6 months [42]. However, most be- 
havioural weight management programmes offered in health care 
settings include person-to-person contact to provide support to 
follow a hypoenergetic diet, together with recommendations for 
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Table 107.2 Therapeutic benefits of weight loss in patients with overweight or obesity 


Condition Amount of weight loss Notes on possible lifestyle interventions 
required 


Asthma/reactive airway disease 7-8% 


Depression None defined 


Dyslipidaemia (elevated triglycerides and 


reduced high-density lipoprotein cholesterol) therapeutic targets 


>10% 
>10% 
>5% to 15% 


Female infertility 
Gastro-oesophageal reflux disease 


Hypertension 





Male hypogonadism 5-10% 

Non-alcoholic fatty liver disease 4-10% 

Obstructive sleep apnoea 7-11% 

Osteoarthritis >10% 

Polycystic ovarian syndrome 5-15% 

Type 2 diabetes (established) 5-15%, as needed, to lower 
HbA1c 





Type 2 diabetes (risk of developing) (people with 10% 
‘prediabetes’, metabolic syndrome, or ‘high risk’ 
based on validated paradigms) 


Urinary stress incontinence in women 5-10% 


5-10%, as needed, to achieve 





o specific recommendation 
‘Should be offered a structured lifestyle intervention’ 


Should include a physical activity programme and reduced-calorie healthy 
meal plan that minimizes sugars and refined carbohydrates, avoids trans fats, 
imits alcohol use, and emphasizes fibre 


o specific recommendation 

o specific recommendation 

Should include caloric reduction and physical activity 

o specific recommendation 

Caloric restriction and moderate to vigorous physical activity 


o specific recommendation 











ust include physical activity programme 


o specific recommendation 





Should be considered, regardless of duration or severity of disease 


Should include caloric reduction and physical activity 








o specific recommendation 


Source: data from Endocr Pract., 22(Suppl 3), Garvey WT, Mechanick JI, Brett EM, et al., American Association of Clinical Endocrinologists and American College of Endocrinology 
comprenehsive clinical practice guidelines for medical care of patients with obesity, pp. 1-2013, Copyright (2016), American Association of Clinical Endocrinologists. 


increased physical activity. A comprehensive systematic review con- 
ducted in 2014 to inform British guidelines for weight management 
found that, on average, participants randomized to behavioural 
weight management programmes, typically lasting 12-24 weeks, 
lost 3.7 kg after 1 year, approximately 2.8 kg more than those ran- 
domized to usual care or minimal control conditions. No adverse 
effects were detected. Findings from this review suggested that most, 
but not all, behavioural weight management programmes were ef- 
fective, but the reasons behind variations in programme effective- 
ness remain largely unknown [43]. After many years of intense focus 
on the macronutrient content of the diet (for example, low fat, low 
carbohydrate, high protein), there is little evidence that diet com- 
position leads to differences in weight change over a year or more 
and growing recognition of the importance of adherence to the 
programme as a predictor of weight loss [44, 45]. For example, in 
a 2009 trial published in New England Journal of Medicine, parti- 
cipants were randomly assigned to one of four diets which varied 
based on the recommended percentages of energy derived from fat, 
protein, and carbohydrate. Weight loss was not significantly dif- 
ferent between treatment groups, but adherence-related measures 
were strongly associated with weight loss, regardless of group [45]. 
Accordingly, interest has grown in understanding the techniques 
which may help an individual change his or her behaviour, including 
self-monitoring, goal setting, and planning behaviours. In a 2014 
systematic review to inform British guidelines, programmes which 
encouraged calorie counting and comparison of an individual's 
behaviour with that of others were associated with greater weight 
loss [43]. 

Behavioural weight management programmes may also include 
provision of meal replacement products to substitute for one or two 
usual meals or a programme involving total diet replacement with 


specially formulated shakes, soups, or other basic foods which are 
nutritionally complete with respect to micronutrients but usually 
contain 800 kcal per day or less, sometimes referred to as very low- 
energy diets. Results from a systematic review suggest a mean weight 
loss of 10.3 kg at 1 year after being offered a very low-energy diet 
with behavioural support, 3.9 kg greater weight loss than the control 
groups offered a behavioural programme alone [46]. Results from a 
recent systematic review of partial meal replacements as part of diets 
involving more than 800 kcal per day found weight losses midway 
between those reported from behavioural programmes alone and 
those from very low-energy diets [47]. Together, these data suggest 
that tight control over the type and amount of food through the 
use of a highly structured dietary programme can bring additional 
weight loss, compared to dietary advice alone, when delivered as 
part of behavioural weight management programmes. 


Pharmacological interventions 


The physiological control systems regulating both energy intake and 
expenditure provide a series of logical targets for pharmacological 
intervention to reduce intake or increase expenditure. However, the 
search for safe and effective drugs to treat obesity has been difficult. 
Though numerous drugs have been developed which lead to suc- 
cessful weight loss, their side effects have frequently led to drugs 
being withdrawn during development or soon after coming to the 
market. In other cases, non-severe side effects or insufficient weight 
loss limit adherence and mitigate against long-term use, with 
weight typically regained once the medication is stopped. A new 
generation of treatments are now coming to market, particularly 
linked to the treatment of people with diabetes with overweight or 
obesity, though the cost is likely to be prohibitive for many health 
care systems. 
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Generally, obesity pharmacotherapies fall under three 
categories: those which aim to decrease energy intake; those which 
aim to increase energy expenditure; and those which aim to de- 
crease energy absorption. Each of these is discussed briefly later in 
this chapter. 

By far, the simplest are compounds that reduce absorption. The 
most common of these is orlistat, which is available both over-the- 
counter or by prescription. Orlistat prevents the absorption of a pro- 
portion of dietary fat which is instead excreted. However, unless the 
patient also adheres to a low-fat diet, this malabsorption will lead to 
oily, loose stools, which discourages patients continuing with medi- 
cation. A 2011 review for the US Preventive Services Task Force 
found that orlistat led to an additional 3 kg weight loss at 1 year, 
compared to a behavioural weight management programme alone 
[48]. A longer-term trial in which patients were randomized to re- 
ceive orlistat or placebo for 4 years found 2.8 kg greater weight loss 
and a 37% reduction in the incidence of diabetes [49]. For patients 
who are able to make the necessary dietary changes to minimize the 
side effects (and which also enhance weight loss), orlistat can be a 
useful and safe aid for weight loss. 

Substances such as caffeine are known to act as metabolic stimu- 
lants and are associated with modest increases in weight loss, but are 
not recommended as weight loss aids [50]. Drugs which have greater 
metabolic effects also tend to be associated with a greater incidence 
and severity of adverse effects, especially increases in heart rate or 
blood pressure. The most well-known metabolic stimulant used to 
treat obesity is phentermine, today often combined with topiramate. 
Though clinical trials of this combination have shown greater weight 
loss than previously seen with orlistat, there are concerns about its 
long-term psychiatric and circulatory system effects, which led to its 
rejection by the European Medicines Agency (EMA). It is, however, 
available in the United States after approval by the Food and Drug 
Administration (FDA) in 2012 [51]. 

To date, the overwhelming bulk of work relating to pharma- 
cotherapies for obesity has been focused on drugs to reduce energy 
intake. These include centrally acting treatments which act on neural 
pathways or those that work on gut mechanisms to boost gut sig- 
nals and enhance appetite control. The former includes lorcaserin, 
which works via serotonergic mechanisms. Though phase 3 trials 
have demonstrated effectiveness for weight loss, use is again limited 
by concerns over adverse events; similar to phentermine, lorcaserin 
is licensed for use in the United States, but not in Europe [51]. 
Liraglutide, an injectable long-acting glucagon-like peptide-1 re- 
ceptor agonist initially developed for the treatment of type 2 dia- 
betes, is an example of the latter. A recent systematic review found 
liraglutide consistently resulted in a 4-6 kg weight loss (studies 
ranged from 20 weeks to 2 years), with comparative data suggesting 
weight loss was greater than that seen with orlistat or phentermine/ 
topiramate [52]. Some studies also observed improvements in blood 
pressure and glucose. Both the FDA and EMA have licensed the use 
of liraglutide for weight loss, but long-term data are lacking and the 
high cost means this treatment is likely to be reserved for patients in 
whom other treatment options have failed. 


Bariatric surgery 


Surgical interventions for the primary purpose of weight loss have 
become increasingly popular in recent years, with an estimated 
10-fold increase in the number of procedures since 2000 [53]. In 


most countries, bariatric surgery is only recommended for people 
with severe obesity (BMI >35 kg/m”) or for those with significant 
obesity-related comorbidities. Unlike behavioural interventions that 
rely on an individual's motivation to adhere to a hypoenergetic diet, 
bariatric surgery leads to weight loss either by physically restricting 
food intake (restriction) and/or by limiting nutrient absorption 
from the gastrointestinal tract (malabsorption), leading to reduced 
energy availability and weight loss. 

There are numerous surgical procedures available, the most 
common being the laparoscopic adjustable gastric band (Lap-band) 
and Roux-En-Y gastric bypass (RYGB), the latter being associated with 
larger weight losses. One of the largest and longest studies of outcomes 
following bariatric surgery is the Swedish Obesity Study in which a co- 
hort of 2010 patients receiving bariatric surgery were matched to acon- 
trol population and followed up over 20 years [54]. The mean changes 
in body weight after 2, 10, 15, and 20 years were -23%, -17%, -16%, 
and -18% in the surgery group. Compared with control, bariatric sur- 
gery was associated with a long-term reduction in overall mortality, as 
well as significantly decreased incidences of diabetes, myocardial in- 
farction, stroke, and cancer. The remission rate of type 2 diabetes was 
also significantly greater in the surgery group at 2 and 10 years. Though 
bariatric surgery clearly represents an invasive procedure, risks are 
low when conducted by experienced surgeons; however, patients do 
require long-term follow-up, especially after bypass operations where 
nutrient absorption is permanently impaired [55]. 

It is estimated that at least 40% of patients presenting for surgery 
have at least one psychiatric diagnosis [56], with depression, anxiety, 
and eating disorders being the most common conditions [57, 58]. 
Overall, bariatric surgery is associated with post-operative improve- 
ment in depressive symptoms, self-esteem, health-related quality of 
life, and body image in patients with obesity [59, 60]. However, it is of 
concern that there are reports that surgery may predispose to more 
serious psychological side effects. A systematic review reported that 
the rate of suicide among those who have undergone bariatric pro- 
cedures is more than five times that of the general population [61]. 
Additionally, patients who undergo bariatric surgery may be more 
likely to develop alcohol and drug dependencies [62]. In light of this, 
careful consideration should be made as to the suitability of those 
presenting for surgery and ongoing provision of appropriate support 
to enhance physical and mental health outcomes. 


Other therapies 


Though the use of other therapies for self-management of weight 
loss—including liposuction, food supplements, hypnotherapy, and 
acupuncture—is thought to be relatively common, there is very 
limited evidence to support their use. Where these therapies have 
been tested in randomized controlled trials, trials have been typic- 
ally been small, short term, and limited by methodological issues, 
hampering the ability to draw conclusions about the effects of these 
therapies [63, 64]. 


Treating obesity in children and adolescents 


Although there has been considerable progress in developing 
weight management programmes for adults, there has been less 
success in developing effective programmes to treat obesity in 
children. A 2009 Cochrane review found limited quality data to 
recommend that one treatment programme be favoured over an- 
other but found that behavioural interventions aiming to change 


diet and physical activity could reduce the level of obesity in chil- 
dren and adolescents at 6 and 12 months after programme start. 
Some studies in adolescents with moderate to severe obesity 
found a reduction in weight with the use of medication (orlistat 
or sibutramine), in addition to a behavioural programme, but side 
effects were frequent [65]. 

As well as uncertainty over which types of programmes might be 
most effective, the provision of treatment for obesity in children and 
adolescents is also limited by a paucity of services in many coun- 
tries. A systematic review of interventions for obesity in children 
and adolescents suitable for use in generalist primary care found 
only marginal reductions in BMI Z-score [66]. Children with se- 
vere obesity should be referred to a specialist centre able to con- 
duct a thorough investigation of the genetic and metabolic status 
of the child and to offer appropriate interventions, with access to 
ongoing multi-disciplinary support for the child and their family 
[67]. On occasions, and for older children, this may include the use 
of pharmacotherapy or surgery; however, behavioural interventions 
are the mainstay of obesity treatment for children [65]. 

Early intervention is preferable because young children are rap- 
idly growing and less weight change is needed to achieve a healthy 
weight [68]. A US task force has suggested that interventions should 
be of moderate to high intensity (>75 hours over 6-12 months) 
and include dietary, physical activity, and behavioural counselling 
components [69]. Some programmes focus the intervention on 
parents, especially for very young children, teaching an authoritative 
parenting style and emphasizing the importance of the family en- 
vironment as a key determinant of a child’s weight [70, 71]. Others, 
modelled on adult community weight management services, aim 
to equip children with practical skills relating to diet and physical 
activity. These interventions have shown some short-term success 
[72]. However, family-based programmes which take a compre- 
hensive approach to family behaviour change appear to be more ef- 
fective for older children than treating parent or child alone [73]. 


Weight loss maintenance 


Behavioural, pharmacological, and surgical interventions have all 
been shown to lead to clinically significant weight loss. However, 
post-intervention weight regain is very common, particularly in 
the case of behavioural and pharmacological treatments, as many 
people find it hard to maintain the cognitive control necessary to 
restrain their appetite and adhere to a hypoenergetic diet. 

Research is ongoing into effective weight loss maintenance treat- 
ments, but to date, there is no specific intervention which has 
shown convincing and reproducible improvements in long-term 
weight control. A systematic review of non-surgical interventions 
for weight loss maintenance found that behavioural interventions 
focusing on both diet and physical activity resulted in an average 
difference of -1.56 kg (95% confidence interval -2.27 to -0.86 kg) 
in weight regain, compared with controls, at 12 months. In the same 
review, orlistat combined with behavioural interventions resulted in 
a -1.80 kg (-2.54 to -1.06) difference in regain, compared with pla- 
cebo, at 12 months [74]. Data from cohort studies have identified 
a number of characteristics associated with long-term weight loss 
maintenance, including dietary restraint, frequent self-weighing, 
and engagement in leisure time physical activity [75]. Increases in 
physical activity also appear to be important in randomized con- 
trolled trials, perhaps in offsetting small excesses in energy intake; 
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systematic reviews have shown better long-term weight loss from 
interventions combining diet and physical activity than those 
aiming to change diet alone [76]. 

However, weight regain is common, and at present, many patients 
may need to accept the need for repeated intermittent weight loss 
interventions for long-term weight control. Evidence from large 
trials with long-term follow-up suggest some residual health bene- 
fits from initial weight loss (32, 34, 77]. 


Preventing weight gain 


The high and increasing prevalence of obesity makes interventions 
to prevent weight gain a public health priority. Such interventions 
offer the potential for benefit to the whole population, regardless of 
weight status, by preventing primary weight gain among lean indi- 
viduals, reducing the risk of further weight gain among those with 
overweight or obesity, and reducing weight regain post-weight loss. 

A 2014 systematic review found that in adults, weight gain pre- 
vention interventions such as individual counselling and exercise 
programmes were associated with reductions in weight and im- 
provements in health outcomes, but it was uncertain as to whether 
benefits were clinically meaningful and could be maintained over 
time [78]. In children, a recent systematic review found moderate- 
quality evidence for school-based obesity prevention interventions 
but concluded more research was needed to evaluate programmes 
in other settings and of other designs [79]. Overall, progress to 
constrain weight gain has been limited. Though isolated areas and 
interventions have shown improvements, no country to date has 
managed to reverse the rising prevalence of obesity, suggesting that 
individual-level interventions or actions in specific settings may be 
insufficient to offset wider environmental drivers and emphasizing 
the need for concerted actions by the whole of society to prevent 
obesity across the life course [80]. 


Conclusions 


Obesity is a chronic, relapsing condition that is associated with sig- 
nificant morbidity and leads to premature death. The health risks of 
obesity can be offset by effective interventions, which may need to be 
offered on repeated occasions. As the range of effective interventions 
increases, attention is focusing on the personalization of care and the 
optimal combination of interventions to achieve weight loss and to 
potentially improve specific clinical conditions. Though drugs and 
surgery can modify an individual's internal physical environment, 
these treatments may be enhanced when combined with behav- 
ioural strategies to help individuals manage the external, obesogenic 
environment. Moreover, stronger public health interventions to pre- 
vent obesity have the potential to enhance individuals’ weight loss 
efforts and to disproportionately benefit people with the greatest 
susceptibility to weight gain. 
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Elimination disorders (EDs) comprise the three broad categories 
of faecal incontinence (FI), daytime urinary incontinence 
(DUI), and nocturnal enuresis (NE). They are very common 
and distressing disorders in children. Thus, 1-3% of typically 
developing 7-year-old children are affected by FI, 2-3% by DUI, 
and up to 10% by NE [1]. These rates are even higher in children 
with special needs, for example those with intellectual disability 
[2]. Untreated, they can persist into adolescence and even adult- 
hood. In addition, the rate of comorbid child psychiatric disturb- 
ances is markedly increased in children with EDs—30-50% of 
children with FI, 20-40% of those with DUI, and 20-30% of those 
with NE are affected by clinically relevant behavioural or emo- 
tional disorders [3]. 

EDs are a heterogenous group of disorders with specific signs, 
symptoms, aetiologies, and treatment approaches. Although the 
ICD-10 [4] and DSM-5 [5] classification systems provide a gen- 
eral framework, they do not reflect the current state of research [6]. 
Most of the expanding research on EDs in the past decades has been 
conducted by paediatricians (especially nephrologists and gastro- 
enterologists), as well as by paediatric urologists, epidemiologists, 
and basic scientists. Unfortunately, few contributions have come 
from the field of child psychiatry, despite the high comorbidity rate 
of psychiatric disorders and the preponderance and effectivity of 
cognitive behavioural treatment approaches. This output has re- 
sulted in two important classification systems, which have super- 
seded those of the ICD-10 [4] and the DSM-5 [5] in clinical practice 
and research: 


e The International Children’s Continence Society (ICCS) for DUI 
and NE [7]. 
e The Rome IV classification system for FI and constipation [8]. 


The aim of this chapter is to provide an overview of the current 
state of knowledge on EDs, as well as to outline practical approaches 
in assessment and treatment. As EDs can coexist, the chapter will 
follow the treatment sequence recommended for these combined 
forms of incontinence—starting with FI, followed by DUI and NE. 





FI (or encopresis) is a common disorder, which is associated with 
stigmatization, distress, and associated comorbid disorders despite 
good treatment approaches. Two main different subtypes can be dif- 
ferentiated: FI with and without constipation. 

Classification 

According to the DSM-5 classification system, encopresis occurs 
either voluntarily or involuntarily and is defined by defecation in 
inappropriate places [5]. A duration of at least 3 months, a fre- 
quency of once per month, and a minimum age of 4 years are also 
required for diagnosis. This is important, because soiling is still so 
common in 3-year-old children that it is considered to be part of 
typical development. Finally, organic causes have to be ruled out. 
Two subytpes are differentiated, that is, encopresis ‘with consti- 
pation and overflow incontinence’ and ‘without constipation and 
overflow incontinence. As primary and secondary forms (defined 
by continent intervals) are identical, this differentiation is not im- 
portant in encopresis. 

The international Rome IV classification system avoids the term 
encopresis altogether and recommends the neutral term of faecal 
incontinence [8]. Two basic conditions are differentiated: functional 
constipation and non-retentive FI. The second change is that consti- 
pation is defined as the main disorder, which can coincide, but not 
necessarily, with soiling. Constipation is more common than FI, and 
the Rome IV stresses that it is aetiologically the overriding disorder. 
The second condition is non-retentive FI, denoting soiling without 
any signs of constipation. The Rome IV diagnostic criteria are listed 
in Box 108.1 [8]. The criteria are stricter than those of DSM-5, more 
specific, and based on empirical research, and the duration has been 
reduced to 1 month. 


Prevalence 


Encopresis is a common disorder, affecting 1-3% of children from 
4 years of age (the definitional age) throughout school years and can 
persist into adolescence and even adulthood [9, 10]. The prevalence 
depends on the definitions used; thus, 5.4% of 7-year-old children 
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Box 108.1 Rome IV diagnostic criteria for functional 


constipation and non-retentive stool incontinence 





| Diagnostic criteria for functional constipation 
Must include two or more of the following, occurring at least once 
per week for a minimum of 1 month, with insufficient criteria for a 
diagnosis of irritable bowel syndrome: 
1 Two or fewer defecations in the toilet per week in a child of a devel- 
opmental age of at least 4 years. 
2 Atleast one episode of faecal incontinence per week. 
3 History of retentive posturing or excessive volitional stool retention. 
4 History of painful or hard bowel movements. 
5 Presence of a large faecal mass in the rectum. 
6 History of large-diameter stools that may obstruct the toilet. 
After appropriate evaluation, the symptoms cannot be fully explained 
by another medical condition. 
Il Diagnostic criteria for non-retentive faecal incontinence 
At least a 1-month history of the following symptoms in a child with a 
developmental age older than 4 years: 























1 Defecation into places inappropriate to the sociocultural context. 

2 No evidence of faecal retention. 

3 After appropriate medical evaluation, the faecal incontinence 
cannot be explained by another medical condition. 





Reproduced from Gastroenterology, 150(6), Hyams JS, Di Lorenzo C, Saps M, et al., 
Childhood functional gastrointestinal disorders: child/adolescent, pp. 1456-1468, 
Copyright (2016), with permission from the AGA Institute. 














soiled in total, but only 1.4% once or more per week [11]. Boys are 
more commonly affected by encopresis than girls (rates: 3-4 times 
higher in boys). Encopresis occurs almost exclusively during the day. 
The rare occurrence of nocturnal encopresis is often associated with 


Table 108.1 Differences between encopresis with constipation and 
non-retentive faecal incontinence 


Functional Non-retentive faecal 
constipation incontinence 

















Bowel movements Seldom Daily 
Large amounts of stools Yes No 
Normal stools (consistency) alf Nearly all 
Pain during defecation alf Seldom 
Abdominal pain Often Seldom 
Appetite Reduced Good 
Colon transit time Long Normal 
Palpable abdominal mass Often None 
Palpable rectal mass Often Never 
Rectal diameter (sonography) Increased Normal 
Daytime urinary incontinence Tenth Seldom 
Nocturnal enuresis Third Tenth 
Comorbidity of behaviouraland 30-50% 30-50% 
emotional disorders 

Laxative therapy Helpful Not helpful, even 


worsening 


Source: data from Arch Dis Child., 71(3), Benninga MA, Buller HA, Heymans HS, et al., 
Is encopresis always the result of constipation?, pp. 186-193, Copyright (1994), BMJ 
Publishing Group Ltd and the Royal College of Paediatrics and Child Health; Arch Dis 
Child, 89(1), Bennigna MA, Voskuijl WP, Akkerhuis GW, et al., Colonic transit times and 
behaviour profiles in children with defecation disorders, pp. 13-16, Copyright (2004), 
BMJ Publishing Group Ltd and the Royal College of Paediatrics and Child Health. 


organic causes. Constipation is more common than encopresis, with 
a median prevalence of 9% worldwide [12]. 


Clinical signs 


The two subtypes do present with different spectrum of signs and 
symptoms (Table 108.1) [13, 14]. Children with functional con- 
stipation have a reduced number of bowel movements with large 
stools of altered consistency (too soft or too hard). Defecation is 
often painful. Abdominal pains and reduced appetite are typical. The 
colon transit time is increased, and abdominal and rectal masses are 
palpable. The rectal diameter is increased on ultrasound (>30 mm). 
DUI, and even NE, can coexist. Thirty to 50% have additional emo- 
tional and behavioural disorders. Finally, laxative therapy is helpful. 
Children with non-retentive FI have daily bowel movements 
of normal size and consistency. Pains are not typical, and appe- 
tite is good. Colon transit time is normal, and no stool masses can 
be palpated. Enuresis and urinary incontinence are less common, 
while comorbidity with psychological disturbances is also 30-50%. 
Finally, laxatives have no effect—and can even worsen the soiling. 


Aetiology 


Genetic factors do play a role in constipation, as shown in a large 
twin study—less in FI [15]. Constipation often develops from 
acute constipation, which can be triggered by both somatic (such 
as painful defecation due to anal fissures) or psychological factors, 
including stressful life events (such as separation of parents) [16]. 
As a consequence, chronic stool retention can follow, characterized 
by painful defecation, avoidance of defecation, and contraction of 
the external anal sphincter. Faeces are accumulated in the colon and 
rectum and colon transit times increase, while peristalsis and sen- 
sation decrease. Fluid withdrawal induces large, hard faecal masses. 
Soiling occurs because of interference with rectal function—and by 
fresh stools bypassing these faecal masses. 

The aetiology of non-retentive FI is not known [9]. It is not due 
to psychological factors alone, as the comorbidity rate of clinically 
relevant psychological symptoms is comparable to that of functional 
constipation [3, 13, 14]. 

Overall, 30-50% of all children with FI are affected by a comorbid 
emotional or behavioural disorder. Ina large population-based study 
of children aged 7.5 years, children with encopresis had significantly 
increased rates of separation anxiety (4.3%), specific phobias (4.3%), 
generalized anxiety (3.4%), attention-deficit/hyperactivity disorder 
(ADHD) (9.2%), and oppositional defiant disorder (ODD) (11.9%) 
[11]. This means that children with FI show a heterogenous pattern 
of both internalizing and externalizing disorders. 


Assessment 


The basic steps of assessment are shown in Table 108.2. The standard 
diagnostic procedures are non-invasive, are sufficient for most chil- 
dren, and can be performed in an outpatient setting. 

The history is the most important aspect of assessment. 
Questionnaires can be very helpful to gain additional information. 
A very useful chart is the Bristol Stool Form, which depicts seven 
types of stool, depending on consistency and form. 

A physical paediatric and neurological examination is recom- 
mended to rule out organic causes. Medical causes account for 5% 
in constipation and for 1% in non-retentive FI. These include anal 
fissures, abscesses, skin tags, dermatitis, anal stenosis, anorectal 
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Table 108.2 Assessment of elimination disorders 


constipation nocturnal enuresis 

Standard Assessment 
Sufficient for most cases 
History History 
Questionnaires and charts 48-hour frequency/volume chart 
Bristol Stool Form Scale Questionnaires and charts 
Physical examination Physical examination 
Sonography Sonography 


Screening for behavioural 
disorders or full child psychiatric 
assessment 


Screening for behavioural disorders or 
full child psychiatric assessment 


Urinalysis 
Extended Assessment 
Only if indicated 

Uroflowmetry and pelvic floor EMG 
Stool bacteriology Urine bacteriology 


Other diagnostic procedures: plain 
abdominal X-ray, colon contrast 
X-ray, MRI of colon, manometry, 
endoscopy and biopsy 


Other diagnostic procedures: radiological 
examinations, invasive urodynamics, 
cystoscopy, etc. 


malformations, cystic fibrosis, Hirschsprung’s disease, coeliac dis- 
ease, cow milk intolerance/allergy, cerebral palsy, spina bifida, 
myelomeningocele, and side effects of medication. 

Ultrasound is very helpful, as an enlarged rectal diameter of 
>30 mm is indicative of constipation [17]. Also, by sonography, a 
rectal examination can be avoided. Due to high comorbidity rates, a 
clinical child psychiatric assessment, or at least screening with valid- 
ated parental behavioural questionnaires, is recommended. If many 
problem scores are checked, assessment and counselling or, if neces- 
sary, treatment are recommended [3]. All other examinations are 
not indicated routinely, but only ifan organic type of FI is suspected 
(such as in Hirschsprung’s disease). 


Treatment 


The first step comprises counselling, provision of information, and 
psychoeducation. Unspecific measures, such as enhancing motiv- 
ation and alleviating guilt feelings, are also helpful. Only if the child’s 
food intake is restricted to low-fibre foods is a change in diet recom- 
mended. Much more important is to increase fluid intake, as many 
children do not drink sufficient amounts of liquids. 

For both types, toilet training is the basic treatment approach. 
Children are asked to go to the bathroom three times a day after 
mealtimes and to sit on the toilet 5-10 minutes in a relaxed way. 
Children should be given a footstool if their feet do not reach the 
ground and are allowed to play or read, as they like. They do not need 
to pass urine or stools every time. These toilet sessions are docu- 
mented in charts. The toiling training procedures have the effect of 
regulating defecation habits over the day, as the postprandial defeca- 
tion reflexes are most active after mealtimes. 

In non-retentive FI, this toilet training is the main treatment. 
Laxatives are not indicated. In children with constipation, toilet 
training is combined with laxative treatment—first disimpaction, 
then maintenance treatment. 


Disimpaction is necessary to evacuate faecal masses at the be- 
ginning of treatment. The preferred way is orally with high doses 
of polyethyleneglycol (PEG—macrogol), for example up to 1.5 g/ 
kg body weight per day, until disimpaction is successful. If not suf- 
ficient, rectal disimpaction with enemas are a good alternative— 
containing either phosphate or sorbite. Recommended doses are 30 
mL per 10 kg body weight or half an enema for preschool children, 
and 3/4 to one enema in schoolchildren. Often, these have to be re- 
peated several times. Both forms of disimpaction are effective [18]. 

After successful disimpaction, long-term maintenance treat- 
ment over a minimum of 6-24 months is recommended to avoid 
reaccumulation of stool masses. In addition to toilet training three 
times a day after mealtimes, oral laxatives are given. The preferred 
and most effective laxative is PEG (macrogol), a long, linear osmotic 
polymer that binds water [19, 20]. Side effects, such as abdominal 
pain, are rare. The initial dose is 0.4 g/kg body weight per day in two 
doses. If stools are too hard, the dose is increased—if too soft, re- 
duced. The therapeutic range varies from 0.2 to 1.4 g/kg body weight 
per day [21]. Lactulose, a disaccharide, is less effective and has more 
side effects. The dosage of liquid lactulose ranges from 1 to 3 mL/kg 
body weight per day in 1-3 doses. 

The long-term course of both types of FI is not favourable. 
Constipation and non-retentive FI can persist into adolescence and 
even young adulthood [9, 10]. Therefore, constipation and non- 
retentive FI needs to be treated actively, with regular follow-up visits. 

In children with a combination of EDs, treatment of functional 
constipation and non-retentive FI should always be the first step for 
two reasons; on the one hand, both DUI and NE can remit by suc- 
cessful treatment; on the other hand, the outcomes of DUI and NE 
are much worse if constipation and FI are not addressed. 





Daytime urinary incontinence 


DUI and NE are even more common disorders than FI. They are 
lumped together in the ICD-10 [4] and DSM-5 [5] classification sys- 
tems under the term enuresis. DSM-5 defines enuresis as an invol- 
untary or intentional wetting of children 5 years of age or older after 
organic causes have been ruled out. The wetting must have persisted 
for 3 months or longer and must occur at least twice per week or else 
must cause clinically significant distress or impairment. Only noc- 
turnal, diurnal, or combined enuresis are differentiated—no other 
subtypes. As DSM-5 is not up-to-date with the current state of re- 
search, newer classification systems are needed [6]. 

The standard classification is provided by the ICCS [7]. It is based 
on international, multiprofessional consensus, including paediatri- 
cians, paediatric urologists, and child psychiatrists. The criteria have 
become mandatory at international meetings and for publication in 
journals. 

According to the ICCS terminology, urinary incontinence is a gen- 
eral umbrella term for the symptom of any ‘uncontrollable leakage of 
urine, which can be continuous (which is rare and usually caused by 
organic conditions) or intermittent (which is common and usually 
functional). An overview is provided in Table 108.3. 

Any daytime wetting is termed daytime urinary incontinence, 
which can be organic (structural, neurogenic, or due to other paedi- 
atric causes) or, in most cases, functional. The term ‘diurnal enuresis’ 
is obsolete and should be avoided. If a child wets during sleep and 
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during the day, he or she would receive two diagnoses: one for the 
NE and one for the DUI. 

Further specifications of the ICCS criteria are shown in Table 
108.3. The lower chronological age limit is 5 years, and a dur- 
ation of 3 months and a frequency of once per month are required. 
Frequent and infrequent wetting can be differentiated. Rare wetting 
of less than once a month is considered to be a symptom, but not a 
condition. Organic causes have to be ruled out. These encompass 
structural causes (such as epi- and hypospadias, urethral valves and 
stenosis, other malformations of the urogenital tract), neurological 
causes (such as spina bifida, tethered cord syndrome and others), 
and medical causes (diabetes, urinary tract infections). Finally, 
many different subtypes of DUI can be differentiated. 

The three main syndromes of DUI are urge incontinence, 
voiding postponement, and dysfunctional voiding. Each types 
of DUI has typical symptoms, which are summarized in Table 
108.4. Urge incontinence (or overactive bladder) is characterized 
by urge symptoms, increased micturition frequency, and small 
voided volumes. In contrast, in voiding postponement, low mic- 
turition frequency and postponement of micturition are typical. 
In both cases, children employ holding manoeuvres to avoid wet- 
ting. Dysfunctional voiding is a disorder of the emptying phase; 
instead of relaxing the sphincter muscle, it is contracted paradox- 
ically. Straining and an interrupted urine stream are indicative of 
this disorder. Stress incontinence is rare in children—in contrast to 
adults. Wetting during coughing, sneezing (that is, any increase 
of intra-abdominal pressure), and small volumes are typical. 
Giggle incontinence is characterized by wetting during laughing 
and large volumes with apparently complete emptying. Detrusor 
underactivity, a decompensation of the detrusor muscle, is marked 
by an interrupted stream; emptying of the bladder is possible only 
by straining. Wetting exclusively 5-10 minutes after normal mic- 
turition is typical of a vaginal reflux. Functional obstruction can 
impede urine flow, and diurnal urinary frequency of childhood is a 


Table 108.3 Basic definitions of the ICCS for daytime urinary 
incontinence (DUI) and nocturnal enuresis (NE) 


Intermittent incontinence ntermittent incontinence 
Wetting during day (wake state) Any wetting during sleep 


Minimum chronological age of 
5 years 


inimum chronological age of 
5 years 


Duration of 3 months Duration of 3 months 


Minimum frequency of once 
per month 

Symptom, not disorder: less than 
once per month: 

Frequent incontinence: 24 times/ 


inimum frequency of once 

per month 

Symptom, not disorder: less than 
once per month: 

Frequent incontinence: 24 times/ 


week week 
Infrequent incontinence: <4 times/ nfrequent incontinence: <4 times/ 
week week 


Exclusion of organic causes Exclusion of organic causes 





Many different subtypes Four different subtypes 


Source: data from Neurourol Urodyn, 35(4), Austin PF, Bauer S, Bower W, et al., 

The Standardization of Terminology of Bladder Function in Children and 
Adolescents: Update Report from the Standardization Committee of the International 
Children’s Continence Society (ICCS), pp. 471-81, Copyright (2016), John Wiley 

and Sons. 


benign, self-limiting condition with typical signs of frequency and 
urgency (often without incontinence). 


Prevalence and clinical presentation 


One and a half times more girls than boys are affected by DUI. Two 
to 3% of 7-year olds and <1% of adolescents wet during the day. Urge 
incontinence and voiding postponement are the two most prevalent 
types. The clinical signs of the three common disorders will be pre- 
sented in more detail. 

Children with urge incontinence are characterized by a high mic- 
turition frequency of >7 times per day, with short intervals in be- 
tween. Children typically have urge symptoms—sometimes with 
sudden, intensive (imperative) urge. They wet small volumes, espe- 
cially during the afternoons, increasing with tiredness. They try to 
stop the urge by initiating holding manoeuvres such as contracting 
pelvic floor muscles, pressing the thighs together, holding the ab- 
domen, jumping from one leg to the next, sitting on the heels, and 
squatting with the heels pressed against the perineum (curtsey sign). 
Vulvovaginitis, perigenital dermatitis, and urinary tract infections 
are common. 

Typical signs of voiding postponement are a low micturition fre- 
quency (<5 times per day) and the habitual postponement of mic- 
turition in certain situations (school, play, reading, television). With 
increasing deferral and fullness of the bladder, holding manoeuvres 
are instituted (as in urge incontinence) until wetting cannot be 
avoided. Constipation and encopresis are common. Children have a 
high rate of psychological disorders such as ODD [22]. 

Dysfunctional voiding is characterized by repeated straining at the 
beginning and during micturition, intermittent and fractioned urine 
flow, incomplete bladder emptying with residual urine, urinary tract 
infections, stool retention, constipation, FI, and vesico-ureteral 
refluxes. 

The comorbidity rate of psychological symptoms and disorders 
among children with DUI (30-40%) is higher, compared to children 
with enuresis (20-40%) [3]. In a population-based study of 8242 
children at 7.5 years of age, externalizing disorders were prominent, 
with significantly increased rates of ADHD (24.8%), ODD (10.9%), 
and conduct disorders (11.8%) [23]. 


Aetiology 


Genetic factors play a role in DUI as well. Ina large population-based 
study, the odds ratio for daytime incontinence was increased by 3.28 
if the mother, and by 10.1 if the father, was affected [24]. Linkage 
studies demonstrated a positive linkage to chromosome 17 in urge 
incontinence [25]. Urge incontinence is caused by spontaneous 
contractions of the detrusor during the filling phase of the bladder, 
which are not sufficiently inhibited by the central nervous system 
[26]. Genetic factors also play a major role in giggle incontinence— 
but not in the other types of incontinence. 

Psychosocial factors do, however, play a major role in the aeti- 
ology of subtypes of daytime incontinence. Voiding postponement 
can be due to an acquired habit—or as one of many oppositional 
symptoms as part of ODD [22]. 


Assessment 


Assessment and a detailed diagnosis is the core of successful 
treatment—as each subtype of DUI responds best to their spe- 
cific treatment. The steps are outlined in Table 108.1 and are quite 
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Table 108.4 Classification of daytime urinary incontinence (DUI) 
with main distinguishing symptoms according to the ICCS 


Type of daytime wetting Main symptoms and features 


Common types 


Overactive bladder (OAB) and Urge symptoms, increased daytime voiding 
urge incontinence frequency, and small voided volumes 


Voiding postponement Infrequent micturitions <5 times per day, 


postponement 


Dysfunctional voiding Straining to initiate and during micturition, 


interrupted stream of urine 
Rare types 


Giggle incontinence Wetting during laughing, large volumes with 


apparently complete emptying 


Stress incontinence Wetting during coughing and sneezing, small 


volumes 





Underactive bladder Interrupted stream, emptying of bladder 


possible only by straining 


Vaginal reflux Wetting exclusively 5-10 minutes after 


normal micturition due to vaginal reflux 


Obstruction Impediment (can be functional) to urine 


outflow, decreased urine flow 


Diurnal urinary frequency of 
childhood 


Benign, self-limiting condition with typical 
signs of frequency and urgency 


Source: data from Neurourol Urodyn, 35(4), Austin PF, Bauer S, Bower W, et al., 

The Standardization of Terminology of Bladder Function in Children and 
Adolescents: Update Report from the Standardization Committee of the International 
Children’s Continence Society (ICCS), pp. 471-81, Copyright (2016), John Wiley 

and Sons. 


comparable to those of FI. Very important is the registration of mic- 
turition volumes and times, fluid intake, and other observations in 
a 48-hour micturition chart. Sonography of the bladder and kidney 
is a valuable instrument to detect malformations and measure the 
bladder wall thickness (which is often increased in DUI) and re- 
sidual urine after micturition. Urinalysis is recommended, and a 
dipstick is usually sufficient. Uroflowmetry is also non-invasive and 
is performed routinely in many centres, but is mandatory only in 
dysfunctional voiding. 


Treatment 


The basic approaches include provision of information, increasing 
fluid intake, counselling regarding micturition habits, and regis- 
tering incontinence episodes [27]. This non-pharmacological, 
non-surgical treatment of incontinence has been named 
‘urotherapy and contains many elements of cognitive behavioural 
therapy (CBT). Treatment is always symptom-oriented, with the 
aim of achieving complete dryness. Comorbid psychiatric dis- 
orders are addressed separately and will increase motivation and 
outcomes. 

The main focus in the treatment of urge incontinence isasymptom- 
oriented cognitive behavioural approach aimed at conscious con- 
trol of the urge without the use of the pelvic floor muscles, that is, 
holding manoeuvres. Children are instructed to register when they 
feel an urge, to go to the toilet immediately without using holding 
manoeuvres, and to register in a chart if their pants were dry or 
wet. If these simple techniques are not sufficient, pharmacotherapy 
with an anticholinergic medication is recommended in addition. 
First-line drugs are oxybutinin or propiverin. Oxybutinin is dosed 


initially at 0.3 mg per kg body weight/day in three doses and can 
be increased to 0.6 mg per kg body weight/day (maximum of 15 
mg/day). The side effects are dose-dependent and reversible. They 
include typical anticholinergic effects such as flushing, accommo- 
dation problems, tachycardia, hyperactivity, dryness in the mouth, 
residual urine, and constipation. Propiverin has fewer side effects 
but is not available in many countries. The dosage is a maximum of 
0.8 mg per kg body weight/day in two doses (total maximum of 15 
mg/day). Recently, the effects of percutaneous electrostimulation 
[transcutaneous electrical nerve stimulation (TENS)] have been 
very promising. 

Timed voiding is indicated in all cases of voiding postponement. 
Children and parents are instructed to increase the micturition fre- 
quency up to seven times per day in regular intervals and to register 
the course in a chart. Medication is not indicated. 

Treatment of dysfunctional voiding includes increasing mo- 
tivation, cognitive behavioural elements, relaxation and general 
drinking, and toileting advice [28]. The most specific and ef- 
fective treatment is biofeedback—either with uroflowmetry or 
pelvic floor electromyography (EMG). Medication is again not 
indicated [28]. 

In therapy-resistant cases, structured group therapies have been 
successful [29]. These combine counselling, provision of informa- 
tion, relaxation techniques, and cognitive behavioural, play, and 
group therapy approaches. 


Outlook 


The outlook of DUI is quite favourable in children with DUI, pro- 
vided that a specific treatment based on an exact diagnosis is pro- 
vided. In children with combined EDs, the treatment of DUI should 
follow after that of FI and/or constipation—but before that of NE. 
In many children with combined DUI and NE, the NE will resolve 
upon successful management of the daytime problems. 


Nocturnal enuresis 


NE is 2-3 times more common than DUI. As presented earlier, NE 
and DUI are lumped together in the ICD-10 [4] and DSM-5 [5] clas- 
sification systems, although they are completely different disorders. 
Therefore, the ICCS classification is again important for the correct 
diagnosis of NE and its subtypes [7]. NE denotes any wetting during 
sleep (at night or daytime naps). In fact, the term enuresis can be 
used without the addition of the adjective nocturnal, as the term 
of so-called ‘diurnal enuresis’ is discouraged. If a child wets while 
sleeping and while awake, two diagnoses are given: one for the type 
of NE and one for the type of DUI (Table 108.5). 

Many diagnostic criteria are the same in DUI as in NE (Table 
108.2). The classification of NE (or enuresis for short) is simple 
(Table 108.1), based on the longest previous dry period and on con- 
comitant lower urinary tract symptoms (LUTS). Primary enuresis 
means that the child has been dry for <6 months; secondary enuresis 
indicates that a relapse after a dry period of at least 6 months has oc- 
curred. Children with secondary enuresis have experienced stressful 
life events and have higher rates of comorbid psychiatric disorders. 
Otherwise, assessment and treatment of primary and secondary en- 
uresis are exactly the same. 
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Table 108.5 Subtypes of nocturnal enuresis according to ICCS terminology 





BEDS SSS Maximal dry interval of <6 months Maximal dry interval of >6 months 


Primary nocturnal enuresis (PNE) 


Secondary nocturnal enuresis (SNE) 


No signs of bladder Monosymptomatic nocturnal Primary monosymptomatic nocturnal Secondary monosymptomatic nocturnal 
dysfunction’ during daytime enuresis (MNE) enuresis (PMNE) enuresis (SMNE) 

Signs of bladder dysfunction’ Non-monosymptomatic Primary non-monosymptomatic nocturnal Secondary non-monosymptomatic nocturnal 
during daytime present nocturnal enuresis (NMNE) enuresis (PNMNE) enuresis (SNMNE) 


* Daytime incontinence, urgency, frequency, holding manoeuvres, interrupted flow, etc. 
Source: data from Neurourol Urodyn, 35(4), Austin PF, Bauer S, Bower W, et al., TI 
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The second division is far more important—children with NE, but 
without daytime LUTS, have monosymptomatic enuresis. NEand LUTS 
is called non-monosymptomatic enuresis. In non-monosymptomatic 
NE, daytime symptoms have to be treated first before addressing the 
night-time wetting to achieve treatment success [30]. 


Prevalence 


One and a half to two times more boys are affected by NE than girls. 
The prevalence is 10% among 7-year-old children, 1-2% among 
adolescents, and 0.3-1.7% among adults. The mean spontaneous 
remission rate is 15% per year. Primary enuresis is more common 
than secondary enuresis, and monosymptomatic enuresis is twice as 
common as non-monosymptomatic enuresis. 


Signs and symptoms 


Typical symptoms of monosymptomatic NE are deep sleep and dif- 
ficult arousal and increased urine volumes at night (polyuria) with 
large wetted volumes. Bladder function during the day is completely 
normal. 

Children with non-monosymptomatic NE have the same symp- 
toms, but in addition, they have similar signs as those with DUI 
(except for wetting during the day). Also, urinary tract infections, 
constipation, and FI are possible. 

Children with primary and secondary NE have the same symp- 
toms, except for a higher rate of comorbid psychiatric disorders in 
the latter group. 

Overall, 20-30% of children with enuresis, 20-40% of those with 
urinary incontinence, and 30-50% of those with encopresis have 
clinically relevant disorders [3]. 

Externalizing disorders predominate, but internalizing disturb- 
ances can occur as well. In a British population-based study of 
8242 children at the age of 7.5 years, children with enuresis were 
affected by separation anxiety (8.0%), social anxiety (70%), specific 
phobia (14.1%), generalized anxiety (10.5%), depression (14.2%), 
ODD (8.8%), conduct disorders (8.5%), and ADHD (17.6%) 
[31]. ADHD is the most common comorbid disorder in enuresis. 
Enuresis in combination with ADHD is more difficult to treat due 
to non-compliance. 


Aetiology 


Enuresis is a genetically determined maturational disorder of the 
central nervous system [32]. Seventy to 80% of all children with 
enuresis have affected relatives. The concordance rates are higher 
among mono- than dizygotic twins. In linkage studies, several loci 
on chromosomes 12, 13, and 22 could be identified—irrespective of 
the subtypes of enuresis [32]. 


Three main mechanisms are responsible for the development of 
enuresis. Firstly, increased urine volumes (polyuria) affects some, 
but not all, children. This is associated with a circadian variation 
(but not lack) of antidiuretic hormone (ADH). Secondly, impaired 
arousal is another important factor, as children do not respond to 
signals of their full bladder to wake up. Thirdly, children have an 
inhibition deficit of the brainstem and do not adequately suppress 
the emptying reflexes of the bladder while sleeping. The same fac- 
tors are responsible for the development of non-monosymptomatic 
enuresis—in addition to local bladder dysfunction. Psychosocial fac- 
tors can modulate these genetic and neurobiological risks and lead 
to relapses or a continuation of NE. 


Assessment and treatment 


Assessment of NE is the same as that of DUI. Also, basic urotherapy, 
provision of information, and counselling are the first steps, com- 
bined with recording of dry and wet nights on a calendar over a 
baseline period of 4 weeks. Children are asked to draw a symbol for 
wet and dry nights (clouds and suns, stars, etc.) on a chart. Any day- 
time symptoms in non-monosymptomatic NE should be treated first 
[30]. These simple measures will achieve dryness in 15-20% without 
any further intervention. 

If children continue to wet, the first-line treatment is alarm 
therapy, which requires active co-operation of the child and family 
[33]. If alarm treatment is not feasible, then it is advisable to start 
with desmopressin medication first. 

Approximately 70% achieve dryness, with long-term cure rates of 
50%. Two different types of alarms exist, the body-worn and bedside 
alarms, and both are equally effective. In case of wetting, the alarm is 
triggered and the child should wake up completely, go to the toilet, and 
void any remaining urine. Parents are asked to document the course of 
the treatment in a chart. To be successful, the alarm must be used con- 
sequently every night for a maximum of 16 weeks. After 14 consecu- 
tive dry nights, the alarm is discontinued and the child is considered 
to be dry. Parents are advised to restart alarm treatment if a relapse 
occurs. If not sufficient, alarm treatment can be combined with add- 
itional CBT components. If a child does not respond to the alarm, a 
switch to desmopressin, the second-line treatment, is recommended. 

Indications for medication are therapy resistance towards alarm 
treatment, lack of motivation in the children, stress in the family, 
lack of co-operation, and the requirement of short-term dryness, 
that is, for school outings, etc. Desmopressin is the medication of 
choice in these cases due to positive effects and few side effects. 
Desmopressin (or DDAVP) is a synthetic analogue of ADH. In 70%, 
a reduction of wet nights or even dryness can be achieved, but after 
discontinuing medication, most children have a relapse, so that only 
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18-38% of children remain dry in the long run. It has a distinctly 
lower curative effect than the alarm. 

Desmopressin is given 30-60 minutes before going to sleep in the 
evenings only. The oral dosage is 0.2-0.4 mg—or 120-240 ug as a melt 
tablet. It is advisable to titrate the required dose over 4 weeks, starting 
with the low dose first and increasing the dose only if necessary. This 
should be documented on a chart. If a child does not respond within 
4 weeks, he or she should be considered to be a non-responder and 
desmopressin is discontinued. Otherwise, one can continue with the 
required dosage for a maximum of 12 weeks, after which desmopressin 
should be stopped—to check if the child can remain dry without 
medication. Ifa relapse occurs, desmopressin can be given in 3-month 
blocks. Adverse effects are rare and not pronounced, like headache, 
stomach ache, lack of appetite, etc. The most dramatic, though rare, 
side effect is hyponatraemia and water intoxication, which can require 
intensive care. Therefore, it is important not to overdose—and not to 
drink <8 oz (250 mL) after taking the medication. 

Tricyclic drugs like imipramine have a proven antienuretic 
effect—and similar relapse rates as desmopressin. Due to the risk of 
cardiac arrhythmias and intoxication in high doses, it has become 
a third-line treatment in therapy-resistant cases. Also, imipramine 
needs close clinical and laboratory surveillance. 


Outlook 


NE has the best treatment results of all EDs, again provided that an 
exact diagnosis is made and the recommended guidelines are fol- 
lowed. In children with combined EDs, NE is the last condition in 
the treatment sequence. 


The aim of this chapter was to provide an overview on EDs in chil- 
dren and adolescents. The clinical aspects of the three major groups 
of FI, DUI, and NE were presented. These are heterogenous disorders 
with divergent aetiologies, symptoms, and treatment approaches. 
Most EDs are functional, that is, non-organic. The guidelines for as- 
sessment and treatment are logical, and if followed, the outcome is 
quite promising. The treatment is symptom-oriented, with the goal 
of complete continence, and comorbid psychiatric disorders, which 
are common but do not affect most children with EDs, need to be 
addressed. A multi-disciplinary approach can be very helpful. 
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i 
Circadian rhythms are endogenous 24-hour oscillations in physi- 
ology and behaviour that enable an organism to anticipate and 
adapt to the changing temporal demands of the environment [1]. 
The term circadian is derived from the Latin for ‘around a day’. The 
sleep/wake cycle is the most obvious example of a circadian rhythm. 
Other examples include daily fluctuations in locomotor activity, core 
body temperature, heart rate, blood pressure, renal activity, liver me- 
tabolism, and secretion of glucocorticoids, melatonin, and testos- 
terone [2, 3]. Circadian rhythms in physiological parameters are not 
unique to man; they are observed in organisms as diverse as cyano- 
bacteria, fungi, algae, plants, Drosophila, and rodents [4]. Circadian 
rhythms confer a survival advantage to organisms that possess them, 
increasing their reproductive fitness. For example, ground squirrels 
lacking endogenous rhythms are more vulnerable to predation and 
show a 20% reduction in lifespan [5]. 

Any circadian rhythm must meet three criteria [1]. Firstly, it must 
persist under constant conditions (that is, in the absence of changes 
in temperature or light), with an endogenous (or ‘free-running’) 
period of approximately 24 hours. This distinguishes internally 
driven rhythms from responses to external cues. Secondly, it must 
be entrainable. To anticipate daily environmental events (for ex- 
ample, sunrise), the rhythm must be able to adjust to changes in the 
timing of these events. For example, the timing of sunrise varies be- 
tween geographical locations and also changes across the seasons. 
This process of synchronization is known as entrainment, and an 
external stimulus that can synchronize a circadian rhythm is called a 
zeitgeber, German for ‘time-giver. Finally, a circadian rhythm must 
be temperature-compensated, meaning that its periodicity should 
remain constant over a range of physiological temperatures. 















































Mammals possess a circadian ‘master clock’ which resides in the 
suprachiasmatic nucleus (SCN), a small brain structure com- 
posed of approximately 20,000 neurons, located in the anterior 





hypothalamus, just above the optic chiasm [6]. The role of the SCN 
was first shown by rodent lesion studies; in rats, circadian rhythms 
in water consumption, locomotor activity, and corticosterone secre- 
tion were all abolished by SCN ablation [7, 8]. In hamsters, trans- 
plantation of neonatal hypothalamic tissue (including the SCN) can 
restore circadian rhythms following SCN ablation [9]. Moreover, 
transplanting SCN tissue from tau mutant hamsters, which have a 
20-hour circadian period, imposes a 20-hour period on the recipient 
[10]. In humans, compression of the SCN by expanding pituitary tu- 
mours causes progressive circadian disintegration [11]. In the 1970s 
and 1980s, in vivo oscillations in glucose uptake and electrical ac- 
tivity were first reported in the rat SCN; both were upregulated in 
the day [12, 13]. Analogous results were subsequently reported with 
in vitro slice preparations [14]. Rhythmic firing was later observed 
in isolated cultured neurons, demonstrating that circadian oscil- 
lations result from subcellular mechanisms, rather than cell-cell 
interactions [6]. 


The molecular circadian clock 


Today, the molecular basis of mammalian circadian rhythms is 
well characterized. The basic mechanism is an autoregulatory 
transcriptional-translational feedback loop (TTFL), involving four 
core ‘clock genes’: Period (Perl and Per2), Cryptochrome (Cry1 
and Cry2), Brain and Muscle ARNT-Like (Bmal1), and Circadian 
Locomotor Output Cycles Kaput (Clock) [15]. These genes encode 
the proteins PER1-2, CRY1-2, BMAL1, and CLOCK, respectively 
[15]. The basic clock mechanism involves the binding of CLOCK 
and BMALI to E-box enhancers in the promoters of Per1-2 and 
Cry1-2, driving their expression. In turn, PER and CRY form homo- 
and heterodimers and inhibit CLOCK/BMAL]I binding, providing a 
negative feedback loop. In addition, several other genes are known to 
regulate the mammalian TTFL. For example, casein kinase 1 (Ck1) 
plays an important role in setting the core clock speed by phosphor- 
ylating PER proteins, which targets them for ubiquitylation and 
proteasomal degradation [16]. 

The TTFL in the SCN is synchronized with the environment by 
light input from the retina. Light information is transmitted from 
the eye to the SCN via photosensitive retinal ganglion cells (pRGCs). 
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Although they also receive input from rod and cone photoreceptors, 
pRGCs are intrinsically photosensitive due to their expression of 
the photopigment melanopsin. Ablation of pRGCs impairs photic 
entrainment in mice but spares pattern-forming visual function 
[17]. The axons of pRGCs, which form the retinohypothalamic tract 
(RHT), synapse directly with the SCN [18, 19]. Glutamate and pi- 
tuitary adenylyl cyclase-activating polypeptide (PACAP) are the 
principal neurotransmitters at these synapses [20]. Although light 
information is the primary zeitgeber for the entrainment of the cir- 
cadian clock, the timing of other events such as feeding and exercise 
can also be considered zeitgebers, which affect the SCN via non- 
photic pathways [1]. 


Clock outputs 


How does the action of the TTFL translate into circadian rhythms 
in physiology and behaviour? The immediate output of the TTFL 
is the rhythmic expression of ‘clock-controlled genes’ within the 
SCN itself [11, 21]. These genes are not part of the TTFL but show 
rhythmic expression because they possess E-boxes. In turn, these 
clock-controlled genes regulate the expression of their own down- 
stream target genes. The net result is that roughly 10% of the SCN 
transcriptome is rhythmically expressed. It is unclear how rhythmic 
patterns of electrical activity are generated in the SCN [20], but these 
oscillations are conveyed to other brain regions by monosynaptic 
or multisynaptic pathways, or via the rhythmic synthesis and secre- 
tion of peptide hormones such as vasoactive intestinal peptide (VIP) 
[11, 21]. The SCN’s target regions are ultimately responsible for the 
co-ordination of circadian physiology and behaviour. Examples in- 
clude the rhythmic secretion of the hormone melatonin from the 
pineal gland and the rhythmic secretion of corticotropin-releasing 
hormone (CRH) from the hypothalamus, which leads to the release 
of glucocorticoids from the adrenal glands [11, 21]. 

Multiple body tissues show self-sustained rhythmic clock gene 
expression in vitro [21, 22]. Initially observed in rodent fibroblasts 
[23], these peripheral rhythms are particularly well characterized in 
the liver [24]. As in the SCN, around 10% of the transcriptome is 
rhythmically expressed in most peripheral tissues [25]. In the brain, 
rhythmic electrical activity, hormone output, and clock gene ex- 
pression has been reported in the olfactory bulb, amygdala, lateral 
habenula, hippocampus, motor cortex, and cerebellum, together 
with a variety of nuclei in the hypothalamus [21, 22]. Taken together, 
it appears that the brain and body are full of localized autonomous 
circadian clocks. The SCN synchronizes these peripheral clocks with 
each other and the external environment [11, 21]. Glucocorticoids 
may be one synchronizing signal; the administration of the gluco- 
corticoid dexamethasone is known to synchronize 60% of the liver’s 
circadian transcriptome [11, 21]. 


Sleep is a recurring physiological state characterized by altered con- 
sciousness, reduced perception of environmental stimuli, and inhib- 
ition of most voluntary muscles [26]. Although we spend around 
a third of our lives asleep, the exact functions of sleep are not well 
understood. Popular theories include rejuvenation of the immune 
system [27] and repair and regeneration of nervous tissue, bone, and 
muscle [26]. From a cognitive perspective, sleep may be important 


for memory consolidation [28], while according to the synaptic 
homeostasis hypothesis, sleep is required to reverse the net increase 
in synaptic strength which occurs in many brain circuits during 
wakefulness [29]. Whatever the functional roles of sleep, its import- 
ance is underscored by the myriad health problems that are associ- 
ated with sleep disruption. 

In humans, sleep consists of both rapid eye movement (REM) 
sleep and non-rapid eye movement (NREM) sleep, which has three 
stages [30-32]. During sleep, the body repeatedly cycles through 
these stages in a predictable and sequential manner, progressing 
through NREM stages 1, 2, and 3 and ending with REM sleep. 
Typically, the first cycle lasts 70-100 minutes, and subsequent cycles 
last 90-120 minutes. Four to five cycles are completed each night. 
REM sleep accounts for 20-25% of total sleep time, and the propor- 
tion of REM sleep increases across cycles. Conversely, the propor- 
tion of stage 3 NREM sleep (also known as slow-wave sleep) tends to 
decrease across cycles. Arousal threshold increases with each NREM 
stage, while muscle activity and eye movements decrease [30-32]. 
The three NREM stages are characterized by oscillatory brain ac- 
tivity of different frequencies. Sleep spindles and K-complexes occur 
at the onset of stage 2 NREM sleep. During REM sleep, there is vig- 
orous eye movement, but most voluntary muscles are paralysed. 
Brain activity in REM sleep closely resembles that observed during 
wakefulness. REM sleep is also associated with dreaming, an associ- 
ation first noted in the early 1950s [33]. 


Mechanistic basis of sleep and wakefulness 


Wakefulness is a state of alertness characterized by high forebrain 
and cortical activity, which is promoted by the ascending arousal 
system (AAS) [34]. The AAS originates in the upper brainstem [35] 
and can be broadly divided into two separate pathways, each innerv- 
ating distinct brain regions and involving different neurotransmit- 
ters [36]. The first pathway consists of cholinergic projections from 
the pedunculopontine (PPT) and laterodorsal tegmental (LDT) 
nuclei of the brainstem, which are more active during wakeful- 
ness and REM sleep and less active during NREM sleep [37]. One 
of their key targets is the thalamic reticular nucleus (TRN), which 
inhibits thalamocortical rhythms to promote wakefulness [38]. The 
second pathway innervates multiple targets in the hypothalamus, 
forebrain, and cerebral cortex [36]. It consists of noradrenergic pro- 
jections from the locus caeruleus (LC), serotonergic projections 
from the dorsal and medial raphe nuclei (DRN and MRN), dopa- 
minergic projections from the ventral periaqueductal grey (VPAG), 
and histaminergic projections from the tuberomammillary nucleus 
(TMN) [34]. It also includes cholinergic and GABAergic projections 
from basal forebrain (BF) nuclei and peptidergic projections from 
lateral hypothalamic (LH) nuclei [34]. These LH neurons can be 
subdivided into those expressing orexin (also known as hypocretin) 
and those expressing melatonin-concentrating hormone (MCH). 
Monoaminergic and orexinergic neurons are most active during 
wakefulness, less active during NREM sleep, and least active during 
REM sleep [35]. In contrast, the MCH-releasing neurons are most 
active during REM sleep [39], and the cholinergic neurons are most 
active during REM sleep and wakefulness [40]. 

The AAS is inhibited by the ventrolateral preoptic nucleus, or 
VLPO, a small cluster of neurons in the anterior hypothalamus [34]. 
During sleep, a subpopulation of VLPO neurons is highly active 
[41] and suppresses the AAS through the release of the inhibitory 
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neurotransmitters GABA and galanin [42, 43]. Central VLPO le- 
sions are known to reduce NREM sleep, while lesions to the extended 
VLPO decrease REM sleep [44]. In addition to the suppression of 
the AAS by the VLPO, neurons in the VLPO can be inhibited by 
the AAS. This inhibition is driven by noradrenergic projections to 
the VLPO from the LC, serotonergic projections from the MRN, 
and GABAergic and galinergic projections from the TMN [45]. As 
such, there is reciprocal inhibition between the AAS and the VLPO. 
The AAS inhibits the VLPO during wake, and the VLPO inhibits the 
AAS during sleep. It has been proposed that this reciprocal inhib- 
ition creates a bi-stable ‘flip-flop’ circuit—whereby sleep and wake 
cannot be promoted at the same time. This putative mechanism, also 
known as the ‘sleep switch, may explain why sleep-wake transitions 
are swift [36, 37]. 


Sleep timing: the two-process model of sleep 


Sleep timing is thought to be dependent on two hypothetical 
mechanisms [46, 47]. The first, known as ‘Process S, describes a 
homeostatic drive for sleep, which increases progressively during 
wakefulness and dissipates during sleep. The biological substrate of 
this mechanism is unproven, although adenosine is a likely candi- 
date. Adenosine accumulates in the brain during wakefulness [48], 
while the administration of adenosine or adenosine receptor agon- 
ists promotes sleep in rats and cats [49, 50]. These pharmacological 
effects are likely mediated by the excitation of sleep-promoting brain 
regions and the inhibition of wake-promoting regions [34]. Recent 
evidence suggests that adenosine accumulation is also responsible 
for the antidepressant effect of sleep deprivation [51]. 

The second component of the two-process model—‘Process C’— 
describes the contribution of the circadian clock to sleep regulation 
[46, 47]. More specifically, it describes a circadian drive for wake- 
fulness, which rises during the day and falls during the evening and 
night. According to the two-process model, Process C counteracts 
Process S to keep us awake during daytime when homeostatic sleep 
pressure is steadily increasing. In the evening, sleep onset coincides 
with the opening of the ‘sleep gate; the point at which the discrep- 
ancy between Processes S and C reaches a critical threshold. SCN le- 
sions in primates result in increased sleep time, as well as arrhythmia 
[52]. This suggests that under normal conditions, signals from the 
SCN oppose the homeostatic drive for sleep, providing a plausible 
biological substrate for Process C. These signals may be carried by 
projections from the SCN to the ventral subparaventricular zone 
(SPZ) and the dorsomedial nucleus of the hypothalamus (DMH). 
In rats, lesions of both structures impair the circadian rhythmicity 
of the sleep/wake cycle [44, 53]. The DMH may be a critical relay be- 
tween Processes C and S, as it receives inputs from the SCN and SPZ 
and innervates both the VLPO and the LH nuclei of the AAS [53]. 


Sleep and circadian rhythm disruption 


in psychiatric disorders 





The relationship between severe mental illness and abnormal sleep 
was first described in the late nineteenth century by the German 
psychiatrist Emil Kraepelin [54]. Today, sleep and circadian rhythm 
disruption (SCRD) is reported in 30-80% of patients with schizo- 
phrenia and is increasingly recognized as one of the most common 
features of the disorder [55]. Moreover, recent evidence suggests that 


the degree of SCRD is correlated with the severity of positive, nega- 
tive, and cognitive symptoms [56-59]. Sleep disturbances in schizo- 
phrenia include increases in sleep latency and reductions in total 
sleep time, sleep efficiency, REM sleep latency, REM sleep density, 
and slow-wave sleep duration [54, 55, 60-62]. Schizophrenia is also 
associated with significant circadian disruption such as abnormal 
phasing, instability, and fragmentation of rest-activity rhythms 
[63-66]. Crucially, schizophrenia patients with SCRD score badly 
on many quality-of-life clinical subscales, highlighting the human 
cost of SCRD in schizophrenia [55, 67, 68]. Furthermore, patients 
often comment that an improvement in sleep is one of their highest 
therapeutic priorities [69]. 

SCRD is not limited to schizophrenia. Up to 90% of patients 
suffering from an acute depressive episode report simultaneous 
changes in their sleep profile that are usually described as difficul- 
ties initiating and maintaining sleep during the night. Significantly, 
persistent insomnia increases the risk of relapse into a new de- 
pressive episode [70]. Increased sleep disruption, as in mothers 
after childbirth, raises the risk of post-partum depression, with 
poorer sleep quality correlated with more severe depression [71]. 
Seasonal affective disorder (SAD), or ‘winter depression, is clin- 
ically classified as a subtype of major depression that describes a 
condition of serious mood alteration associated with a change in 
the season towards autumn and winter. Individuals tend to show 
excessive daytime sleepiness, have little energy, and crave sweets 
and carbohydrates. SAD individuals may also feel profoundly de- 
pressed. While there is a trend suggesting an overall increase in the 
prevalence of SAD with increasing latitude and with the shortened 
daily light period during the winter season, there is considerable 
country-to-country variation [72, 73]. These inconsistencies in the 
prevalence of SAD might be because this condition is a more severe 
expression of the naturally occurring seasonal variation in mood 
experienced by a large percentage of the general population. The 
best evidence that there is a recurrent, seasonally influenced winter 
depressive condition that is a function of an interaction between 
light and internal circadian and/or circannual rhythms is the very 
high success rate of phototherapy [74]. In individuals with a diag- 
nosis of bipolar disorder, disruption of their 24-hour sleep/wake 
cycle, shortened sleep, or travel across multiple time zones seems 
to act as an important trigger for a relapse (77% of patients) into 
mania [75]. By contrast, during the depressed phase, patients may 
become hypersomnolent, sleeping up to 12-14 hours per day [76]. 
Sleep in alcoholics is disturbed both on drinking nights and on 
nights during withdrawal. During periods of heavy drinking and 
up to 2 years after discontinuation of alcohol intake, sleep consists 
mainly of reduced NREM states, while REM sleep is suppressed and 
sleep duration is short, with sleep of the second half of a normal 
8-hour bedtime being fragmented [77]. Rates of insomnia in this 
population are reported to be between 40% and 70%. Interestingly, 
those alcohol users who have a good prognosis for recovery are 
those individuals who tend to return to normal ratios of NREM/ 
REM sleep and who overall sleep well [77]. Excessive alcohol con- 
sumption can cause depressive-like states—indeed, ethanol is clas- 
sified as a depressant drug, mainly through its sedative effects via 
GABA facilitation and inhibition of glutamatergic NMDA function 
[77]. Finally, the relationship between disrupted sleep and various 
anxiety-related disorders is well recognized, but the cause and ef- 
fect can be difficult to untangle. For example, insomnia predisposes 
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individuals to anxiety which, in turn, precipitates sleep disruption 
that then increases the likelihood of panic. Conversely, studies have 
shown that sleep problems may actually precede conditions such as 
anxiety and depression [71, 78]. 

It is perhaps significant that behavioural abnormalities similar 
to those seen in psychiatric disorders can be induced in healthy 
individuals by sleep deprivation [79-84] or transitory circadian 
desynchronization [85]. For example, SCRD may exacerbate or 
even contribute to the cognitive impairments of schizophrenia 
patients. Consistent with this, associations between cognitive per- 
formance and specific sleep or circadian parameters are present in 
medication-naive [86], medicated [57, 87-92], and unmedicated 
schizophrenia patients [93]. For example, the severity of patients’ 
cognitive symptoms is inversely related to slow-wave sleep duration 
and REM sleep density [93], and the amplitude of rest-activity 
rhythms is strongly associated with performance in cognitive tasks 
that assess distractibility, reaction time, and verbal fluency [57]. 
A correlation is also present between attentional set-shifting per- 
formance and REM sleep latency [94]. Similar associations are evi- 
dent in healthy individuals; performance in the go/no-go task (a 
test of response inhibition) is associated with multiple slow-wave 
sleep parameters [95]. 

Sleep makes a crucial contribution to memory consolidation [28]. 
Reduced overnight consolidation of procedural learning has been 
demonstrated in schizophrenia patients [96], and these deficits were 
accompanied by reduced slow-wave sleep duration [89]. Another 
study with schizophrenia patients showed an association between 
sleep spindle activity during stage 2 NREM sleep and overnight im- 
provements in a finger-tapping, sequence-learning task [90, 91]. 
Sleep spindle activity is significantly reduced in schizophrenia 
patients [97]. 

In schizophrenia, where again we have the most detailed studies, 
positive symptoms appear to be aggravated or even triggered by 
SCRD. In healthy human subjects, sleep deprivation results in 
increased perceptual distortions [82, 98], greater paranoia [99], 
and reduced prepulse inhibition [82]. Clinically, sleep disturb- 
ance is a significant risk factor for the first episode of psychosis 
[100], as well as for relapse in recovering patients [101]. Severe 
sleep disturbance often precedes the onset of psychotic episodes 
in schizophrenia patients [59, 102, 103], while it is also predictive 
of the maintenance of these symptoms [103]. In addition, the se- 
verity of sleep disruption is positively correlated with the severity 
of psychotic symptoms [102]. Finally, the positive symptoms of 
schizophrenia are associated with high-frequency EEG episodes in 
sleep [104]. It is also interesting to note that circadian desynchron- 
ization increases negative mood, irritability, and affective volatility 
in healthy volunteers [85, 105]. A total of 24 hours of sleep de- 
privation in healthy human subjects also leads to anhedonia [82], 
one of the core negative symptoms of schizophrenia. Consistent 
with this, improvements in sleep quality are frequently correlated 
with amelioration of negative symptoms in schizophrenia patients 
[68, 106]. Moreover, the negative symptoms of schizophrenia are 
associated with reduced REM sleep latency and various slow-wave 
sleep deficits [94, 107-109]. Likewise, in adolescents at ultra-high 
risk for psychosis, sleep disruption is associated with greater nega- 
tive symptom severity [110]. Collectively, such associations hint at 
causal mechanisms, and therefore possible new therapeutic targets 
for improved health in psychiatric disorders. 


Shared neuropathophysiology 


What therefore could be the cause of SCRD in psychiatric disorders? 
One possibility is a shared dysfunction in common brain mech- 
anisms (for example, specific enzymes, neuroreceptors, or neuro- 
transmitter pathways) that contribute to the observed abnormalities 
[111]. According to this hypothesis, SCRD is not a consequence of 
the psychiatric disorder but is a core symptom intrinsically rooted 
in its underlying neuropathophysiology. This is plausible, given 
that psychiatric disorder implicates distributed brain circuits, af- 
fecting multiple neurotransmitter systems [112], many of which 
overlap with those involved in sleep regulation [113]. Dopamine, 
glutamate, and serotonin are all implicated in psychiatric disorder, 
while all three neurotransmitters are involved in sleep and circa- 
dian function. Moreover, multiple candidate genes for psychiatric 
disorder have a credible biological connection with sleep and cir- 
cadian function within the brain. If these genes are the substrates of 
the comorbidity, their manipulation should result in both SCRD and 
disorder-relevant behavioural abnormalities (for example, impaired 
memory or reduced prepulse inhibition). Although this prediction 
is very difficult to address in humans, it can be tested using animal 
models. Indeed, there is a new, but rapidly growing, literature which 
speaks to this question [111]. 


Studies in rodents 


Just as sleep deprivation can trigger psychiatric symptoms in healthy 
humans, the same is true in wildtype rodents. NMDA receptor 
(NMDAR) antagonist-induced hyperlocomotion, a schizophrenia- 
relevant behaviour, is heightened by sleep deprivation in male 
Wistar rats [114]. Sleep deprivation is also known to cause cogni- 
tive deficits; it impairs object recognition memory [115, 116], spatial 
recognition memory [117, 118], long-term spatial memory in the 
Morris water maze [119-123] and radial arm maze [124], and atten- 
tional performance in the 5-choice serial reaction time task [125]. 
These findings appear to reflect genuine memory deficits, as partial 
sleep deprivation (by gentle handling) has little effect on exploratory 
locomotor activity or on indices of stress such as plasma cortico- 
sterone or adrenocorticotrophic hormone levels [116]. In rats, sleep 
deprivation impairs prepulse inhibition [126-128], which may be 
relevant to the cognitive symptoms of psychiatric disorder. 
Disrupted circadian rhythms and complete arrhythmia can be in- 
duced in healthy hamsters by a single light treatment—a combin- 
ation of a nocturnal light pulse and a phase delay in the light/dark 
(LD) cycle. Crucially, this also impairs object recognition memory 
performance [129, 130]. In mice, circadian rhythms can be dis- 
rupted with T7 cycles, ultradian LD cycles consisting of a 3.5-hour 
light phase followed by a 3.5-hour dark phase [131]. This leads to 
lengthening of the body temperature and rest-activity rhythms, 
but no change in total sleep time, REM sleep time, or the rhythmic 
expression of Per2 in the SCN [132]. T7 cycles impair object rec- 
ognition memory performance, although it is unclear whether this 
impairment reflects a genuine memory deficit or a reduction in ob- 
ject exploration, as locomotor activity data were not reported [131]. 


Clock gene transgenic mice 


Genetic manipulation of core clock genes can induce SCRD. 
Under 12:12 LD, circadian rhythmicity is unaffected in Cry single- 
knockout (Cryl and Cry2”) and Cry double-knockout (Cry1/ 
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2") mice [133, 134]. However, Cry1™ mice have a shortened en- 
dogenous period and Cry2“” have a lengthened endogenous period, 
while Cry1/2 mice become completely arrhythmic in constant 
darkness [133, 134]. Significantly, object recognition memory per- 
formance is impaired in Cry1”, Cry2^, and Cry1/2 mice. These 
deficits cannot be explained in terms of altered activity, since total 
object exploration does not differ between knockouts and wildtypes. 
They may be related to altered anxiety, however, as all three models 
exhibit heightened anxiety in the elevated plus maze [135]. 


Moving from correlation to causality and new 


treatment targets 





SCRD is rarely targeted for treatment in psychiatric patients, but 
when it is, individuals report improvements in both sleep quality and 
psychiatric symptoms. Treating insomnia with the GABA, receptor 
agonist sodium oxybate leads to an improvement in both the sleep 
quality and the negative symptoms of schizophrenia patients [136]. 
In another study, insomnia was treated with cognitive behavioural 
therapy (CBT) in 15 patients with persistent persecutory delusions 
and schizophrenia [58]. At least two-thirds of participants showed a 
substantial (>25%) improvement in insomnia, while approximately 
half showed a substantial (>25%) reduction in persecutory delusions. 
There were also reductions in hallucinations, anxiety, and depres- 
sion. These studies represent an important advance, as they are the 
first to demonstrate that the targeted treatment of sleep disruption 
in schizophrenia patients can yield improvements in their ‘classical’ 
behavioural symptoms. This suggests that SCRD can make a causal 
contribution to the behavioural symptoms of psychiatric patients. 
However, both studies should be interpreted with caution, due to a 
number of methodological limitations; the sample sizes were small, 
the studies did not include control groups, and patients received 
medication throughout the studies. However, such issues were ad- 
dressed in a very recent study where 3755 university students across 
the UK were randomized into two groups. One group received online 
CBT for insomnia; the other group did not but had access to standard 
treatments. The aims of the study were to explore whether sleep dis- 
ruption is a driving factor in the occurrence of paranoia, hallucin- 
atory experiences, and other mental health problems in young adults 
(average age of 25 years). Individuals who received the CBT inter- 
ventions showed large reductions in insomnia, as well as significant 
reductions in paranoia and hallucinatory experiences. The treatment 
also led to improvements in depression, anxiety, nightmares, psycho- 
logical well-being, and daytime work and home functioning [137]. 
These studies support the proposal that treating disrupted sleep 
could provide a key route for improving mental health. 

SCRD in psychiatric illness such as schizophrenia has often 
been attributed to the side effects of medication. Although intui- 
tively plausible, drug effects cannot be the only cause of SCRD in 
schizophrenia, since SCRD affects both medication-ndaive [70] and 
medicated patients [71]. Indeed, SCRD often precedes the diag- 
nosis of schizophrenia, prior to any medication [70, 72]. Recent 
and potentially exciting results have shown that dopamine antag- 
onist drugs might actually improve sleep quality in schizophrenia 
[55, 71, 73], possibly due to their strong sedative effects [74]. So- 
called typical antipsychotics improve sleep efficiency and total sleep 
time [55], while atypical drugs for psychosis are even more effective 


at improving sleep quality in schizophrenia patients [75] (also see 
Chapter 64). It is also possible that antipsychotic medication might 
also act directly upon clock gene expression. Such actions are cur- 
rently being explored by researchers at the University of Oxford. 


More evidence from rodent models 


SCRD can also be targeted in rodent models that demonstrate sim- 
ultaneous SCRD and psychiatric-relevant behavioural abnormalities. 
If SCRD makes a causal contribution to the behavioural symptoms 
of psychiatric disorders, then such interventions should ameliorate 
their behavioural impairments. Proof-of-principle for this approach 
is provided by the targeted ‘treatment’ of SCRD in the R6/2 trans- 
genic mouse model of Huntingtons disease, which displays a pro- 
gressive disintegration of sleep and circadian rhythmicity across its 
lifespan [138]. Interventions employed include bright light therapy 
[139], temporally scheduled feeding [140, 141], temporally sched- 
uled exercise [139], and pharmacological imposition of a daily sleep/ 
wake cycle with the sedative drug alprazolam [142, 143] and/or the 
vigilance-promoting drug modafinil [143]. Bright light therapy and 
scheduled exercise delayed the disintegration of rest-activity rhythms 
[139], while scheduled feeding also restored behavioural rhythmi- 
city [140, 141] but had no impact on cognitive performance [141]. 
Strikingly, however, alprazolam administration improved behavioural 
rhythmicity and slowed cognitive decline in the R6/2 model [140, 141]. 
This suggests that SCRD causally contributes to cognitive dysfunc- 
tion in Huntington's disease. The same experimental approach could 
easily be applied to a psychiatric-relevant mouse model, or indeed any 
neuropsychiatrically relevant animal model. Temporally scheduled 
exercise has already been shown to improve behavioural rhythmicity 
in the schizophrenia-relevant Vipr2— mouse model [144], although 
cognitive performance was not assessed. 


Treatments on the horizon? 


Despite our growing knowledge of the molecular mechanisms 
underlying the 24-hour circadian clock and its role in the develop- 
ment of psychiatric disease, there are very limited therapeutic op- 
tions that have been developed to act on the clock directly to reduce 
SCRD. As light is the primary entraining agent for the SCN, bright 
light therapies and CBTs that strengthen natural zeitgebers such as 
scheduled outdoor exercise [145, 146] have been shown to have 
some success. However, potent pharmacological interventions are 
still lacking. Melatonin has long been characterized as an output of 
the circadian clock and can be used to modify the phase of the clock, 
presumably by acting via melatonin receptors that are expressed 
within the neurons of the SCN and widely across the multiple other 
cell populations throughout the body. Melatonin has therefore been 
studied as a possible chronotherapeutic drug and shows promise in 
certain circadian-related conditions [147, 148]. Prolonged-release 
melatonin (tradename Circadin ) is used to treat primary insomnia 
[149] in the aged, and the agonist agomelatine in the treatment of 
major depressive disorder [150]. Most recently, tasimelteon was ap- 
proved in the United States for the orphan circadian disorder non- 
24-hour sleep-wake disorder in the totally blind [151]. Targeting 
the melatonin system, however, has limited efficacy; for example, 
tasimelteon showed a beneficial effect on stabilizing sleep-wake 
in 20% of the patient population after 1 month of treatment [151]. 
As a consequence, recent efforts have focused on developing alter- 
natives, mainly targeting the core clock. Solt et al. reported a novel 
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REV-ERBa receptor agonist was effective at regulating both sleep as 
well as metabolism in mice [152, 153], and Hirota et al. have devel- 
oped a small-molecule cryptochrome activator [154]. An alternative 
strategy that is yet to be employed is the development of molecules 
that act on the light input pathway to the clock, providing a pharma- 
cological replacement for light for the treatment of SCRD. Such an 
approach could yield an exciting alternative. 


Future perspectives 


In conclusion, there are clear links between SCRD and the behav- 
ioural symptoms of psychiatric illness, although the mechanistic 
bases of these associations remain to be clearly defined. Rather than 
considering SCRD to be a by-product of medication or social iso- 
lation in psychiatric disease, it might be better conceptualized as 
a central feature of the condition. While SCRD is unlikely to be a 
primary cause of psychiatric disease, the treatment of sleep disturb- 
ances can yield improvements in the disorder’s core symptoms and 
has the potential to significantly improve the quality of life of pa- 
tients and their carers. 
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In 400 Bc, Hippocrates wrote ‘sleep and watchfulness, both of them 
when immoderate, constitute disease. Unfortunately modern doc- 
tors in training receive little or no training in sleep medicine [1]. 
Disrupted sleep has immediate and long-term consequences on 
physical and mental health. Sleep fulfils many functions, but ar- 
guably the most important is regulation of normal affect and cog- 
nition, including a vital role in memory consolidation and new 
learning [2-4]. 

As explained at the molecular level in Chapter 109 and the systems 
level in Chapter 111, sleep is controlled by two neuronal circuits. The 
homeostat drives an increasing pressure to sleep after every hour 
awake and the circadian rhythm drives alertness in the day and sleep 
at night, with light intensity as the strongest external timekeeper 
[5]. Both total sleep time and the circadian rhythm change over the 
course of our lives [6]. As teenagers and young adults, typical sleep 
need is 8-9 hours for the majority, with a tendency for a delay in 
the sleep phase such that many comfortably fall asleep after, rather 
than before, midnight [7]. There is a tendency to phase-advance 
(that is, fall asleep earlier) with ageing. Adults fall asleep by 30 min- 
utes earlier every decade from the third decade onwards [8]. There 
is greater sleep fragmentation and increased time to fall asleep in 
healthy older adults with and without sleep complaints [9]. At least 
30% of adults in the United States regularly sleep <6 hours a night 
[10]. Over 7 hours of sleep at night is the adult norm, and over 9 
hours is necessary for many teenagers, as well as those repaying sleep 
debt [11]. 

The physiological definition of sleep shows us to be either awake 
or alternating between non-rapid eye movement (NREM) or rapid 
eye movement (REM) sleep at approximately 90- to 120-minute 
cycles (Fig. 110.1). 








Sleep disorders fall into one or more of four categories: hypersomnia, 
insomnia, circadian rhythm disorder, and parasomnia. Common 





night-time disorders also include restless legs syndrome and asso- 
ciated periodic limb movements of sleep. The 2014 International 
Classification of Sleep Disorder, third edition (ICSD-3) has been 
used for the diagnostic criteria [13]. In addition, the most recent 
diagnostic criteria for restless legs syndrome (http://www.irlssg.org) 
improved specificity of the diagnosis and highlighted both episodic 
and chronic symptoms [14]. 














A sleep history should be a short, but routine, part of all psychiatry 
consultations. Many sleep disorders can be accurately diagnosed by 
clinical history alone. A limited number of things go wrong with 
sleep—patients complain of sleeping too much or too little, of 
sleeping at the wrong time of the day, and less commonly of ‘things 
that go bump in the night’ (or the complaint will come from the bed 
partner). 

Simple screening questions are suggested in Box 110.1. A collat- 
eral history from the bed partner, where possible, is helpful to deter- 
mine the presence of any snoring, periodic limb movements of sleep, 
sleepwalking, or other parasomnia. It is important to ask about the 
duration of symptoms (typically young onset for narcolepsy and 
NREM parasomnia) and whether they are progressive or relapse 
and remit. Some conditions disturb the bed partner far more than 
the patient, and establishing who is more concerned is important. 

Discussing the patient's concerns about sleep and expectations 
of normal sleep is not only worthwhile, but also part of treatment 
in insomnia disorder. For those with significant daytime sleepiness, 
driving safety should be discussed when relevant. The American 
Academy of Sleep Medicine recommends, “Have you had crashes, 
near misses, or claims on insurance in the last year?’ Those who are 
aware of nodding at the wheel or have modified driving are at most 
risk [15]. 

One of the most simple and cost-effective tests is a sleep diary kept 
for at least 2 weeks. Alongside a typical history, this may be the only 
diagnostic test needed for insomnia disorder, circadian rhythm dis- 
order, or daytime sleepiness caused by sleep restriction or shift work. 
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Hypnogram of normal sleep 
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Fig. 110.1 The sleep hypnogram. Sleep stages over a typical night as determined by polysomnography. During adult life, at night, we cycle between 
non-rapid eye movement (NREM) sleep and rapid eye movement (REM) sleep approximately every 90-120 minutes. REM sleep typically occurs >90 
minutes into the night, and NREM sleep is divided into three stages: N1, N2, and N3. N3 is often known as slow-wave sleep (SWS). Young adults will 
wake 2-4 times during the night, although they may not be aware of this. Most adults will have 4-5 cycles of REM sleep, although dream recall is highly 


variable and few patients recall 4-5 distinct cycles of dreaming. 


Box 110.1 Recommended questions for a sleep history 


Key sleep questions 
1 Do you snore heavily? Has anyone witnessed prolonged pauses in 
breathing (apnoeas)? 


For typical obstructive sleep apnoea, snoring is very loud, heard outside the 
bedroom door; the pauses are typically 20-30 seconds, rather than brief 
(<5-10 seconds); and a snort or a start at the end of apnoea is characteristic. 


2 Do you have unpleasant tingling or discomfort in the legs which 
makes you need to kick or to move? Is it worse in the evenings? Is it 
helped by moving (restless in the body, rather than a racing mind)? 


Everybody needs to be asked specifically about restless legs; people can 
find it hard to describe the dysaesthesia—‘wriggly’, ‘itchy’, ‘worms under 
the skin’. Bed partners can be asked about the infuriatingly regular kicks 
that may wake them, but not the patient—every 20-30 seconds is typical. 


3 What drugs do you take, and when do you take them, including the 
dose and timing of caffeine/alcohol and nicotine? 


Consider over-the-counter medication that might affect sleep, so short- 
acting painkillers, sedative opiates, and adrenergic drugs such as inhalers 
or some nasal sprays; counting the cups of coffee or tea is important. 


4 ‘Take me through your typical 24 hours’—describing the sleep/wake 
pattern over a day. 


Particularly helpful for those with sleep restriction, circadian rhythm dis- 
order, and insomnia. 


5 Doyou nap during the day—if so when and for how long? Can you get 
through the day without sleeping? 


Examples of when they might sleep can help to distinguish fatigue from 
true hypersomnia; few of us would sleep in the middle of a conversation 
or with food in our mouths. 


6 Do you have any history of nightmares, acting out of dreams, sleep- 
walking out of the bedroom? If so, what time of night do things tend 
to happen? 


Typically, the first cycle of REM sleep starts 90 minutes into the night, so 
REM parasomnias are more likely in the second half of the night. NREM 
parasomnias can occur within the first hour of sleep. 








This should include: lights out, time in bed, estimated time asleep, 
any daytime napping, and ideally daily intake of caffeine, alcohol, 
meals, and exercise. There is debate about the time needed for diag- 
nosis, but ICSD-3 highlights that certain circadian rhythm disorders 
will be missed with a single week of diaries. Weekends are different to 
weekdays, even in those who are unemployed or on work shifts. Some 
diaries rate satisfaction with sleep on a nightly basis, but this does not 
have diagnostic sensitivity and the author does not do this—generally, 
if they were satisfied with sleep, they would not be completing a 
sleep diary. Patients should complete the diary once a day, but not 
at night. Commercial accelerometers within smart phones and other 
gadgets tend to overestimate movement as wake and should not be 
used [16]; they can, in themselves, exacerbate insomnia. If patients 
awake refreshed and do not perceive a problem falling asleep, then 
they should largely avoid looking at movement-based data. Typical 
diaries are shown from normal and abnormal sleepers (Fig. 110.2). 


Screening questionnaires—which scales to use? 


The Epworth Sleepiness Scale (ESS) is the most widely used ques- 
tionnaire to assess daytime sleepiness as self-rated over the prior 
month [17] (Box 110.2). Advantages include speed and simpli- 
city for the patient and validation within psychiatry populations. 
However, it correlates poorly with more objective measures of sleep 
in the laboratory [18]. Many typical insomniacs score 0-1; >10 is 
taken as daytime sleepiness, and >17 as very sleepy. Change in ESS 
score is often used to assess interventions for sleepiness such as 
stimulants [19, 20]. Added value is often obtained from asking a 
spouse or partner to also complete the questionnaire. 

The other widely used questionnaire to separate good and bad 
sleepers is the Pittsburgh Sleep Quality Inventory [21]. Seven ques- 
tions give a single score, with >5 indicating some form of sleep 
disturbance and those scoring >10 having more severe sleep disturb- 
ance. This is widely used in research but is more complex to complete. 

The insomnia severity index is validated for insomnia severity and 
improvement after therapy, with scores ranging from 0 to 24 [22]. It 
is quick and easy to complete but will not distinguish between pri- 
mary or secondary causes of insomnia. 





























































































































































































































































































































































































































































































































pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES i 1 M M 
LIGHTS OUT__ 1 Dom TOTAL SLEEP TIME__ 6 . Dars 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES c M M 
LIGHTS OUT__ 1 1 pm TOTAL SLEEP TIME__ 5 . mn 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
Activities} M | X i M M 
LIGHTS OUT_ 1 1 š 3 O om TOTAL SLEEP TIME_ 6 _ tvs 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES M M 
LIGHTS OUT__ 1 om TOTAL SLEEP TIME__ Z __hrs 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES ; M 
LIGHTS OUT 1 2 . 1 O pm TOTAL SLEEP TIME_ Z nrs 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES MJAJA C/M 
LIGHTS OUT__ 1 1 7 3 Opm TOTAL SLEEP TIME __ 6 ns 
pm midnight am noon pm 
6 8 10 12 2 6 10 12 2 4 6 
ACTIVITIES M M 








[== 























=i 





















































Fig. 110.2 Sleep dia 
a) Diary 1. Anormal ( 
asleep is normal, and 


LIGHTS our OO. 1 O pm 


jes. 





patient with insomnia 





normal sleep and advi 


TOTAL SLEEP TIME_ 8 nes 


albeit slightly introspective) 45-year old with occasional brief wakenings and rapid returns to sleep. Taking up to 30 minutes to fall 
increased sleep time at weekends and non-work days are within current societal norms. Treatment was a clear explanation about 
sing him to avoid using his phone to record his sleep. 

b) Diary 2. Insomnia disorder where time in bed is far longer than time perceived asleep, and both difficulty initiating and maintaining sleep are seen 
without daytime naps. Variability among nights is seen, with occasional good nights of prolonged sleep. 

c) Diary 3. Delayed sleep phase syndrome in a 23-year-old male who has had this sleep pattern since the age of 15. The pattern is of falling asleep 
ypically between 2.30 a.m. and 4 a.m., but waking well after 11 a.m. with a few night wakenings. The estimated total sleep time is far greater than in the 
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Fig. 110.2 Continued 
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CHAPTER 110 Diagnosis of sleep and circadian rhythm disorder 


Box 110.2 Epworth Sleepiness Scale 


ame: Today's date: 





Your age (Yrs): Your sex (Male = M, Female = F): 
How likely are you to doze off or fall asleep in the following situations, 
in contrast to feeling just tired? This refers to your usual way of life in 


recent times. 
Even if you haven't done some of these things recently try to work out 
how they would have affected you. 


Use the following scale to choose the most appropriate number for each 
situation: 





0 = would never doze 
1 = slight chance of dozing 
2 = moderate chance of dozing 





3 = high chance of dozing 

It is important that you answer each question as best you can. 
Situation Chance of dozing (0-3) 
Sitting and reading 

Watching TV 

Sitting, inactive in a public place (e.g. a theatre or a meeting) 

As a passenger in a car for an hour without a break 

Lying down to rest in the afternoon when circumstances permit 


Sitting and talking to someone 








Sitting quietly after a lunch without alcohol 

n a car, while stopped for a few minutes in the traffic 
THANK YOU FOR YOUR COOPERATION 

© M.W. Johns 1990-1997. Used under License 


Reproduced from Sleep, 14(6), Johns MW, A new method for measuring daytime 
sleepiness: the Epworth sleepiness scale, pp. 540-5, Copyright (1991), with permis- 
sion from Oxford University Press. 

















Hypersomnia affects up to 15% of the adult population, with 5% se- 
verely affected [23]. Differential diagnoses include obstructive sleep 
apnoea (OSA), sleep restriction with insufficient hours in bed for 
the needs of the individual, sedative medication, narcolepsy with or 
without associated cataplexy (transient loss of skeletal muscle tone 
typically triggered by strong emotion), and idiopathic hypersomnia. 
Excessive daytime sleepiness can be caused by head trauma (usually 
transient for the first 3-6 months; if it persists, other causes should 
be screened for) [24]. Troublesome daytime sleepiness is common 
in both early and more advanced neurodegenerative conditions 
such as Parkinsons disease where it is typically multifactorial. 
Louter and colleagues produced a useful diagnostic algorithm [25]. 

Circadian rhythm disorders can masquerade as hypersomnia, in 
particular in shift workers over 40 or young adults with delayed sleep 
phase syndrome. 


Behaviourally induced insufficient sleep syndrome 


(sleep restriction) 





Those who are sleep-restricted for sleep will have daytime sleepiness. 
There are few true short sleepers, and most of the adult population 


sleeping <5 hours will have symptoms of poor concentration and 
sleepiness. ‘Social jet lag’ describes increased sleep over weekends 
to repay sleep debt. Variability over days and completion of a sleep 
diary, alongside a careful history, may be sufficient to make the diag- 
nosis. Occasionally, wrist actigraphy can record sleep-wake patterns 
where there is diagnostic doubt. 

Napoleon, Margaret Thatcher, and Thomas Edison are often ac- 
claimed as short sleepers of only 4 hours a night. Further analysis re- 
veals them all to be efficient power nappers. Edison, who dismissed 
sleep as a ‘waste of time and throwback to our cave days, was once 
furious to be caught napping in a cupboard during the day. 


Obstructive sleep apnoea 


The most common cause of significant daytime sleepiness in those 
over the age of 50 is OSA. It remains underdiagnosed despite having 
an excellent, cost-effective, symptomatic therapy—continuous positive 
airways pressure [26]. Risk factors include male sex, increasing age (in 
particular over 50), obesity, increased neck circumference (>17 inches), 
and retrognathic jaw. Increasing obesity means that at least 10% of men 
over 40 and 5% of women now have moderate or severe sleep apnoea 
[27]. In children, OSA is seen with adenotonsillar hypertrophy and cer- 
tain dysmorphic conditions such as Downs syndrome. 

Psychiatry patients have particular risk factors for sleep apnoea, 
and studies across a range of psychiatric disorders have reported 
rates of up to 66% [28, 29]. There is most evidence for increased 
rates in major depressive disorder and in those with elevated BMI 
[30]. This is a particular problem with those established on drugs for 
psychosis with metabolic syndrome [31, 32]. 

The history includes very loud snoring, progressively trouble- 
some sleepiness, awaking unrefreshed, and often progressive over 
months and years. Spouses often reposition the patient onto their 
side, and the pauses in breathing are often long enough (>30 sec- 
onds) to cause concern. It is rare for the patient themselves to 
have a perception of choking, although people can wake them- 
selves with their own snoring. Morning headache is common, 
often due to exacerbation of primary headache such as migraine, 
rather than associated respiratory failure. Clues for those who live 
alone include a dry, sore throat in the morning, nocturia, awaking 
unrefreshed, vivid dreams, and night-time sweating, alongside 
daytime sleepiness. Untreated OSA can cause treatment-resistant 
hypertension. 

At least 20% of patients are not overweight, and those over 
65 years of age can be slim with atypical presentations, including 
cognitive impairment. The STOP-Bang is a validated screening tool 
for OSA to assess those who may need further investigations [33] 
(http://www.stopbang.ca). 

Those with suspected OSA should be referred for a respira- 
tory sleep study. This detects pauses in breathing—either partial 
(hypopnea) or complete (apnoea)—alongside oximetry and gives an 
apnoea—hypopnea index (AHI). AHI of >5/hour is abnormal, but 
AHI of >15 (moderate) and >30 (severe) is far more likely to cause 
symptoms and to respond to treatment. Those with untreated OSA 
report more symptoms of depression and anxiety than the back- 
ground population [34, 35]. 

Symptomatic OSA is notifiable to the driving licence authorities, 
but treated patients can restart driving. 








SECTION 17 Sleep-wake disorders 


It is also worth mentioning chronic fatigue syndrome. Sleep dis- 
orders must be excluded, and OSA is a common mimic. Recent 
polysomnography studies in those with an established diagnosis 
of chronic fatigue syndrome showed 21-49% had an undiagnosed 
sleep disorder, most commonly OSA [36, 37]. 


Central sleep apnoea 


Those taking sedative medication and, in particular, opiates, as well 
as those with heart failure, can have prolonged apnoeas without 
snoring as a central phenomenon. Increased opiate prescribing over 
the past 20 years has led to a belated recognition of their harmful 
effects, including sleep apnoea [38]. Those with chronic pain often 
complain of bad sleep, but this is mostly attributed to the pain it- 
self. Within pain clinics, rates as high as 25-50% are reported, often 
with secondary respiratory failure [39, 40]. Those at highest risk 
are those taking >200 mg of equivalent dose of morphine, in com- 
bination with benzodiazepines. Recent changes in the Center for 
Disease Control prescribing guidelines in the United States have 
recognized the dangers and may influence prescribing in other 
countries [41]. 


Narcolepsy 


Narcolepsy, meaning ‘seized by somnolence, was described by 
Gelineau in 1880 to include sleep attacks and falls triggered by 
strong emotion (cataplexy). The most recent diagnostic criteria 
(Box 110.3) reflect an increasing understanding of the neuropath- 
ology [42]. Narcolepsy is now characterized as type 1 or type 2, 
with the diagnosis made on a typical history plus supportive in- 
vestigations [either polysomnography plus daytime multiple sleep 
latency tests or, more recently, reduced cerebrospinal fluid (CSF) 
hypocretin]. Its prevalence is 0.05-0.1%, but often with significant 
delay to diagnosis [43]. 

It typically develops in adolescence with excessive daytime 
sleepiness evolving over weeks to months and then remaining 
relatively fixed and unremitting over time. It is best thought of as 
a sleep switch problem where patients transition between sleep/ 
wake states more frequently than normal. The classical tetrad of 
excessive daytime sleepiness, hypnogogic and hypnapompic hal- 
lucinations, cataplexy, and sleep paralysis is only seen in 20% of 
patients at first presentation. One-third of patients will never de- 
velop cataplexy or have only very subtle symptoms. The majority 
will have disabling sleepiness and will not be able to get through 
the day without napping. Many describe naps as difficult to resist 
at times, but classical sleep attacks without any recollection of the 
event are, in fact, uncommon. The description from a relative or 
witness is often of a conversation stopped halfway and restarted as 
if no time has passed. 

Key diagnostic features include the need for brief and typically 
refreshing naps, of sometimes just a few minutes. Adult patients will 
typically have a fragmented night, and some feel that this should be 
part of the diagnostic criteria. Certainly if an adult is very sleepy, but 
with a long, deep, and unbroken night, then another cause of sleepi- 
ness is more likely. Dream recall is strikingly vivid, often after even 
short daytime naps. The description of hypnagogic hallucinations 


Box 110.3 Diagnostic criteria for narcolepsy with and 





without cataplexy 


Narcolepsy type 1—narcolepsy with cataplexy 

Criteria A and B must be met: 

A The patient has daily periods of an irrepressible need to sleep or day- 
time lapses into sleep occurring for 23 months. 

B The presence of one or both of the following: 

1 Cataplexy and a mean sleep latency of <8 minutes and 22 sleep- 
onset REM periods (SOREMPs) on a multiple sleep latency test 
(MSLT) performed according to standard techniques. A SOREMP 
(within 15 minutes of sleep onset) on the preceding nocturnal 
polysomnogram may replace one of the SOREMPs on the MSLT. 

2 CSF hypocretin-1 concentration, measured by immunoreactivity, 
is either <110 pg/mL or <1/3 of mean values obtained in normal 
subjects with the same standardized assay. 








Narcolepsy type 2—narcolepsy without cataplexy 

Criteria A-E must be met: 

A The patient has daily periods of an irrepressible need to sleep or day- 
time lapses into sleep occurring for =3 months. 

B A mean sleep latency of <8 minutes and >2 SOREMPs are found on 
an MSLT performed according to standard techniques. A SOREMP 
(within 15 minutes of sleep onset) on the preceding nocturnal 
polysomnogram may replace one of the SOREMPs on the MSLT. 

C Cataplexy is absent.” 

D Either CSF hypocretin-1 concentration has not been measured or 

CSF hypocretin-1 concentration measured by immunoreactivity is ei- 

ther >110 pg/mL or >1/3 of mean values obtained in normal subjects 

with the same standardized assay. 

E Hypersomnolence and/or MSLT findings are not better explained by 

other causes such as insufficient sleep, obstructive sleep apnoea, de- 

layed sleep phase disorder, or the effect of medication or substances 
or their withdrawal. 





“If cataplexy develops later, reclassify as type 1. 











can vary, but unlike those with psychotic disorders, the hallucin- 
ations occur within sleep and are typically visual. 

Cataplexy is typically an axial loss of posture affecting the head, 
face, and neck, and patients can describe their face becoming loose 
or their jaw dropping. Bigger attacks cause falls, but with retained 
consciousness; eyes close, and patients look as if they are asleep. 
Recovery is typically quick within a minute or so. Cataplexy is worse 
at times of emotional arousal. Younger children can find it harder to 
describe emotional triggers and may have a near permanent decrease 
in muscle tone, giving a tongue that lolls and a pseudomyopathic ap- 
pearance with a clumsy, waddling gait [44].Those presenting later in 
life have a milder phenotype, but sleepiness that is definitely new in 
onset over the age of 40 is rarely due to narcolepsy. 

Narcolepsy is now known to be caused by a loss of hypocretin 
neurons within the hypothalamus [45]. Since hypocretin also has a 
key role in metabolism, patients typically gain weight as the condi- 
tion evolves with a craving for carbohydrate-rich foods. Children, in 
particular, can have significant weight gain and precocious puberty 
as a consequence [46]. Many have parasomnia and a mild REM sleep 
behaviour disorder (RBD); the dream enactment rarely causes in- 
jury but can lead to diagnostic confusion and can occasionally be the 
presenting complaint [47]. 

Specific diagnostic tests include haplotyping for the commonly as- 
sociated HLA-DQB1*0602 allele. This is seen in 95% of those with 
narcolepsy and cataplexy, but also in 25% of the background popula- 
tion and only in 60% of those without cataplexy, so it is of limited use. 
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It is strongly associated with low CSF hypocretin. The author does not 
use it, unless planning CSF testing. Polysomnography plus multiple 
sleep latency tests have for years been the gold standard objective test 
of both sleepiness and abnormally early-onset REM sleep (4-5 timed 
nap opportunities with patients given a 20-minute period and the 
instruction to ‘try to fall asleep’). Those with a mean sleep latency of 
<8 minutes are said to be pathologically sleepy; many patients with 
narcolepsy have far shorter sleep latencies of <5 minutes [48], and the 
presence of REM sleep within 15 minutes on two or more occasions 
has 70% sensitivity and 97% specificity [49, 50]. There is no value for 
routine EEG to assess for the presence of sleepiness. 

Differential diagnosis includes sleep restriction, and careful as- 
sessment of the average time in bed should be measured, ideally with 
sleep diaries. Comorbid sleep apnoea should be excluded in those 
with increased BMI who also snore. 


Idiopathic hypersomnia 


A less well-defined condition, patients typically complain of pro- 
longed and unrefreshing sleep, typically answering the question of 
‘how long do you nap?’ with the answer ‘until I am woken. Naps 
are much more likely to last over an hour, and for these patients, 
there is little dream recall and no cataplexy [51, 52]. In one-third 
of patients, symptoms have been present since childhood. Unlike 
narcolepsy, there is a family history in at least a third. Typically, pa- 
tients describe the phenomenon of a prolonged night sleep of often 
9 or 10 hours and then marked difficulty waking (sleep drunken- 
ness), with repeated returns to sleep. Being late into work or school 
is common. Diagnostic criteria again require polysomnography plus 
multiple sleep latency tests confirming a mean sleep latency of <8 
minutes, but without sudden-onset REM sleep. Polysomnography 
often shows prolonged slow-wave sleep and high sleep efficiency. 
Unlike narcolepsy, about 10% spontaneously remit. The differential 
diagnosis includes hypersomnia associated with depression, but this 
more commonly causes more variable symptoms over weeks and 
months, with periods of remission [53]. 





Parasomnias—‘things that go bump in the night’ 


Described as ‘undesirable physical and/or experiential phenomena 
accompanying sleep, parasomnias can involve complex motor acts, 
including walking, eating, texting, and even driving. Rarely, this 
can be misdiagnosed as a primary psychiatric problem or, more 
commonly, as possible nocturnal seizures. At least 20% of all chil- 
dren have some form of parasomnia, although most do not pre- 
sent to doctors. At least 2-3% of the adult population have some 
form of parasomnia, and this is increased to 4-9% in psychiatric 
patients [54]. 

Parasomnias are divided into those occurring in NREM or 
REM sleep. 


Non-rapid eye movement parasomnias 


Confusional arousals, sleepwalking (somnambulism), sleep-related 
eating disorder, and sexsomnia are all NREM parasomnias. They 
are disorders of arousal [55], with incomplete wakening from 
slow-wave sleep (N3). These disorders are common in childhood, 


and there is often a family history. Events can occur within the first 
30-60 minutes of sleep, which is less common for REM-related 
events. Occasional sleep talking is also an NREM parasomnia, but so 
common as to be near universal. 

Typically, there is a lack of awareness for at least some, if not all, 
of the events or if there is recall, it is patchy. Partners almost always 
describe more night-time events than the patients, and events last 
minutes, rather than seconds (5-15 minutes would be the average). 
There is variability to events, but clustering is common, with sev- 
eral events over a week and then periods of relative remission. 
Patients who are out of the bed and out of the bedroom are more 
likely to have NREM, rather than REM, parasomnia. Patients can 
be both argumentative and combative, but directed violence is rare. 
Hallucinatory phenomena can occur within NREM sleep. 

NREM parasomnia typically decreases moving into adulthood; 
however, those who are predisposed can then have further symp- 
toms with acute psychiatric illness, shift work, or irregular sleep 
patterns [56]. The other well-recognized trigger is another sleep dis- 
order such as OSA or restless legs [57, 58]. 

Specific additional investigations include sleep diaries to look 
for triggers and scheduling issues and a witness history, wherever 
possible. The value of inpatient video polysomnography is debated, 
as it rarely changes the diagnosis but may detect additional sleep 
apnoea or restless legs [59]. If events are captured, they arise from 
slow-wave sleep (N3 sleep stage). If the differential diagnosis does 
include nocturnal seizures, then MRI brain and EEG should also be 
performed. However, the main value of a sleep study is to look for 
other sleep disorders in adults that might be triggering the event or 
an alternative diagnosis ifno witness history is available. The current 
ICSD-3 criteria do not require polysomnography to make the diag- 
nosis with a typical history. 


Rapid eye movement parasomnias 


These disorders are typically brief, memorable, and dream-filled, 
and unlike NREM parasomnias, patients are far more likely to have 
recall of events. REM parasomnias include nightmare disorder, iso- 
lated sleep paralysis, and RBD. 

In normal sleep, a powerful, descending glutaminergic signal 
inhibits the spinal motor neurons during REM sleep with reduced 
skeletal muscle activity or REM atonia [60]. Only the diaphragm and 
the extraocular muscles are active (giving the rapid, jerky eye move- 
ments on the electro-oculogram within polysomnography). In RBD, 
an insidious and progressive degeneration of this brainstem control 
causes increasingly troublesome dream enactment, alongside vivid 
and often violent dream content. Affected patients shout and act out 
dreams, and up to 70% will injure themselves or their bed partner 
and this is often the prompt for referral. This is typically in the second 
half of the night. The condition predominantly affects older men, 
with a reported prevalence of at least 1%. RBD was first described by 
Carlos Schenck in 1986 in a small series of elderly men [61]. In fact, 
selective brainstem lesions in animals were shown to cause dream- 
like behaviours many years earlier [62]. Over recent years, there 
has been increasing recognition of the strong association between 
RBD and other neurodegenerations, and in particular Parkinson's 
disease, dementia with Lewy bodies, and multiple system atrophy. 
Over 80% of those with RBD will develop one of these conditions 
over 15 years of follow-up [63]. Studies show subtle cognitive, ol- 
factory, and gait disturbance in RBD patients, compared to controls 
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Fig. 110.3 Inpatient video polysomnography in a patient with REM sleep behaviour disorder. A 30-second epoch of polysomnography showing EEG, 
EOG, and EMG, with loss of normal REM atonia (bottom line) highlighted with black arrows. This patient had a clinical history typical for REM sleep 


behaviour disorder (RBD). 


[64]. In the Parkinson's disease clinic, at least 50% of patients will 
have some symptomatic RBD with a range of validated screening 
questionnaires primarily developed for research [65]. 

The ICSD-3 criteria specify the need for polysomnography in the 
context of a typical history, unlike other parasomnias where history 
alone is sufficient. This shows loss of REM atonia and occasionally 
abnormal behaviour on video (Fig. 110.3). 

RBD is rare in children, but there is a recognized association with 
narcolepsy. Loss of REM atonia is seen with SSRI and SNRI anti- 
depressants, with occasional dream enactment [66, 67]. 

Screening for RBD should therefore be a routine part of history 
taking in the memory clinic, and RBD can mimic some of the fea- 
tures of post-traumatic stress disorder. Key differences include a 
variety of different dreams, with not all of them being violent, and 
a lack of daytime distress. Occasionally, severe OSA can cause strik- 
ingly vivid dreams and mimics RBD. Screening for sleep apnoea 
should be considered in those who are sleepy or have other features 
to suggest OSA. 


Other REM parasomnias 


Isolated sleep paralysis where someone awakes but for a minute or so 
cannot move. This occurs with associated dream-like imagery and 
occasionally with a feeling of heaviness in the chest or shortness of 
breath, and it is more common in the second half of the night or 
on waking from sleep. Touch relieves symptoms, and patients can 
describe trying to shout or to move. The diagnosis is made on his- 
tory alone, and there are no specific tests routinely used. This is a 
common parasomnia experienced by at least 20% of the population 


on occasion and up to 2-3% regularly [68], but it can be distressing 
when it first occurs. It is due to a dissociation between reactivation 
of skeletal muscle tone and return to wakefulness and is often seen 
with abrupt wakenings from REM sleep. For those with other psy- 
chopathology, the dreams can be more intrusive and involve themes 
of abuse. Typical triggers include irregular sleep/wake cycles for any 
reason. It can also be seen as a rebound phenomenon when stopping 
REM-suppressing antidepressants such as venlafaxine. It is seen fre- 
quently in those with narcolepsy, but alongside other features such 
as daytime sleepiness [69]. 

Nightmares are a universal phenomenon, but those with fre- 
quent and distressing nightmares have nightmare disorder. There 
is an association with depressive symptomatology [70]. An almost 
endless list of medications are said to be associated with night- 
mares or vivid dreams, but in practice, the common culprits are 
beta blockers, opiates, or cessation of REM-suppressing anti- 
depressant medications. 


Benign hypnagogic hallucinations 


There is debate about which sleep state these strikingly real, but 
exclusively nocturnal, and typically visual phenomena arise from. 
Typically occurring in the early part of the night, insects are com- 
monly described themes, with rapid resolution of symptoms when 
lights come on. Insight is retained or rapidly returns with wake- 
fulness. They seem more frequent in young women, but there are 
limited prevalence data. 
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Box 110.4 Circadian rhythm disorders 





e Advance sleep phase syndrome 

e Delayed sleep phase syndrome 

e Shift work disorder 

e Jet lag circadian rhythm disorder 

e Irregular sleep/wake syndrome 

e Non-24-hour sleep/wake syndrome 











Circadian rhythm disorders 


There are six distinct circadian rhythm disorders currently defined 
by ICSD-3 (Box 110.4) and increasing recognition of the relatively 
high frequency of these conditions in psychiatric patients [71]. 
All are caused by misalignment of the period of sleep, and many 
are characterized by the length of sleep time and often the quality 
of sleep remaining relatively normal, but simply out of sync with 
societal norms. 


Delayed sleep phase syndrome 


This presents in teenage years with patients who typically fall asleep 
and wake far later than the societal norm, causing associated daytime 
dysfunction. The diagnosis can be made on history alone, alongside 
well-completed sleep diaries (Fig. 110.2). For many, the key question 
is ‘if you were allowed to sleep when you wanted, would you sleep 
well? The quality of sleep is often good if uninterrupted, but patients 
simply do not feel sleepy in the first half of the night, falling asleep 
between 3 and 4 a.m. and then struggling to wake in the morning. 
Those who have sleep restriction when woken early may have day- 
time sleepiness; there is also evidence that the total sleep period (tau) 
is longer in this group. Males are more commonly affected [72]. 

The diagnostic criteria include either sleep diaries or actigraphy 
for at least 2 weeks, with a typical sleep diary shown from a teen- 
ager subsequently well treated with melatonin and light therapy 
(Fig. 110.2c). There are two groups: those with a lifelong tendency, 
often with a family history; and those that drift during teenage years. 
Many report symptoms persisting over years, once established. It 
can be misdiagnosed as insomnia or chronic fatigue syndrome. 


Shift work disorder 


At least 20% of the western world do shift work, and for at least 20%, 
this significantly disrupts sleep. This decreases total sleep time and 
disrupts the circadian rhythm, with substantial evidence for in- 
creased mortality, obesity, cancer rates, and affective disorders in 
those who do shift work, compared to non-shift workers [73-75]. 
Shift work disorder occurs at greater rates in women, those over 
40, and those who work rotating or back-to-back night to day shifts. 
Patients typically present with both insomnia when trying to sleep 
during the day and sleepiness during their shift, although many do 
not attribute either sleepiness or fatigue to their work patterns. This 
is often because sleep debt can accumulate over time, and shifts that 
may have been tolerable in the third or fourth decade become harder 
if there is added sleep disturbance with a young family, a longer 


commute, or an additional sleep disorder such as sleep apnoea. ESS 
may well be increased, but the key test remains sleep diaries. 


Restless legs syndrome and periodic limb 


movements of sleep 





This common sleep-related movement disorder is seen in at least 5% of 
the population at all ages [76], and diagnosis can be made on a typical 
clinical history for many. However, it is often missed by both patient 
and doctor as a cause of a disturbed night and a sleepy day. The diag- 
nostic features, as described by the most recent International Restless 
Legs Syndrome Study Group (IRLSS) criteria, are an unpleasant dys- 
aesthesia deep within the legs that causes the desire to move and is 
at least partially relieved by movement [14]. Twenty-five per cent of 
patients have daytime symptoms, but there is a clear circadian pattern, 
with symptoms worst in the evening and first half of the night. Patients 
can find it surprisingly difficult to describe the sensations; ‘creeping, 
‘itchy; ‘fidgety, and ‘crawling’ are all words used. Thighs and calves 
are affected more than the soles of the feet, and patients are restless in 
bed, seeking cool surfaces upon which to rest their feet. Pacing and 
walking during the night is seen in more severe cases. Symptoms can 
be asymmetric and include the low back, and less commonly the arms 
can be affected. Unlike arthritis pain or peripheral neuropathy, people 
are better or symptom-free when walking. 

Symptoms can be episodic, and not all have sleep disturbance, but 
for those with moderate or severe symptoms, they can have consid- 
erable sleep fragmentation and subsequently a sleepy day. At least 
50% will have a family history, and those with other affected family 
members typically present at a younger age. About 5% have had 
symptoms starting under the age of 16, often mislabelled as growing 
pains. There is an association with restless legs and high caffeine in- 
take, smoking, and excess alcohol. Exercise close to bedtime exacer- 
bates symptoms. 

Approximately 80% of those with restless legs can describe (or a 
frustrated bed partner can) strikingly periodic limb movements of 
sleep and occasionally of wakefulness. There is a brief judder of the 
toe, foot, and ankle, and 20-30 seconds pass before the next. The 
impressive response to dopamine agonists is often both diagnostic 
and therapeutic. 

There is an association with low ferritin levels for some and blood 
levels should be checked, and many psychotropic medications ex- 
acerbate symptoms, particularly those that are dopa-depleting, as 
well as some antidepressants and melatonin. 

Polysomnography can record the periodic limb movements with 
both limb EMG and video. This also helps to exclude other/add- 
itional causes of sleep disturbance such as OSA. Lower limb/foot 
actigraphy can be used over 2-3 nights at home to measure the peri- 
odic limb movement index. 

The differential diagnosis includes drug-related akathisia and 
hypnic jerks, but both lack the striking circadian pattern to symp- 
toms and periodicity to night-time myoclonus. 


Insomnia disorder 


Insomnia remains the most common sleep disorder in primary care, 
with at least 10% of the population affected [77]. Now simplified as a 
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diagnostic entity in its own right, it is defined as difficulty initiating 
sleep, difficulty maintaining sleep, and associated daytime distress 
persisting for >3 months. It is commonly comorbid with other 
mental and physical health problems [78, 79]. At least 40% will have 
anxiety disorder. Typically, ESS will be low at 0-2, and those with 
high ESS should be screened for secondary causes of insomnia. 

At least 10% have restless legs syndrome as the cause of their in- 
somnia; 20% of those with sleep apnoea will perceive an unrefreshing 
night, although the other clues of snoring and daytime sleepiness 
should prompt further investigations. Gastro-oesophageal reflux 
can be surprisingly toxic to sleep as another secondary cause. It can 
be surprisingly hard to distinguish insomnia alone from insomnia 
with comorbid OSA in those over 65 [80]. 

The other common mimic is circadian rhythm disorder where pa- 
tients perceive difficulty initiating sleep with delayed sleep phase, 
but in fact, they then stay asleep well, and sleep diaries and an ac- 
curate history should distinguish the two. There are a small number 
of truly short sleepers who, in fact, awake refreshed after just 5 hours 
but feel this is not enough. For insomnia disorder, there has to be 
associated daytime functional impairment. 

The cornerstone of both diagnosis and therapy is sleep diaries 
where a typical pattern of more time in bed than being asleep is usually 
seen. Sleep efficiency (total sleep time/time in bed) can be calculated. 
Normal sleep is typically >85% sleep efficiency, and those with severe 
insomnia often have sleep efficiency of <50%. A typical diary is shown 
in Fig. 110.2. Most will have occasional nights of more normal sleep. 
Fatigue, rather than daytime sleepiness, is described, and documented 
daytime naps should be infrequent. A hypervigilant phenotype is typ- 
ical, with ‘My mind is racing’ and ‘As soon as I climb the stairs, the 
lights go on’ as typical statements heard in clinic. Women are more 
affected than men, and increasing age is also a risk factor. A widely 
accepted model is that of predisposing factors, a precipitating trigger, 
and then perpetuating maladaptive behaviours that disrupt both the 
normal homeostatic and circadian drivers to sleep [81]. 

The history should include questions about the typical 24-hour 
pattern and the bedroom itself. Review of the prescription list for 
any stimulant medications is required, alongside an assessment of 
mood. There is often a mismatch between perceived daytime per- 
formance and more objective assessments. This is very different to 
the sleep apnoea or narcolepsy story where daytime function has 
usually been objectively affected. Use of sleeping tablets, including 
over-the-counter preparations, needs to be documented, as these 
may cause some of the daytime fatigue. 

A sleep study is not indicated unless another sleep disorder 
is suspected, and actigraphy is rarely used outside of research. 
Polysomnography when performed typically shows reduced sleep 
efficiency, prolonged sleep latency, and early REM latency, but these 
changes are also seen in depressive disorder. There is often sleep mis- 
perception, with patients estimating less time asleep in the labora- 
tory, but explaining this rarely helps the patient. 


Miscellaneous sleep disorders 


Hypnic jerks 


A single shock-like jerk at sleep/wake transition is a near- 
universal phenomenon diagnosed by history alone, but it can be 


non-specifically increased by any condition that delays sleep onset 
or increases night wakenings. 


Exploding head syndrome 


This dramatically named, but benign, phenomenon is diagnosed by 
history alone and is thought to be the sensory equivalent of hypnic 
jerks. At sleep/wake transitions, a patient will hear a single loud 
bang or an explosion, or occasionally there is a flash of light that can 
briefly seem entirely external; symptoms can cluster and be wors- 
ened by anxiety, and this tends to increase sleep latency and increase 
night wakenings. 


Bruxism 


Those who grind their teeth at night, such that bed partners can 
hear, risk damage to the enamel and often disturb their partners, 
and some, but not all, have jaw pain. Night-time grinders tend to be 
daytime clenchers, with tenderness around the temporomandibular 
region, and there is some evidence for daytime jaw relaxation. There 
is some limited evidence for an association with increased anxiety 
and low mood [82]. 


Conclusions 


Unfortunately, sleep disorders remain underdiagnosed and 
undertreated [1, 12]. In addition, sleep disturbance in those with 
mental health problems is often attributed to the psychiatric diag- 
nosis or psychotropic medication, rather than considering a distinct 
and treatable set of disorders. This chapter has provided the tools 
to identify specific sleep problems, how to take a sleep history, and 
how to then investigate and diagnose common sleep disorders. Their 
treatment is described in Chapter 114—it should assume a greater 
priority. 
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A recent study [1] estimated the economic costs of insufficient sleep 
to be roughly between 1% and 3% of the gross domestic product 
(GDP) in the industrialized world. To understand the mechan- 
isms and causes behind this expensive modern sleep ‘experiment, 
we need to strengthen sleep research both in the laboratory and in 
the field. 

Sleep research is almost 100 years old. By studying sleep predom- 
inantly in laboratories, the physiology underlying sleep was system- 
atically elucidated, describing, for example, the neuronal circuits 
and brain regions that initiate and maintain sleep [2] and how the 
timing of sleep and wakefulness are controlled by an internal 24- 
hour clock—the circadian clock [3]. Sleep research outside the la- 
boratory in the real world, that is, the epidemiology of sleep, is even 
younger, looking back about only 30 years [4]. 

Sleep and its functions can only be understood in the context 
of the whole 24-hour day, which includes wake. We therefore base 
this review on formal considerations of the four main (albeit inter- 
related) aspects of sleep (and wake): duration, timing, structure, and 
quality (Fig. 111.1). The molecular underpinnings of the system 
have been described in Chapter 109. 

Duration 

Wake and sleep are not identical with, but closely related to, activity 
and rest, which together make up the circadian day. Internal days 
can deviate from 24 hours, but in the long term, the duration of 
activity-rest (sleep-wake) averages up to 24 hours. If, on average, 
longer sleep was not followed by shorter wake, and vice versa, we 
would gradually advance or delay our sleep timing and other daily 
behaviours. 


Timing 

The timing of sleep and wake, is similarly interdependent as the dur- 
ation of sleep and wake. The timing is often defined by the onset 
and offset of the respective episodes; however, this measure is highly 


dependent on the episodes durations. The timing of sleep or wake 
is therefore more reliably defined by the mid-points of either sleep 





or wake, rather than by their transitions. As will be described later, 
the mid-point of sleep on work-free days is used as a surrogate for 
chronotype. The concept of chronotype refers to how the circadian 
clock of an individual embeds itself into the 24-hour light-dark 
cycle—earlier or later. 

Both sleep timing and duration are regulated by at least two pro- 
cesses [5, 6]—one homeostatic, and the other circadian. The former 
works like an hourglass; sleep pressure builds up during wake and 
dissipates during sleep (with non-linear characteristics). Sleep pres- 
sure can be estimated from the EEG, specifically the spectral power 
of slow-wave sleep (SWS) [7]. Yet, if sleep was solely controlled by 
the homeostat, sleep timing would be quite labile. Let us presume 
someone regularly slept 8 hours from 10 p.m. to 6 a.m. and was 
awake for 16 hours from 6 a.m. to 10 p.m. If this person stayed up 
one day until 6 a.m., he/she might sleep as long as 12 hours due to 
the prior sleep loss and would wake up at 6 p.m. with minimal sleep 
pressure. From that moment on, the homeostat would let this person 
sleep from 6 p.m. to 2 a.m. and be awake from 2 a.m. to 6 p.m. 

But sleep—like most functions in our body—is also regulated 
by the circadian clock, which makes being asleep and being awake 
more likely to occur at certain times in the 24-hour cycle than at 
others. The model proposed by Borbély and Daan [5, 6] implements 
the circadian involvement in sleep regulation in making the thresh- 
olds that trigger falling asleep and waking up oscillate in a circadian 
manner. The circadian clock’s influence on sleep becomes evident 
in jet lag situations. When we travel fast over many time zones, the 
resynchronization of our circadian clock lags behind (as a rule of 
thumb, the circadian clock adapts by no more than about 1 hour per 
day). As a result, we have difficulties adjusting to the new light-dark 
regime at our destination, cannot sleep for several days during the 
local night, and can hardly be awake during the local day. 

Our lifestyle has drastically changed with industrialization. While 
we used to expose ourselves to bright natural light outdoors during 
the day and actual darkness during the night, we now spend most 
of the day inside (with light up to a 1000-fold dimmer than outside) 
and simply switch on artificial light from sunset to bedtime. We have 
thereby drastically weakened the synchronizing signals (zeitgeber) 
[8] that our circadian clocks use to synchronize (i.e. entrain) to the 
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Fig. 111.1 Key aspects of sleep-wake behaviour can be assessed 
separately but are highly interrelated. 


24-hour cycle. In adapting to this new situation, most people's cir- 
cadian clocks have become later [9]. This delayed circadian phase is 
quickly reversible, as Ken Wright has shown in elegant experiments 
where he took urban students camping for 1 week and exposed them 
to the strong zeitgeber of natural light and dark [10]. Especially the 
late chronotypes advanced, and the chronotype differences between 
individuals became much smaller. 


Structure 


Both wake and sleep—albeit in very different ways—are also charac- 
terized by temporal structures that depend on many parameters—in 
our society, most importantly, on whether they occur on work-free 
days or workdays. This is illustrated in Fig. 111.2 where the phys- 
ical activity averaged over several days rarely reaches zero in any of 
the subjects; this has at least two reasons: (1) sleep times can vary 
greatly between days (thereby ‘spilling’ activity into the average rest- 
or sleep-window); and (2) some activity can still be recorded during 


Activity profiles on workdays 


vey 


Activity profiles on free days 





sleep. Workday sleep timing is usually more regular, so that activity 
troughs are often more pronounced than on weekends. Although 
activity levels during sleep are by many factors smaller than during 
wake, the residual levels can still be analysed. The resulting activity 
structure during sleep shows a clear rhythmicity with a period of 
around 90-150 minutes, a property it shares with the sleep stages 
recorded by polysomnography (PSG) in sleep laboratories (see 
Activity monitoring for sleep-wake epidemiology, p. 1140). 


Quality 

The fourth aspect is quality (Fig. 111.1). Although we intuitively 
know that sleep and wake quality are strongly interdependent, we 
need an objective measure for sleep quality before we can quantify 
the respective qualities and their interdependence scientifically. 
Since sleep fulfils many biological functions, which we still need 
to fully understand, a universal definition of sleep quality may be 
difficult. 

Sleep science commonly distinguishes between subjective and ob- 
jective sleep quality. Subjective sleep quality refers to how well the 
sleeper reports to have slept after the sleep episode or to sleep in gen- 
eral. This self-report is likely influenced by the sleeper’s individual 
and cultural preconception about how sleep should be or how one 
should feel after waking up. It is also influenced by the sleeper’s in- 
dividual psychological state such as their affect [11]. Objective sleep 
quality, by contrast, compares certain objective sleep measures of 
an individual with normative values. These normative values may, 
in themselves, harbour a bias just by how they were obtained and 
who served as reference population—often young, healthy univer- 
sity students in controlled sleep laboratory environments. The most 
common measures for objective sleep quality are: how long people 
take to fall asleep (sleep latency); how often they wake and how long 
they are awake (wake after sleep onset); how much of their time in 
bed people actually sleep (sleep efficiency); the sleep architecture 
(shuttling between sleep stages); time in certain sleep stages; and the 
intensity of the slow brain waves during deep sleep (delta power). 
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Fig. 111.2 Activity profiles can vary in their timing, shape, and amplitude. In general, they differ greatly between workdays (top row) and work-free 
days (bottom row). Profiles were taken from our large actimetry database and represent averages over 23-60 days (work or work-free, respectively). 
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Some of these sleep measures and objective quality indicators can be 
subjectively assessed by questionnaires such as the Pittsburgh Sleep 
Quality Index (PSQI) [12], but most must be measured via PSG in 
the sleep laboratory. 

Importantly, both objective and subjective sleep quality are biased 
by the modern notion of what sleep should ‘look like, namely ef- 
ficient, consolidated, and, if possible, initiated 6-8 hours before 
one has to get up (for work). The more we find out about sleep in 
non-industrialized societies, that is, before the use of electrical light 
[13-17] or in people who have to live according to the natural photo- 
period [18], the more we may have to challenge our notions and cur- 
rent methods of measuring sleep quality. The fact that subjective and 
objective sleep quality are only poorly associated with each other 
[11, 19, 20] indicates how difficult and highly flawed our current 
definition of sleep quality is. 


Epidemiology of sleep in the real world 


Since 2000, we have been building a database on daily sleep behav- 
iour using the Munich ChronoType Questionnaire (MCTQ) [21]. 
By now, it contains more than a quarter million entries. The MCTQ 
asks participants to give the times when they go to bed, prepare for 
sleep, fall asleep, wake up, and get up [21]. Most importantly, these 
questions are asked separately for workdays and work-free days. The 
growing database allowed us to quantify the key aspects of human 
sleep-wake behaviour. We assessed the amount of sleep people get 
and when they get it, and how different aspects of sleep change with 
age [22], season [23, 24], and geographical location [25], even with 
daylight-saving time [23]. Our results suggest that, although people 
sleep just as long on work-free days as they did 10 years ago, they 
get continuously less sleep on workdays [26]. Also, throughout 
their school and work life, most people alternate between under- 
sleeping on workdays and over-sleeping on work-free days. One 
factor leading to this difference in sleep duration is a change in sleep 
timing from workdays to work-free days. This so-called social jetlag 
[27] is quantified as the difference between the social and the bio- 
logical ‘time zone’ The fact that >80% of the working population 
in our database needs an alarm clock to wake up on workdays [26] 
indicates that almost the entire working population experiences a 
strain on their circadian clock and sleep. 


Sleep timing 


Sleep times on workdays are predictably more synchronized among 
individuals than on work-free days, that is, they have a narrower 
distribution [mid-sleep on workdays (MSW); see Fig. 111.3] than 
on free days [mid-sleep on free day (MSF); see Fig. 111.3] (for day- 
specific differences in sleep timing, see also Fig. 111.5a). 

The internal circadian clocks of most of us have continuously 
delayed during industrialization (due to decreasing zeitgeber 
strengths), while social times, for example work or school start 
times, have remained relatively stable. We fall asleep much later than 
our rural ancestors and use alarm clocks to wake up, thereby accu- 
mulating a sleep debt over the work week, for which we try to com- 
pensate on weekends. 

For chronotype assessment, that is, how late or early one’s circadian 
clock is embedded into the light-dark cycle, sleep timing on work- 
free days is a better basis than sleep timing on workdays. However, 
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Fig. 111.3 Distribution of mid-sleep times in the MCTQ database. 

The narrower distribution shows the mid-sleep times on workdays 
(MSW: n = 197,829; mean = 3:19 + 1.13), and the later wider distribution 
represents mid-sleep times on free days (MSF: n = 200,260; mean = 4:47 
+ 1.34; excluding people who use alarm clocks on work-free days). The 
assessment of chronotype involves correcting MSF for oversleep (see text 
for details), resulting in the distribution in the middle (MSFsc: n = 181,600; 
mean = 4:19 + 1.27). 


since work-free days are still influenced by workdays in the form of 
catch-up-sleep, we correct MSF for ‘over-sleep’ (MSF,.; see https:// 
www.thewep.org/documentations/mctq for a comprehensive list of 
variables assessed or computed from the MCTQ). Note that MSF,.- 
based chronotype should only be calculated for people who do not 
use alarm clocks on work-free days. 

The MCTQ-assessed chronotype is not a scaled prefer- 
ence for doing things at specific times of day (as is the aim of the 
morningness-eveningness questionnaire developed in the 1970s) 
[28]. The MSF,.-time should be regarded as a surrogate for the en- 
trained phase of an individual's circadian clock. Validations with 
more objective measures of phase of entrainment, such as sleep-logs, 
actimetry, or measurements of the rise of melatonin in the evening, 
show good accordance [29-31] with the one-time assessment of 
chronotype (MSF,.). 


Sleep duration 


Most people accumulate a sleep debt on workdays. As outlined 
in the introduction, this is most probably due to the effects of our 
modern light environments on the circadian clock and fixed work 
schedules. Fig. 111.4 shows the distributions of sleep duration on 
work- and work-free days, with the former being over 1 hour shorter 
and just slightly narrower than its counterpart on work-free days. 
Fig. 111.5b shows that the day-specific sleep durations can, in some 
individuals, differ by >5 hours. 


Sleep quality 


As discussed, we still need an objective measure for sleep quality. 
However, apart from pathologies that are not directly sleep-related 
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Fig. 111.4 Distributions of sleep duration on workdays (n = 197,829; 
mean = 7.06 hours + 1.08 hours) and on work-free days (n = 200,260; 
mean = 8.12 hours + 1.22 hours). 


(apnoea, restless legs, chronic pain, etc.), chronic accumulation of 
sleep debt, the habitual use of alarm clocks, and constant attempts to 
sleep at the wrong circadian times must greatly reduce sleep quality. 
All these scenarios are aspects of social jet lag (Fig. 111.5a). This 
modern syndrome could turn out to be the most prevalent and 
costly high-risk behaviour of our times. With every hour of social jet 
lag, the chances of being overweight or obese increase by 33% [26], 
substantiating an association found already much earlier between 





sleep duration and metabolic problems [32]. Social jet lag is also 
associated with higher odds of being a smoker and correlates with 
alcohol and caffeine consumption [27], as well as with signs of de- 
pression such as appetite loss and feelings of sadness [33]. 


Sleep structure 


So far, there are no epidemiological data on the structure of sleep, 
since it is currently only accessible via measuring brain waves and 
other parameters in sleep laboratories using PSG. PSG recordings 
are subsequently analysed and ‘scored’ into different sleep stages. 
Sleep’s ‘architecture’ cycles through these different sleep stages ap- 
proximately every 90-150 minutes. This sleep-inherent ‘ultradian’ 
rhythmicity was first described by Dement and Kleitman [34]. To 
obtain epidemiological data on sleep structure, we recently explored 
how to extract this ultradian structure from activity. 





Activity monitoring for sleep-wake epidemiology 


The methods used in sleep laboratories (for example, PSG) usually 
obtain detailed information on individuals in clinical settings and 
are less suited for large-scale studies (too costly, time-consuming, 
and labour-intensive). Epidemiological sleep studies require simple 
and cost-effective methods to assess and monitor the desired vari- 
ables. They therefore often rely on self-reporting—either as one-off 
questionnaires (for example, the MCTQ; see Epidemiology of sleep 
in the real world, p. 1139) or by keeping sleep-logs. 

Another method for sleep monitoring that is becoming increas- 
ingly popular is the continuous measurement of physical activity or 
body movement via actimetry (or ‘actigraphy’). Body movements 
are recorded mostly with watch-like, wrist-worn devices. Since 
people move much more during their wake-time than during their 
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Fig. 111.5 Differences in sleep-wake behaviour between workdays and work-free days. (a) Social jetlag is defined by the differences in sleep timing 
between the two day types [27]; frequencies on the right of the stippled zero line concern people who sleep later on work-free days than on workdays, 
while those on the left of this line sleep later on workdays. (b) Similar differences exist between sleep duration on workdays and work-free days, with 
the majority sleeping longer on work-free days (right of the stippled zero line). 
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Fig. 111.6 Activity records can be used to infer on sleep-wake times. The top row shows five 40-day activity recordings from different individuals, 
and the bottom row shows the respective sleep times as bars [27]. The pale backgrounds indicate photoperiods, which were calculated based on the 
individuals’ locations (latitude and longitude) and recording dates. We selected these five examples to demonstrate how different the sleep-wake 
behaviour can be. The example in (a) shows a female with strict regularity in her relatively early daily sleep-wake schedules. The next pair of graphs 


in (b) is again a female and early chronotype whose schedules are far less regular (possibly because her partner is a late chronotype) 


[he central pair 


of graphs in (c) shows a male whose nocturnal sleep episodes are late, short, and irregular and who takes naps in the early evening. The next two 
examples in (d) and (e) show a male and a female, respectively, whose clocks—despite being visually unimpaired individuals—have difficulties to stably 





synchronize to the 24-hour rhythm of the environment. 


sleep (Fig. 111.2), one can estimate when and how long they sleep by 
the episodes of relatively low activity [35, 36], that is, rest-activity is 
used to assess sleep-wake (Fig. 111.6). 


Sleep timing and duration from actimetry 


The many different methods to assess sleep via actimetry are either 
published or remain black boxes built into proprietary software. 
Overall, their concordance with the gold standard in sleep monitoring 
(PSG) is reasonably high—at least in healthy sleepers [35, 36]—but 
they have also been validated in sleepers suffering from common 
sleep pathologies such as sleep apnoea (for example, [37-39]). Most 
methods perform well but tend to underestimate short awakenings 
within a sleep episode [wake after sleep onset (WASO)]. Detection 
generally becomes less reliable when the difference between wake- 
time and sleep-time activity is low, that is, when people are bedridden 
or suffer from motor disorders or highly interrupted sleep. 

For studies outside of laboratories and clinics, sleep detection via 
actimetry has many advantages despite providing much less infor- 
mation than PSG. Unlike self-reports, it is an objective measure of 
sleep timing and duration and produces—without burden—a de- 
tailed longitudinal record (over days and weeks) of an individual's 
rest-activity or sleep-wake behaviour. Long-term actimetry puts 
sleep into the context of the previous and following wake and sleep 
episode and thus allows, for example, teasing apart homeostatic 
and circadian influences on sleep (see Introduction, p. 1137). As 
such, long-term actimetry is superior to a single night in the sleep 
laboratory; it allows analysing the regularity of the sleep-wake 
pattern, identifying the daily phase and amplitude of general ac- 
tivity [for example, by a 24-hour cosine fit providing the centre of 
gravity (COG)], calculating the mid-points of sleep, or character- 
izing weekly patterns (for example, differences between work- and 
work-free days). 





Long-term actimetry also allows detecting more complex sleep- 
wake problems, disorders or pathologies that are only apparent in 
long-term recordings. A good example is the N24 disorder [40] 
where sighted individuals cannot synchronize (i.e. entrain) to the 
normal light-dark cycle and therefore drift through societal time 
(Fig. 111.6d and e). Shift-workers have a similar problem, but the 
influences are inversed; societal time (for example, work schedules) 
make these individuals drift through their own circadian time (Fig. 
111.7). Note how similar the rest-activity/sleep-wake patterns are 
in the individuals shown in Fig. 111.6d and e to those shown in 
Fig. 111.7d and e. 

To make use of this strength of actimetry, its recordings need to be 
long enough to reveal the important patterns and be representative 
of an individual's sleeping habits. Since the pattern and variability in 
people's sleeping habits are often not known beforehand, any initial 
actimetry recording should cover at least 14 days (including three 
weekends) and ideally aim for 6 weeks. With complex disorders and 
sleep schedules, these initial recordings need to be complemented by 
even longer recordings over months. 


Sleep structure from actimetry 


So far, research has usually used actimetry only to obtain informa- 
tion about the timing and duration of sleep episodes. However, new 
efforts are under way that allow extracting more information about 
the sleep episode itself, that is, its structure and ultradian cyclicity. 
This effort is based on the observation that body movements are part 
of sleep physiology and vary with sleep stages or sleep stage transi- 
tions (for example, [34, 41-43]). Many current commercial actimeters 
(notably not for use in clinical or research settings!) claim to inform 
consumers about their sleep depth based on their movements during 
sleep. Whereas these consumer products are currently not scientif- 
ically validated and their output needs to be treated with absolute 
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Fig. 111.7 Actimetry and the derived sleep times in five examples of shift-workers. As in Fig. 111.6, the top row shows the activity, and the bottom row 


the calculated sleep times. The grey squares indicate work times. 


caution [44-46], the principle on which they are based may well work. 
We ourselves have developed a method that reveals the rhythms in 
body movement during sleep in actimetry recordings—rhythms that 
are highly reflective of sleep cycles, as determined by clinical PSG [47]. 
These extended applications of actimetry in large epidemiological 
studies will provide valuable new insights into the structure of sleep 
and its relation to sleep timing, sleep duration, and sleep quality. 





Sleep-wake in psychiatric patients 


The association between sleep-wake behaviour and mental dis- 
orders is apparent in the standard criteria for the diagnosis of psy- 
chiatric disorders in DSM-5 and ICD-10 [48]. Sleep problems are 
commonly seen as symptoms or comorbidities of the psychiatric 
disorder itself. For example, one of the diagnostic criteria for a de- 
pressive episode are changes in sleep (hypersomnia or insomnia); 
for a manic episode, a decreased need for sleep; and the criteria 
for post-traumatic stress disorder include nightmares and difficulty 
sleeping. However, the association between sleep and psychiatric 
pathologies is highly complex with numerous feedbacks. The causal 
links in this network are still not well understood but are certainly 
not unidirectional. While psychiatric patients frequently develop 
sleep problems [49], sleep problems may trigger psychiatric dis- 
orders in the general population [50]. Furthermore, psychophar- 
macological treatments influence sleep both by improving [51] and 
by creating/increasing sleep problems [51, 52]. Sleep—or rather its 
deprivation—is even used as therapy in major depression [53]. But 
again, the role of sleep deprivation in depression is complex; on the 
one hand, isolated sleep deprivation may acutely reduce depression 
[54], possibly via changing the synaptic rearrangements that occur 
during sleep [55]; on the other hand, chronic sleep deprivation 
may increase the risk for developing a variety of psychiatric (and 
somatic) disorders [56]. 

Similar to the concept of sleep quality, the term sleep problems 
is used for many different symptoms. Here, we try to put some 
order into the different aspects of sleep in the context of psychiatry 


by separating the individual—though interdependent—aspects 
of sleep, namely timing, duration, structure, and quality (see also 
Introduction, p. 1137 and Fig. 111.1). 


Sleep timing 


Sleep timing, or chronotype, has often been associated with psychi- 
atric syndromes (for review, see [57]). Notably, many studies ask for 
sleep preferences, for example for morningness (MEQ; [28]), instead 
of actual sleep timing (for a discussion about different chronotyping 
methods, see [58]). Two studies using actual sleep timing found that 
the later people sleep, the more depressed they are [33, 59]. In add- 
ition, Levandovski and co-workers [33] showed an association with 
social jet lag, indicating that the clash between chronotype and social 
schedules, rather than chronotype/sleep timing per se, can lead to 
depressive symptoms. Whether a later chronotype predisposes to 
depression or depression goes along with a change in chronotype 
still has to be worked out (for an in-depth discussion, see [60]). The 
fact that advancing circadian phase (chronotype) is being used as a 
therapeutic approach in patients suffering from depression indicates 
that sleep timing is being regarded as a cause—at least in part—for 
developing depressive symptoms [61]. 

Other psychiatric syndromes, such as seasonal affective disorder 
(SAD) [62], schizophrenia [63, 64], obsessive-compulsive disorder 
[65, 66], and attention-deficit/hyperactivity disorder (ADHD) [67], 
have also been found to be associated with delayed sleep timing. It 
seems that most psychiatric, as well as neurodegenerative, disorders 
go along with changes in sleep timing [68-71]. 


Sleep duration 


The most obvious example of the relationship between psychiatric 
syndromes and sleep duration is bipolar disorder; during depres- 
sive episodes, most patients sleep longer, whereas during manic 
episodes, sleep is usually shortened [49]. Notably, shortened sleep 
during mania is not to be confused with insomnia, the inability to 
initiate or maintain sleep, but is due to an apparent reduction in 
sleep need, that is, patients feel awake and refreshed after as little as 
3 hours of sleep [49]. 
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Other mood disorders have also been associated with changes in 
sleep duration; patients suffering from SAD show a prolonged sleep 
duration during winter [72]—as, to some extent, is also true for the 
general population [23]; patients with non-seasonal unipolar depres- 
sion either sleep significantly longer (hypersomnia [73]) or shorter 
(hyposomnia [74]) than healthy controls. These changes in sleep 
duration are usually caused by the inability to initiate sleep and/or 
early morning awakenings. Another explanation for this condition 
is that patients adhere to habitual sleep times, even when their circa- 
dian timing changes. Some studies failed to show hypersomnia (or 
hyposomnia) in depressed patients in the sleep laboratory and con- 
cluded that these symptoms must be rather a subjective sleep com- 
plaint [75], but sleep measured in a one-night PSG in an artificial 
environment does not capture the actual sleep of patients at home. 

Our own research showed that adolescent patients with re- 
mitted depressive disorder still sleep longer, compared to healthy 
controls [60]. Schizophrenia is also associated with hyper- [64] or 
hyposomnia [76]. The latter was also found in post-traumatic stress 
disorder [77], alcoholism [78], autism [79], and generalized anxiety 
disorder [80]. 

The relationships between psychiatric syndromes and sleep dur- 
ation are all but straightforward. On the one hand, studies in the 
general population found sleep durations of under 5 hours increased 
the odds for a variety of psychiatric conditions: mood disorders such 
as depression [56, 81, 82]; anxiety disorders [56, 81]; obsessive- 
compulsive disorder [81]; and nicotine and alcohol dependence 
[56, 81]. On the other hand, sleep over 9 hours was associated with 
depression [82], anxiety disorders, social phobia, and alcohol de- 
pendence [81]. In children, both shortened and prolonged sleep in- 
creased the likelihood for ADHD [83, 84]. Although sleep duration 
differs significantly between workdays and work-free days [85], Park 
and colleagues [81] analysed the relationship of sleep duration and 
various psychiatric disorders only on weekdays, which predomin- 
antly reflects external (that is, social) time constraints. 


Sleep quality 


Reduced subjective sleep quality can include a variety of symptoms— 
from trouble initiating and maintaining sleep (insomnia) to early 
morning awakenings, excessive daytime tiredness, and the feeling 
of not being refreshed despite a sufficient sleep duration. All of 
these symptoms have been associated with a variety of psychiatric 
disorders [86], both as preceding symptoms and as comorbidities. 
Reduced subjective sleep quality is, for example, often associated 
with acute depression [87, 88] and is even found in patients with re- 
mitted depression [60] either as a residual symptom or as a predictor 
of relapse. Sleep disturbances can be predictors for the onset of major 
depression [89], and subjects complaining about insomnia have a 
4-fold higher chance of developing major depressive disorder [50]. 


Sleep structure 


The concept of sleep structure has been described (see Structure, 
p.1138). Due to the limited number of publications, we focus here 
on the main characteristics of PSG in psychiatric patients. The most 
obvious changes in PSG could be found in depressed patients. Many 
studies show a disturbed sleep consolidation, as well as changes in 
sleep depth and REM sleep pressure [90, 91]. Additionally, two meta- 
analyses found a reduced duration of SWS [90, 91], which remained 


unconfirmed in another study [92], which also found none of these 
symptoms in SAD. 

A recent meta-analysis on PSG and psychiatric disorders by 
Baglioni et al. [92] showed that beyond ADHD and SAD, almost 
every psychiatric disorder (including personality disorders, schizo- 
phrenia, and eating disorders) is associated with sleep alterations. 
Specific psychiatric disorders were not characterized by single PSG 
parameters, but rather a specific combination or profile of altered 
parameters. 





Primary prevention of sleep-wake disorders 


Sleep disorders, according to ICD-10, refer to hyposomnia, 
hypersomnia, delayed or advanced sleep timing, irregular sleep, and 
non-24-hour sleep. The aetiology of these may be as complex as sleep 
itself. This complexity includes the neuronal circuits and switches 
that initiate, maintain, and terminate sleep [2], as well as the cir- 
cadian [3] and homeostatic control [6, 93]; it also pertains to the 
many different functions that our brain performs during sleep such 
as synaptic reorganization [94], clearing of accumulated metabolites 
[95], consolidating memory [96], or immune functions [97]. This is 
a far from complete list of sleeps mechanistic gearbox that consti- 
tutes a breeding ground for sleep-wake disorders. But sleep can only 
be understood in context, which again constitutes a complex net- 
work of different levels. Many everyday life factors influence when, 
for how long, and how we sleep. Many of them are related to how we 
expose ourselves to light or what genetic chronotype we are: light 
exposure during the day—office worker vs farmer, commuting by 
bike or by tube; light exposure during the night—using intense, 
blue-rich light after sunset, sleeping in a darkened room; light ex- 
posure across the year— affected by season, longitude, timezone and 
‘time-meddlings’ like daylight saving time; being a victim of societal 
time—late-chronotype teenagers learning at 8 in the morning, late 
chronotypes working early shifts or early chronotypes working at 
night. Again, this is an incomplete list of context factors, which are 
fertile soil for developing sleep-wake disorders. While the mechan- 
istic complexity of sleep control does not readily lend itself to pri- 
mary prevention, the contextual influences on sleep do. Here, we 
keep to the structure of this chapter and address this issue separately 
for timing, duration, structure, and quality. 


Sleep timing 


Sleep timing is by far the most important and amenable toe- 
hold for preventive and therapeutic sleep medicine. Most sleep 
disturbances—apart from the comorbidities of psychiatric or other 
pathologies (pain, apnoea, etc.)—are due to living against one’s in- 
ternal circadian clock. This modern, highly prevalent syndrome [26] 
has many names: circadian disruption, circadian misalignment, cir- 
cadian stress, circadian strain, or social jet lag (for details, see [98]). 
Its most extreme version is called shift-work. Sleeping and waking at 
the wrong circadian times has numerous health consequences, many 
of which are probably directly or indirectly related to sleep disturb- 
ances. Realigning sleep times with the temporal window provided 
by the circadian clock will therefore ameliorate sleep disturbances 
and thereby decrease health deficits. A recent epidemiological study 
has shown that late chronotypes have higher odds for developing 
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type 2 diabetes if they work ‘normal day jobs, compared to working 
permanent night shifts [99]. Other studies have shown how sleep 
and health in shift-workers profit by realigning sleep and circadian 
time [100] or by assigning shifts according to chronotype [101]. 

In a majority of the population in urban/industrialized regions, 
circadian clocks are delayed while work schedules have hardly 
changed. Most people are therefore, figuratively speaking, ‘per- 
manent shift-workers. Thus, any measure that decreases the re- 
sulting social jet lag is sleep problem-preventive. Such measures can 
change either the internal or the external time. The internal phase 
can be advanced by strengthening the zeitgeber [more light during 
the day and less (blue-containing) light during the night] or by en- 
suring an asymmetry in light exposure (more light before internal 
midday and less thereafter). Another point of action is delaying 
work and school start times or making them flexible. 


Sleep duration 


Circadian misalignment and social jet lag always also affect sleep 
duration. Using an alarm clock and being unable to fall asleep early 
enough (due to the wake maintenance zone [102]) lead to chronic 
sleep deprivation during the work week and to over-sleep on week- 
ends. This is also, though of a very different nature, a problem in 
shift-workers; shift-specific sleep problems are commonly due to 
forced awakenings. While late chronotypes are awoken by the ex- 
ternal alarm clock before early shifts, early chronotypes are awoken 
too early by their internal (circadian) clocks when they try to get 
an extended sleep after night shifts [103]. Thus, changing sleep 
timing—as described—can, in most cases, also prevent insufficient 
sleep in terms of sleep duration. 


Sleep quality 


Although we argued that fully consolidated sleep episodes are a 
modern concept, frequent interruptions are certainly a sign of poor 
sleep. Sleep quality and sleep interruptions change with age [104]. 
Alzheimer’s patients often wake up during the night and are sleepy 
during the day. This is partly due to drastically reduced light-dark 
cycles in these patients, and again this syndrome is treatable with 
increased zeitgeber strength [105]. 

Low subjective sleep quality often is due to a wrong conception of 
what sleep should look like. For example, early chronotypes living in 
a predominantly late society often perceive their early tiredness and 
awakening as signs of a sleep disturbance, but only because they are 
not aware of their chronotype and the large inter-individual vari- 
ability in circadian sleep timing. Once educated, society still chal- 
lenges compliance with their own rhythms, but this then becomes 
a matter of choice, rather than a perceived illness. In other cases, 
simply by advising ‘patients’ to keep sleep diaries, which they can 
visualize, rectifies their misconceptions about sleep. 


Sleep structure 


Abnormalities in sleep structure can be used in diagnosis and treat- 
ment of sleep disorders arising from both the mechanistic and the 
contextual levels. So far, sleep structure is best detected by PSG in 
the sleep laboratory or at home, but we are confident that actimetry 
can be used in future to derive the ultradian structure of sleep and 
thereby contribute to detection and prevention of sleep disorders. 
Sleep stages and structure depend on the time of day [3], so that 
attempts to sleep within the temporal window provided by the 


circadian programme should be part of sleep hygiene and preventive 
sleep medicine. 


Conclusions 


While we begin to understand the link between sleep timing and 
somatic problems (for example, metabolic syndrome), we need 
much more research to understand the association between sleep 
and psychiatric disorders. We will have to resolve several chicken- 
and-egg problems, and we have to look into the contexts of light, 
activity, food, and sleep within our weekly structures. We are opti- 
mistic that a combination of long-term activity recordings and the 
increasing possibilities to extrac t sleep information from activity 
will be a rich source for elucidating the different aetiologies of sleep 
pathologies, as well as the relationships between sleep and patholo- 
gies per se. 
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As with the vast majority of biological traits in humans, sleep has 
an important genetic component. Sleep-related traits and sleep dis- 
orders can be considered complex or multifactorial conditions, that 
is, the aetiology and the biological basis are a combined effect of 
genetic factors, environmental non-genetic components, and their 
interaction. Since the stimulus of the Human Genome Project, bio- 
medical research has made real progress in characterizing the genetic 
architecture and molecular pathways underlying human complex 
diseases [1]. Establishing that a given trait is heritable implies that 
underlying genetic factors play a role in determining the phenotype. 
Despite the established genetic heritability of many sleep-related 
phenotypes, only a small number of validated genetic variants have 
been discovered so far. Thus, compared to some other fields, under- 
standing the genetic architecture of sleep-related phenotypes through 
genome-wide association studies (GWAS) is in a relatively nascent 
stage. As methodologies for analysing these data continue to develop, 
research in these emerging genetic areas will improve our current 
knowledge. Given the known heritability of many traits of sleep and 
circadian rhythm in humans and also of specific sleep disorders, this 
is currently a major opportunity for the sleep research community. 
In addition to the characterization of genetic factors underlying 
sleep disorders, a number of neuropsychiatric conditions have been 
extensively studied in regard to their genetic architecture [2]. It is also 
well described how sleep disorders are associated with many neuro- 
psychiatric illnesses, and this relationship suggests that there are 
shared genetic components explaining this association. Only recently 
have advances in genomic technologies supported the development 
of knowledge to interpret these relationships biologically. Therefore, it 
is expected that a better understanding of the genetic contributions to 
sleep disorders and psychiatric conditions will reveal both common 
shared and unique molecular pathways to these traits that will help 
to inform the aetiology of these morbidities, thereby supporting the 
development of new diagnostic and therapeutic approaches. 

















The duration of sleep varies in individuals in the general popula- 
tion, and a number of studies have shown that the prevalence of 





self-reported short sleep duration (<6 hours per night) ranges from 
14% to 35% worldwide [3]. Given the known physiological, be- 
havioural, metabolic, and molecular consequences of lack of sleep, 
understanding the biological basis of short sleep duration is funda- 
mental. Although behaviourally induced short sleep also occurs as a 
consequence of commitments of modern life, an important genetic 
contribution has been identified. Studies analysing the differences in 
sleep duration between monozygotic and dizygotic twins estimated 
the heritability of sleep duration in the order of 31-44% [4, 5]. 

Common and rare genetic variants have also been described as 
associated with this phenotype. Although an initial GWAS based on 
the Framingham cohort did not find any genome-wide significant 
single-nucleotide polymorphisms (SNPs) associated with sleep dur- 
ation [6], another larger GWAS in a population with European an- 
cestry found a significant association in an intronic variant of ABCC9 
in the discovery phase, but not on replication [7]. This gene encodes 
one of the 17 transmembrane domains of the pore-forming subunit 
of an adenosine triphosphate (ATP)-sensitive potassium channel 
(Karp) that serves as a sensor of cellular metabolism. However, the 
association between this variant and sleep duration was also not 
significant in other studies [8-10]. On the other hand, significant 
associations with other variants in ABCC9 and sleep duration [11] 
and depressive symptoms have been found [10]. Knocking down the 
expression of the homologue of this gene in Drosophila was associ- 
ated with reduced sleep amounts at night [7]. Whether variations in 
ABCC9 are associated with sleep duration or with a shared mech- 
anism between sleep duration and depression symptoms, this gene 
likely plays a role in the control of sleep/wake, suggesting that fur- 
ther investigation is needed. 

In a GWAS in a Finnish cohort, no genome-wide significant as- 
sociations with sleep duration were found [9]. However, some of 
the suggestive associations were replicated in a follow-up sample. 
Among the most interesting findings, the authors found an associ- 
ation in the KLF6 locus. This gene encodes Kruppel-like factor 6, a 
transcription factor that functions as a tumour suppressor. Higher 
expression of KLF6 in circulating mononuclear lymphocytes was re- 
lated to shorter sleep duration, and its expression in these cells was 
increased with experimental sleep restriction, suggesting this gene 
as a modulator of the sleep/wake regulation. 

A study from the CHARGE consortium used self-reported sleep 
duration and genotyping data from 18 community-based cohorts 


and found two genome-wide significant associations in European 
individuals. One of the significant associations was in the PAX8 
locus, which replicated in an independent African-American 
sample [8]. This gene encodes a thyroid-specific transcription factor 
involved in thyroid development and function [12], which could be 
potentially involved in the relationship between sleep regulation and 
metabolism. 

Sequencing studies in individuals with extreme phenotypes in re- 
gard to sleep duration were also conducted. He et al. investigated 
two individuals who slept for 6 hours with no evidence of daytime 
impairments and identified a mutation in exon 5 of the BHLEH41 
(class E basic helix-loop-helix protein 41) gene, also known as 
DEC2. This mutation results in a proline-to-arginine substitution in 
the protein and had its sleep-affecting properties validated experi- 
mentally in Drosophila and mice [13]. A further study sequenced 
DEC2 in two other individuals with particular sleep phenotypes and 
identified two new mutations in this gene [14]. In one subject, who 
was a member of a dizygotic twin pair, the twin with the mutation 
slept 2 hours less per day and had substantially less performance 
lapses during prolonged sleep deprivation than the other twin. The 
other mutation was found in three unrelated individuals, but no ob- 
vious effect of this variant on sleep duration. Interestingly, the muta- 
tions that were linked to differences in sleep showed clear functional 
consequences in model organisms, by affecting key mechanisms of 
the molecular circadian clock [14]. Based on these findings, it is ex- 
pected that other variants of DEC2 and other genes that affect the 
molecular circadian clock can result in short sleep. 

Due to the difficulty in defining a reliable and objective pheno- 
type that represents the habitual sleep duration, most of the studies 
rely only on noisy self-report measures, and conclusive studies are 
still not available. More objective assessment of sleep duration in- 
cludes the use of actigraphy-based rest-activity patterns and GWAS, 
with these objective parameters only conducted recently [15], as de- 
scribed in more detail later in this chapter. Nevertheless, the progress 
so far indicates that some candidate genes deserve further functional 
characterization of their roles in sleep/wake regulation. 


Genetics of insomnia 


Similar to sleep duration, there is a lack of comprehensive studies 
on the genetics of insomnia, partly because of the uncertainty in 
defining the appropriate phenotype. Although insomnia is a preva- 
lent condition with defined clinical diagnostic criteria, it is difficult 
to establish a common biological mechanism, potentially due to the 
diverse underlying causes. Nevertheless, family and twin studies 
have been conducted, utilizing self-report measures of insomnia 
traits, showing that a moderate degree of the variability of these 
phenotypes can be explained by genetic factors. 

Patients whose insomnia began in childhood reported a positive 
family history of sleep complaints at a higher rate than those with 
adult-onset insomnia [16]. In a large Australian twin study, addi- 
tive genetic influences were found for sleep quality, initial insomnia, 
sleep latency, ‘anxious insomnia, and ‘depressed insomnia, with 
heritability ranging from 32% to 44% [4]. In a twin study from the 
Vietnam Era Twin Registry [17], estimates ranged from 21% to 42% 
for trouble falling or staying asleep, waking up several times per night, 
and waking up feeling tired and worn out. In a recent longitudinal 
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study in a representative sample of twins, there was moderate herit- 
ability in the univariate analysis (22-25%), but when accounting for 
the longitudinal measurement, heritability increased substantially, 
especially in females (59%), compared to males (38%) [18]. 

A GWAS for insomnia, defined using a series of questions about 
sleep patterns, was conducted in Korea, and an association was 
found with a variant in the ROR1 locus that was not genome-wide 
significant [19]. This gene encodes the receptor tyrosine kinase-like 
orphan receptor 1, a protein that modulates synapse formation and 
was indicated as a potential candidate explaining genetic variability 
in this insomnia phenotype. Another GWAS in an Australian twin 
cohort did not find any genome-wide significant associations [20]. 
‘The strongest associations were between self-reported sleep latency 
and the CACNAIC locus, a gene that encodes the calcium voltage- 
gated channel subunit alpha 1 C and was repeatedly associated with 
bipolar disorder and schizophrenia [19] (see Chapters 59 and 60). 
This finding did not replicate in a second cohort but was then subse- 
quently replicated in a different study [10]. More recently, genome- 
wide associated variants in the RBFOX3 locus were reported with 
self-reported sleep latency [21]. This gene encodes the RNA- 
binding protein—Fox-1 homolog 3—and is believed to have a role 
in neuron-specific alternative splicing and to be co-expressed with 
genes involved with calcium channel activity and signalling of the 
neurotransmitter gamma-aminobutyric acid (GABA) [21]. Thus, it 
is of potential interest as a candidate in the aetiology of insomnia. 

The fact that insomnia is mainly based on self-reported symptoms 
and clinical evaluation could partially explain the lack of reprodu- 
cibility of studies aiming to find a strong biological basis of this 
disease. In this sense, objective measurements of insomnia-related 
traits, such as actigraphic recordings, can provide a more robust 
characterization of some of the phenotypes that compose the con- 
dition. Recently, the first GWAS on actigraphic sleep phenotypes 
found a genome-wide significant association between a variant in 
the UFL1 locus and actigraphy-derived sleep efficiency on week 
days [15]. This gene encodes the ubiquitin-fold modifier 1 specific 
ligase 1, a protein that participates in the regulation of apoptosis and 
vesicle trafficking in the endoplasmic reticulum. Variants in other 
genes (DMRT1 and CSNK2A1) were also associated with sleep la- 
tency and deserve further investigation. DMRT1 encodes the double 
sex and Mab-3-related transcription factor 1, a protein involved in 
sexual differentiation and embryonic development, and CSNK2A1 
encodes the alpha subunit of casein kinase II, involved in the regu- 
lation of circadian rhythms. Even though studies can achieve more 
biologically meaningful phenotypes by using objective assessment, 
it is important to note that there is often a discrepancy between these 
measures and self-reports of sleep in patients with insomnia [22]. 





Genetics of narcolepsy and hypersomnia 


Narcolepsy is one of the most successful examples for identifying 
genetic factors in sleep disorders. Based on evidence from studies 
in mice and dogs, narcolepsy is caused by a disruption in the orexin 
signalling system, including selective loss of orexin-producing 
neurons. Although only a single patient with a mutation in the orexin 
receptor gene was described [23], it is established that patients with 
narcolepsy present with very low levels of this neuromodulator in 
the cerebrospinal fluid [24]. 








SECTION 17 Sleep-wake disorders 


Early studies showed a 10-40 times increase in the risk of narco- 
lepsy for someone with a first-degree relative with narcolepsy when 
compared to the general population [25]. In accordance with the evi- 
dence from familial aggregation, there is an established relationship 
between specific human leucocyte antigen (HLA) alleles and narco- 
lepsy [26]. HLA encodes the proteins that compose the major histo- 
compatibility complex class II expressed in immune cells, which are 
involved in presenting foreign peptides to receptors on T cells. In 
many populations, the DQB1*0602 allele is strongly associated with 
narcolepsy (odds ratio approximately 250), particularly in cases that 
also present with cataplexy, also known as type 1 narcolepsy [27]. 

Regardless of its strong association with type 1 narcolepsy, the 
DQB1*0602 allele is frequent in the general population, with its 
prevalence ranging in different ethnic groups from 12% in Japanese 
to 38% in African-Americans [28]. Thus, it is commonly found in 
many individuals without narcolepsy. In this sense, it could be that 
the presence of DQB1*0602 could make individuals more suscep- 
tible to environmental insults, in a gene-environment interaction 
fashion. In addition, multiple gene variants other than DQB1*0602 
may be involved in narcolepsy. 

In order to identify additional variants, a series of GWAS where 
both cases and controls were positive for the DQB1*0602 allele were 
conducted. Three variants within the T cell receptor alpha (TRCA) 
locus were significantly associated with narcolepsy and were repli- 
cated in independent Caucasian and Asian samples [29], as well as in 
a study in Chinese patients [30]. This association within TRCA was 
also replicated in a study aimed to evaluate variants relevant to the 
immune system, which also identified significantly associated vari- 
ants in other genes such as cathepsin H (CTSH) and tumour necrosis 
factor (ligand) super-family member 4 (TNFSF4) [31]. 

Further studies investigating DQB1*0602-positive individuals 
found additional loci associated with narcolepsy. A significant asso- 
ciation with a variant in the promoter region of CCRI, a gene that 
encodes the chemokine (C-C motif) receptor 1, was reported and 
replicated in an independent cohort, as well as found to be asso- 
ciated with CCR1 gene expression and lower migration indexes in 
monocytes with a CCR1 ligand [32]. Moreover, a single nucleotide 
variant in the purinergic receptor subtype P2Y11 (P2RY11) gene 
locus was also associated with narcolepsy and presented functional 
evidence of decrease in gene expression, as well as increased sensi- 
tivity of CD8+ T cells to ATP-induced cell death [33]. Interestingly, 
the P2RY11 gene locus is within a region where other genes, such as 
PPAN, EIF3G, and DNMT1, are present, so the signal detected in this 
GWAS could be related to variants of the other genes in this region. 
Further studies replicated these signals and identified a strong signal 
in a haplotype involving P2RY11/EIF3G in European and Chinese 
cohorts [34]. Mutations in the DNA methyltransferase 1 (DNMT1) 
gene in individuals with autosomal dominant cerebellar ataxia, 
deafness, and narcolepsy supported the role of this region [35]. This 
gene encodes an enzyme responsible for maintaining methylation 
patterns in development and is required for the differentiation of 
CD4+ cells into T regulatory cells. Additionally, a genome-wide 
copy number variation (CNV) analysis in a Japanese cohort iden- 
tified an enrichment of rare and large CNVs in different immune 
response-related genes, as well as duplications in the Parkinson's 
disease protein 2 (PARK2) gene in patients, compared to controls 
[36]. The protein encoded by this gene also participates in antigen 


processing and presentation, supporting the existence of an auto- 
immune component against orexin-producing neurons [37]. 

In regard to environmental influences and supporting the role of 
the immune system in the physiopathology of narcolepsy, a strong 
relationship between this sleep disorder and upper airway winter in- 
fections was described. There was a significant increase in the onset 
of narcolepsy after the 2009 H1N1 influenza pandemic [38], and a 
follow-up GWAS revealed additional loci in the HLA region in cases 
with narcolepsy with onset after the 2009 pandemic [39]. 

Despite efforts in the characterization of the genetic basis of type 
1 narcolepsy (narcolepsy with cataplexy), a growing number of 
studies have been investigating the role of genetics in other forms 
of excessive somnolence such as essential hypersomnia syndrome 
(EHS). This condition differs from narcolepsy mainly because of the 
absence of cataplexy, but the sleep-related symptoms are indistin- 
guishable [40]. The DQB1*0602 allele is also associated with EHS, 
although not as strongly as narcolepsy with cataplexy [41]. Similarly, 
variants in the TCRA locus were described as associated with EHS 
in DQB1*0602-positive individuals [42]. A more recent GWAS in 
DQB1*0602-negative individuals with EHS reported three new loci 
associated with the condition (NCKAP5, SPRED1, and CRAT), all 
of which contain genes previously related to other neuropsychiatric 
diseases [43]. NCKAP5 encodes NCK-associated protein 5 that still 
has unknown function. SPRED1 encodes sprouty-related EVH1 
domain-containing 1, a protein phosphorylated in response to 
growth factors. CRAT encodes carnitine O-acetyltransferase, a key 
enzyme in -oxidation of fatty acids. These genes were described as 
candidates in EHS but still need replication and functional follow-up. 


Genetics of chronotype and circadian 





rhythm disorders 


One of the mechanisms through with an organism responds to tem- 
poral changes in the environment as a consequence of the light-dark 
cycle is through regulation of the ‘biological clock. This system, also 
known as the molecular circadian clock, consists of autoregulatory 
feedback loops involving the transcription, translation, and degrad- 
ation of proteins encoded by the clock genes. This set of genes in- 
cludes PER1, PER2, and PER3, and CRY1 and CRY2, as well as CK16, 
CK1le, CLOCK, NPAS1, NPAS2, DEC2, BMAL1, and BMAL2 [44]. 
Given the existence of a comprehensive molecular pathway regu- 
lating the generation of circadian rhythms, it is expected that vari- 
ation in genes in this system could affect circadian behaviours and 
disorders. 

Chronotype is known as the behavioural outcome that results 
from the generation of endogenous biological rhythms and inter- 
action with the environment. Individuals can be classified as an 
evening type (that is, ‘night owls’), morning type (that is, ‘larks’), or 
intermediate. Several twin and family studies have estimated that 
chronotype is a moderately heritable trait, with genetic factors po- 
tentially explaining as high as 50% of the variability in this phenotype 
[45, 46]. This estimate can also be affected by the type of environ- 
ment in which the individual lives, that is, urban or rural [47]. 

Linkage studies of extended families with circadian rhythm sleep 
disorders, primarily advanced sleep phase syndrome (ASPS), al- 
lowed the identification of specific rare variants that segregate 


with the disorder. A serine-to-glycine mutation in the clock gene 
PER2 was reported in a family [48], and different mutations in the 
CK16 gene were identified as the causal variant in other two studies 
[49, 50], supporting the role of the molecular circadian clock in 
ASPS. Interestingly, mouse knockouts for these genes also replicate 
the phenotypes seen in humans. 

Common genetic variants in other genes of the molecular circa- 
dian clock, such as CLOCK, PER1, PER2, and PER3, are also associ- 
ated with chronotype and other circadian rhythm disorders [51-53], 
but not replicated in other studies [54, 55]. One of the most studied 
variants is a variable number of tandem repeats polymorphism in 
PER3 resulting in four or five repeats of a 54-bp motif in exon 18 
[56]. This polymorphism was associated with delayed sleep phase 
syndrome and diurnal preference [57, 58]. The association was rep- 
licated in Brazil [59] and South Africa [60], but not in Colombia [61] 
or in Norway [62]. 

Given the evidence provided by genetic associations studies, as 
well as the functional nature of this polymorphism, a more in-depth 
characterization of sleep and circadian phenotypes was performed 
to evaluate the role of these variants of PER3. Although no signifi- 
cant differences were found in circadian phenotypes, differences 
in sleep and wake behaviour in response to sleep deprivation were 
found between individuals carrying PER3“* and PER3°*” genotypes 
[63]. PER3°” carriers showed increased slow-wave sleep, as well as 
electroencephalographic markers of slow wave activity and greater 
decrement in cognitive performance, in response to sleep depriv- 
ation. These findings was complemented by another study suggesting 
that PER3“° is associated with greater behavioural resiliency to 
sleep restriction when compared to PER3“* [64], and PER”® carriers 
showed a more pronounced impact of sleep deprivation on sustained 
attention, with increased sleepiness, than PER3“” carriers [65]. 

Genome-wide approaches were also used to potentially identify 
genetic loci related to circadian phenotypes. In an early family- 
based study in the Framingham cohort, a linkage peak close to the 
CSNK2A2 gene was found with usual bedtime. This gene encodes 
the catalytic unit of casein kinase 2 that was described as an im- 
portant component in the molecular circadian clock [6]. More re- 
cently, a large GWAS in >89,000 costumers of European ancestry of 
the personal genetics company 23andMe, Inc. identified 15 genome- 
wide significant loci associated with self-reported morningness. 
This study also found an enrichment of associated variants of 
genes in circadian and phototransduction pathways [66]. In add- 
ition, another large GWAS in the UK Biobank cohort in >100,000 
individuals of European ancestry validated eight loci and reported 
a consistent effect of all the 15 loci reported by Hu et al. in 2016. 
A meta-analysis combining results of both studies reinforced the 
role of three loci: PER3, VIP and TOX3 [67]. VIP encodes vasoactive 
intestinal peptide, a key molecule in the central nervous system. 
Intracerebroventricular administration of this peptide was found to 
increase the rapid eye movement sleep time in rabbit. TOX3 encodes 
TOX high mobility group box family member 3 and was previously 
associated with restless legs syndrome (RLS) [68]. 


Genetics of restless legs syndrome 


RLS is another successful example of how genetics has helped to iden- 
tify the pathophysiology of a sleep-related condition. This disorder 
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has been diagnosed primarily using solid symptom-based criteria 
[69] that allowed the identification and validation of a number of 
genetic risk factors, establishing it as a relevant disorder. Positive 
family history [70, 71], twin studies [72, 73], and complex segrega- 
tion analyses [74, 75] supported the existence of an important gen- 
etic component in RLS. Most studies reported the heritability of RLS 
to vary between 40% and 65% [76]. 

Linkage analyses within large-family pedigrees were able to 
identify significant genetic loci linked with RLS on chromosomes 
12q22-23.3, 14q13-22, 9p24-22, 2q33, 20p13, 16p12.1, 4q25-26, 
17p11-13, 19p1 [76], and 13q32.3-33.2 [77]. Follow-up candidate 
region case-control association studies in these regions were able to 
find significant variants associated with RLS and potential candi- 
date genes. Significant associations with single-nucleotide variants 
in NOS1 [78] and PTPRD [79] were described and replicated in in- 
dependent cohorts. NOS1 encodes nitric oxide synthase 1 and has 
been associated with pain perception and sleep-wake control [80]. 
PTPRD encodes protein tyrosine phosphatase receptor type delta, 
involved in long-term potentiation in memory formation and ab- 
normal axon targeting to motoneurons during development in mice 
[81, 82]. 

GWAS were also conducted, and the first two reports were pub- 
lished simultaneously and independently in Caucasian popula- 
tions. One study in German and French Canadian cases, who were 
diagnosed by face-to-face interview with expert clinicians, identi- 
fied and replicated variants associated with RLS in three different 
loci: MEIS1, BTBD9, and MAP2K5/SKORK1 [83]. The other study of 
cases from Iceland and the United States used self-assessment ques- 
tionnaires that contained the essential diagnostic criteria, as well as 
leg actigraphy to assess periodic limb movements during sleep, also 
identified variants in BTBD9 associated with RLS. Interestingly, the 
association was only found in cases that also showed periodic limb 
movements during sleep, and this variant was also associated with 
ferritin levels [84]. The associations between variants in MEIS1, 
BTBD9, and MAP2K5/SKORK1 have been confirmed in further in- 
dependent investigations [76]. Also, an excess of loss-of-function al- 
leles in MEIS1 was found in RLS cases, compared to controls [85], 
supporting the role of these genetic loci in RLS. The functional 
roles of MEIS1 (limb axis formation and neuronal differentiation), 
SKORI (neuronal differentiation), MAP2K5 (muscle cell differenti- 
ation), and BTBD9 (iron and ferritin metabolism and sleep fragmen- 
tation) corroborate the associations found in variants within these 
genes and the pathophysiology of RLS and might suggest the role of 
developmental abnormality. However, better characterization of the 
molecular mechanisms that integrate these genes in the syndrome in 
humans is still warranted. 

An exome sequencing study identified a rare variant in PCDHA3 
segregating in a family with multiple cases of RLS, and two add- 
itional rare missense variants in unrelated cases [86]. PCDHA3 is 
expressed in neurons, and its product is present at synaptic junctions 
in neural cell-cell interaction [87]. This is another plausible candi- 
date for the genetics of RLS. 


Genetics of obstructive sleep apnoea 


Obstructive sleep apnoea (OSA) is a highly prevalent complex con- 
dition characterized by partial or complete blockage of the airway 
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during sleep that leads to a repetitive reduction in blood oxygen sat- 
uration [88]. Some of the major risk factors include older age, male 
gender, and obesity [89], although not all cases of the disease can 
be explained by these factors. This suggests a high degree of hetero- 
geneity and indicates an interesting phenotype to conduct genetic 
investigations. 

An initial study of a single family with a high prevalence of OSA 
suggested for the first time the existence of an important genetic 
component [90]. Further investigation reported that some of the 
most common symptoms of OSA also aggregate in families [91], 
along with more refined measurements of apnoeas and hypopneas 
during sleep [92, 93]. Addressing the confounding effect of obesity, 
which is a major risk factor for OSA, the Cleveland Family Study 
found an increased relative risk of OSA in first-degree family mem- 
bers, even after controlling for body mass index [94]. This approach 
was supported by another investigation that identified familial ag- 
gregation of specific craniofacial features in less obese cases of OSA 
[95], suggesting that factors other than obesity also play an im- 
portant role in the genetics of this disorder. In addition to familial 
aggregation, the heritability of OSA ranges between 21% and 84%, 
depending on the phenotype used to define the disorder and the 
population studied [96, 97]. 

There has been, however, little progress in identifying relevant 
and causal genetic variants accounting for the variability in OSA. 
Due to the heterogeneity of the phenotype, as well as technical and 
statistical limitations, linkage and association studies failed to report 
consistent results [98-100]. This led to meta-analyses of candidate 
genes failing to find evidence that the majority of the reported asso- 
ciations were replicable [98, 101], except for a polymorphism in the 
promoter of the TNFA gene. Given the participation of inflamma- 
tory response in the physiopathology of OSA [102], tumour necrosis 
factor alpha is an important candidate contributing to key conse- 
quences of OSA. A study in paediatric OSA patients showed higher 
tumour necrosis factor plasma levels, particularly in carriers of the 
promoter polymorphism in the TNFA gene [103]. In addition, these 
patients also presented significantly increased symptoms of exces- 
sive daytime sleepiness, suggesting that this variant might explain, 
in part, this particular phenotype in paediatric OSA. 

A genetic association study with variants across 2000 candi- 
date genes relevant to heart, lung, blood, and sleep disorders in the 
Cleveland Family Study and the Sleep Heart Health Study found a 
genome-wide significant association between the apnoea—hypopnea 
index and a variant in the LPARI gene and another in the PTGER3 
gene [104]. The LPARI gene encodes lysophosphatidic acid re- 
ceptor I, a pro-inflammatory protein expressed in the developing 
cerebral cortex [105] and associated with changes in behaviour and 
craniofacial abnormalities in mice knockouts [106]. The product of 
the PTGER3 gene—the prostaglandin E2 receptor—was described 
as a modulator of neurotransmitter release in central and peripheral 
tissues. Both genes seem to participate in important molecular path- 
ways that could partly explain the aetiology of OSA. 

Only recently has a large-scale GWAS on OSA quantitative traits 
been performed, in Hispanic/Latino cohorts. The authors identi- 
fied two genome-wide significant candidate regions associated with 
the apnoea—hypopnea index and respiratory event duration in the 
GPR83 and C6ORF183/CCDC162P loci, respectively. GPR83 en- 
codes a G-protein receptor expressed in different brain regions ofim- 
portance to OSA, and linked to body temperature, metabolism, and 


inflammatory response regulation. C6ORF183 and CCDC162P are 
pseudogenes, and this locus was associated with red blood cell traits. 
Additional loci involving other plausible genes were also identified, 
supporting the involvement of inflammatory and hypoxia signalling 
pathways, as well as sleep-related gene sets [107]. However, these as- 
sociations have still not been independently replicated. 

Given the complexity and different proposed mechanisms and 
pathways contributing to OSA, the genetic study of the major risk or 
protective factors of this disorder can be an interesting approach and 
has not been explored so far. For example, there are multiple obesity 
and body mass index genetic loci identified by GWAS [108], but 
the genetic correlation between obesity and OSA is yet to be deter- 
mined. Other opportunities for investigation involve the genetics of 
major components of OSA pathophysiology such as fat distribution 
[109], soft tissue volume [110], craniofacial structures [111], and 
hypoxic response [112], as well as other respiratory physiological 
parameters without genetic characterization to date. Additionally, 
understanding the role of ancestry and how it affects the identifica- 
tion of genetic risk factors of OSA has already been proposed [113], 
but not explored at its full potential. Although it seems a challenging 
task trying to dissect the complex genetic architecture of OSA, it cer- 
tainly offers an exciting opportunity for future research. 


Conclusions 


The genetics of sleep disorders is an emerging and rapidly evolving 
field in sleep medicine. Some disorders, particularly narcolepsy 
and RLS, present more consistent results; however, the majority of 
sleep-related traits still lack a robust characterization of their gen- 
etic basis. Sleep-related traits are heterogenous phenotypes, and a 
complex genetic architecture is expected, requiring more robust 
and objective ways to characterize intermediate phenotypes, as well 
as larger sample sizes to detect the expected low effect of variants 
explaining the variability in these traits. In addition, most of the 
identified genetic regions lack a comprehensive functional charac- 
terization of the gene and sometimes rely only on the effect of the 
variant on the closest gene, and not within a genomic context. Given 
the relationships between sleep and neuropsychiatric disorders, and 
the promising opportunity of more refined genetic and phenotypic 
characterizations, understanding the genetic basis of sleep and wake 
regulation could partially inform the aetiology of other relevant 
mental and neurological conditions for public health awareness. 
While understanding the genetic aspects of sleep/wake disorders 
imposes a challenge, it is an example of the genetic study of complex 
and multifactorial traits in humans. 
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The understanding of sleep and its disorders has vastly improved 
in recent years, aided significantly by technological and methodo- 
logical advancements in neuroimaging techniques, which allowed 
for more precise neuroanatomical insight into the activity of the 
sleeping brain. Indeed, the brain does not ‘turn off’ during sleep and, 
in fact, remains active throughout the sleep cycle. While it is difficult 
to obtain a direct measure of neuronal activity, neuroimaging tech- 
niques have allowed sleep researchers to safely detect its correlates in 
human subjects during sleep and sleep/wake transitions. 

The widely available research methodology include magnetic 
resonance imaging (MRI) and functional MRI (fMRI), magnetic 
resonance spectroscopy (MRS), positron emission tomography 
(PET), and single-photon emission computed tomography (SPET or 
SPECT), described in more detail in Chapter 12. When used in sleep 
research, these neuroimaging techniques can be powerful tools for 
illuminating the complex neuronal activities that shape brain func- 
tion throughout the different stages of sleep. 

Neuroimaging is particularly useful for studying sleep disorders. 
For example, insomnia has been associated with an excess of ac- 
tivity in brain regions responsible for processing emotion, as well as 
with a marked lack of activation in networks normally responsible 
for the inhibitory regulation of these regions, and these differences 
tend to be most pronounced during transitions from wakefulness 
to sleep. Although less consistent than functional imaging studies, 
neuroimaging of brain anatomy in insomnia suggests alterations in 
the compositional integrity of discrete brain structures such as the 
hippocampus and prefrontal cortex, as well as the rostral anterior 
cingulate cortex. Taken together, neuroimaging techniques have 
proven to be a powerful tool in understanding the neural mechan- 
isms underlying sleep and pathologies leading to sleep disorders. 
This chapter will provide an overview of these imaging methods and 
discuss findings they have provided pertaining to sleep, insomnia, 
and other sleep-wake disorders. While this chapter focuses select- 
ively on neurological disorders of sleep, we encourage the reader 
to consult dedicated works for a comprehensive overview of sleep 
disorders [1]. 


As described in detail in Chapter 109, the current predominant 
model of human sleep divides this physiological state into four 
stages, based on the characteristics of brain activity, muscle tone, 
and eye movements, with the main differentiation being between 
rapid eye movement (REM) sleep from non-REM (NREM) sleep, 
which is composed of stage N1, N2, and N3 sleep. Stage N3 sleep 
was previously further divided into stages 3 and 4 of sleep, but these 
were consolidated due to a lack of evidence for a neurophysiological 
difference between them. During a night of sleep, the brain con- 
tinuously cycles through these stages in periods of approximately 
90 minutes. 

N1 sleep marks the transition from wakefulness to sleep. This 
stage of sleep, which normally constitutes 5-10% of total sleep time, 
shows a slower pattern of brainwave activity, compared to the awake 
state, when measured by electroencephalography (EEG). Stage N1 
is followed by stage N2 where two distinct microarchitectural EEG 
patterns—K-complexes and spindles—are visible. Slow waves, also 
referred to as K-complexes during stage N2, are typified by a large 
(>75 uV), low-frequency (0.5-2 Hz) negative peak, followed imme- 
diately by a lower-frequency positive inflection. Spindles are short 
bursts of approximately 11-15 Hz activity lasting approximately 0.5 
seconds. Spindles are thought to reflect brain processes involved in 
gating out of sensory stimuli that might otherwise wake the sleeper, 
as well as other cognitive functions such as memory consolidation. 
N2 sleep generally comprises 45-55% of total sleep. The next stage 
of NREM sleep N3 is also known as slow-wave sleep (SWS) due 
to an increased prevalence of slow waves on the EEG. Commonly 
known as ‘deep sleep, N3 occurs mostly during the first half of a full 
night’s sleep and constitutes 15-25% of total sleep time. REM sleep, 
characterized by rapid eye movements, is the stage of sleep most 
associated with dreaming. Hallmarks of REM sleep include muscle 
atonia, as well as an EEG profile more similar to the awake state 
than that of NREM sleep. For this reason, REM sleep is also known 
as ‘paradoxical sleep. REM sleep usually accounts for 20-25% of 
total sleep and occurs mostly during the second half of a normal 
night’s sleep. 


Imaging in NREM sleep 


During NREM sleep, cortical neuron activity transitions into a pat- 
tern of slow oscillations, alternating between brief bursts of neuronal 
firing (‘up’ states), followed by long intervals of hyperpolarization 
(dowr states). This oscillation forms the basis of cortical synchron- 
ization in NREM sleep, which is observed on EEG as spindles and 
slow waves. As EEG-indexed synchronization increases, down-state 
hyperpolarization is also associated with decreases in regional cere- 
bral blood flow (rCBF) using PET, with gradual decreases during 
the transition from wakefulness to N1 and N2, reaching a minimum 
during N3 [2]. 

The induction of NREM sleep seems to be facilitated by a de- 
creased activity in the brainstem tegmentum and its ascending 
pathways, as evidenced by human PET studies showing decreased 
activity in the pontine tegmentum during light NREM extending to 
the mesencephalic tegmentum during SWS [3]. Sleep spindles are 
thalamocortical in origin, generated by the activity of GABAergic 
thalamic reticular neurons and are reflected by rCBF decreases in 
thalamic areas during NREM sleep [3]. 

Moreover, the cerebral cortex also shows SWS deactivation, al- 
beit with larger rCBF decreases in frontal and parietal cortices, 
with less deactivation observed in the primary cortices [3]. The ob- 
served decreases in rCBF are likely due to a larger net contribution 
of the down state to the overall PET signal, which is averaged across 
periods of several minutes. However, recent studies utilizing com- 
bined EEG and fMRI have confirmed a transient increase of brain 
responses in cortical and subcortical brain areas during slow waves 
and spindles [4, 5]. 

Studies of functional connectivity in NREM sleep have shown 
attenuated connectivity of the default mode network (DMN). The 
DMN, a functional neural network highly active in the absence of 
any stimuli or task, shows uncoupling, particularly between its an- 
terior (medial prefrontal) and posterior (precuneus, posterior cin- 
gulate) nodes, during deep sleep [6]. In general, there is evidence 
that transitioning from light to deep NREM sleep is associated with 
uncoupling of functional neural networks in favour of an increase in 
local connectivity [7]. This is in line with recent consciousness the- 
ories, according to which decreasing levels of functional connect- 
ivity are the physical correlates of decreasing levels of consciousness. 


Imaging in REM sleep 


In contrast with NREM sleep, REM sleep is defined by a marked de- 
synchronization of brain waves, more closely resembling the awake 
state in general, with some areas showing increased activity, com- 
pared to wakefulness, and decreased activity in others. 

During REM sleep, rCBF is increased specifically in limbic and 
paralimbic areas, the pontine tegmentum, thalamic nuclei, amyg- 
daloid complex, anterior cingulate, orbitofrontal and insular cor- 
tices, and hippocampal formation [3, 8]. In contrast, the middle and 
inferior prefrontal gyri, posterior cingulate, and precuneus tend to 
show decreases in rCBF during REM [3, 8]. 

Induction of REM sleep is also believed to originate in the brain- 
stem, as evidenced by increased activation of the mesopontine, 
which is thought to activate the thalamus and, in turn, the cortex 
[9]. Neurons in the pons generate ponto-geniculo-occipital waves, 
which can be recorded at different levels of this pathway: the pons, 
the lateral geniculate bodies, the occipital cortex, and limbic areas 
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[10, 11]. A time course analysis of BOLD responses during REM 
sleep showed activation of the pons, the thalamus, and the visual 
cortex, in association with REM [12]. PET-indexed rCBF analysis 
showed coupling of the lateral geniculate bodies, the occipital cortex, 
and REM during REM sleep, but not in wakefulness [13]. 

REM sleep has also been observed to exhibit a unique pattern of 
functional connectivity, exemplified by a curious dissociation be- 
tween striate and extrastriate cortical areas, which is opposite to 
that seen in wakefulness [14]. The DMN, previously mentioned to 
show uncoupling in NREM sleep, demonstrates recoupling again 
during REM sleep between the anterior and posterior nodes of the 
DMN, similar to the awake state [15]. However, other changes in 
DMN functional connectivity occur during REM sleep such as an 
anti-correlation between the DMN and the thalamus, as well as sen- 
sorimotor areas [15]. 

Dreams, a hallmark of REM sleep, may also be related to distinct 
neurophysiological activity. Increased activation of the amygdala 
during REM is likely related to the strong emotional content experi- 
enced in dreams, and deactivation of areas associated with working 
memory, such as the dorsolateral prefrontal cortex, might explain the 
discontinuity commonly reported by dreamers [8]. The vivid visual 
and auditory content of dreams has been linked to hyperperfusion 
within occipital and temporal lobes, while the feeling of movement 
may be explained by increased activity within premotor and motor 
cortices [3, 14]. These movements are not acted out by the sleeper 
in normal conditions due to muscle atonia, another characteristic 
of REM sleep. 


Imaging in insomnia 


Insomnia, defined by difficulty in sleep initiation, continuity, or 
quality, which leads to significant daytime impairments, has only 
relatively recently been considered from a neurological point of view. 
Current theories ascribe the aetiology of insomnia to alterations in 
neural control over sleep-wake regulation systems. An imbalance in 
these systems may be due to increased activity in networks contrib- 
uting to arousal (hyperarousal), or hypofunction of those respon- 
sible for initiating or maintaining sleep [16]. 

A summary table of functional neuroimaging studies of insomnia 
can be found in Table 113.1. Compared to normal sleepers, indi- 
viduals suffering from insomnia show a decrease in rCBF in the 
basal ganglia, medial frontal, parietal, and occipital cortices, and 
this decrease is most prominent in NREM sleep [17]. Interestingly, 
basal ganglia activity seems most sensitive to cognitive behavioural 
therapy for insomnia (CBTi), with one study showing a 24% increase 
in basal ganglia rCBF, coupled with a 43% improvement in average 
sleep latency, following CBTi [18]. A landmark PET imaging study 
in primary insomnia (that is, in the absence of major comorbidities) 
showed a lack of reduction in brain glucose metabolism from wake- 
fulness to sleep, lending credence to the hyperarousal theory of in- 
somnia where brain metabolism maintains a profile more similar to 
the awake state in insomnia, compared to good sleepers [19]. Specific 
brain areas that were affected by this failure to reduce glucose me- 
tabolism during wake-sleep transition included the ascending re- 
ticular activating system, thalamus, hypothalamus, hippocampus, 
amygdala, and anterior cingulate, insular, and medial prefrontal cor- 
tices. Conversely, during wakefulness, these areas—particularly the 
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Table 113.1 Functional neuroimaging studies of insomnia 






Smith et al. 
2002 [17] 


Nofzinger 
et al. 2004 
[19] 


Smith et al. 
2005 [18] 


Nofzinger 
et al. 2006 
[21] 


Altena et al. 
2008 [22] 


Huang et 
al. 2012 
[45] 


Drummond 
et al. 2013 
[23] 


Baglioni 
et al. 2014 
[26] 


Li et al. 
2014 [44] 


Nie etal. 
2015 [43] 


Kay et al. 
2016 [20] 


Neuroimaging | Sample size 
(number of 
females) 


%™Tc-HMPAO 5 (5) 4 (4) 

SPECT 

EFDGPET Te 20(13) 

*”™Tc-HMPAO 4(4) | None 

SPECT 

8E-FDGPET  15(7) None 

fMRI(15T) 21 12(9) 
(17) 

fMRI (30T) 10(5) 10(5) 

fMRI(3.0T) 25 25(12) 
(12) 

fMRI (30T) 22 38(21) 
(15) 

fMRI (30T)  15(8) 15 

(8) 

fMRI (BOT) 42  42(24) 
(27) 

8E-FDGPET 40 44(25) 


(24) 


Mean age in years + SD 


rics rcs 


B78 WLI | 4S VY) 
Bd Oo S26 384 
ASE None 
36.9+10.5 None 
61462 60+ 8.2 
Se A S52 
Pasa |Z24a7 l 
AO a eS) 
PBE 41.3489 
49.24 + 4914+ 
12.26 10.20 
37/10) 38411 





PI diagnosis | Pl 
and 


duration 
assessment 


ICSD, PSG 26 mo 
DSM-IV,PSG 21 mo 
See [17] See [17] 
DSM-IV, PSG 21 mo 
See [88]. PSG 22.5 years 


DSM-IV, PSG imr 





DSISD, >3 mo 
actigraphy, 

PSG 

Research >1 year 
diagnostic (10.3 + 
criteriaforID 10.9 years) 
(not Pl) [88]. 

PSG, ISI, PSQI 

DSM-IV, PSQI nr. 


[eSB >2mo 


DSM-IV, PSG m 


History of 
pharmacological 
treatment 


Off-sleep aids for 
24 weeks, off SSRIs 
for 21 year 


PI using med. were 
excluded 


See [17] 


PI using med. were 
excluded 


Off med. for 22 mo 


Medication-naive 


PI using med. were 
excluded 


Off psychoactive 
med. for >2 weeks 


Medication-naive 


Off psychoactive 
med. for =2 weeks 


Off psychoactive 
med. for >2 weeks 


Main findings in PI, compared 
to GS (significance level) 





Hypoperfusion of basal ganglia 
and other regions during NREM 
sleep (P <0.05 uncorr,) 


Smaller reduction in glucose 
metabolism during transition 
to NREM sleep; prefrontal 
hypoactivation during wake (P 
<0.001, corr.) 


Partial re-establishment of 
activation in basal ganglia after BT 
(P <0.05, uncorr.) 


Correlation between WASO 

and thalamocortical activation, 
including pontine tegmentum (P 
<0.05, corr.) 


Prefrontal hypoactivation during 
verbal fluency task, partially 
restored after CBT (P <0.05, 
uncorr. 








Altered connectivity between 
amygdala and other regions, 
particularly the premotor cortex; 
amygdala-premotor connectivity 
was correlated with PSQI (P <0.05, 
uncorr. 





During cognitive task, reduced 
activation in task-relevant areas 
and reduced deactivation of 

default mode regions (P <0.05, 
corr.) 


D patients, compared to GS, 
have heightened amygdala 
responses to insomnia-related 
stimuli (P <0.001, uncorr.). 
Habituation of amygdala 
responses was observed only in 
GS, but not in patients with ID 











Resting state connectivity of 
superior parietal lobe was 
decreased with superior frontal 
gyrus, and increased with bilateral 
anterior and posterior cingulate 
(P <0.05 uncorr.) 


Decreased functional connectivity 
between medial prefrontal and 
right medial temporal lobes, and 
between left medial temporal 
lobe and left inferior parietal 
cortex (P <0.05, corr.) 


Group-by-state interactions in 
relative glucose metabolism 
showing impaired disengagement 
of left fronto-parietal, precuneus/ 
posterior cingulate and fusiform/ 
lingual gyri during NREM sleep, or 
alternatively impaired engagement 
of these regions during 
wakefulness (P <0.05, corr.) 





Note. corr: corrected; uncorr.: uncorrected; n.r.: not reported; SD: standard deviation; Pl: primary insomnia; GS: good sleeper controls; ID: insomnia disorder; NREM: non-rapid-eye 
movement; BT: behaviour therapy; CBT: cognitive behavioural therapy; PSQI: Pittsburgh Sleep Quality Index; PSG: polysomnography; WASO: wake after sleep onset; med.: medication; 
SSRI: selective serotonin reuptake inhibitor; DSM-IV: Diagnostic and Statistical Manual of Mental Disorders, fourth edition; ICSD: International Classification of Sleep Disorders; 

DSISD: Duke Structured Interview for Sleep Disorders; mo: month. 
Updated from [89]. 
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Fig. 113.1 NREM sleep-wake differences in brain glucose metabolism—primary insomnia sufferers compared to good sleepers. Insomniacs showed 
smaller differences in PET-indexed glucose metabolism, compared to good sleepers, in (a) the right precuneus/posterior cingulate cortex, (b) the left 
middle frontal gyrus, (c) the left inferior/superior parietal lobules, (d) the left lingual/fusiform/occipital gyri, and (e) the right lingual gyrus. 


Adapted from Sleep, 39(10), Kay DB, Karim HT, Soehner AM, et al., Sleep-Wake Differences in Relative Regional Cerebral Metabolic Rate for Glucose among Patients with 
Insomnia Compared with Good Sleepers, pp. 1779-1794, Copyright (2016), with permission from Oxford University Press. 


prefrontal cortex—showed relatively reduced glucose metabolism 
in insomnia sufferers, compared to good sleepers. While this initial 
study utilized a small sample size, a replication study comparing 44 
primary insomnia sufferers with 40 good sleepers showed insomnia 
to exhibit smaller sleep-wake differences in glucose metabolism in 
brain regions involved in cognition (prefrontal), self-referential pro- 
cesses (precuneus/posterior cingulate), and emotion (prefrontal, fu- 
siform/lingual gyri) during NREM sleep. These differences can be 
seen in Fig. 113.1. Moreover, consistent with previous findings, acti- 
vation of these areas was impaired during wakefulness in insomnia, 
compared to good sleepers [20]. Another PET study of insomnia 
sufferers found that measures of WASO (wake after sleep onset) 
positively correlated with metabolism in the pontine tegmentum 
and in thalamocortical networks in a frontal, anterior temporal, and 
anterior cingulate distribution [21]. 

Neuroimaging studies investigating the waking state and cogni- 
tion in insomnia have generally focused on f{MRI-indexed activa- 
tion of specific brain areas or networks, compared to healthy good 
sleepers. Insomnia was associated with lower prefrontal cortex acti- 
vation during a verbal fluency task, compared to controls, and this 
decreased activation was partially restored after CBTi [22]. A dif- 
ferent study showed that insomnia sufferers failed to decrease DMN 
activation during a working memory task, whereas controls showed 
attenuation of the DMN in favour of the activation of more task- 
specific brain areas [23]. This inability to switch neural activation 
patterns to those required for specific tasks may explain subjective 
complaints of diminished cognitive performance in insomnia. 
Additionally, DMN activation has also been linked to rumination 
in other disorders, and thus failure to attenuate this network may 
also provide a biological basis for cognitive rumination associated 
with insomnia [24, 25]. An {MRI-based investigation into amygdala 
activation in insomnia found heightened activation in response to 
insomnia-related stimuli, compared to controls, but this difference 
was not observed when insomnia-unrelated stimuli were tested, 
supporting the use of insomnia-specific cognitive restructuring that 
is routinely performed during CBTi [26]. 

MRS-based investigations in insomnia have been less con- 
sistent. Some studies have shown lower relative concentrations 
of brain GABA in insomnia, compared to controls, both globally 
[27] and localized to the occipital and anterior cingulate [28]; 
however, a later study showed an increase in occipital GABA 


concentrations [29]. A more recent study failed to replicate these 
results, showing no difference in GABA levels between insomnia 
sufferers and controls in the areas of interest (anterior cingulate 
cortex and dorsolateral prefrontal cortex) [30]. While the direction 
of effect is inconsistent, possibly due to different times of MRS data 
acquisition (morning vs bedtime), dysregulation of GABA neuro- 
transmission may be a potential target for both identification and 
treatment of insomnia and thus warrants further study. An MRS 
study of phosphocreatine revealed lower levels in grey matter of 
insomniacs in comparison to controls. Given the fact that lower 
levels of this metabolite are markers of an increased energy de- 
mand, this finding provides further evidence for the hyperarousal 
theory of insomnia [31]. 

Structural MRI investigations (with volumetry and VBM ana- 
lyses) in insomnia have also been inconsistent, with some ob- 
servations failing replication and others not surviving statistical 
corrections. A summary table of structural neuroimaging studies of 
insomnia can be found in Table 113.2. There is some evidence for 
reduced hippocampal volume in insomnia, but these results are not 
universal [32-35]. In one study, hippocampal volume was shown 
to be negatively correlated with insomnia duration and arousal in- 
dices; however, no significant differences in hippocampal volume 
were observed between insomnia sufferers and controls [34]. A dif- 
ferent small-sample study did report significant deficits in bilateral 
hippocampal volume in insomnia [32]. This was later corroborated 
by a different group using a larger sample size that further showed 
a negative correlation between CA1 volume and sleep quality [36]; 
the combined volume of CA3, CA4, and the dentate gyrus was also 
negatively correlated with cognitive performances in verbal fluency, 
verbal memory, and verbal information processing tasks [36]. Other 
areas that seem to be affected in insomniacs are the rostral anterior 
cingulate cortex, found to be increased in volume [37], the prefrontal 
(dorsolateral, orbitofrontal, medial frontal), precentral, superior, 
and middle temporal cortices, the precuneus, and the cerebellum, 
found to display decreased grey matter volume in insomniacs [38- 
40]. The pineal gland has also been a focus of neuroanatomical re- 
search due to its involvement in regulation of the circadian rhythm 
via melatonin production and release. One study revealed volume 
reduction of the pineal gland in insomniacs, compared to controls, 
with pineal gland volume also negatively correlating with age in in- 
somnia, but not in controls [41]. 
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Fig. 113.2 Structural covariance of cortical thickness—good sleepers (GS), compared to individuals with persistent insomnia symptoms (PIS). Insomnia 
sufferers showed a marked reduction in the spatial extent of cortical thickness correlations with regions of the default-mode network, that is, (a) the 

medial frontal cortex (mFC) and (b) the precuneus (PC). 
Adapted from Sleep, 39(1), Suh S, Kim H, Dang-Vu TT, et al., Cortical Thinning and Altered Cortico-Cortical Structural Covariance of the Default Mode Network in Patients with 
Persistent Insomnia Symptoms, pp. 161-171, Copyright (2016), with permission from Oxford University Press. 


Table 113.2 Structural neuroimaging studies of insomnia 


Riemann et 
al. 2007 [32] 


Winkelman 
et al. 2008 
[27] 


Winkelman 
et al. 2010 
[33] 


Altena et al. 
2010 [39] 


Plante et al. 
2012 [28 


Morgan et al. 


2012 [29 


Noh et al. 
2012 [34 





Neuroimaging | Sample 


technique 


MRI (1.5 T) 


1H-MRS (4T) 


MRI (3 T) 


MRI (1.5 T) 


1H-MRS (4T) 


1H-MRS (4T) 


MRI (1.5 T) 


size 


(number of 


24 
(17) 


20 
(12) 


16 
(10) 


20 
(18) 


IG) 


15 (6) 


13 (9) 


20 
(12) 


17(9) 


20 
(18) 


Mean age in years + SD 


8 (5) 


48.4 +163 4634143 
S EE] 37.6445 
39.3 (8.7) 38.8 (5.3) 
60.3 (6.0) 60.2 + 8.4 
343+83 341499 
39) ae ©) 216 Y 
50.8+108  504+411.7 


PI diagnosis and 
assessment 


DSM-IV, PSQI 


DSM-IV, PSQI, PSG 


DSM-IV, ISI, PSQI, 
actigraphy 


DSM-IV 


DSM-IV, PSQI, ISI, 
actigraphy 


DSM-IV, PSQI, ISI, 
PSG 





CSD, PSG 


PI duration 


11.6+ 
8.9 years 


26 mo 


26 months 


EE 
15.8 years 


21 year 


21 year 


7.6 + 6.1 years 


History of 
pharmacological 
treatment 


Off medications 
>2 weeks 


Off sleep aids 
>1 month 


Off psychoactive 
medications 
>3 months 


Off sleeping 
medications 
>2 months 


Off psychoactive 
medications >2 
weeks 


Off psychoactive 
medications 
>3 months 


No hypnotic med. 


for >1 month 


Main findings in 
PI, compared to GS 
(significance level) 





Reduced hippocampal 
volumes bilaterally (P 
<0.05, uncorr.) 


Average brain GABA levels 
were nearly 30% lower in 
PI. GABA levels negatively 
correlated with WASO (P 
<0.05) 


No differences in 
hippocampal volume 
between groups 


Smaller volume of 

grey matter in the left 
orbitofrontal cortex, 
bilateral precuneus (P <0.05 
corr.). Insomnia severity 
negatively correlated with 
eft orbitofrontal cortex grey 
matter volume 


Lower GABA levels in 
he occipital cortex and 
anterior cingulate (P <0.05) 


ean occipital GABA level 

was 12% higher in Pl. GABA 
evels correlated negatively 
with WASO (P <0.05) 


No diff. in hippocampal 
volume. Hippocampal 
volume negatively 
correlated with arousal 
index (P <0.05 uncorr.) 
and insomnia duration (P 
<0.001 uncorr.) 





Table 113.2 Continued 


Study 


Spiegelhalder MRI (3 T) 
et al. 2013 


[35] 


Winkelman MRI (3T) 
et al. 2013 


(Study 1) [37] 


Winkelman 
et al. 2013 
(Study 2) [37] 


MRI (3 T) 


Harper etal. *'P-MRS (4 T) 


2013 [31] 


Joo etal, MRI (1.5 T) 


2013 [38] 


Bumb etal. MRI (3T) 


2014 [41] 


Joo etal. MRI (1.5 T) 


2014 [36 


Zhao et al. MRI (3 T) 


2015 [42 


MRI (1.5 T) 











Spiegelhalder MRS (3 T) 
et al. 2016 
[30] 


size 


(number of 


Neuroimaging | Sample 


females) 


28 
(18) 


20 
(10) 


(25) 


(25) 


(30) 


(22) 


20 
(12) 


38 
(21) 


15 (6) 


20 
(12) 


27 
(23) 


27 
(16) 


30 
(28) 


35 
(26) 


40 
(16) 


20 
(12) 


Mean age in years + SD 


43.7 + 14.2 


SIET 


SUS 3 95) 


RE SA 


Bye) as Tes) 


43+7A4 


OEG 


39.3 + 8.6 


Il 23) ae 3 (07/ 


42.7 +13.4 


Ca PSS 


39.6 + 8.9 


Je D3 


34.1+9.9 


37.6447 


51.7454 


le 121 


50.4 + 7.1 


34.9 + 10.7 


47.93 + 6.54 


44.1 + 10.6 


CHAPTER 113 Multimodal imaging of sleep-wake disorders 


PI diagnosis and 
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B' sleep 
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insomnia 
symptoms, 
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difficulty to initiate 
or maintain sleep, 
early morning 
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non-restorative 
sleep. PSQI, PSG 
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PI duration 
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26 months 


>6 months 


7646.1 years 
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21 month 
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Main findings in 
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(significance level) 
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med. for 22 weeks volume, no diff. in grey and 


Off psychoactive 
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=| © 





Off psychoactive 
med. for 
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No history of 
antidepressants, 
hypnotic agents, 


white matter concentration 
P <0.05 corr. and P <0.001 
uncorr., respectively) 


ncreased rostral ACC 
volume, correlated 
positively with sleep 

onset latency and WASO, 
negatively with sleep 
efficiency (P <0.05 uncorr.) 





ncreased rostral ACC 


ed. for >2 weeks volume. Right ACC volume 


correlated with sleep onset 
atency (P <0.05 uncorr.) 


Lower phosphocreatine in 
grey matter (P <0.05, corr.) 


Reduced grey matter 
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dorsolateral and medial 
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Off psychoactive 
medications >2 
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Drug-naive 


Not on sleep or 
antidepressant 

medications at 
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study 


Off psychoactive 
medications >2 
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superior and middle 
emporal gyri, and 
cerebellum (P <0.001 
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Decrease in pineal gland 
volume (P <0.001) 


PI showed bilateral atrophy 
across all hippocampal 
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ncreased structural 
covariance between 
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Decreased structural 
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No difference in GABA 
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Note. corr: corrected; uncorr.: uncorrected; n.r.: not reported; SD: standard deviation; Pl: primary insomnia; GS: good sleeper controls; PSQI: Pittsburgh Sleep Quality Index; 
PSG: polysomnography; WASO: wake after sleep onset; ACC: anterior cingulate cortex; med.: medication; SSRI: selective serotonin reuptake inhibitor; DSM-IV: Diagnostic and 
Statistical Manual of Mental Disorders, fourth edition; ICSD: International Classification of Sleep Disorders. 


Updated from [89]. 





SECTION 17 Sleep-wake disorders 


Finally, a few studies investigated brain connectivity in insomnia. 
A recent structural MRI study found decreased cortical thickness 
correlation (that is, structural covariance) between anterior and 
posterior regions of the DMN in individuals with persistent in- 
somnia symptoms (see Fig. 113.2), suggesting decreased connect- 
ivity within the DMN, and this decrease was negatively correlated 
with sleep quality [40]. This contrasts with other data showing in- 
creased structural covariance between sensory and motor cortices 
in insomnia [42]. Resting-state fMRI studies showed that insomnia 
was associated with disruption of medial prefrontal cortex func- 
tional connectivity with the right medial temporal cortex and left 
medial temporal lobe functional connectivity with the left inferior 
parietal cortex [43], as well as superior parietal lobe functional con- 
nectivity with the dorsolateral prefrontal cortex [44]. In line with 
the concept of emotional dysregulation in insomnia, resting-state 
fMRI data also showed increased functional connectivity between 
the amygdala and other regions such as sensorimotor cortices [45]. 

Overall, neuroimaging studies of insomnia have been mainly con- 
ducted in individuals with primary insomnia and gave further sup- 
port to the hyperarousal theory—reduced deactivation during the 
sleep-wake transition [19], difficulty deactivating irrelevant cogni- 
tive processes [23], enhanced activity and connectivity of threat and 
emotional neuronal circuitry [26, 45], and reduced concentration 
of inhibitory neurotransmitters [28] all contribute to the incapacity 
to modulate levels of cortical arousal across the sleep-wake cycle. 
Recent data also suggest complex patterns of altered neural con- 
nectivity in insomnia. The significance of these abnormalities in 
structural and functional connectivity remains unclear, and further 
studies are warranted to confirm and clarify the relevance of these 
findings for the pathophysiology of insomnia. 





Imaging in other sleep disorders 


Narcolepsy 


Narcolepsy is a disorder characterized by irresistible bouts of sleep 
throughout the day, often associated with REM-related features such 
as sleep paralysis, cataplexy, and hallucinations. Type 1 narcolepsy is 
characterized by cataplectic episodes and decreased cerebrospinal 
fluid levels of hypocretin. In the absence of these criteria, narco- 
lepsy is classified as type 2. Significant DTI-indexed differences were 
observed between these two subtypes of narcolepsy, whereas no 
differences were observed between type 2 and healthy controls, sug- 
gesting the two types may result from different pathological mech- 
anisms, with type 2 being a milder form with no apparent structural 
brain pathology [46]. Therefore, the bulk of neuroimaging research 
in narcoleptic patients focuses on type 1. 

A meta-analysis of VBM in MRI studies of narcolepsy found se- 
lective grey matter loss in multiple brain areas, thought to be related 
to a damaged orexin/hypocretin pathway and emotional regulation 
system [47]. Unsurprisingly, multiple studies found alterations in 
the hypothalamus where orexinergic neurons are located [48], as 
well as the fronto-temporal cortex [47]. Further MRI analysis found 
a reduction in amygdalar volume, possibly serving as a mechanism 
for dysfunctional emotional regulation in the disorder [49]. 

Two fMRI studies sought to elucidate the nature of altered re- 
sponse to affective stimuli in narcolepsy via the presentation of 
humourous stimuli. Interestingly, while both studies observed 


increased amygdalar responses in patients, compared to healthy 
controls, observations in the hypothalamus were inconsistent. One 
study showed a decrease in hypothalamic response in response to 
affective stimuli [50], whereas the other study showed an increase in 
hypothalamic response, with the exception of one participant who 
experienced a cataplectic attack during the study and showed dra- 
matic reductions in hypothalamic activity [51]. While these results 
seem to suggest an overdriven emotional network in narcolepsy, the 
role of the hypothalamus is not clear. The inconsistency in these re- 
sults may be due, in part, to the relatively small sample sizes utilized 
in these studies. 


Restless legs syndrome 


Restless legs syndrome (RLS) is characterized by an unpleasant and 
uncomfortable feeling in the legs, accompanied by the urge to move 
them in order to relieve this sensation. It occurs during the evening 
and at night, when sitting or lying down, and sometimes interferes 
with sleep onset. 

The mechanism underlying RLS pathogenesis is hypothesized 
to involve inhibition of adrenergic, opiate, and dopaminergic 
descending inhibitory pathways [52]. This is supported by the ob- 
servation that dopamine antagonists exacerbate RLS and dopamine 
agonists relieve symptoms [53]. 

Striatal dopamine transporter (DAT) and D2 receptor 
dysregulation in RLS has been extensively investigated using 
SPECT, but the results have been contradictory, pointing towards 
dysregulation, rather than a net increase or decrease of receptors 
[54, 55]. 

Similarly, PET studies using C-raclopride have found conflicting 
results on striatal D2 receptor binding, with differences across 
studies attributed to drug-naive patients used in some studies, while 
others assessed patients on medication which can downregulate D2 
receptors [56]. Opioid PET studies using C-diprenorphine showed 
no difference between RLS patients and controls; however, a rela- 
tionship between RLS severity and decreased opioid binding was 
found, possibly due to competitive endogenous opioid release in re- 
sponse to pain [57]. 

Structural MRI studies have shown evidence of alterations in 
pulvinar grey matter in RLS, compared to controls, though it is un- 
clear whether these alterations are a cause or a consequence of the 
syndrome [58]. A study using VBM found grey matter decreases 
in the primary sensorimotor cortex [59], and a DTI study revealed 
white matter alterations in this same area, as well as in the thal- 
amus [60]; however, these anatomical deviations, compared to con- 
trols, were not significant when drug-naive subjects were included 
(61, 62]. 

Functional MRI studies during resting state showed reduced 
thalamic connectivity with the right parahippocampal gyrus, right 
precuneus, right precentral gyrus, and bilateral lingual gyrus, as 
well as increased connectivity with the temporal and other areas. 
Interestingly, there was a negative correlation between RLS severity 
and right parahippocampal gyrus connectivity with the thalamus, 
suggesting dysfunction at the level of somatosensory information 
processing [63]. 

MRS studies of RLS point towards thalamic dysfunction, with one 
study showing decreased N-acetyl aspartate (NAA) concentrations 
and NAA/creatine (Cr) ratios in the medial thalamus of RLS parti- 
cipants, compared to controls, indicative of neuronal damage, but 


without showing fMRI or DTI-indexed alterations [64]. Another 
study found increased glutamate—glutamine/creatine ratios in the 
thalamic region of RLS patients, compared to controls, and inter- 
estingly, this ratio correlated with WASO [65], supporting the in- 
volvement of a glutamatergic arousal system in RLS. GABA was also 
investigated, with no difference in levels between RLS patients and 
controls. However, a positive correlation was found between thal- 
amic GABA levels and RLS severity, as well as a negative correlation 
between cerebellar GABA and this same measure [66]. This suggests 
a role for the thalamus and the cerebellum in the modulation of RLS 
intensity. 


REM sleep behaviour disorder 


REM sleep behaviour disorder (RBD) is a parasomnia character- 
ized by movements or vocalizations which occur during REM sleep 
and are consistent with enactment of dream content. This disorder 
is often idiopathic but can also be comorbid with other neurological 
disorders or related to pharmacological effects (for example, anti- 
depressants). The majority of idiopathic RBD patients will even- 
tually develop neurodegenerative alpha-synucleinopathies such as 
Parkinson's disease (PD), multiple system atrophy, and Lewy body 
dementia [67]. 

A study attempting to elucidate the pathways involved in human 
RBD has employed SPECT, combined with video polysomnography, 
with injection ofa radiotracer during an RBD episode. Increased me- 
tabolism was shown in premotor areas, the interhemispheric cleft, 
the periaqueductal area, the dorsal and ventral pons, and the an- 
terior lobe of the cerebellum. Interestingly, no activation of the basal 
ganglia was observed, as is observed during wakeful movements. 
However, this study used a small sample size and no control group 
[68]. Other SPECT studies have reported differences in brain acti- 
vation, compared to healthy controls, during resting wakefulness. 
A study in awake SPECT patients found decreased activity in the 
temporo-parietal and frontal cortices, as well as increased activity in 
the putamen, pons, and right hippocampus [69], and these findings 
have been replicated using larger populations [70, 71]. Decreased 
rCBF in the parietal, occipital, limbic, and cerebellar regions was ob- 
served in a group of idiopathic RBD patients, and a 2-year follow- 
up with the same individuals revealed further reduction of rCBF in 
the medial portions of the parieto-occipital lobe, with a significant 
decrease in rCBF in the right posterior cingulate [72]. A 3-year lon- 
gitudinal SPECT study of 20 idiopathic RBD patients reported that 
half of these participants developed a neurodegenerative disease and 
hippocampal hyperperfusion was predictive of the development of 
neurodegeneration [71]. 

Due to a high degree of comorbidity with nigrostriatal dopa- 
mine pathway disorders, a number of studies have focused on un- 
veiling a link between RBD and dopaminergic abnormalities. RBD 
patients showed a significant reduction in striatal DAT receptor 
densities, compared to controls, but not compared to PD patients 
[73]. Attempts to re-create these findings have been mixed, with 
some studies showing DAT density decrease in only a minority of 
RBD patients [74, 75]. Longitudinal studies of dopaminergic func- 
tion have also shown a link between RBD and neurodegeneration, 
with presynaptic DAT densities in the substantia nigra being pre- 
dictive of later onset of alpha-synucleinopathies in RBD [76]. No dif- 
ferences were found between RBD, PD, and healthy controls when 
post-synaptic D2 receptors were probed [73]. SPECT investigations 
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of serotonergic pathways also failed to show any involvement of 
these pathways in RBD [77]. 

Anatomical analyses using MRI confirmed that degeneration in 
RBD is not limited to the substantia nigra but is also observed in 
neighbouring pathways, as evidenced by reduced neuromelanin 
signal intensity in the nearby caeruleus/subcaeruleus complex; this 
reduction was a better predictor of alpha-synucleinopathies, com- 
pared to clinical measurements [78]. Structural studies of RBD 
using VBM and DTI have also revealed bilateral putamen volume 
decrease [79], increased hippocampal grey matter density [80], and 
white matter alterations in brain areas involved in REM sleep regu- 
lation [80, 81]. Decreased cortical thickness in the frontal cortex, 
the lingual gyrus, and the fusiform gyrus was also recently reported 
[82]. Using a large sample of PD patients, a recent study observed 
smaller volumes in the pontomesencephalic tegmentum in PD pa- 
tients with probable RBD, compared to PD patients without RBD 
and healthy controls [83]. This area contains neurons involved in 
the promotion of REM sleep and muscle atonia, and it is interesting 
to note that animals which have lesions in this area will show RBD- 
like symptoms, including dream enactment and loss of REM atonia, 
supporting the view that the pons is a key region for RBD [84]. 
Furthermore, PD with probable RBD had decreased volumes ex- 
tending to other subcortical and cortical regions whose loss may 
contribute to the dysregulation of sleep-wake states and motor ac- 
tivity underlying RBD in PD patients [83]. Moreover, PD patients 
with comorbid RBD showed reduced thalamic volume, compared to 
PD patients without RBD, ina VBM study [85]. 

MRS studies failed to reveal any differences between metabolic 
peaks of NAA/Cr, choline/Cr, and myoinositol/Cr ratios in the pon- 
tine tegmentum and the midbrain between idiopathic RBD patients 
and healthy controls [86], and between PD and RBD patients in the 
pontine region [87]. 

Taken together, these findings indicate an involvement of brain 
structures including (but not limited to) the pontine nuclei and the 
nigrostriatal dopaminergic system in the pathophysiology of RBD, 
although more work is required to elucidate the precise pathways 
involved in giving rise to this condition. The often bizarre dream en- 
actments associated with RBD are a powerful reminder of the very 
active and often emotional state of the brain during REM sleep. 


Conclusions 


There is still much to discover. New technological advancements 
in neuroimaging will allow researchers to run studies at higher 
and faster resolutions, which will be essential to truly elucidate 
the precise neural networks involved in the deceptively complex 
physiological state we call sleep. Furthermore, larger data sets, 
standardization of methodological procedures, and multimodality 
of imaging techniques will aid the field of sleep neuroimaging, as 
it expands and incorporates new results. As we further our under- 
standing of pathologies in sleep-related neural pathways, innova- 
tive treatment interventions may emerge from better knowledge of 
the neural mechanisms of sleep disorders. As the physical, psycho- 
logical, and cognitive costs of dysfunctional sleep become more ap- 
parent, the impetus for finding solutions to disorders of sleep will 
surely increase. With hope, future research in sleep neuroimaging 
will enable novel treatments for the many disorders of sleep. 
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While sleep-wake disorders are common and impairing, they are 
underdiagnosed and undertreated. This notwithstanding, recent de- 
velopments in the science of sleep and sleep disorders, described in 
Chapters 109 to 113, have given rise to the burgeoning field of sleep 
medicine. There has been a surge in the number of accredited sleep 
centres, training opportunities for health care professionals, and 
sleep-specific societies, all with the aim of improving sleep disorder 
recognition, diagnosis, and treatment. 

In this chapter, we will focus on the evidence-based manage- 
ment of insomnia and circadian rhythm sleep-wake disorders 
(CRSWDs), the two classes of sleep disorder most commonly 
encountered in psychiatric practice. Other sleep disorders, like 
non-rapid eye movement (NREM) and rapid eye movement (REM)- 
related parasomnias, obstructive sleep apnoea (OSA), and restless 
legs syndrome (RLS) and periodic limb movements of sleep also co- 
vary with psychiatric disorder and its treatment; their recognition 
and diagnosis are covered in Chapter 110, but their management is 
beyond the scope of this chapter (for further in-depth reading, see 
Oxford Textbook of Sleep Disorders [1]). 


ke 





























Insomnia disorder (ID) is characterized by persistent problems 
(23 days per week for >3 months) with sleep initiation and/or main- 
tenance, resulting in significant impairment to quality of life (QoL) 
[2-4]. ID is the most common sleep disorder and the second most 
prevalent mental health complaint in Europe, affecting 10-12% of 
the adult population [5, 6]. Historically viewed as a symptom of a 
so-called ‘primary illness, ID is now recognized as: (1) a disabling, 
non-remitting condition in its own right [3]; and (2) a causal factor 
in the evolution and maintenance of physical and mental ill-health, 
particularly depression and cardiometabolic disease [7]. This change 
in understanding is reflected in the recent reclassification of primary 
and secondary insomnia into one overarching category (‘Insomnia 
disorder’) in both DSM-5 and the International Classification of 





Sleep Disorders, third edition (ICSD-3) [8]. Prospective data also 
suggest that persistent insomnia is a robust risk factor for all-cause 
mortality, after adjustment for potential confounding factors [9]. 
Although UK data are limited, extrapolation from per person cost 
data calculated in Canada [10] suggests that direct and indirect costs 
of insomnia are likely to exceed £14 billion per year. Associated costs 
reflect increased health care utilization, higher rates of workplace 
absenteeism, reduced productivity (‘presenteeism’), and increased 
accident risk [11, 12]. 

While insomnia is a heterogenous condition, most likely com- 
prising multiple phenotypes [7, 13], contemporary theoretical 
models posit that key cognitive and behavioural processes serve 
to disrupt sleep-wake regulation [14-18]. In Spielman’s stress- 
diathesis conceptualization, genetic vulnerability (for example, 
[19]) is assumed to interact with precipitating factors (for example, 
stress, illness), stimulating an acute episode of insomnia. Cognitive 
and behavioural responses to this acute sleep loss (for example, ex- 
tension of time in bed, napping, altered light exposure), driven, in 
part, by dysfunctional beliefs and attitudes about sleep, create sleep 
preoccupation and effort, conditioned arousal, sleep fragmenta- 
tion, and increased night-to-night sleep variability. The end-state 
is persistent hyperarousal across cognitive, autonomic, and cortical 
domains, eroding sleep continuity and architecture and negatively 
affecting daytime functioning [20]. This dysfunctional arousal may 
also manifest in the report of poor sleep, despite an absence of gross 
sleep impairment on polysomnographic recordings [21-23]. 

It extends logically that treatment of insomnia should address 
cognitive and behavioural maintenance factors. There are two main 
evidence-based treatment modalities for the management of in- 
somnia. The first comprises cognitive behavioural therapies (CBTs), 
reflecting a ‘high-level’ approach aimed at addressing sleep-related 
behaviours and cognitions, effectively clearing the path to normal 
sleep. The second, more ‘low-level’ approach involves direct pharma- 
cological induction of sleep through, for example, the adminis- 
tration of hypnotics. The National Institute for Health and Care 
Excellence (UK) and prominent organizations in the United States 
(American College of Physicians) [24] endorse CBT as the first-line 
treatment for chronic insomnia, while hypnotics are recommended 
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only for short-term use due to the risk of tolerance, withdrawal ef- 
fects, and next-day side effects. We will now summarize each treat- 
ment approach. 





Cognitive behavioural therapy 


CBT is a multicomponent psychological therapy, usually delivered 
over 4-8 sessions by a trained health care professional and after a 
thorough clinical history. Techniques typically have a lifestyle focus 
(for example, sleep hygiene), behavioural focus [for example, sleep 
restriction therapy (SRT), stimulus control, relaxation techniques], 
and cognitive focus (for example, addressing unrealistic expect- 
ations about sleep, paradoxical intention). Table 114.1 summar- 
izes the main ingredients of CBT and proposed treatment targets. 
It should be noted that sleep hygiene education has no evidence as 
a standalone intervention in the management of insomnia but is 
typically incorporated to ensure that patients have basic knowledge 
about factors that inhibit or facilitate sleep. 

Compared to CBT for other psychological disorders, insomnia- 
focused CBT places greater emphasis on behavioural vs cognitive 
modification (for an example of CBT session coverage, see Fig. 
114.1). Indeed, behavioural components have the largest evidence 
base as standalone treatments [25, 26], and their level of implemen- 
tation within multicomponent CBT is reliably associated with sleep 
improvement [27]. This is, in part, because sleep can be improved 
through attention to its fundamental biological regulation. For ex- 
ample, SRT, a core component of CBT, is considered to be a psycho- 
biological treatment because it takes advantage of sleep homeostasis 
to reduce arousal and consolidate nocturnal sleep. Thus, patients are 
asked to keep a sleep diary for 1-2 weeks to enable calculation of the 
average time in bed and the average sleep duration (Table 114.2). 
SRT restricts time in bed to match self-reported sleep time—with 
the rationale that time in bed awake can maintain fragmented and 
poor-quality sleep. Restriction of time in bed, coupled with stand- 
ardization of bed and rise-times each day, addresses both homeo- 
static and circadian regulation of the sleep-wake cycle, increasing 


the probability that patients will sleep through their allocated 
window each night. The successful pairing of sleepiness/sleep and 
the bedroom environment may also help to address conditioned 
(hyper)arousal that is presumed to play a role in insomnia mainten- 
ance. Titration of the sleep window each week helps to establish a ro- 
bust new sleep-wake pattern for the patient and identify core sleep 
need. SRT has been associated with reductions in cognitive arousal, 
sleep effort, core body temperature, and night-to-night sleep vari- 
ability [26, 28, 29]. 

It should be noted that any treatment that has potential to do good 
also has the potential to do harm—and psychological interventions 
are no exception [30]. SRT should be avoided or carefully adapted 
for populations where acute sleep loss may stimulate episodes, re- 
lapse, or symptom exacerbation (for example, epilepsy, bipolar dis- 
order, parasomnia). Patients should also be educated about driving 
risks during acute implementation since vigilance is known to be 
compromised and daytime sleepiness enhanced [28, 31]. 

CBT is effective across a range of contexts, for example when 
delivered by different professional groups (for example, nurses, 
masters-level psychologists), through different delivery formats 
(groups, face-to-face, over the phone, Internet), and to diverse 
populations (for example, those with both medical and psychiatric 
comorbidity). The most comprehensive meta-analysis of CBT trials 
to date showed a large effect size (Hedges g = 0.98) for insomnia 
severity and medium to large effects for sleep continuity meas- 
ures [32]. Treatment effects do not vary for patients with/without 
comorbid disease, by age, or whether or not patients use sleep medi- 
cation. Importantly, recent studies also showed generalized benefits 
of CBT on other parameters of health and well-being, including in- 
flammatory markers [33] and depressive symptoms [34]. Despite a 
large evidence base and endorsement in clinical guidelines, provi- 
sion of CBT for insomnia within health care settings remains very 
limited [35]. Those in primary care are likely to receive sleep hy- 
giene advice or be prescribed hypnotic and/or sedative antidepres- 
sant medication (off-label). Digital interventions therefore provide a 
unique opportunity to facilitate access at scale. Espie and colleagues 
[36, 37] have shown that automated CBT delivered by an animated 


Table 114.1 Principal components of cognitive behavioural therapy (CBT) for insomnia [106] 


Cognitive therapy 


Identify, challenge, and change dysfunctional beliefs and attitudes about sleep and the consequences of poor sleep that 


may contribute to sleep-related arousal. Use paradoxical intention techniques to reduce attempts to control sleep 


Sleep hygiene 


General recommendations that promote 


healthy sleep habits and practices that are conducive to healthy sleep. These 


can include altering the bedroom environment, scheduling a consistent pre-sleep routine, avoiding arousal-promoting 
activities, and limiting alcohol, caffeine, and nicotine before bed 


Stimulus control 


Behavioural instructions aimed at eliminating the association between the bedroom and arousal; instead, strengthening 


the bedroom environment as a stimulus for sleep and sleepiness. Examples include: only using the bedroom for sleep- 


related activities; only going to bed when 
15 minutes, and returning only when slee 


Sleep restriction 


sleepy/tired; leaving the bedroom when unable to sleep within approximately 
py (‘quarter of an hour rule’) 





A behavioural intervention which involves restricting and standardizing a patient's time in bed, with the aim of increasing 


homeostatic sleep pressure, overriding cognitive and physiological arousal, and strengthening circadian control of sleep. 
Tailored prescription of bedtime and rise-time over several weeks leads to improved sleep consolidation and quality 


Relaxation techniques 


Techniques that provide cognitive and somatic relaxation for the patient that may facilitate sleep onset/reinitiation, 


including progressive muscle relaxation, mindfulness, breathing meditation, and guided imagery 


Reproduced from BrJ Health Psychol., 22(4), Kyle S, Henry A, Sleep is a modifiable determinant of health: implications and opportunities for health psychology, pp. 661-70, Copyright 


(2017), with permission from John Wiley and Sons. 
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Session 1 Sleep Information 


Aim: To learn about normal sleep processes and about sleep 
disorders 


to understand the need for sleep and its functions 
to understand sleep pattern and how it varies during the lifetime 
to understand sleep as a process with stages and phases 


to understand factors which adversely affect sleep pattern and 
sleep quality 


e to understand the effects of sleep loss 

e to understand the concept of insomnia and how it can be 
measured 

e to understand personal sleep histories and patterns in the above 
context 

e to begin to correct previous misunderstandings about sleep and 
sleeplessness 


Session2 Sleep Hygiene & 
Relaxation 


Aim: To introduce practical steps towards developing a healthy sleep 
pattern without recourse to drugs 


e to review progress so far and maintain treatment goals 
to create a bedroom environment that is comfortable for sleep 


to take regular exercise which promotes fitness and enhances 
sleep 


to develop a stable and appropriate diet 

to reduce the undesirable effects of caffeine upon sleep 

to moderate alcohol consumption and eliminate ‘night-caps’ 
to support people reduce and stop taking sleeping pills 

to design individualised reduction programmes 

to learn relaxation skills to apply in bed 


Session3 Sleep Scheduling 


Aim: To re-shape sleep patterns to correspond with individual sleep 
needs and to strengthen sleep rhythms 


to review progress so far and maintain treatment goals 

to develop a good pre-sleep routine 

to distance waking activities from the bedroom environment 

to establish a strong bed-sleep connection 

to eliminate wakefulness from bed 

to define restricted parameters for the individual’ s sleep period 


to increase sleep efficiency through scheduling sleep in relation to 
needs 


to eliminate daytime napping 
to establish a stable night to night sleep pattern 
to encourage and support people in changing their sleep routines 


Fig. 114.1 Session-by-session (a-e) aims for a 5-week multicomponent CBT programme. 
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(a) 


Session 4 Cognitive approaches 
(EN) 


Aim: To learn ways of reducing the mental alertness, repetitive 
thoughts and anxiety which interferes with sleep 


to review progress so far and maintain treatment goals 

to identify thought patterns which interfere with sleep 

to develop accurate beliefs and attitudes about sleep 

to prepare mentally for bed by putting the day to rest 

to learn thought distraction and imagery techniques 

to reduce efforts to control sleep and allow it to happen naturally 
to utilise these techniques to combat intrusive thoughts 


to encourage and support people in changing their mental 
approach 


Session 5 Developing a strong & 
natural sleep pattern 


Aim: To integrate advice from previous sessions and to maintain 
implementation at home 


to review progress so far and maintain treatment goals 
to systematically rehearse the elements of programme 
to address implementation problems experienced 


to plan further adjustments to the sleep period to maintain sleep 
efficiency 


e to encourage and support people in maintaining their new sleep 
routines and their mental approach 


e to learn relapse prevention approaches if a sleep problem recurs 


Fig. 114.1 Continued 


Table 114.2 Sleep restriction therapy guidelines 


Sleep restriction therapy Instructions 
guidelines 


1 Patient instructed to record a sleep diary for 1-2 weeks to obtain average nightly sleep duration, for example 5 
hours 30 minutes (rounded to nearest 15-minute interval). This acts as the designated sleep window 





2 Therapist, in collaboration with patient, sets a morning ‘rising time’, for example 6.30 a.m. 

3 A ‘threshold time’ is then calculated by subtracting the average nightly sleep duration from the specified morning 
rising time, for example 6.30 a.m.—5 hours 30 minutes = threshold time of 1 a.m. 

4 Patient is instructed not to enter bed prior to their designated ‘threshold time’ and to exit bed on, or prior to, set 
‘rising time’ 

5 Patient follows this prescribed schedule every night, including weekends. Avoid driving or operating heavy 
machinery if experiencing excessive daytime sleepiness 

6 Weekly modifications to the sleep window are based on sleep efficiency values (total time asleep/total time in 

bed x 100): 


If sleep efficiency >90%: increase sleep window by 15 minutes 
If sleep efficiency <85%: decrease sleep window by 15 minutes 
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therapist outperformed two control conditions. This was an im- 
portant trial because it incorporated an active comparator therapy 
(so-called imagery relief therapy, or IRT), which deliberately ex- 
cluded the presumed active ingredients of CBT but was plausible 
because it focused on routines preparatory to sleep. Trials that so 
convincingly demonstrate superiority over the non-specific effects 
of attention and therapy involvement are rare. In addition to scale, 
digital technology can facilitate better matching between insomnia 
phenotype and specific CBT techniques [38], which may further po- 
tentiate treatment gains and reduce non-response [39]. 


Pharmacotherapies 


Hypnotics 


Short-term use (3-4 weeks) of sleep-prompting medication is re- 
commended for acute insomnia (for example, insomnia caused by 
temporary stressor, jet lag, etc.) [40] or for chronic insomnia that 
fails to respond to CBT as the first-line treatment [24]. Both benzo- 
diazepines (for example, lormetazepam) and non-benzodiazepine 
positive allosteric GABA-A receptor modulators (for example, 
zolpidem, zopiclone—the so-called ‘z-drugs’) are approved for the 
short-term management of insomnia (for dose and half-life of com- 
monly prescribed insomnia medications, see Table 114.3). Careful 
consideration of pharmacological properties and risk—benefit pro- 
file, as well as any specific individual contraindications (for example, 
comorbidities, medications), is necessary to optimize and tailor 
pharmacological treatment for each patient. 

These medications modify sleep and arousal via allosteric 
modulation of the GABA-A receptor complex, potentiating the 
inhibitory effects of GABA on the central nervous system. It 
should be noted, however, that, in general, they do not re-create 
normal sleep. Indeed, they may suppress power density in the 
low-frequency range during NREM sleep [41] and impair sleep- 
dependent brain plasticity [42]. Meta-analyses showed hypnotics 
to reliably improve insomnia symptoms, relative to placebo, 
when administered over a short period of time [43]. However, 
more recent analyses showed smaller effect size differences when 


Table 114.3 Benzodiazepines and non-benzodiazepine GABA, 
receptor positive allosteric modulators frequently used for treatment 
of insomnia [107] 


Drugs Usual dose (mg) Half-life (hours) 


Benzodiazepines 





Flunitrazepam 0.5-2 16-35 
Flurazepam 15-30 48-120 
Lormetazepam OS=| 8-15 
Nitrazepam Sme 2S 
Temazepam 10-30 10-20 
Triazolam 0.125-0.250 1.4-4.6 
GABA, receptor modulators 

Zolpidem 5-10 2-4 
Zopiclone INIMA 56 


Reproduced from Handb Exp Pharmacol., Spiegelhalder K, Nissen C, Riemann D, Clinical 
Sleep-Wake Disorders Il: Focus on Insomnia and Circadian Rhythm Sleep Disorders, 
pp. 1-16, Copyright (2017), with permission from Springer International Publishing AG. 


specifically investigating elderly patients [44] or when consid- 
ering both published and unpublished trials of z-drugs [45]. 
Importantly, there is no evidence that treatment gains from hyp- 
notics last beyond their discontinuation. Only a handful of trials 
have compared hypnotics directly with CBT, finding that treat- 
ment effects are comparable in the short term but that CBT out- 
performs medication at follow-up [46, 47]. 

Owing to the longer half-life of traditional benzodiazepines, 
physicians increasingly prefer to prescribe shorter-acting z-drugs 
[48]. Z-drugs are also considered more effective due to their 
greater selectivity for the alpha 1 subunit, thus enhancing activity 
in terms of sedation and limiting more generic effects involved in 
the interaction with other subunits [49]. Despite this assumption, 
meta-analyses suggested there are limited differences in efficacy 
or adverse effect profiles [44, 50]. Both are associated with an in- 
creased risk of adverse effects, including falls, driving accidents, 
and psychomotor impairment. Epidemiological studies also sug- 
gested that long-term use of hypnotics is associated with an in- 
creased risk of infection, depression, cancer, and mortality [51], 
although the direction of the causal relationship may be open to 
question. 


Melatonin receptor agonists, orexin antagonists, and 
off-label medication use 


Melatonin receptor agonists, which act on MT, and MT, receptor 
sites in the suprachiasmatic nucleus (SCN) of the hypothalamus, are 
also licensed for the management of insomnia. Ramelteon (8 mg) 
is the only approved melatonin agonist for insomnia on the market 
(United States) and has been investigated in more than a dozen trials. 
Although demonstrating low potential for abuse and limited side 
effects, in comparison with GABA-mediated hypnotics, ramelteon 
appears to have only modest effects on sleep latency and little/ 
no impact on wake-time after sleep onset or total sleep time [52]. 
Kuriyama et al. [53] meta-analysed 13 placebo randomized con- 
trolled trials of ramelteon, finding statistically significant, but small, 
effects for subjective sleep latency (mean difference = 4 minutes, 
compared to placebo) and self-reported sleep quality. Circadin’ (2 
mg), an extended-release melatonin agonist, is approved in Europe 
for those over 55 with insomnia. Trials have documented small to 
medium effects for sleep continuity parameters, sleep quality, and 
daytime functioning, and the safety profile appears encouraging 
[54, 55]. 

The first orexin receptor antagonist (suvorexant) has been ap- 
proved by the US Food and Drug Administration (FDA) for the 
management of insomnia (it is not yet licensed in Europe). Orexin- 
producing neurons in the lateral hypothalamus innervate ascending 
arousal systems, helping to generate and sustain wakefulness. Hence, 
antagonism of orexin A and B receptors may promote sleep through 
blockade of the wake drive and insomnia-related hyperarousal. 
Prolonged treatment with suvorexant (30-40 mg) for between 3 and 
12 months has been found to effectively treat insomnia symptoms, 
and benefits persist even after a prolonged (2-month) discontinu- 
ation phase [56, 57]. While reports of somnolence are increased, 
relative to placebo, the adverse effect profile appears less impairing 
than that of hypnotics [58]. It must be noted that the FDA recom- 
mends a maximum dose of 20 mg suvorexant—due to concerns 
about somnolence—but clinical trial data are less robust for this 
dose, relative to 30-40 mg. 
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A number of other drugs are also used ‘off-label’ to treat insomnia, 
including sedative antidepressants, anti-neuroleptics, and atypical 
(second-generation) antipsychotics. This is not evidence-based prac- 
tice [24, 39]. For example, the most frequently prescribed sleep aid in 
the United States is the sedative antidepressant trazodone; yet there is 
a paucity of trials to inform efficacy or risk—benefit profile [59]. 


Circadian rhythm sleep-wake disorders 


CRSWDs manifest because of alterations to the endogenous cir- 
cadian clock (intrinsic CRSWDs) or when there is gross mis- 
alignment between the sleep-wake cycle and the 24-hour social 
environment (extrinsic CRSWDs). Patients typically present 
with difficulty initiating sleep or maintaining sleep, or excessive 
sleepiness, with corresponding negative effects on daytime func- 
tioning. Because the circadian clock runs at slightly longer than 
24 hours (approximately 24.2 hours), alignment between our in- 
ternal rhythms and the 24-hour social environment requires 
daily adjustment. Light, physical activity, and melatonin are the 
main synchronizers of the clock and therefore form the back- 
bone of contemporary CRSWD management. We will now pro- 
vide an overview of evidence-based treatments for the following 
intrinsic CRSWDs: delayed sleep-wake phase disorder (DSWPD)/ 
advanced sleep-wake phase disorder (ASWPD), irregular sleep- 
wake schedule disorder, and non-24-hour sleep-wake disorder. 
While we emphasize treatments that are endorsed by the American 
Academy of Sleep Medicine (AASM) task force [60], the highly 
specific patterning of sleep-wake timing in individual patients ne- 
cessitates a tailored approach when managing CRSWDs. For the 
correct administration of light therapy and exogenous melatonin, 
it is important that the clinician is able to estimate the circadian 
phase. Core body temperature nadir and rise in endogenous mela- 
tonin [dim light melatonin onset (DLMO)] are the two most ro- 
bust indicators of phase position. While procedures are currently 


being developed to reliably assess the circadian phase at home [61], 
clinicians will typically have to rely on questionnaire measures 
(sleep diary logs or psychometrics) to infer the circadian phase. In 
general, as a rule of thumb, the nadir of the core body temperature 
occurs 2 hours before habitual sleep offset, while DLMO initiates 
about 2-3 hours prior to habitual sleep onset (Table 114.4). 


Delayed sleep-wake phase disorder 


DSWPD is one of the most common CRSWDs, affecting approxi- 
mately 2% of the population [62]. It is characterized by a sleep 
schedule that occurs significantly later than desired or what would 
be considered ‘normal’ by society. Patients with DSWPD present 
with habitually delayed sleep onset and wake-times, or difficulties 
with sleep onset and rising when attempting to adhere to a more typ- 
ical sleep schedule. When able to sleep on their preferred schedule, 
sleep duration and quality are normal [8]. While the aetiology of 
DSWPD remains unclear, longer-than-average circadian periods 
[63], hypersensitivity to light in the evening [64], and sleeping 
through the advanced period of the phase response curve in the 
morning [65] are all thought to play a role in perpetuating phase 
delay. DSWPD is overrepresented in adolescents, most likely due to 
puberty-related changes in sleep homeostasis and circadian timing, 
compounded by inflexible social schedules. Several treatments have 
been proposed to advance the circadian phase in DSWPD, including 
chronotherapy, melatonin, and phototherapy. 


Chronotherapy 


Chronotherapy, or prescribed sleep scheduling, is a behavioural 
intervention aimed at creating a desired sleep-wake pattern by 
systematically delaying bedtime and rise-time by 3 hours each 
day [66]. Because the clock runs at >24 hours in most humans, it 
is easier to phase-delay than phase-advance the sleep-wake cycle. 
Upon achieving the desired time, individuals are instructed to ri- 
gidly adhere to strict rules around sleep and wake activity, avoiding 
activities that may encourage phase drift. Chronotherapy appears 


Table 114.4 Summary of clinical presentation, typical sleep-wake schedule, and evidence-based treatments for CRSWDs, based 


upon current AASM guidelines 


CRSWD Clinical presentation Typical sleep-wake pattern Recommended treatments 


DSWPD Difficulty falling asleep and waking at 


times considered normal (10 a.m.-1 p.m.) 


ASWPD Difficulty maintaining wakefulness and 
sleepiness in early evening, combined 
with early morning awakening or sleep 


maintenance difficulties 


N24SWD Bouts of insomnia symptoms and excessive 
daytime sleepiness, interspersed with 


periods of normal alertness 


ISWRD Insomnia symptoms or excessive daytime 


sleepiness 


Delayed sleep onset (2-6 a.m.) and wake-time 


Early sleep onset (6-9 p.m.) and wake (2-5 a.m.) 


Bedtime and wake-time are consistently 
delayed (by, for example, 1 hour) each day 


Inconsistent and irregular patterning 


Melatonin (0.5-3 mg) 1.5-6.5 hours prior to 
bedtime/DLMO 


Bright light therapy (=2500 lux) for 4 hours, 
starting at 8 p.m., or 4000 lux between 9 and 
11 p.m. 


Melatonin 2-3 hours before desired 

bedtime (0.5-5 mg) when circadian phase is 
approaching normal clock time, or melatonin 
agonist (tasimelteon) 1 hour prior to desired 
bedtime 


Morning bright light (=2500 lux) for 2 hours 
(between 8 and 11 a.m.) in elderly individuals 
with dementia 





ASWPD, advanced sleep-wake phase disorder; CRSWD, circadian rhythm sleep-wake disorder; DLMO, dim light melatonin onset; DSWPD, delayed sleep-wake phase disorder; 


ISWRD, irregular sleep-wake rhythm disorder; N24SWD, non-24-hour sleep-wake disorder. 


Source: data from J Clin Sleep Med, 11(10), Auger RR, Burgess HJ, Emens JS, et al., Clinical practice guideline for the treatment of intrinsic circadian rhythm sleep-wake 
disorders: advanced sleep-wake phase disorder (ASWPD), delayed sleep-wake phase disorder (DSWPD), non-24-hour sleep-wake rhythm disorder (N24SWD), and irregular sleep- 
wake rhythm disorder (ISWRD). An update for 2015: an American Academy of Sleep Medicine Clinical Practice Guideline, pp. 1199-236, Copyright (2015), American Academy of 


Sleep Medicine. 
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effective in laboratory demonstrations (for example, [66]), and there 
is some supporting evidence from observational studies (for ex- 
ample, [67, 68]); however, there is a dearth of robust randomized 
controlled trials in the home environment. Potential for creating 
free-running in some patients has also been observed [69]. Given 
the lack of controlled outcome studies, the AASM states that there 
is insufficient evidence to warrant recommendation of prescribed 
sleep scheduling for individuals with DSWPD [60]. 


Morning phototherapy 


Timed light therapy, also known as phototherapy, aims to take ad- 
vantage of the potent role of light as a synchronizer of circadian 
physiology. Exposure to bright light in the morning is intended to 
bring about a phase advance in those with a delayed phase (while 
evening light can phase-delay). Phototherapy is considered more 
practical than chronotherapy and can harness both artificial and 
natural light sources. Light is typically delivered using light boxes or 
other specifically designed devices that emit bright broad-spectrum 
light (for example, re-timer glasses [70]). Light boxes usually deliver 
between 2500 lux and 10,000 lux, although it is important to bear in 
mind that these measurements are recorded at the level of the box, 
and not the eye [71]. 

A number of studies have examined the efficacy of timed light 
exposure in DSWPD. Rosenthal et al. administered 2500-lux light 
to patients for 2 hours between 6 a.m. and 9 a.m., in addition to 
light restriction in the evening. Findings revealed a phase advance 
in core body temperature and increased alertness in the morning 
[72]. Another study found that bright light (2700 lux) administered 
via a mask phase-advanced markers of melatonin metabolism fol- 
lowing extended treatment for 26 days [73]. Despite these positive 
findings and a rich literature on the phase-shifting properties of 
light in healthy controls [71], there is a dearth of high-quality clin- 
ical studies. The field lacks trials with adequate sample sizes, clinical 
outcome measures, and appropriate control groups. Based on the 
low quality of evidence to date, the AASM was recently unable to 
recommend light therapy for the management of DSWPD, either as 
a monotherapy or in combination with other treatments [60]. They 
did, however, note that evidence is higher for the use of light therapy 
for children and adolescent DSWPD populations, in combination 
with behavioural intervention. Given the robust phase-shifting ef- 
fects of light in experimental studies, there is an urgent need to de- 
velop a strong evidence base in clinical populations. 


Melatonin 


Exogenous melatonin administration can shift the circadian clock 
to earlier or later times. Melatonin phase response curves, based on 
average responses from healthy participants, demonstrate phase ad- 
vances when administered in late afternoon or early evening, be- 
tween 5 and 7 hours before habitual bedtime [71]. Mundey et al. 
randomized DSWPD patients (n = 13) to one of three arms: 4 weeks 
of placebo, melatonin 0.3 mg or 3 mg, administered 1.5-6.5 hours 
prior to habitual sleep onset. Both melatonin groups displayed phase 
advance in DLMO, the magnitude of which was strongly correlated 
with the time of melatonin administration (earlier times being the 
most effective) [74]. A meta-analysis of nine randomized, double- 
blind, placebo-controlled studies confirmed that administration 
1.5-6.5 hours prior to DLMO was effective at phase-advancing the 
clock and reducing sleep onset latency by a mean of 23 minutes [75]. 


At present, the AASM recommends timed melatonin for the treat- 
ment of DSWPD in adults, children, and adolescents. However, fur- 
ther research is required to establish optimum parameters (timing 
and dose), as well as long-term safety [60]. 


Advanced sleep-wake phase disorder 


ASWPD is characterized by persistent advancement of the major 
sleep period (for example, sleep onset at approximately 6-9 p.m. and 
sleep offset at approximately 2-5 a.m.). This advancement manifests 
as habitual difficulty maintaining wakefulness in early evening and 
wake-up times that are several hours earlier than normal. When at- 
tempting to delay sleep to a more conventional clock time—for ex- 
ample, to engage in social activities—patients typically experience 
significantly curtailed sleep duration and next-day sleepiness. The 
prevalence of ASWPD increases with age [76, 77], which may be a 
function of an age-related phase advance [78]. Possible explanatory 
mechanisms include a shortened endogenous circadian period and 
increased retinal sensitivity to light in the morning [79]. Treatment 
in this population relies on timed light therapy during the evening 
or delaying the portion of the phase response curve to bring about a 
phase delay of the sleep window. 


Evening phototherapy 


One study in older adults showed that administration of bright 
light (4000 lux) between 7 and 9 p.m. for a 12-day period resulted 
in a 2-hour delay in core body temperature nadir and a reduction 
in nocturnal awakenings [80]. A more recent study by Lack and 
colleagues randomized 24 adults with early morning awaken- 
ings to either a bright light condition (2500 lux from 8 p.m. to 1 
a.m. for two consecutive nights) or a control arm (dim red light). 
Participants in the bright light group evidenced a 2-hour delay in 
both melatonin and core body temperature rhythms and reported 
reductions in wake-time during the night, as well as increased total 
sleep duration [81]. 

Other studies have shown more variable results. In a study 
involving older adults (n = 47) with ASWPD, exposure to evening 
light (265 lux) for 2-3 hours had no effect on circadian or actigraphy 
outcomes. Despite this, participants reported significantly delayed 
sleep onset and perceived the treatment to be effective [82]. The low 
light intensity (just 265 lux) may have attenuated phase-shifting ef- 
fects. However, a further study comparing bright light (10,000 lux) 
with dim light placebo similarly failed to find significant effects on 
sleep or circadian variables in older adults [83]. Key limitations of 
studies to date include gross variability in light therapy intensity 
and limited recruitment of well-defined ASWPD patients. This may 
account for the heterogeneity in trial evidence. Despite these limi- 
tations, there is some evidence to support the use of bright light 
therapy in early evening in those with ASWPD, which is supported 
by the AASM task force. 


Melatonin 


Based on melatonin phase response curves, it is theoretically plaus- 
ible that morning administration of melatonin will phase-delay the 
timing of the sleep-wake cycle. At present, however, there is little 
to no robust evidence regarding the effectiveness of melatonin in 
ASWPD [60]. Concerns about the potential for sleepiness during 
the day may have limited progress in this area. Morning administra- 
tion of melatonin should be examined in carefully designed studies. 
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Non-24-hour sleep-wake disorder 


Non-24-hour sleep-wake disorder (N24SWD) is the result of the 
SCN failing to synchronize with the 24-hour light-dark cycle; thus, 
sleep-wake schedules drift progressively later each day. Patients typ- 
ically experience bouts of nocturnal insomnia and daytime somno- 
lence, the severity of which will depend on when the patient tries 
to sleep in relation to the current phase of the endogenous rhythm. 
The condition is relatively rare in sighted individuals and instead 
primarily affects those with total blindness due to an inability to per- 
ceive photic stimuli, impairing entrainment to the 24-hour light- 
dark cycle [84]. Disease mechanisms in sighted individuals may 
include decreased responsiveness to light entrainment or an un- 
usually long free-running circadian period that is outside the range 
of entrainment [79]. Across individuals with N24SWD, treatments 
aim to synchronize patients’ endogenous body clock with the 24- 
hour light-dark cycle and are initiated when the sleep-wake pattern 
is approaching the normal/desired clock time. 


Phototherapy 


In sighted individuals, timed light exposure has been suggested as an 
appropriate intervention, but controlled studies are limited. Correct 
timing of light is important since inappropriate timing may worsen 
the current rhythm or induce an undesired shift in position. In a 
handful of case studies, administration of bright light was found to 
facilitate entrainment [85-88]. Nevertheless, long-term administra- 
tion of this treatment appears to be difficult for patients to manage in 
the home environment, relative to melatonin [85]. Given the small 
evidence base and the absence of controlled trials, light therapy 
cannot yet be recommended as an effective treatment for N24SWD 
in sighted individuals. 


Melatonin and melatonin receptor agonists 


Evidence in support of melatonin for N24SWD comes from obser- 
vational studies in non-sighted individuals [89-92] and sighted in- 
dividuals [85, 93, 94]. Successful entrainment depends on correct 
timing of melatonin since administration when the individual’s 
free-running phase is not approaching normal can have adverse 
consequences and result in unwanted phase shifts. Using a placebo- 
controlled crossover design, Lockley and colleagues showed that 
administration of 5 mg of melatonin at 9 p.m. for a full circadian 
cycle resulted in entrainment in non-sighted individuals with 
N24SWD [95]. 

In recent years, the development of melatonin agonists has in- 
creased treatment options for those with N24SWD. Tasimelteon is 
an MT,/MT, agonist which has recently been approved for treat- 
ment of N24SWD by both the FDA and the European Medicines 
Agency. Two consecutive placebo-controlled trials examined 
the effect of tasimelteon in non-sighted patients diagnosed with 
N24SWD. In the first of these (SET trial), participants were ran- 
domly assigned to either tasimelteon (20 mg) or placebo, 1 hour 
before the desired bedtime and once per day for 26 weeks. It was 
found that 24% of individuals receiving tasimelteon became en- 
trained (vs 0% in the placebo arm) and patients in this group had 
increased night-time sleep and reduced daytime sleep [96]. The 
subsequent RESET trial sought to examine the effect of withdrawal 
from tasimelteon on entrainment. Following initial entrainment, 
individuals were randomized to either placebo (withdrawn group) 


or continued tasimelteon. The group maintained on tasimelteon 
remained entrained, while those allocated to placebo withdrawal 
reverted back to non-24-hour rhythms [96]. Although some side ef- 
fects were reported (headache, elevated liver enzymes, nightmares), 
tasimelteon has clear benefits for those with N24SWD. The AASM 
recommends melatonin for the treatment of N24SWD in blind in- 
dividuals, but not in sighted individuals, given the limited evidence 
base for this latter population. 


Irregular sleep-wake rhythm disorder 


Irregular sleep-wake rhythm disorder (ISWRD) is characterized by 
the absence ofa clearly defined pattern in the sleep-wake cycle. While 
total sleep time may be normal, sleep bouts are fragmented and usu- 
ally distributed in naps across a 24-hour period. There is no primary 
night-time sleep episode. Thus, the amplitude of the rest-activity 
rhythm is low and patients often report insomnia symptoms and 
excessive daytime sleepiness. The condition primarily affects older 
adults, especially those with dementia, other neurodegenerative dis- 
orders, and neurodevelopmental disorders and individuals in care 
home facilities. 

Loss of SCN neurons, driven by dementia pathology, may be one 
contributor to circadian disorganization, impairing the ability to 
generate and sustain consolidated bouts of sleep and wakefulness 
[97, 98]. For patients who are institutionalized (for example, in care 
homes), lack of a structured schedule, reduced physical activity, and 
reduced exposure to light may all contribute to ISWRD [99]. The 
primary goal of treatment in such patients is to establish consoli- 
dated nocturnal sleep and eliminate/reduce daytime sleep episodes. 


Phototherapy 


A number of studies have examined the effectiveness of timed bright 
light exposure in dementia patients, but findings have been incon- 
sistent. In one study, nursing home patients with dementia were 
exposed to morning bright light (8-11 a.m.) for 2 hours each day 
for 2 weeks. Following the treatment period, patients displayed in- 
creased actigraphy-defined sleep efficiency and decreased nocturnal 
wake-time. Although these improvements were maintained for 4 
weeks post-treatment, by 16 weeks, all improvements returned to 
near baseline levels [100]. Mixed effects have been shown across a 
number of other studies [101-103], and a recent Cochrane review 
found no overall effect of light therapy on cognitive function, sleep, 
challenging behaviour, or psychiatric symptoms in patients with 
dementia [104]. Taken together, there appears to be some support 
for the use of bright light in patients with ISWRD, but heterogen- 
eity in design limits solid conclusions. One clear issue is the poor 
delineation of patient groups with ISWRD on study entry, with 
many studies recruiting patients with dementia and sleep difficulties 
(broadly defined), rather than with diagnosed ISWRD. The AASM 
recommends bright light as a possible treatment in ISWRD patients 
with dementia. Nevertheless, it seems clear that further research is 
required to optimize treatment parameters and obtain a definitive 
understanding of treatment benefits. 


Pharmacotherapies 


Melatonin is not currently recommended for patients with ISWRD 
and dementia. A recent Cochrane review of four randomized con- 
trolled trials of melatonin concluded that there was no evidence of 
superiority, relative to placebo [105]. Similarly, current evidence 
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and practice parameters argue against the use of sleep-promoting 
hypnotics in demented patients with ISWRD, owing to the po- 
tential for side effects, including falls, confusion, and physiologic 
dependence [60]. 


Insomnia and CRSWDs are common and impairing sleep dis- 
orders. CBT is the first-line treatment for chronic insomnia, while 
hypnotics should only be used sparingly and for a short period 
of time. Digital technology is helping to disseminate evidence- 
based CBT to insomnia patients, while improved understanding 
of sleep-wake neurobiology is helping to refine pharmacological 
approaches beyond conventional hypnotics. Human circadian sci- 
ence clearly shows that melatonin and light therapy can effectively 
reset the clock, but these insights have only partially translated into 
therapeutics for patients with CRSWDs. High-quality randomized 
controlled trials are required to establish standardized treatment 
guidelines and inform on optimal timing of circadian therapeutics 
in the management of CRSWDs. 
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Sexual relationships are central to the lives of most of us. The sexual 
component of those relationships can go wrong in various ways. This 
may be secondary to other difficulties in the relationship, mental 
health problems, specific sexual vulnerabilities of the individual, or 
the impact of disease or medication on sexual response. This chapter 
will describe the more common sexual and paraphilic disorders. 
Evidence related to prevalence, aetiology, and treatment will be 
briefly reviewed. Practical guidance on how to carry out assessment 
and plan a treatment programme for individuals and couples with 
sexual dysfunction will be provided. 








Clinical features 
Sexual problems in men 


The most common sexual problems presented by men are erectile 
disorder (ED), premature or early ejaculation (PE), and low sexual 
desire. Delayed or absent ejaculation is a relatively infrequent 
complaint. 


Erectile problems 


Penile erection is a tangible and fundamental component of a man’s 
experience of sexual arousal, and a lack of erection in a sexual situ- 
ation often has significant negative effects on self-esteem and sexual 
confidence. Irrespective of whether or not there are peripheral ex- 
planations for impaired erections (for example, vascular disease), 
the reactions of the man and his partner have a major influence on 
how problematic the erectile difficulty becomes. Sexual problems in 
partners of men with ED have also been reported as common [1]. 


Low sexual desire 


Low sexual desire is sometimes linked with erectile and/or ejacu- 
latory concerns [2]. Men with low sexual desire often report a de- 
cline in their initiation of sexual activity and in their responses to a 
partner's attempts to initiate. Many men with low sexual desire also 
report a reduction in ‘spontaneous’ erections. However, a man can 
experience low sexual desire without having any erectile difficulties, 





although he may require more direct tactile stimulation to achieve 
erections. 


Premature ejaculation 


The key feature of PE is that ejaculation occurs prior to, or shortly 
after, vaginal penetration [2]. Many men also report feeling a lack of 
control over ejaculation and anxiety about PE occurring in future 
sexual encounters [3]. 

To meet the Diagnostic and Statistical Manual of Mental Disorders, 
fifth edition (DSM-5) diagnostic criteria for PE, ejaculation must 
occur within approximately 1 minute of vaginal penetration and be- 
fore the man wishes it [2]. PE can be lifelong or acquired; occurring 
after a period of normal ejaculatory latency, acquired PE usually has 
a later onset. 


Delayed ejaculation 


Delayed, infrequent, or absent ejaculation occurs in men, although 
it is much less common than PE. A man might have difficulty 
ejaculating only during sexual activity with his partner and in some 
cases only during penetrative intercourse, or the problem may be 
evident even during masturbation. Delayed or absent ejaculation is a 
common side effect of selective serotonin reuptake inhibitor (SSRI) 
medications, suggesting that the primary effect of such drugs is on 
the triggering of orgasm [4]. 


Pain during sexual response 


While there is no male sexual pain disorder listed in either the 
International Classification of Diseases (ICD) or DSM, pain during 
sexual response in men can occur [5]. 


Sexual problems in women 
Loss of sexual arousal and/or desire 


Most surveys have suggested that low sexual desire is the most 
common sexual problem reported by women. However, low sexual 
desire is a heterogenous problem category, and the relationship be- 
tween sexual arousal and sexual desire in women is particularly 
complex. Many women do not differentiate between ‘arousal’ and 
‘desire’ [6], and awareness of ‘desire’ is usually accompanied by some 
degree of central arousal, whether or not any genital response is 
perceived [7]. There is also considerable overlap or comorbidity be- 
tween problems related to sexual arousal and desire in women [8]. 
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Although traditionally seen as the counterpart to penile erection 
in men, vaginal response is not central to the experience of sexual 
arousal in women. Vaginal dryness does not necessarily indicate 
lack of arousal, and conversely, a woman may experience lack of 
sexual arousal and yet have vaginal lubrication. An increase in va- 
ginal blood flow has been consistently demonstrated in women re- 
acting to sexual stimuli, whether or not they find the sexual stimulus 
appealing; this led Laan and Everaerd [9] to call this an ‘automatic 
response. The relevance of vaginal response to sexual arousal in 
women therefore remains unclear. Tumescence of the clitoris, on the 
other hand, may be more directly comparable to male genital re- 
sponse, but this is less easily assessed and less clearly perceived by 
women, compared with penile erection in men. 


Problems with genito-pelvic pain 


Pain during attempted or complete vaginal entry is acommon sexual 
problem in women with a wide range of possible causes. Fear of pain 
or vaginal intercourse and tension of the pelvic floor muscles are 
common diagnostic features [2]. Sexual pain is also frequently asso- 
ciated with a lack of sexual desire and/or arousal and with avoidance 
of sexual situations. Sometimes pain only occurs when provoked; for 
example, vulvar vestibulitis syndrome is a condition associated with 
pain on touching the labia or vaginal introitus. 


Problems with orgasm 


Difficulty experiencing orgasm is not uncommon in women. Often 
this is situational in that orgasm is possible with masturbation, but 
not during sexual interaction with the partner. The capacity to ex- 
perience orgasm varies considerably across women. Some women 
reach orgasm easily if sufficient arousal occurs; others may require 
more specific or more intense stimulation, and an estimated 10- 
15% are unable to experience orgasm throughout their lives [10]. In 
identifying a problem as primarily orgasmic, one needs to first estab- 
lish that appropriate sexual arousal has occurred. Orgasmic difficul- 
ties can be lifelong or can be acquired, that is, problems developed 
after a period of normal orgasmic functioning. 


Persistent genital arousal disorder 


Persistent genital arousal disorder (PGAD) is a recently recognized, 
but fairly uncommon, sexual problem in women. It is characterized 
by genital and breast vasocongestion and sensitivity which persists 
for hours or days and is only temporarily relieved by orgasm; genital 
sensations are unaccompanied by any subjective sense of sexual de- 
sire and excitement but instead are perceived as intrusive [11]. 


Classification 


The two major classification systems for sexual dysfunction are the 
American Psychiatric Associations DSM and the World Health 
Organization’s ICD. ‘The fifth edition of the DSM was published in 
2013 [2] and ICD-11 was published in 2018 [12]. Efforts were made 
to harmonize the groups of disorders in ICD-11 with those included 
in DSM-5 [13]. 

DSM-5 comprises major changes in the classification of sexual 
disorders [14]. Specific duration and severity criteria were added to 
all of the sexual disorders—a requirement that the symptoms must 
have persisted for a minimum duration of approximately 6 months 


and have been experienced on all or almost all (approximately 75- 
100%) of sexual encounters. There is also the requirement that the 
symptoms cause ‘clinically significant distress in the individual’ [2]. 

Regarding female sexual disorders, there were other significant 
changes. Major criticisms of DSM-IV were the high comorbidity be- 
tween diagnoses of sexual disorders, the ‘genital’ focus of the diagnostic 
criteria, and the neglect of psychological and relationship factors [15- 
17]. In response to these criticisms and to growing evidence on the 
overlap between arousal and desire in women, in DSM-5, “Hypoactive 
sexual desire disorder’ and ‘Female sexual arousal disorder’ were de- 
leted and a new disorder ‘Female sexual interest/arousal disorder’ 
(FSIAD)—was added. The criteria for FSIAD include subjective, behav- 
ioural, and physical aspects of sexual interest/arousal. There were also 
significant changes made to female sexual pain disorders, with both 
vaginismus and dyspareunia deleted from DSM-5, and a new diagnosis 
‘Genito-pelvic pain/penetration disorder’ (GPPPD) introduced (for the 
rationale for these changes, see [18)]). 

For women, the DSM-5 diagnostic categories are FSIAD, female 
orgasmic disorder, and GPPPD. For men, the categories are ED, PE, 
male hypoactive sexual desire disorder, and delayed ejaculation. 
Substance/medication-induced sexual dysfunction, other ‘specified 
sexual dysfunction, and ‘unspecified sexual dysfunctions’ are diag- 
noses that can be applied to both men and women. 

An important aspect of the DSM definition of sexual dysfunction 
is how we define a ‘clinically significant disturbance, with connota- 
tions of abnormal or impaired function, and how this is distinguished 
from a ‘sexual problem in a more general sense. Relevant to the ques- 
tion of when a sexual problem becomes a ‘dysfunction’ is a theoretical 
approach called the dual control model [19, 20]. This postulates that 
sexual response results from an interaction between excitation and 
inhibition, involving relatively discrete neurophysiological systems in 
the brain. A central assumption of the model is that individuals vary 
in their propensity for both sexual excitation and sexual inhibition and 
that ‘normal levels of inhibition are adaptive, reducing sexual respon- 
siveness in circumstances where sexual activity is best avoided. It is pre- 
dicted that high levels of inhibition will be associated with vulnerability 
to sexual dysfunction, and low levels with an increased likelihood of 
engaging in high-risk sexual behaviour. 

This faces us with the seemingly obvious, but fundamental, chal- 
lenge of deciding whether a loss of sexual interest or responsiveness 
is an understandable, or even adaptive, reaction to current circum- 
stances or is a result of ‘malfunction’ of the sexual response system, 
which can be appropriately called a ‘sexual dysfunction. This chal- 
lenge is also central to assessment that identifies the key factors 
causing the sexual problem and how they should best be treated. 
A strategy for carrying out such assessment, which we have called 
the ‘three-windows approach, will be outlined further. 


Epidemiology 


Many early epidemiological surveys assessed sexual problems that were 
relatively short term and did not ask respondents about whether they 
experienced any distress about symptoms. An example of this was the 
US National Health and Social Life survey, which asked if symptoms 
had occurred ‘for several months or more’ during the past year, but not 
whether the symptoms were distressing [8]. Forty-three per cent of 
women and 31% of men were identified as having a ‘sexual dysfunction. 
‘These figures were widely cited as evidence that sexual dysfunction is ‘a 
significant public health problem [8, p. 544]. 


More recent surveys, including the UK’s third National Survey 
of Sexual Attitudes and Lifestyles (Natsal-3), have asked about 
the duration of symptoms and about associated distress [21]. 
These findings have demonstrated that transient sexual difficul- 
ties are very common, but more persistent problems and distress 
about one’s sex life much less so. For example, 51.2% of women 
and 41.6% of men reported experience of one or more sexual 
problem during the previous year [21], but distress was reported 
by only 10.9% of women and 9.9% of men. Prevalence estimates 
for sexual problems also show marked reductions when dur- 
ation criteria are applied; in Natsal-3, of those reporting lacking 
interest in sex, 35.1% of men and 55.8% of women said that the 
problem lasted 6 months or more [22]. 

Prevalence rates for specific problems vary considerably. This, in 
part, can be attributed to variations in how sexual problems are de- 
fined and whether distress about symptoms is assessed. Most epi- 
demiologic research assessing sexual dysfunction was conducted 
in Western countries, but the limited evidence we have from other 
countries shows that estimated prevalence rates vary widely across 
cultures [23-25]. 

There has been more consistency across studies in the associations 
found between factors of possible aetiological relevance and sexual 
functioning. In women, sexual problems are more frequent in those 
with mental health problems and relationship difficulties [26, 27]. In 
a survey of American heterosexual women aged 20-65 years, 24.4% 
reported marked distress about their sexual relationship and/or 
their own sexuality [28]. The best predictors of distress were markers 
of mental health and the quality of the emotional relationship with 
the partner. Physical aspects of sexual response in women, such as 
arousal and orgasm, were relatively poor predictors. 

In men, age has a predictable negative effect on erectile function 
[29]. Most studies have also found an association between age and 
loss of sexual desire [30]. There is some evidence that PE is more 
commonly reported by younger men than older men [21]. 

The association between age and sexual problems in women is 
more complex. Whereas the level of sexual interest typically de- 
creases with age, older women are less likely to regard this as a 
problem [28]. An important predictor of sexual problems is whether 
a woman hasa current sexual partner and, among partnered women, 
whether the partner has sexual difficulties [31]. Although the post- 
menopausal decline in oestrogens is relevant to vaginal lubrication, 
other factors such as mental health and the quality of the sexual rela- 
tionship are more important determinants of sexual well-being than 
menopausal status [32]. 


Aetiology 


Before considering the factors that can cause sexual problems, it is 
worth underlining the important way that sexual function differs 
from most other physiological response systems. Although involving 
physiological mechanisms, sexual responses are most often experi- 
enced in the context of a relationship. This highlights the importance 
of keeping the interactive relationship components in mind when 
trying to assess and treat sexual problems. Sociocultural factors are 
also crucial to understanding how sexual problems are experienced 
[33]. Much of the focus in medical treatments of sexual problems 
has been on the individual patient, with relationship and sociocul- 
tural aspects largely ignored. The more specific aetiological factors 
can now be considered using the ‘three-windows approach [4]. 
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The first window-the current situation 


Through the first window, a variety of factors in the individual’s cur- 
rent relationship and situation may be relevant. Relationship prob- 
lems, particularly resentment and insecurity within a relationship, 
are of particular importance. For many individuals, feeling secure 
and being able to ‘let go’ are necessary for them to enjoy sex. Other 
factors that may be important include: poor communication be- 
tween partners about their sexual feelings and needs; misunder- 
standings and lack of information; unsuitable circumstances and 
lack of time, for example fatigue, lack of privacy; concerns about 
pregnancy or sexually transmitted infections; and low self-esteem 
and poor body image. 


The second window-vulnerability of the individual 


Although a wide range of factors can impact on our sexuality, it is 
also clear that individuals vary substantially in the extent to which 
they are affected by such factors, particularly in terms of an associ- 
ated inhibition of sexual response. Such vulnerabilities are likely to 
have been evident in earlier episodes in the current relationship or 
in earlier relationships. 


1. Negative attitudes. Long-standing attitudes, usually stemming 
from childhood, that sex is inherently ‘bad’ or immoral are likely 
to interfere with an individual’s ability to become involved in, 
and enjoy, a sexual relationship. 

2. Need to maintain self-control. In some individuals, difficulty in 
‘letting go’ sexually reflects a more general need to maintain self- 
control, particularly in the presence of another person. 

3. Earlier experience of sexual abuse or trauma. There is now an 
extensive literature on the impact of sexual abuse on subsequent 
sexual adjustment. Whereas the mediating mechanisms are not 
well understood and are likely to be complex, a history of such 
experience should be regarded as potentially relevant to current 
sexual difficulties. 

4. Propensity to sexual inhibition. The dual control model, dis- 
cussed earlier, has led to psychometrically validated measures 
of propensity to sexual excitation and inhibition in men [34] 
and women [35]. In men, as predicted, there is a clear as- 
sociation between low sexual excitation and/or high sexual 
inhibition propensity and erectile problems, but no associ- 
ation with PE [36]. In women, there is a strong relationship 
between sexual inhibition propensity and reports of sexual 
problems [37, 38]. Particularly important is one inhibition 
subscale (labelled ‘arousal contingency’) that reflects suscep- 
tibility for sexual arousal to be easily affected by situational 
factors, for example if the circumstances are not ‘just right. 
Although further research is needed, these measures of sexual 
inhibition and excitation may prove valuable in explaining 
patterns of impaired sexual response and helping in the selec- 
tion of appropriate treatment. 


The third window-—health-related factors that alter 
sexual function 


Mental health and sexuality 


Psychiatric problems are commonly associated with sexual prob- 
lems (for a review, see [4]). Reduction in sexual interest and, to 
some extent, sexual arousability is generally accepted as a common 
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symptom of depressive illness [39]. In contrast, sexual interest tends 
to be increased in states of elevated mood such as hypomania [40]. 

With anxiety, the clinical evidence is much more limited. Higher 
rates of sexual dysfunction in patients with anxiety disorders have 
been reported [41]. Individuals with panic disorder may be par- 
ticularly likely to report sexual problems. PE is a common sexual 
problem in men with social phobias [42]. 

Although most studies have focused on negative effects of mood 
disorders on sexual interest and response, there is evidence that a 
minority of individuals experience increased sexual interest during 
negative mood states. This paradoxical pattern has been reported 
in non-clinical samples of men [43] and women. [44] Although the 
origins of this pattern are not yet understood, it may be problematic 
in various ways, for example associated with sexual risk-taking or 
leading to sex being used as a mood regulator. 

The impact of schizophrenia on sexuality is complex. Sexual 
thoughts and behaviours are common in schizophrenia, and there 
may be a relative increase in sexual activity [45]. 


Physical health and sexuality 


Poor physical health is associated with decreased sexual activity and 
reduced sexual satisfaction at all ages [46]. The impact of poor phys- 
ical health on sexuality may be relatively non-specific. For example, 
loss of well-being and energy associated with chronic illness is likely 
to cause reduced sexual interest and arousability. Psychological re- 
actions to the illness or condition (for example, the effects of breast 
cancer on a woman's body image, and hence her sexual enjoyment) 
may also be important. In addition, there are a variety of ways in 
which health problems can directly affect sexual interest and/or re- 
sponse (for a review, see [4]): 


1. Damage to the neural control of genital response. This can in- 
volve peripheral mechanisms (for example, autonomic and per- 
ipheral neuropathy) or disease in the spinal cord (for example, 
multiple sclerosis). Injury or surgery causing nerve damage may 
be involved (for example, spinal cord injury, prostatectomy, or 
hysterectomy). The most likely consequences are ED in men 
and impaired vaginal response in women. Brain abnormalities, 
such as epilepsy or cerebral tumour, can affect central control of 
sexual response, the precise effect depending on the site of the 
abnormality or tumour. In some cases, the result is loss of sexual 
interest and arousability; rarely, there is disinhibition of sexual 
behaviour. 

2. Impairment of vascular supply of the genitalia. Genital response 
is dependent on increased arterial inflow, as well as alteration in 
venous outflow. 

3. Alteration of endocrine mechanisms affecting sexual interest, 
arousal, and response. In the male, any cause of lowered testos- 
terone (T) levels is likely to produce loss of sexual interest and, 
to a varying extent, impairment of erectile response. 


In women, lack of oestrogen is associated with impaired vaginal 
lubrication. The effects of sex steroids, either oestrogens or andro- 
gens, on sexual interest and arousability are much less predictable. 
The evidence is consistent with there being a proportion of women 
who depend on T for their normal level of sexual interest, but there 
are many women who can experience substantial reductions in T 
without obvious adverse sexual effects. 


Hyperprolactinaemia, usually resulting from pituitary adenomas 
and hypersecretion of prolactin, may be associated with loss of sexual 
interest and, in men, with ED. However, this is not always the case. 
The precise role of prolactin in human sexuality is not understood. 

Some diseases affect sexuality through more than one of the 
mechanisms described. Diabetes mellitus is a good example. In dia- 
betic men, ED can result from small-vessel vascular disease and also 
autonomic and peripheral neuropathology. Lowered sexual interest 
and impairment of genital response have also been reported in 
diabetic women. 


Side effects of medication 


We can now consider the main adverse sexual effects resulting from 
medication (for a review, see [47]). 


Antidepressants 


SSRIs, by inhibiting the 5-HT1A receptor, increase serotonergic 
transmission. The most predictable side effect, in both women and 
men, is inhibition of orgasm and ejaculation [48]. Other negative 
effects include reduced sexual interest and arousability, though 
they are less predictable and not always easy to distinguish from the 
sexual effects of the affective disorder being treated. Tricyclic anti- 
depressants also commonly produce sexual side effects. 


Antipsychotic medication 


These drugs, used for treatment of schizophrenia and other psych- 
otic disorders, involve a balance of dopamine antagonist and 5-HT 
agonist effects. The most common side effects are ED and ejacula- 
tory difficulties in men, and orgasmic dysfunction in women. 


Antihypertensive medication 


Many drugs used to treat hypertension interfere with male sexual 
response. Beta-blockers (for example, propranolol), by blocking 
smooth muscle relaxation and leaving alpha-1-induced contraction 
unopposed, commonly cause ED. Centrally acting antihypertensives 
(for example, guanethidine) also interfere with sexual response, 
impairing erection and blocking ejaculation. Clonidine, an alpha- 
2 agonist, can also cause erectile problems. In this case, the prin- 
cipal effect is likely to be reduced central arousal. For a review of 
the effects of antihypertensive medication on male sexual response, 
see [49]. 

The effects of drugs on women’s sexuality have received far less 
attention. Research has mostly focused on difficulties in achieving 
orgasm. As orgasm only occurs after sufficient sexual arousal, these 
effects may reflect impairment of arousal, but this has not been ad- 
equately assessed. Steroidal contraceptives, although associated with 
markedly reduced levels of free T, decrease sexual interest only in a 
minority of women [50]. 


In this section, we will first describe the principles of sex therapy and 
the main forms of psychological and pharmacological treatment, 
followed by an outline of the process of assessment and selection of 
a suitable treatment plan. There is growing recognition that integra- 
tion of psychological and pharmacological methods, with emphasis 


on the couple, may be the most appropriate treatment model for 
most couples [3, 51]. 


Sex therapy 


Although the approach first introduced by Masters and Johnson [52] 
has been adapted in various ways, the core treatment techniques re- 
main. Originally developed for helping couples, the techniques can 
be modified for use with individuals and with same-sex, as well as 
heterosexual, couples. 

The key elements of the therapeutic process are: 


1. Clearly defined tasks that the couple are asked to attempt before 
the next therapy session. 

2. Those attempts, and any difficulties encountered, are examined 
in detail. 

3. Attitudes, feelings, and conflicts that make the tasks difficult to 
carry out are identified. 

4. These are modified or resolved, so that subsequent achievement 
of the tasks becomes possible. 

5. The next tasks are set, and so on. 


The tasks are mostly behavioural in nature. They are chosen to 
facilitate the identification of relevant issues but, in some cases, are 
sufficient in themselves to produce change. The behavioural pro- 
gramme is in three parts. In the first part, the couple are asked to 
avoid any direct genital touching or stimulation and to focus on 
non-genital contact, alternating who initiates and who does the 
touching. These first non-genital steps are effective in identifying 
important issues in the relationship such as lack of trust or coun- 
terproductive stereotypical attitudes (for example, once a man is 
aroused, he cannot be expected not to have intercourse). Once this 
stage can be carried out satisfactorily, and related problematic issues 
dealt with, the programme moves on to the second part, which al- 
lows genital touching to be combined with non-genital touching, 
with penile-vaginal intercourse still ‘out of bounds: In this second 
part, more intra-personal problems, such as long-standing negative 
attitudes about sex or the sequelae of earlier sexual trauma, are likely 
to emerge. In the third part, a gradual approach to vaginal-penile 
contact and insertion is undertaken. Here the most relevant issues 
are ‘performance anxiety’ and fear of pain. 

As the behavioural tasks reveal key issues that need to be resolved 
before moving on to the next stage, a variety of psychotherapeutic 
approaches, including cognitive behavioural techniques, can be 
utilized. Although the stage at which particular issues emerge does 
vary from case to case, there is a tendency for problems identified 
through the ‘first window; particularly those related to lack of trust 
and unresolved resentment, to appear during the first stage of the 
programme. Intra-personal issues (for example, as seen through 
the ‘second window’) are more likely to be recognized during the 
second stage. 

The goals of therapy include helping the individual to accept and 
feel comfortable with his or her sexuality and helping the couple to 
establish trust and emotional security and to enhance their sexual en- 
joyment and intimacy. An important point is that these goals do not 
necessarily include reversal of specific sexual dysfunctions. There are 
exceptions; for example, there are specific behavioural techniques to 
deal with PE and orgasmic disorder (for details of these techniques, 
see [4]). However, the overriding principle is that, assuming there is 
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no abnormality of the basic physiological mechanisms involved in 
sexual response, normal sexual function will return once these goals 
are achieved. In cases where impairment of physiological mechan- 
isms does exist, the goals of sex therapy are still helpful and integrate 
well with the use of pharmacological treatment. 


Practical aspects 


Although sex therapy varies in duration, 12 sessions over 4- 
5 months is typical. The therapist adjusts to the particular needs of 
the individual or couple. Treatment begins weekly, with the interval 
between sessions extended once major issues like unexpressed re- 
sentment or communication problems have been dealt with. The last 
two or three sessions are spaced out over a few months, so that the 
couple have an opportunity to consolidate their progress and cope 
with any setbacks before termination. 


Other psychological treatments 


Apart from sex therapy, a range of other psychological treatments 
have been used to treat sexual dysfunction, either on their own or 
as a supplement to sex therapy. Two particular approaches that have 
generated the most interest and evaluation are cognitive behavioural 
therapy (CBT) and mindfulness-based intervention. There is a large 
literature supporting the important role for cognitive factors in the 
aetiology and maintenance of sexual disorders (for a review, see 
[53]), which suggests that CBT approaches could be effective. 


Pharmacological treatments for men 
Erectile disorders 


The introduction of phosphodiesterase type 5 (PDE-5) inhibitors 
had a major impact on the treatment of ED. After the approval of 
sildenafil in 1998, other PDE-5 inhibitors, for example tadalafil and 
vardenafil, became available. Although these drugs were effective for 
approximately 75% of men, there was evidence that many men did 
not continue with the medication [54]. In one study done in eight 
countries, 2912 men identified with ED were followed up; 58% of 
them had sought medical help for ED, but only 16% of men main- 
tained their use of PDE-5 inhibitors [54]. In addition, although these 
medications have a generally good safety and side effect profile, there 
are some contraindications to their use (for example, use of nitrates). 

Other pharmacological treatments that were widely used be- 
fore PDE-5 inhibitors became available, such as apomorphine and 
intracavernosal injections of phentolamine, used in combination 
with papaverine, are now primarily recommended for men with ED 
for whom PDE-5 inhibitors are either contraindicated or have been 
ineffective [51, 55]. 

Other drugs that are still in development to treat ED in men 
include bremelanotide, a melanocortin agonist and a synthetic 
peptide analogue of a naturally occurring hormone called alpha- 
melanocyte-stimulating hormone (MSH) [56]. 


Premature ejaculation 


Pharmacological treatments used for PE include SSRIs and 
clomipramine, a tricyclic antidepressant, and topical administra- 
tion of prilocaine/lidocaine. One drug—dapoxetine, a novel, short- 
acting SSRI—has received approval for the treatment of PE in over 
30 countries [3]. All of these medications can be taken either daily 
or on an as-needed basis. 
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Delayed or absent orgasm/ejaculation 


At the present time, there is no accepted pharmacological treatment 
for delayed or absent ejaculation or orgasm [57]. 


Low sexual desire 


The most treatable, but relatively infrequent, cause of loss of sexual 
desire in men is hypogonadism. Where androgen deficiency is evi- 
dent, T replacement is indicated. Ifloss of sexual desire is associated 
with hyperprolactinaemia, treatment with dopamine agonists, such 
as bromocriptine, is recommended. 


Pharmacological treatments for women 


Following the success of PDE-5 inhibitors in treating ED, there were 
attempts to use sildenafil to treat sexual arousal disorder in women, 
but the results were disappointing [58]. Attention shifted to treat- 
ment of low sexual desire in women and the use of T. Although 
Intrinsa, a T patch, was approved in 2006 by the European Medicines 
Agency, this was only for treatment of low sexual desire in women 
with surgically induced menopause. 

The first medication to receive approval by the US Food and 
Drug Administration for the treatment of hypoactive sexual desire 
disorder in premenopausal women was flibanserin, a medication 
with mixed effects on serotonergic and dopaminergic transmitter 
systems that was originally evaluated as an antidepressant but was 
ineffective [59]. Flibanserin was approved amid a great deal of con- 
troversy about its safety and efficacy. Two systematic reviews on the 
efficacy and safety of flibanserin reached inconsistent conclusions 
regarding its efficacy [60, 61]. 

Thereare other drugs in developmentto treat women’s sexual desire 
problems, including subcutaneously administered bremelanotide 
and medications that combine sildenafil and T or T and buspirone. 


Evaluation of treatments 


There have been many more studies testing the effectiveness of 
pharmacological treatments for sexual dysfunction than of psy- 
chological or combination therapies [62]. With the exception of 
research on female sexual pain disorders, there has been a serious 
shortage of outcome research on psychological treatments. This lack 
of research does not reflect a lack of growth in sex therapy or of new 
approaches [63]. 

Three systematic reviews of the efficacy of psychological inter- 
ventions to treat sexual dysfunction in men and women have 
been published [64-66]. For women, the strongest evidence 
exists for the use of CBT and mindfulness-based interventions 
in the treatment of low sexual desire [53]. For female orgasmic 
disorder, multimodal treatment programmes involving tech- 
niques such as directed masturbation, sexual skills training, and 
sex education can be effective approaches [53]. As mentioned, 
there have been several controlled outcome studies on psycho- 
logical interventions for sexual pain disorders, and these suggest 
that psychological therapies are effective treatments, supporting 
a biopsychosocial approach [67]. 

For men, there have been few outcome studies on psychological 
treatments for ED, but the limited evidence suggests that group or 
couple therapy is superior to individual therapy and that combined 


treatments (PDE-5 inhibitors and psychological therapy) are ef- 
fective [53]. For the treatment of PE, there is limited and incon- 
sistent evidence for the effectiveness of psychological interventions. 
For low sexual desire and delayed ejaculation, there are very few 
controlled outcome studies. 


Planning a treatment programme 


Assessment 


When couples present with sexual difficulties, one of them is usu- 
ally regarded as having the problem, but both partners should be 
carefully assessed, whenever possible. There are three stages to as- 
sessment: (1) to facilitate the decision about whether sex therapy is 
appropriate; (2) to identify issues relevant to the sexual problem that 
need to be resolved; and (3) to determine whether medication or 
other treatments are required. 

Keeping in mind the distinction between a sexual problem 
that is adaptive or appropriate, given an individual's current cir- 
cumstances, and one that is maladaptive (and can perhaps be 
considered a ‘dysfunction’), we can assess each individual’s case 
through the three conceptual ‘windows’ described earlier. Are 
there problems in the couple’s current relationship or situation 
which would make inhibition of sexual responsiveness in either 
partner understandable or adaptive? Does either individual give 
a history that suggests vulnerability to sexual problems? Are 
there are any mental or physical health issues or medication use 
in either partner that could be having a negative effect on sexual 
interest or response? 

Although not all of the details can be obtained during the ini- 
tial interview, assessment of the following topics should be carried 
out: the nature of the problem, including an assessment of the level 
of sexual interest and response in each partner; identification of 
other assessments that may be needed (for example, physical exam- 
ination, blood tests); commitment and motivation of each partner 
to improving the relationship; and mental state assessment of each 
partner. If both partners are present, each should be interviewed 
separately, following a conjoint interview. As far as possible, ques- 
tions should be asked about each individual’s sexual history, the na- 
ture of the current relationship, contraceptive use and reproductive 
history, and alcohol or recreational drug use. 

At the end of the initial interview, the clinician should provide a 
preliminary formulation of the nature of the problem and the types 
of intervention that may be helpful. Whatever treatment methods 
are used, it is important to continue to see the couple together to 
monitor progress and provide counselling, as needed. 

If there is no evidence of causal factors of the kind viewed 
through the ‘first’ or ‘second’ window, then a trial of pharmaco- 
logical treatment may be appropriate (although, as discussed, few 
pharmacological options exist for women). In such cases, a physical 
examination and, where relevant, laboratory investigations would 
normally be arranged before starting treatment. It is important to 
have a good ‘clinical baseline’ before embarking on pharmacological 
interventions. 

If there are any indications of issues, particularly of the kind that 
invoke inhibition of sexual response, which need to be resolved, 
then pharmacological interventions should not be considered as a 


first step. In many such cases, more assessment is required before 
these factors can be adequately assessed. There are then two op- 
tions: (1) further interview(s) to explore such issues; or (2) starting 
on a programme of sex therapy. The rationale for the second option 
is as follows. The initial two stages of sex therapy (that is, involving 
non-genital and then genital touching, with no attempts at vaginal 
intercourse) are particularly effective at identifying relevant issues 
underlying the problem. Furthermore, the process involved in those 
early stages is likely to benefit any sexual relationship, even those 
without obvious problems. After three or four sex therapy sessions, 
a reappraisal would be made and a decision taken about whether to 
continue with sex therapy alone or to combine it with an appropriate 
pharmacological method. For example, with a couple where the man 
has erectile problems, it is easier to assess the indications for the use 
of a PDE-5 inhibitor after the couple have gone through the first two 
stages of the programme where there is no ‘performance pressure’ 
and no need for an erection to occur. Similarly, it is often informative 
to see what impact sex therapy has on the individual who has com- 
plained of reduced sexual desire, before attempting to deal with the 
problem pharmacologically or hormonally. 

Ideally, the use of the pharmacological method should then pro- 
ceed in combination with the continuation of the sex therapy pro- 
gramme. In that way, a gradual transition to a satisfying sexual 
relationship can be achieved without renewed ‘performance pres- 
sures. If, however, progress continues to be made with sex therapy, 
then the addition of pharmacological interventions may not be ne- 
cessary. It should be explained to the couple that, whereas pharma- 
cological treatments often have beneficial effects on sexual response, 
they do not ‘cure’ the problem, which is likely to recur once the 
medication is stopped. 


Paraphilic disorders 


Clinical features 


The essential feature of a paraphilia is that it involves an intense, 
recurrent pattern of sexual interest, manifested in sexual fantasies, 
urges, or behaviour, to atypical objects or activities (typically those 
that do not involve sexual interaction with a consenting, phenotyp- 
ically normal, physically mature human partner) [2]. A paraphilia 
can be considered a disorder if it: (1) causes distress or other nega- 
tive impact to the individual or is harmful, or potentially harmful, 
to other people; and (2) is manifest over at least a 6-month period. 
There is high comorbidity between paraphilias and mood disorders 
and also consistent evidence that different paraphilias often co- 
occur [68, 69]. Substance use disorder, neurodevelopmental condi- 
tions, and mood disorders have been reported as highly prevalent 
among paraphilic offenders [70]. 

A large number of paraphilias have been mentioned in the litera- 
ture, but most of them are rare. We will focus here on the paraphilic 
disorders that are most common and listed in DSM-5. 


Voyeuristic disorder 


The key feature is a recurring pattern of intense sexual arousal 
from observing an unsuspecting person or couple in the process of 
undressing or interacting sexually. The voyeur usually masturbates 
while ‘peeping. 
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Exhibitionistic disorder 


This involves a pattern of recurrent and intense sexual arousal from 
exposing one’s genitalia to an unsuspecting person, experienced in 
fantasies, urges, or explicit behaviour. In some exhibitionists, the 
urge to expose themselves occurs occasionally, possibly at times of 
crisis or emotional distress. For others, the idea of exposing is always 
sexually stimulating and may feature in their masturbation fantasies. 


Frotteuristic disorder 


This involves recurrent and intense sexual arousal from touching or 
rubbing against a non-consenting person, which is experienced in 
fantasies, urges, or explicit behaviour. Such individuals are also more 
likely to experience exhibitionist and voyeuristic disorders, and to 
show mood and antisocial personality disorders. 


Sexual sadism disorder 


This is defined as recurrent and intense sexual arousal, as manifested 
by fantasies, urges, or behaviours, from the physical or psychological 
suffering of a non-consenting person. 


Sexual masochism disorder 


This involves recurrent and intense sexual arousal from the act of 
being humiliated, beaten, or bound. Individuals who engage in 
BDSM (bondage, dominance and submission, sadism, and maso- 
chism) which is consensual and not associated with distress would 
not be diagnosed with either sexual sadism disorder or sexual maso- 
chism disorder. 


Paedophilic disorder 


This term relates to an adult (16 years or older) being sexually at- 
tracted to a prepubertal child, engaging in sexual fantasies involving 
a child of either sex, and/or engaging in sexual activity with such 
children. In DSM-5, there is also the requirement that there must be 
at least a 5-year age difference between the individual and the child. 


Fetishistic disorder 


This refers to recurrent and intense sexual arousal in response to 
a specific part of the body or to the use of non-living objects, for 
example rubber or leather. 


Transvestic disorder 


This is a particular form of fetishism in which recurrent and intense 
sexual arousal arises from cross-dressing. There is evidence that 
some fetishistic transvestites end up transitioning to transgendered 
women. 


Classification 


In DSM-5, there were several changes in the classification of 
paraphilias. Firstly, a distinction was made between paraphilias and 
paraphilic disorders, the latter involving behaviour that is currently 
either causing distress or impairment to the individual with the 
paraphilia or a risk of harm to others [2]. There are eight paraphilic 
disorders listed in DSM-5: voyeuristic disorder, exhibitionistic dis- 
order, frotteuristic disorder, sexual masochism disorder, sexual 
sadism disorder, paedophilic disorder, fetishistic disorder, and 
transvestic disorder; there are also categories for ‘other specified 
paraphilic disorder’ and ‘unspecified paraphilic disorder. In the 
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revision of ICD (ICD-11), several disorders listed in ICD-10 (fet- 
ishistic disorder, fetishistic transvestism, and sadomasochism) were 
removed from the list of paraphilic disorders [71]. 

Leading up to DSM-5, there were controversial proposals to intro- 
duce new disorders such as hypersexual disorder [72] and paraphilic 
coercive disorder [73], but these were not approved by the American 
Psychiatric Association. There has also been a long running and 
highly complex debate about whether paraphilic disorders should 
be removed from the DSM altogether [74]. 


Epidemiology 


The prevalence of paraphilic disorders is largely unknown. A con- 
sistent finding, however, has been that most paraphilias and 
paraphilic disorders are much less common in women than in men. 

In a representative Swedish sample, 7.7% of respondents reported 
at least one incident of being sexually aroused by watching others 
having sex and 3.1% reported at least one occasion of being sexually 
aroused by exposing their genitals to a stranger [68]. 

In a representative Australian sample of 19,307 men and women 
aged 16-59 years, 1.8% of sexually active respondents (2.2% of men 
and 1.3% of women) had been involved in BDSM over the past year 
[75]. It is important to note that these prevalence estimates relate to 
paraphilic interests, and not to paraphilic disorders. 

In a recent online survey of 8718 German men, 4.1% reported 
sexual fantasies involving prepubescent children, 3.2% reported 
sexual offending against prepubescent children, and 0.1% reported 
a paedophilic sexual preference [76]. Reviewing the literature, Seto 
et al. [69] concluded that the upper-limit prevalence of paedophilia 
in community settings ranged from 1% to 3% in men. 

Langstrom and Zucker [77], in a representative sample of 2450 
Swedish men and women, asked “Have you ever dressed in clothes 
pertaining to the opposite sex and become sexually aroused by this?’ 
Positive answers were given by 2.8% of men and 0.4% of women. 
However, we have no idea how many of these individuals repeated 
this behaviour and, in particular, how many had a transvestic 
disorder. 


Aetiology 


‘There are many theories for the aetiology of paraphilias, with the most 
support for those involving conditioning or neurodevelopmental 
factors (for a review, see [78]). Because of the strong comorbidity 
between different paraphilic disorders, there may be common causal 
factors across paraphilias [69]. 

Although individuals with paraphilic disorders are most often 
seen in forensic and prison settings, they can also present in gen- 
eral psychiatric settings [69]. Regarding assessment, there are 
problems with reliance on self-report alone, and the diagnosis of 
a paraphilic disorder typically requires integrating many different 
sources of information, including a mental status examination to 
establish whether there are any concurrent psychiatric conditions 
and a careful sexual history [69]. It is important to assess whether 
the paraphilic interest or behaviour is stronger or weaker than 
normophilic interests or behaviour. Psychophysiological assessment 
of sexual arousal patterns, using penile plethysmography or viewing 
time of sexual stimuli, can be useful in this regard, particularly for 
individuals with a paedophilic disorder. For a review of the available 
self-report (through clinical interview or questionnaires) measures 
used in the assessment of paraphilias, see [69]. 


Management 


While there is no evidence that paraphilic disorders can be ‘cured’ 
[69], there is some support for pharmacological treatments such as 
anti-androgens and antidepressants to reduce sexual drive, in par- 
ticular for men with paedophilia [78]. Cognitive behavioural ap- 
proaches are often used, usually as one component in a multimodal 
treatment programme [78]. 


Evaluation of treatments 


There has been limited outcome research on treatments for the 
various paraphilic disorders. Although used for many years, based 
on a review of outcome studies, Zucker and Seto [78] concluded that 
there was no strong empirical evidence that pharmacological treat- 
ments to reduce sexual drive had any effect on rates of reoffending. 
To date, there have been very few large, controlled outcome studies 
of either pharmacological or psychological interventions for the 
treatment of paraphilic disorders. 


Conclusions 


Overall, sexual dysfunctions in one or other partner of a relationship 
are potentially treatable in many cases. The basic sex therapy approach 
described in this chapter can be effective on its own or in combination 
with other psychological interventions such as mindfulness-based 
therapy or with pharmacotherapy. Paraphilias, which are of most sig- 
nificance when they lead to illegal behaviour, are more difficult to treat. 
Further research is needed on these conditions. 
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In the past few years, the media have covered the lives of transgenders 
as never before. Caitlin Jenner was on the cover of Vanity Fair in July 
2015, and the Netflix series Orange is the New Black featured a trans 
character—Sophia Burset—embodied by a trans woman Laverne 
Cox. At the same time, multi-disciplinary gender clinics worldwide 
are reporting on rapidly growing numbers of individuals voicing a 
desire for gender-confirming treatment (GCT) [1-2]. 











The terminology used in this chapter is summarized in Table 116.1. 
The term ‘gender dysphoria (GD) was first mentioned by Norman 
Fisk (1974) [3] to describe discomfort with the assigned gender 
(assignment based on biological sex), leading to a desire for sex re- 
assignment surgery (SRS). Until recently, gender clinics were mostly 
consulted by trans women (individuals who were assigned male at 
birth but who identify as female) and trans men (individuals who 
were assigned female at birth but who identify as male) requesting 
counselling concerning the desired social transition from one gender 
role to the other and requesting medical interventions (hormone 
treatment and genital surgery) to transition from one sex to an- 
other. Evidence shows that GCT in individuals with GD is effective 
[4] and leads to an improvement in psychological symptoms, im- 
proved quality of life, and diminished GD [5]. Next to the symptom, 
and now an official DSM-5 diagnosis, of GD today, the clinician will 
note a spectrum of words used to describe individuals whose gender 
identity, gender role behaviour, and/or gender expression do not 
match (or ‘conform to’) current local expectations and stereotypes 
within the binary system of male-female in society (for example, 
genderqueer, gender variant, genderfluid, a-gender, non-binary, 
transgender, trans, transsexual). Today, not all individuals with GD 
presenting in gender clinics desire the full range of GCTs [6]. This 
confronts clinicians with the challenge of providing care in the ab- 
sence of evidence-based treatment for these individuals with non- 
binary identities [7]. This evolution of an increased awareness for a 
broad gender variance has led to the use of the term ‘transgender; a 
term coined by activist Virginia Prince, referring to a diverse group 
of individuals who cross or transcend culturally defined categories 





of gender, including transsexual people, cross-dressers, drag queens 
and kings, non-binary people, and gender variant people [8]. 

Two taxonomies are dominant in describing the phenomenology 
of GD—based on sexual orientation [9] and on age of GD onset [10]. 
Blanchard proposed two possibilities in GD development (with per- 
haps different aetiologies), based on sexual orientation [9]: a ‘homo- 
sexual’ and a ‘non-homosexual (relative to birth sex) subtype (Table 
116.2). Also, the non-homosexual group develops differently in trans 
women and men; for example, the age of first clinical contact only 
differed between homosexual and non-homosexual trans women, 
and not trans men [11]. Pioneers in transgender health care were 
under pressure to identify prognostic factors and select the right 
candidate for treatment, leading to a search for the ‘true transsexual, 
in whom minimal post-treatment regret would be expected. In the 
past, clinicians have been reluctant to offer treatment to the non- 
homosexual group, based on (limited) evidence that regret would 
be more prevalent in older individuals requesting treatment [11]. 
This categorization based on sexual orientation hence stirs contro- 
versy until today. Individuals with GD are sometimes cautious in 
disclosing the history of their GD or their sexual orientation, due 
to fear of being denied GCT, a concern historically not unfounded. 

Recently, categorization based on the age of GD onset has been 
used increasingly. Researchers define adults with GD who recall 
childhood cross-gender behaviour and identification as ‘early onset’, 
while adults in whom the indicators of GD arise during puberty, or 
much later, are considered ‘late onset’ [12]. This categorization re- 
frains from the older terms of homosexual and non-homosexual 
but uses the terms ‘androphilic’ (sexually attracted to men) and 
‘gynephilic (sexual attracted to women) to refer to sexual attraction 
without making assumptions about sex or gender. A recent review 
shows that categorization based on the anticipated post-treatment 
heterosexual behaviour of trans women and men (as proposed by 
Blanchard’s typology) might not be very predictive of a good out- 
come after GCT [10]. Contrary to early treatment protocols, sexual 
orientation is currently no longer a decisive factor in treatment de- 
cisions. Considering the potential psychosocial consequences of 
becoming part of not one minority group (the transgender com- 
munity), but two (the gay and lesbian and bisexual communities), 
and the need to assess the individuals skills in dealing with minority 
distress, sexual orientation might still be a topic in need of clinical 
attention. 


LEEY) 





SECTION 18 Gender dysphoria and sexual dysfunction 


Table 116.1 Summary of the terminology used in this chapter 


Sex The biological aspect of gender, mostly based on primary and secondary sex characteristics 


Gender 
Assigned gender 


The psychosocial aspect of sex, subjective experience 


Apart from assigning a legal sex (in most countries, male or female, based on a newborn's primary sex characteristics), children 


are also assigned a certain gender (role and identity), for example expectations on crying behaviour, having a calm or more 
aggressive temper, showing an interest in stereotypical clothing, toys, hairstyle, etc. 


Experienced gender 





Or gender identity; the subjective experience of being male, female, genderqueer, or otherwise 


Gender role Behaviours that are—within a certain culture and time—associated with being male or female and hence are expected from men 
and women 

Transgender An umbrella term referring to a group of individuals who cross or transcend culturally defined categories of gender, including 
transsexual people, cross-dressers, drag queens and kings, non-binary people, and gender variant people 

Cisgender Opposite of transgender; people whose gender identity matches the sex and gender assigned at birth 

Trans woman An individual assigned the male gender at birth who identifies as a woman 

Trans man An individual assigned the female gender at birth who identifies as a man 

Non-binary An umbrella term for all not exclusively male or female gender identities; also genderqueer, genderfluid, etc. 


Cross-dressing 


The act of assuming a gender expression commonly associated with another gender; does not necessarily imply a direct 


correlation with gender dysphoria, a transgender identity, or fetishist arousal 


Androphilia and gynephilia 


Terms used to describe sexual orientation without attributing a sex or gender assignment (in contrast to the terms gay, lesbian, 


or bisexual); androphilia describes sexual attraction to men or masculinity; gynephilia refers to sexual attraction to women or 


femininity 


During the past decades, gender identity and gender role diag- 
noses underwent category migration and renaming, both in 
the American Psychiatric Associations (APA) Diagnostic and 
Statistical Manual of Mental Disorders (DSM) and in the World 
Health Organizations (WHO) International Classification of 
Diseases (ICD) [13]. Transsexualism appeared for the first time in 
the DSM in 1980 (‘Gender identity disorder of childhood (GIDC) 
and transsexualism’) and was listed among the psychosexual dis- 
orders. In 1994, GIDC and transsexualism were merged into one 
diagnosis—‘Gender identity disorder’ (GID) —and clustered with 
paraphilias and sexual dysfunctions. In the preparation phase of 
DSM-5, a fundamental question was raised by some transgender 
activists and clinicians of whether the diagnosis of GID should 
be retained at all. Their main argument was that GID was not a 
mental disorder, a similar argument that led to the removal of 
homosexuality from DSM-II in 1973. The decision to keep the 
diagnosis within the DSM was based on two considerations: ac- 
cess to care (health care reimbursement) and reconceptualization 
of the diagnosis. The new concept of GD was introduced to em- 
phasize the fact that incongruence between one’s felt gender and 
assigned sex/gender (usually at birth) leading to distress and/or 
impairment was the core feature of the diagnosis [14]. 


Also in the eleventh revision of the ICD, GID will be replaced 
with the label of “Gender incongruence’ (GI) to a new section pro- 
visionally entitled ‘Conditions related to sexual health’ [15]. The 
main argument for shifting and renaming diagnoses is further 
destigmatization of trans people who are already at risk for discrim- 
ination and associated mental health problems. 


Prevalence 


While studies on the prevalence of both childhood GD and GD in 
adults are severely hampered by the uncertain methodology and 
diagnostic classification, a recent review calculated a meta-analytical 
general prevalence for GD in adults of 4.6 in 100,000 individuals, 
6.8 for trans women and 2.6 for trans men. The review also found 
an increase in reported prevalence over the last 50 years [1]. This 
might still represent an underestimation of the true prevalence, as 
most studies are based on the number of adult individuals seen at 
specialized gender clinics or on the number of legal sex changes. 
Recently, population-based data have become increasingly available 
[16-18]. Conron et al. estimated 0.5% of adults consider themselves 
as ‘transgender’ (for example, ‘a person born into a male body but 
who feels female or lives as a womar), based on a probability sample 
in the United States of 28,176 adults aged between 18 and 64 years 


Table 116.2 Differences between homosexual and non-homosexual subtype 


Homosexual Non-homosexual 


Stronger cross-gender identity in childhood Less cross-gender identity in childhood 
Younger age at first clinical contact Older age at first clinical contact 
Lower prevalence of being (or having been) married Higher prevalence of being (or having been) married 


Less sexual arousal while cross-dressing in adolescence Greater sexual arousal when cross-dressing in adolescence 


Reproduced from Psychiatry Res., 137(3), Smith Y, van Goozen S, Kuiper A, et al., Transsexual subtypes: clinical and theoretical significance, pp. 151-160, Copyright (2005), with 
permission from Elsevier Ireland Ltd. 


[16]. Kuyper and Wijsen reported on a prevalence of ‘gender incon- 
gruence’ (‘stronger identification with other sex as with sex assigned 
at birth’) of 1.1% in men and 0.8% in women. When also assessing 
a dislike of the body and a wish to obtain hormones/surgery, the 
percentages drop to 0.6% and 0.2%, respectively [17]. Obtaining a 
psychiatric diagnosis and the level of distress in population-based 
surveys and self-report instruments remains difficult, and hence, 
these numbers cannot unequivocally be interpreted as a true epi- 
demiological estimate of GD. These numbers of GI might well 
consist of a theoretical potential of individuals situated in the trans- 
gender spectrum that might eventually seek medical treatment. 
Valid information on the prevalence of childhood GD is currently 
lacking entirely. An estimate of gender non-conforming/gender 
variant behaviour can be made, based on Child Behavior Checklist 
(CBCL) studies, pointing at 2.6% of boys and 5.0% of girls who ‘be- 
have like the opposite sex’ and 1.4% of boys and 2.0% of girls who 
‘wish to be of the other sex’ [19]. Therefore, gender non-conformity/ 
gender variance appears to be more prevalent in girls than in boys. 
Interestingly, based on referral rates to gender clinics, the sex ratios 
for prepubescent children, adolescents, and adults have always been 
in favour of trans women. This might be a reflection of a historically 
lower societal acceptance of femininity in boys, compared to tol- 
erance for masculinity in girls. In fact, gender clinics are currently 
reporting a change in both prepubescent children and adolescents 
with GD, such that more trans boys are presenting for care [20-21]. 


Associated psychopathology 


The association between GD and psychopathology has been chan- 
ging ever since the first publications on transsexualism in the middle 
of the twentieth century. Transsexualism was previously considered 
as a severe symptom of a psychiatric disorder (for example, schizo- 
phrenia), and it was assumed that both axis 1 and axis 2 disorders 
were highly prevalent in transsexualism. The viewpoint now is that 
transsexualism/transgenderism should no longer be included in 
psychiatric manuals (see Diagnostics, p. 1192) [22]. Previous atti- 
tudes were based on anecdotal, rather than systematic, evidence. 
Gender non-conformity is no longer judged as inherently patho- 
logical or negative. Nevertheless, more recently, methodologically 
sound research has shown a high prevalence of depression and anx- 
iety symptoms in individuals with GD who are referred to gender 
clinics [23]. A review on the co-occurrence of mental health prob- 
lems and GD clearly showed that severe psychiatric problems, such 
as schizophrenia and bipolar disorder, are not more prevalent in in- 
dividuals with GD, compared to the general population. The preva- 
lence of affective disorders, on the other hand, is considered to be 
2-3 times as high as in the general population. Trans women are 
found to be more comparable to cisgender women, then to cisgender 
men, with regard to the prevalence and nature of associated psy- 
chopathology [24]. With regard to suicidality and self-harm, Zucker 
et al. reported on the results of 13 studies, showing that one in three 
adults with GD has experienced suicidal ideation or attempted sui- 
cide or engaged in suicidal or non-suicidal self-harm [14]. Results 
on the co-occurrence of axis 2 disorders and GD have been incon- 
sistent with regard to prevalence and the nature of the personality 
disorder [24]. Recently, there has been increasing interest in the link 
between GD and autism spectrum disorder (ASD). Van Der Miesen 
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et al. concluded that there is evidence for an overrepresentation of 
co-occurring GD and ASD, compared to the general population, 
and that possible aetiological factors that could account for this co- 
occurrence are speculative [25]. 

Still, there are some limitations with regard to research in this 
field. Firstly, the sample size is still limited and only individuals at- 
tending a gender clinic and who are willing to participate are in- 
cluded. Secondly, no randomized controlled trials are available, 
mainly due to ethical reasons. 

Several possible explanations for the overrepresentation of mental 
health problems in GD are formulated—experiencing an incongru- 
ence between one’s assigned gender at birth and perceived gender 
leads to dysphoria, the negative psychological effects of belonging 
to a minority group, with the risk for prejudice, discrimination, and 
victimization [14]. 


Aetiology 


As for studies on other psychological characteristics in humans, the 
role of nature (biological factors) vs nurture (psychosocial factors) 
in the development of GD has been a subject of debate. GD should 
be considered as an extreme variant of a gender-atypical develop- 
ment. This development starts from early childhood and is charac- 
terized by cross-gender behaviour and/or the stated wish to be of the 
other gender. Only a small proportion of children showing gender- 
atypical development fulfil the criteria for childhood GD, and with 
the onset of puberty (with the inherent physical changes, changing 
social interactions, and first romantic feelings), gender dysphoric 
feelings desist in the majority of children with GD [26]. Historically, 
psychosocial factors have accounted for the development of a cross- 
gender identity or GD. However, there is a lack of methodologically 
sound studies on the causal status of these factors in the develop- 
ment of GD. Furthermore, causal factors for the development of 
GD should already be present at a very early stage of development, 
that is, in the prenatal and/or early postnatal period [27]. Therefore, 
psychosocial processes are conceptualized as having a perpetuating 
role, rather than a causal role [14]. 

With regard to biological factors, most information is gained from 
genetic and brain imaging studies. Twin studies suggested a herit- 
ability of GD that varied between 62% [28] and 77% [29]. Heylens 
et al. found a significant difference between monozygotic twins and 
dizygotic twins with GD, suggesting a role for genetic factors [30]. 
In this study, almost 40% of the 23 monozygotic twins included were 
concordant for GD, in contrast to none of the 21 dizygotic twins. At 
present, no candidate genes have been found that can account for the 
development of GD. Studies have focused on genes involved in sex 
steroid biosynthesis, but results have been inconsistent. Genome- 
wide studies on GD patients could give more insight into the genetic 
basis for GD but are presently lacking. 

The effects of (prenatal) hormones on the brain can be visual- 
ized using brain imaging techniques such as magnetic resonance 
imaging (MRI) and diffusion tensor imaging (DTI). An overview 
of neuroimaging studies in patients with GD is given by Kreukels 
et al. [31] and Guillamon et al. [32]. Both groups concluded that the 
brain phenotypes of trans women and trans men differed in various 
ways from those of control men and women with feminine, mascu- 
line, demasculinized, and defeminized features. Evidence for these 
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phenotypes is strongest for early-onset gynephilic trans men and 
androphilic trans women, and absent for androphilic trans men and 
gynephilic trans women, suggesting that there are different develop- 
mental pathways for different subgroups (based on sexual orientation 
and age of GD onset). Guillamon et al. concluded that the origin of 
these phenotypes might be caused by atypical effects of sex hormones 
in specific cortical regions of trans men and trans women [32]. They 
hypothesized that these differences are due to differently timed cortical 
thinning in different regions. Functional alterations in the brain of in- 
dividuals with GD have also been described, using task-related imaging 
studies (for example, visuo-spatial tasks, verbal fluency tasks) [31]. 

Finally, post-mortem studies have found that in trans women, 
specific brain structures (the interstitial nucleus, the bed nucleus of 
the stria terminalis, and the intermediate nucleus of the hypothal- 
amus) have some sex-specific characteristics that are more similar to 
cisgender females than males [33-35]. 

In summary, gender identity development in general, and GD in 
particular, is the result of a complex interaction between multiple 
genetic and environmental factors. Genes encode hormones that 
exert an effect on sex-dimorphic neural structures during the pre- 
natal period. Environmental factors then could have a pivotal effect 
on gender-atypical development during childhood or adolescence. 
Since time windows for the prenatal development of the genitals and 
the brain are believed to differ, individuals with GD experience an in- 
congruence between their perceived gender identity and their body. 


Clinical management of gender-dysphoric adults 





Diagnostic assessment 


Most individuals attending a gender clinic arrive with a self-diagnosis. 
During the diagnostic phase, a detailed clinical interview is most useful 
to confirm the diagnosis of GD. Standardized measures can be helpful 
to capture multiple indicators of gender identity and GD. These meas- 
ures include the Utrecht Gender Dysphoria Scale (UGDS), and the 
Gender Identity/Gender Dysphoria Questionnaire for Adolescents 
and Adults (GIDYQ-AA) [48-49]. The latter uses a specific time frame 
and has similar items for males and females, in contrast to the UGDS. 
When associated psychopathology (see Role of the mental health pro- 
fessional, p. 1195) is likely, further diagnostic investigations can be im- 
portant, in order to initiate appropriate treatment. This can be crucial 
since (severe) psychiatric comorbidity is a predictor of regret and a 
worse outcome after GCT [5, 22]. In recent years, the diagnostic phase 
has shortened and the approach to gender-dysphoric patients has be- 
come more individualized. Living in the preferred/experienced gender 
[the so-called real life experience (RLE)] is no longer a requirement for 
hormonal treatment. A study by Lawrence et al. showed no association 
between the duration of preoperative RLE and any outcome measure 
after GCT, except for happiness with the result [50]. 

GD should be distinguished from simple non-conformity to stereo- 
typical gender role behaviour. Given the increased openness of atypical 
gender expressions by individuals, it is important that the clinical diag- 
nosis is limited to those whose distress and impairment meet the DSM- 
5 criteria. Another diagnosis that should be accounted for is transvestic 
disorder (characterized by the association between cross-dressing 
and sexual excitement, leading to distress and/or impairment). 
Occasionally, transvestic disorder is accompanied by GD, and in many 


cases of late-onset GD in gynephilic assigned males, transvestic behav- 
iour is a precursor. Individuals suffering from body dysmorphic dis- 
order, obsessive-compulsive disorder, and psychotic disorders can also 
present with symptoms that are similar to those typical for GD. Most 
often, these conditions can be differentiated from GD by the presence 
of other symptoms that are not related to the diagnosis of GD [22]. 


Role of the mental health professional 


Recently, there has been a debate with regard to the role of the mental 
health professional (MHP) in diagnosing GD and associated psy- 
chopathology. With the introduction of the Standards of Care for the 
Health of Transsexual, Transgender, and Gender-Non-conforming 
People, Version 7 (SOC-7), any health professional who is appropri- 
ately trained in behavioural health and competent in the assessment 
of GD is authorized to make diagnoses and treatment recommenda- 
tions [51]. This more liberalized approach is in contrast with the 
traditional position of MHPs as gatekeepers for adults with GD. It is 
too early to estimate which approach leads to the best outcome after 
GCT with regard to satisfaction with treatment and quality of life 
(for prognostic factors associated with outcome after GCT, see [8]). 
The SOC-7 states that GCT can be initiated if physical and/or mental 
health problems that co-occur with GD are sufficiently stabilized. 
The use of psychotherapy merely to change a person’s gender iden- 
tity has been considered unethical (SOC-7). Still, psychotherapy 
or counselling could be useful to treat concomitant mental health 
problems and to deal with other issues such as grief and loss, sexual 
concerns, or problems related to partner relationship. If a specific 
treatment is needed, for example more specific psychotherapy for 
persons with personality disorders or pharmacotherapy for mood 
disorders, patients can be referred to a specialized MHP [52]. 


Endocrinological interventions 


Based on a thorough assessment, a qualified MHP can write a re- 
ferral letter to start hormone replacement therapy (HRT). The 
SOC-7 outlines several criteria for commencing HRT: persistent, 
well-documented GD; the capacity to make a fully informed decision 
and to consent for treatment, having reached the age of majority in a 
given country; and if significant medical or mental health problems 
are present, they should be reasonably controlled [51]. The adminis- 
tration of exogenous endocrine agents to induce feminizing or mas- 
culinizing changes is a medically necessary intervention, for which 
the Endocrine Society has issued very specific guidelines [53]. A safe 
and effective hormone regimen should: (1) suppress endogenous 
hormone secretion as determined by the persons genetic/biological 
sex; and (2) maintain sex hormone levels within the normal range for 
the person's desired gender [53]. HRT in trans women may consist of 
blocking androgen action (with anti-androgens, for example GnRHa, 
cyproterone acetate, spironolactone, finasteride) to decrease mascu- 
linization (diminishing body hair, frequency and firmness of erec- 
tions, and muscle volume and strength) and oestrogens to achieve 
a more feminine presentation (breast development, softer skin, fe- 
male fat distribution). Unfortunately, elimination of the hormonally 
induced natal sex characteristics is rarely complete—HRT does not 
alter voice, hand, feet, or shoulder dimensions [54]. In trans men, 
HRT may consist of synthetic progestins (for example, lynestrenol, 
medroxyprogesterone acetate) to interrupt menstrual bleeding, and 
testosterone to obtain more masculine features (deepening of the 
voice, increasing muscle mass and strength, beard and body hair 


growth, decreased fat mass, male pattern baldness, etc.). Although 
generally, a good aesthetic result is achieved in trans men, the short 
stature and often broader hip configuration do not change with HRT. 


Surgical interventions 


Gender-confirming surgeries in trans women may include breast/ 
chest surgery (augmentation mammoplasty), genital surgery (pen- 
ectomy, orchidectomy, vaginoplasty, clitoroplasty, vulvoplasty), and/ 
or other interventions such as facial feminization surgery, liposuc- 
tion, lipofilling, voice surgery, thyroid cartilage reduction, gluteal 
augmentation, hair reconstruction, and various aesthetic proced- 
ures. It is recommended that trans women undergo HRT (min- 
imum of 12 months) prior to breast augmentation, to maximize 
breast growth and surgical results. For genital surgery, two MHP re- 
ferrals are recommended. Specifically for orchidectomy, the SOC-7 
recommends 12 continuous months of HRT to introduce reversible 
testosterone suppression before undergoing irreversible surgery. For 
vaginoplasty, it is additionally recommended that the GD individual 
lives for 12 months in the congruent gender role. Typically, it is ad- 
vised to stop all HRT at least 2 weeks prior to any surgical procedure, 
as oestrogens in particular cause an increased risk of thrombo- 
embolic complications [51, 55]. 

In trans men, gender-confirming surgeries may include subcuta- 
neous mastectomy, hysterectomy/ovariectomy, vaginectomy, recon- 
struction of the fixed part of the urethra (if isolated, metoidioplasty), 
scrotoplasty, phalloplasty, and implantation of erection and/or tes- 
ticular prostheses. It is recommended that trans men undergo HRT 
(minimum of 12 months) prior to ‘internal’ genital surgery (hyster- 
ectomy/ovariectomy), to introduce a period of reversible oestrogen 
suppression before irreversible surgery is performed. For ‘external’ 
genital surgery (metoidioplasty), it is additionally recommended 
that the GD individual lives for 12 months in the congruent gender 
role [51, 56]. 


Fertility 


Although neglected for a long time, fertility issues have become 
an important topic in the care of transgender persons. The SOC-7 
clearly emphasizes the need to discuss fertility options prior to any 
treatment or medical intervention, since GCT (especially genital 
reconstructive surgery) often has irreversible effects with regard to 
fertility. The development of new reproductive medicine techniques 
creates opportunities for preserving fertility in transgender persons. 
Before, losing fertility was accepted as the price to pay for transi- 
tioning [57]. For trans women, sperm cryopreservation is the sim- 
plest and most reliable method. Success rates regarding pregnancy 
are similar to preservation in cisgender patients. Fertility options in 
trans men include embryo cryopreservation, oocyte cryopreserva- 
tion, and ovarian tissue cryopreservation. Each option requires the 
use of partner sperm or donor sperm and a recipient uterus. Success 
rates are lower, compared to trans women, and depend, besides spe- 
cific expertise of a specialized fertility centre, also on the age of the 
patient at the moment of cryopreservation. An interruption of cross- 
sex hormones for 3 months is necessary to restore fertility, both in 
trans women and trans men [58]. 


Sexual health 


Sexuality and relationships are vital parts of almost everyone's 
quality of life, including GD individuals. In the past, GD was often 
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considered to be a ‘hyposexual’ state. Most individuals consulting 
a gender clinic (80%) have had sexual experiences with a partner. 
About halfalso involved their genitals in partnered contact, although 
only a minority (10-15%) indicated pleasurable genital sensations. 
Also, most have experience with masturbation [59]. Thus, the idea 
of GD individuals as not being sexual creatures is an oversimplifica- 
tion. Sexual activity is there before, during, and after GCT. Hence, 
both health care providers and individuals considering whether to 
undergo GCT should have clear information on the potential sexual 
effects of treatment (for a more extensive review, see [60-61]). 

While one might reduce the potential sexual effects of GCT to 
the effects of sex steroids (oestrogens in trans women inhibiting 
sexuality; testosterone in trans men stimulating sexuality), other 
factors such as better self-esteem, feeling more comfortable with 
the new genitalia, and being able to allow more sexual explor- 
ation should not be disregarded. With regard to sexual desire, 
when trans women and men are asked—in retrospect—whether 
treatment has decreased, increased, or not affected their sexual 
desire, remarkable results emerge. The majority of trans women 
(70%) report decreased sexual desire after treatment (hormones 
and surgery), while a minority report an increase (10%) or no 
effect (20%). For trans men, the picture is reversed; the majority 
report an increase (70%), and a minority a decrease (10%) or no 
effect (20%) [60]. Most trans women and men welcome these 
changes, as they value sexual functioning more closely resem- 
bling the ‘typical’ functioning of their identified gender [62]. 
A subgroup of trans women (one in five) experience distress 
from decreased sexual desire. It is unclear to what extent this sub- 
group struggle with developing sexual desire which is now less 
testosterone-driven and more driven by mental processes. In any 
case, GCT always demands sexual (re)development by the indi- 
vidual and within a partnership. Distress about both decreased 
(5%) and too much sexual desire (3%) is much less of a concern 
in trans men. Sexual arousal is found to increase after GCT [63], 
although it should be noted that trans women with a skin-lined 
vagina always need a lubricant for penetration. Trans men also 
cannot obtain an erection spontaneously, even with an erection 
prosthesis. Percentages for ‘orgasm capacity’ in trans women, the 
best studied aspect of sexuality in GD individuals, ranges from 
27% to 100% and differs between countries and used techniques 
[61]. It should, of course, be noted that obtaining an orgasm is 
not only about functional nerves and adequate hormone therapy, 
but as much about knowledge, skill, and experience. Most trans 
men are able to reach orgasm after having a phalloplasty [63-64). 
So both trans men and women appear to experience increased 
orgasmic functioning after GCT. 





Post-treatment adjustment and prognostic factors 


As GCT is, at least partially, an irreversible process, it is of major 
importance to look at outcome with regard to feelings of GD, 
quality of life, and psychological functioning post-treatment. 
A systematic review by Murad et al. showed that, after GCT, 80% 
of individuals diagnosed with GD reported significant improve- 
ment in gender-dysphoric feelings and 78% reported significant 
improvement in psychological symptoms [5]. Also with regard 
to quality of life and sexual function, 80% and 72%, respectively, 
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reported an improvement. Djehne et al. reported on 11 longitu- 
dinal studies investigating outcome of psychiatric disorders and 
psychopathology post-treatment and found that in the majority 
of the studies, scores on questionnaires measuring psychopath- 
ology and GD were similar to normative data [24]. In general, 
however, longitudinal studies on the effects of GCT are meth- 
odologically weak, and follow-up duration is limited. Only one 
study has a duration time of >10 years, showing an overall posi- 
tive evaluation [65]. More discouragingly, a Swedish cohort study 
(average follow-up duration of 10.4-11.4 years) showed that 
overall mortality for sex-reassigned persons was higher during 
follow-up than for cisgender controls, particularly death from 
suicide. Sex-reassigned persons also had an increased risk for 
suicide attempts and psychiatric inpatient care [66]. The majority 
of the patients included in this study were diagnosed in the 70s 
and 80s and, along with altered societal attitudes towards persons 
with different gender expressions, received less qualitative care 
both for their GD and for co-occurring mental health problems. 

Positive prognostic factors are early-onset GD and a homo- 
sexual subtype [5], being a trans man [5, 67], and a younger age at 
assessment [68], although results are inconsistent. Pre-existing psy- 
chopathology [5] and poor results of SRS [4] tend to have worse 
prognosis. Furthermore, inadequate social functioning, indicated by 
periodical or full dependence on social assistance, and poor support 
from the individual's family are considered to be negative predictive 
factors [69]. 

Regret after GCT, defined as GD in the new gender role and after 
GCT, and the explicit wish to revert to his/her original gender role is 
estimated to occur in <1% in trans men and <1-1.5% in trans women 
[70]. Risk factors for regret are: inadequate diagnosis of GD and/or 
major psychiatric comorbidity, absent or disappointing RLE, and 
disappointing surgical results [22, 70]. De Cuypere and Vercruysse 
concluded that inadequate diagnosis of GD and major psychiatric 
comorbidity are the predominant indicators for regret [69]. 

The question remains as to which part of GCT is essential with re- 
gard to improvement of psychological functioning: RLE in the pre- 
ferred gender role, hormonal therapy, or SRS. Heylens et al. showed 
that after initiation of hormonal treatment, the level of psychological 
stress, as measured by Symptom Checklist-90 (SCL-90), became 
comparable to a general population control. SRS did not further 
change the level of psychological distress [71]. 

In general, GCT has proven to be the best method for treating 
GD. Clinicians should stay mindful of actual or potential psychiatric 
problems before the start of GCT and after its completion. 
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For no realm of psychopathology has a categorical approach to 
classification and diagnosis proved more wanting than for per- 
sonality pathology. A meta-analytic review of 177 studies with 
a combined sample of over 500,000 participants on the latent 
structure of various types of psychopathology [1] found little 
persuasive evidence for the existence of taxa (categories) for any 
of the personality disorders (PDs) classified by the DSM, with the 
possible exception of schizotypal personality disorder (STPD). 
Further, in a 2007 survey of PD experts, 87% believed that per- 
sonality pathology was dimensional in nature [2]. Although the 
definition of PD categories by explicit diagnostic criteria since 
DSM-III has had beneficial effects in documenting substantial 
prevalence in community and clinical populations, high rates 
of associated social and occupational impairment, and a more 
chronic course for other mental disorders co-occurring with 
PDs, the approach embodied by the DSMs has also been fraught 
with problems. These include extensive co-occurrence of PDs, 
considerable heterogeneity within PD categories, arbitrary diag- 
nostic thresholds without empirical bases, temporal instability, 
limited validity and clinical utility, and incomplete coverage of 
the full range of personality pathology [3]. Dimensional models 
of personality pathology have been proposed since the publica- 
tion of DSM-III as alternative representations of PDs [4], with 
the goal of rectifying some of these problems. Debate and con- 
troversy over the best dimensional representations of personality 
pathology have played out over the past three or more decades, 
but considerable consensus exists now about fundamental core 
dimensions. 

In this chapter, we will review the core personality trait domains 
underlying psychopathology in general, and personality pathology 
specifically. We will describe the core dimensions of personality 
functioning that can distinguish between personality styles and 
PDs and between PDs and other types of psychopathology. We will 
then review the development and longitudinal course, the impact 
on health and psychosocial functioning, and the clinical utility of 
core personality dimensions. Finally, we will illustrate how trad- 
itional PD subtypes can be rendered in terms of domains and fa- 
cets of personality functioning and of personality traits in a ‘hybrid’ 
dimensional-categorical model. 











Co-occurrence of categorical mental disorders is very common in 
clinical populations and in the community, because certain types 
of signs, symptoms, and personality traits tend to vary together. 
This co-variation has led to attempts to understand the organiza- 
tion of psychopathology by identifying fundamental dimensions of 
psychopathology that undergird groups of disorders using factor 
analyses of disorders and their manifestations. Two broad dimen- 
sions, or ‘spectra, of psychopathology, called internalizing and ex- 
ternalizing, have been identified that encompass a large number of 
non-psychotic mental disorders [5]. The process of internalizing in- 
volves the expression of mental problems by negative feelings and 
behaviours directed at oneself and includes symptoms of depression 
and anxiety, for example; externalizing involves the expression of 
such problems by negative feelings and behaviours directed at other 
people or things in one’s environment and includes symptoms of ag- 
gression, defiance, and violence. A third broad spectrum of psycho- 
pathology has been identified that includes schizophrenia and other 
psychotic disorders and bipolar I disorder, and has been referred to 
as a thought disorder spectrum [6]. These spectra are fundamentally 
dimensional in nature, because a person can exhibit more or less 
internalizing, externalizing, or disordered thinking, rather than an 
‘all or nothing; ‘present or absent’ (that is, categorical) expression. 
Spectra also represent general, consistent tendencies or proclivities 
to think, feel, and act in certain ways, and thus reflect personality 
dispositions, that is, ‘enduring patterns of perceiving, relating to, and 
thinking about the environment and oneself that are exhibited in a 
wide range of social and personal contexts’ [7, p. 647]. 

Disorders within these three broad domains have been shown 
to not only co-occur, but also to share genetic, environmental, and 
temperamental risk factors, exhibit common cognitive and emo- 
tional processing abnormalities, and respond to similar treatments, 
even though they are assigned to different diagnostic classes in 
DSM-5. A classification of mental problems based on empirical as- 
sociations can include both symptoms—relatively transient forms 
of psychopathology—and maladaptive personality traits that are 
thought to form the stable core of many mental disorders. Thus, per- 
sonality trait domains can be viewed as the dimensional underpin- 
nings of the meta-structure of psychopathology in general. 
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An extensive literature shows that personality constructs are 
organized empirically into 3-5 broad domains [8]. According to 
the most widely studied five-factor model (FFM) of personality, 
these domains are labelled as neuroticism (a tendency to be de- 
pressed, anxious, and stress-reactive), agreeableness (oriented 
towards empathy and getting along with other people), extra- 
version (a disposition to be outgoing, friendly, and emotionally 
positive), openness (a tendency to be curious and imaginative 
and to try new things), and conscientiousness (a tendency to 
be orderly and achievement-oriented). These domains organize 
both normal and abnormal personality [9], because normal- and 
abnormal-range personality variations are continuous with each 
other and there is no compelling evidence that they differ in kind, 
as opposed to degree [10]. In working to systematically introduce 
an empirically based trait assessment into DSM-5, the DSM-5 
Personality and Personality Disorders (P&PD) Work Group de- 
veloped a model that was consistent with the well-established 
FEM but was an extension that specifically delineated and encom- 
passed the more extreme and maladaptive personality variants 
necessary to capture the maladaptive personality dispositions of 
people with PDs [11]. Although the Work Group’s model was not 
placed in Section II ‘Diagnostic criteria and codes’ of DSM-5, it 
appears as an ‘alternative’ dimensional-categorical model of PDs 
in Section III ‘Emerging measures and models. 

The alternative DSM-5 model of personality disorders (AMPD) 
describes personality pathology in terms of five broad trait domains, 
within the internalizing, externalizing, and thought disorder spectra. 
The domains and their definitions are as follows [7, pp. 779-81]: 


1. Negative affectivity: frequent and intense experiences of high 
levels of a wide range of negative emotions (for example, anx- 
iety, depression, guilt/shame, worry, anger, etc.) and their behav- 
ioural (for example, self-harm) and interpersonal (for example, 
dependency) manifestations. 

2. Detachment: avoidance of socio-emotional experience, including 
both withdrawal from interpersonal interactions (ranging from 
casual, daily interactions to friendships to intimate relation- 
ships) and restricted affective experience and expression, par- 
ticularly limited hedonic capacity. 

3. Antagonism: behaviours that put the individual at odds with 
other people, including an exaggerated sense of self-importance 
and a concomitant expectation of special treatment, as well as a 
callous antipathy towards others, encompassing both unaware- 
ness of others’ needs and feelings and a readiness to use others in 
the service of self-enhancement. 

4. Disinhibition: orientation towards immediate gratification, 
leading to impulsive behaviour driven by current thoughts, feel- 
ings, and external stimuli, without regard for past learning or 
consideration of future consequences. 

5. Psychoticism: exhibiting a wide range of culturally incongruent, 
odd, eccentric, or unusual behaviours and cognitions, including 
both thought process (for example, perception, dissociation) and 
content (for example, beliefs). 


The personality trait domain of negative affectivity is the per- 
sonality component of the internalizing spectrum of psychopath- 
ology. The trait domains of disinhibition and antagonism are the 
components of the externalizing spectrum. The trait domain of 


psychoticism is the component of the thought disorder spectrum. 
A trait domain of detachment corresponding to a spectrum of 
pathological introversion has been found in many studies of the 
structure of mental disorders that have included PDs and/or patho- 
logical personality traits [12-14]. 

The most recent description of the PD proposal for ICD-11 also 
incorporates a personality trait domain schema for describing the 
variation in PD presentations [15]. The five trait domains included 
are negative affectivity, detachment, dissocial, disinhibition, and 
anankastic. Three of these domains are identical to the AMPD model, 
and dissocial corresponds closely with the AMPD antagonism do- 
main, leaving the only difference between AMPD psychoticism and 
ICD-11 anankastic trait domains. The AMPD includes psychoticism 
to represent STPD traits in the model, and other structural studies 
have found that DSM-IV cluster A PDs load positively onto the 
thought disorder spectrum. Since STPD is not classified as a PD 
in ICD-11, psychoticism may be less necessary to capture the full 
range of PD in the ICD. Unlike the ICD-11 proposal, the AMPD 
conceptualizes anankastic (obsessive-compulsive) personality traits 
as the opposite pole of disinhibition (that is, low levels), although 
some empirical studies of the AMPD have suggested that it might 
potentially be better characterized as an independent dimension as 
in the ICD proposal [16]. Nonetheless, the clear similarity in these 
two independently developed approaches to the classification of PD 
speaks further to the consensus around the dimensional structure of 
pathological personality. 

A number of researchers (for example, [8, 17]) have suggested that 
PD traits might be best represented as an integrative hierarchy, with 
broad dimensions of personality situated at the top of this hierarchy 
and specific trait constructs described at lower levels—with these 
lower-level constructs providing important bridges between per- 
sonality pathology and syndromal disorders. Thus, each of the five 
broad trait domains of the AMPD includes from three to six more 
specific component trait facets. The trait facets and their definitions, 
used for the formulation of specific PDs according to the AMPD, can 
be found in DSM-5 [7, pp. 779-81]. 

PDs vary in their manifestations and complexity. Some PDs are re- 
lated to internalizing mental disorders, such as depressive or anxiety 
disorders, because they are characterized primarily by traits of nega- 
tive affectivity, for example depressivity or anxiousness. Dependent 
personality disorder (DPD) is an example of a PD within the intern- 
alizing spectrum of psychopathology. Some PDs are related to exter- 
nalizing disorders, such as disruptive, impulse control, and conduct 
disorders or substance use disorders, because they are characterized 
by traits of disinhibition, for example impulsivity or risk-taking, 
and/or of antagonism, for example callousness, deceitfulness, or 
grandiosity. Antisocial personality disorder (ASPD) is an example of 
a PD within the externalizing spectrum of psychopathology. Some 
PDs are related to psychotic disorders and are characterized by traits 
of psychoticism, for example cognitive/perceptual dysregulation or 
unusual beliefs and experiences, with STPD as an example. Finally, 
some PDs represent constellations of broad domains; borderline 
personality disorder (BPD) has both internalizing and externalizing 
characteristics, and avoidant personality disorder (AVPD) is charac- 
terized by a combination of internalizing and detachment. 

Appreciation of the relationship of pathological personality trait 
domains with the meta-structure of psychopathology in general 
makes the ubiquitous phenomenon of mental disorder comorbidity 


understandable. Because symptom disorders and PDs share 
underlying predispositions, certain patterns of co-occurrence of cat- 
egorical disorders can be expected to be observed more often than 
others. To represent psychopathology as co-occurring disorders 
when some more fundamental process is operative, however, ob- 
scures the search for aetiology and pathophysiology and complicates 
treatment selection [18]. 


Core dimensions of personality functioning 





Since the same personality trait domains underlie both symptom 
disorders—such as depressive, anxiety, disruptive behaviour, and 
psychotic disorders—and PDs, core aspects of personality path- 
ology that would distinguish PDs from other types of pathology, as 
well as from non-pathological personality ‘styles, need to be iden- 
tified. A PD is defined by general criteria in Section II of DSM-5 as 
an ‘enduring pattern of inner experience and behaviour that devi- 
ates markedly from the expectations of the individual’s culture, is 
pervasive and inflexible, has an onset in adolescence or early adult- 
hood, is stable over time, and leads to distress or impairment’ [7, 
p. 645]. The patterns are said to manifest in two or more of the fol- 
lowing areas: cognition, affectivity, interpersonal functioning, and im- 
pulse control. Because these features are not specific to PDs and may 
characterize other chronic mental disorders, the DSM-5 Section III 
AMPD includes a more specific set of general criteria with which 
to define and identify PDs. According to this empirically derived 
model, PDs are characterized by impairments in personality func- 
tioning, including core functions of identity, self-direction, empathy, 
and intimacy, in combination with the presence of pathological per- 
sonality traits [7, p. 761]. Impairment in personality functioning 
occurs on a continuum of severity, and greater severity of impair- 
ment in these core functions is proposed to both distinguish PDs 
from other types of psychopathology and from normal personality 
functioning. Given the presence of such impairment, various con- 
figurations of pathological personality traits can describe the myriad 
manifestations in the presentations of personality pathology. 

The history of the study of personality is replete with classification 
models that are unitary, dimensional severity models [19]. From 
Sir Francis Galton’s model of ‘good and bad temper’, through James 
Cowles Prichard’s concept of ‘moral insanity’ and theorists such as 
Piaget, Erikson, and Loevinger, many personality-oriented writers 
emphasized a single developmental continuum over which individ- 
uals could vary, based on a principle of maturation that reflected 
greater (or lesser) degrees of self-control and prosocial behaviour. 
Along similar lines, Kernberg’s construct of ‘personality organiza- 
tion’ represents a classification of character pathology arrayed along 
a severity continuum, with a realistic and stable sense of identity and 
of the experience of others reflecting the healthy end of this con- 
tinuum. In fact, the significance of a severity gradient in evaluating 
personality problems has been described for far longer than PD 
categories themselves. Even according to the FFM, which posits 
that these personality factors are largely independent of each other, 
different DSM PDs consistently have been shown to exhibit quite 
similar profiles, consisting of high neuroticism, low agreeableness, 
and low conscientiousness [20]. 

In a literature review of clinician-administered measures for as- 
sessing personality functioning performed for the DSM-5 P&PD 
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Work Group, Bender et al. [21] found that global measures of 
personality pathology all referenced fundamental impairments 
in self and interpersonal functioning—impairments that exist 
on a continuum of severity, can be measured reliably, and have 
considerable clinical utility in determining the type and degree 
of personality pathology, planning treatment interventions, and 
anticipating treatment course and outcome. To capture this se- 
verity continuum, a 5-point scale of impairment in personality 
functioning, ranging from little or none (0) to some (1), moderate 
(2), severe (3), and extreme (4) impairment and consisting of a 
global rating of self (identity and self-direction) and interpersonal 
(empathy and intimacy) functioning—the Level of Personality 
Functioning Scale (LPFS)—was developed. It was tested in sec- 
ondary data analyses in over 2000 patients and community mem- 
bers, and the construct was found to relate to the probability of 
receiving any PD diagnosis and the total number of DSM-IV PD 
features manifested, as well as the probability of receiving two or 
more PD diagnoses as determined by semi-structured diagnostic 
interviews [22]. A subsequent study [23] determined that a clin- 
ician rating of ‘2 (moderate)’ impairment in personality func- 
tioning on the LPFS identified clinician-diagnosed DSM-IV PDs 
with solid sensitivity (0.846) and specificity (0.727), a cut-point 
which was incorporated into the DSM-5 AMPD’s general criteria 
for a PD. This severity continuum is designed to capture the vari- 
ation between PDs (which may, on average, vary in the degree of 
impairment typically present), but also the heterogeneity within 
PD categories (where some manifestations of a specific PD may be 
more severe than others), and to be sensitive to change. The DSM-5 
AMPD definitions of the core self and interpersonal functions are 
as follows [7, p. 762]. 


Self 


1. Identity: experience of oneself as unique, with clear boundaries 
between self and others; stability of self-esteem and accuracy 
of self-appraisal; capacity for, and ability to regulate, a range of 
emotional experience. 

2. Self-direction: pursuit of coherent and meaningful short-term 
and life goals; utilization of constructive and prosocial internal 
standards of behaviour; ability to self-reflect productively. 


Interpersonal 


3. Empathy: comprehension and appreciation of others’ experi- 
ences and motivations; tolerance of differing perspectives; 
understanding the effects of own behaviour on others. 

4. Intimacy: depth and duration of connection with others; desire 
and capacity for closeness; mutuality of regard reflected in inter- 
personal behaviour. 


The ICD-11 PD proposal also includes a dimensional personality 
pathology severity measure as its centrepiece. The scale would dis- 
tinguish personality difficulty (not a disorder) from mild, moderate, 
and severe PD. The ICD-11 scale is to be based on interpersonal 
functioning only, without a self functioning component. However, 
recent research has demonstrated that self pathology adds incre- 
mental validity over interpersonal pathology in predicting overall 
severity of personality pathology [24], suggesting that the AMPD 
approach to characterizing severity may ultimately prove to be 
more valid. 
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In addition to clinician-rated measures (reviewed in [21]), self- 
report instruments to assess personality functioning also empha- 
size a self-other perspective. For example, the Severity Indices of 
Personality Problems (SIPP) [25] measures five domains of per- 
sonality functioning: identity integration, self-control, relational 
functioning, social concordance, and responsibility. The General 
Assessment of Personality Disorder (GAPD) [24] measures self 
pathology linked to failures in the development of an integrated 
self system or structure, and interpersonal pathology linked to 
failures in the capacity for intimacy, attachment, and co-operative 
behaviour. 





Development and longitudinal course 


Temperament can be viewed as personality traits that are present 
very early in life and appear to have biological origins. One widely 
studied model of childhood temperament is a three-factor model 
that includes negative affectivity, extraversion/surgency, and ef- 
fortful control [26]. These personality trait domains are structurally, 
hierarchically, and developmentally related to the FFM domains de- 
scribed previously, in that neuroticism and negative affectivity and 
extraversion and extraversion/surgency are analogous constructs, 
conscientiousness and agreeableness develop from effortful control, 
and openness develops from extraversion [5]. So some continuity 
between basic temperament and personality trait domains seems 
likely, though relationships remain to be established through more 
longitudinal research. 

Traditionally, personality pathology has not been assessed 
or diagnosed in children or young adolescents for a variety of 
reasons—personality was considered to be in flux; some immature 
attitudes and behaviours are developmentally appropriate, and diag- 
nosis could be stigmatizing. DSM-5 Section II describes the onset of 
PDs as ‘traced back to at least adolescence or early adulthood’ and 
the course to be ‘stable and of long duration’ [7, p. 647]. However, 
other than ASPD, which has a minimum age of 18 years, PDs ac- 
cording to DSM-5 can be diagnosed in children or adolescents if an 
‘individual’s particular maladaptive personality traits [italics added] 
appear to be pervasive, persistent, and unlikely to be limited to a par- 
ticular developmental stage’ [7, p. 647]. Most mental disorders have 
an onset in some form in childhood or adolescence, and homotypic 
and heterotypic continuity characterize the course of psychopath- 
ology across the lifespan [27]. 

Dimensional approaches to conceptualization and measurement 
of pathological personality in children are being developed. A model 
including emotional instability, introversion, compulsivity, and dis- 
agreeableness has been proposed [28]. Child clinical researchers 
often advocate for assessing adolescents for personality pathology, 
and some have specifically endorsed the DSM-5 AMPD as an ap- 
proach preferable to the standard categorical approach [29, 30]. The 
structure of four of the five AMPD trait domains—negative affect- 
ivity, detachment, antagonism, and disinhibition—has been repli- 
cated many times in adolescents. In addition to a trait dimensional 
approach, incorporation of a self-other dimension to distinguish 
personality from PD has been endorsed, in order to better under- 
stand the processes involved in the development of personality path- 
ology [30]. Social adaptation and emotional regulation depend on 
the presence of a realistic and stable sense of self, and gratifying, 


supportive, and maturation-promoting interpersonal relationships 
depend on accurate perceptions of others. 

The context in which a young person’s personality develops is crit- 
ical. Children and young adults are at increased risk for developing 
personality pathology if they have experienced physical, sexual, or 
verbal abuse or neglect in childhood. The personality profiles of mis- 
treated children are characterized by high neuroticism, low agree- 
ableness, low conscientiousness, and low openness to experience—a 
profile of maladaptive personality very similar to that reflecting PD 
in adulthood—which tend to persist [31]. Child and adolescent psy- 
chopathology of other types, such as depressive, anxiety, and disrup- 
tive behaviour disorders, have been shown in longitudinal studies to 
predispose to the development of personality pathology in adoles- 
cents and young adults. Even according to the categorical approach 
to the diagnosis of personality pathology in children, research has 
shown that although rates of PD features decrease over time, chil- 
dren who report higher rates are more likely to have PDs in young 
adulthood and to suffer from a host of other mental disorders and 
psychosocial problems. Thus, the identification of personality path- 
ology in children and adolescents presents an opportunity for early 
intervention to prevent future adverse consequences. 

Certain critical developmental periods are implicated in the 
genesis of personality pathology. Abnormal attachment to a pri- 
mary caregiver, due to either separation or poor parenting, has 
been observed. Disrupted attachment early in life is likely to lead 
to impairments in emotional regulation and self-control [32]. 
Temperamentally high stress-reactivity in a child may itself con- 
tribute to problematic attachment. In adolescence, the development 
of a stable identity or sense of self is a major task and, when delayed 
or impeded, may lead to the development of personality pathology. 
In early adulthood, transitions such as leaving home, becoming eco- 
nomically self-sufficient, and being intimately involved with people 
outside of the family of origin are important developmental tasks. As 
a consequence, personality pathology often becomes evident when 
young people attempt these transitions. 

Traditional PD diagnostic criteria have poor face validity in later 
life, because they often refer to occupational or interpersonal activ- 
ities that may no longer be relevant to older adults. The few lon- 
gitudinal studies of PDs over the lifespan suggest a drop in PD 
prevalence in older adults, but it is not known how much this finding 
is dependent on inapplicable criteria. Some argue that personality 
traits are generally stable across age, with slight decreases in (FFM) 
neuroticism, extraversion, and openness, and slight increases in 
agreeableness and conscientiousness. Trait expression in later life 
might, however, be a function not only of underlying neurobiology, 
but also of different (from youth) contextual factors varying over the 
lifespan. Thus, the degree of social, physical, occupational, or eco- 
nomic stress experienced by an individual over time may determine 
how stable or unstable personality appears [30]. 

Dimensional approaches to the conceptualization of personality 
pathology are appropriate for the study of development and longi- 
tudinal course, since change is a matter of degree and behaviours, 
such as excessive dependency or obstinacy, that are developmentally 
appropriate at certain ages but become inappropriate at other ages. 
Viewed developmentally, personality pathology may be considered 
a manifestation of delayed or obstructed development [33]. Recent 
rigorous prospective longitudinal studies have shown that PDs are 
much less stable over time than are implied in the traditional DSM 


conceptualization [34, 35]. However, certain features of PD are 
more stable than others [36], perhaps because the severity of core 
personality pathology can fluctuate over time, related to situational 
or contextual factors, while the basic personality trait structure for 
any given individual may be more stable. Most studies of person- 
ality traits have shown that they change gradually over the lifespan 
until individuals reach the age of 50 [37]. In a clinical population, 
the stability of personality traits, as measured by the FFM (normal) 
or Schedule for Nonadaptive and Adaptive Personality (SNAP) 
(normal and pathological) models, was significantly higher than the 
stability of DSM-IV PD symptoms (that is, criteria) over 10 years 
of follow-up, suggesting that, from a longitudinal perspective, ‘traits 
reflect basic tendencies that are stable and pervasive across situ- 
ations, whereas ... [PD] symptoms reflect characteristic maladapta- 
tions that are a function of both basic tendencies and environmental 
dynamics’ [38]. DSM-5 AMPD personality traits have been shown 
to be highly stable, to be prospectively predictive of psychosocial 
functioning, and to be dynamically associated with functioning 
over time [39]. In the Collaborative Longitudinal Personality Study 
(CLPS) sample, general PD features representing ‘disorder’ severity 
exhibited less stability than specific features representing ‘style’ 
consistent with the notion that personality functioning is the more 
dynamic and changeable aspect of personality pathology, while per- 
sonality traits are stable [40]. 





Impact on health and psychosocial functioning 


Physical health 


PDs are known to have associations with a wide range of physical 
health disorders. In the National Epidemiological Survey on Alcohol 
and Related Conditions (NESARC), PDs within clusters A, B, and C 
were associated with multiple physical conditions, including cardio- 
vascular disease, arthritis, diabetes, and gastrointestinal conditions 
[41]. The occurrence of physical disorders with a wide range of PDs 
suggests common underlying factors (that is, core dimensions) at 
work. From a review of population-based studies in England, Wales, 
Scotland, Western Europe, Norway, Australia, and the United States, 
Quirk and colleagues [42] concluded that the bulk of evidence 
supports associations between PDs from clusters A and B and car- 
diovascular disease and arthritis. As a result, PDs increase health 
care utilization, particularly in primary care. Individuals with PD 
have reduced life expectancy [43], with all-cause mortality pre- 
dicted by alcohol and drug abuse, physical illness, and functional 
impairment [44]. 

Considerable evidence indicates that broad personality trait do- 
mains, such as neuroticism (that is, negative affectivity), adversely 
affect physical health and the quality and longevity of life [45]. In re- 
cent years, neuroticism has been found to be associated with asthma 
[46], obesity [47], Alzheimer’s disease [48], and coronary heart dis- 
ease [49]. Low conscientiousness (that is, disinhibition) has been 
found to increase the risk of diabetes [50] and risk of death [51]. 

The trait domain of conscientiousness appears to be protective 
against a number of physical conditions, possibly related to a more 
favourable inflammatory profile [52]. A meta-personality factor of 
‘stability, consisting of high conscientiousness, high agreeableness, 
and low neuroticism is associated with reduced cardiometabolic 
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risk, an association that was mediated by inflammation, autonomic 
function, and physical activity [53]. Lower-order FFM personality 
trait facets of straightforwardness, self-discipline, altruism, compli- 
ance, tender-mindedness, and openness to fantasy are associated 
with increases in life survival time [54]. 


Psychosocial functioning 


All PDs, by definition, are maladaptive and accompanied by func- 
tional problems in school or at work, in social relationships, or at 
leisure. The requirement for impairment in psychosocial functioning 
is codified in DSM-5 Section II in its criterion C of the general diag- 
nostic criteria for a PD, which states that ‘the enduring pattern [of 
inner experience and behaviour, that is, personality] leads to clin- 
ically significant distress or impairment in social, occupational, or 
other important areas of functioning’ [7, p. 646]. 

A number of studies have compared patients with PDs to those 
with no PD or with DSM-IV axis I disorders and have found that 
patients with PDs were more likely to be functionally impaired [55]. 
Specifically, they are more likely to be separated, divorced, or never 
married and to have had more unemployment, frequent job changes, 
or periods of disability. Fewer studies have examined quality of func- 
tioning, but in those that have, poorer social functioning or inter- 
personal relationships and poorer work functioning or occupational 
achievement and satisfaction have been found among patients with 
PDs than with other disorders. When patients with different PDs were 
compared with each other on levels of functional impairment, those 
with severe PDs, such as STPD and BPD, were found to have signifi- 
cantly more impairment at work, in social relationships, and at leisure 
than patients with less severe PDs, such as obsessive-compulsive per- 
sonality disorder (OCPD), or with an impairing other mental dis- 
order such as major depressive disorder (MDD) without PD. Patients 
with AVPD had intermediate levels of impairment. 

Another important aspect of the impairment in functioning in 
patients with PDs is that it tends to be persistent, even beyond ap- 
parent improvement in PD psychopathology itself [34, 56]. The per- 
sistence of impairment is understandable if one considers that PD 
psychopathology has usually been long-standing and therefore has 
disrupted a person's work and social development over a period of 
time [57]. The ‘scars’ or residua of personality pathology take time to 
heal or be overcome. With time (and treatment), however, improve- 
ments in functioning can occur [35]. 

DSM-5 Section III criteria for PDs in the AMPD do not include 
a requirement for impairment in psychosocial functioning. This 
change is in keeping with some other disorders in DSM-5, which 
attempted to separate the manifestations of a disorder (that is, signs, 
symptoms, traits) from their consequences (that is, impact on occu- 
pational, social, and leisure functioning). Furthermore, Section II 
PDs all include specific impairments in personality functioning at a 
moderate level or greater. This change is consistent with the distinc- 
tion between mental functions (for example, emotional regulation, 
reward dependence, reality testing) that lead to symptoms and the 
disabilities that accompany disturbances in these functions [58]. 

Interpersonal problems are probably most characteristic of people 
with PDs [59, 60], which, coupled with problems in the sense of self, 
are captured by the Section III LPFS and the A criteria for the six spe- 
cific PDs and personality disorder-trait specified (PD-TS). Each of 
the six PDs has characteristic problems with empathy and intimacy, 
reflecting fundamental abnormalities in social cognition [61]. 
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People with ASPD deceive and intimidate others for personal 
gain. Lacking in ‘emotional empathy, they have no concern for the 
feelings of others and fail to express remorse if they hurt someone. 
In the area of intimacy, they are incapable of having mutually in- 
timate relationships, as they exploit others or control them. Patients 
with histrionic personality disorder (HPD) and narcissistic person- 
ality disorder (NPD) need to be the centre of attention and require 
excessive admiration. Intimate relationships are generally shallow, 
and people are sought out primarily to bolster self-esteem. Empathic 
concerns centre on issues that have direct implications for the person 
with the PD. 

Patients with OCPD have difficulties appreciating others’ perspec- 
tives and need to control them and to have them submit to their ways 
of doing things. Intimacy is circumscribed by stubbornness and ri- 
gidity and a preference for engaging in tasks, rather than pursuing 
close relationships. 

The interpersonal relationships of patients with AVPD and DPD 
are impoverished as a result of fear and submissiveness, respectively. 
Patients with AVPD are inhibited in interpersonal relationships be- 
cause they are afraid of being shamed or ridiculed. Empathy is im- 
paired because of a distorted sense of others’ appraisal and an acute 
sensitivity to rejection. Patients with DPD will not disagree with im- 
portant others for fear of losing their support or approval and will 
actually do things that are unpleasant, demeaning, or self-defeating 
in order to receive nurturance from them. With the self-sacrificing 
approach to relationships, real intimacy and empathy are elusive. 

The empathy of patients with BPD is distorted and typically 
biased towards the negative tendencies and vulnerabilities of others. 
Intimate relationships are extremely challenging, with a pattern of 
becoming ‘deeply’ involved and dependent only to turn manipula- 
tive and demanding when their needs are not met. They have inter- 
personal relationships that are unstable and conflicted, and they 
alternate between over-involvement with others and withdrawal 
from them. 

The degree of detachment associated with patients with paranoid 
PD, schizoid PD, and STPD serves as a pronounced impediment to 
empathy and intimacy in interpersonal relationships. Patients with 
schizoid PD manifest an apparent lack of need for closeness with 
others; people with paranoid PD do not trust others enough to be- 
come deeply involved; and patients with STPD have few friends or 
confidants, in part due to a lack of trust and in part as a result of poor 
communication and inadequate relatedness. 

In addition to these core personality functions, personality traits 
have been found to predict concurrent and prospective function 
in psychiatric patients [62], with some specificity across different 
traits. Within the FFM traits, neuroticism was broadly related to 
impairment across the domains of social, occupational, and rec- 
reational functioning. Extraversion (low) was primarily related to 
social and recreational dysfunction, openness (low) to recreational 
dysfunction, agreeableness (low) to social dysfunction, and con- 
scientiousness (low) to work dysfunction. Ro and Clark [63] com- 
pared adaptive-range traits to non-adaptive range traits in patient 
and non-patient samples and found that psychosocial functioning 
and personality traits were closely linked, particularly in patients. 
General well-being was negatively associated with neuroticism/ 
negative affectivity and positively associated with extraversion/ 
positive affectivity; social/interpersonal functioning was associated 
with dis(agreeableness) and with conscientiousness/disinhibition, 


and basic functioning was associated with conscientiousness/ 
disinhibition. 

Simms and Calabrese [64] found that Section II pathological 
personality traits incrementally predicted psychosocial impair- 
ment over normal-range personality traits, PD criteria counts, and 
common psychiatric symptoms. In contrast, the incremental ef- 
fects of normal traits, PD criteria counts, and common symptoms 
were substantially smaller than for pathological personality traits. 
In the CLPS follow-along [65], PD criteria counts were strongly 
associated with impairment in psychosocial functioning at intake 
(more strongly than categorical PD diagnoses), but this relation- 
ship diminished over time. The best predictors longitudinally of 
impairment were models that combined normative traits and mal- 
adaptive variables (that is, the SNAP model or a model composed 
of a DSM-IV PD criteria count and FFM domains). At 6-, 8-, and 
10-year follow-ups, the SNAP continued to be the most predictive 
and DSM PD criteria and FFM domains tended to provide substan- 
tial incremental validity to one another, supporting a hybrid model 
[66]. Also, in the CLPS sample, general PD features representing 
‘disorder’ severity were more strongly related to psychosocial func- 
tioning concurrently and prospectively than specific features repre- 
senting personality ‘style’ [40]. 


Clinical utility 


In the official DSM-5 Field Trials, clinicians were asked to rate the 
usefulness of tested diagnostic criteria for all disorders. In both the 
Academic Centers and the Routine Clinical Practice Field Trials 
[67], the Section III PD model was rated as ‘moderately; ‘very’, 
or ‘extremely’ useful by over 80% of clinicians. In the Academic 
Centers trial, more clinicians rated the Section II] model as ‘very’ 
or ‘extremely’ useful, compared to DSM-IV than all disorders, ex- 
cept somatic symptom disorders and feeding and eating disorders. 
In the Routine Clinical Practice trial, the Section III model was 
more often rated as ‘very’ or ‘extremely’ useful, compared to DSM- 
IV than all disorders, except neurocognitive disorders and substance 
use and addictive disorders. In a separate investigation, Morey and 
colleagues [68] asked clinicians to rate the perceived utility of the 
proposed DSM-5 rendering of personality pathology, compared to 
DSM-IV. Questions addressed ease of use and usefulness for com- 
munication, patient description, and treatment planning. Although 
the clinicians were much more familiar with DSM-IV PDs, they 
rated all DSM-5 components generally as ‘useful’ or ‘more useful’ 
than DSM-IV for clinical description and treatment planning. Other 
than in ease of use and communication with other professionals, the 
LPFS was rated by clinicians as more useful than DSM-IV for patient 
description, communicating with patients about their problems, 
and treatment planning. Furthermore, the DSM-5 pathological trait 
system was rated by both psychiatrists and psychologists as easier to 
use and more useful for communication with other clinicians and 
with patients, for patient description, and for planning treatment 
than the DSM-IV conceptualization. 

In addition to perceived utility, self interpersonal problems, such 
as insecure attachment and maladaptive schemas, have been shown 
to be associated significantly with PD psychopathology and im- 
pairments in psychosocial functioning, as well as to affect clinical 
outcome [3]. Self-other dimensions have discriminated different 


types of PD pathology, predicted various areas of psychosocial func- 
tioning, and have shown to be moderators of treatment alliance and 
outcome. In the Morey and colleagues’ survey [23], the single-item 
LPFS rating predicted variance in clinician ratings of psychosocial 
functioning, prognosis, and treatment needs over and above that 
predicted by all ten DSM-IV PD diagnoses combined (that is, diag- 
noses based on 79 total criteria). 

In addition to the independent utility of pathological personality 
traits and of personality functioning in identifying and describing 
personality pathology and in planning and predicting the outcome 
of treatment, a number of recent studies support a model of person- 
ality psychopathology that specifically combines ratings of disorder 
and trait constructs. Each has been shown to add incremental value 
to the other in predicting important antecedent (for example, family 
history, history of child abuse), concurrent (for example, functional 
impairment, medication use), and predictive (for example, func- 
tioning, hospitalization, suicide attempts) variables [65, 66, 69, 70]. 





Dimensions and personality disorder subtypes 


Each of the ‘official’ DSM-5 PDs can be characterized by spe- 
cific impairments in personality functioning and by pathological 
personality traits in one or more of the DSM-5 trait domains of 
negative affectivity, detachment, antagonism, disinhibition, and 
psychoticism. Some of the simpler PDs have traits in only one do- 
main. DPD can be described by three traits in the negative affectivity 
domain: submissiveness, separation insecurity, and anxiousness. 
Schizoid PD can be described by four traits in the detachment do- 
main: withdrawal, intimacy avoidance, anhedonia, and restricted 
affectivity. Some of the more complex PDs have traits in more than 
one domain. Complexity in clinical presentation is also reflected by 
the common co-occurrence of categorical PDs with each other and 
with other mental disorders. Co-occurring disorders frequently, 
though not always, come from the same psychopathological spec- 
trum or trait domain. 

The DSM-5 alternative model includes six specific PDs—ASPD, 
AVPD, BPD, NPD, OCPD, and STPD—selected because of their 
empirical bases or their utility for clinicians [71, 72]. All PDs are 
required to have at least moderate impairment in personality func- 
tioning, according to the DSM-5 AMPD, because a moderate level of 
impairment has been empirically shown to identify PDs with max- 
imal combined sensitivity and specificity. 

PD trait domain and facet assignments were made on the basis 
of existing meta-analyses of FFM/PD relationships [73-75], with 
corroboration from data provided by clinicians that were gathered 
to inform the process [76]. In analyses of 33 data sets, O'Connor 
[73] found that DPD, AVPD, and BPD had high loadings on a factor 
resembling FFM neuroticism; antisocial, NPD, paranoid PD, and 
HPD loaded on a factor of low agreeableness; schizoid PD, STPD, 
AVPD, paranoid PD, and HPD (a negative loading) loaded on low 
extraversion, and OCPD loaded on conscientiousness. In terms of 
the DSM-5 trait model, these four factors would be represented by 
negative affectivity, antagonism, detachment, and low disinhibition, 
respectively. 

In Morey and colleagues’ study [76], ASPD was associated with 
traits from the antagonism and disinhibition domains. All seven 
traits assigned for the diagnosis of ASPD (four from the antagonism 
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domain and three from the disinhibition domain) had higher cor- 
relations with ASPD than any of the remaining traits in the AMPD 
model. STPD was associated with traits almost exclusively in the 
psychoticism and detachment domains, and again all six assigned 
traits had higher correlations with STPD than any of the other 
traits. BPD showed generally elevated trait associations across the 
negative affectivity domain, with the highest correlation being with 
emotional lability. Significant correlations were also found for risk- 
taking and impulsivity from the disinhibition domain. The two traits 
assigned to NPD—grandiosity and attention seeking—both had 
high correlations with the diagnosis. In addition, there were signifi- 
cant correlations with other traits from the antagonism domain such 
as callousness, manipulativeness, and deceitfulness. These traits of 
so-called ‘malignant narcissisn’ were not assigned to NPD because 
to do so resulted in extensive overlap between the diagnoses of NPD 
and ASPD. Since reducing PD comorbidity was a goal of the AMPD, 
these traits were included as diagnostic ‘specifiers’ to be listed when 
warranted. OCPD had significant correlations with traits from dif- 
ferent domains. The four highest correlated traits were assigned. 
AVPD had significant correlations with anxiousness from negative 
affectivity and several traits from the detachment domain, which 
were assigned. For each AMPD-specific disorder, except for STPD, 
it was shown that clinician ratings on the criterion A impairments 
in personality functioning significantly incremented the assigned 
traits in predicting a DSM-IV PD diagnosis, supporting the validity 
of the AMPD personality functioning and personality trait hybrid 
dimensional-categorical model. 

Each PD also had a diagnostic algorithm for the trait (‘B’ criterion) 
developed from the Morey survey [77], consisting of a minimum 
number of assigned traits and, in some cases, a particular configur- 
ation of traits. The algorithm was developed to simultaneously maxi- 
mize correspondence with DSM-IV PD diagnoses (to be minimally 
disruptive to clinical practice and research), to minimize overlap 
with other PD diagnoses (to reduce comorbidity), and to maximize 
relationships with functional impairment (to increase validity). The 
algorithms were successful in rendering DSM-IV PD diagnoses ac- 
cording to personality functioning impairments and pathological 
personality traits, with very good fidelity (correlations between 
DSM-IV and AMPD dimensional criteria counts for ASPD = 0.80; 
BPD = 0.80; AVPD = 0.77; NPD = 0.74; STPD = 0.60; OCPD = 0.57). 
The algorithm for the A criterion for specific PDs according to the al- 
ternative model of ‘characteristic difficulties in any two of four areas’ 
(that is, identity, self-direction, empathy, or intimacy) was found to 
be associated with the greatest combined sensitivity and specificity 
across the six specific PDs in the AMPD [77]. 

According to the AMPD, the other four official PDs in DSM-5, 
as well as any other PD presentations that would be diagnosed as 
‘other specified personality disorder’ in DSM-5 Section II, may be 
diagnosed as PD-TS. Both the specific level of impairment (that is, 
moderate, severe, or extreme) and the specific pathological person- 
ality traits that describe the person would be noted. 


Conclusions 


The classification and diagnosis of personality pathology has been 
slowly, but inexorably, moving from a categorical to a dimensional 
model. A dimensional model better reflects the phenomena of 
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personality pathology, which are continuous in nature, and better 
describes the degrees of difference from normal personality and the 
degrees of severity. Core personality dimensions can represent the 
features that distinguish PDs both from normal personality func- 
tioning and from other forms of psychopathology, as well as the 
myriad manifestations of PD. A general consensus is that the person- 
ality trait structure of PD converges on a five-factor model of broad 
trait domains, with each factor composed of more narrow trait facets. 
The DSM-5 alternative model for PDs comprises an LPFS to iden- 
tify PD and its severity and five trait domains of negative affectivity, 
detachment, antagonism, disinhibition, and psychoticism—the first 
four of which represent the pathological poles of the five factors of the 
five-factor model of normal personality—and their component trait 
facets. Personality traits and personality functioning variables will un- 
doubtedly be more closely related than traditional categories to basic 
physiological and psychological processes underlying personality 
pathology. Personality traits and personality functioning have been 
shown to be associated with physical health problems and impaired 
psychosocial functioning, to guide treatment planning, and to predict 
a host of important outcomes. Clinicians find dimensional models 
more clinically useful than categorical models, despite their newness 
and unfamiliarity. The future for research and treatment of personality 
pathology lies in its dimensional conceptualization. 
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Personality disorders are a group of disorders characterized by a 
pervasive set of feelings and behaviours that impair an individual's 
ability to function. Even when standardized approaches are em- 
ployed by highly trained clinicians, the consistency of diagnosis 
between different clinicians is highly variable [1]. Many com- 
mentators argue that this is inevitable because current diagnostic 
classifications are fundamentally flawed; their explanation is that 
personality dysfunction is a continuum and comorbidity between 
the different classes is ubiquitous [2]. This debate is not unique to 
personality disorder, and there is increasingly a move away from 
observational approaches to a more integrated approach where 
multiple sources of information are assimilated to better under- 
stand the basic dimensions of functioning underlying the full range 
of human behaviour, from normal to abnormal [3]. For person- 
ality disorder, this tension between dimensional and categorical 
approaches to diagnosis, in combination with the dominant role of 
psychoanalytic and behavioural models in treatment development, 
has resulted in a relative paucity of research into underlying neuro- 
biology, cognitive psychology, or physiology. Asa result, evidence- 
based treatment targets are few and based upon a small number 
of studies. The interpretation of the existing literature is limited 
by the internal heterogeneity of each specific personality disorder, 
a lack of consistency in the approach to diagnosis, axis I and II 
comorbidity, female gender bias, small sample sizes, and cross- 
sectional approaches. Borderline personality disorder (BPD) has 
been the focus of the majority of research in personality disorder, 
although there is some literature on antisocial personality disorder 
(ASPD) and schizotypal personality disorder (SPD). Studies of 
other personality disorders are too few in number to draw any firm 
conclusions about the mechanisms or treatment targets. In this 
chapter, we will review the current evidence for the basic mechan- 
isms for personality disorder, relevant to treatment development, 
as well as review what current treatments reveal about potential 
treatment targets. 


The neurobiology of personality disorder is poorly understood, and 
it seems unlikely that any of the disorders relate to the impairment 
of a single neurotransmitter system. This may, in part, relate to the 
inherent heterogeneity of personality disorder diagnoses, but also 
to the fact that the disorders emerge as a result of biological vulner- 
ability in combination with environmental influences. However, the 
underlying neurobiology does provide useful insights into how/why 
personality disorders develop, and targeting specific neurotrans- 
mitter systems may lead to symptomatic relief and enhance the effi- 
cacy of existing psychological treatments. Biological approaches are 
widely used in the treatment of personality disorder, and there is evi- 
dence to support the role of a number of neurotransmitter systems. 





There is considerable evidence implicating serotonin in person- 
ality disorder. Serotonin regulates amygdala hyperreactivity in 
BPD, thought to be a central neurobiological correlate of affective 
instability [4]. It has been proposed that the imbalance between pre- 
frontal and limbic responsivity may relate to impaired serotonergic 
facilitation of top-down control [5]. Polymorphisms in the genes 
for the 5-HT2 receptor, tryptophan hydroxylase, and the serotonin 
transport promoter are all found to be associated with BPD [6, 7], 
although these findings have not been replicated. However, there is 
more convincing evidence that gene-environment interactions are 
of greater relevance, with some studies suggesting that these genes 
mediate the impact of exposure to childhood trauma in BPD [8, 9]. 
The serotonin transporter gene is also associated with ASPD [10], 
and methylation of this gene has been shown to mediate the impact 
of childhood sexual abuse in ASPD [11]. Variants of monoamine 
oxidase (MAO), which metabolizes dopamine and serotonin, are 
also thought to have a role in the development of antisocial traits 
and again are implicated in outcomes following adverse events in 
childhood [12]. Positron emission tomography (PET) studies in- 
dicate that there is reduced serotonin transporter availability in 
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the prefrontal cortex in BPD, compared with healthy controls 
[13]. Blunted hormonal responses to serotonergic agonists, such 
as fenfluramine and meta-chlorpiperazine, have been observed in 
BPD [14, 15], and this blunting is associated with impulsivity, anger, 
and self-harm, but not the social or affective components of the dis- 
order. Increased impulse aggression has also been shown to be as- 
sociated with reduced prolactin release in response to fenfluramine 
in obsessive-compulsive personality disorder (OCPD) [16]. These 
findings are consistent with preclinical studies where reductions 
in serotonergic activity are found to be associated with aggression 
[17]. Fluoxetine (a selective serotonin reuptake inhibitor) has been 
shown to increase the metabolic rate in the orbitofrontal cortex and 
to significantly improve symptoms in individuals with aggressive- 
impulsive personalities [18]. 

Despite there being considerable evidence for a role of the 
serotonergic system in personality disorder, there is no convin- 
cing evidence that treatment with serotonergic drugs is associated 
with significant or sustained improvement in symptom profiles in 
clinical trials [19]. This may relate to a lack of serotonin receptor 
specificity (impulsivity is associated with 5-HT2a, not 5-HT2c, an- 
tagonism), the complexity of the interactions between different re- 
ceptor systems associated with behavioural outcomes, and the need 
for targeting specific brain regions [20]. 


Dopaminergic system 


The dopamine system has been a target of interest in SPD be- 
cause of its similarity to psychotic disorders. SPD has been found 
to be associated with the catechol-O-methyltransferase enzyme 
(COMT), the D4 receptor, and dopamine beta-hydroxylase genes. 
Decreased dopaminergic activity in the prefrontal cortex has been 
observed, but there appears to be evidence of compensatory ac- 
tivation in other regions, such as the striatum, suggesting that 
individuals with SPD may have greater buffering of subcortical 
dopaminergic activity than that found in those with schizophrenia 
[21, 22]. Homovanillic acid, a proxy for dopamine levels in the 
brain, has been found to be elevated in both plasma [23] and the 
cerebrospinal fluid (CSF) [24] in SPD. SPD is associated with en- 
hanced dopamine release in response to amphetamine [25], and 
both amphetamine and pergolide have been shown to improve 
cognitive performance [26, 27]. 

Use of drugs for psychosis in personality disorder is widespread 
[28, 29]. In a recent systematic review of their use in SPD, there 
was little evidence found to support it [30]. In BPD, one meta- 
analysis concluded that dopamine antagonists were associated with 
worsening of symptom severity [19]. However, there is some evi- 
dence supporting reductions in symptom severity in BPD associ- 
ated with olanzapine [31, 32], quetiapine [33], and aripiprazole [34]. 
Ziprasidone has not proved to be efficacious in BPD, despite having 
a similar pharmacological profile to aripiprazole [35]. 


Oxytocin 


Given the centrality of interpersonal dysfunction to all personality 
disorders, and particularly BPD, there has been an increasing focus 
on oxytocin, a prosocial neuropeptide. The precise mechanism by 
which oxytocin exerts this effect is unknown, but there is sub- 
stantial evidence in non-clinical groups that it promotes empathy, 
trust, and social reward [36]. There is some evidence to support the 
moderating role of alleles of the oxytocin receptor (OXTR) coding 


gene in the development of BPD symptoms, following childhood 
adversity [37]. There is also evidence that methylation of OXTR 
genes occurs rapidly after exposure to stress [38], suggesting that 
oxytocin may play a role in enhanced stress sensitivity found in 
BPD. Reduced CSF and serum oxytocin levels have been found 
to be associated with BPD symptoms [39]; however, these find- 
ings are not specific to BPD. It may therefore be oversimplistic to 
consider BPD as a disorder uniquely characterized by low oxy- 
tocin levels. Studies of the impact of oxytocin in BPD indicate that 
while it downregulates stress responses to social threat [40], it is 
not associated with increased no-verbal affiliative behaviour ob- 
served in healthy controls [41]. In another study, BPD subjects 
given intranasal oxytocin were found to show less co-operative be- 
haviour while playing the assurance game (a variant of the classic 
prisoners dilemma) and described higher levels of attachment 
anxiety [42]. Striepens et al. have proposed a paradoxical effect 
of oxytocin in BPD; while oxytocin may downregulate amygdala 
reactivity to aversive stimuli [43], it may also increase insular re- 
sponses to negative stimuli, enhancing the defensive impact of 
negative social cues [44]. It could be that individuals with BPD 
experience the prosocial effects of oxytocin as aversive, given the 
association of the insula with the processing of complex negative 
emotions. 

Given the conflicting evidence about the effects of oxytocin in 
BPD, its role in treatment remains unclear. However, all studies to 
date have used single doses of oxytocin and explored the possibility 
that BPD may be associated with dysfunctional oxytocin regulation 
or that the prosocial effects of oxytocin may heighten entrenched 
anxieties and fears about trust and attachment. Despite these uncer- 
tainties, oxytocin represents a potential treatment for the core dys- 
function in BPD. Some authors have suggested that oxytocin may 
facilitate the learning of new relational strategies in combination 
with psychological interventions [42], while others have proposed 
that it may allow the intergenerational cycle of dysfunction to be 
broken [36]. 

Opioids 

Brain opioids play an important role in mediating the social impact 
of isolation and exclusion [45]. There is evidence to support the dys- 
function of the endogenous opioid system (EOS) in both BPD [46] 
and ASPD [47], and it has been proposed that reduced EOS activity 
may underpin the chronic dysphoria associated with BPD [48]. It 
has also been suggested that self-harm represents an unconscious 
drive to increase EOS activity [48]. PET studies showed evidence 
of mu-opioid receptor binding dysfunction [46], and lower levels of 
endogenous opioids have been reported in the CSF of individuals 
with BPD [49]. To date, there have been no randomized controlled 
trials of opioid antagonists, but a small number of open-label studies 
[50, 51] in BPD suggested that targeting the EOS may be a potential 
treatment strategy. 

The glutamatergic system has been proposed as a potential treat- 
ment target, given the role of N-methyl-D-aspartate (NMDA) in 
mediating a range of psychiatric symptoms and on the basis that ex- 
cessive glutamatergic activity contributes to hypersensitivity of the 
limbic system associated with BPD [52]. There is little direct evi- 
dence directly linking personality disorders with glutamatergic dys- 
function, but a randomized trial of an NMDA receptor antagonist in 
BPD is ongoing [53]. 
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Symptom-based targets 


Factor analysis of the diagnostic criteria for borderline person- 
ality disorder has consistently revealed three factors: affective 
dysregulation, behavioural dysregulation, and interpersonal 
disturbances. 


Mood instability 


Affective dysregulation has long been viewed as the core source of 
dysfunction in BPD. Recent technological developments enabling 
prospective real-time monitoring have confirmed this observation 
[54]. Although mood instability is common in other mental dis- 
orders, the high temporal frequency and characteristics observed in 
BPD appear to be distinct and specific to the disorder [54, 55]. Some 
of the most consistent neuroanatomical findings in BPD are those 
of reduced volumes in the amygdala, hippocampus, orbitofrontal 
cortex, and anterior cingulate cortex [56, 57]. These are all areas 
of the brain involved in emotional regulation. Multiple studies re- 
vealed decreased activation in prefrontal areas in response to emo- 
tional stimuli, suggesting that top-down emotional control may be 
impaired [58]. Psychophysiological studies also indicate abnormal 
emotional regulation in BPD [59]. Mood regulation forms one of 
the central components of a number of psychotherapies, and there 
is some limited evidence that existing treatments may reduce mood 
instability, although this is limited to single-study estimates of ef- 
fect [60]. Specific emotion regulation training has not been found 
to be effective, although this may relate to short study duration [61]. 
While mood instability is often used as an outcome measure in trials 
of treatments for BPD, few studies have employed objective longitu- 
dinal measurement of mood; rather mood instability has been as- 
sessed using one-off questionnaires and items from the Structured 
Clinical Interview for DSM-IV Axis II personality disorders (SCID- 
II). Mood instability remains an important treatment target, and 
there is a need for large, high-quality studies using objective longitu- 
dinal measures of mood. 


Interpersonal disturbance 


Social dysfunction is a prominent and pervasive feature in BPD. 
Impairments in social functioning are well documented in BPD and 
persist despite remission of other symptoms, both in children who 
meet criteria for borderline pathology [62] and in adults [63]. 
Interpersonal difficulties have been linked to impairments in 
decoding social signals, such as facial expressions [64], and deeper 
problems with trusting the motives and behaviours of others 
[65, 66]. Individuals with BPD appeared to lack the capacity to sus- 
tain mutually beneficial interactions with playing partners in an 
investment-trust game [67] and showed patterns of reduced co- 
operation when compared to those with bipolar disorder or healthy 
controls in an iterated prisoner’s dilemma game [68]. In this latter 
study, trait aggression, hostility, and impulsivity were not found to 
account for the failure to co-operate; instead it was proposed that in 
BPD, the experience of co-operation may not be coded or experi- 
enced as rewarding and consequently cannot provide a basis for 
sustained future co-operation. However, in real life, decisions rarely 
involve interactions with one individual. To date, no one has sys- 
tematically studied the responses of individuals with BPD in a group 
context or which factors are most influential. Given that almost all 


current treatment modalities involve group treatments, enhancing 
our understanding of group behaviour may assist in identifying po- 
tential specific treatment targets relating to social function. 


Behavioural disturbance 


Behavioural disturbance is common to all personality disorders but 
has been most widely studied in BPD and ASPD. Reactive aggres- 
sion has been consistently described in ASPD and is associated with 
dysfunctional representation of the cost of aggressive behaviour and 
the failure of inhibitory mechanisms in response to the distress of 
the victim [69]. In BPD, prefrontal activation is diminished, when 
compared to non-aggressive healthy controls, and there is evidence 
of decoupling of orbitofrontal-amygdala responses in BPD, sug- 
gesting a failure of top-down control leading to more rapid escal- 
ation of angry responses to angry faces [70]. However, these findings 
are not specific to personality disorders, and to date, interventions 
specifically directly targeting aggression have been unsuccessful. 


What can we learn from effective treatments? 


While there is evidence for pharmacological targets in personality 
disorders, the rationale for the use of medications is largely based 
on observed symptom control, although this is not supported by any 
randomized controlled trials of adequate size, observational data 
are poor, and studies have generally been limited by short treatment 
duration and inadequate follow-up time. The one independent ran- 
domized trial of a reasonable size found no beneficial effect in the 
use of lamotrigine in BPD [71]. 

In contrast, there are a wide range of psychosocial interventions 
that appear to be effective, in particular in BPD. The majority of 
randomized studies have follow-up periods of <2 years, a draw- 
back in the treatment of trait-based disorders. Of the three studies 
with longer follow-ups, only mentalization-based treatment (MBT) 
has demonstrated a robust effect [72]. This is a small study and the 
nature of the intervention is unclear [73], consisting of 5-days-per- 
week day hospital treatment and employing a range of interven- 
tions [74]. This result may mirror Antonsen et al’s [75] finding at 
6-year follow-up of a superior outcome for patients with BPD who 
participated in day hospital treatment, when compared to a group 
receiving individual therapy. A 6-year follow-up of cognitive behav- 
ioural therapy (CBT) for BPD gave mixed results [76]. 

Authors of the psychosocial treatments that appeared to have a 
beneficial effect in BPD proposed a variety of mechanisms which 
underpin their efficacy (Table 118.1). 

The wide range of therapeutic targets is likely to be a result of the 
complex nature of BPD, involving a wide range of high-level func- 
tions, but may also reflect the lack of current knowledge of the active 
elements in effective therapies and the heterogenous nature of the 
population. 

In ASPD, the evidence is too weak to support any specific treat- 
ment targets [95]. In cluster C conditions, exposure and behavioural 
experimentation have been proposed as the mechanism of action of 
CBT and the development of emotional insight in short-term psy- 
chodynamic psychotherapy (STPP) [96]. 

In addition to authors’ assumption/models, other themes are 
emerging from the existing evidence base. The majority of successful 
treatments involve groups (all except individual schema-focused 
therapy (SFT), transferance-focused psychotherapy (TFP), STPP, 
and cognitive analytic therapy) [97]. Group membership promotes 
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Table 118.1 Treatment targets/mechanisms of successful 
psychological treatments for BPD 





Addressing underlying SFT and CBT-PD address unhelpful cognitive 
schema/cognitive schema explicitly, and ‘loving kindness/self 
distortions compassion’ techniques are also likely to challenge 


putative schema [77] 


Mentalizing capacity 
deficits/reflective 
function 


MBT and DTC encourage mentalizing/reflective 
function explicitly, while SCM incorporates elements 
of the MBT theory, and TFP has been found to have 
a beneficial impact on reflective function [78] 


Emotional regulation 
skills deficits 


DBT and STEPPS incorporate group-based 
emotional skills sessions, which, in DBT, are then 
practised and reinforced in one-to-one sessions 
and telephone support. Mindfulness comprises 
an additional emotional regulation skill taught 
in DBT, although its applicability as a stand- 
alone intervention in personality disorder is not 
established [79] 


Psychoeducation overlaps with emotional 

skills training, but in addition, STEPPS explicitly 
incorporates education on the nature of BPD, an 
approach that is also showing promise using the 
Internet [80] 


STPP, DBT, and DTC all target problematic and 
recurrent behavioural and emotional patterns in 
order to explore their meaning (STPP and DTC) 
and encourage the development of alternative 
strategies (DTC and DBT) 


Nidotherapy [81] and SCM both provide this as 
a core part of the intervention, but it is probably 
active in most of the approaches discussed here 








Psychoeducation 


Identifying recurring 
behavioural and 
emotional themes and 
patterns 


Case management 


Encouraging emotional 
expression and 
addressing emotional 
avoidance 


A number of the models recognize the tendency 
of individuals with personality disorder to 

avoid strong emotions. STPP DTC, MBT, and 
DDP all explicitly encourage the idea that this 

is an unhelpful coping strategy and encourage 
alternative ways of dealing with distress 


Resolution of past 
trauma/construction 
of coherent narrative, 
development of 
emotional insight 


Individuals with personality disorder have often 
experienced adverse events in childhood, and 
narrative reconstruction is a part of all exploratory 
approaches to personality disorder, including DTC, 
STPP, CAT, DDP, TFP. and SFT [82] 


Improvement in social function is probably 
targeted in all group-based approaches but has 
been specifically addressed in PEPS (although 

with disappointing results [83]) and interpersonal 
group psychotherapy [84]. Belongingness arises 
from frequent and positive contact with others and 
has a range of benefits [85], and is also likely to be 
active, to an extent, in group approaches but has 
been specifically identified in DTC 


Social function and 
belongingness 





Deficiencies in 
responsible agency 


Agency is the main therapeutic target in MI, and it 
is also targeted in DTC and MBT 





SFT, schema-focused therapy [86]; MBT, mentalization-based therapy [72]; SCM, 
structured clinical management [87]; DTC, democratic therapeutic community [88]; DBT, 
dialectical behaviour therapy [89]; STEPPS, systems training for emotional predictability 
and problem solving [90]; STPP, short-term psychodynamic psychotherapy [91]; PEPS, 
psychoeducation with problem-solving [83]; MI, motivational interviewing [92]; DDP, 
dynamic deconstructive psychotherapy [93]; TFP, transference-focused psychotherapy 
[78]; CAT, cognitive analytic therapy [94]. 








social learning and a sense of belongingness, and there is evidence 
that this is associated with more prosocial behaviour in experimental 
settings [98]. Many interventions improve self-efficacy and respon- 
sible agency (the concept that individuals are motivated to change, 


feel that they have the power to do so, and wish to exert this power to 
effect change). This is likely to be related to empowerment—an in- 
ternal locus of control, knowledge of how to respond, and a sense of 
having something to offer to others. This is a particularly prominent 
element of democratic therapeutic communities (DTCs) where it 
is facilitated by support systems and member-member mentoring. 
All interventions probably improve social function when clinicians 
are empathic and reasonably skilled, but attempts to teach social 
problem-solving have proved disappointing [83]. Clinician attitude 
and faith in the model is likely to be an important ingredient in the 
success of any treatment [87]. 

Treatment strategies can also be derived from ineffective interven- 
tions. Crisis planning and motivational interviewing are too brief to 
be effective [99]. Failure of emotional regulation training to have any 
impact in adolescents with BPD may also relate to the brevity of the 
intervention [61]. Similarly, brief CBT seems not to help in people 
with personality dysfunction after self-harm [100]. 

The apparent effectiveness of a range of complex interventions 
illustrates the need for dismantling studies. The only approach 
in which this has been attempted to date is dialectical behaviour 
therapy (DBT) where Linehan et al. [89] found that skills training 
plus case management (DBT-S) showed significant advantages 
over DBT individual therapy plus activities group, whereas the full 
DBT programme was not clearly superior to DBT-S, indicating 
that group-based skills training constitutes an essential part of the 
DBT intervention and may represent the active element. Similarly, 
Soler et al. [101] found DBT skills training superior to standard 
group therapy. The apparent effectiveness of Systems Training for 
Emotional Predictability and Problem Solving (STEPPS) [90], a 20- 
week CBT programme that bears similarities to DBT skills training 
groups, provides further evidence to support this conclusion. 

It has been proposed that the effectiveness of therapies for per- 
sonality disorder relates to non-specific factors that all therapies 
share. While there are considerable overlaps between therapies, it 
is unlikely that this is a valid explanation, given the heterogeneity 
of approach and the fact that some studies have been able to differ- 
entiate between psychotherapies (for example, [102]). Nevertheless, 
the use of control conditions that incorporate non-specific factors 
such as case management and skills groups produces improvements 
in the control arm of trials, thus narrowing the gap with the active 
conditions both in trials that were still able to demonstrate some su- 
periority for the active condition (in MBT [103] and TFP [104]) and 
in those that were not (in DBT [105] and cognitive analytic therapy 
[106]). 


Future directions 


Current neuroscientific evidence supported by the theoretical 
underpinnings of successful psychotherapeutic interventions con- 
verge on social functioning and mood regulation as key treatment 
targets in personality disorder, and specifically in BPD. Current 
treatments may be more efficacious than they appear, but the hetero- 
geneity of the recruited samples and poor choices of outcome meas- 
ures have created too much noise for much signal to be detected. 
Trans-diagnostic approaches may prove more informative, given the 
heterogeneity of presentations within each specific personality dis- 
order and the extent of comorbidity. 
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Treatment outcome trials have suffered from a number of meth- 
odological flaws which limit their usefulness, and further research 
is required to address these. In trials of medication, most studies 
have suffered from being small and funded by drug companies 
and having short follow-ups. Although follow-up periods have, 
on the whole, been longer for psychosocial interventions, long- 
term follow-up over 5 years remains the exception; to date, there 
have been no published multi-centre trials, and randomized studies 
have not yet been carried out for cluster A disorders and are rare for 
cluster C disorders (although some studies have looked at all three 
clusters (88, 107]). The current uncertainty over the effectiveness of 
psychosocial interventions for personality disorders stems also from 
the small numbers enrolled in studies to date for all interventions 
other than DBT. Finally, the active elements of effective complex 
interventions will need to be investigated through dismantling and 
component studies [108]. 

To date, there are no animal models of personality disorder or an 
experimental medicine model against which to test new treatments, 
and there is no consensus on outcome measures in personality dis- 
order research [109]. Developments in mobile and wearable tech- 
nologies should be exploited in the development of new treatments 
and objective markers of treatment success. 
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The conceptual, clinical, and nosological debates about personality 
disorders (PDs) as psychiatric entities reflect both the complexity 
of the diagnoses and the fragility of the agreements arrived at in 
different classification systems [1, 2]. Among the reasons, the ab- 
sence of common or homogenous phenomenological and psycho- 
pathological bases for the different PD types, the multiple clinical 
variations or subtypes among the individual conditions, profuse 
comorbidity levels, and uncertain outcomes are frequently men- 
tioned [3, 4]. Additionally, a relatively reduced prevalence of PDs 
in the general population, when compared with ‘major’ psychiatric 
disorders, leads to relatively scarce (and, at times, questionable) 
epidemiological data and to a colourful, but heterogenous, clinical 
course. On the other hand, the same debates and their many impli- 
cations represent the growing relevance of PDs, not only as more or 
less permanent distortions of individual identities, but also as re- 
positories of genetic, epigenetic, environmental, and socio-cultural 
factors in the construction of such identities [3-5]. Thanks to the 
rapidly developing field of molecular genetics, PD phenotypes may 
assist in better aetio-pathogenic, and therefore diagnostic, and 
epidemiological estimates [6]. Furthermore, in many modern ap- 
proaches to the delineation of PDs, basic personality features or 
traits and personality functioning levels are the departure point of 
thorough and comprehensive clinical evaluations [7]. Different stat- 
istical methods and procedures (from multi-dimensional scaling to 
factor analysis through correlational analyses, to cite a few) assist 
in the structuring of PDs and the controversial acceptance of their 
three superordinate clusters in the Diagnostic and Statistical Manual 
of Mental Disorders, fifth edition (DSM-5). 

Historically, the maladaptive variants of normal personality 
traits, grouped in the well-known Five-Factor Model [8], provide 
a particularly compelling tool for the identification and assess- 
ment of the 11 DSM-IV’s axis II PDs, still present in DSM-5 [1]; in 
spite of the unevenness of studies about these factors (many more 
on neuroticism and extraversion than on openness, agreeable- 
ness, and conscientiousness related to former axis I disorders) [9], 
it is clear that personality can decisively influence much broader 

















psychopathologies and, secondarily, epidemiological, follow-up, 
and outcome inquiries. Intrapersonal and interpersonal manifest- 
ations of PDs differ, and informant reports may be more valid than 
self-reports in both clinical evaluations and prevalence surveys, 
adding another important methodological layer to research in both 
areas. Higher-order internalizing and externalizing structural fac- 
tors in different psychopathologies were correlated with 30 latent 
personality facets in a study by Uliaszek and Zinbarg [10]. An ele- 
vation of the neuroticism facets, along with low positive emotions, 
low actions, and low competence, was found for the higher-order 
internalizing factor entities (that is, major depression), whereas the 
higher-order externalizing ones were negatively associated with 
most conscientiousness and agreeableness factors and with an ele- 
vation of excitement-seeking impulsivity and angry hostility. These 
findings constitute a call for active search of PDs in different types of 
studies, epidemiological surveys included. In turn, environmental 
characteristics (including interpersonal transactions such as per- 
ceived discrimination) must be assessed in every type of study, as 
they have shown to be a main social pathway which, in addition to 
biological and developmental routes, is crucial for the establishment 
of the adult personality type and its detection and prognosis [11-12]. 

This section will examine, first, general epidemiological data on 
PDs through studies from different regions of the world, following 
immediately with studies on the prevalence and incidence of each of 
the PD types identified in the main world nomenclatures. Emphasis 
on the role and significance of socio-demographic variables, family 
history, and relevant comorbidities will precede the clinical course 
subsection, similarly focused on each PD type, with and without 
treatment, on pertinent risk and protective factors, and on their 
short- and long-term prognostic implications. Topics such as re- 
lapses and chronicity patterns will also be examined. 






































The considerable heterogeneity and poor discriminant validity 
of DSM’s PDs impede accurate diagnoses and therefore can make 
epidemiological findings questionable. Yun et al. [13] proposed 
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alternative assessment tools such as Kernberg’s object relational 
model [14], on the basis of which develops a finite mixture mod- 
elling that harboured three components: Group 1, characterized by 
low levels of antisocial, paranoid, and aggressive features; Group 2, 
by elevated paranoid features; and Group 3, with the highest levels 
across the three variables. Various external measures supported the 
validity of this grouping structure, thus offering a substantive aid 
in refining PD classification and conferring additional solidity to 
prevalence, incidence, and clinical course studies. 

Different clinical populations have been used in attempts to delin- 
eate more precisely normal personality types and actual PDs. This 
helps to consider pathogenic circumstances, that is, situations that 
contribute to the evolvement of actual psychopathologies. For in- 
stance, coping styles (relevant personality-based features) and treat- 
ment preferences were evaluated by Contractor et al. [15] in 1266 
trauma-exposed military veterans (89.6% male). The results were 
revealing: (1) post-traumatic stress disorder (PTSD)-free and emo- 
tionally stable probands were less likely to use self-distraction, denial, 
and substance consumption, and more likely to use active coping; 
(2) those less emotionally stable and with more re-experiencing and 
avoidance symptoms were less likely to use denial and active coping, 
but more likely to use behavioural disengagement; (3) those with 
subsyndromal PTSD were more likely to use self-distraction, ac- 
tive coping, and substance use, but less likely denial, instrumental 
coping, positive reframing, and acceptance; and (4) those with severe 
PTSD and combined internalizing-externalizing traits responded 
similarly to (3), but also used more substances, more denial, more 
emotional support, and less active coping. Interestingly, emotional 
stability was the most distinguishing feature, and all classes were 
more likely to seek mental health treatment. 

Similarly, the assessment of personality traits becomes an im- 
portant parameter for psychological evaluations among persons 
seeking cosmetic surgery. In a study from Iran, Golshani et al. [16], 
using three well-validated instruments, evaluated 274 randomly 
selected individuals seeking rhinoplasty, blepharoplasty, face/jaw 
implant, mammoplasty, and liposuction; a prevalence of 51% of psy- 
chiatric problems was found, with interpersonal sensitivity as the 
highest syndrome-like endorsed (the lowest, psychosis), and agree- 
ableness and extroversion as the highest scored features (the lowest, 
openness). 

Sexual problems (hypersexuality, excitation, and inhibition) 
constitute another area of discussions and studies of this kind. 
Rettenberger et al. [17] investigated the impact of different person- 
ality traits on data provided by two instruments in a sample of 1749 
participants in an Internet-based survey. Six per cent of the prob- 
ands were categorized as hypersexual, irrespective of gender and 
sexual orientation. Among male recidivist offenders, Barbosa et al. 
[18], based on the signal detection theory, evaluated sensitivity to 
emotional arousal and valence induced by pictures; offenders re- 
ported higher arousal than controls and showed lower sensitivity to 
changes in levels of arousal, but no differences vis-a-vis controls for 
valence. This modality of research broadens the narrow typological 
taxonomy and allows indirect, but valid, assessment of PDs in spe- 
cial populations. 

Finally, issues such as reality-testing, developmental phases of the 
life cycle, and traumatic experiences make prevalence and incidence 
studies of PDs a difficult enterprise. The associations between PDs 
and other conditions have shown different degrees of consistency 


when epidemiologically assessed by established nomenclature clas- 
sifications. It can be said that the pervasive and relatively inflexible 
pattern of behaviours that reflect the individual's predominant mode 
of being is, in itself, a formidable challenge for epidemiological, field, 
and clinical studies. 


General epidemiology 


The instruments used in epidemiological studies of personality and 
other disorders are a critical component of this area of knowledge. 
In addition to the questionnaires used in the first historical studies, 
a multitude of instruments reflecting different modalities of con- 
ceptualization (categorical and dimensional, and among the latter, 
interpersonal, clinical, bio-psychological, etc.) have been developed. 
Names of authors such as Goldberg, Fonagy, Widiger, Costa and 
McCrae, Cloninger, Millon, Gunderson, Livesley, Silver and Davies, 
Oldham and Skodol, Krueger, and Clark provide a sound historical 
background for this field of inquiry [3]. 

There is some variability in the estimation of general prevalence 
of PDs in the general population. Torgersen et al. [19] examined the 
issue in depth, identifying different components of any epidemio- 
logical search (that is, sample selection) that impact the final esti- 
mates. In eight prevalence studies including all PDs and covering 
a total of 5081 persons, with >500 as the average sample size, the 
findings ranked between 3.9% and 22.7%, but in small samples, the 
variation was smaller between 10.0% and 14.3%. The median preva- 
lence of all studies was 11.55%, and the pooled prevalence 12.26% 
[20]. These data are consistent with the major large, representative 
national epidemiological studies, including those from Great Britain 
[21], which estimated an overall prevalence of 10.1%, and the US 
National Comorbidity Survey Replication [22] with a rate of 9.1%. 


Prevalence data 


Grant et al’s analysis of the National Epidemiological Survey on 
Alcohol and Related Conditions (NESARC) [23] presented na- 
tionally representative data (from a sample of 43,093 subjects) on 
the prevalence, socio-demographic correlates, and disability of 
seven of the ten DSM-IV PDs; overall, 14.8% of adult Americans 
(or 30.8 million) had at least one personality disorder. Another re- 
port of the same study found rates of PDs at the high end of previ- 
ously published ranges (21.5% overall) [24]. Test—retest reliabilities 
between wave 1 (2001-2002) and wave 2 data (2004-2005), using 
the Alcohol Use Disorder and Associated Disabilities Interview 
Schedule-I[V (AUDADIS-IV) were, however, low for some types 
[that is, 0.40 for histrionic PD (HPD)] [25]. Some have criticized 
the diagnostic approach, as the study required only one of the iden- 
tified PD symptoms to have an association with distress or impair- 
ment. A re-analysis of the data that required an association with 
distress or impairment for each of the symptoms used to make the 
PD diagnosis [for example, five of nine borderline personality dis- 
order (BPD) symptoms] yielded rates more in keeping with other 
samples (9.1%) [26]. 

As expected, prevalence figures increased up to 50% in studies 
conducted within specific clinical settings [20]. Widiger and Rojas 
[27] pointed out that the prevalence of PDs is generally under- 
estimated in clinical practice due, in part, to the previously men- 
tioned deficiencies in a comprehensive clinical assessment of the 
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corresponding symptomatology. Moreover, Beckwith et al. [28] 
conducted a systematic literature review of studies measuring the 
prevalence of PDs in community secondary care settings and iden- 
tified only nine out of 269 papers deserving critical appraisal; they 
showed a high level of heterogeneity with regard to methods, in- 
clusion criteria, source of information, times of assessment, instru- 
ments used, and overall quality of research. Prevalence estimates in 
Europe varied between 40% and 92%, showed a closer range (45- 
51%) in the United States, and differed significantly in two Asian 
studies (1.07% in India and 60% in Pakistan). The low prevalence in 
India confirmed a previous finding by Gupta and Mattoo [29], but 
once again, weak methodology (case records study) may have gen- 
erated questionable results. 


Socio-demographic variables 


It is generally accepted that the prevalence rates are higher in urban 
populations and in lower socio-economic groups. Gender is prob- 
ably the most controversial topic in this area. Some studies con- 
cluded that the prevalence levels of PDs in men and women are 
roughly equivalent, with some disorders more frequently present in 
each gender, whereas others seemed to indicate that only one type 
(borderline) is more prevalent among women. A variety of recom- 
mendations to reduce and avoid sampling, assessment, and criteria 
biases have been made. The differential socialization and life experi- 
ences of males and females have been used to explain the differences 
in personality and PDs between males’ and females’ innate behav- 
ioural tendencies, without excluding neurobiological (particularly 
physiological and endocrinological) factors. Nevertheless, the gen- 
eral agreement is that more than one in ten adult individuals has 
a PD. The average prevalence of specific PDs is a little above 1%, 
with variations per type. It is also important to remark that these 
correlations neither display one-directional causal relationships nor 
account for the total nature and characteristics of the individual's 
interpersonal experiences [30]. 

According to DSM-5 [1], features of a PD must have been pre- 
sent for at least 1 year if the diagnosis is to be made before age 18. 
Apparently, individuals with odd/eccentric PDs are older, and 
the dramatic/emotional trait dimensions decrease with age [19]. 
Subjects with PDs have more often been separated, unmarried, or 
divorced, even though there is agreement in that the real effect of 
marital status on PD occurrence is difficult to determine. Similarly, 
the trend of a negative correlation between PD prevalence and edu- 
cational and income status is not uniformly agreed upon. Urban 
location seems more often linked to PDs, with subsequent lower 
quality of life and higher dysfunctional level [31]. Last, but not 
least, the few studies on PDs and ethnicity show methodological 
weaknesses, as analysed by McGilloway et al. [32]. Meta-analyses 
revealed significant differences in prevalence of PDs between black 
and white groups, but no differences between Asian or Hispanic 
groups compared with white groups; these heterogenous results may 
be accounted for by methodological characteristics, ranging from a 
true lower prevalence among black patients to a clear neglect of PD 
diagnoses among other ethnic groups. 


Risk and protective factors 


We have touched also on risk and protective factors as important 
variables in the clinical evolvement of PDs and their epidemiological 
correlates. While the genetic load seems to be unquestionable, 


particularly, for example, in the DSM-5’s Cluster A types [33], epi- 
genetic variations and environmental factors, such as child-rearing 
issues, intra-family relationships, parental styles, educational mi- 
lieus, and opportunities or cultural and religious concerns, can play 
either a risk or a protective role, depending on the kind, orientation, 
and weight of each in everyday life [4]. Childhood and adolescence 
are considered critical periods in the developmental cycle of every 
individual. A balanced use of modelling, independence fostering, 
decision-making, and social/relational skills will do a lot to favour 
a normal personality development. Contrariwise, gender-based 
violence is associated with at least 38% of all PD cases, and women 
who have experienced it have 8.5 times the odds of PDs, compared 
to non-victims [34]. Needless to say, the assessment of these fac- 
tors is decisive in the epidemiological and clinical evaluation of the 
disorders. 


Comorbidities 


Another heatedly debated topic is that of the correlation (or coex- 
istence, although some may call it comorbidity) between PDs and 
other psychiatric conditions—the main point of contention is the 
cause-effect equation, which is first, and how one can lead into the 
other. It must be clear that this type of studies does not refer to the 
clinical setting in which they take place; actually, most of them are 
community-based projects. A third phase of the NESARC study 
[23], conducted in 36,309 adults, found significant associations of 
12-month and lifetime prevalence of drug use disorders (DUD) with 
a variety of PDs, including antisocial, avoidant, and borderline. In 
Denmark, Toftdahl et al. [35] found that 46% of 463,003 substance 
use disorder (SUD) patients had diagnoses of PDs. 

A study in France [36] found PDs (independent of clusters) as 
significant risk factors for generalized anxiety disorder, and Cluster 
C types for major depressive disorder (MDD), in non-metastatic 
breast cancer patients. In a population-based 37-year follow-up 
cohort study of >29,000 Swedish Twin Registry members, Sieurin 
et al. [37] found neuroticism and introversion strongly associated 
with a risk of Parkinson's disease; smoking was a significant medi- 
ator in the relationship between personality traits and Parkinson's 
disease, partly accounting for the effect of introversion, while being 
a suppressor for the effect of neuroticism. Higher-than-normal 
body weight was associated with paranoid, antisocial, and avoidant 
PDs among women, whereas overweight men had lower rates of 
paranoid PD (PPD) and underweight women had higher odds of 
schizoid PD (SPD) [38]. An intriguing finding—among college men 
with purging-type eating disorders, symptoms were associated with 
constellations of personality traits that are typically reported among 
women [39]. The obvious conclusion is that, in general, PDs are 
highly associated with a variety of disabilities in different periods of 
the life cycle. 

Thus, it is evident that personality and its temperamental and 
characterological components are always in the background of 
normal and ill people; among the latter, early and precise identi- 
fication of such features is crucial as they can contribute, in many 
cases, to increased levels of severity and to a complicated clinical 
course of the comorbid condition. Of interest, the trait model of 
personality pathology in DSM-5 Section III [1] has been success- 
fully applied to problematic alcohol use cases, with antagonism and 
disinhibition as the most relevant domains. It has been suggested 
that throughout life, higher coercion experiences are associated 
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with lower understanding and lower reasoning which, in turn, de- 
crease the social engagement of individuals with severe PDs [40]. 
On the other hand, the course of PDs themselves will depend on 
the timing and scope of the professional intervention. In general, 
an early diagnosis and appropriate therapeutic interventions can 
certainly be effective, improve the patients quality of life, and dis- 
rupt the chronicity pathway characteristic of untreated conditions; 
the pattern is similar in the case of comorbid medical or psychi- 
atric pictures. 


Special epidemiology, comorbidities, and 





clinical course 


Primarily referred to data about each of the more or less estab- 
lished PDs in modern psychiatry, special epidemiology integrates 
concomitant issues such as comorbidities and risk and protective 
factors. At the same time, it offers much needed information for 
the formulation and understanding of the clinical course, prog- 
nosis, and outcomes of each of the disorders [41, 42]. Tables 119.1 
and 119.2 summarize the information from this and the previous 
section. 


Paranoid personality disorder 
Prevalence 


Although distrust and suspiciousness, with subsequent misinter- 
pretation of others’ behaviours, may be present in 13% of adult males 
and 6% of females [8], the actual DSM-5 criteria for PPD seem to 
be met only by a population range of between 0.4% and 4.4%, ac- 
cording to a variety of prevalence studies [1], with a median rate of 
1.6% [27]. In inpatient clinical settings, the range may increase to 
10-30%, while in ambulatory services, it may oscillate between 2% 
and 10%. It may be worth mentioning that some studies show a 0% 
prevalence of PPD, which raises several of the questions examined 
in previous pages. 


Table 119.1 Main epidemiological data about personality disorders 


PD type Prevalence Avg community | Clinical settings (%) 
range (%) (A) 
All 





39-227. 11.6 40.0-51.0 
Paranoid 04-44 1.6 Inpt 10-30; outpt 2-10 
Schizoid 0.4-4.9 0.6 Inpt 16 
Schizotypal 0.1-5.6 0.8 Inpt 2-20 
Antisocial 0.2-4.5 1.3 Outpt 3-30 
Borderline 0259 I Inpt 20; Outpt 6.0 
Histrionic 0.3-3.9 1.6 Outpt 10-15 
Narcissistic Qi 0.5 Outpt 6.2 
Avoidant 1.0-6.0 17 Outpt 6.0 
Dependent 0.1-11.0 0.9 
Obsessive- ZTS) 2, General 10.0-20.0 
compulsive 


Inpt, inpatient; outpt, outpatient. 
Main sources: [41, 42]. 


Comorbidities 


It has been demonstrated that the phenomenological structure of 
PPD can share some of its features with the other Cluster A types, 
all of Cluster B’s, and Cluster C’s avoidant PD (AvPD). The clin- 
ician must have a clear perspective about the global nature of the 
trait combinations, that is, the distinction between the affective dis- 
tance and eccentricity of the schizoid or avoidant PD’s fear of being 
ashamed or overwhelmed vs the hypervigilance or suspiciousness of 
PPD. Conversely, a careful evaluation must be pursued to rule out 
predictable ‘paranoid-like’ behaviours in members of immigrant, 
ethnic, or political groups. 

Other forms of psychopathology may occur simultaneously (or 
more frequently) in individuals with PPD: psychotic conditions 
such as delusional disorder, schizophrenia, depressive and manic 
episodes, alcohol and substance dependence and abuse, criminal 
behaviours, and obsessive-compulsive, agoraphobic, and depressive 
disorders [43, 44]. 


Clinical course 


During childhood and early adolescence, PPD features such as 
hypersensitivity, social anxiety, low school performance, and poor 
peer relationships can elicit bullying from other children, thus ac- 
centuating paranoid attitudes. Brief psychotic episodes have been 
reported in connection with stressful situations throughout the life 
of PPD patients. A consistent employment history is relatively in- 
frequent, mostly due to hyperdefensiveness, litigiousness, and an- 
tagonistic, or even violent, behaviour. Last, but not least, clinicians 
must not forget the premonitory nature of PPD features in patients 
that later may develop psychoses such as schizophrenia. 


Schizoid personality disorder 
Prevalence 


With two studies showing a 0% prevalence, the range in all the 
others goes from 0.4% to 4.9% and one study showed 7%, but the 
median in several tables is around 0.65%. Additional contradictions 
emerge from the statement that SPD is considered ‘uncommon’ in 
clinical settings, at the time that the same source (DSM-5) included 
prevalence levels of 4.9 and 3.1%, from Part II of the Comorbidity 
Survey-Replication (CMS-R) and NESARC samples, respectively 
[1]; in other clinical samples, the prevalence reaches 16%. Similar 
disagreements exist about the prevalence of SPD by gender [45, 46]. 


Comorbidities 


There is limited documentation of comorbidity between SPD 
and major psychotic conditions, probably due to difficulties in 
differentiating the clinical presentations of both kinds of disorders. 
Actually, some authors have found little or no relationship between 
SPD and schizophrenia when family history or clinical course were 
compared. The same occurs in cases of mild autism spectrum dis- 
order that may co-occur with SPD, even though in the majority of 
cases, the former may be installed earlier than SPD could be de- 
tected. These problems multiply when a trait distinction is attempted 
between SPD and schizotypal and avoidant PDs. 


Clinical course 


Usually long-lasting, but not necessarily life-long, SPD courses in 
a more or less ‘stable’ fashion, seldom exhibiting rapid or profound 
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Table 119.2 Comorbidities and clinical course of personality disorders 








All Other PDs; alcohol and SUDs; mood and anxiety disorders; 
eating disorders; medical illnesses 

Paranoid Clusters A and B and AvPD; schizophrenia and other psychoses; 
bipolar disorder; SUDs; OCD 

Schizoid Cluster A PDs; mood and phobic disorders; autism spectrum 
disorder 

Schizotypal Cluster A PDs; AvPD and BPD; brief psychotic episode; MDD, 


OCD, generalized anxiety disorder 





Antisocial Cluster B PDs; PPD; alcohol and SUDs; hypochondriasis; 
gambling; ADHD 








Borderline DD, bipolar disorder, PTSD, SUDs, generalized anxiety 
disorder, and panic disorder 

Histrionic Cluster B PDs and DPD; somatization, conversion and 
dissociative disorders; brief reactive psychosis 

Narcissistic SUDs; anxiety disorders; bipolar disorder; BPD and AsPD 

Avoidant SAD, DPD 

Dependent AvPD; anxiety disorders; depressive mood 

Obsessive- Anxiety, mood and obsessive-compulsive disorders; Cluster C 

compulsive PDs 


changes. This does not mean that patients will not have difficulties in 
different settings—their more or less severe social ostracism makes 
them have few close relationships, including sexual partners, and 
occasionally become homeless. If married and having children, they 
have limited capacity to provide warm, supportive, or endearing af- 
fection. Clinical complications, in addition to possible psychotic oc- 
currences, include mood, phobic, and behavioural disorders. On the 
other hand, and differing from the other Cluster A PDs, a relatively 
good number of SPD patients, particularly those with mild mani- 
festations, may reach reasonable levels of social adaptation, work, 
and quality of life [45]. 


Schizotypal personality disorder (StPD) 
Prevalence 


Studies that have found prevalent schizotypal personality dis- 
order (StPD) showed a lowest level of 0.1% and a highest of 5.6% 
in community populations, with a median of 0.7-0.9%. In clinical 
populations, prevalence levels of between 2% and 20% have been 
found, and 14.6% among relatives of schizophrenic patients [46, 47], 
particularly those with a history of childhood onset and among 
monozygotic twins. According to Kendler et al. [33], StPD most 
closely reflects the genetic and environmental liability common to 
all three Cluster A disorders. No gender distinction appears to be 
unequivocal, but there seems to be more male than female StPD 
patients. 


Comorbidities 


About 30% of StPD patients present a comorbid pattern with SPD 
(with introversion as the common dominant feature), and 60% with 
PPD (with suspiciousness as the most shared trait). AvPD and BPD 


Strong tendency to chronicity 
Bullying victims, pre-psychotic states, unstable employment 
Marital and relational difficulties; reasonable adaptation 


More among men; social isolation; 10% suicidal behaviour; marginal 
employment; no frequent psychoses 


More among men; family dysfunction; unemployment; conduct disorder 
and law/forensic problems; criminal behaviour; frequent use of medical 
services; some improvement with ageing 


Self-mutilation episodes and suicidal behaviour; 80% remission by 
10 years, 40% with high functioning; infrequent relapses, but persistent 
social and occupational issues 


More among women; problems since adolescence; suicidal attempts; 
occupational fragility; frequent use of medical services; improves with 
age 

60% improve in 3 years; connection with evolution of comorbidities; 
prognosis worsens with AsPD 

Variations in early periods; diagnostic ‘stability’, many patients remain 
single 

Few studies; limited reduction of symptoms 


More among men; small long-term improvement of core symptoms 


are also frequently encountered in patients with StPD. The joint pres- 
ence of paranoid, schizotypal, schizoid, and avoidant personalities 
among first-degree relatives of schizophrenic patients has led to the 
notion of schizophrenia-spectrum disorder and the identification 
of such PDs as psychometric high-risk groups [48]. In cases, these 
patients may be ‘in and out of brief psychotic episodes or present 
premorbid schizophreniform disorder or schizophrenia. As relevant 
as this, DSM-5 [1] points out that approximately half of StPD pa- 
tients experience comorbid MDD and may also show evidence of 
obsessive-compulsive and generalized anxiety disorders [27]. 


Clinical course 


Like the other Cluster A PDs, StPD patients show their main symp- 
toms (social isolation, esoteric fantasies, peculiar thought, and 
language and school underachievement) early in childhood or ado- 
lescence. As adults, they may join in marginal groups with extreme 
socio-political or religious views which may increase their sense of 
alienation and interpersonal deterioration, a factor that can make 
them reach, according to some studies, up to 10% of suicidal behav- 
iour. They do not seek treatment for their PD, but rather for asso- 
ciated depression, dysphoria, and/or anxiety. Nevertheless, only a 
minority falls into schizophrenia or other psychotic disorders, and 
a good number of patients remain marginally employed throughout 
their active life period [42]. 


Antisocial personality disorder 


Prevalence 


One of the two most studied PDs, the prevalence figures of anti- 
social personality disorder (AsPD) do not escape the levels of de- 
bate or disagreement seen in other types, in spite of large-size 
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studies and more sophisticated evaluation instruments. The highest 
incidence figures are seen in late adolescence and early adulthood. 
The range in community samples goes from 0.2% to 4.5%, with a 
median of between 0.9% and 1.7%, while figures in clinical popu- 
lations (including prisons or other forensic settings) vary from 3% 
to 30%. High levels of sensation-seeking and low anxiety sensitivity 
indicate an increased risk for substance misuse in incarcerated male 
offenders [49]. In most studies, the role of socio-environmental fac- 
tors has been elucidated: more often seen in men than in women 
(rates of 3:1) and among people of younger ages; living in urban, 
but deprived, areas [ergo, low socio-economic status (SES) ]; mem- 
bers of dysfunctional families; and history of trauma, victimization, 
and neglect during childhood. AsPD patients also have been found 
to be frequent users of medical services [47]. All-cause mortality is 
twice as high among those imprisoned [50]. A genetic component 
is strongly considered by the majority of researchers [6, 33, 42, 51]. 


Comorbidities 


In addition to a relatively frequent co-occurrence of other Cluster 
B PDs like borderline, narcissistic, and histrionic, AsPD patients 
show frequent comorbidity with PPD, alcoholism, hypochondriac 
behaviour, problem-gambling, and depression [42, 52], the latter 
exhibiting predominantly atypical features. Substance abuse may 
coexist with AsPD since childhood, and antisocial behaviour may 
be secondary to premorbid alcoholism; this association relies on 
underlying dimensions of impulsivity or externalization [53, 54]. 
Men with attention-deficit/hyperactivity disorder (ADHD) and 
patterns of use of nicotine, cannabis, and other illicit drugs since 
early age also show high presence of AsPD [49], and almost 15% 
of patients on methadone maintenance therapy fulfil the diagnostic 
criteria [55]. 


Clinical course 


AsPD is probably one of the most complicated and clinically 
fascinating conditions, even when looking back at the childhood life 
period of the patients—conduct disorder diagnoses frequently pre- 
cede AsPD detection by the second decade of life. Children with a 
history of severe conduct problems and callous-unemotional traits 
(that is, with limited prosocial emotions) can be identified by their 
peers as early as their preschool years [56] and are high on fearless- 
ness but, like those on the other extreme of the same scales, may have 
the same antisocial phenotypic outcome. AsPD cannot be diagnosed 
before age 18, as its typical behavioural manifestations are most pro- 
nounced in early adult years [57]. In addition to serious problems 
with the law, other sequelae, such as unemployment, poverty, home- 
lessness, risky driving, weapon-carrying, and increasingly brutal 
actions in and out of numerous and fragile marital relationships 
[56-58], may end up in a shorter life course. Non-domestic homi- 
cide offenders are four times more frequent users of firearms, and 
twice more likely than spontaneous domestic homicide’s to be diag- 
nosed as AsPD [58]. 

It is possible that AsPD behaviours may either gradually decrease 
with age in some patients, or a subgroup of them may be able to 
conduct seemingly acceptable social and occupational lives; thus, re- 
duction or remission of criminal behaviour and decrease of the full 
spectrum of antisocial postures may occur around the fourth decade 
of life. On average, 20 years after the diagnosis is made, 12% of the 
patients are symptom-free, 27% show a notable improvement, but 


60% remain unchanged [46]. Interestingly, the appearance of other 
conditions such as depression or hypochondriasis correlates with 
abandonment of rage and aggression as interpersonal life strategies, 
reflecting and fostering personality maturation. 


Borderline personality disorder 
Prevalence 


According to DSM-5, the median population prevalence of BPD is 
estimated to be 1.6%, but other sources cite a range of 0.2-5.9%. The 
high end of this range is from the NESARC study [59], which may 
be inflated by oversampling young adults (prevalence decreases with 
age), thus requiring less robust associations with impairment. The 
samples have been more numerous than with any other PDs, and the 
clinical correlates with a variety of other conditions are abundant. 
The disorder is significantly more common among females, par- 
ticularly widowed and unmarried women (though less clearly so in 
community samples), and its frequency increases in younger, non- 
white, urban, and poorer respondents. The presence of BPD patients 
in primary care settings seems to be about 6%, about 10% in out- 
patient mental health clinics, and 20% in inpatient psychiatric units 
[60]. A substantial genetic basis, 18-35 years as age of onset, and de- 
creased incidence among older individuals are additional findings. 


Comorbidities 


BPD is highly comorbid with a number of other disorders. Up to 
75% of patients have comorbid major depression [61], and patients 
with BPD report MDD symptoms that are subjectively more severe 
and more interpersonally driven [62]. Even though the presence of 
MDD does not impact the accuracy of the BPD diagnosis, the latter's 
symptoms are less responsive to treatment with drugs for depression 
and electroconvulsive therapy (ECT). On the other hand, bipolar 
disorder is overrepresented in BPD. In two large prospective studies 
of BPD, bipolar disorder co-occurred in 15-20% of patients [12% bi- 
polar disorder type I (BPI) and 8% bipolar type II (BPII) in one study 
[61]; 1% BPI and 14% BPII in the other [63], though BPI was initially 
excluded from this second cohort). 

PTSD co-occurs with BPD at rates of between 17% and 50%, 
depending on sample and methodology, whereas BPD is found in 7- 
25% of people with PTSD [22, 26, 59, 64, 65]. Substance use and anxiety 
disorders (including panic disorder) are also quite common in BPD 
(upwards of 50%) and tend to remit when BPD symptoms do [60]. 


Clinical course 


Two large prospective, longitudinal studies inform most of what is 
known about the course and outcome of BPD. The McLean Study 
of Adult Development (MSAD), led by Zanarini [63, 66], followed 
290 former inpatients for about 20 years; all met rigorous diag- 
nostic criteria for study entry. The second study—the Collaborative 
Longitudinal Personality Disorders Study (CLPDS)—led by 
Gunderson [65] was a multi-site, more socio-economically repre- 
sentative inquiry and followed 175 patients with BPD (as well as 
patients with MDD and other PDs). Both studies demonstrated sur- 
prising levels of symptom remission. In fact, by the end of 10 years 
of follow-up, >80% of each sample no longer met diagnostic cri- 
teria. Moreover, when symptomatic remission was achieved, re- 
lapse was rare—cumulatively 12% in CLPDS and 15% in the MSAD 
study. Nonetheless, severe and persistent impairment in social 
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and occupational functioning remained, with only a third of pa- 
tients obtaining full-time employment by the end of 10 years and 
only 40% of previously high-functioning patients regaining that 
performance level. 


Histrionic personality disorder 
Prevalence 


Within ranges of 0.3-3.9% in general populations (median between 
1.5% and 1.8%) and 10-15% in clinical settings, HPD is seen more 
frequently among women, particularly separated and divorced, but 
without other socio-demographic differences. This disorder seems 
to respond more to cultural aetio-pathogenic factors (that is, sex 
role stereotypes, emotional expressiveness, etc.), while some au- 
thors suggest that HPD in women is genotypically linked to AsPD 
in men [42, 45]. 


Comorbidities 


Overlapping HPD cases and those of other clusters is a relatively 
frequent occurrence; borderline, narcissistic, and antisocial from 
Cluster B, and dependent from Cluster C are the most prevalent. As 
the features of the different disorders can be combined, once again, 
a clinically sound differential diagnosis must converge with a thor- 
ough assessment of authentic comorbidity. The most difficult com- 
parison is with narcissistic personality disorder (NPD) and AsPD. 
In the former, the main distinction has to do with the narcissist’s 
attention- and superiority-seeking vs the histrionic’s inconsistent 
and dependency-seeking interpersonal transactions; about the 
latter, although both types show impulsive, manipulative, and se- 
ductive behaviours, the histrionic manipulates to capture others’ at- 
tention and does not necessarily break the law to reach it, whereas 
the antisocial individual seduces others in search of absolute au- 
thoritarian power frequently nourished by illegal manoeuvres. 

The evidence in favour of HPD’s comorbidity with other psychi- 
atric disorders is greater for somatization, conversion, and dissocia- 
tive disorders, besides brief reactive psychosis and a few cases of 
hypomanic and manic states; the episodic nature of these, though, 
leads to a differential diagnosis more than to a real comorbidity 
process [45]. 


Clinical course 


Blashfield et al. [67] characterized the HPD patient’s adolescent 
period as playful, flamboyant, flirtatious, and attention-seeking. 
During their early adult life, superficiality, rapidly changing and in- 
consistent interpersonal patterns, poor sense of responsibility, com- 
petitiveness, emotional instability, and suggestibility lead them to, at 
times, risky or dangerous connections, theatrical and melodramatic 
explosions, and occupational fragility. Obviously, concomitant bor- 
derline and antisocial disorders generate severe levels of personal 
dysfunctionality. In these and other contexts, depressive symptoms, 
suicide attempts, and frequent use of medical services are common. 
Ageing may become a factor of sobering changes in the clinical 
course of a number of HPD cases. 


Narcissistic personality disorder 
Prevalence 


Perhaps the most debatable and debated PD, with serious consider- 
ations of its elimination from DSM-5 due to reasons such as limited 


credible research, excessive theoretical/ideological considerations, 
very low prevalence figures, and more dimensional or trait-related 
coverage and pertinence [68], NPD is, at best, a rich source of dia- 
lectics or a window into the relationship between self-esteem and 
emotional regulation [69] and, at worst, a questionable clinical dis- 
order. It is the PD with the most 0% prevalence figures in the epi- 
demiological literature (eight of 16), and with levels as low as 0.1% 
among general populations. It is commonly cited as having a rate of 
0.5% [19-21], with a range of up to 4.4% [70] or a surprisingly high 
6.2% in the NESARC study [71]. More frequently diagnosed in some 
female clinical populations, this gender difference becomes irrele- 
vant, however, in most epidemiological inquiries. 


Comorbidities 


In the NESARC data, NPD was strongly associated with SUD (40% 
12-month prevalence, 64% lifetime) and anxiety disorders (40% 12- 
month prevalence, 55% lifetime) [71]. Rates of NPD in BPI (24% 
lifetime) and BPI in BPD (20% lifetime) were surprisingly high, per- 
haps owing to the overlaps in constructs of grandiosity. Given the 
prevalence data being out of step with most other studies, interpret- 
ation or application of these results warrant caution. Finally, NPD 
can co-occur with other PDs, most commonly BPD. Estimates are 
that 15% of patients with BPD have NPD, and 25% of patients with 
NPD have BPD [60]. 


Clinical course 


There are strikingly limited data about the course of NPD. One 3- 
year prospective study found that 60% of a sample of 20 patients im- 
proved over 3 years and that the remaining 40% showed no change; 
improvement was associated with achievements and new relation- 
ships, whereas disillusionments and remaining symptomatology 
were predicted by high narcissism in interpersonal relationships 
[72]. Furthermore, NPD worsens the course of comorbid BPD but 
will improve when BPD symptoms do, and comorbidity with AsPD 
worsens the prognosis of NPD [60]. 


Avoidant personality disorder 
Prevalence 


The clinical expression of an intense sensitivity to rejection and criti- 
cism with resulting social isolation, AvPD was first introduced as an 
independent entity in DSM-III. It is considered the most prevalent 
PD type in clinical populations, with figures of up to 5.7% in at least 
one epidemiological study [70], but still above 5% in two other sur- 
veys, a rate of 1.7% in general populations [20] and much higher 
in psychiatric outpatients. Forty-five per cent of people with AvPD 
are single (never married) [19, 20], but no strong gender differences 
have been found [73]. 


Comorbidities 


Social anxiety disorder (SAD) is both the most common comorbid 
condition with AvPD and the most common diagnostic confounder. 
Co-occurrence with SAD is seen in 40% of patients with AvPD, and 
in patients with SAD, AvPD can be diagnosed in 46% [74]. Social 
fears are more associated with self-concept AvPD and therefore are 
more pervasive and interfere with more life domains [75]. High 
comorbidity with DPD is noted, and a connection with schizoid PD 
has also been postulated [76]. 
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Clinical course 


While carefully designed long-term studies of AvPD are rare, data do 
support the relative stability of the symptoms over time. An analysis 
of the CLPDS data showed that in a 10-year follow-up of AvPD pa- 
tients, diagnostic stability was reasonably robust from years 6 through 
10 (standardized effect 0.79 in men and 0.81 in women), with more 
variability earlier in the course [77]. It also seems that genetic factors 
contribute most robustly to AvPD’s phenotypic stability [78]. 


Dependent personality disorder 
Prevalence 


In spite of relatively small numbers, prevalence data of DPD in dif- 
ferent studies report significant oscillations (0.1-11%), with a me- 
dian of between 0.5% and 1.4% and some predominance among 
women [21, 23, 78, 79]. This PD may be one of the most affected by 
cultural factors, as pointed out by Carrasco and Lecic-Tosevski [80], 
due to patterns of passivity, politeness, and submissiveness normally 
dominant and accepted in some societies. 


Comorbidities 


DPD has reasonably high comorbidity with AvPD, as outlined [75]. 
It is also commonly thought to have high comorbidity with anxiety 
disorders, but meta-analytic data have questioned this assumption 
by reporting a modest association (r = 0.11) [81]. The submissive 
nature of the disorder may contribute to an ambiguous presence of 
a comorbid anxiety, whereas reports of depressed mood are more 
common. 


Clinical course 


The clinical course of DPD is very poorly studied, a major gap in the 
epidemiological literature on PDs. Indirect evidence for diagnostic 
stability (vs its emergence in older age) comes from the fact that it only 
shows a 23.8% reduction in symptoms between ages 20 and 50 [82]. 


Obsessive-compulsive personality disorder 
Prevalence 


There seems to be agreement in that OCPD is, side by side with 
AvPD, one of the most prevalent PDs in the general population, with 
a range of 2.1-7.9% [1], depending in some cases on the precise- 
ness and specificity of the instruments utilized. In clinical popula- 
tions, prevalence may be as high as 10%. The most accepted median 
figure is 2.1%. Males may show five times higher rates than females, 
and the same happens among white, highly educated, married, 
older age, and employed subjects [83]. Clinical setting may be an- 
other differentiating factor, as the highest number of OCPD is seen 
in outpatient clinics and the lowest in inpatient units and private 
practice offices [27]. The issue of ‘adaptive (with a subtle or evident 
cultural input), not necessarily pathological, obsessive-compulsive 
personality traits has also been raised since the publication of the 
Epidemiologic Catchment Area (ECA) findings [41, 43, 68]. 


Comorbidities 


OCPD is commonly comorbid with anxiety and mood disorders, 
occurring at rates of over 20% in both groups of patients [84]. 
Among the PDs, OCPD co-occurs with the other Cluster C disorders 
(particularly AvPD), as well as PPD and StPD. While obsessive- 
compulsive disorder (OCD) is clearly a separate disorder, symmetry 


and hoarding symptoms can certainly co-occur, and the capacity to 
delay rewards contributes to this distinction [85]. 


Clinical course 


Data from the CLPDS documented no change in OCPD functional 
improvement over 2 years. Small treatment studies have shown 
improvement in core PD symptoms, as well as anxiety and depres- 
sion measures, but data are limited regarding long-term changes 
in OCPD. 


Other personality disorders 
Passive-aggressive (negativistic) personality disorder 


Initially considered in DSM-I as a ‘personality trait disturbance, this 
label was taken out of DSM-IV, and in DSM-5, it could be even- 
tually ascribed to the so-called ‘Personality disorder-trait specified’ 
(PD-TS) category included in Section III as a resource for clinicians 
to describe and identify types different from the generally accepted 
ones [86]. In European psychiatry, the label is more used and rec- 
ognized than in North America. According to studies, the general 
population prevalence ranges from 0.9 to 3.1, but in clinical settings, 
it moves up to 10% [80]. 


Additional PD types 


There are several additional labels scattered around the existing lit- 
erature. For example, some pre-DSM-5 classification instruments 
listed a depressive personality disorder, later incorporated to either 
‘Dysthymia, ‘Major/moderate depressive disorder, or other clinical 
entities. In a recent study, it was found mostly associated with in- 
attentive ADHD [87]. Similarly, self-defeating (masochistic) and sad- 
istic PDs, with low prevalence levels (0.8% and 0.2%, respectively), 
have been studied primarily by Scandinavian researchers [19] and in 
forensic populations. Finally, a mixed PD could not be absent from 
attempts at a comprehensive clinical assessment [1]. 

DSM-5 [1] groups a series of personality-related conditions under 
the rubric of ‘Personality change due to another medical condition’ (in 
the pastalso called ‘Organic PD’), with specifiers that identify different 
types (labile, disinhibited, aggressive, apathetic, paranoid, other, 
combined, and unspecified) causing significant distress or impair- 
ment in social, occupational, and other areas of functioning. In this 
context, the issue and actual existence of defined comorbidities make 
the distinction between a pre-existing PD and personality changes 
secondary to medical conditions a delicate and complex clinical 
exercise. Furthermore, a growing variety of medical diagnoses can 
cause this disruption: neurological (that is, Huntington's, Parkinson's, 
Alzheimer’s, and Pick’s diseases, epilepsy and interictal status, frontal 
lobe syndrome, head or central nervous system trauma), neoplastic, 
infectious (that is, HIV), endocrinological (that is, hypo- and hyper- 
thyroidism, hypo- and hyperadrenalism), and autoimmune condi- 
tions (that is, systemic lupus erythematosus). Thus, the clinical course 
of these variants is determined by the underlying cause and a subse- 
quent tendency to chronicity and gradual clinical worsening. 
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Substantial contributions have been made to understanding person- 
ality disorders through work in psychiatric genetics in a progres- 
sive way that can be summarized in five major steps. Initially twin, 
family, and adoption studies were conducted in order to test whether 
personality disorders were familial and heritable [1, 2], and also 
whether there was concordance for specific categorical diagnoses 
in families [3]. Personality disorders were found to be familial, but 
patterns of comorbidity within individuals and resemblance among 
relatives were found to be complex [4]. Personality disorders are not 
inherited as discrete entities, so there was a need to quantify the de- 
scriptive features of the disorders [5]. 

Secondly, reliable interviews were developed with checklists of 
features that quantified diagnosis, and the inheritance patterns of 
such descriptive features were studied. The number of features of 
disorders was found to be moderately heritable, and some progress 
was made to identify which features were common to all person- 
ality disorders and which to particular subtypes [6, 7]. However, 
there was still much overlap within individuals and within families, 
calling into serious doubt the reliance on categorical diagnosis [8]. 
Nevertheless, categorical diagnoses communicate much informa- 
tion concisely, so they continue to be popular with clinicians. 

Thirdly, the relationship between normal and abnormal person- 
ality traits were thoroughly studied using quantitative interviews 
and/or self-report inventories with many scales and subscales [9]. 
The structure of personality was found to be the same whether 
people came from the general population or from clinical sam- 
ples [10, 11], indicating that people with personality disorders or 
prominent abnormal personality traits were simple unusual con- 
figurations of traits that vary widely in the general population [12]. 
Extremes of individual traits or particular combinations of traits led 
to problems adapting in a healthy manner. In other words, what ap- 
peared to be fuzzy categories of disorder without discrete bound- 
aries could well be specified as specific combinations of extreme 
(high or low) scorers on multiple dimensions of personality [13]. At 
least seven dimensions of personality, as measured by various alter- 
native models, had some genes that were unique to each dimension, 
even though some systems used only 3-5 dimensions [14]. 

Fourthly, with the ability to measure personality and its disorders 
as multi-dimensional configurations or profiles of traits, it quickly 











became clear that the inheritance of the dimensions of person- 
ality was highly complex, with resemblance in families influenced 
by genetics, socio-cultural learning, and environmental experi- 
ences throughout the lifespan, even though such complexity had 
been well described much earlier [15, 16]. Personality is a person's 
characteristic way of learning about, and adapting to, his or her life 
experiences. Human beings have multiple systems of learning and 
memory—procedural (behavioural conditioning for temperament 
traits), semantic (learning facts and propositions), and autonoetic 
(self-aware learning or autobiographical memory ofa life narrative). 
Such observations facilitated the integration of genetic, biological, 
social, and cultural contributions to understanding personality and 
its disorders. Although twin studies often seemed to suggest that 
heritability of personality was about 50%, in fact, most of the herit- 
ability is due to complex interactions among many gene loci acting 
in concert, which is called epistasis [17, 18]. The average effects of 
individual genes on personality are weak and inconsistent until their 
interactions with other genetic variants and environmental events 
are taken into account [19, 20]. There is much gene-gene and gene- 
environment interaction involved in the development of healthy and 
unhealthy personalities [21-23]. As a result, the field was stuck for 
more than two decades with what was called the ‘missing’ or ‘hidden’ 
heritability problem—despite resemblance for personality in mono- 
zygotic (MZ) twins being very much greater than in dizygotic (DZ) 
twins and other relatives, most of the genetic variability could not 
be explained by the average effects of genetic variants despite sur- 
veying dense maps of the entire genome [24, 25]. Less than 1% of 
variability in personality was explained by the average effects of indi- 
vidual genes, even in huge samples of over 60,000 subjects collected 
by the Genetics of Personality Consortium [19, 26]. 

Fifthly, it has recently become possible to identify systematically 
sets of many genetic variants that influence the risk of personality 
disorders [24]. Using genome-wide association studies (GWAS) 
in moderate-sized samples with detailed genotyping of the whole 
genome, the hidden heritability was found distributed in networks of 
many genes acting in concert, rather than individually. We now have 
well-validated clustering methods that allow a thorough description 
of both the genetic and phenotypic architecture of personality [24, 
27, 28]. Most of the genome is expressed in the brain, and there is 
much regional variability in brain gene expression, as well as much 
individual variability in gene expression between individuals. The 
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genetic variants that influence personality are widespread across the 
genome, as illustrated later, and are found to regulate gene expres- 
sion that contributes, in particular, to variability in neural develop- 
ment and neuroplasticity. These recent discoveries open a rich and 
promising field for understanding the complex genotypic networks 
and pathways by which personality develops and is self-regulated in 
human beings. Its applicability to even small and moderate samples 
means that the pathway from genotype to phenotype can be charac- 
terized with depth and precision in ways that are impractical in very 
large samples [24]. 

In this chapter, the basic observations along this progression 
in understanding the genetics of personality disorders will be de- 
scribed and discussed to provide an overview of the contributions 
of genetics to the diagnosis and treatment of personality disorders. 


Genetic variation in risk of personality disorder is 


quantitative, not discrete 





Antisocial personality is the most consistently recognized and most 
extensively studied personality disorder. Family, twin, and adoption 
studies have all shown that liability to antisocial personality disorders 
varies quantitatively and is moderately heritable, regardless of methods 
of assessment and diagnosis. Adoption studies of psychopathy [29], 
antisocial personality [2, 30], and criminality [31, 32] all moderate her- 
itability of the quantitative risk of illness. The childhood experience of 
an unstable, hostile, or neglectful home acted synergistically to increase 
the risk of adult petty criminality or antisocial personality [30, 31]. MZ 
twins were much more often concordant for criminality than were DZ 
twins, with differences in prevalence according to gender and severity of 
impairment explaining the degree of familial loading or concordance in 
relatives in population-based samples for all of Denmark and its islands 
[33]. A meta-analysis of 51 twin and adoption studies estimated that 
there were moderate proportions of variance due to additive genetic 
influences (0.32), non-additive genetic influences (0.09), shared envir- 
onmental influences (0.16), and non-shared environmental influences 
(0.43) [34], but the additive genetic influences may have been inflated by 
the neglect of cultural inheritance and assortative mating [15]. 

Sex differences in prevalence in the general population have a sub- 
stantial impact on the degree of familial concordance [1, 33, 35]. For 
example, antisocial personality disorder is more rare in women than 
in men, and antisocial women carry a greater genetic loading (meas- 
ured by more antisocial relatives) than do antisocial men [36]. 

Such findings show that liability to personality and its disorders 
varies quantitatively, not as a discrete disease or a set of diseases. 
Nevertheless, categorical classifications have remained popular for 
use in clinical practice, even when clinicians are warned that the clin- 
ical syndromes used for classification should be reified into discrete 
diseases, as was done in DSM-IV. The quantitative nature of vari- 
ation in personality traits suggested the possibility that the number 
of diagnostic symptoms would be a good index of genetic risk. 


Resemblance for personality disorder categories 


and symptoms in twins 





Svenn Torgersen and his colleagues carried out a twin study of per- 
sonality disorders using categorical diagnoses [3]. They ascertained 


92 MZ and 129 DZ twin pairs, in which at least one proband had 
a diagnosis of personality disorder, and then classified subjects ac- 
cording to DSM clusters and categories. The concordances for any 
personality disorder were 58% for MZ pairs and 36% for DZ pairs, 
indicating moderate heritability (44%) (Table 120.1). When the 
diagnosis of personality disorder was definite, concordances for any 
personality disorder were 40% for MZ pairs and 29% for DZ pairs, 
which was highly significant (P <0.01). However, the results could 
not be explained by a single heritable factor for personality disorder 
in general; there were heritable influences that further differentiated 
personality disorder clusters (Table 120.1). Torgersen’s estimates of 
heritability for the clusters and specific categories were moderate 
(that is, between 40% and 60%), just as was found for quantitative 
measures of normal [16, 37] and abnormal personality traits [38]. 
The correlations between DZ pairs were usually less than half of those 
of MZ pairs, which suggests that gene-gene and gene-environment 
interactions are important for categorical diagnoses, as they are for 
personality dimensions in twins reared apart [17, 39-41]. 

Even stronger evidence for the moderate heritability of person- 
ality disorders has been provided by more recent studies of the in- 
heritance of the number of symptoms in each of the ten categories 
of personality disorders in DSM-IV, which is conserved officially in 
DSM-5. The sample included 1386 pairs of twins aged 19-35, as as- 
sessed by Structured Interview for DSM-IV Personality Disorders 
(SIDP-IV) interviews. The data were compiled from separate reports 
about the heritability of the number of symptoms of disorders in 
personality Cluster A [6], Cluster B [42], and Cluster C [7] (Table 
120.2). The total heritability of liability to the number of symptoms 
in each category varied from weak to moderate, ranging from 28% 
for paranoid personality symptoms to 38% for antisocial personality 
symptoms (‘Table 120.2). 

The unique heritability in Table 120.2 indicates the specificity of 
the genetic determinants of each disorder. Low specificity means 
that the genetic determinants are non-specific for various diagnoses, 
so that there is much overlap of diagnostic features in the same in- 
dividuals and in their relatives, as is true for most personality dis- 
orders. In contrast, obsessive-compulsive personality disorder has 
85% unique heritability, indicating that its genetic determinants are 
largely distinct from those of avoidant and dependent personality 
disorders. This supports the proposal that obsessional traits consti- 
tute a fourth cluster with high persistence or anankastic traits [43]. 
Likewise, unique genetic determinants predominate for dependent 
personality disorder (52%) in Cluster C, and for schizoid personality 
disorder (74%) and paranoid personality disorder (54%) in Cluster 
A. Likewise in Cluster B, antisocial and borderline personality 


Table 120.1 A study of categorical personality disorders (PDs) 
assessed by SCID-II interviews in 92 MZ twin pairs and 129 DZ twin 
pairs in Norway 


estes eee ee 





Any PD 58 + 10 36 + 10 44+ 20 
Any Cluster A Sa Wl Qe Sy) 3225) 
Any Cluster B 60+ 11 3U ap 12 Sy) ae 23) 
Any Cluster C olz 23) 35 ||| SS ae A0 


Source: data from Compr Psychiatry, 41 (6), Torgersen S, Lygren S, Oien PA, et al., A twin 
study of personality disorders, pp. 416-25, Copyright (2000), Elsevier Inc. 


Table 120.2 Heritability of the number of symptoms in personality 
categories in a sample of 1386 Norwegian twin pairs aged 19-35, 

as assessed by SIDP-IV interviews (total heritability and the heritability 
unique to each disorder within the three clusters are shown) 


Clusters of PD symptoms | Total heritability (%) | Unique heritability (%) 


Cluster A (‘odd’) 


Schizotypal 26 0 
Paranoid 21 57 
Schizoid 28 74 
Cluster B (‘dramatic’) 

Antisocial 38 I5 
Histrionic 3] 0 
Narcissistic 24 10 
Borderline 35 0 
Cluster C (‘anxious’) 

Avoidant 35 17 
Dependent 2l 52 
Obsessive-compulsive 27 85 


Source: data from Psychol Med, 36(11), Kendler KS, Czajkowski N, Tambs K, et al., 
Dimensional representations of DSM-IV Cluster A personality disorders in a population- 
based sample of Norwegian twins: a multivariate study, pp. 1583-91, Copyright (2006), 
Cambridge University Press [Cluster A]; Psychol Med, 38(11), Torgersen S, Czajkowski N, 
Jacobson K, et al., Dimensional representations of DSM-IV cluster B personality disorders in a 
population-based sample of Norwegian twins: a multivariate study, pp. 1617-25, Copyright 
(2008), Cambridge University Press [Cluster B]; Psychol Med, 37(5), Reichborn-Kjennerud 

T, Czajkowski N, Neale MC, et al., Genetic and enviromental influences on dimensional 
representations of DSM-IV cluster C personality disorders: a population-based multivariate 
twin study, pp. 645-53, Copyright (2007), Cambridge University Press [Cluster C]. 


disorders are more closely related genetically to each other than to 
histrionic and narcissistic personality disorders. 

The genetic and environmental architecture of all ten DSM-IV 
personality disorders has been evaluated using path analysis to esti- 
mate the number of common factors and unique determinants of all 
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ten DSM-IV personality disorders [44]. Three latent genetic factors 
were uncovered (Table 120.3). The first factor is a common deter- 
minant of all personality disorders. This common personality factor 
corresponds to what is measured as low self-directedness (that is, 
irresponsible, blaming) using the Temperament and Character 
Inventory (TCI) [10, 43]. The second genetic determinant contrib- 
utes most strongly to antisocial and borderline personality disorders 
and most likely corresponds to low TCI Cooperativeness (that is, 
hostile, revengeful). The third genetic determinant contributes 
most strongly to schizoid and avoidant personality disorders and 
most likely corresponds to social detachment as a result of high 
TCI Harm Avoidance (that is, anxious, shy) and/or low TCI Reward 
Dependence (that is, aloof, detached). 

The first two general factors provide support for proposals to use 
low Self-directedness and low Cooperativeness as indices of per- 
sonality in general, as is done in the alternative criteria in DSM-5 
[45] and elsewhere [46]. However, most of the disorders have add- 
itional genetic influences that are not accounted for by such char- 
acter variables that define personality disorders in general. Subtypes 
of personality disorder are largely explained by variation in four 
temperament dimensions [5, 46], as proposed by Peter Tyrer for the 
eleventh edition of the International Classification of Diseases (ICD- 
11) [47]. Therefore, genetic evidence strongly supports classifying 
personality disorders on the basis of multi-dimensional inventories 
that distinguish character traits for identifying personality disorder 
in general or not, and temperament traits that distinguish subtypes 
in terms of their profile of emotional style [10, 43]. 


Same genes influence normal and 





abnormal personality 


Multi-dimensional inventories for assessing components of person- 
ality disorders have been developed to measure both normal and ab- 
normal traits. One of the most robust findings about personality, but 


Table 120.3 Path coefficients (x 100) for contributions to heritability of numbers of PD symptom groupings 
measured by the SIDP from three latent genetic factors (GFs) and unique contributions for each grouping 


of symptoms in 2794 Norwegian twins 


(? high Novelty Seeking/ 


Path from GF3 Path from Unique 
(? high Harm Avoidance/low | or Unexplained 





Clusters of PD Path from GF1 Path from GF2 
symptoms (? low 
Self-directedness) | low Cooperativeness) 
Cluster A 
Schizotypal 11 22 
Paranoid 30 13 
Schizoid 2 7 
Cluster B 
Antisocial 6 63 
Histrionic 49 23 
Narcissistic 25 12 
Borderline 32 43 
Cluster C 
Avoidant 14 7 
Dependent 29 5 
Obsessive 29 6 


Reward Dependence) cig 


22 31 
22 28 
37 34 

9 0 
14 0 

m 33 
16 24 
59 0 
23 37 
13 41 


Source: data from Arch Gen Psychiatry, 65(12), Kendler KS, Aggen SH, Czajkowski N, et al., The structure of genetic and environmental risk factors 
for DSM-IV personality disorders, pp. 1438-46, Copyright (2008), American Medical Association. 
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one that is surprising to many psychiatrists, is that the same dimen- 
sions of personality are observed whether one begins with normal 
personality variation in the general community, with abnormal per- 
sonality traits, or with symptoms of personality disorders in treat- 
ment samples. 

There are three main lines of evidence indicating that the same 
genes and environmental variables account for variation in both 
normal and abnormal personality traits. Firstly, people with per- 
sonality disorders or other forms of psychopathology have extreme 
values on one or more personality dimensions, but the correlational 
structure is the same in samples from the general community and 
from psychiatric treatment facilities [43, 48, 49]. If personality dis- 
orders were discrete diseases unrelated to normal personality vari- 
ation, then the correlational structures of personality would differ 
in clinical and non-clinical samples (but they do not!). Secondly, 
the genetic-environmental architecture of normal personality traits 
is indistinguishable from that of abnormal personality traits. The 
finding that the genes and environmental influences for normal and 
abnormal personality are the same is found whether twins are reared 
together [49] or apart [50]. Thirdly, variability in normal personality 
traits that are descriptive of the general population are diagnostic of 
personality disorders [43, 51]. Therefore, normal personality traits 
share a common functional foundation with abnormal personality 
traits and most, if not all, mental disorders. Essentially, the same 
genetic and bio-psychosocial systems influence individual differ- 
ences in normal personality and in its disorders. 

In order to understand how personality disorders arise from par- 
ticular configurations of traits that characterize normal variation in 
the general population, it is helpful to consider information from 
genetic studies of both normal and abnormal personality. Twin 
studies of personality dimensions in the general population dem- 
onstrated moderate heritability of normal personality traits. For 
example, the TCI was developed based on specific neurobiological 
and psychosocial data [5, 10, 52-54]. It measures four dimensions of 
temperament (that is, behavioural biases in response to basic emo- 
tional stimuli) and three dimensions of character (that is, higher 
cognitive processes influencing the maturity of a person’s goals and 
values). Initial twin studies were carried out using only measures of 
temperament [55, 56]. More recent twin studies have used the TCI 
and showed that each of the seven TCI dimensions has a unique 
genetic variance that is not explained by the other dimensions [14]. 

The TCI is descriptive of normal variation in the general popu- 
lation and is also highly effective in distinguishing people with and 
without personality disorder and in doing differential diagnosis of 
all personality disorder subtypes [43, 46]. The distinction between 
character and temperament in the TCI allows its character dimen- 
sions to be a highly effective model in distinguishing healthy from 
unhealthy personality traits [51]. The three character dimensions of 
the TCI are Self-directedness (that is, resourceful, purposeful, and 
responsible), Cooperativeness (that is, tolerant, helpful, and prin- 
cipled), and Self-transcendence (that is, idealistic, selfless, and con- 
templative) [57]. These dimensions correspond to variability in the 
intrapersonal, interpersonal, and transpersonal aspects of the self. 
The distinction between character (what we make of ourselves inten- 
tionally) and temperament (emotional drives) is well supported by 
studies of unique genetic determinants [14], phylogenesis [58, 59], 
brain imaging [60, 61], self-awareness and mental time-travel [62], 
effects of parental child rearing and early home environment [63], 


and generativity (that is, fertile and/or stable marriage, sperm do- 
nation) [64-67]. 

The heritability of each of the seven TCI dimensions in a sample 
of 2517 Australian twins is summarized in Table 120.4. Total herit- 
ability varied from 27% to 45%, without correcting for the reduced 
reliability of the short form of the TCI used in this study. Both tem- 
perament and character traits were roughly equally heritable, and 
each dimension had genetic determinants unique to it (that is, not 
overlapping with the genetics of other dimensions). 

Likewise, John Livesley and his colleagues developed the 
Differential Assessment of Personality Pathology (DAPP) to 
measure the self-report of abnormal personality traits as a hier- 
archy of dimensions. The questionnaire is composed of 560 items 
measuring 18 factorially derived dimensions, each with at least three 
specific facet scales [11, 38]. The heritability and concordances in 
236 MZ twin pairs and 247 DZ twin pairs have been estimated for all 
18 DAPP traits (Table 120.5). The heritability ranged from 35% for 
rejection sensitivity to 56% for conduct problems and callousness, 
once again showing that personality traits are moderately heritable. 

Four higher-order traits have been identified that account for 
most of the variability in the 18 DAPP dimensions. These are de- 
scribed as: (1) ‘emotional lability or dysregulation’; (2) ‘antagonism 
or dissocial behavio[u]r’;; (3) ‘interpersonal responsiveness or inhib- 
ition’; and (4) ‘compulsivity’ [49]. These higher-order dimensions 
of abnormal personality resemble the unhealthy extreme of the di- 
mensions of normal personality. The first three also correspond to 
the three factors identified in path analysis of personality disorders, 
as summarized previously in Table 120.3. For example, emotional 
dysregulation is similar to high neuroticism in the five-factor model 
(measured by the NEO) [68] or low Self-directedness in the seven- 
factor model of temperament and character (measured by TCI) [10]. 
Antagonism or dissocial behaviour is similar to low agreeability in 
the five-factor model and low Cooperativeness in the seven-factor 
model. Hence, these dimensions define healthy personality at one 
extreme (namely, high TCI Self-directedness and Cooperativeness, 
or low DAPP emotional dysregulation and dissocial behaviour, or 
low NEO Neuroticism and Agreeability) and personality disorder 
at the other extreme (for example, low TCI Self-directedness and 
Cooperativeness, etc.) [43, 49, 68]. 


Table 120.4 Total heritability of each of the seven Temperament and 
Character Inventory (TCI) personality dimensions estimated in 2517 
twins in Australia (unique effects exclude genetic contributions shared 
with other TCI personality dimensions) 


Personality dimension | Total heritability (%) | Unique heritability (%) 


Harm Avoidance 42 29 
Novelty Seeking 39 S2 
Reward Dependence 35 20 
Persistence 30 23 
Self-directedness 34 25 
Cooperativeness 27 16 
Self-transcendence 45 26 


Source: data from Pers Individ Dif., 35(8), Gillespie NA, Cloninger CR, Heath AC, et al., 
The genetic and environmental relationship between Cloninger’s dimensions of 
temperament and character, pp. 1931-46, Copyright (2003), Elsevier Ltd. 


Table 120.5 Heritability and concordances in 236 MZ twin pairs and 
247 DZ twin pairs for 18 basic scales of the Differential Assessment 
of Personality Pathology in Canada 


Scale label MZ correlation |DZcorrelation | Heritability (%) 
(rx 100) (rx 100) 
49 12 45 


Affective lability 


Anxiousness 42 25 44 
Callousness 56 82 56 
Cognitive distortion 48 al 49 
Compulsivity 40 19 ay 
Conduct problems 53 36 56 
Identity problems 51 28 52 
Insecure attachment 45 Pi 48 
Intimacy problems 47 24 48 
Narcissism 5] 2a 59 
Oppositionality 4] 29 46 
Rejection 33 19 55 
Restricted expression 48 26 41 
Self-harm 29 26 41 
Social avoidance 52 27 53 
Stimulus-seeking 38 A 40 
Submissiveness 41 29 45 
Suspiciousness 42 29 45 





Source: data from Acta Psychiatr Scand., 94(6), Jang KL, Livesley WJ, Vernon PA, et al., 
Heritability of personality disorder traits: a twin study, pp. 438-44, Copyright (1996), John 
Wiley and Sons. 


Overall, twin studies showed that personality is moderately her- 
itable whether measured as normal traits, abnormal traits, number 
of symptoms, or categorical diagnoses. Alternative measurement 
methods are highly convergent with one another. 


The complexity of personality genetics 


Twin studies indicated that between 30% and 60% of the phenotypic 
variance in personality and its disorders, as assessed by a variety of 
instruments, is genetic in origin [14, 18, 39, 69, 70]. However, adop- 
tion studies, studies of twins plus other family members, and GWAS 
have shown that common additive genetic variants account for little 
of the heritability of personality traits; most of the heritability of per- 
sonality and its disorders, as assessed by a variety of instruments, 
depends on complex interactions among multiple gene loci (that 
is, epistasis) and possibly multiple alleles at a locus (that is, domin- 
ance), rather than the average effects of individual genes [17, 18, 20, 
39, 70-72]. As a result, little (that is, <7.2% on average) of the ex- 
pected genetic effects on personality can be attributed to the average 
effects of individual single-nucleotide polymorphisms (SNPs) [20]. 
The importance of gene-gene and gene-environment interactions 
can be illustrated by studies of association of TCI Novelty Seeking 
with polymorphisms for dopamine receptor type 4 (DRD4), first 
reported by Jonathan Benjamin. Initially, they observed the simple 
association between DRD4 and Novelty Seeking, and then extended 
their findings to show the association depends on the three-way 
interaction of DRD4 with catechol-O-methyltransferase (COMT) 
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and the serotonin transporter locus promoter’s regulatory region 
(5-HTTLPR). Consequently, the association of Novelty Seeking 
with the average effects of DRD4 is weak and inconsistent. In the ab- 
sence of the short 5-HTTLPR allele (SHTLPR L/L genotype) and in 
the presence of the high-activity COMT Val/Val genotype, Novelty 
Seeking scores are higher in the presence of the DRD4 7-repeat al- 
lele than in its absence [73]. Furthermore, within families, siblings 
who shared identical genotype groups for all three polymorphisms 
(COMT, DRD4, and 5-HTTLPR) had significantly correlated TCI 
Novelty Seeking scores (r = 0.4 in 49 subjects; P <0.01). In contrast, 
sibs with dissimilar genotypes in at least one polymorphism showed 
no significant correlation for Novelty Seeking. Similar interactions 
were observed between these polymorphisms and Novelty Seeking 
in an independent sample of unrelated subjects [73] and have been 
replicated by independent investigators [74]. Likewise, there is also 
substantial evidence for specific gene-environment interactions on 
personality development [21, 75-83]. For example, TCI Novelty 
Seeking scores in adulthood are associated with particular DRD4 
polymorphisms only if the children were reared in a hostile child- 
hood environment with measures during childhood of maternal 
reports of emotional distance and punitive discipline [21]. Such 
complex non-linear interactions present great challenges for the 
identification and replication of relationships in GWAS. As a result 
of such complexity, despite extensive past effort, for the past decade, 
GWAS of personality had found few significant associations between 
specific SNPs and personality traits despite using a variety of well- 
validated personality instruments [19, 26, 84]. This is not surprising 
because the GWAS methods failed to account for the complexity of 
personality. Such failure to account for most of the heritability of 
complex traits has been called the ‘missing’ [25] or ‘hidder’ [85] her- 
itability problem. 

Until recently [24], the missing heritability problem had been 
approached in GWAS by analysing the explained variance in in- 
dividual personality traits in large individual samples or by 
using meta-analysis to combine data sets, such as the Genetics of 
Personality Consortium, with more than 60,000 subjects in 30 or 
more demographically diverse samples assessed with a variety of 
personality inventories [19, 20, 26, 86-89]. Efforts have also been 
made to harmonize measures from moderately correlated inven- 
tories using item-response theory and to consider the impact of 
demographic variables or allele frequencies on variation in indi- 
vidual traits. Nevertheless, most of the heritability of the traits has 
remained unexplained because even in large samples, the methods 
used are effective for detecting only the average effects of individual 
genes, but not the non-additive interactions among multiple genes 
and environmental events that are actually expected [24, 28, 90] 
(Table 120.6). 


Genomic approaches to understanding 


temperament and character 





Fortunately, the integration of well-validated clustering methods 
with GWAS has made it possible to address the complexity of per- 
sonality systematically [24]. Different profiles of temperament 
and of character were associated with different sets of functionally 
interacting genes detected through SNPs. The broad heritability of 
the various traits of personality varies from 30% to 60%, but this 
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Table 120.6 Greater importance of non-additive than additive 
components of heritability for Cloninger’s and Eysenck’s personality 
traits in a study of 9672 MZ and DZ twins and 3241 of their siblings 
(twin plus sibling design) 








Personality trait Additive Non- Additive Non- 
heritability | additive heritability | additive 
(women) heritability | (men) heritability 

(women) (men) 

TCI Harm ons O27 = 0.29" 

Avoidance 

TCI Novelty 0.05 O25 0.05 0.35" 

Seeking 

TCI Reward 0.07 0.31 - 0.11" 

Dependence 

TCi Persistence 0.00 025 0.00 025 

EPQ-R Extraversion 0.23" 0.24 02 0.24 

EPQ-R 0.19 O23 0.09" One 

Neuroticism 


“Denotes components that are significant in best-fitting model 

Source: data from Behav Genet., 35(6), Keller MC, Coventry WL, Heath AC, et al., 
Widespread evidence for non-additive genetic variation in Cloninger's and Eysenck's 
Personality Dimensions using a Twin Plus Sibling Design, pp. 707-21, Copyright (2005), 
Springer Science Business Media, Inc. 


depends on substantial interactions among multiple genes and 
environmental variables. The average effects of individual genes 
explains <7.2% of the variability in personality, but allowing for 
gene-gene and gene-environment interactions accounted well for 
the broad heritability of temperament and character. The whole 
genome was screened for specific genes using about one million 
SNPs in >4000 people representative of the general populations of 
Finland, Germany, and Korea. Personality was found to be highly 
polygenic, with the associated genes distributed across all the human 
chromosomes in the Young Finns Heart Study [91], followed by rep- 
lication in the other two independent samples. Fig. 120.1 shows the 
number of genes per chromosome that are strongly associated with 
clusters of people with particular temperament profiles (that is, tem- 
perament only), clusters of people with particular character profiles 


(that is, character only), and clusters of people with profiles of both 
temperament and character traits (that is, temperament and char- 
acter). Sets of multiple genes were identified, independently of infor- 
mation about the personality traits, and the clusters of people were 
identified without information about the genetic profiles. The gene 
sets therefore represent naturally occurring combinations of genes 
with distinct functional interactions [24]. The sets of genes and per- 
sonality profiles were very strongly associated, but the relationships 
were complex in the sense that the same gene set could be expressed 
in multiple ways (that is, multifinality, pleiotropy) and different sets 
of genes could be expressed in the same way (that is, equifinality, 
heterogenetiy). 

The complex development of personality profiles from interacting 
sets of multiple genes and environmental variables will be described 
in more detail later. Now it is fundamentally important to recog- 
nize that most of the genes that influence personality have regula- 
tory functions, rather than being protein-coding genes (Fig. 120.2). 
In older genetic studies of personality, nearly all attention has been 
focused on protein-coding genes related to neurotransmitter func- 
tions. However, it is the regulatory genes that help us to understand 
change and development of human personality. 

Most of the regulatory genes are long non-coding ribonucleic 
acids (IncRNAs), other shorter non-coding RNAs (ncRNAs), and 
pseudogenes. These ncRNAs (long or short) are functional RNA 
molecules that are transcribed from deoxyribonucleic acid (DNA) 
but are not translated into proteins [92]. They can regulate gene ex- 
pression at both the transcriptional and post-transcriptional levels. 
They are involved in epigenetic processes that can be divided into 
two main groups: the short ncRNAs of <30 nucleotides and the 
IncRNAs of >200 nucleotides. The major classes of short ncRNAs 
are microRNAs (miRNAs), short interfering RNAs (siRNAs), and 
piwi-interacting RNAs (piRNAs). Both long and short RNAs are 
known to play a role in heterochromatin formation, histone modi- 
fication, DNA methylation targeting, and gene silencing. However, 
IncRNAs and the shorter ncRNAs act differently. 

Nearly 30% of the genes associated with character only involve 
IncRNAs, whereas the percentage is only 20% in the other two 
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Fig. 120.1 Number of genes per chromosome for temperament and character identified and replicated in person-centred GWAS of Temperament 


and Character Inventory in the Young Finns Study. 


Temperament and Character Inventory GWAS: 
genes characterized by biotypes 
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Fig. 120.2 Genes distinguished by biotype for temperament only, 
character only, and both in person-centred GWAS of Temperament and 
Character Inventory in the Young Finns Study. 








groups, as shown in Fig. 120.2. In contrast, protein-coding genes are 
more often associated with temperament only than with the groups 
involving character. The more prominent role of IncRNAs in char- 
acter is consistent with the hypothesis that epigenetic change is more 
important for the development of character than for temperament 
[63, 91]. 

IncRNAs form extensive networks of ribonucleoprotein (RNP) 
complexes with numerous chromatin regulators. These chromatin 
regulators target these enzymatic activities to the appropriate loca- 
tions in the genome. Apparently, IncRNAs can function as modular 
scaffolds to specify higher-order organization in RNP complexes 
and in chromatin states. IncRNAs have several functions in the regu- 
lation of gene transcription, including the regulation of specific gene 
transcription (either activation or repression), post-transcriptional 
regulation, differential spicing, and regulation of epigenetic modi- 
fications, including histone and DNA methylation, histone acetyl- 
ation and sumoylation, and remodelling of chromatin domains. 
These findings suggest the hypothesis that character development 
involves epigenetic processes involving regulation of gene expres- 
sion and chromatin modification, which, in turn, influence neural 
development and neuroplasticity. IncRNA dysfunction is also often 
observed in common chronic diseases related to stress and lifestyle 
regulation, including cancer, cardiovascular diseases, neurological 
diseases, and immune-mediated diseases. 

In contrast, the shorter ncRNAs are mostly involved in distinct 
temporal expression patterns during embryogenesis, acting in spe- 
cific tissues to regulate expression at a post-transcriptional level. 
miRNAs usually bind to a specific target messenger RNA with a 
complementary sequence to induce cleavage and degradation or to 
block translation, and they have crucial roles in epigenesis. Silent- 
interfering RNAs function in ways similar to miRNAs to mediate 
post-transcriptional gene silencing as a result of messenger RNA 
degradation and to induce heterochromatin formation. piRNAs are 
so named due to their interaction with the piwi family of proteins. 
They are involved in chromatin regulation and suppression of trans- 
poson activity during embryogenesis. The percentage of ncRNA 
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genes associated with personality is only 5% and do not distinguish 
the temperament clusters from the character clusters. 

As shown in Fig. 120.2, the distribution of the types of genes asso- 
ciated with character differs from that of other groups. These findings 
suggest that the ability of the human character to self-regulate emotion 
may be related to the human capacity to self-regulate gene expression, 
which can be cultivated by enhanced self-awareness [93]. The prom- 
inence of regulatory functions associated with personality traits that 
influence neural development and neuroplasticity provides a mech- 
anism for understanding the way learning and memory processes in- 
fluence personality development as a complex adaptive process. 


REFERENCES 


1. Cloninger RC, Reich T, Guze SB. The multifactorial model of 
disease transmission: II. Sex differences in the familial trans- 
mission of sociopathy (antisocial personality). Br J Psychiatry. 
197532722. 

2. Crowe RR. The adopted offspring of women criminal offenders: a 
study of their arrest records. Arch Gen Psychiatry. 1972;27:600-3. 

3. Torgersen S, Lygren S, Oien PA, et al. A twin study of personality 
disorders. Compr Psychiatry. 2000;41:416-25. 

4. Cloninger CR. Implications of comorbidity for the classification 
of mental disorders: the need for a psychobiology of coherence. 

In: Maj M, Gaebel W, Lopez-Ibor JJ, Sartorius N, editors. Psychiatric 
Diagnosis and Classification. New York, NY: Wiley; 2002. pp. 79-105. 

5. Cloninger CR. A systematic method for clinical description 
and classification of personality variants. A proposal. Arch Gen 
Psychiatry. 1987;44:573-88. 

6. Kendler KS, Czajkowski N, Tambs K, et al. Dimensional rep- 
resentations of DSM-IV Cluster A personality disorders in a 
population-based sample of Norwegian twins: a multivariate 
study. Psychol Med. 2006;36:1583-91. 

7. Reichborn-Kjennerud T, Czajkowski N, Neale MC, et al. Genetic 
and enviromental influences on dimensional representations 
of DSM-IV cluster C personality disorders: a population-based 
multivariate twin study. Psychol Med. 2007;37:645-53. 

8. Cloninger CR. Validation of psychiatric classification: the psy- 
chobiological model of personality as an exemplar. In: Zachar 
P, Stoyanov D, Aragona M, Jablensky A, editors. Alternative 
Perspectives on Psychiatric Validation: DSM, ICD, RDoC, and 
Beyond. Oxford: Oxford University Press; 2015. pp. 201-23. 

9. Strack S, editor. Differentiating Normal and Abnormal Personality, 
second edition. New York, NY: Springer Publishing; 2006. 

10. Cloninger CR, Svrakic DM, Przybeck TR. A psychobiological 
model of temperament and character. Arch Gen Psychiatry. 
1993;50:975-90. 

11. Livesley WJ, Jang KL, Vernon PA. Phenotypic and genetic 
structure of traits delineating personality disorder. Arch Gen 
Psychiatry. 1998;55:941-8. 

12. Cloninger CR. A new conceptual paradigm from genetics and 
psychobiology for the science of mental health. Aust N Z J 
Psychiatry. 1999;33:174-86. 

13. Cloninger CR, editor. Personality and Psychopathology, first edi- 
tion. Washington, DC: American Psychiatric Press; 1999. 

14. Gillespie NA, Cloninger CR, Heath AC, Martin NG. The gen- 
etic and environmental relationship between Cloninger’s di- 
mensions of temperament and character. Pers Individ Diff. 
2003;35:1931-46. 

15. Cloninger CR, Rice J, Reich T. Multifactorial inheritance with 
cultural transmission and assortative mating. II. a general model 





1236 





16. 


We 


18. 


19. 


20. 


Pile 


22! 


23; 


24. 


25: 


26. 


27 


28. 


Do: 


30. 


Biles 


328 


335 


SECTION 19 Personality disorders 


of combined polygenic and cultural inheritance. Am J Hum 
Genet. 1979;31:176-98. 

Eaves LJ, Eysenck HJ, Martin NG. Genes, Culture and 
Personality: An Empirical Approach. London: Academic 

Press; 1989. 

Plomin R, Corley R, Caspi A, Fulker DW, DeFries J. Adoption 
results for self-reported personality: evidence for nonadditive 
genetic effects? J Pers Soc Psychol. 1998;75:211-18. 

Keller MC, Coventry WL, Heath AC, Martin NG. Widespread 
evidence for non-additive genetic variation in Cloninger’s and 
Eysenck’s Personality Dimensions using a Twin Plus Sibling 
Design. Behav Genet. 2005;35:707-21. 

Verweij KJH, Zietsch BP, Medland SE, et al. A genome-wide 
association study of Cloninger’s temperament scales: implica- 
tions for the evolutionary genetics of personality. Biol Psychol. 
2010;85:306-17. 

Verweij KJ, Yang J, Lahti J, et al. Maintenance of genetic variation 
in human personality: testing evolutionary models by estimating 
heritability due to common causal variants and investigating the 
effect of distant inbreeding. Evolution. 2012;66:3238-51. 
Keltikangas-Jarvinen L, Raikkonen K, Ekelund J, Peltonen L. Nature 
and nurture in novelty seeking. Mol Psychiatry. 2004;9:308-11. 
Keltikangas-Jarvinen L, Jokela M. Nature and nurture in person- 
ality. Focus. 2010;8:180-6. 

Lahti J, Raikkonen K, Ekelund J, Peltonen L, Raitakari OT, 
Keltikangas-Jarvinen L. Novelty seeking: interaction between 
parental alcohol use and dopamine D4 receptor gene exon III 
polymorphism over 17 years. Psychiatr Genet. 2005;15:133-9. 
Arnedo J, Svrakic DM, Del Val C, et al. Uncovering the hidden 
risk architecture of the schizophrenias: confirmation in three 
independent genome-wide association studies. Am J Psychiatry. 
2015;172:139-53. 

Eichler EE, Flint J, Gibson G, et al. Missing heritability and strat- 
egies for finding the underlying causes of complex disease. Nat 
Rev Genet. 2010;11:446-50. 

Service SK, Verweij KJ, Lahti J, et al. A genome-wide meta- 
analysis of association studies of Cloninger’s Temperament 
Scales. Transl Psychiatry. 2012;2:e116. 

Arnedo J, del Val C, de Erausquin GA, et al. PGMRA: a web server 
for (phenotype x genotype) many-to-many relation analysis in 
GWAS. Nucleic Acids Res. 2013;41(Web Server issue):W 142-9. 
Arnedo J, Mamah D, Baranger DA, et al. Decomposition of 
brain diffusion imaging data uncovers latent schizophrenias 
with distinct patterns of white matter anisotropy. Neuroimage. 
2015;120:43-54. 

Schulsinger F. Psychopathy, heredity and environment. Int J Ment 
Health. 1972;1:190-206. 

Crowe RR. An adoption study of antisocial personality. Arch Gen 
Psychiatry. 1974;31:785-91. 

Cloninger CR, Sigvardsson S, Bohman M, von Knorring AL. 
Predisposition to petty criminality in Swedish adoptees. II. 
Cross-fostering analysis of gene-environment interaction. Arch 
Gen Psychiatry. 1982;39:1242-7. 

Mednick SA, Gabrielli WFJ, Hutchings B. Genetic influences in 
criminal convictions: evidence from an adoption cohort. Science. 
1984;22:891-4. 

Cloninger CR, Gottesman II. Genetic and environmental factors 
in antisocial behavior disorders. In: Mednick SA, Moffitt TE, 
Stack S, editors. The Causes of Crime: New Biological Approaches. 
Cambridge: Cambridge University Press; 1987. pp. 92-109. 


34. 


315}, 


36. 


We 


38. 


Sie, 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


ile 


5p, 


59 


Rhee SH, Waldman ID. Genetic and environmental influences on 
antisocial behavior: a meta-analysis of twin and adoption studies. 
Psychol Bull. 2002;128:490-529. 

Cloninger CR, Christiansen KO, Reich T, Gottesman, II. 
Implications of sex differences in the prevalences of antisocial 
personality, alcoholism, and criminality for familial transmission. 
Arch Gen Psychiatry. 1978;35:941-51. 

Cloninger CR, Reich T, Guze SB. The multifactorial model of dis- 
ease transmission: II. Sex differences in the transmission of soci- 
opathy (antisocial personality). Br J Psychiatry. 1975;127:11-22. 
Loehlin JC. Genes and Environment in Personality Development. 
Newbury Park: Sage Publications; 1992. 

Jang KL, Livesley WJ, Vernon PA, Jackson DN. Heritability of 
personality disorder traits: a twin study. Acta Psychiatr Scand. 
1996;94:438-44. 

Tellegen A, Lykken TD, Bouchard TJ, Wilcox KJ, Segal NL, Rich 
S. Personality similarity in twins reared apart and together. J Pers 
Soc Psychol. 1988;54:1031-9. 

Pedersen NL, Plomin R, McClearn GE, Friberg L. Neuroticism, 
extraversion, and related traits in adult twins reared apart and 
reared together. J Pers Soc Psychol. 1099;55:950-7. 

Bergeman CS, Chipur HM, Plomin R, et al. Genetic and environ- 
mental effects on openness to experience, agreeableness, and con- 
scientiousness: an adoption/twin study. J Pers. 1993;61:159-79. 
Torgersen S, Czajkowski N, Jacobson K, et al. Dimensional rep- 
resentations of DSM-IV cluster B personality disorders in a 
population-based sample of Norwegian twins: a multivariate 
study. Psychol Med. 2008;38:1617-25. 

Svrakic DM, Whitehead C, Przybeck TR, Cloninger CR. 
Differential diagnosis of personality disorders by the seven-factor 
model of temperament and character. Arch Gen Psychiatry. 1993; 
50:991-9. 

Kendler KS, Aggen SH, Czajkowski N, et al. The structure of gen- 
etic and environmental risk factors for DSM-IV personality dis- 
orders. Arch Gen Psychiatry. 2008;65:1438-46. 

Cloninger CR. Personality and temperament: new and alternative 
perspectives. Focus. 2010;8:161-3. 

Cloninger CR. A practical way to diagnose personality dis- 
order: a proposal. J Pers Disord. 2000;14:99-108. 

Tyrer P, Crawford M, Sanatinia R, et al. Preliminary studies of the 
ICD-11 classification of personality disorder in practice. Personal 
Ment Health. 2014;8:254-63. 

Krueger RE. The structure of common mental disorders. Arch 
Gen Psychiatry. 1999;56:921-6. 

Livesley WJ, Jang KL, Vernon PA. Phenotypic and genetic 
structure of traits delineating personality disorder. Arch Gen 
Psychiatry. 1998;55:941-8. 

Markon KE, Krueger RE, Bouchard TJ, Jr., Gottesman, II. Normal 
and abnormal personality traits: evidence for genetic and envir- 
onmental relationships in the Minnesota Study of Twins Reared 
Apart. J Pers. 2002;70:661-93. 

Grucza RA, Goldberg LR. The comparative validity of 11 modern 
personality inventories: predictions of behavioral acts, informant 
reports, and clinical indicators. J Pers Assess. 2007;89:167-87. 
Cloninger CR. A unified biosocial theory of personality and 

its role in the development of anxiety states. Psychiatr Dev. 
1986;4:167-226. 

Bohman M, Cloninger R, Sigvardsson S, von Knorring AL. The 
genetics of alcoholisms and related disorders. J Psychiatr Res. 
1987;21:447-52. 


54. 


55; 


56. 


Bs 


58. 


59. 


60. 


6l. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


TAL. 


72 


Cloninger CR. The psychobiological theory of temperament and 
character: comment on Farmer and Goldberg (2008). Psychol 
Assess. 2008;20:292-9; discussion 300-4. 

Heath AC, Cloninger CR, Martin NG. Testing a model for 

the genetic structure of personality: a comparison of the per- 
sonality systems of Cloninger and Eysenck. J Pers Soc Psychol. 
1994;66:762-75. 

Stallings MC, Hewitt JK, Cloninger CR, Heath AC, Eaves LJ. 
Genetic and environmental structure of the Tridimensional 
Personality Questionnaire: three or four temperament dimen- 
sions? J Pers Soc Psychol. 1996;70:127-40. 

Cloninger CR. What makes people healthy, happy, and fulfilled 
in the face of current world challenges? Mens Sana Monogr. 
2013;11:16-24. 

Cloninger CR, Kedia S. The phylogenesis of human person- 
ality: identifying the precursors of cooperation, altruism, 

and well-being. In: Sussman RW, Cloninger CR, editors. 

The Origins of Cooperation and Altruism. Developments in 
Primatology: Progress and Prospects, first edition. New York, 

NY: Springer; 2011. pp. 63-110. 

Cloninger CR. The evolution of human brain functions: the func- 
tional structure of human consciousness. Austr N Z J Psychiatry. 
2009;43:994-1006. 

Van Schuerbeek P, Baeken C, De Raedt R, De Mey J, Luypaert 

R. Individual differences in local gray and white matter vol- 
umes reflect differences in temperament and character: a voxel- 
based morphometry study in healthy young females. Brain Res. 
2011;1371:32-42. 

Gardini S, Cloninger CR, Venneri A. Individual differences in 
personality traits reflect structural variance in specific brain re- 
gions. Brain Res Bull. 2009;79:265-70. 

Quoidbach J, Hansenne M, Mottet C. Personality and mental 
time travel: a differential approach to autonoetic consciousness. 
Conscious Cogn. 2008;17:1082-92. 

Josefsson K, Jokela M, Hintsanen M, et al. Parental care-giving 
and home environment predicting offspring temperament and 
character traits after 18 years. Psychiatry Res. 2013;209:643-51. 
Brandstrom S, Przybeck TR, Sigvardsson S. Reliability of in- 
formant ratings and spouse similarity based on the Temperament 
and Character Inventory. Psychol Rep. 2011;109:23 1-42. 

Fassino S, Garzaro L, Peris C, Amianto F, Piero A, Abbate Daga G. 
Temperament and character in couples with fertility disorders: a 
double-blind, controlled study. Fertil Steril. 2002;77:1233-40. 
Sydsjo G, Lampic C, Brandstrom S. Who becomes a sperm 
donor: personality characteristics in a national sample of identifi- 
able donors. BJOG. 2012;119:33-9. 

Rizzo A. Temperament and generativity during the life span. 
Mediterranean Journal of Clinical Psychology. 2013;1:1-31. 

Costa PTJ, McCrae RR. Personality disorders and the five-factor 
model of personality. J Pers Disord. 1990;4:362-71. 

Yamagata S, Suzuki A, Ando J, et al. Is the genetic structure 

of human personality universal? A cross-cultural twin study 
from North America, Europe, and Asia. J Pers Soc Psychol. 
2006;90:987-98. 

Finkel D, McGue M. Sex differences and nonadditivity in herit- 
ability of the Multidimensional Personality Questionnaire Scales. 
J Pers Soc Psychol. 1997;72:929-38. 

Eaves LJ, Heath AC, Neale MC, Hewitt JK, Martin NG. Sex dif- 
ferences and non-additivity in the effects of genes on personality. 
Twin Res. 1998;1:131-7. 

Eaves LJ, Heath AC, Martin NG, et al. Comparing the biological 
and cultural inheritance of personality and social attitudes in 


73. 


74. 


U5 


76. 


Vi 


78. 


7); 


80. 


81. 


82. 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


CHAPTER 120 Genetics of personality disorders 


the Virginia 30,000 study of twins and their relatives. Twin Res. 
1999;2:62-80. 

Benjamin J, Osher Y, Kotler M, et al. Association between tri- 
dimensional personality questionnaire (TPQ) traits and three 
functional polymorphisms: dopamine receptor D4 (DRD4), sero- 
tonin transporter promoter region (5-HTTLPR) and catechol 
O-methyltransferase (COMT). Mol Psychiatry. 2000;5:96-100. 
Strobel A, Lesch KP, Jatzke S, Paetzold F, Brocke B. Further 
evidence for a modulation of Novelty Seeking by DRD4 exon 

III, 5-HTTLPR, and COMT val/met variants. Mol Psychiatry. 
2003;8:371-2. 

Caspi A, Moffitt TE. Gene-environment interactions in psych- 
iatry: joining forces with neuroscience. Nat Rev Neurosci. 
2006;7:583-90. 

Caspi A, Sugden K, Moffitt TE, et al. Influence of life stress on 
depression: moderation by a polymorphism in the 5-HTT gene. 
Science. 2003;301:386-9. 

Kim-Cohen J, Caspi A, Taylor A, et al. MAOA, maltreatment, 
and gene-environment interaction predicting children’s mental 
health: new evidence and a meta-analysis. Mol Psychiatry. 
2006;11:903-13. 

Hintsanen M, Pulkki-Raback L, Juonala M, Viikari JS, Raitakari 
OT, Keltikangas-Jarvinen L. Cloninger’s temperament traits and 
preclinical atherosclerosis: the Cardiovascular Risk in Young 
Finns Study. J Psychosom Res. 2009;67:77-84. 

Jokela M, Raikkonen K, Lehtimaki T, Rontu R, Keltikangas- 
Jarvinen L. Tryptophan hydroxylase 1 gene (TPH1) moderates 
the influence of social support on depressive symptoms in adults. 
J Affect Disord. 2007;100:191-7. 

Keltikangas-Jarvinen L, Pulkki-Raback L, Elovainio M, Raitakari 
OT, Viikari J, Lehtimaki T. DRD2 C32806T modifies the effect of 
child-rearing environment on adulthood novelty seeking. Am J 
Med Genet B Neuropsychiatr Genet. 2009;150B:389-94. 
Keltikangas-Jarvinen L, Puttonen S, Kivimaki M, Elovainio M, 
Rontu R, Lehtimaki T. Tryptophan hydroxylase 1 gene haplotypes 
modify the effect of a hostile childhood environment on adult- 
hood harm avoidance. Genes Brain Behav. 2007;6:305-13. 
Keltikangas-Jarvinen L, Puttonen S, Kivimaki M, Rontu R, 
Lehtimaki T. Cloninger’s temperament dimensions and epi- 
dermal growth factor A61G polymorphism in Finnish adults. 
Genes Brain Behav. 2006;5:11-18. 

Keltikangas-Jarvinen L, Salo J. Dopamine and serotonin systems 
modify environmental effects on human behavior: a review. 
Scand J Psychol. 2009;50:574-82. 

de Moor MH, Costa PT, Terracciano A, et al. Meta-analysis of 
genome-wide association studies for personality. Mol Psychiatry. 
2012317:337-49. 

Williams SM, Haines JL. Correcting away the hidden heritability. 
Ann Hum Genet. 2011;75:348-50. 

Verweij KJ, Burri AV, Zietsch BP. Evidence for genetic variation 
in human mate preferences for sexually dimorphic physical traits. 
PLoS One. 2012;7:e49294. 

van den Berg SM, de Moor MH, McGue M, et al. Harmonization 
of neuroticism and extraversion phenotypes across inventories 
and cohorts in the Genetics of Personality Consortium: an appli- 
cation of Item Response Theory. Behav Genet. 2014;44:295-313. 
van den Berg SM, de Moor MH, Verweij KJ, et al. Meta-analysis 
of genome-wide association studies for extraversion: findings 
from the Genetics of Personality Consortium. Behav Genet. 
2016;46:170-82. 

Genetics of Personality Consortium, de Moor MH, van den Berg 
SM, Verweij KJ, et al. Meta-analysis of genome-wide association 





iyei3) SECTION 19 Personality disorders 





90. 


Ol. 


studies for neuroticism, and the polygenic association with major 
depressive disorder. JAMA Psychiatry. 2015;72:642-50. 

Manolio TA. Genomewide association studies and assessment of 
the risk of disease. N Engl J Med. 2010;363:166-76. 

Josefsson K, Jokela M, Cloninger CR, et al. Maturity and change 
in personality: developmental trends of temperament and char- 
acter in adulthood. Dev Psychopathol. 2013;25:713-27. 


92% 


93° 


Qureshi IA, Mehler ME. Emerging roles of non-coding RNAs 
in brain evolution, development, plasticity and disease. Nat Rev 
Neurosci. 20123;13:528-41. 

Campanella F, Crescentini C, Urgesi C, Fabbro F. Mindfulness- 
oriented meditation improves self-related character scales in 
healthy individuals. Compr Psychiatry. 2014;55: 

1269-78. 











| 


In this chapter, we will review the neuroimaging literature in person- 
ality disorders. Given its clinical importance and the rapidly grown 
literature on borderline personality disorder (BPD), we will focus 
on this condition, with some additional findings from studies on 
avoidant (AvPD), narcissistic (NPD), and obsessive-compulsive per- 
sonality disorder (OCPD), where appropriate. Due to consistency and 
readability, we abstained from reviewing the large body of imaging lit- 
erature on suspicious personality disorder types (paranoid, schizoid, 
schizotypal, antisocial), which has been covered elsewhere [1, 2]. 








Emotion processi 


Emotion dysregulation is a hallmark of BPD psychopathology [3]. 
A growing number of studies have used functional magnetic res- 
onance imaging (fMRI) during emotional challenge (for example, 
presentation of emotionally arousing pictures or facial expressions) 
to investigate the neural correlates of emotional responding in indi- 


viduals with BPD, compared to healthy individuals; the majority of 
fMRI studies have observed 
compared to 


healthy controls (for a meta-analysis, see [4]). In addition to an ex- 
aggerated amygdala response to emotional pictures, several studies 
in BPD patients found increased amygdala activation in response 
to pictures which have been rated ED in the general popu- 
lation (for example, [5, 6]). Both BPD and AvPD patients showed 








not in BPD and AvPD [7]. This is in line with other studies TE 


instability [7]. Findings of increased and prolonged amygdala acti- 
vation are in line with the well-documented clinical feature of high 
sensitivity to emotional stimuli with intense and long-lasting reac- 


tions in patients with BPD [10-12]. Most interestingly, medication- 
fee samples O O heros no 


such group differences were found in patients currently taking psy- 
chotropic medication [4]. 












In addition to amygdala hyperreactivity, increased activation in 
as been observed in emotional challenge studies in BPD 
patients, compared to healthy controls, emphasizing the role of this 
brain area in disturbed emotion processing in BPD [5, 6, 13, 14]. In 
numerous studies in healthy individuals, the insula has been associ- 
ated with the encoding of unpleasant feelings, interoceptive aware- 
ness (awareness of one’s own bodily experiences), perceived social 
exclusion, and pain perception [15, 16]. 

Several neuroimaging studies have reveale 
[17-19], especially blunted responses of the bilateral dorsolateral 
prefrontal cortex (dIPFC) [4], in response to emotionally arousing 
stimuli. Schmahl and colleagues [20] presented individualized audi- 
tory scripts of abandonment situations to their participants and 
found increased activation in the dIPFC and decreased activation in 
the medial prefrontal cortex (mPFC). Furthermore, when patients 
were confronted with their self-reported traumatic memories, they 
did not exhibit increased activity in the ACC or orbitofrontal cortex 
(OFC) and dIPFC, as did healthy controls. Instead, they showed un- 
altered or diminished activity [21]. The OFC and dIPFC are thought 
to be implicated in a top-down appraisal system that modulates ac- 
tivation in limbic and subcortical brain areas [22]. Hence, it is con- 
ceivable that the pattern of hypoactivation observed in frontal brain 
regions during emotional challenge in BPD might represent a neural 
correlate of altered processing of negative autobiographical mem- 
ories and negative emotions. 

Turning to studies of brain structures (for example, brain volume) 
in patients with BPD, compared to healthy controls, several studies 
have shown a reduced volume in limbic and paralimbic brain re- 
gions. A meta-analysis [23] even proposed that 


markers’ or endophenotypes of BPD. In a recent meta-analysis on 
studies using voxel-based morphometry (VBM), most prominent 
volumetric changes in BPD were observed in the amygdala and 
dIPFC [4]. It is nonetheless controversial whether the observed al- 
terations rather stem from experience of abuse in childhood [24] 
or might even be a substrate of comorbid post-traumatic stress dis- 
order (PTSD), both of which are highly prevalent conditions in BPD 
[25, 26]. Kuhlmann and colleagues [27] further detected a reduced 
volume of the hypothalamus, which extends previous findings of al- 
terations in the hypothalamic-pituitary—adrenal axis in BPD [28, 29]. 
The first multimodal imaging study in BPD published by Salvador 
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and colleagues [30] complements earlier work [31-33] showing con- 
verging abnormalities in subgenual and perigenual ACC. 

While no differences were found between BPD and AvPD in brain 
structure, both patient groups showed reduced mPFC and ACC 
volumes, compared to healthy controls [34]. Further, both patient 
groups showed a positive correlation between amygdala volume and 
anxiety [34]. 

The research depicted here has led to the current conceptualiza- 
tion o 






structures. Imbalance of control such that the lower brain regions 
generating emotion are untethered from prefrontal control areas is 
also observed in anxiety disorders [35] and is probably not specific 
to BPD. The white and grey matter abnormality, accompanied with 
altered resting-state brain connectivity [36], emphasizes alterations 
in fronto-limbic networks as a candidate neural substrate of symp- 
toms concerning several functions, such as emotion processing, 
emotion regulation, and decision-making, that are impaired in BPD 
and other mental disorders. 


Cognitive emotion regulation 


Three studies investigated the cognitive regulation of emotions 
in BPD by presenting negative pictures with the instruction to 
regulate emotions in the MRI scanner [37-39]. The results con- 
verged to a reduced response of the dorsal mPFC and dIPFC in 
BPD patients when regulating emotions. More specifically, the 
study by Koenigsberg and colleagues [37] found diminished ac- 
tivity in the dIPFC and ventrolateral prefrontal cortices when 
patients with BPD tried to cognitively distance themselves from 
negative stimuli. Similarly, cognitive reappraisal yielded decreased 
recruitment of the OFC and increased activation of the insula in 
patients with BPD, relative to healthy participants [39]. In an at- 
tempt to clarify the role of trauma history on downregulation of 
emotional responses in BPD patients, Lang et al. [38] compared 
trauma-exposed BPD patients to trauma-exposed healthy subjects 
without BPD and non-traumatized healthy subjects. In this study, 
BPD patients, as well as healthy individuals with a trauma history, 
recruited brain regions associated with up- and downregulation 
of negative emotions (for example, ACC) to a lesser extent, which 
might reflect compensatory changes associated with trauma 
exposure. 

Initial fMRI findings suggested changes in neural responding 
with psychotherapy. Goodman and colleagues [40] demonstrated a 
reduction of amygdala activity in a passive viewing paradigm after 
dialectical behaviour therapy (DBT) treatment. Testing patients 
receiving DBT vs patients receiving treatment as usual (TAU) and 
healthy controls, Winter et al. [41] found the inferior parietal lobe/ 
supramarginal gyrus (IPL/SMG) to alter responding during emo- 
tion regulation. Decreased IPL/SMG responding predicted an im- 
provement of symptom severity. Furthermore, DBT responders 
reduced perigenual ACC activation when viewing negative vs neu- 
tral pictures. Schmitt et al. [42] found decreased insula and ACC 
activity during reappraisal in patients with BPD. ACC connectivity 
to the medial and superior frontal gyri, superior temporal gyrus, 
and inferior parietal cortices increased after DBT. Treatment re- 
sponders exhibited reduced activation in the amygdala, ACC, 
OFC, and dIPFC, together with increased connectivity within a 


limbic-prefrontal network, during reappraisal of negative stimuli 
after psychotherapy. 

One fMRI study testing neural responses of cognitive emotion 
regulation in AvPD revealed amygdala hyperreactivity in anticipa- 
tion of regulation and in response to negative, compared to neutral, 
pictures [43]. Furthermore, amygdala reactivity was correlated with 
anxiety ratings [43] and may be interpreted as a neural correlate of 
exaggerated anticipatory anxiety of social situations. 


Self-injury and altered pain processing 


A very common dysfunctional behaviour, closely linked to emotion 
dysregulation in BPD, is non-suicidal self-injury (NSSI) [44]. Pain 
appears to have an important emotion-regulating effect in BPD [45]. 
A large body of experimental research investigating pain perception 
in individuals with BPD points to reduced sensitivity in BPD patients 
(for example, [46-49]). Schmahl and colleagues [48] showed that 
self-reported pain ratings were lower in BPD patients and that pain 
thresholds were altered. Interestingly, no differences were observed 
in discrimination task performance of laser-evoked brain potentials, 
which showed that the analgesic state in BPD is not explained by an 
impairment of the sensory-discriminative component of pain [48]. In 
a first neuroimaging study by Schmahl et al. [47], patients with BPD 
and NSSI underwent fMRI while heat stimuli were applied to their 
hands. BPD patients showed increased dIPFC activation, along with 
decreased activation of the posterior parietal cortex. Moreover, painful 
heat stimulation evoked neural deactivation in the amygdala and 
perigenual ACC in participants with BPD [47]. The observed inter- 
action between increased pain-induced response in the dIPFC, coupled 
with deactivations in the ACC and amygdala, could be interpreted as 
an anti-nociceptive mechanism in BPD [47]. While sensory discrim- 
ination processes remain intact, this mechanism may modulate pain 
circuits primarily by an increased top-down regulation of emotional 
components of pain or an altered affective appraisal of pain [47]. 

Examining the neural mechanisms underlying the role of self- 
inflicted pain as a means of affect regulation in BPD more directly, 
Niedtfeld and colleagues [6] conducted an fMRI study using pic- 
tures to elicit negative affect and thermal stimuli to induce heat pain. 
Although the authors found an attenuation of activation in limbic 
areas (amygdala, insula) in response to sensory stimulation, it was 
specific neither to patients with BPD nor to painful stimulation. In 
order to identify brain mechanisms underlying the limbic deacti- 
vation observed [6], the authors measured functional connectivity 
between those regions that had been identified to be involved in 
emotional processing, namely the amygdala, insula, and ACC [50]. 
Results of their analyses indicated that painful sensory stimuli, as 
opposed to warmth perception, resulted in enhanced negative coup- 
ling between (para-) limbic and prefrontal structures in BPD, thus 
indicating inhibition of limbic arousal. Increased coupling between 
the amygdala and medial frontal gyrus (BA8 and BA9) may point 
to attentional distraction processes [51]. Additionally, in the patient 
group, the dIPFC showed enhanced coupling to the posterior insula, a 
brain region known to play a critical role in pain processing [52] and 
affective appraisal of pain perception [53]. 

To test the direct effect of tissue damage, two recent studies induced 
stress with the Montreal Imaging Stress Task, followed by either an in- 
cision into the forearm (tissue damage) or a sham condition [54, 55]. 
The incision resulted in a decrease in tension and heart rate in the BPD 
group, in contrast to a short-term increase in aversive tension in the 


healthy control group [55]. A resting-state fMRI scan after the incision 
pointed to a decrease in amygdala activity, together with normalized 
functional connectivity of the amygdala and superior frontal gyrus, in 
the BPD group [54]. These results may point to a stress-reducing ef- 
fect of tissue damage that affects subjective experience, psychophysio- 
logical reactions, and brain function in BPD. 

In sum, the overall reported results on pain processing in BPD 
suggest that NSSI may be interpreted as an attempt to compensate 
for a deficient emotion regulation mechanism. More specifically, the 
reported findings implicate that the soothing effect of pain in BPD 
seems to be mediated by different emotion regulation processes (at- 
tentional shift and altered appraisal of pain). 


Social interaction 


Processing of social stimuli 


Izurieta Hidalgo and colleagues [56] found a decreased late 
centroparietal P300 response in BPD patients that may reflect di- 
minished categorical processing of happy faces. Several fMRI studies 
found increased amygdala responses to emotional facial expressions 
[57-60], and decreased subgenual and dorsal ACC responses were re- 
ported [60]. Further, increased functional connectivity of the amygdala 
with the dorsal and rostral ACC was observed in overt face processing 
[61]. Covert face processing increased the amygdala-thalamus con- 
nectivity in BPD patients [61], and an early enhancement in electro- 
encephalogram (EEG) responses (P100, N170) was observed [56]. 
Although BPD patients tended to make more and faster initial eye fix- 
ations to the eyes of angry faces [58] and to misclassify predominantly 
happy faces as angry faces [56], amygdala hyperactivation was not en- 
hanced comparing angry faces to other emotions [57, 60]. 


Empathy 


Both increased interpersonal resonance and pervasive interper- 
sonal disturbances are ubiquitous in BPD, possibly rooted in re- 
duced mirroring of emotional reactions in parent-child dyads and 
childhood traumatic experiences [62]. Frick et al. [63] found that 
BPD patients were better and faster than healthy controls in infering 
emotional states from eye gaze pictures. This was accompanied by in- 
creased amygdala, temporal cortex, and OFC responses to emotional 
eye gazes and ventromedial PFC (vmPFC), ACC, and parietal activa- 
tions specifically to eye gazes in faces with negative emotional expres- 
sion in BPD. Increased amygdala activation was also observed in BPD 
patients attributing intentions to emotional face expressions [59]. 
Similar to posterior insula hyperactivity in BPD patients who empa- 
thized with people on violence and threat photographs [64], amygdala 
hyperactivity may reflect increased emotional resonance and arousal. 
In contrast, reduced anterior insula activations [63] may reflect diver- 
gent social expectations in BPD [65]. Moreover, empathic mindsets 
increased superior temporal cortex activity in healthy subjects, but 
not in BPD patients [59, 63, 64]. Low superior temporal cortex activity 
was suggested to reflect reduced meta-cognitive processes [66], which 
may entail inadequate emotion regulation in social situations. Paired 
with enhanced emotional sensitivity and affective reverberation, this 
may promote interpersonal disturbances [62]. 

Reduced grey matter volume in patients with NPD was found in 
the anterior insula, rostral ACC, mPFC, and medial cingulate [67]. 
Another study found reduced grey matter volume in the mPFC/ 
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ACC and fusiform, temporal, and occipital cortices [68]. Higher 
anterior insula volume predicted higher emotional empathy [67]. 
These findings were further supported by diffusion tensor imaging 
findings, showing altered structural connectivity involving frontal 
lobe and thalamic regions [68]. Neural alterations were discussed to 
underlie decreased empathy, which is characteristic for NPD [67]. 
Somatosensory gating (SG) during the anticipation and observation 
of pain, measured with EEG, was found increased in NPD subjects 
[69]. According to the authors, this finding may reflect an increased 
somatic representation of observed pain. A positive correlation of 
SG with impulsivity-egocentricity ratings suggests an increased 
somatic representation of observed pain despite lower empathy [69]. 
The NPD sample consisted of patients with high psychopathic traits, 
limiting the generalization of results. Though the database is sparse, 
imaging data so far support malfunction in the frontolimbic and 
paralimbic systems to underlie NPD psychopathology. 


Co-operation and social exclusion 


Perceived exclusion in a virtual ball-tossing game was associ- 
ated with hyperactivation of the dIPFC and precuneus in patients 
with BPD [70]. Furthermore and independent of actual exclusion 
by the co-players, hyperactivation of the dorsal mPFC/ACC, par- 
ietal cortex, occipital cortex, and insula was found [70]. Unlike 
healthy controls, BPD patients felt excluded, even when they 
were actually included. Furthermore, they also felt excluded and 
showed neural hyperactivation when players followed predefined 
rules [70]. The observed pattern was discussed to possibly reflect 
‘hypermentalizing, that is, the tendency to attribute intentions to 
co-players when behaviour is actually not guided by intentions. 
Abnormal anterior insula activation may reflect decreased expect- 
ations of co-operative behaviour in BPD [65], a potential precursor 
of break-ups in interpersonal co-operation. Additionally, increased 
anterior PFC activation during exclusion [71] may relate to altered 
emotion regulation in social interactions [72]. In summary, diver- 
gent social expectations appear to promote feelings of exclusion and 
impede the maintenance of relationships. This is reflected by diver- 
gent neural responding to social interaction in BPD. 

With regard to interpersonal aggression, patients increased both 
OFC and amygdala activation in an experimental manipulation pro- 
voking aggressive tendencies, while controls decreased amygdala 
activation [73]. As the latter study assessed patients with comorbid 
intermittent explosive disorder (IED), results may not generalize to 
BPD without IED. Mancke, Herpertz, and Bertsch [74] suggested 
a multi-dimensional model linking neuropsychology and neuro- 
chemistry data to the bio-behavioural dimensions of affective 
dysregulation, threat hypersensitivity, reduced cognitive empathy, 
reduced self-other differentiation, and impulsivity. The authors 
assumed a prefrontal-limbic imbalance to underlie impulsivity, 
affective dysregulation, and hypersensitivity to social threat cues, 
mediating aggressive tendencies often observed in BPD [74]. 


Behavioural regulation 


Decision-making 


Decisions are ubiquitous in daily life and ‘commit the organism to 
one out of several possible behaviors’ [75]. It is assumed that deci- 
sions are guided by the processing of subjective value that critically 
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involves the vmPFC and OFC (reviewed, for example, in [76]). Brain 
volumetry analyses revealed differences in mPFC/OFC grey matter 
volume in BPD patients vs healthy controls [77-83] and in BPD pa- 
tients vs other clinical populations [77, 84]. A meta-analysis found 
reduced grey matter volume in the ventral mPFC/OFC of patients 
with BPD [4]. 

fMRI studies demonstrated that BPD patients’ vmPFC response 
did not differentiate between reward and no-reward expectations; 
in contrast, this pattern was observed in healthy controls [85, 86]. 
Notably, effects were most pronounced for reward anticipation in 
face of emotional pictures [85]; in the presence of emotional pic- 
tures, BPD patients did not discriminate between anticipated reward 
and no-reward conditions. Group differences were also observed in 
upstream brain regions of the reward system such as the ventral stri- 
atum, ventral tegmental area, and amygdala. Generally, healthy par- 
ticipants, compared to BPD patients, showed stronger deactivations 
in these regions if no reward was expected [85]. In line with this, 
reduced neural discrimination between reward and punishment in 
BPD was found in a simple gambling task [87, 88]. Andreou et al. 
[87] localized this group interaction effect to the mPFC/ACC. In 
addition, they observed group differences in mPFC and OFC re- 
sponses to high vs low magnitudes of monetary reward and pun- 
ishment. In concert with the findings reviewed in this chapter, 
Schuermann and colleagues [89] demonstrated reduced frontal EEG 
discrimination between reward and punishment in a more complex 
decision-making task. In addition, they observed an increased P300 
response to punishments in BPD patients that may relate to delayed 
processing of feedback [89]. Alternatively, an increased P300 re- 
sponse may indicate differences in BPD patients’ expectations about 
receiving punishment [89]. In accordance with these findings, BPD 
patients did not adjust their choices and kept drawing from disad- 
vantageous card decks [89]. 

In summary, neuroimaging studies documenta lack of discrimin- 
ation between reward and punishments in the neural reward system 
of BPD patients. 


Learning 


Classical conditioning is used as a paradigmatic task for fear 
learning. In turn, the unlearning of fear is studied by investigating 
how well subjects extinguish an established conditioned response. 
BPD patients reported stronger affective responses in fear extinction 
[9]. However, on the neural level, no significant group differences 
emerged comparing BPD patients and healthy controls [9]. 

Fear can also be learnt by verbal instruction. In research, this 
phenomenon is investigated in ‘instructed fear’ studies where in- 
vestigators inform subjects about a possible negative stimulus (for 
example, an electrodermal pain stimulus) that might be delivered 
upon a certain cue but is actually never given during the experiment. 
BPD patients showed a prolonged amygdala response to instructed 
fear, while healthy controls decreased amygdala activation over time 
[10]. In addition, BPD patients showed altered connectivity patterns 
in the prefrontal-limbic regions in this paradigm [10]. While pa- 
tients’ amygdala activation failed to habituate, this was not repre- 
sented in skin conductance responses [10]. 

Turning to another learning process, Soloff et al. [90] investigated 
episodic memory encoding and retrieval. BPD patients observed af- 
fective and neutral pictures a first and later a second time where they 
should identify those already known pictures among unfamiliar 


pictures. Comparing with healthy control patients, differential ac- 
tivations in the hippocampus, ACC, dorsal PFC, and parietal cortex 
were seen for negative vs positive pictures. This suggests less differ- 
ential recruitment of brain areas associated with episodic memory 
formation in BPD. Despite neural differences, patients recognized 
items equally well as healthy controls during retrieval. 

To our knowledge, imaging studies in other personality disorders 
(excluding antisocial personality disorder, which are not addressed 
in this chapter) focusing on memory functions are not existent. 
Coutinho and colleagues demonstrated altered precuneus network 
connectivity in OCPD [91]. The authors speculated that altered 
precuneus connectivity may reflect over-involvement in anticipa- 
tory processes, leading to rumination in future-oriented planning 
at the cost of present moment experience and an attentional bias to 
minor details [91]. 


Cognitive conflict management 


Winter et al. [92] did not find a group difference comparing BPD 
patients to healthy controls in an fMRI study with the ‘emotional 
Stroop’ task. However, when dissociation was evoked by personal- 
ized autobiographical memory scripts, significant emotional inter- 
ference was observed comparing these patients to non-dissociated 
BPD patients and controls [92]. Moderate activation differences 
in the inferior parietal cortex and inferior temporal gyrus may re- 
flect impaired inhibition of task-irrelevant information caused by 
dissociation [92]. Dissociation involves disruptions of usually in- 
tegrated functions such as consciousness, memory, attention, pain 
perception, and perception of the self and environment, and is 
frequently reported by patients with BPD [93-95]. Compared to 
healthy controls, non-dissociated patients were found to increase 
responses to positive words in temporal, frontal, and cingulate re- 
gions, but no differences were observed with regard to negative 
words [92]. In contrast, Malhi et al. [96] found increased ventrolat- 
eral PFC and decreased dIPFC and amygdala activation in BPD pa- 
tients in the ‘emotional Stroop’ task, although lacking behavioural 
group differences. In their analyses, both PFC regions showed cor- 
relations with self-assessments of emotion dysregulation, which 
is supportive of increased involvement of the PFC in emotional 
interference resolution. In contrast, another study found decreased 
dIPFC activation [97]. Wingenfeld and colleagues [98] reported ac- 
tivations of a more posterior lateral PFC area when comparing the 
responses to negative vs neutral words in healthy controls to re- 
sponses in BPD patients, in addition to temporal cortex and ACC 
activations. 

Beyond word identification, emotional conflict can also be intro- 
duced to other cognitive processes. Krause-Utz et al. [5] found 
BPD patients to be more distracted by negative emotional pictures 
that were presented during a working memory task, evidenced by 
slowing of reaction times. Neurally, this was complemented by in- 
creased amygdala and anterior insula, as well as decreased dIPFC, 
activation [5]. Furthermore, stronger functional connectivity of 
the amygdala with the dorsomedial prefrontal cortex (dmPFC) was 
found, and stronger coupling of these regions was related to in- 
creased reaction times [99]. In addition, amygdala connectivity with 
the dorsal ACC was increased and amygdala activation decreased 
when patients reported higher dissociation [99]. These findings sug- 
gest that BPD is associated with an increased demand of cognitive 
resources to counteract intense emotional interference. 


Presenting emotional face distractors in a flanker-task, Holtmann 
et al. [100] found stronger dorsal ACC responses to fearful vs neutral 
faces in BPD patients, compared to healthy controls. A negative cor- 
relation of dorsal ACC response to incongruent vs congruent trials 
with anxiety ratings suggests involvement of this region in conflict 
resolution, modulated by anxiety. Behaviourally, no effect of distrac- 
tion on reaction times or accuracy was detected by the authors. As 
reviewed previously, emotional content is also suggested to interfere 
with neural representations of reward processing [85]. 

In contrast to emotional conflict, semantic conflict is provoked, 
for example in the colour-word Stroop or the face-word Stroop task 
[101], where semantically relevant distractors (that is, colour words 
or emotion words) compete with target stimuli (that is, print colour 
or faces with emotional expressions, respectively). To the best of 
our knowledge, there is currently no published study on the neural 
correlates of semantic conflict resolution in BPD. Though there is 
limited evidence for significant slowing in the colour-word Stroop 
task [102], we are not aware of a study testing interference with task- 
relevant emotional stimuli in BPD patients (utilizing the face-word 
Stroop task). 

In summary, though several studies found neural differences in 
emotional conflict processing in BPD, no consistent picture can be 
derived from the literature. Behaviourally, though significant inter- 
ference of emotional material with working memory was found, 
BPD patients did not show distraction effects in the majority of 
tasks. Negative findings may also relate to low robustness of the 
‘emotional Stroop’ effect [89]. 


Response control 


The ability to inhibit a prepotent response can shed light on impulse 
control. Impulsivity is not assumed to be a unitary construct; thus, 
different facets of impulsivity have been investigated, using both be- 
havioural experiments and self-assessment instruments, in BPD (for 
a review, see [103]). Subjects are required to withhold a prepotent 
behavioural response (usually a button press) to perform a correct 
action, while neural responses are measured. Jacob et al. [104] re- 
ported lower inferior frontal cortex (IFC) and higher subthalamic 
nucleus activation in BPD patients, compared to controls, after 
anger induction. The authors discussed this in terms of increased 
neural inhibitory activity of the IFC and compensatory subthalamic 
activity in BPD. However, behavioural data did not reveal group 
differences. When BPD patients were confronted with face stimuli 
showing affective expressions, increased brain activations were ob- 
served in task-related (parietal, basal ganglia, middle inferior OFC, 
hippocampus) and emotion-related (amygdala) brain areas [90]. 
This reflects task-dependent differences in the recruitment of neural 
circuits, perhaps due to increased proneness to cognitive inter- 
ference from emotional distraction. In face of affective vs neutral 
words, which were used as distracting stimuli while subjects per- 
formed a response control task, BPD patients decreased responses 
in the subgenual ACC, posterior medial OFC, and posterior ACC, 
relative to healthy controls [97]. Conversely, patients increased re- 
sponses in the lateral OFC and dIPFC, among others. There are 
major differences in methods and stimulus material in the latter two 
studies, limiting comparability. Nonetheless, they both support al- 
tered brain activations in response control in BPD patients when 
confronted with emotional material. With a stress induction pro- 
cedure, which might more potently affect information processing 
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systems in BPD, Cackowski and colleagues [105] found patients to 
make more response errors relative to a no-stress condition. 

With EEG, Ruchsow et al. [106] observed a reduced late positivity 
(P300) in BPD patients to events when subjects had to withhold a 
prepotent response. Conversely, no differences, compared to healthy 
controls, were found in the earlier N200 component. In addition, a 
negative correlation of the P300 component was found with cognitive 
impulsiveness and depressiveness in patients. The authors concluded 
that P300 findings point to altered response inhibition in BPD related 
to impulsivity and depression. On the other hand, unchanged N200 
responses are in favour of intact conflict resolution. Behaviourally, no 
differences between patients and controls were observed. The most 
recent neuroimaging work found neither neural nor behavioural dif- 
ferences between unmedicated BPD patients and matched healthy 
controls in tasks probing action withholding, response interference 
resolution, and stop signal reactions [107]. Further, no correlations 
with self-assessments of impulsivity and task data were found. 

In conclusion, the literature does not reflect a general impair- 
ment in the ability to control or stop actions in BPD (for reviews, 
see [103,107]). Cumulative evidence demonstrates that emotional 
provocation can deteriorate response withholding on the behav- 
ioural and neural levels in BPD. 


Closing remarks 


Neuroimaging work has contributed to a better understanding of 
the psychophysiological correlates of personality disorders. Existing 
research has resulted in a comprehensive picture of the neuro- 
biology of altered emotion processing and emotion regulation. This 
is fundamental for understanding BPD psychopathology, although 
the neural mechanisms are probably not unique to this disorder. 

Hyper-responsiveness to social cues in BPD is underscored by 
neuroimaging studies, providing support for altered activations in 
the brain’s attention and salience systems. Research has started to 
illuminate the neural basis of empathy, cooperation, and aggression 
in BPD. Current findings are congruent with socio-psychological 
models; however, neuroimaging studies in social interaction re- 
search in BPD are still sparse. As discussed previously, integration of 
these findings into bio-behavioural models of dysfunctioning may 
result in a more complete picture to guide future research. 

Diverse areas of executive functioning have been explored by BPD 
researchers such as decision-making, learning, cognitive conflict 
management, and response control. Taken together, many studies 
using tasks that are thought to tap into aspects of BPD psychopath- 
ology (for example, impulsivity, disinhibition, risky choice behaviour, 
emotional learning) lack positive behavioural findings, although 
they produce neural effects. In addition, the latter sometimes seem 
inconsistent in the aggregate. Furthermore, neuroimaging research 
in many areas of executive functioning in BPD is rare. In light of this, 
we currently do not have a clear picture on how neuroimaging find- 
ings relate to behavioural dysregulation in general in BPD. 
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The management of personality disorder remains a complex and 
challenging, but increasingly recognized and potentially rewarding, 
area of psychiatric practice. Our understanding of personality, per- 
sonality disorder, and its interactions with mental state disorders 
continues to develop, and alongside this, understanding of mech- 
anisms of management has also developed. Personality disorder is 
discussed in multiple places in this textbook, and some of the inter- 
ventions described here are developed in greater detail elsewhere, 
being general interventions for a wide array of presentations, as op- 
posed to personality disorder itself. Nonetheless, collating these into 
one chapter provides an overview of the current state of the evidence 
in the management of personality disorder and the influence of per- 
sonality pathology in those with comorbidities. Before we examine 
the current state of the literature, it is important to consider its po- 
tential shortcomings. This allows for a balanced understanding of 
the evidence base and its applicability to day-to-day clinical practice. 





All medical evidence is open to criticism. However, as the com- 
plexity of problems being addressed increases, the need to under- 
stand the limitations of the evidence without discarding it increases. 
The difficulties in interpreting the evidence base for the treatment 
of personality disorders is considered under four headings: popu- 
lation studied, treatment duration, comorbidity, and outcome 
measures used. 





Ideally, to allow translation of clinical trials to standard clinical 
practice, it is important for the study population to mirror the clin- 
ical population to which it is to be applied. This presents difficul- 
ties in the interpretation of personality disorder trials for numerous 





reasons. Importantly, there are no long-term, robust studies of the 
epidemiology of personality disorder in a general population. This 
makes the interpretation of trials difficult. We do not know what 
the natural progression of personality disorder is in the community. 
This is not to say there are no data to act as a guide, but simply the 
data that exist require interpretation with caution. The Children in 
the Community study [1], based in New York state and initiated in 
1975, has tracked the course of around 800 children over longer than 
a 20-year period. However, the inclusion of personality consider- 
ations occurred late with a ‘constructed’ measurement of uncertain 
validity, and no data were published from this study in relation to 
personality disorder for 18 years after its initiation [2]. The McLean 
Study of Adult Development (MSAD) [3] provides valuable informa- 
tion on the trajectory of borderline personality disorder (BPD). The 
Collaborative Longitudinal Personality Disorders Study (CLiPS) [4] 
follows the course of avoidant, obsessive-compulsive, borderline, 
and schizotypal personality disorders, but again broader issues as to 
the value of categorical diagnoses and the capacity to translate these 
data into other cultural settings provide at best a coarse backdrop 
from which to compare intervention studies. 


tment duration 





For most mental state disorders, it is relatively clear about the dur- 
ation over which to demonstrate the efficacy of a particular interven- 
tion, largely due to an understanding of the natural development of 
the disorder. In the case of personality disorder, since chronicity is 
acore feature of the condition, much longer time frames are needed 
than in many other disorders. Previously, periods of 2-3 years have 
been suggested. However, it is not clear how long is really needed. 
This problem is made more difficult by the relative instability of the 
categorical personality disorder diagnoses over time [5]. It is also 
clear that, in following those with categorical diagnoses longitu- 
dinally, there is a natural progression of the disorder, with many no 
longer reaching diagnostic criteria without targeted interventions 
[6]. It is not clear whether this translates into problem resolution 
[7]. This evidence suggests that larger numbers will be required for 
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trials to show an effect (taking into account the potential natural 
resolution of some participants) conducted over a time frame that 
takes this natural variation into account. It also suggests studies 
should examine treatment effect over variable time frames, to assess 
the duration of treatment needed to accelerate symptom remission. 
Therapy studies of one [8] or more [9] years have been conducted; 
however, few brief interventions have been considered. For those 
that have, they have not reported benefit, although understanding 
the reasons for this is complex. What is clear is that for a treatment 
to be considered useful, it needs to show improvement greater than 
would be expected from the natural history of the disorder; the 
improvement needs to be sustained, and follow-up should be able 
to find evidence of continued effectiveness in a real-world clinical 
setting. 

Consideration also needs to be given to the natural history of per- 
sonality development and the place of extremes of personality that 
cause distress, namely personality disorder. It is clear that person- 
ality develops and continues to do so throughout the life course, and 
this needs to be considered when the outcomes of trials on manage- 
ment are considered. This is most clearly highlighted by work that 
identifies personality disorder in adolescence and its specific treat- 
ment [10]. The personality disorders that garner the most clinical 
(and research) attention are externalizing, and these disorders tend 
to peak in mid life where the bulk of the literature into interventions 
is focused, improving from the fourth decade onwards. Later in life, 
paranoid, schizoid, and schizotypal presentations become increas- 
ingly apparent. The literature is starting to recognize this better [11], 
as well as the problems associated with it [12], although there has 
been little work on the management and treatment of later-life per- 
sonality disorder. 


Comorbidity 


Comorbidity is ‘the presence of any distinct clinical entity that has 
existed or that may occur during the clinical course of a patient who 
has the index disease under study’ [13]. The key word here is ‘dis- 
tinct, identifying two separate disorders that are distinct and not 
causally related in any way. Co-occurrence ranges from comorbidity 
to the presence of the same disorder in two or more differing forms 
[14]. Historically, personality disorder itself has been questioned as 
a diagnostic entity, with some suggesting it is ‘an adjective in search 
of a nour [15]. Frustratingly, comorbidity (or consanguinity) is 
the norm in personality disorders, both with other personality dis- 
orders [16] and with mental state disorder [17]. This may relate to 
the categorical constructs used to artificially separate normal from 
abnormal personality. In deciding the efficacy of a treatment for 
personality pathology, it would be useful to examine the interven- 
tion in a population without comorbidity. However, this is difficult, 
with a rate of BPD without comorbidity of as low as 5% [18] and 
no clear indication of the rates of comorbidity for other personality 
disorders. This comorbidity implies undertaking efficacy studies for 
the management of personality disorder is difficult to impossible, 
and the trials described in the following sections are, in effect, effect- 
iveness trials. While this has the advantage of clinical applicability, it 
leaves open the question as to whether the intervention is impacting 
primarily on the personality disorder or the comorbid condition. 
It may be more sensible to consider personality function in this 


regard—an intervention may not lead to a change in personality per 
se; however, the functioning of the individual may markedly change 
on the basis of changes in the environment, mental state changes, or 
cognitive consideration of the person’s internal world [19]. 


Outcome measures 


Understanding the most appropriate outcome measure in person- 
ality disorder trials is also problematic. Personality disorders that 
come to the attention of psychiatric services cause problems for an 
individual in their internal world, in their relationships with others 
and society in general. These are all potentially outcome targets to 
measure. Forensic clinicians generally consider social measures of 
the most importance (for example, reoffending), and this measure is 
often considered the most important by the general public. Although 
such measures offer a clear outcome target, there are so many po- 
tential confounders that it may well not reflect personal or person- 
ality change at all. Change in psychopathology is the most common 
measure in most psychiatric trials but may be inappropriate due to 
the likely presence of comorbid mental state disorder, as well as the 
reorientation of psychiatric services towards a recovery focus of care 
that minimizes the importance of symptoms over improvement 
in psychosocial functioning [20]. Direct use of repeated measures 
using personality assessment tools is also beset by problems related 
to the instability of these tools over time and the natural develop- 
ment of personality through the life course [21]. Currently, there is 
no agreed outcome measure in personality disorder trials, requiring 
careful consideration of each trial on its own. This problem needs to 
be kept in mind when considering meta-analyses or overviews that 
combine multiple trials in an effort to overcome the power problems 
described previously. It is probably best to have a range of outcome 
measures, being mindful of their shortcomings and examining for 
areas of convergence that may reflect more closely actual change. 


Taxonomic issues 


Overarching all of the issues described previously when consid- 
ering the literature related to the treatment of personality disorder is 
the issue of diagnosis and the use of the term ‘personality disorder’. 
The Diagnostic and Statistical Manual of Mental Disorders, fifth edi- 
tion (DSM-5) and the International Classification of Diseases, tenth 
revision (ICD-10) take a categorical approach to the definition of 
personality disorder. This is not based on in a single theoretical 
construct, the empirical literature, or clear divisions in the epi- 
demiological data to suggest a set number of discrete entities. These 
problems may go some way to explaining the difficulties in the utility 
of categories that experts agree neither are helpful nor reflect empir- 
ical understandings of personality [22]. Trials of interventions are 
heavily skewed towards BPD and antisocial personality disorder, 
and care is needed not to assume the heuristic that the weight of 
evidence (of varying strength and quality) implies these categories 
are more biologically based or amenable to intervention than other 
conceptualizations of personality. Rather these categories cause the 
most social distress and the greatest economic burden for society, 
focusing research on them for these reasons. The recently published 
ICD-11 has moved away from traditional personality disorder 
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categories [23]. Clinicians can now describe the severity of core per- 
sonality dysfunction (mild, moderate, or severe) when describing 
personality disorders. The ICD-10 specific personality disorders are 
abandoned entirely in favour of five broad trait domains based on 
the scientific literature on personality: negative affectivity, disinhib- 
ition, dissociality, detachment, and anankastia. A borderline speci- 
fier was also added to enhance clinical utility and continuity. The 
utility of assessing core personality functioning is supported by re- 
search showing that severity is more predictive of clinical outcomes 
than specific personality disorder categories [23a]. The range of be- 
haviours described by the domains may encourage treatment studies 
to venture beyond BPD into patients with detachment or anankastic 
symptoms. 


Psychotherapies 


Psychological treatment is now clearly recognized as the most ap- 
propriate primary intervention for the treatment of BPD, and this is 
identified in national guidelines [25]. The evidence discussed later 
in this chapter is related to BPD unless stated otherwise. Although 
the evidence is much weaker, a similar approach is recommended 
for other personality pathologies. A broad spectrum of psycho- 
therapeutic approaches have been tried, from strict behavioural 
approaches, through psychoeducation and supportive therapy, to 
analytic therapy. A recent Cochrane review identified more than 15 
modalities of therapy trialled in BPD alone [26]. These have been 
trialled in inpatient, outpatient, and partial hospitalization settings. 
Length of therapy and therapy mix (individual, group, and mixed 
individual and group) are also variable and often based on therapist 
choice or model adherence, as opposed to robust clinical evidence. 
The UK’s National Institute for Health and Care Excellence (NICE) 
recommends a mixed-therapy approach for patients with BPD, but 
little other guidance is available. Often the key determinant is re- 
source provision, heavily working against high cost services, such 
as traditional therapeutic communities, and significantly advan- 
taging treatments that reduce hospital presentations or admis- 
sions, as opposed to functional recovery. It is worth noting that 
the overall evidence for the effectiveness of psychotherapies is low, 
with the likelihood that well-designed randomized controlled trials 
(RCTs) in the future may lead to significant changes in clinical prac- 
tice. A recent systematic review and meta-analysis concluded that 
psychotherapies are effective for BPD. However, effect sizes were 
small, inflated by risk of bias, and unstable at follow-up [26a]. 





Cognitive therapies 







0] are the three largely cognitively based 
psychological interventions that are most researched. They have 
in common an attempt to integrate biological and social factors, 


linking past experience and relationships with current difficulty and 
testing out mechanisms intheshere and now to normalize dysfunc 
GEE chaviour and emotional responses. 


Cognitive behavioural therapy (CBT) was originally devel- 
oped for the treatment of depressive disorder, designed to help the 






patient identify and modify dysfunctional thoughts and beliefs 
through the use of specific techniques suc 
Although therapy relies on an understanding of development, the 
feast hry CBT-PD expands this narrow window regarding 
problems as originating in the temperament of the child, develop- 
ment, and early experience. These experiences, particularly early 

















With this in mind, one of the primary tasks of CBT-PD is to gain a 
solid understanding of childhood experience and the development 
of a shared formulation to link past difficulties with present prob- 


The meaning and content of these beliefs are explored, 
as well as their basis in negative early experience, possible neglect, 
and abuse. 





nce these patterns are identified, the therapist 
and patient develop strategies to test them out in the present and 
learn new, more adaptive patterns of response. The emphasis may 
be cognitive (for example, “How can I think about myself in posi- 
tive terms and collect evidence that supports this perspective?’) or 
behavioural (“When I feel criticized, I will not retreat and cut but 
ask the question, “Can you explain that to me?” to gain a better 
understanding of the other persons perspective’) [31]. Compared 
to CBT for mental state disorders, CBT-PD generally takes longer, 
places significantly greater importance on the maintenance of the 
therapeutic relationship, and uses experiences occurring in the 
therapeutic relationship as ‘microcosmic examples of experiences 
occurring in the real world. The logic behind this is the more in- 
grained and pervasive cognitive and behavioural patterns found 
in personality pathology would be expected to take longer to work 
through and are pervasive enough to be expected to occur in the 
therapeutic relationship. There is evidence of effectiveness in this 
approach [32]. 

DBT is a form of CBT linked to skills training and detached (or 
radical) acceptance conceptualized as mindfulness. DBT was one of 
the first therapies formally tested for its benefits in patients with BPD 
[33] and has been the subject of more RCTs than any other treat- 
ment for BPD. Working from a basis of assuming BPD is primarily 
a dysfunction of emotion regulation based on biology and early en- 
vironment, patients become emotionally vulnerable, struggling to 
regulate patterns of responses associated with emotional states. This 
leads to maladaptive behaviours and maladaptive problem-solving 
strategies. As the name implies, DBT has an implicit embracing of 
opposites, for example an acceptance of how things are and a drive to 
change, directed through therapy. From a working perspective, the 
essentials of DBT include: (1) manualized weekly individual therapy 
sessions; (2) group psychoeducational behavioural skills training; 
and (3) telephone coaching. The focus is on problem-solving skills, 
emotion regulation techniques, distress tolerance, validation, and 
mindfulness. Therapists also meet weekly to discuss therapy pro- 
gress, share problems, and ensure therapy adherence [29]. Nine 
RCTs have examined the effectiveness of DBT and summarized in a 
Cochrane review [26]. These showed DBT is beneficial in reducing 
anger and self-harm and generally improving overall functioning for 
patients with BPD. 
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SFT has been trialled in two RCTs, one of which importantly is 
a head-to-head trial against transference-focused therapy (TFT) 
[30]. Building on CBT, SFT expands the basic notions of cogni- 
tions affecting behaviours and affects to include considerations 
from object-relations theory, in particular. There are assumptions 
of maladaptive schemas developed early in life that lead to maladap- 
tive interpersonal relatedness in adulthood. This leads to individual 
therapy sessions, with a focus on understanding and redressing 
maladaptive schemas and elements of CBT (and gestalt) to adapt or 
replace maladaptive coping styles. Techniques focus on the patient- 
therapist relationship with SFT, describing a ‘limited reparenting’ 
role for the therapist. Some describe SFT as an integrative psycho- 
therapy, as opposed to a predominantly cognitively based therapy. 
The study of Giesen-Bloo and colleagues showed advantages of SFT 
over TFT in most domains related to BPD. 


Dynamic therapies 


Transference-focused therapy [34] and mentalization-based therapy 
(MBT) [35] are the two dynamically oriented therapies that have 
been trialled for the management of BPD. Although very different 
in approach, both are based on an understanding of the influence of 
the unconscious on the presentation of patients in their theory. They 
utilize this understanding of the unconscious to enable the patient to 
develop insight into their difficulties, enabling change. 

MBT [35] was specifically designed as therapy for patients with 
BPD, conceptualized as a problem of disorganized attachment and 
subsequent failure to mentalize. Mentalization is the capacity to in- 
terpret action and derive an internal meaning for it. This is the basis 
of relationships (and understanding of self), and developing this 
enables normalization of the emotional difficulties (neither too in- 
tense nor too detached) and behaviours of those with BPD. Having 
stabilized emotional expression in the initial stages of therapy, the 
primary aim of all therapy is to increase mentalization, this being 
more important than the ‘task orientation of cognitively based ther- 
apies (be they cognitive or behavioural tasks). This requires a ‘not 
knowing’ stance that accepts the various positions of the patient and 
focuses on description (“What is going on for you now?’), as opposed 
to understanding (‘Why are you doing that?’). It is important for the 
therapist to be mindful of their own mentalizing stance throughout 
therapy. MBT has been trialled in both partial hospitalization [36] 
and outpatient [37] settings and shown to be effective in BPD. 

TFT is also designed to manage the problems of BPD and devel- 
oped from the object-relations model of borderline personality or- 
ganization [38]. It assumes internal and unconscious conflict exists 
between internal representations of both self and others, and this 
leads to emotionally unstable relationships. These distorted internal 
representations emerge in the process of therapy in a classic transfer- 
ence style of understanding and interpretation. This is a major inter- 
vention designed to lead to internal change and also (ultimately) 
behavioural change. The end-goal of TFT, like all psychotherapy for 
BPD, is amelioration of symptoms, reduction of non-suicidal self- 
harm, and improved relationships. TFT is a long-term individual 
therapy, as long as twice-weekly sessions for 3 years, and works at 
containing dangerous behaviours, managing behaviour that im- 
pairs therapy, and synthesizing the ‘split-off’ part of self into an in- 
tegrated whole. Although there is evidence of effectiveness, this is a 


time-intensive treatment and has been shown to be inferior to SFT 
in a head-to-head trial. 





Other structured therapies 


Interpersonal therapy (IPT) has been modified for application in 
patients with BPD on the basis of efficacy of IPT in patient with de- 
pression and comorbid BPD. A small trial has suggested this may be 
of benefit [39] but needs replication. Systems Training for Emotional 
Predictability and Problem Solving (STEPPS) is aligned with cognitive 
therapies, but largely psychoeducational. It is a structured programme 
of 20 2-hour weekly group sessions, with a learning goal to be derived 
from each session. Evidence of effectiveness exists [40]. Cognitive ana- 
lytic therapy has also shown weak evidence of effectiveness in adoles- 
cents with personality disorder and traits [41], and this well-constructed 
trial is important not only for its evidence of effectiveness, but also for 
clearly displaying the utility of a BPD diagnosis in adolescence. 


Social therapies 


Social therapies were the traditional mainstay for the manage- 
ment of personality disorders, bridging the time of development of 
pharmacotherapies and implementation of modern psychotherapies. 
Personality disorders do, by definition, cause significant distress to 
the community, and it is perhaps not surprising that efforts to con- 
sider alternative community settings were thought to be the best ap- 
proach. In general, they recognize the influential elements of natural 
communities and aim to use these communities and the environ- 
ment into the tolls for change. Although these approaches are used 
considerably less today, they remain an important element of the 
overall bio-psychosocial management armory and should be con- 
sidered for the most severe and disruptive patients with personality 
disorder. Notably, the therapeutic communities described in the fol- 
lowing paragraphs are not those utilized by correctional services in 
the United States, although they have at times been considered in 
this light. 

Therapeutic communities have been described variously, al- 
though their modern context was defined and described by Maxwell 
Jones [42], who created a structure that ran counter to the traditional 
mental hospital. This can be defined as a socially cohesive struc- 
ture, depending on intensive group treatments carried out by the 
residents who are involved in treatment in a democratic way. These 
communities have a strong historical basis for their effectiveness but 
are complex, long-term interventions and, as such, are expensive 
to run and difficult to assess, particularly compared to manualized 
psychotherapy or pharmacotherapy. Nonetheless, there is some evi- 
dence for their effectiveness in personality disorder, although there 
are no RCTs that have been completed. 

Nidotherapy is a novel social therapy described as ‘the collabora- 
tive systematic assessment and modification of the environment to 
minimise the impact of any form of mental disorder on the individual 
in society’ [43]. It was developed and trialled with personality dis- 
order as a central focus, in association with other comorbid mental 
state disorders [44]. Interestingly, this ‘add-on therapy’ does not aim 
to directly change psychopathology; rather it focuses on developing a 
better ‘fit between an individual and their environment. It is normally 
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carried out by a nidotherapist working independently from other clin- 
ical services, while liaising with them. There is some RCT evidence 
this form of therapy may lead to cost savings and improvements in 
social functioning, the therapy’s natural target endpoints [45]. 


Pharmacotherapies 


The rationale for using drugs in the treatment of personality dis- 
orders is that some of the behaviours associated with personality 
disorders may reflect disordered neurochemistry [46]. The poor 
empirical basis for the creation of the individual personality dis- 
orders in DSM-III, their heterogeneity, and the absence of evi- 
dence to support drug treatment of specific personality disorders 
have led researchers to largely ignore the DSM categories and to 
focus on dimensions of psychopathology. The algorithm gener- 
ally used to study drug effects was proposed by Siever and Davis 
[47] and further developed by Soloff It suggests that four 







personality disorder categories and that drug treatment effects on 
these dimensions should be studied, rather than on individual per- 
sonality disorders. Although heuristically appealing, little evidence 
exists to support their validity. The dimensions have never been 
tested in hypothesis-driven studies [49]. In addition, although the 
algorithm was designed to study behaviours across all personality 
disorder categories, most clinical trials have used participants with 
BPD. A systematic review reported that over 70% of all drug trials 
used subjects with BPD [50]. The review also noted that most trials 
were underpowered, with a mean of 22.4 participants in the treat- 
ment group and 19.3 in the control group, were of short duration, 
averaging 13.2 weeks, and had a wide range of outcome measures 
[50]. Finally, we should note that many studies were sponsored by 
the pharmaceutical industry. 


Cluster A personality disorders 


Studies have focused on the drug effect on cognitive perceptual dis- 
turbances in this population. Patients with schizotypal personality 
disorder have been studied in a few small, usually open-label studies 
using dopamine antagonists [51]. Patients showed some improve- 
ment in overall symptom severity, but the risk-to-benefit ratio appears 
poor. No RCTs for patients with schizoid or paranoid personality dis- 
orders have been undertaken, and therefore, no robust evidence about 
the efficacy of drugs in these patients is available at present. 


Cluster B personality disorders 


As previously noted, most evidence for the use of drugs in patients 
with Cluster B personality disorders have been derived from studies 
on subjects with BPD, using the Siever and Davis algorithm [47]. 
This has led to a number of recommendations which are confusing 
and, at times, contradictory. Some guidelines advocate symptom- 
targeted pharmacotherapy, based on Siever and Davis’ dimensions, 
while others state drug treatment should be avoided generally in pa- 
tients with BPD. 


The American Psychiatric Association (APA) guideline [52] states 
that symptom-targeted pharmacotherapy is an important adjunct 
treatment. It suggests that affective instability is treated with se- 
lective serotonin reuptake inhibitors (SSRIs) or monoamine oxidase 
inhibitors (MAOIs), impulsive aggression with SSRIs or mood sta- 
bilizers, and cognitive—perceptual disturbances with low-dose dopa- 
mine antagonists. The World Federation of Societies of Biological 
Psychiatry guidelines [53] state that moderate evidence exists for 
dopamine antagonist drugs being effective for cognitive-perceptual 
and impulsive-aggressive symptoms, some evidence exists for SSRIs 
being effective for emotional dysregulation, and there is some evi- 
dence for mood stabilizers being effective for emotional dysregulation 
and impulsive-aggressive symptoms. The most recent Cochrane re- 
view [54] partially contradicted these guidelines, reporting no evi- 
dence for the efficacy of SSRIs, but did report that mood stabilizers 
could diminish affective dysregulation and impulsive-aggressive 
symptoms in patients with BPD and antipsychotics could improve 
cognitive-perceptual symptoms and affective dysregulation. 

In contrast, the UK NICE guidelines [55] state that drug treatment 
should generally be avoided, except in a crisis, and then given for no 
longer than 1 week. More recent guidelines for the treatment of BPD 
from the Australian National Health and Medical Research Council 
(NHMRC) [56] again reviewed the literature, including conducting 
a series of meta-analyses. It concluded that pharmacotherapy did 
not appear to be effective in altering the nature and course of BPD. 

These apparently contradictory recommendations may reflect 
the weight given to risks, as well as benefits, of drug treatment. 
Both NICE and NMHRC Committees acknowledged evidence 
existed that some dopaminergic drugs (notably aripiprazole and 
olanzapine) and mood stabilizers (notably topiramate, lamotrigine, 
and valproate) may reduce BPD symptoms over the short term. They 
concluded that substantial long-term risks did not justify recom- 
mending these drugs when alternative psychosocial interventions 
do not carry such risks. 

A pragmatic compromise may be to acknowledge the real con- 
cerns about using drugs in this population and be guided towards 
using drugs with at least some evidence of efficacy, using them spar- 
ingly and for short periods. The current evidence [57] would suggest 
using dopamine antagonists/partial agonists and mood stabilizers, 
rather than SSRIs, tricyclic antidepressants, and benzodiazepines. 
However, a recent large, well-conducted RCT testing lamotrigine for 
BPD reported no evidence for its efficacy or cost-effectiveness [57a]. 
A more radical view, articulated in the NICE guidelines [55], is that 
if patients have no comorbid illness, efforts should be made to re- 
duce or stop pharmacotherapy. 

Drug treatment of antisocial personality disorder has almost no 
evidence base [50]. The NICE guidelines for antisocial personality 
disorder concluded that pharmacological interventions should not 
be used routinely for the treatment of antisocial personality disorder 
or its associated behaviours. However, NICE states that drugs can 
be used for comorbid mental disorders. A recent meta-analysis by 
Khalifa and colleagues [58] reached similar conclusions. 





Cluster C personality disorders 


No RCTs have been published of drug treatment for patients with 
Cluster C personality disorders. However, the World Federation 
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of Societies of Biological Psychiatry guidelines [53] suggests that 






In summary, the evidence base for psychopharmacology in pa- 
tients with personality disorders is poor. Drug treatment should be 
used sparingly for short periods, with careful consideration of risks 
and benefits. Hopefully, more useful drugs may become available. 
For example, targeting N-methyl-D-aspartate signalling pathways, 
which have effects on disinhibition, social cognition, and dissocia- 
tive symptoms, has potential [59]. There is a 







[62]. However, small 
open trial results have been mixed [62, 63]. We still have a long 
way to go. 


Managing risk or managi 





In addition to interventions designed to alter the trajectory of person- 
ality disorder, a service orientation towards recognizing this problem 
and ensuring a seamless service is considered important. This needs 
to take into account the risks presented, while recognizing the limi- 
tations in the evidence for effective interventions, and not inadvert- 
ently positively reinforcing unhelpful or dangerous behaviours. For 
example, if hospitalization is the default response to non-suicidal 
self-injury and a patient with severe personality disorder is struggling 
in the community, there is a risk that they will self-harm in order to 
gain access to an inpatient ward. Developing a plan with the patient, 
which recognizes these difficulties, is important to ensure the patient, 
services, and the wider community that may be involved (for ex- 
ample, police and paramedical staff) are clear as to the best approach 
in difficult situations. Providing a consistency of response allows for 
a patient with personality disorder to develop an understanding of 
boundaries and balances long- and short-term risks. Constancy in 
personnel further allows for an understanding of behaviour driven 
by long-term traits and the more acute psychopathologically driven 
behaviours. This sort of conceptualization and understanding of the 
purpose of behaviour reflects what is known of personality disorder 
in regard to its structure [3, 64]. 


Conclusions 


Although the management of personality disorder remains in flux, 
much progress has been made over the last decade. Increased clarity 
associated with an improved understanding of taxonomy has opened 
the door to the development of novel trait-based psychotherapeutic 
interventions. The importance of psychotherapy as the primary 
intervention of value, particularly in BPD, has been strengthened 
and the relatively limited place of pharmacotherapy is clearer. The 
potential for ‘add-on’ therapy, such as STEPPS or nidotherapy, is also 
clearer. Having said this, there remains little evidence to guide any 
but BPD treatment. The evidence for antisocial personality disorder 
treatment suggests little leads to long-term change. Almost nothing 
is known as to the effectiveness of interventions for other categorical 
diagnoses. Added to this, it is likely that ICD-11 will take a radic- 
ally different approach to diagnosis from that of DSM-5, and this 


opens the door to both confusion and improved study of personality 
problems. In the future, well-designed large randomized or complex 
trials with novel findings are likely to lead to significant changes in 
clinical practice, suggesting the current evidence is at best weak. 
Despite the significant steps forward, clarity about the limits of the 
evidence and the need for n = 1 trials with our patients remain im- 
portant. Acknowledging the changes in personality through the life 
course [21] and the need to be cognizant of our limits and expertise 
ensures we will be able to help patients with personality disorder, 
without overstating the current state of the evidence. 
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The hallmark of impulse-control disorders (ICDs) is difficulty with 
emotional and behavioural self-regulation, as manifested by diffi- 
cult, disruptive, or aggressive behaviour. Recognized in Diagnostic 
and Statistical Manual of Mental Disorders, third edition (DSM-III) 
[1] as ‘disorders of impulse control not elsewhere classified’, the 
category included pathological gambling, kleptomania, pyromania, 
and both intermittent and isolated explosive disorders. All were uni- 
fied by the failure to resist an impulse to perform an act harmful to 
the individual or others, increasing tension before the act, and the 
experience of pleasure, gratification, or release at the time of the act. 
They were believed ego-syntonic at the ‘moment of discharge’ but ac- 
companied by guilt or regret afterwards [2]. The chapter continued 
in DSM-III-R, DSM-IV, and DSM-IV-TR [3]. Trichotillomania was 
added in DSM-III-R, while isolated explosive disorder was dropped 
after DSM-III. 

The chapter was reorganized in DSM-5 [4], with intermittent 
explosive disorder (IED), kleptomania, and pyromania con- 
tinuing in a new chapter called ‘Disruptive, impulse-control, 
and conduct disorders. Pathological gambling was moved to 
the chapter on ‘Substance-related and addictive disorders’ and 
renamed gambling disorder, while trichotillomania was moved 
to the chapter on ‘Obsessive-compulsive and related disorders’. 
Oppositional defiant disorder (ODD) and conduct disorder, pre- 
viously included in the chapter on childhood disorders, were 
brought into the new chapter, reflecting the view that they too 
are disorders of self-regulation. 

ICDs received limited recognition in International Classification 
of Diseases, ninth revision (ICD-9) [5]. The category ‘Disturbances 
of conduct not elsewhere classified’ included conditions character- 
ized by aggressive and destructive behaviour. Kleptomania was given 
as an example of a ‘compulsive conduct disorder’. In ICD-10 [6], the 
renamed category (“Habit and impulse disorders’) was expanded to 
include gambling disorder, pyromania, pathological stealing (klep- 
tomania), and trichotillomania (with IED listed under ‘Other habit 
and impulse disorders’). These disorders were characterized by re- 
peated acts having no clear rationale. 





Despite their historical roots, ICDs were not officially recognized 
until DSM-III. In the nineteenth century, the French psychiatrist 
Jean Etienne Esquirol used the term monomania to describe a group 
of conditions in which people had irresistible urges without ap- 
parent motivation [7]. Pyromania was recognized by the nineteenth- 
century German psychiatrist Griesinger as a ‘morbid impulse, which 
drives the person to commit destructive acts. In the early twentieth 
century, Bleuler wrote about reactive impulses that borrowed from 
Kraepelin’s impulsive insanity [8, 9]. Included were pyromania, klep- 
tomania, and compulsive shopping (oniomania). More recently, 
many of these conditions have been described as behavioural ad- 
dictions, characterized by the presence of unrestrained or poorly 
controlled behaviours arising in the absence of alcohol or drugs of 
abuse [10]. 

This chapter discusses the following DSM-5 disorders: ODD, IED, 
pyromania, kleptomania, and gambling disorder. Trichotillomania 
is discussed in Chapter 93 on obsessive-compulsive and related 
disorders. For completeness, the chapter will include a discussion 
of three non-DSM conditions that involve poorly regulated behav- 
iours: compulsive shopping (CS), Internet addiction, and compul- 
sive sexual behaviour (CSB). Despite its reclassification in DSM-5, 
conduct disorder is discussed in Chapter 124. 





ODD is a diagnosis for persons who exhibit negativistic, hostile, de- 
fiant, and disobedient behaviours towards others. ODD is mostly 
a diagnosis for children and adolescents but may also be used in 
adults. The diagnosis is made on the basis of angry or irritable, de- 
fiant, or vindictive behaviour of at least 6 months’ duration, with 
a minimum of four of eight symptoms in three categories: angry/ 
irritable mood, argumentative/defiant behaviour, and vindictive- 
ness. DSM-5 [4] considers ODD to be a developmental antecedent 
for some youth with conduct disorder, thereby suggesting that they 
may reflect different stages of a spectrum of disruptive behaviours. 
ODD was introduced in DSM-III [1] as ‘oppositional disorder’ 
to characterize those with a negative and disobedient opposition 
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to authority. The diagnosis received its current name with DSM- 
III-R. The diagnosis had many commonalities with DSM-II’s [11] 
unsocialized aggressive reaction, a diagnosis used to describe loners 
with a pattern of hostile disobedience, aggressiveness, stealing, and 
lying, behaviours thought to result from inconsistent discipline and 
parental rejection. 

The prevalence of ODD has ranged in studies from 1% to 11%, 
with an average of 3.3% [4]. The disorder is more common in boys 
than girls prior to adolescence [12]. ODD has a mean age of onset of 
6 years and may precede the onset of conduct disorder. Youth with 
ODD are also at risk for developing mood and anxiety disorders 
[12, 13]. The defiant, argumentative, and vindictive symptoms carry 
most of the risk for conduct disorder. Angry/irritable mood symp- 
toms carry most of the risk for internalizing disorders. While all 
children show oppositional behaviour from time to time, the diag- 
nosis is given to those with frequent, recurrent, and problematic be- 
haviours, for example temper outbursts, arguments with parents or 
other authority figures, and a refusal to obey orders. 

The disorder tends to be stable over time [12]. Boys who develop 
conduct disorder have higher numbers of ODD symptoms that 
those who do not. There appears to be a genetic overlap of ODD 
with other disruptive disorders, including conduct disorder and 
attention-deficit/hyperactivity disorder. 

Other disorders need to be ruled out. Unlike conduct disorder 
which specifies that the child must have violated personal rights 
and social rules, ODD is defined on the basis of difficult and dis- 
ruptive behaviour. Attention-deficit/hyperactivity disorder may be 
comorbid with ODD but is a diagnosis used in those with problems 
of sustained effort and attention. ODD shares many features with 
disruptive mood dysregulation disorder, such as negative mood 
and temper outbursts, but the severity, frequency, and chronicity 
of temper outbursts are more severe in children with disruptive 
mood dysregulation disorder than in those with ODD. (In DSM-5, 
the diagnosis of disruptive mood dysregulation disorder takes pre- 
cedence over ODD, if the criteria for both disorders are met.) IED 
also involves high rates of anger, but people with this disorder show 
serious aggression towards others that is not part of the definition 
of ODD. 

There has been little neurobiologic research with regard to ODD. 
One study showed elevated levels of dehydroepiandrosterone sulfate 
in children with ODD, in contrast to children with attention-deficit/ 
hyperactivity disorder and controls, suggesting to the authors that 
stress or genetic factors have led to a shift in adrenocorticotrophic 
hormone (ACTH)-f-endorphin functioning in the hypothalamic- 
pituitary—adrenal axis. Another study found a specific pattern 
of single-nucleotide polymorphisms associated with attention- 
deficit/hyperactivity disorder comorbid with ODD, compared with 
attention-deficit/hyperactivity disorder alone, especially for meas- 
ures of argumentative and defiant behaviours. 

There is no standard treatment for ODD, but common sense sug- 
gests that because most patients with ODD are children, clinical 
management should emphasize individual and family therapy, with 
treatment of co-occurring attention-deficit/hyperactivity disorder 
or other disorders with medications, as needed, such as stimulants, 
guanfacine, or clonidine [12]. Family-based interventions include 
parental management training and child problem skills training. 
The former aims to teach parents to better manage their child’s 
behaviour, as well as to promote desired behaviours. ‘The latter is 


cognitively based and aims to help children learn to manage anger, 
improve problem-solving ability, delay impulsive responses, and im- 
prove social interactions. School-based programmes, such as those 
aimed at resisting negative peer influences and reducing bullying 
and antisocial behaviour, may also be helpful. 





Intermittent explosive disorder 


IED was new to DSM-III [1] and included as a diagnosis for those 
with episodes of verbal or physical aggression grossly out of pro- 
portion to the stressor. IED was considered roughly equivalent to 
DSM-III’ [11] explosive personality. A related diagnosis—isolated 
explosive disorder—was also included in DSM-III and was meant to 
be used in persons with a single, discrete episode of uncharacteristic 
aggression. It was dropped from later editions. 

In DSM-5 [4], IED is defined by the presence of recurrent behav- 
ioural outbursts that are grossly out of proportion to the provocation 
or stressors, are not premeditated, occur in an individual aged 6 or 
older, cause distress or functional impairment, and are not better ex- 
plained by another mental disorder, medical condition, or the effects 
of a substance. ICD-10 [6] lists IED under ‘Other habit and impulse 
disorders’ but does not provide criteria. 

People with IED describe their aggressive episodes as brief, 
explosive, uncontrollable, and unpremeditated, and typically 
provoked by minor events [14]. They may experience changes 
in mood, awareness, and autonomic arousal before the outburst. 
The frequency of episodes depends, in part, on how the disorder 
is defined. In the National Comorbidity Survey-Replication 
(NCS-R) [15], whereby DSM-IV criterion A was operationalized 
as 23 lifetime attacks, persons with IED had a mean of 43 life- 
time attacks. Many people with IED have a history of chronic 
anger or irritability accompanied by frequent minor episodes. 
Subthreshold episodes are similar to the anger attacks (sudden 
episodes of intense anger with autonomic arousal) often de- 
scribed in patients with mood disorders. 

Before making the diagnosis, IED needs to be distinguished from 
Cluster B personality disorders associated with anger outbursts 
(antisocial and borderline personality disorders), neurocognitive 
disorders characterized by verbal or physical outbursts, substance 
abuse and intoxication causing behavioural disinhibition, and the 
childhood-onset disorders disruptive mood dysregulation disorder, 
autism spectrum disorder, attention-deficit/hyperactivity disorder, 
ODD, and conduct disorder. In the case of the childhood-onset dis- 
orders, the additional diagnosis of IED may be warranted when out- 
bursts are deemed in excess of those usually seen in the disorders 
and warrant independent clinical attention. 

IED is common in clinical and general population samples, with 
an estimated lifetime prevalence in the general population of 7.3% 
[14]. IED is more common in men than women. IED begins in 
childhood or adolescence and rarely occurs after age 40. The dis- 
order follows a chronic or episodic course and is associated with 
distress, morbidity (for example, accidents), and social and occupa- 
tional impairment. In the NCS-R [15], IED had a mean age at onset 
of 14 years, was persistent over the life course (with averages of 6.2- 
11.8 years with attacks), and was associated with substantial role im- 
pairment. The prevalence of the disorder was much lower in persons 
aged 60 years and older. 
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Psychiatric comorbidity is common in persons with IED. In the 
NCS-R [15], 82% of respondents with IED met criteria for at least one 
other lifetime disorder, in particular depressive, anxiety, and sub- 
stance use disorders. It was also significantly comorbid with ODD, 
conduct disorder, and attention-deficit/hyperactivity disorder. 

Family studies suggested that first-degree relatives of people with 
IED have high rates of impulsive violence, substance misuse, and 
possibly mood and other ICDs [14]. A blinded, controlled family 
history study using broadly defined IED criteria found a signifi- 
cantly increased morbid risk of the condition in relatives of affected 
probands (26%), compared with relatives of control probands (8%). 

In terms of neurobiology, people with emotion dysregulation and 
aggression have been shown to have disturbed serotonergic func- 
tion [14] and functional abnormalities in both the limbic system 
and the orbitofrontal cortex. In a functional magnetic resonance 
imaging (MRI) study of response to social threat [16], ten subjects 
with IED showed exaggerated amygdala reactivity and diminished 
orbitofrontal cortex activation to faces expressing anger, compared 
with controls. The authors noted these findings were similar to other 
disorders characterized by impulsive aggression, including border- 
line personality disorder and bipolar disorder, and that they sup- 
ported a link between a dysfunctional frontal-limbic network and 
aggression. 

There are no standard treatments for IED. Cognitive behavioural 
therapy has been used, with a focus on anger management [17]. 
Patients learn to recognize when they are becoming angry and to 
identify and defuse the triggers that lead to outbursts. One study 
showed that cognitive behavioural therapy was superior to a wait 
list in reducing anger and hostility in persons with IED. The pro- 
gramme employed relaxation training, imagery, rebreathing, use of 
time-outs, and cognitive restructuring. 

Medication has been used to reduce or eliminate aggressive im- 
pulses [17] in persons with impulsive aggression. The strongest 
evidence supports the use of selective serotonin reuptake inhibi- 
tors (SSRIs) in the treatment of IED. There is some evidence that 
mood stabilizers and drugs for psychosis may also play a role in 
reducing anger outbursts in persons with behavioural dyscontrol. 
Benzodiazepines should be avoided because of their tendency to 
cause behavioural disinhibition. 


Pyromania was first recognized in the nineteenth century, its name 
coined in 1833 by Marc, a French psychiatrist [3]. He described 
the disorder as a form of instinctive and impulsive monomania. 
Griesinger, Bleuler, and Kraepelin, all active at the turn of the twen- 
tieth century, considered pyromania a ‘morbid impulse’ [8, 9, 18]. 
The disorder was briefly mentioned in DSM-I [19] as a supplemen- 
tary term but was not mentioned in DSM-IL Pyromania was in- 
cluded in DSM-III [1] and has continued to the present. 

DSM-5 [4] defines pyromania as the deliberate and purposeful 
fire-setting on more than one occasion; tension or affective arousal 
before the act; fascination with, interest in, curiosity about, or at- 
traction to fire and its contents and characteristics; and pleasure, 
gratification, or relief when setting fires or when witnessing or 
participating in their aftermath. Importantly, those who set fires 
for political motives, out of anger or to seek vengeance, to conceal 


crimes, or to improve one’ living situation (for example, by claiming 
insurance benefits) do not have pyromania. Likewise, the fire-setting 
cannot be in response to a psychosis or to impaired judgement from 
a neurocognitive disorder or drug of abuse. Thus, an arsonist who 
sets fires for monetary gain or for political or criminal purposes 
would not merit the diagnosis. In ICD-10 [6], pyromania is defined 
as multiple acts of, or attempts at, setting fire to property or other 
objects, without apparent motive and by a persistent preoccupation 
with subjects related to fire and burning. 

Data show that around 1% of the general population report a life- 
time history of fire-setting, though this is only one component of 
pyromania [20]. A study of psychiatric inpatients found that about 
6% had a lifetime history of pyromania [21]. Pyromania is probably 
more common in men than women and usually begins in adoles- 
cence or early adulthood. Mood, substance use, and other ICDs 
are common in people with pyromania. Fire-setting is considered 
a poor prognostic sign for children with conduct disorders and is 
associated with adult aggression [20]. The course of pyromania is 
unknown, but clinical descriptions suggest that the course of pyro- 
mania is episodic and tends to wax and wane. 

Before making the diagnosis, the clinician should rule out other 
causes of intentional fire-setting, including normal developmental 
experimentation (for example, playing with matches), antisocial 
personality disorder, and adult antisocial behaviour. Accidental fire- 
setting, as might occur in a person with a neurocognitive disorder, 
substance abuse or intoxication, and psychosis, should be ruled out. 

There are no standard treatments for pyromania [22]. Much of 
the literature on the use of psychological treatments has focused on 
children and includes behavioural therapies, family therapy, and fire 
education. Case reports have suggested benefit from SSRIs, lithium, 
topiramate, olanzapine, and valproate. 


Kleptomania 


Kleptomania has been recognized for nearly 200 years [3]. Bleuler 
provided one of the first clinical descriptions: “The kleptomaniacs 
in the old sense cannot even otherwise resist the impulse of appro- 
priating things ...’ ([8], p. 539). The disorder was listed as a supple- 
mentary term in DSM-I and was formally included in DSM-III. The 
disorder has continued to the present. 

In DSM-5 [4], kleptomania is defined as the ‘recurrent failure to 
resist impulses to steal objects that are not needed for personal use 
or for their monetary value’ In addition, there is an increasing sense 
of tension immediately before committing the theft, followed by 
pleasure, gratification, or relief at the time of the theft. Importantly, 
individuals do not steal to express anger or vengeance, or steal in 
response to hallucinations or delusions. In ICD-10 [6], kleptomania 
(or pathological stealing) is defined as the repeated failure to resist 
impulses to steal objects that are not acquired for personal use or 
monetary gain. 

Kleptomania prevalence is unknown, perhaps because individ- 
uals rarely report their symptoms. A survey of the adult general 
population in the United States found shoplifting to have a lifetime 
prevalence of 11% [23], though stealing is only one component of 
kleptomania. A survey of nearly 800 college students reported a 
0.4% current prevalence for kleptomania [24]. Grant et al. [21] re- 
ported a 9% lifetime prevalence rate in psychiatric inpatients. 
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People with kleptomania describe irresistible impulses or urges to 
steal that build until satisfied [25]. They understand their impulses 
are senseless and intrusive and that their behaviour is wrong. Many 
try to resist the impulses with varying degrees of success. Stealing 
itself is accompanied by a ‘rush’ or ‘high, although that feeling dis- 
sipates as the potential consequences of the act become apparent. 
While most stealing is impulsive, some is premeditated. People 
with kleptomania tend to steal items that they otherwise could af- 
ford such as toiletries, make-up, or jewellery. Triggers include feel- 
ings of depression, anxiety, or boredom, or sometimes the particular 
sights, sounds, and objects found within a store. Lying to conceal the 
stealing is common. Some stop stealing temporarily following an ar- 
rest for shoplifting, but the behaviour resumes unfettered for most. 
A small number of individuals report that they have no memory of 
the stealing or that it occurs in a dream-like (or dissociative) state. 
For these individuals, stealing may become automatic. 

Clinicians should rule out other causes of stealing before making 
the diagnosis of kleptomania. Other possible diagnoses include anti- 
social personality disorder, crime occurring in the context of adult 
antisocial behaviour, and stealing that may occur in the course of a 
neurocognitive disorder, mania, or psychosis. 

Kleptomania appears more common in women than in men [25]. 
Many cases begin in the late teens to early 20s and often follow an 
episodic or a chronic course. By the time patients seek treatment, 
women are typically in their mid- to late 30s, while men are in their 
50s. Clinical studies show that kleptomania often co-occurs with 
other mental psychiatric disorders, including mood, anxiety, sub- 
stance use, and eating disorders (particularly bulimia). Kleptomania 
may also be associated with compulsive shopping. 

Family study data are limited, but studies suggested that people 
with kleptomania have first-degree relatives with high rates of 
mood disorders, obsessive-compulsive disorder, and substance use 
disorders [25]. 

Kleptomania may be associated with serotonergic and frontal lobe 
dysfunction, as evidenced by reduced [3H] paroxetine binding (a 
peripheral marker of serotonin function) found in a mixed group 
of 20 people with obsessive-compulsive-related disorders, including 
five with kleptomania [26]. In another study, ten women with klep- 
tomania were more likely than controls to have decreased white 
matter microstructural integrity in the inferior frontal brain regions 
when evaluated with diffusion tensor imaging [27]. 

While there are no standard treatments for kleptomania [25], 
various forms of cognitive behavioural therapy have been recom- 
mended. Case reports and open-label studies have reported on the 
use of SSRIs, topiramate, and naltrexone. Interestingly, there are also 
case reports of persons treated for depression with SSRIs who sub- 
sequently developed kleptomania [28]. In a randomized controlled 
trial (RCT) [29], naltrexone produced significant reductions in 
stealing urges and behaviours, compared with placebo. Common 
wisdom suggests that a self-imposed ban on shopping in an attempt 
to head off potential thefts may help curb stealing, but this may not 
be sustainable. 


Gambling disorder 


Recognized by both Kraepelin [10] and Bleuler [8], uncontrolled 
gambling was included in DSM-III [1] as pathological gambling 


within the chapter ‘Impulse-control disorders not elsewhere clas- 
sified. In DSM-5 [4], the disorder was moved to the chapter on 
‘Substance related and other addictive disorders, a change made be- 
cause of growing evidence of its relationship with alcohol and drug 
use disorders. The name was changed to gambling disorder (GD), in 
part, because of the stigma from the word ‘pathological’ [3]. ICD-10 
[6] includes pathological gambling, which is defined as frequent and 
repeated episodes of gambling that dominate the persons life to the 
detriment of their functioning. 

GD prevalence is estimated at 1.6% of the general population, but 
the prevalence of subclinical GD (‘at-risk’ gambling) is much higher 
at nearly 4% [30]. Gambling behaviour typically has an onset in ado- 
lescence, with GD developing by the late 20s for men and early 40s 
for women [31]. GD affects more men than women and progresses 
more rapidly in the latter, a phenomenon also observed in alcoholic 
persons [32]. Risk factors include a history of mental illness or sub- 
stance misuse, lower levels of education, and ethnic and racial mi- 
nority populations. Family and twin data show that GD is familial 
and has a heritable component shared with substance addictions. 
In a large family study involving over 1200 subjects, rates of lifetime 
pathological gambling were significantly greater among the first- 
degree relatives of people with pathological gambling (11%) than 
among control relatives (1%) [33]. 

Substance use, mood, and anxiety disorders are common in per- 
sons with GD [31]. Personality disorders are frequent, particularly 
the antisocial and borderline types. Compulsive shopping com- 
monly occurs in women with GD, and the disorders run in the same 
families [34]. 

The most widely discussed clinical distinction among gamblers is 
that of ‘escape-seekers’ and ‘sensation-seekers’ [35]. Escape-seekers 
are often older persons who gamble out of boredom, to alleviate de- 
pression, or to fill time. They tend to choose passive forms of gam- 
bling such as slot machines. Sensation-seekers are younger and 
prefer the excitement of card games or table games that involve ac- 
tive input. Another model posits the existence of three subgroups of 
GDs: (1) behaviourally conditioned gamblers without predisposing 
psychopathology but who make bad judgements regarding gam- 
bling; (2) emotionally vulnerable gamblers with premorbid de- 
pression or anxiety, and a history of poor coping; and (3) impulsive 
gamblers who are highly disturbed with features of antisocial per- 
sonality disorder [36]. 

GD is described in DSM-5 as chronic and progressive, though re- 
cent data have challenged the notion that GD is intractable. Instead, 
most GD subjects improve during follow-up, as evidenced by fewer 
gambling behaviours and less preoccupation [37]. 

Functional MRI and other technologies showed that the 
neurocircuitry mediating GD is similar to those seen in substance 
addictions [38]. The involvement of reward circuitry has also been 
strongly suggested by research showing that dopamine agonist 
medications for Parkinson's disease have led to the development 
or exacerbation of GD in some persons [39]. Dopamine is widely 
considered the neurotransmitter most involved in reward-based 
neurocircuitry. 

While few persons with GD seek treatment, cognitive behavioural 
therapy and motivational interviewing are effective [40]. Gamblers 
Anonymous (GA), a 12-step programme patterned after Alcoholics 
Anonymous, is often helpful and chapters are widely available. Self- 
exclusion programmes in which gamblers agree not to enter a casino 
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can be helpful. RCTs have shown naltrexone and nalmefene (not 
available in the United States) to be effective in reducing gambling 
behaviours and urges. SSRIs have also been studied, but RCTs have 
shown little effect [41]. 


Compulsive shopping 


Compulsive shopping (CS) has been recognized for over 100 years, 
including descriptions from Kraepelin [10] and Bleuler [8]. Despite 
this rich history, CS attracted little attention until interest was re- 
vived in the late 1980s and early 1990s by consumer behaviour 
researchers and clinicians interested in compulsive behaviours [42- 
44]. CS is not included in DSM-5 or ICD-10. 

McElroy et al. [43] have published diagnostic criteria that have be- 
come standard in the research community. They require the presence 
of cognitive and behavioural aspects of CS and impairment from 
both subjective distress and interference in social or occupational 
functioning or from financial or legal problems. The criteria require 
that the disorder does not co-occur with mania or hypomania. 

The prevalence of CS has been estimated at nearly 6% [45]. The 
disorder has shown a female preponderance in nearly all clinical 
and epidemiologic studies. CS has an onset in the late teens or early 
twenties that appears to correspond with emancipation from the nu- 
clear family and establishing credit [46]. CS occurs mainly in devel- 
oped countries, probably due to the availability of consumer goods 
and disposable income. Interestingly, after Germany reunified in 
1989, the prevalence rate of CS increased, presumably due to the 
influx of goods into the former East Germany, combined with in- 
creased income [47]. 

Psychiatric comorbidity with substance use, mood, anxiety, per- 
sonality, and eating disorders and other ICDs is common. Research 
suggests that CS is familial and co-aggregates with mood, anxiety, 
and substance use disorders [46]. CS has been considered chronic or 
episodic, but a recent study showed that CS behaviours diminished 
during a 5-year follow-up [48]. 

The hallmark of CS is a preoccupation with shopping and spending 
[46]. People with CS spend many hours each week engaged in shop- 
ping and spending behaviours, often preceded by increasing ten- 
sion or anxiety relieved with a purchase. Compulsive shoppers are 
mainly interested in consumer goods such as clothing, shoes, crafts, 
jewellery, and make-up. The impact of the Internet on CS behaviour 
is unclear but could be considerable. 

Neurobiologic theories of CS have focused on disturbed serotonin 
neurotransmission, because of hypothetical similarities between CS 
and obsessive-compulsive disorder, a disorder treated with SSRIs. 
Dopamine has been theorized to play a role in ‘reward dependence, 
which has been linked to behavioural addictions, including CS. 
Functional abnormalities in limbic regions and the prefrontal cortex 
have been hypothesized to account for the impulsivity and poor 
decision-making that characterize CS. Using fMRI, Raab et al. [49] 
reported on differences between 23 women with CS and 26 controls. 
They found greater nucleus accumbens activity during product pres- 
entation in women with CS, compared to those without, and lower 
insula activation during the presentation of prices for the products 
the CS women decided to purchase. These investigators concluded 
that the expected loss of money led to a stronger negative emotional 
response in healthy controls than in women with CS. 


There are no standard treatments for CS, but various forms of 
cognitive behavioural therapy have been shown to be effective [50]. 
Benson et al. [51] have developed a comprehensive programme 
combining cognitive behavioural therapy and elements of dialect- 
ical behaviour therapy. In an RCT, 11 subjects randomized to the 
group treatment had significantly greater improvement than wait 
list controls and the benefit was maintained at 6-month follow-up. 
Medication studies have had mixed success, with open-label trials 
showing positive results not confirmed in RCTs. SSRIs have been 
the most frequently used drugs in these trials. Other treatments are 
12-step programmes, financial counselling, and self-help books. 
Common sense suggests that people with CS should avoid carrying 
credit cards or shopping alone. 


Internet addiction 


Internet addiction is characterized by excessive and/or inappro- 
priate use of personal computers and other electronic devices, 
combined with personal distress or impairment in important life 
domains. Young [52] has proposed criteria patterned after those 
used to diagnose GD. She only counts non-essential computer/ 
Internet usage (for example, non-business or non-academic use), 
and Internet addiction is present when five or more symptom 
criteria are present during the past 6 months and mania has been 
ruled out. The concept of Internet addiction has been criticized 
because, unlike other behavioural addictions, the focus is on the 
medium, and not the behaviour, so that if an addiction exists, it 
likely pertains to the activity engaged in (for example, gaming, 
gambling, viewing pornography) [53]. The condition is not listed 
in DSM-5 or ICD-10. 

The prevalence of Internet addiction is unknown, in part because 
surveys have produced figures that range from 0.9% to 38% [54]. 
In a random telephone survey of adult Americans, Aboujaoude 
et al. [55] reported prevalence rates ranging from 0.3% to 0.7%. The 
widely varying figures suggest that more work is needed to develop 
uniform definitions of Internet addiction. Cases of Internet addic- 
tion have been reported in many countries, showing its universality. 

Data suggest that Internet addiction is more prevalent in males 
than females, perhaps because males are more likely to use the 
Internet for activities that may fuel addiction such as games, porn- 
ography, and gambling [54]. Age of onset is unknown, but Internet 
addiction has been reported in children as young as age 6 [55]. 

Psychiatric comorbidity is common, particularly for mood, anx- 
iety, and substance use disorders and other ICDs [54]. Using a di- 
mensional approach to assess the psychological status, increased use 
of the Internet was associated with higher ratings on measures of 
depression, loneliness, and social isolation. 

Neurobiological theories of Internet addiction focus on disturbed 
neurotransmission, particularly serotonin and dopamine [55], but 
there is no direct evidence to support the role of these or other 
neurotransmitter systems in Internet addiction. Pallanti et al. [56] 
hypothesized that immaturity of the frontal cortical and subcortical 
monoaminergic system during normal neurodevelopment under- 
lies adolescent impulsivity and perhaps Internet addiction. One 
study employing diffusion tensor imaging concluded that Internet 
addiction was associated with widespread reductions in fractional 
anisotropy in major white matter pathways that the authors believed 
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may be linked to the abnormal behaviours exhibited by their 
subjects [57]. 

There are no standard treatments for Internet addiction [55, 58]. 
Young [58] has developed a guide for therapists working with 
Internet addicts, employing cognitive behavioural methods. SSRIs 
have been used to treat Internet addiction [58], but there have been 
no RCTs. Because Internet affects many youth, family therapy has 
been recommended. Support groups are available in some areas and 
are available online as well. 


Compulsive sexual behaviour 


CSB involves excessive preoccupation and/or behaviour that causes 
distress or impairs one’s functioning. The concept of a hypersexual 
disorder dates back to the work of the German psychiatrist Krafft- 
Ebing [59]; yet the concept of a sexual addiction was first recognized 
by Orford [60]. Diagnostic criteria have been proposed that incorp- 
orate the concepts of inappropriate or excessive sexual cognitions or 
behaviours, subjective distress, and impaired functioning [61]. CSB 
is not listed in DSM-5 or ICD-10. 

CSB encompasses various problematic sexual behaviours that 
can roughly be divided into paraphilic and non-paraphilic sub- 
types [61]. The former involves pathological sexual behaviours (for 
example, exhibitionism, voyeurism), while the latter involves con- 
ventional sexual behaviours taken to extremes (for example, com- 
pulsive masturbation, promiscuity, pornography dependence). 
While hypersexual behaviour is the hallmark of CSB, it can be found 
in mania, substance use disorders, and neurocognitive disorders, all 
of which need ruling out. 

The prevalence of CSB is estimated to range from 3% to 6% of the 
general adult population in the United States [61]. Kafka [62] has 
suggested that data on total sexual outlet (TSO) (or total number 
of orgasms achieved through any means during a designated week) 
may more accurately reflect CSB prevalence, at least in men. He con- 
cluded that >7 weekly orgasms over 6 consecutive months could be 
used to define hypersexual behaviour, a figure that corresponds to 
3-15% of the adult male population in the United States. Grant et al. 
[21] reported a 4.9% lifetime prevalence for CSB in adult psychiatric 
inpatients. Whether high-frequency sexual behaviour is inherently 
pathological has been questioned. In one study, high-frequency 
sexual behaviour with a stable partner indicated better psychological 
functioning [63]. 

CSB is primarily a disorder of men, and there may be gender- 
specific differences in the way CSB manifests [61]. Men are more 
likely to report compulsive masturbation, to engage in paraphilias, to 
pay for sex, or to engage in anonymous sex. Women are more likely 
to engage in fantasy sex (for example, seductive behaviour leading 
to multiple affairs/relationships) or sadomasochism, to use sex as a 
business, or to see themselves as ‘love addicts: CSB has an onset in 
adolescence, with paraphilic behaviours frequently occurring earlier 
than non-paraphilic behaviours [61]. CSB appears to be chronic, 
though waxing and waning in frequency and severity. Psychiatric 
comorbidity is common in persons with CSB, particularly mood, 
anxiety, and substance use disorders. Some experts have suggested 
that CSB begins with childhood sexual abuse, but not all persons 
with CSB have experienced maltreatment [64]. 


Typical symptoms include preoccupation with one’s sexual urges 
and fantasies and/or being overly sexually active, often spending 
considerable time in pursuit of sexual experiences [64]. Many will 
report feeling out control and being subjectively distressed by their 
sexual thoughts or urges. The behaviour eventually causes impair- 
ment in important life domains such as affecting their marriage or 
significant relationships or one’s work or school life (for example, 
through intrusive thoughts or from frequent lateness). Persons with 
CBS often try unsuccessfully to resist sexual thoughts and urges. 

In terms of neurobiology, Kafka [65] has focused on the possible 
contribution of disturbed neurotransmission, noting that noradren- 
aline, serotonin, and dopamine all serve to modulate sexual behav- 
iour and other dimensions of human and animal pathophysiology. 
In a functional MRI study [66], sexual-cue reactivity led to greater 
activation in brain regions linked to drug craving and emotion pro- 
cessing (ventral striatum, amygdala, and dorsal anterior cingulate 
cortex) in 19 persons with CSB, compared to 19 controls. The au- 
thors suggested that CBS shares neural mechanisms with addictive 
disorders. 

There are no standard treatments for CSB. Many forms of psy- 
chotherapy have been recommended, including imaginal desensi- 
tization, aversion therapy, group therapy, and psychodynamic and 
cognitive behavioural therapies [67]. Twelve-step programmes are 
available in some areas and can be helpful. SSRIs have been used in 
one controlled study and several open studies, leading to a reduc- 
tion in sexual preoccupations and behaviours [67]. Case reports and 
small case series have also suggested that nefazodone, naltrexone, 
and anti-androgens may be effective in treating CSB. A placebo- 
controlled trial of the gonadotrophin-releasing hormone analogue 
triptorelin in men with severe paraphilias suggested that it is ef- 
fective [68]. A meta-analysis of 118 patients suggested that lutein- 
izing hormone-releasing agonists were effective in treating patients 
with severe paraphilias [69]. Anti-androgens should probably be 
reserved for men with aggressive and/or dangerous forms of CSB. 
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The term conduct disorder refers to a persistent pattern of anti- 
social behaviour in which the individual repeatedly breaks social 
rules and carries out aggressive acts. It is the most common psychi- 
atric disorder of childhood across the world. Antisocial behaviour 
has the highest continuity into adulthood of all measured human 
traits, except intelligence. A high proportion of affected children and 
adolescents grow up to be antisocial adults with impoverished and 
destructive lifestyles; some will develop antisocial personality dis- 
order. The disorder places a large personal and economic burden 
on individuals and society. Juvenile delinquency is a legal term, re- 
ferring to an act by a young person that breaks the law. Most re- 
current juvenile offenders have conduct disorder. In this chapter, 
the term ODD/CD is used to denote oppositional defiant disorder 
(ODD) and conduct disorder (CD) together, since although they are 
essentially the same underlying condition of persistent antisocial be- 
haviour expressed at different ages, in the Diagnostic and Statistical 
Manual of Mental Disorders, fifth edition (DSM 5), unlike in the 
International Classification of Diseases, tenth revision (ICD-10), they 
are separated. The term conduct problems will be used for less severe 
antisocial behaviour; when the term child is used, it refers to both 
children and adolescents, for brevity. 








Aggressive and defiant behaviour is an important part of normal 
child and adolescent development which ensures physical and social 
survival. Indeed, parents may express concern if a child is too ac- 
quiescent and unassertive. The level varies considerably among chil- 
dren, and it is a continuously distributed trait. Picking a particular 
level of antisocial behaviour to call ODD/CD is therefore necessarily 
arbitrary. For all children, the expression of any particular behaviour 
also varies according to child age; for example, physical hitting is at 
a maximum at around 2 years of age but declines over the next few 





years. Therefore, any judgement about the significance of the level 
of antisocial behaviour has to be made in the context of the child’s 
age. Before deciding that the behaviour is abnormal, other clinical 
features have to be considered: 


e Level: severity and frequency of antisocial acts, compared with 
children of the same age and gender. 

e Pattern: the variety of antisocial acts and the setting in which they 
are carried out. 

e Persistence: duration over time. 

e Impact: distress and social impairment of the child; disruption 
and damage caused to others. 


Change in clinical features with age 


The type of behaviour seen will depend on the age and gender of the 
individual. 

Younger children, from 3 to 7 years of age, usually present with 
general defiance of adults’ wishes, disobedience of instructions, 
angry outbursts with temper tantrums, physical aggression to 
people, especially siblings and peers, destruction of property, 
arguing, blaming others for things that have gone wrong, and a 
tendency to annoy and provoke others. 

In middle childhood, from 8 to 11 years, these features are often pre- 
sent, but as the child grows older and stronger and spends more time 
out of the home, other behaviours are seen. They include: swearing, 
lying, stealing of others’ belongings outside the home, persistent 
breaking of rules, physical fights, and bullying of other children. 

In adolescence, from 12 to 17 years, more antisocial behaviours 
are often added: cruelty and hurting of other people, assault, rob- 
bery using force, vandalism, breaking and entering houses, stealing 
from cars, driving and taking away cars without permission, run- 
ning away from home, truanting from school, and misuse of drugs. 


Girls 


Severe antisocial behaviour is less common in girls who are less likely 
to be physically aggressive and engage in criminal behaviour, but more 
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likely to show spitefulness, emotional bullying (such as excluding chil- 
dren from groups, spreading rumours so others are rejected by their 
peers), frequent unprotected sex leading to sexually transmitted dis- 
eases and pregnancy, drug abuse, and running away from home. 


Classification 


The ICD-10 classification has a category for CDs—F91. The ICD- 
10 Classification of Mental and Behavioural Disorders: Clinical 
Descriptions and Diagnostic Guidelines [1] states that any one of the 
behaviours on which a diagnosis is based (for example, excessive 
levels of fighting or bullying, cruelty to animals), if marked, is suffi- 
cient for diagnosis, but isolated acts are not. An enduring pattern of 
behaviour should be present, but no time frame is given and there is 
no impairment or impact criterion stated. 

ICD-10 has conduct disorder as an overarching term, which can 
be divided into two subtypes—ODD and CD, thus making it closely 
compatible with DSM-5 [2] which treats ODD and CD as com- 
pletely separate conditions. There is considerable debate about the 
validity of this division, with many authorities considering it to be a 
unitary disorder, with different phenomena chiefly due to different 
ages of presentation. In contrast to the clinical guidelines, the ICD- 
10 Classification of Mental and Behavioural Disorders: Diagnostic 
Criteria for Research [3] takes a menu-driven approach that is vir- 
tually identical to DSM-5. In line with the underlying philosophy of 
DSM-5, ODD and CD could theoretically be used at any age, but in 
practice, ODD is usually used for younger children and CD for ado- 
lescents up to the age of 18. Neither term tends to be used in adults. 

To diagnose ODD (313.81/F91.3), there should be four of the fol- 
lowing eight symptoms over at least 6 months: 


Angry/irritable mood 


1. or his or her 


developmental level. 


23 
3. 





Argumentative/defiant behaviour 
4. 
5. 


6. Often deliberately does things that annoy other people. 
T 








Vindictiveness 


8. 





The behavioural problems should be associated with distress in 
the individual or others around them, or affect social, educational, 
or other areas of function. 


For the diagnosis of CD, three of the following 15 behaviours 
should have occurred in the last 6 months: 
Aggression to people and animals 
x eo ey aaa 
2 
3. Has used a weapon that can cause serious physical harm to 
others (for example, knife, gun). 


4. Exhibits physical cruelty to people. 

5. Exhibits physical cruelty to animals. 

6. Has stolen with confrontation of victim (including purse- 
snatching, extortion, mugging). 

7. Has forced someone into sexual activity. 


Destruction of property 


8. Deliberately sets fires with a risk or intention of causing serious 
damage. 
9. Deliberately destroys the property of others. 


Deceitfulness or theft 


10. Has broken into someone else’s house, building, or car. 


11, Often lies to obtain goods or favours to avoid obligations (that 


is, ‘cons others). 






12. the 


victim (for example, shoplifting, burglary). 
Serious violations of rules 


13. Often stays out after dark despite parenting prohibition, begin- 
ning before age 13 years. 





14. 
15. Is frequently truant from school, beginning before age 13 years. 


In DSM-5, there should be significant impairment in social, 
academic, or occupational functioning, whereas in ICD-10, there 
is no impairment criterion. Age of onset should be specified, 
with childhood-onset type (312.81) manifesting before age 10 and 
adolescent-onset type (312.89) after. Severity should be categorized 
as mild, moderate, or severe, according to the number of symptoms 
or impact on others. 

New in DSM-5 is recognition of the subtype characterized by 
callous-unemotional traits (discussed further under ‘Aetiology’ 
p. 1268) called which is effectively 
a personality trait. Two of the following four features should be per- 
sistently and pervasively present: 


e 
a unconcerned about the negative consequences 


of their actions, for example no remorse after hurting someone or 
harmful effect of breaking rules. 


e 
The person appears more 


concerned about themselves, even when their actions result in 
substantial harm to others. 










Puts in insufficient effort needed to perform 
well, typically blames others for poor performance. 


(for example, emotions displayed 
to manipulate or intimidate others). 


Where there are sufficient symptoms of a comorbid disorder 
to meet diagnostic criteria, the ICD-10 system discourages the 
application of a second diagnosis and instead offers two com- 
bined categories: mixed disorders of conduct and emotions where 
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Depressive Conduct Disorder (F92.0) is the best researched; and 
Hyperkinetic Conduct Disorder (F90.1). There is good evidence 
to suggest these combined conditions differ in the longitudinal 
course from the pure form, in a manner that would be expected. 
Comorbidity in DSM-5 is handled by giving as many separate diag- 
noses as necessary. 





Differential diagnosis 


Making a diagnosis of CD is usually straightforward, but comorbid 
conditions are often missed. The differential diagnosis may include: 


1. Hyperkinetic syndrome/attention-deficit/hyperactivity disorder 
(ADHD). These are the names given by ICD-10 and DSM-5, re- 
spectively, for similar conditions, except that the former is more 
severe. For convenience, the term ADHD will be used here. It is 


characterized by impulsivity, inattention, and motor overactivity. 
Any ft tess spans is also present in ODD/CD. 


However, none of the antisocial symptoms of ODD/CD are a part 
of ADHD, so excluding CD should not be difficult. A frequently 
made error, however, is to miss comorbid ADHD. Standardized 
questionnaires are very helpful here, such as the Conners or the 
Strengths and Difficulties Questionnaire, which is brief and just as 
effective at detecting hyperactivity as much longer alternatives [4]. 

2. Adjustment reaction to an external stressor. This can be diag- 
nosed when onset occurs soon after exposure to an identifiable 
psychosocial stressor such as parental divorce or bereavement, 
trauma, abuse, or adoption. The onset should be within 1 month 
for ICD-10, and 3 months for DSM-5, and symptoms should not 
persist for >6 months. 


3. Mood disorders. 
but unlike typical ODD/CD, mood is 


usually clearly low and there are vegetative features; more severe 
conduct problems are absent. It is common—around a third of 
adolescents with ODD/CD have depressive or other emotional 
symptoms, severe enough to warrant a diagnosis. 






arly manic/bipolar depressive disorder 
can be harder to distinguish, as there is often considerable de- 
fiance and irritability, combined with disregard for rules and 
behaviour that violates the rights of others, but there should be 
clear evidence of elevated mood, racing thoughts, etc. [5]. 


A new disorder appeared in DSM-5, called Disruptive Mood 
Dysregulation Disorder (296.99). This condition does not have 
a solid research underpinning and has a comorbidity rate ap- 
proaching 90% with ODD, so its validity as a separate entity is 
questionable. The motivation for introducing the new condition 
was to stop some psychiatrists misdiagnosing ODD with prom- 
inent temper tantrums in young children as juvenile bipolar dis- 
order and using potentially harmful medications. 


4. Autistic spectrum disorders. These are often accompanied by 
marked tantrums or destructiveness, which may be the reason 
for seeking a referral. Enquiring about other symptoms of aut- 
istic spectrum disorders should reveal their presence. 









5. Subcultural deviance. Some youths are antisocial and commit 
crimes but are not particularly aggressive or defiant. They are 
well adjusted within a deviant peer culture that approves of rec- 
reational drug use, shoplifting, etc. In some localities, a quarter 
or more of teenage males fit this description and would meet 
ICD-10 diagnostic guidelines for socialized CD. Some clinicians 
are unhappy to label such a large proportion of the population 
with a psychiatric disorder. Using DSM-5 criteria would pre- 
clude the diagnosis for most youths like this, due to the require- 
ment for significant impairment. 


Multi-axial assessment 


ICD-10 recommends that multi-axial assessment be carried out for 
children and adolescents. A benefit of having a multi-axial system 
is that at least it forces the clinician to consider aspects beyond the 
presenting symptoms. In ICD-10, Axis I is used for psychiatric dis- 
orders; Axis II and Axis III cover specific and general intellectual 
disabilities, respectively; and Axis IV, Axis V, and Axis VI cover 
general medical conditions, psychosocial problems, and the level of 
social functioning, respectively. DSM-5 has abolished the first three 
axes as separate entities but keeps psychosocial problems and level 
of social functioning. 

Both specific and general learning disabilities need to be assessed 
in individuals with conduct problems. Over a quarter of children 
with ODD/CD also have specific reading impairment/dyslexia [6], 
defined as a reading level two standard deviations below that pre- 
dicted by the person’s intelligence quotient (IQ). This is not just 
due to a lack of adequate schooling; there is good evidence that the 
cognitive deficits often precede the behavioural problems. The rate 
of ODD/CD rises several-fold as the IQ gets below 70, and general 
intellectual disability is often missed in children, so if school per- 
formance is markedly behind, IQ and attainment testing should be 
carried out. 


Epidemiology 


Most surveys find ODD + CD to have a prevalence of around 5-7% 
when an impairment criterion is applied [7, 8]; a meta-analysis of 
epidemiological studies estimated that the worldwide prevalence of 
CD alone among children and adolescents aged 6-18 years is 3.2%, 
with little variation across countries or continents [9]. A modest rise 
in diagnosable ODD/CD over the second half of the twentieth cen- 
tury has also been observed, comparing assessments of three suc- 
cessive birth cohorts in Britain [10]. There is a very marked social 
class gradient, greater than for almost any other disorder, with five 
times the prevalence in the lowest, compared to the highest, socio- 
economic group [8]. With respect to ethnicity, youth self-reports of 
symptoms of CD and crime victim survey reports of perpetrators’ 
ethnicity (both of which are less likely to be influenced by racial 
prejudice than parent or teacher reports of CD symptoms or po- 
lice arrests) show an excess of offenders of black African ancestry, 
whereas Hispanic Americans in the United States and British Asians 
in the UK do not show an excess of offending, compared to their 
white counterparts. The sex ratio is approximately equal with re- 
spect to ODD in younger children but then rises to 2- to 3-fold more 
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in males than females after around 7 years. The causes of conduct 
problems appear to be the same, but males have more ODD/CD be- 
cause they experience a greater number and intensity of individual- 
level risk factors (for example, hyperactivity, neurodevelopmental 
delays). 


Developmental subtypes 


Life course persistent versus adolescence onset 


DSM-5 distinguishes between conduct problems that are first seen 
in early childhood versus those that start in adolescence. Early onset 
is a strong predictor of persistence through childhood and into adult 
life (Fig. 124.1) [11]. Those with early onset differ from those with 
later onset in that they have lower IQ, more attention and impulsivity 
problems, poorer scores on neuropsychological tests, greater peer 
difficulties, and they are more likely to come from adverse family 
circumstances [12]. Those with later onset become delinquent pre- 
dominantly as a result of social influences such as association with 
other delinquent youths; they have fewer neurodevelopmental risk 
factors, but they are not absent [12]. Longitudinal studies support 
poorer adult outcomes for the early-onset group, but the doom- 
laden prognosis of 20 years ago has been moderated by the discovery 
of an early-onset group that subsequently desist, called childhood 
limited. Distinguishing between this group and the life course- 
persistent group has proved difficult, although a family history of 
criminal behaviour predicts persistence [13]. While many of the 
adolescence-onset group still engage in offending as adults and often 
have problems with alcohol and drugs, the level is lower than in the 
early-onset group. 


Progression to antisocial/dissocial personality disorder 


In both ICD-10 and DSM-5, a person should be aged 18 or older be- 
fore a personality disorder can be diagnosed, but it is a requirement 
that they should have a history of CD from before age 15. The diag- 
nostic criterion is generous, since only one of the following seven 
symptoms is needed: 


Antisocial behaviour level 








5 10 15 20 25 30 35 
Age in years 





Not antisocial (70%) = Childhood limited (7%) 
Adolescent onset (15%) =< Early onset persistent (8%) 


Fig. 124.1 Subtypes of antisocial behaviour by longitudinal course (data 
synthesis from several longitudinal studies). 


Reproduced from Scott S, Gardner F, Parenting programs. In: Thapar A, Pine D, 
Leckman J, et al. [Eds.], Rutter’s Child and Adolescent Psychiatry, 6th ed., pp. 483-495, 
Copyright (2015), with permission from John Wiley and Sons. 





1. Failure to obey laws and norms by engaging in behaviour war- 
ranting criminal arrest. 

2. Lying, deception, and manipulation for profit or self-amusement. 

3. Impulsive behaviour. 

4. Irritability and aggression, manifested as frequent assaults or 
fighting. 

5. Blatantly disregards safety of self and others. 

6. A pattern of irresponsibility. 

7. Lack of remorse for actions. 


Perhaps not surprisingly, given these wide criteria, prevalence 
rates vary from 0.2% to 3.3%; using clinical impairment criteria, the 
prevalence is around 1% [2]. Antisocial personality disorder is a nat- 
ural progression over time in children with CD who display the sub- 
type with limited prosocial emotions. They have difficulty learning 
from mistakes, are rigid in decision-making, and are typically unre- 
sponsive to punishment [14]. 


Aetiology 


The causes of ODD/CD vary considerably according to subtype and 
are multifactorial. After consideration of implicated factors, a syn- 
thesis model will be offered. 


Individual-level characteristics 
Genetic contribution 


While in former times, ODD/CD was reckoned to arise from bad 
socialization and a lack of self-control, it is now evident that overall, 
genetic factors account for around 60% of the variance [15]. Within 
this overall figure, for some subtypes, the genetic contribution is 
even stronger. Twin studies showed both substantial genetic influ- 
ences and also a substantial shared environmental component [16]. 
The genetic contribution is higher when inattention and hyper- 
activity are present [17], and extremely high when there are callous- 
unemotional traits, when the heritability typically reaches 80% [18]. 
Conversely, where these factors are absent, the genetic contribution 
can beas low as 30% [18]. 


Identified genotypes 


Genome-wide approaches looking for main effects have been 
disappointing so far for ODD/CD, typically accounting for at most 
2% of the variance; no consistent linkage regions have been identified 
[19, 20]. The most studied candidate gene in relation to conduct prob- 
lems is the MAOA promoter polymorphism. The gene encodes the 
MAOA enzyme, which metabolizes neurotransmitters linked to ag- 
gressive behaviour. Replicated studies showed that maltreatment his- 
tory and genotype interact to predict antisocial outcome [21]. Recently, 
the impact of environmental influences on epigenetic processes has 
become prominent whereby identifiable chemical changes, such as 
acetylation and methylation of genes, are affected. For example, a study 
has shown differentially increased methylation in adolescents whose 
parents reported they were exposed to stress as infants [22]. 


Perinatal complications and temperament 


Recent large-scale general population studies have found asso- 
ciations between life course-persistent type conduct problems 
and perinatal complications, minor physical anomalies, and 
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low birthweight. Most studies support a biopsychosocial model 
in which obstetric complications might confer vulnerability to 
other co-occurring risks such as hostile or inconsistent parenting. 
Smoking in pregnancy is a statistical risk predictor of offspring con- 
duct problems. Several prospective studies have shown associations 
between irritable temperament and conduct problems. 


Brain structure and function 


Whereas ADHD has been recognized as having specific brain 
differences for over 20 years, it is only in the last decade that it 
has become clear that ODD/CD has its own and different set of 
brain anomalies. These are in terms of both structure, with re- 
duced white matter connectivity with default mode regions that 
subsume cognitive functions about understanding self [23], and 
function where a recent meta-analysis of 24 studies found con- 
sistent differences in the rostro-dorsomedial, fronto-cingulate, 
and ventral-striatal regions that mediate reward-based decision- 
making, which is typically compromised in ODD/CD. Youths with 
callous-unemotional traits, on the other hand, had dysfunctions 
associated with the ventromedial prefrontal cortex and limbic 
system, together with dorsal and fronto-striatal hyperfunctioning, 
which may reflect poor affect reactivity and empathy in the pres- 
ence of hyperactive executive control. In particular, these youths 
have a hypofunctioning amygdala, which is consistent with their 
fearlessness [24]. 


Autonomic reactivity and intellectual and information 
processing deficits 


A slow heart rate has been consistently associated with ODD/ 
CD [25]. Indicators of heart rate variability and skin conductance 
confirm that the callous-unemotional subtype tend to be hypo- 
aroused, whereas the subtype with mood/internalizing problems is 
hyperaroused [25]. 

Children with ODD/CD have increased rates of deficits in 
language-based verbal skills. The association holds after con- 
trolling for potential confounders such as race, socio-economic 
status, academic attainment, and test motivation. Children who 
cannot reason or assert themselves verbally may attempt to gain 
control of social exchanges using aggression; there are likely also 
to be indirect effects in which low verbal IQ contributes to aca- 
demic difficulties, which, in turn, mean that the child’s experi- 
ence of school becomes unrewarding. Children with ODD/CD 
also have poor executive functions such as skills in learning and 
applying contingency rules, abstract reasoning, problem-solving, 
self-monitoring, sustained attention and concentration, relating 
previous actions to future goals, and inhibiting inappropriate re- 
sponses. These functions are largely, although not exclusively, as- 
sociated with frontal lobe function. A meta-analysis with nearly 
15,000 participants found that individuals with conduct problems 
had poorer executive functioning by an effect size of 0.54 standard 
deviations [26]. 

Dodge [27] proposed the leading information-processing 
model for the genesis of aggressive behaviours within social inter- 
actions. The model has good evidence that children prone to ag- 
gression focus on threatening aspects of others’ actions, interpret 
neutral actions as hostile, and are more likely to favour aggressive 
solutions. 


Risks outside the family 


Risks in the neighbourhood 


It has long been assumed that bad neighbourhoods promote youth 
antisocial behaviour. It is difficult to make direct links between 
neighbourhood characteristics and child behaviour, since several 
risk factors often coexist such as demographic factors like per- 
centage of ethnic minority residents or single-parent households, 
unemployment levels, and parental mental illness. Many neighbour- 
hood influences are mediated by supportive parenting, notably high 
warmth and close monitoring [28]. 


Peer influences 


Children with conduct problems have poorer peer relationships 
than non-disordered children. It often starts off with failure to have 
the social skills to befriend more socially successful children, so that 
they then tend to associate with other antisocial children and, in 
turn, take pleasure in being defiant and breaking rules. In adoles- 
cence, a high proportion of delinquent acts are committed in the 
company of other antisocial youths [29]. 





Risks within the family 


Concentration of crime in families 


Fewer than 10% of families in any community account for >50% of its 
criminal offences, reflecting the co-occurrence of genetic and envir- 
onmental risks. However, knowing that conduct problems are under 
some genetic influence is less useful clinically than knowing that this 
genetic influence appears to be reduced or enhanced, depending on 
the quality of the child’s environment. Several genetically sensitive 
studies have allowed interactions between family genetic liability 
and rearing environment to be examined. Thus, for example, genetic 
risk, indexed by having a criminal or an alcoholic parent, leads to 
tripling or so of ODD/CD in their children, compared to controls. 
However, if the upbringing is benign, this is reduced, whereas if it 
is harsh, the rate can be 8- or 10-fold greater. In short, genetic vul- 
nerability provides a form of susceptibility to adverse child-rearing 
conditions [30]. 


Family poverty 


There is an association between severe poverty and early childhood 
conduct problems. Early theories proposed direct effects of poverty. 
Subsequent research has indicated that the association between low 
income and childhood conduct problems is indirect, mediated via 
family processes such as marital discord and parenting deficits. 


Parent-child attachment 


Parent-child relationships provide the setting for the development 
of later social functioning, and disruption of these attachment rela- 
tionships, for example through institutional care, is associated with 
subsequent difficulties in relating. One study found that ambivalent 
and controlling attachment predicted externalizing behaviours after 
controlling for baseline externalizing problems [31]; disorganized 
child attachment patterns seem to be especially associated with con- 
duct problems. Although it seems obvious that poor parent-child 
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relationships in general predict conduct problems, it has yet to be 
established whether attachment difficulties have an independent 
causal role in the development of behaviour problems or if classifi- 
cations are markers for poor parenting. 


Discipline and parenting 


Patterns of parenting associated with conduct problems were delin- 
eated by Patterson [32] in his seminal work Coercive Family Process. 
Parents of antisocial children were found to be more inconsistent in 
their use of rules, to issue less clear and more commands, to be more 
likely to respond to their children on the basis of their own mood, 
rather than the characteristics of the child’s behaviour, to be less 
likely to monitor their children’s whereabouts, and to fail to acknow- 
ledge or reward their children’s prosocial behaviour. Patterson pro- 
vided observational evidence for negative coercive cycles worsening 
antisocial behaviour as follows. A parent responds to mild child 
oppositional behaviour with a prohibition; the child responds by 
escalating his or her behaviour; the parent, in turn, gets angry and 
mutual escalation continues until exasperated and tired, the parent 
backs off. This has the inadvertent effect of teaching the child that if 
they argue and swear and are defiant, they get their way, thus reinfor- 
cing the child’s behaviour and making it more likely that the child 
will be antisocial next time. There is ample evidence that conduct 
problems are associated with hostile, critical, punitive, and coercive 
parenting [33]. 

In considering the role of coercive processes in the origins or 
maintenance of conduct problems, we need to consider possible 
alternative explanations: (1) that the associations reflect familial 
genetic liability towards children’s psychopathology and parents’ co- 
ercive discipline, so they would happen whether or not they lived 
together; (2) that they represent evocative effects of children’s behav- 
iours on parents; and (3) that coercive parenting may be a correlate 
of other features of the relationship that influence children’s behav- 
iours, for example noisy living conditions that make them both more 
irritable. There is considerable evidence that children’s difficult be- 
haviours do indeed evoke parental negativity. The fact that children’s 
behaviours can evoke negative parenting does not, however, mean 
that negative parenting has no impact on children’s behaviour. The 
E-Risk longitudinal twin study of British families examined the ef- 
fects of fathers’ parenting on young children’s aggression [34]. As 
expected, a prosocial father’s absence predicted more aggression by 
his children, but the presence of an antisocial father predicted even 
greater aggression and his harmful effect was exacerbated the more 
times each week he spent taking care of the children. 


Exposure to adult marital conflict and domestic violence 


Family processes other than parenting skills and the quality of 
parent-child attachment relationships have a role. Many studies 
have shown that children exposed to domestic violence between 
adults are subsequently more likely to themselves become aggres- 
sive. Marital conflict influences children’s behaviour because of 
its effect on their regulation of emotion. For example, a child may 
respond to frightening emotion arising from marital conflict by 
downregulating his or her own emotion through denial of the situ- 
ation. This, in turn, may lead to inaccurate appraisal of other social 
situations and ineffective problem-solving. Repeated exposure to 
family conflict is thought to lower childrens’ thresholds for psycho- 
logical dysregulation, resulting in greater behavioural reactivity to 


stress. Children’s aggression may also be increased by marital dis- 
cord because children are likely to imitate aggressive behaviour 
modelled by their parents [35]. 


Maltreatment 


In the Christchurch longitudinal study, child sexual abuse predicted 
conduct problems, after controlling for other childhood adversities 
[36]. Overall, associations between physical punishment and con- 
duct problems are well established; however, links are not straight- 
forward. The risk for conduct problems does not apply equally to 
all forms of physical punishment. The E-Risk longitudinal twin 
study was able to compare the effects of corporal punishment 
(smacking, spanking) versus injurious physical maltreatment, using 
twin-specific reports of both experiences [37]. Results showed that 
children’s genetic endowment accounted for virtually all of the asso- 
ciation between their corporal punishment and their conduct prob- 
lems. This indicated a ‘child effect, in which children’s bad conduct 
provokes their parents to use more corporal punishment, rather 
than the reverse. Findings for injurious physical maltreatment were 
the opposite; significant effects of maltreatment on child aggression 
remained after controlling for any genetic transmission of liability to 
aggression from antisocial parents. 


From risk predictor to causation 


Associations have been documented between conduct problems and 
a wide range of risk factors. A variable is called a ‘risk factor’ if it has a 
documented predictive relation with antisocial outcomes, whether or 
not the association is causal. The causal status of most of these risk fac- 
tors is unknown; we know what statistically predicts conduct problem 
outcomes, but not how or why. Establishing a causal role for a risk 
factor is by no means straightforward, particularly as it is unethical to 
experimentally expose healthy children to risk factors. The use of gen- 
etically sensitive designs and the study of within-individual change in 
natural experiments and treatment studies have considerable meth- 
odological advantages for suggesting causal influences. 


Prognosis 


Many of the risk factors which predict poor outcome are associated 
with early onset (Table 124.1). 


Table 124.1 Factors predicting poor outcome 


Onset Early onset of severe problems, before age 8 

Phenomenology Antisocial acts which are severe, frequent, and 
varied 

Comorbidity Hyperactivity and attention problems 

Intelligence Lower IQ 

Family history Parental criminality, parental alcoholism 

Parenting Harsh and inconsistent parenting, with high 


criticism, low warmth, low involvement, and low 
supervision 





Wider environment Low-income family in poor neighbourhood with 


ineffective schools 
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Protective factors 


To detect protective factors, children who do well, despite adverse 
risk factors, have been studied. 

These so-called ‘resilient’ children, however, have been shown to 
have lower levels of risk factors, for example a boy with antisocial be- 
haviour and low IQ living in a rough neighbourhood, but living with 
supportive, concerned parents. Protective factors are mostly the 
opposite end of the spectrum of the same risk factor. Nonetheless, 
there are factors which are associated with resilience, independent 
of known adverse influences. These include a good relationship with 
at least one adult who does not necessarily have to be the parent, a 
sense of pride and self-esteem, and skills or competencies. 


Adult outcome 


Studies of groups of children with early-onset CD indicated a wide 
range of problems not only confined to antisocial acts, as shown in 
Table 124.2. 

What is clear is that not only are there substantially increased rates 
of antisocial acts, but also the general psychosocial functioning of 
children with CD when they have grown up is strikingly poor. For 
most of the characteristics shown in Table 124.2, the increase, com- 
pared to controls, is at least double for community cases who were 
never referred, and 3-4 times for referred children [27]. 


Pathways 


The path from childhood CD to poor adult outcome is neither inev- 
itable nor linear. 

Different sets of influences impinge and shape the life course as 
the individual grows up. Many of these can accentuate problems. 
Thus, a toddler with an irritable temperament and short attention 


Table 124.2 Adult outcome 


More violent and non-violent crimes, for 
example mugging, grievous bodily harm, theft, 
car crimes, fraud 


Antisocial behaviour 


Psychiatric problems Increased rates of antisocial personality, alcohol 
and drug abuse, anxiety, depression and somatic 
complaints, episodes of deliberate self-harm and 


completed suicide, time in psychiatric hospitals 


Education and training Poorer examination results, more truancy 
and early school leaving, fewer vocational 


qualifications 


Work ore unemployment, jobs held for shorter 
ime, jobs lower in status and income, increased 


claiming of benefits and welfare 


Social network Few, if any, significant friends, low involvement 
with relatives, neighbours, clubs, and 


organizations 


Intimate relationships ncreased rate of short-lived, violent cohabiting 


relationships, partners often also antisocial 





Children ncreased rates of child abuse, conduct problems 
in offspring, children taken into care 
Health ore medical problems, earlier death 


span may not learn good social skills if he or she is raised in a family 
lacking them and where he or she can only get his or her way by be- 
having antisocially and grasping for what he or she needs. At school, 
he or she may fall in with a deviant crowd of peers where violence 
and other antisocial acts are talked up and give him a sense of esteem. 
His or her generally poor academic ability and difficult behaviour in 
class may lead him to truancy increasingly, which, in turn, makes 
him fall further behind. He or she may then leave school with no 
qualifications and fail to find a job, and resort to drugs. To fund his 
or her drug habit, he or she may resort to crime and, once convicted, 
find it even harder to get a job. From this example, it can be seen that 
adverse experiences do not only arise passively and independently 
of the young person's behaviour; rather, the behaviour predisposes 
them to end up in risky and damaging environments. Consequently, 
the number of adverse life events experienced is greatly increased 
[38]. The path from early hyperactivity into later CD is also not in- 
evitable. In the presence of a warm, supportive family atmosphere, 
it is far less likely than if the parents are highly critical and hostile. 

Other influences can, however, steer the individual away from 
an antisocial path. For example, the fascinating follow-up of delin- 
quent boys to age 70 by Laub and Sampson [39] showed that the fol- 
lowing led to desistence: being separated from a deviant peer group; 
marrying to a non-deviant partner; moving away from a poor neigh- 
bourhood; and military service which imparted skills. 


Evidence-based treatments 


Proven treatments include those which singly or in combination ad- 
dress: (1) parenting skills; (2) family functioning; (3) child interper- 
sonal skills; (4) difficulties at school; (5) peer group influences; and 
(6) medication for coexisting hyperactivity. 


1. Parenting skills. 


Parent management training aims to improve parenting skills. 
There are scores of randomized controlled trials showing that it is 
effective for children aged up to about 10 years. They address the 
parenting practices identified in research as contributing to conduct 
problems. A more detailed account is given by Scott and Gardner 
[11]. Typically, they include five elements: 


i. Promoting play and a positive relationship. In order to cut into the 
cycle of defiant behaviour and recriminations, it is important 
to instil some positive experiences for both sides and begin to 
mend the relationship. Teaching parents the techniques of how 
to play helps them recognize their child’s needs and respond 
sensitively. The child, in turn, begins to like and respect their 
parents more and becomes more secure in the relationship. 

ii. Praise and rewards for prosocial behaviour. Parents are helped to 
reformulate difficult behaviour in terms of the positive behav- 
iour they wish to see, so that they encourage wanted behaviour, 
rather than criticize unwanted behaviour. For example, instead 
of shouting at the child not to run, they would praise him when- 
ever he or she walks quietly; then he or she will do it more often. 
Through hundreds of such prosaic daily interactions, child be- 
haviour can be substantially modified. Yet some parents find it 
hard to praise and fail to recognize positive behaviour when it 
happens, with the result that it become less frequent. 
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iii. Clear rules and clear commands. Rules need to be explicit 
and constant; commands need to be firm and brief. Thus, 
shouting at a child to stop being naughty does not tell him 
what he or she should do, whereas, for example, telling him 
to play quietly gives a clear instruction, which makes compli- 
ance easier. 

iv. Consistent and calm consequences for unwanted behaviour. 
Disobedience and aggression need to be responded to firmly 
and calmly by, for example, putting the child in a room for 
a few minutes. This method of ‘time-out from positive re- 
inforcement’ sounds simple but requires considerable skill 
to administer effectively. More minor annoying behaviours, 
such as whining and shouting, often respond to being ig- 
nored, but again parents often find this hard to achieve in 
practice. 

v. Reorganizing the child’s day to prevent trouble. There are often 
trouble spots in the day which will respond to fairly simple 
measures. For example, putting siblings in different rooms 
to prevent fights on getting home from school and banning 
electronic devices in the morning until the child is dressed. 


Treatment can be given individually to the parent and child, which 
enables live feedback in light of the parent's progress and the child’s 
response. Alternatively, group treatments with parents alone have 
been shown to be equally effective [40]. Trials showed that parent 
management training is effective in reducing child antisocial behav- 
iour in the short term, with moderate to large effect sizes of 0.5-0.8 
standard deviations, and there is little loss of effect at 1- or 3-year 
follow-up [41]. 


2. Family functioning. 


Functional family therapy, multi-systemic therapy, and treatment 
foster care aim to change a range of difficulties which impede ef- 
fective functioning of teenagers with CD. Functional family therapy 
addresses family processes which need to be present such as im- 
proved communication between parent and young person, reducing 
interparental inconsistency, tightening up on supervision and moni- 
toring, and negotiating rules and sanctions to be applied for breaking 
them. Functional family therapy has been shown to reduce reof- 
fending rates by around 50% [42]. Other varieties of family therapy 
have not been subjected to controlled trials for young people with 
CD or delinquency, so they cannot be evaluated for their efficacy. 

In multi-systemic therapy [43], the young person’s and family’s 
needs are assessed in their own context at home and in their rela- 
tions with other systems such as school and peers. Following the 
assessment, proven methods of intervention are used to address 
difficulties and promote strengths. Multi-systemic therapy differs 
from most types of family therapy such as the Milan or systemic ap- 
proach as usually practised in a number of regards. Firstly, treatment 
is delivered in the situation where the young person lives, for ex- 
ample at home. Secondly, the therapist has a low caseload (4-6 fam- 
ilies) and the team is available 24 hours a day. Thirdly, the therapist 
is responsible for ensuring appointments are kept and for making 
change happen—families cannot be blamed for failing to attend 
or ‘not being ready’ to change. Fourthly, regular written feedback 
on progress towards goals from multiple sources is gathered by the 
therapist and acted upon. Fifthly, there is a manual for the thera- 
peutic approach and adherence is checked weekly by the supervisor. 


Several randomized controlled trials have attested to the effective- 
ness, with reoffending rates typically cut by half and time spent in 
psychiatric hospitalization reduced further [43]. 

Treatment Foster Care is an approach developed in Oregon in the 
United States as a way to improve the quality of encouragement and 
supervision that teenagers with CD receive. The young person lives 
with a foster family specially trained in effective techniques; some- 
times it is ordered as an alternative to jail. Outcome studies have 
shown useful reductions in reoffending [44]. 


3. Anger management and child interpersonal skills. 


Most of the programmes to improve child interpersonal skills de- 
rive from cognitive behavioural therapy. A typical example is the 
Coping Power Programme [45]. This, and other programmes have in 
common, involves training the young person to: 


i. Slowdownimpulsive responses to challenging situations by stop- 
ping and thinking. 
ii. Recognize their own level of physiological arousal and their own 
emotional state. 
iii. Recognize and define problems. 
iv. Develop several alternative responses. 
v. Choose the best alternative, based on anticipation of con- 
sequences. 
vi. Reinforce himself or herself for use of this approach. 


Over the longer term, they aim to increase positive social be- 
haviour by teaching the young person skills to make and sustain 
friendships, turn taking and sharing, and express viewpoints in ap- 
propriate ways and listen to others. 

Typically, given alone, treatment gains with interpersonal skills 
training are good within the treatment setting but only gener- 
alize slightly to ‘real-life’ situations such as the school playground. 
However, when they are part of a more comprehensive programme 
which has those outside the young person reinforcing the approach, 
they add to outcome gains [46]. 


4. Difficulties at school. 


These can be divided into learning problems and disruptive be- 
haviour. There are proven programmes to deal with specific learning 
problems such as Reading Recovery; however, few of the pro- 
grammes have been specifically evaluated for their ability to improve 
outcomes in children with CD, although trials are in progress. 

There are several schemes for improving classroom behaviour, 
which vary from those which stress on improved communication 
such as ‘circle time’ and those which work on behavioural principles 
or are part of a multimodal package. Many of these schemes have 
been shown to improve classroom behaviour, and some specifically 
target children with CD [47]. 


5. Peer group influences. 


A few interventions have aimed to reduce the bad influence of de- 
viant peers. However, a number attempted this through group work 
with other conduct-disordered youths, but outcome studies showed 
a worsening of antisocial behaviour due to forming more ties with 
antisocial peers. Current treatments therefore either see youths in- 
dividually or work in small groups (say 3-5 youths) where the ther- 
apist can control the content of the sessions. Some interventions 
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place youths with CD in groups with well-functioning youths, and 
this has led to favourable outcomes [48]. 


6. Medication for coexisting hyperactivity. 


Where there is comorbid hyperactivity, in addition to CD, sev- 
eral studies attested to a large (effect size of 0.8 standard deviations 
or greater) reduction in both overt and covert antisocial behaviour, 
both at home and at school [49]. However, the impact on long-term 
outcome is unstudied. 


Engagement of the family is particularly important for this group of 
children and families, as dropout from treatment is high at around 
30-40%. Practical measures, such as assisting with transport, pro- 
viding childcare, and holding sessions in the evening or at other 
times to suit the family, will all help. Many of the parents of chil- 
dren with CD may themselves have difficulty with authority and of- 
ficialdom and be very sensitive to criticism. Therefore, the approach 
is more likely to succeed if it is respectful of their point of view, does 
not offer overly prescriptive solutions, and does not directly criti- 
cize parenting style. Practical homework tasks increase changes, 
as do problem-solving telephone calls from the therapist between 
sessions. 

Parenting interventions may need to go beyond skills devel- 
opment to address more distal factors which prevent change. For 
example, drug or alcohol abuse in either parent, maternal depres- 
sion, and a violent relationship with the partner are all common. 
Assistance in claiming welfare and benefits and help with financial 
planning may reduce stress from debts. 

A multimodal approach is likely to get larger changes. Therefore, 
involving the school in treatment by visiting and offering strategies 
for managing the child in class is usually helpful, as is advocating 
for extra tuition where necessary. Avoiding antisocial peers and 
building self-esteem may be helped by getting the child to attend 
after-school clubs and holiday activities. 

Where parents are not coping or an abusive relationship is de- 
tected, it may be necessary to liaise with social services to arrange 
respite for the parents or a spell of foster care. It is important during 
this time to work with the family to increase their skills, so the child 
can return to the family. Where there is permanent breakdown, 
long-term fostering or adoption may be recommended. 


Opportunities for prevention 


CD should offer good opportunities for prevention since: 


1. It can be detected early reasonably well. 
2. Early intervention is more effective than later intervention. 
3. There are a number of effective interventions. 


In the United States, a number of comprehensive interventions 
based on up-to-date empirical findings are being carried out. 
Perhaps the best known is Families and Schools Together [50]. Here 
the most antisocial 10% of 5- to 6-year olds in schools in disadvan- 
taged areas were selected, as judged by teacher and parent reports. 


They were then offered intervention which was given for a whole 
year in the first instance and comprised: 


. Weekly parent training in groups with videotapes. 

. An interpersonal skills training programme for the whole class. 
. Academic tutoring twice a week. 

. Home visits from the parent trainer. 


uu fF WN = 


. A pairing programme with sociable peers from the class. 


Almost 1000 children were randomized to receive this condition 
or controls, and the project has cost over $100 million. By age 25, 
children in the treatment group had less antisocial behaviour and 
criminality and psychopathology, but only if they were in the more 
severe part of the sample to begin with. 

In the United States, preschool education programmes for dis- 
advantaged children have shown good outcomes in small dem- 
onstration projects, but replication on a larger scale has generally 
proved rather disappointing. In the UK, the government stressed 
the importance of helping parents of children in the first 3 years 
of life and put substantial resources (€540 million) into SureStart 
centres in specifically targeted high-risk neighbourhoods to support 
parenting. Early evaluation of outcome showed no change on 24 of 
25 variables; maternal acceptance of the child was the only measured 
outcome to change, and child antisocial behaviour did not change 
[51]. However, more intensive high-quality parenting programmes 
started when children are young (aged 4-7 years) do have some evi- 
dence of longer-term effects, including prevention of antisocial per- 
sonality traits in adolescence [52]. 


Conclusions 


Much is known about the risk factors leading to CD, and effective 
interventions exist. The challenge is to make these available on a 
wide scale and to develop approaches to prevention which are ef- 
fective and can be put into practice at a community level. 
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Ever since the work of Charles Darwin established that the 
struggle for existence plays a major role as scientific explanation 
of diversification in nature [1], the striving to live has been con- 
sidered fundamental for all living creatures, also among human 
beings. Therefore, suicidal impulses and suicidal behaviour must 
be understood as disturbances of this fundamental condition. 
But suicide statistics clearly demonstrate that the will to live can 
be broken. While the strength of the force threatening to break 
the will to live is crucial, individual resilience play an important 
role too. When suicidal behaviour occurs, it must be considered 
as an indication that the strain on the individual is exceeding his 
or her capability to cope with the situation. The diathesis—stress 
model can be used as a framework for understanding suicidal im- 
pulses and suicidal behaviour. In 2003, van Heeringen presented 
a slightly modified version of the diathesis—stress model related 
to suicidal behaviour [2]. The model suggests that suicidal be- 
haviour might be explained through factors related to trait (de- 
termined by genetics, personality, intellect, and other factors) 
and factors related to state (which can be influenced by, for in- 
stance, depression or substance use disorder). Stressors and pro- 
tective factors, as well as threshold factors (for instance, access 
to lethal means, alcohol intoxication, crisis, and access to help), 
might further act as facilitators or barriers for suicidal behav- 
iour. Destructive forces, that is, stressors, can be manifold and di- 
verse and modified by protective factors hosted by the individual. 
Humans react differently to the same condition; thus, a situation 
perceived stressful for one individual may not be distressing for 
another individual. Different individuals have different levels of 
resilience. This may depend upon individual trait factors, gen- 
etics, or being grounded during childhood, but it can also be in- 
fluenced by more recent factors such as the support to which he 
or she has access. Similarly, the perception of options for help is 
likely to vary from one individual to another, as well as by cul- 
tural factors. Protective factors and stressors either from early 
life experiences or from the current context may impact traits 
or the state of the individual, while genetic factors are mainly 
thought to influence traits. Threshold factors, such as alcohol in- 
toxication, crisis, or access to help, can act either as drivers or as 
barriers for suicidal behaviour. 





Suicide cannot be understood as a disease or an accident; suicidal 
acts should be considered as severe and preventable complications 
to a range of diseases and conditions in which social aspects play 
an important role. The understanding of suicidal acts as complica- 
tions emphasizes that, like other complications, suicidal acts can be 
prevented. 

The most frequently used model for suicide prevention is the 
Universal, Selected and Indicated preventive model (USI-model), 
which was suggested by Gordon [3] and later accepted by the 
Institute of Medicine [4]. This model provides a relevant and ap- 
plicable framework for suicide prevention. Universal prevention 
strategies aim to address an entire population and is designed to in- 
fluence everyone. Universal interventions include programmes such 
as: public education campaigns, restricting access to suicide means, 
and education programmes for the media on reporting practices re- 
lated to suicide. There are several examples of universal interventions 
that could be considered suicide-preventive but were not launched 
as such, mainly because the primary focus of the intervention is not 
suicide—for instance, school-based programmes aiming at redu- 
cing bullying and campaigns to reduce mental health stigma. An 
example of successful universal suicide prevention is the restriction 
of pack sizes on paracetamol, which has been linked to less severe 
suicide poisoning [5]. Selective prevention strategies aim to prevent 
the onset of suicidal behaviours among specific subpopulations. 
Important high-risk groups are the mentally ill, alcohol and drug 
misusers, those with a newly diagnosed severe physical disorder, 
prisoners, and the homeless, as well as other socially marginalized 
groups. Crisis plans for patients with mental illness is an example of 
application of selective prevention strategies [6]. Another example 
is better education of general physicians for recognition and treat- 
ment of depression, which has been shown to reduce suicide rates 
[7]. Indicated prevention strategies are focused on high-risk groups, 
that is, persons who have attempted suicide or have presented them- 
selves to health services because of suicidal ideation. An example of 
a promising indicated strategy is the psychosocial therapy provided 
to people after suicide attempt [8]. In addition, help lines, psychi- 
atric emergency rooms, psychiatric emergency outreach, and other 
crisis interventions can play a role in preventing suicide attempts 
among persons in a suicidal crisis. 
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National action plans for suicide prevention generally include all 
three types of strategies. 


The World Health Organization (WHO) defines suicide as the act 
of deliberately killing oneself [9]. This definition underlines that the 
death is caused by a deliberate act, thereby separating suicide from 
accidents. The definition does not mention suicidal intent, which, in 
some cases, can be difficult to determine after death. The number of 
suicides is recorded and suicidal rates are available for most high- 
income countries. 

The WHO [9] defines suicide attempt as any non-fatal suicidal 
behaviour, such as intentional self-inflicted poisoning, injury, or 
self-harm, which may or may not have a fatal intent or outcome. 
This definition implies that non-fatal self-harm without suicidal in- 
tent is included under this term, which is problematic due to the 
possible variations in related interventions. However, the WHO 
chooses this definition because suicide intent can be difficult to de- 
termine as it may be surrounded by ambivalence or even conceal- 
ment. Some countries record hospital-treated suicide attempts, but 
underreporting is common and suicide attempts outside hospital 
settings are not included in statistics. Other terms, such as deliberate 
self-harm, are also used for suicide attempt. 

Suicidal ideations or thoughts may vary in terms of intensity, 
from being sporadic and transient to being frequent and persistent. 
Suicidal thoughts can be active, such as considering killing one- 
self, or passive such as wishing to catch an incurable disease or not 
waking up tomorrow. Some people might express their suicidal 
thoughts overtly without being prompted; others present them only 
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after questioning or conceal them. Suicidal thoughts include suicidal 
plans where a distinction can be made between plans for not being 
alive anymore (writing a testimony, preparing economy, and writing 
suicidal notes) and plans for carrying out a suicidal act (selecting the 
method, time, and place). 


Epidemiological trends and registration of suicides 





The WHO estimates that the number of suicides worldwide was 
803,900 in 2012 [9]. 

The absolute number, that is, the actual number of people dying 
by suicide, in a country is dependent on the number of people living 
in the country. Thus, when comparing suicide figures across bor- 
ders, a yearly suicide rate per 100,000 inhabitants is usually used. 
National suicide rates based on those countries that report data on 
causes of death are presented in Fig. 125.1. Specific regions are noted 
for high suicide rates: Eastern Europe (Russia, Ukraine, Lithuania, 
Latvia, Poland, Hungary, Belarus, and Kazakhstan) and Eastern 
sub-Saharan countries (Sudan, Uganda, Kenya, Tanzania, Zambia, 
Malawi, and Zimbabwe). In addition, particularly countries such as 
India and Japan, as well as Guyana and Suriname in South America, 
list high rates of suicide. 

The quality of death and population statistics varies substantially 
across the world, and one has to bear this in mind when comparing 
rates. For instance, low- and middle-income countries with low in- 
frastructure might have limited resources to dedicate to recording 
suicide statistics. The fact that more than 75% of all suicide takes 
place in low- and middle-income countries (Fig. 125.2) complicates 
the process of estimation. Suicide has, throughout the ages, been 
subjected to taboo and stigmatization. Religious and cultural norms 


3400 Kilometres 


0 850 1700 


Fig. 125.1 (see Colour Plate section) Age-standardized suicide rates for both genders by country (2012). 
Reproduced from World Health Organization, Preventing suicide: A global imperative, Copyright (2014), with permission from the World Health Organization. 
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Fig. 125.2 Absolute numbers of suicide in the world by age and income level (2012). 
Reproduced from World Health Organization, Preventing suicide: A global imperative, Copyright (2014), with permission from the World Health Organization. 


might imply that a suicide death is unacceptable or shameful, and 
social or legal sanctions are possible consequences. Several religions, 
such as Catholicism and Islam, consider suicide a sin and the burial 
of a deceased by suicide might not be granted the same rituals as 
other deceased, and relatives might experience sanctions [10]. For 
such reasons, it is plausible that suicide deaths in some countries 
might, to a larger extent, be recorded as accidents or other causes of 
death, partially out of concern for the relative. 

The administrative procedure of recording causes of death varies 
substantially across the world. In the UK, for instance, a jurisdic- 
tional coroner system determines whether a death is a suicide, an ac- 
cident, or undetermined, that is, an ‘open verdict’ [11]. In this setting, 
supportive evidence, such as a ‘suicide note, documenting intent of 
suicide, might be required before a cause of death is determined to 
bea suicide. In other high-income countries, the presence of suicidal 
intent is generally determined by medical or forensic staff. For the 
reasons listed previously, caution has to be applied when comparing 
suicide rates of different countries. 


In the following sections, societal and individual risk factors for 
suicide are presented. There is no clear distinction between societal 
and individual risk factors; yet both groups of factors have an inter- 
dependent impact on the risk of suicide. For example, an economic 
crisis affects the entire society, while specific individuals will be dis- 
tressed by unemployment or bankruptcy. Likewise, free-of-charge 
access to high-quality health care can be considered as a societal 
factor, while the influence by treated or untreated mental disorders 
on the risk of suicide is an individual factor. Still, there are clear links 
between the two dimensions. The relative risk associated with indi- 
vidual risk factors are listed in Table 125.1. 


Societal factors 


On the societal level, there are factors which can be associated with 
high or low risk of suicide. Rather than focusing on individual risk 
factors, the French sociologist Durkheim attributed variations in 
suicidal behaviour between different countries to societal factors 
such as sense of connectedness and social cohesion. Durkheim used 
the term ‘anomic’ to describe suicides in societies with poor con- 
nectedness between people and where the individual behaviour 
was not regulated by a strong social cohesion [12]. He hypothe- 
sized that anomic suicides were likely to increase in periods when 
social norms and roles underwent rapid changes. As such, the steep 
increase in suicide rates seen in indigenous societies, for example 
among New Zealand’s Maori population, Australia’s aborigines, the 
United States’ native Indians, and Canadian and Greenland’s indi- 
genous peoples, could be seen as examples of increasing numbers of 
anomic suicides [13, 14]. 

Societal factors influence the number of suicides occurring in 
a country. In the 1980s, the ‘Perestroika policy introduced by the 
Soviet leader Mikhail Gorbachev included restrictions on alcohol 
sales. Alcohol misuse is linked to suicide in some populations, and 
the implementation of the policy consequently resulted in a marked 
drop in the suicide rate [15]. After the dissolved Soviet Republic, 
structural changes and increasing unemployment, as well as rising 
alcohol sales, contributed to an increase in suicide rates in many 
post-Soviet countries. The global financial crisis of 2007-2008 
was linked to an increase in the numbers of suicide, particularly in 
European countries and the United States [16]. While the financial 
crisis had an impact on a societal level, for instance, through cut- 
backs in state budgets, rising unemployment rates might act as an 
individual stressor, as suggested previously. Significant increases 
in suicide rates of age groups belonging to the working forces were 
noted in the years following. Interestingly, countries with social 
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Table 125.1 Relative risks denoting the risk of suicide with respect 
to select risk factors of suicide in relation to the general population 


[a fmen [women 





Schizophrenia [57] 9 19 
Bipolar depression disorder [57] 11 18 
Unipolar depression disorder [57] J 15 
Substance abuse' [57] 7 13 
Any mental disorder [57] 6 8 
Previous self-harm [73, 92-94] >30 >30 
Prisoner [95]? >3 >6 
Divorced [96° 3 3 
Widow/widower [97] 23 V7 
Homeless [34] 7 15 
Same-sex married [37]° 4] 64 
Somatic illness [98} 1.8 24 
Transgender [99]’ 57 DD 


Based on a psychiatric substance abuse diagnosis. 

Higher in Asian countries. 

3 Inmate suicide rates in women varied widely in different countries and was based on 
small numbers of suicide. 

Based on a number of longitudinal follow-up studies of people presenting with 
self-harm. 

° Based on somatic illness requiring inpatient hospitalization (adjusted analysis). 
easured as persons living in a same-sex marriage, when compared to persons living 
in an opposite-sex marriage. Figures are adjusted for persons never married. 

7 Persons who have received hormone therapy or transgender surgery or diagnosed with 
transgender orientation. 


N 


6 








welfare systems, such as Finland and Sweden, seemingly were less 
affected by the crisis [17]. 

Durkheim predicted that when a country is in war, this would 
increase the social cohesion and strengthen the societal influence of 
individual’s behaviour. Some support for this hypothesis has been 
found, for instance among the countries participating in the First 
World War where a decline in suicide rate was noted over the same 
period [18]. Decreasing suicide rates were, among others, noted in 
Norway and Sweden during the Second World War [19]. Newer 
findings indicated that civil wars might not have this effect on sui- 
cide rate, potentially explained through the lack of social cohesion 
when national, oppositional groups are in conflict [20]. 


Role models and media portrayal 


Goethe’s book The Sorrows of Young Werther depicts a tragic love 
story from the Romantic period where the main character ends up 
taking his own life. Legends from contemporary Germany report 
how young men would identify with the story and copy the be- 
haviour of Werther—sometimes even dressed in similar ropes and 
found with the book next to their body [21]. The main character gave 
name to the phenomenon ‘the Werther effect; which denotes famous 
suicide deaths (by fictive or actual persons) that are being copied. 
A more recent case of this was observed in Germany in 2009 where a 
former national soccer player R. Enke died by suicide. In subsequent 
years, a 31% increase in suicides by the same method was observed 
among men [22]. Fictive portrayals, as a suicide of a main character 
in a soap opera, have been linked to an increase in self-poisonings 


admitted to hospital by women [23]. Media guidelines installing that 
reports of suicide should not list details regarding the method and 
avoid depicting the event as a heroic act have subsequently been de- 
veloped [24]. Interestingly, the media might also have a preventive 
impact on suicides when reports are made according to media 
guidelines [25]. In order to emphasize that media may also have a 
positive impact, this effect is now called ‘the Papageno effect’. 


Individual risk factors 


On an individual level, various circumstances are linked to elevated 
vulnerability to suicide. Coping strategies and support from net- 
work are important moderators of these risk factors. Some of the 
most relevant individual risk factors for suicide are listed in the fol- 
lowing sections. 


Gender and age 


In most countries, suicides by men outnumber those by women at 
a ratio of 1:2 to 1:4. However, as can be seen in Fig. 125.3, there is 
great variation in the sex ratio between countries. One explanation 
for this is gender-specific preferences for methods; men tend to use 
more lethal means of suicide, while women outnumber men when 
it comes to suicide attempts as a result of predominantly using poi- 
soning [26]. China is an exception to this, as Chinese women have 
a 25% higher suicide rate than men [27]. This is largely attributed to 
suicides in rural areas where young women have a very high rate of 
suicide and might be explained through their low social status and 
the availability of highly lethal means (pesticides), as well as limited 
options for social support. 

As shown in Fig. 125.4, suicide rates increase with increasing age 
in those countries that report their data to the WHO. While older 
adults aged 75 years and older have the highest rate, more detailed 
studies of the suicide rates among the oldest age groups reveal even 
higher rates for men until age 90 years, while the rate for women 
peaks at age 85-94 [28, 29]. In terms of absolute numbers, most sui- 
cides in high-income countries occur among persons in mid life at 
around age 50. In low- and middle-income countries, most suicides 
take place among the very young aged 15-24 (Fig. 125.2). In all 
countries, suicides among children below age 15 years are very rare. 


Family and close relations 


Familial conditions, such as separation, divorce, and widowhood, 
are linked to a higher risk of suicide, while marriage is often found 
to be protective against suicide [30]. Having a child, particularly a 
young child, protects parents against suicide [31]. 

Stressful life events, such as the death of a child or loss of a relative 
to suicide, are linked to increased risks of suicide [31, 32]. Studies 
showed that the recent loss of a partner constitutes a stressor that is 
associated with higher risks of suicide, for instance in older adults 
[33]. Moreover, it is plausible that humiliation, bullying, financial 
ruin, fear of not passing exams, or losing one’s job are all factors that 
might be associated with an increased risk of suicide. 


Minority groups 


An increased suicide mortality is found among homeless people; this 
is particularly dominant in younger age groups [34]. Imprisonment, 
but also being charged with a criminal offence, although not found 
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Fig. 125.3 (see Colour Plate section) Sex ratio of age-standardized suicide rates by country (2012). 
Reproduced from World Health Organization, Preventing suicide: A global imperative, Copyright (2014), with permission from the World Health Organization. 


guilty, are linked to increased risks of suicide [35]. The period imme- 
diately after imprisonment is associated with the highest risk of sui- 
cide [35]. For marginalized groups, such as the homeless or people 
with a criminal history, the elevated risk of suicide might be medi- 
ated through untreated mental disorders. 

Sexual minority groups are generally thought to have a higher sui- 
cide rate, although limited evidence has been produced [36]. Based 
on Danish register data, persons living in same-sex partnership had 
higher suicide rates than those living in opposite-sex partnerships 
[37]. A study on transgendered persons in treatment found that 
these have a higher suicide rate than the general population [38]. 





Until recently, the rate of suicide in the military has been lower than 
the rate among civilians [39]. However, large cohort studies based on 
active US army soldiers (deployed or not) and veteran populations 
(that is, those who are no longer employed in the army) have shown 
a rise in the suicide risk in the period spanning from 2004 to 2009— 
even higher than in the general background population [40, 41]. In 
the UK, a similar trend was found in a large cohort study; however, an 
excess risk, when compared with a background population, applied 
only to veterans younger than 24 years [42]. Regarding leaving the 
armed forces, both US and UK studies found that the suicide risk was 
highest in the first years after leaving the military [42, 43]. 
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Fig. 125.4 Suicide rate age group for 62 countries reporting to WHO Mortality Data Bank (2004-2009). 
Reproduced from World Health Organization, Preventing suicide: A global imperative, Copyright (2014), with permission from the World Health Organization. 
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Suicide rates in different occupational groups have been exam- 
ined in several studies. In a large Danish study, the highest suicide 
rates were found among doctors and nurses, as well as occupational 
groups with unstable working conditions and low salaries, mainly 
unskilled manual workers, for example domestic and related helpers, 
cleaners, and launderers. Much, but not all, of the excess risk for 
doctors and nurses was due to their increased use of self-poisoning, 
a feasible method, considering their knowledge on toxicity and rela- 
tively easy access to medical drugs [44]. Police officers constitute a 
professional group exposed to a high level of job stress and moreover 
with access to firearms. Some studies reported increased suicide 
rates of this group and have examined the effectiveness of preven- 
tion programmes [45, 46]. People who are unemployed have higher 
suicide risks than those in work. The same goes for people who are 
retired or on disability pensions [47]. 


Mental disorders 


The strongest individual risk factors are mental disorders and pre- 
vious suicidal behaviour. In fact, the increased risk of suicide is one 
of the main explanations for why people with mental disorders 
have a 15-20 years shorter life expectancy, compared with the gen- 
eral population [48]. Psychological autopsy studies where available 
post-mortem information on the suicide victim is collected from, 
for instance, family members and general practitioner found that up 
to 90% of people who died by suicide met the criteria for a mental 
disorder [49, 50]. While acknowledging that not all people with 
mental disorders receive treatment, a recent study found that the 
risk of suicide varies according to the level of mental health care re- 
ceived; people admitted to inpatient care had a 44-fold higher risk 
of suicide than the general population, whereas those receiving out- 
patient care or psychiatric medication had an 8- or 6-fold higher 
risk, respectively [51]. 

While suicide among psychiatric inpatients is a rare event, this 
population still has a high rate of suicide—147 per 100,000 patient- 
years [52], which is almost 13 times higher than the annual global 
age-standardized suicide rate of 11.4 per 100,000 provided by the 
WHO [9]. Two phases are linked to particularly high risks: patients 
who are currently admitted and those who are recently discharged 
from psychiatric hospital [53]. Most inpatient suicides take place 
within the first week of admission [54, 55]. The first days and months 
after discharge comprise a suicidal high-risk period, especially in the 
first week where the risk has been reported to be up to 102-fold (for 
men) and 246-fold (for women) higher, when compared to a back- 
ground population with no admission history. The following clinical 
factors have been associated with an increased risk: history of sui- 
cide attempt, depressed mood, feelings of hopelessness or worthless- 
ness and suicidal ideations expressed at admission, family history 
of suicide, diagnosis of schizophrenia, and having been prescribed 
antidepressant medication [56]. 

Virtually all mental disorders are linked to an elevated risk of sui- 
cide. A register-based long-term follow-up study investigated the 
cumulative incidence of suicide by time since the first psychiatric 
inpatient or outpatient contact for each disorder in men and women, 
respectively [57]. The incidences were based on a 36-year observa- 
tional follow-up study of the Danish population and showed that 
the suicide risk is highest in the first years after first contact and that 
all psychiatric diagnoses had an elevated suicide risk throughout 
the study period. The overall cumulative risk of dying by suicide 


in people who had inpatient or outpatient clinical contact with 
specialized mental health services was 4% in men and 2% in women 
and varied across diagnoses, with a particularly higher risk of sui- 
cide in those diagnosed with schizophrenia or affective disorders, 
compared with other diagnoses such as substance abuse. Women 
diagnosed with bipolar/unipolar affective disorder or schizophrenia 
had a 4-5% long-term risk of dying by suicide, whereas this risk in 
men was 6-7%. 

The increased risk of suicide in people with bipolar disorder 
has been estimated to be 200-400 per 100,000 person-years. The 
International Society for Bipolar Disorders published a meta- 
analysis of predictors of suicide death in relation to bipolar disorder 
[58]. Based on 12 studies, the variables being a man and having a 
first-degree family history of suicide were identified as significant 
risk factors of suicide. 

Unipolar depressions occur with a prevalence of 2-3% among 
men and 3-5% among women in the general population [59, 60], 
and the general practitioner is often the first treatment contact for 
people with depression. It is difficult to identify who are at eminent 
risk of suicide, but addressing suicidal issues is important among pa- 
tients who experience depressive episodes. Many studies have exam- 
ined risk factors of suicide in people diagnosed with depression, and 
a meta-analysis based on 19 studies found that male gender, family 
history of psychiatric disorder, previous attempted suicide, severe 
depression, hopelessness, and comorbid disorders, such as anxiety 
and substance misuse, were significant predictors [61]. Treatment of 
depression is very important to reduce suicide risk. Antidepressants 
are commonly used in the treatment of people with depression. New 
generations of antidepressants, such as selective serotonin reuptake 
inhibitors (SSRIs) and serotonin/noradrenaline reuptake inhibitors 
(SNRIs), have been linked to suicidal ideation and preparative acts 
and behaviour in children and young people [62, 63], but protective 
for those older than 64 years [62]. When prescribing antidepres- 
sants, it is important to carefully monitor patients, especially when 
young people are treated. 

In a meta-analysis, the lifetime suicide risk in schizophrenia was 
estimated to be 5.6% [57, 64]. In a long-term follow-up study from 
the UK of a large cohort of individuals, the absolute risk of suicide 
was 3.23% for patients 20 years after the initial diagnosis of a psych- 
otic disorder [65]. Based on the standardized mortality ratio, this 
study also reported that the risk of suicide was 12-fold higher in pa- 
tients with psychosis, when compared with the general population. 
A meta-analysis of risk factors for suicide in schizophrenia based 
on 29 eligible studies found robust evidence of increased risk being 
conferred by depressive disorders, previous suicide attempts, drug 
misuse, agitation and motor restlessness, fear of mental disintegra- 
tion, poor adherence to treatment, and recent loss [66]. 

Suicide is also increased in those suffering from substance abuse 
[67-69] and personality disorders, especially borderline personality 
disorders [70]. Further, these disorders often exist as comorbidity 
to depression or schizophrenia. Thus, in addition to adding to the 
suicide risk, it also complicates the treatment and recovery of these 
patients [71, 72]. 


History of suicide attempt 


Having a history of suicide attempt is possibly the most significant 
risk factor of completed suicide. A meta-analysis showed that as 
many as 1.6% persons died of suicide within 1 year after re-presenting 


to hospital with a non-fatal suicide attempt and that one in 25 had 
died after 5 years [73]. In these studies, the suicide risk associated 
with the different mental disorders almost doubled if a patient had 
also been admitted to hospital with a suicide attempt [57]. 

A Swedish register-based study examined the association between 
the method of attempted suicide and the risk of subsequent suicide 
and found that individuals who had attempted suicide by hanging, 
strangulation, or suffocation had higher risks of dying by sui- 
cide [74]. In particular, this study highlighted that among patients 
who were admitted to hospital due to suicide attempt by method 
of hanging, strangulation, or suffocation and who had a coexisting 
diagnosis of a non-organic psychosis, 68.8% of women and 69.6% of 
men had completed suicide within a year. 


Physical disorders 


Severe somatic disorders have been associated with an increased 
risk of suicidal behaviour. Studies of specific disorders identified 
that peptic ulcers [75], renal disease [75], arthritis [76], acquired 
immune deficiency syndrome (AIDS) [77], cancer [78, 79], cerebral 
stroke [80-82], myocardial infarction [83], diabetes [84], multiple 
sclerosis [85, 86], and epilepsy [84, 87] were linked to an elevated 
suicide risk. Also relatively rare diseases, such as fibromyalgia and 
Crohn's disease [88, 89], have been linked to increased risks of sui- 
cidal behaviour. Danish findings showed that the time immediately 
after discharge from treatment of somatic diseases is linked to an 
excess risk [90]. 


Population-attributable risk 


Population-attributable risk (PAR) is a useful measure when 
thinking in terms of suicide prevention. It estimates the poten- 
tial for prevention if addressing effectively specific individual risk 
factors, although it does not include a measure for estimating the 
potential for prevention of societal risk factors. The PAR denotes 
the proportion of all suicides prevented if one were to avoid all 
suicides associated with a specific risk factor. For example, if the 
excess risk of suicide among single men would be reduced to the 
level of the general population, then the total number of suicides 
would be reduced with 26.2% (Table 125.2). The usefulness is ap- 
parent when comparing the PAR of different risk factors, as this 
provides information on which risk factors are linked to larger 
potentials of prevention. An important consideration is the popu- 
lation size affected by a risk factor. As the epidemiologist Rose 
noted, an intervention aimed at a large population at a low risk 
might result in prevention of more suicides than an intervention 
aimed at a small, but high-risk, group [91]. The size of the exposed 
population for each risk factor is also included in the PAR. As seen 
in Table 125.2, an intervention aimed at preventing suicides in a 
large population, such as single men, that has a relatively small 
excess suicide risk has the potential to prevent one in four of all 
male suicides. An intervention aimed at a proportionally smaller 
group, such as women who have been diagnosed with schizo- 
phrenia who have a 19-fold higher risk of suicide than the general 
population, would prevent 3.4% of all female suicides. The factors 
with the highest PAR are mental disorders, recent suicide attempt, 
and single marital status. 
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Table 125.2 Population-attributable risk (%) associated 
with specific risk factors [100-101] 

















Co-habiting marital status es) 1.0 
Single 202 20.6 
Unemployed 3.0 2.1 
Age pensioner 70 18.8 
Disability pensioner 17 6.7 
mprisoned 1.0 = 
Lowest-income quartile 93 2 
Sickness-related absence from work 6.6 6.2 
Recent suicide attempt 11.8 244 
All psychiatric diagnoses 325 sh) 
Any affective disorder 8.6 19.2 
- Bipolar disorders 1.0 Ze 
- Recurrent depression 29 9.8 
- Other affective disorders Sh Ue 
Schizophrenia 3.2 3.4 
Other schizophrenic disorders 23 3.7 
Borderline personality disorders 0.1 ley, 
Other personality disorders 25) Ua 
Reaction to stress/adjustment disorders 40 56 
Other anxiety disorders 0.4 13 
Substance use disorders 76 6.0 
- Alcohol use disorders 6.0 em 
- Drug use disorders 1.6 23 
Dementia 0.5 0.7 
Other psychiatric disorders 23 49 


“Estimated on Danish citizens in year 2010, based on numbers from the Centre for 
Suicide Research in DK (prevalence of suicide attempt), Statistics Denmark (population 
size), and Carroll et al, 2014 (73) (percentage of fatal repitition within a year). 
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Self-harm 


Epidemiology and risk factors 


Nav Kapur, Sarah Steeg, and Adam Moreton 





Self-harm is a behaviour, not a diagnosis, so why should it appear 
in a textbook of psychiatry? People commonly present to emer- 
gency departments or mental health services having poisoned or in- 
jured themselves and in a state of distress. Self-harm is probably the 
strongest risk factor for suicide, and yet services to help people are 
patchy or non-existent. Although the evidence base to inform man- 
agement is growing, it remains thin. In this chapter, we will discuss 
what we know about the epidemiology and risk factors for self-harm. 





Characterizing non-fatal suicidal behaviour is challenging, and the 
terminology in this area has developed over time. ‘Attempted sui- 
cide’ is problematic because many patients who self-harm do not 
wish to die or are too distressed to have formed a clear intent at the 
time of the episode. “Parasuicide’ describes an act of deliberate self- 
injury or poisoning which mimics the act of suicide but does not re- 
sult in a fatal outcome [1]. However, this still contains a reference to 
suicide. “Deliberate self-harm’ became the preferred term in the UK, 
but in recent years, the prefix ‘deliberate’ has been dropped in re- 
sponse to concerns that it was judgemental and in recognition of the 
heterogenous nature of the phenomenon [2]. Self-harm refers to an 
intentional act of self-poisoning or self-injury, irrespective of motiv- 
ation [3]. This is the term that will generally be used throughout this 
chapter, except when we describe individual studies. People can hurt 
themselves in a variety of ways—by poisoning, self-cutting, burning, 
and scratching, as well as by other forms of potentially more lethal 
self-injury such as hanging, stabbing, and jumping. 


Categorizing self-harm 





Should people who self-harm be categorized into those who have 
clear suicidal intent and those who do not? Proponents have argued 
that a diagnosis of ‘non-suicidal self-injury disorder (NSSID) would 
increase precision and lead to improved communication between 





health professionals, as well as stimulate research into causes and 
prevention and the development of specific treatments. Others have 
argued that the term is not useful because suicidal and non-suicidal 
motivations often coexist, methods of self-harm change, there is no 
clear dichotomy of intent in self-harm populations (it is continu- 
ously distributed), and even people with low-intent episodes are at 
increased risk of subsequent suicide [4]. It has been suggested that 
there are potential problems with creating a new diagnosis of NSSID 
for which there are few proven treatments and which could stigma- 
tize large numbers of young people unnecessarily. Given the uncer- 
tainty, we will use the broad and inclusive definition of self-harm, 
as highlighted in UK national guidance [5]. However, it should be 
noted that NSSID and attempted suicide disorder are included in 
DSM-5 as conditions requiring further study. 





It has been suggested that the epidemiology of suicidal behaviour 
might be best conceptualized as an iceberg (Fig. 126.1). Behaviours 
become more common, but less severe and less visible as one descends 
the iceberg. The most serious episodes are at the top—deaths by sui- 
cide. Self-harm which comes to medical attention is the next level of 
the iceberg. Even when people present to hospitals, some episodes may 
go unrecognized and are hence ‘under the surface’ Self-harm in the 
community may be hidden and never come to the attention of health 
or helping services. Below this, there may be an even larger group of 
people who experience thoughts of self-injury without acting on them. 
The important point to note is that the vast majority of epidemiological 
research in self-harm has been carried out in hospital settings. 

We know far less about self-harm in the community. It is difficult 
to put definitive numbers to the various levels of the iceberg, but 
McMahon and colleagues attempted to do just this in a recent Irish 
study of suicidal behaviour in young people where they estimated 
just 6% of self-harm episodes resulted in hospital presentation. This 
proportion might be higher in adults—data from a UK household 
survey suggested that around half of people who attempted suicide 
or self-harm received some form of help from clinical services [6]. 
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Fig. 126.1 The iceberg of suicidal behaviour. 


In the following sections, we will examine data on the commu- 
nity prevalence of self-harm from international and school surveys 
before moving on to the hospital-based epidemiology of self-harm. 


International surveys of suicidal behaviour 





One of the challenges of international survey data is ensuring con- 
sistent methodology across settings. Findings need to be interpreted 
cautiously, as survey data tend to be self-reported. Response rates 
may also vary between centres. 

Nock and colleagues [7] assessed the prevalence of suicidal 
ideation, plans, and attempts using the Composite International 
Diagnostic Interview, as part of the World Health Organization 
(WHO) World Mental Health Survey initiative. Across 17 high- 
, middle-, and low-income countries, the lifetime prevalence of 
suicide attempts was 2.7%, with a lifetime prevalence of 9.2% for 
ideation and 3.1% for suicide planning. There was substantial cross- 
national variability, with the lifetime prevalence of suicide attempts 
ranging from 0.7% in Nigeria to 5% in the United States. 

More recently, the WHO has estimated that the global annual rate 
ofsuicide attempts is around 4 per 1000 adults on the basis ofthe same 
survey methodology [8]. Interestingly, the 12-month prevalence 
rates for suicidal behaviour were found to be similar for males and fe- 
males in high- and low-income countries (0.3-0.4%), but higher for 
females (0.6%) than males (0.3%) in middle-income countries [8]. 

In a UK household survey, 5.6% of adults reported a lifetime his- 
tory of suicide attempts and 4.9% a history of self-harm (without 
suicidal intent), with a 50% overlap between the groups. The preva- 
lence of suicide attempts during the previous year was 0.7% (these 
previous-year data were not available for self-harm) [6]. 


School surveys of self-harm 


School-based surveys are a useful source of information. In an inter- 
national study of self-harm in adolescents (Child and Adolescent 
Self-harm in Europe study), Madge and colleagues [9] reported rates 
for self-harm in the previous year of 8.9% for females and 2.6% for 


males, and lifetime rates of 13.5% and 4.3%, respectively. In terms of 
methods of self-harm, over half of episodes involved self-cutting and 
around a fifth poisoning. Many countries have shown similar rates of 
self-harm in their adolescent populations, (for example, [10, 11]), al- 
though some recent research reported considerable variation. A study, 
using self-report data on over 45,000 15- to 16-year olds from 17 coun- 
tries participating in the European School Survey Project on Alcohol 
and Other Drugs, found the proportion reporting a lifetime suicide at- 
tempt varied from 4% in Armenia to 24% in Hungary, with a median of 
around 11% [12]. Female rates were consistently almost twice as high 
as male rates. Regardless of the precise incidence, it is clear that self- 
harm is a commonly reported issue in adolescence and often a major 
concern for young people themselves, their parents, and their teachers. 


The hospital-based epidemiology of self-harm 


Register-based studies of self-harm are another valuable source of 
information but generally only identify people presenting to hos- 
pital. However, this is an important and relatively severe subgroup 
of those who self-harm. 

As part of the WHO/EURO multi-centre study of suicidal be- 
haviour, Schmidtke and colleagues [13] investigated treated epi- 
sodes and found that self-harm was more common among females 
than males and higher rates were found among younger adults. In 
particular, young women aged 15-24 years and young men aged 
25-29 years had the highest rates of self-harm. There was also sub- 
stantial variation, with a 6- to 9-fold difference in self-harm rates, 
even among countries in Europe. 

Ireland has a national registry of self-harm, which has been 
running since 2002 and collects data from hospital emergency de- 
partments [14]. In 2014, the annual rate of self-harm was 185 per 
100,000 for men and 216 per 100,000 for women. The peak ages were 
15-19 years for females and 20-24 years for males. In contrast to 
community settings, the majority of episodes (70%) involved over- 
doses of medication or other substances and 26% involved cutting. 
In interpreting these data, it should be noted that emergency depart- 
ment visits incur a personal cost to the patient in Ireland. 

In England, self-harm data are available from 2000 for the three sites 
that make up the Multicentre Study of Self-harm in England (Oxford, 
Manchester, and Derby) [15]. In 2013, the rates of self-harm were 322 
per 100,000 per year for men and 468 per 100,000 per year for women. 
The peak age of presentation during the study period (2000-2012) 
was 15-24 years for women (41% of female episodes). Men had a 
more even age distribution, with approximately one-third of men who 
self-harmed aged 15-24 years and one-third aged 35-54 years. Self- 
poisoning featured in 75% of episodes, and self-injury (mostly cut- 
ting) in 21% of episodes. In terms of medication ingested in overdose, 
despite legislation to reduce pack sizes, paracetamol was the most 
common substance (ingested in 35% of episodes). This was followed 
by selective serotonin reuptake inhibitor (SSRI) or serotonin/nor- 
adrenaline reuptake inhibitor (SNRI) antidepressants (17%), benzo- 
diazepines (14%), and paracetamol-containing compounds (11%). 


Trends in self-harm 


There was a substantial increase in the rates of self-harm during the 
1960s and 1970s in the UK [16], but accurate long-term data are 


sparse. Data from the Oxford Monitoring system suggests hospital 
presentations for self-harm have increased by 60% since the 1970s, 
but annual rates based on individuals are not dissimilar to earlier 
years. It seems likely that rates of hospital-based repetition have in- 
creased, but the reasons for this are unclear. 

International data suggest that the prevalence of suicidal behav- 
iour may have remained relatively stable between 1997 and 2007 
[7]. With respect to adolescent self-injury, a systematic review con- 
cluded that although rates were high, relative to other age groups, 
they did not seem to have increased internationally between 2007 
and 2011 [17]. 

It seems likely that rates of self-harm in adults increased in re- 
sponse to the 2008 recession, especially in centres which experi- 
enced increased joblessness in the UK [18]. Research from Ireland 
suggests a greater impact on men [19]. There has been much recent 
concern about a possible epidemic of self-harm among adolescents. 
This has not been substantiated in the research literature but is a 
focus of much ongoing work. 


Risk and protective factors for self-harm 


The causes of self-harm are multiple and complex. Some researchers 
conceptualize self-harm as part of a ‘suicidal process’ that ranges 
from thoughts through to self-injurious behaviour and ultimately 
suicide for a minority of people [20]. Self-harm may share many risk 
factors with suicide, but some are also distinct. One striking differ- 
ence is the gender ratio—self-harm tends to be more common in 
women, and suicide much more common in men. It has been sug- 
gested that the reasons for this disparity may include methods (men 
may choose more lethal methods when they hurt themselves) or 
help-seeking behaviour (men may be less likely to seek help if they 
become physically or psychologically unwell). Table 126.1 summar- 
izes some of the most important risk and protective factors for self- 
harm, and these will be discussed in more detail in the next section. 
Protective factors have been subject to much less research than risk 
factors. 


Table 126.1 Risk and protective factors for self-harm 


Gender 

Age 

Ethnicity 

Sexual orientation 
Deprivation 

Social support (p) 
Religious affiliation (p) 


Sociodemographic factors 


Genes 
Neurochemistry 


Biological factors 


Clinical factors Psychiatric disorder 
Physical disorder 


Alcohol misuse 


Psychological factors Impulsivity 
Poor problem-solving skills 


Hopelessness 


Environmental factors Early adversity 
Later life events 
Education (p) 


Media 


(p) indicates a possible protective factor. 
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Sociodemographic factors 


Self-harm is more common in young people, particularly girls in 
mid to late adolescence. The increase in the incidence of self-harm 
in the early teenage years is striking and may be linked to the pu- 
bertal stage. Underlying mechanisms could be biological or social 
(for example, sensitivity to social cues or propensity to imitative self- 
harm). Self-harm in older adults tends to be associated with a greater 
suicidal intent [21] than in younger age groups, and its incidence 
may be increasing [21]. 

Socio-economic disadvantage is associated with self-harm in 
both high- [22] and low- and middle-income countries [23]. There 
is evidence that rates of self-harm may increase at times of economic 
adversity [18]. Social isolation is also an important factor [22], but 
good communication and family functioning may be protective 
[24]. A 2013 review by Beghi et al. reported a weak association be- 
tween unmarried marital status and repetition of self-harm [25]. 
Although previous studies suggested that young South Asian fe- 
males were at increased risk of self-harm, this has not been borne 
out by more recent studies and reviews [26, 27], which suggested the 
highest risks are in young black females. In older age groups, self- 
harm is most common among the white population. People from 
all minority ethnic groups were less likely to repeat self-harm in a 
large English study [26]. A review of adolescent suicidal behaviour 
in the United States found the highest rates among white youths 
[28]. Religious affiliation is likely to be protective [29], although this 
may be context-dependent—religious minorities may feel socially 
isolated and so may be at greater risk. Sexual orientation is also a risk 
factor for suicide attempt, with the highest risks possibly in gay and 
bisexual men [30]. 


Genetic and biological factors 


A family history of suicide increases the individual’s risk of suicide, 
a finding that may be explained by observational learning or genetic 
predisposition. A genetic explanation is supported by twin studies 
and adoption studies which suggested a heritability of between 30% 
and 50% (or 17% once other psychiatric disorders have been taken 
into account). What is inherited is unknown. No specific genes 
coding for suicidal behaviour have been identified. Instead much 
of the research interest has shifted to the link between early life ex- 
perience and epigenetic mechanisms that could explain some of the 
link between childhood exposures, suicide risk, brain circuitry, and 
neurochemistry. Studies suggested that changes in the polyamine 
and serotonergic systems and the hypothalamic-pituitary—adrenal 
axis might be implicated. These could make certain individuals 
more vulnerable to suicidal behaviour (for a comprehensive recent 
review, see [31]). 


Clinical and psychological factors 


The prevalence of psychiatric disorders in people who self-harm 
varies according to the setting and method of assessment. Using 
diagnostic interviews and research criteria, as many as 90% of those 
who present to hospital may have an Axis I psychiatric disorder, 
with the most common single diagnosis being affective disorder 
(70%) [32]. However, it is possible that psychiatric symptoms at the 
time of an episode are relatively transient and may remit spontan- 
eously in a substantial proportion of cases. A history of psychiatric 
disorder is also important. In a cross-national community study, a 
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prior mental disorder was associated with significantly increased 
risk of developing suicidal behaviours, even after adjusting for 
sociodemographic factors and country of residence. Mood dis- 
orders had the strongest association [33]. 

Alcohol consumption is also an important contributory factor to 
self-harm—both acute ingestion (which may increase the risk of im- 
pulsive behaviour) and longer-term misuse [34]. Physical illness is 
a risk factor for self-harm, perhaps more so for women than men 
[35]. At least for some illnesses (asthma, back pain, diabetes, epi- 
lepsy, hypertension), the association persists after adjustment for 
depression [35]. 

A number of psychological characteristics are more common 
among those who self-harm; these include impulsivity, poor 
problem-solving, perfectionism, and hopelessness [36]. It is possible 
to apply diagnostic labels to some of these characteristics (for ex- 
ample, personality disorder), but this may have the unfortunate con- 
sequence of causing stigma, diverting attention from enabling the 
person to overcome their problems or even leading to them being 
denied help [2]. 


Environmental factors 


Childhood adversity, including early abusive experiences, can pre- 
dispose to self-harm. A recent review suggested that sexual abuse was 
the most important of these antecedents, but that physical abuse and 
domestic violence were also associated with self-harm [37]. A classic 
review found strong cross-sectional associations between childhood 
sexual abuse and self-harm, but the lack of longitudinal prospective 
studies led the authors to conclude that these exposures could not be 
classified as true risk factors [38]. A meta-analysis, which only con- 
sidered non-suicidal self-injury as an outcome, found a weak associ- 
ation with sexual abuse, which largely disappeared after adjustment 
for psychiatric factors [39]. On balance, it seems likely that adverse 
childhood experiences can lead to later self-harm, but this effect may 
be mediated through an association with clinical factors. 

As well as individual exposures, there may also be an association 
between self-harm and the number and type of adverse events that 
an individual reports having experienced during their lifetime [22]. 
Early studies established that life events, particularly interpersonal 
ones, were important immediate precipitants of self-harm [40], and 
this has been confirmed more recently [41]. 

The role of the media in propagating suicidal behaviour, possibly 
by increasing the cognitive availability of self-harm methods or 
normalizing it as a coping strategy, is an additional environmental 
factor and the subject of much published work [42]. Social media 
is a further development, of course, and the subject of considerable 
ongoing research. 


Repetition, suicide, and death from all causes are outcomes of par- 
ticular interest following self-harm. Each will be explored in more 
detail in the rest of this chapter. 


Non-fatal repetition 


A recent meta-analysis of 170 cohort studies and randomized trials 
in people who had presented to health care services with self-harm 
suggested that the pooled incidence of repeat self-harm was 16.3% 


at 1 year and 22.4% at 5 years [43]. Repetition rates based on patient 
self-report, as opposed to hospital databases, were around 5% higher 
at 1 year. There was also some evidence that repetition rates were 
lower in Asian vs non-Asian countries, perhaps due to differences 
in ascertainment or the method of self-harm. Poisoning in South 
Asia has a higher case fatality during the index episode than in the 
West due to the lethality of the substances ingested—for example, 
pesticides. 

The timing of repetition is striking and has important clinical im- 
plications. Most people who repeat self-harm do so soon after they 
present to hospital. Data from the Manchester Self-harm Project 
(Fig. 126.2) suggest that 30% of people who repeat within a year 
will do so within 1 month of presentation and one in ten will repeat 
within 5 days. This means that any aftercare needs to be provided 
without delay. 

Suicide 

The Carroll et al. systematic review [43] found that 1.6% of people 
died by suicide in year after presentation with self-harm, with a 
pooled suicide incidence of 3.9% at 5 years and a persistent risk 
of suicide in the longer term. The risk of suicide was twice as high 
in men than in women, and higher in studies which focused on 
older age groups. Self-poisoning was associated with a lower risk 
of subsequent suicide than other methods. Gender and age to- 
gether explained nearly 70% of the between-study variation. In a 
UK single-centre study, suicide rates were highest within the first 
6 months after the index self-harm episode [44]. 


All causes of death 


In an English study of over 30,000 individuals presenting with self- 
harm to three centres and followed up for an average of 6 years, ap- 
proximately 6% of the cohort died—about four times the number 
of deaths that would be expected according to general population 
mortality rates [45]. The risk of death increased with levels of socio- 
economic deprivation. Diseases of the circulatory and digestive sys- 
tems made a major contribution to natural mortality. This study also 
expressed the risk of death as mean years of life lost—25-26 years 
for natural-cause deaths, and 40 years for external-cause deaths (for 
example, accidents, suicide, and undetermined deaths). 


Outcome of self-harm in young and older people 


The prognosis of self-harm is of great concern to young people and 
their parents. A cohort study of 1800 adolescents in Victoria, Australia 
provided a valuable insight into the natural history of self-harm in 
young people [46]. The researchers found that 4% of the sample re- 
ported self-harm within the previous 12 months at the start of the 
adolescent phase of data collection (mean age of cohort 15.9 years). 
This fell steadily through subsequent waves of data collection, so that 
by the end of the study (mean age of cohort at this stage 29.0 years), 
just 0.8% reported self-harm. Of the young people who self-harmed 
in adolescence, only one in ten continued to do so as young adults. 

Some authors have argued that fatal and non-fatal self-harm are 
more closely related in the elderly than in younger age groups [47]. 
In a large English study of older adults (>60 years), the suicide rate 
was 1.5% within 12 months of self-harm, 67 times higher than in 
the general population of similar age and three times higher than 
in younger self-harm patients in the same cohort. Suicide was most 
common in those aged 75 years and over [48]. 
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Fig. 126.2 Timing of repeat self-harm for people repeating within a year (N = 3078). 


Source: data from the Manchester Self-harm project. 


Risk factors for repetition 


Many studies have examined the factors which might make repeat 
self-harm more likely. In a recent systematic review of 129 pro- 
spective studies, a number of factors were identified as consistent 
predictors of self-harm repetition [49]. Many of these were similar 
to the risk factors for incident self-harm. They included previous 
self-harm, personality difficulties, hopelessness, a history of psy- 
chiatric treatment, schizophrenia, and alcohol and drug misuse. 
However, the sensitivity of individual variables (ability to cor- 
rectly identify those who went on to repeat) varied greatly between 
studies. Other factors were less consistently reported, but it was sug- 
gested they too were linked to increased repetition, for example im- 
pulsivity, comorbidity, poor problem-solving ability, sexual abuse, 
staff attitudes to self-harm, and stressful life events. Self-cutting 
was associated with a higher risk of repetition than self-poisoning. 
Area-based variables, such as deprivation, may also be important 
determinants of suicidal behaviour, but an English study suggested 
an individual’s own characteristics were much more powerful pre- 
dictors of repetition than the characteristics of the area in which 
they lived [50]. 

In a separate review of risk factors for suicide following self- 
harm, updating work carried out as part of the National Institute 
for Health and Care Excellence (NICE) guidelines [51], four risk 
factors emerged from a meta-analysis with robust effect sizes 
that showed little change, even after adjustment for potential 
confounders. These included: previous episodes of self-harm, sui- 
cidal intent, physical health problems, and male gender. However, 
the authors suggested that the factors might be of limited prac- 
tical use because these were common in clinical populations. In a 
large, Manchester (UK)-based study [52], independent risk factors 
for suicide after hospital presentation for self-harm were: avoiding 
discovery at the time of self-harm, not living with a close relative, 
previous psychiatric treatment, self-cutting (vs self-poisoning), al- 
cohol misuse, and physical health problems. The overall adjusted 
population-attributable fraction for the statistical model in the 
study (a measure of the overall proportion of self-harm repeti- 
tion in a population that might be accounted for by the variables) 
was 65%. 





Drawing risk factors together—models of self-harm 


Although risk factors may provide some clues to the aetiology, they 
are unable to explain why self-harm occurs. Explanatory models 
may help us to understand the phenomena better, help to formu- 
late testable hypotheses, and ultimately facilitate the development of 
appropriate treatments. One of the most influential models of sui- 
cidal behaviour is a clinical model—the stress—diathesis or stress- 
vulnerability model [53]. This suggests that certain individuals carry 
with them a predisposition to suicidal behaviour (which may be 
related to sex, religion, familial and genetic factors, childhood ex- 
periences, psychosocial support systems, access to lethal methods, 
or biological factors). The vulnerability only leads to suicidal be- 
haviour when the individual encounters a stressor (which could be 
a mental disorder, alcohol or drug misuse, a medical illness, or a 
psychosocial crisis). Other influential models of suicidal behaviour 
include the ‘cry of pair model (which emphasizes the role of defeat 
and entrapment—the sense there is nowhere else to go, following 
the experience of a major stressor), the interpersonal-psychological 
model (which suggests perceived burdensomeness and thwarted 
belongingness are central to suicidal ideation), and the integrated 
motivational-volitional model, a stress—diathesis model which 
highlights the pre-motivational, motivational (ideation and in- 
tent formation), and volitional (behavioural, enaction) phases of 
suicidality. A full review is provided in [36]. 


Combinations of risk factors considered together in ‘risk assessment 
scales’ are in widespread use. A study from 32 hospitals in England 
found that the vast majority of services used risk assessment tools, 
and in 22 of 32 hospitals, the tools were locally developed and un- 
validated [54]. A number of recent reviews [51, 55, 56] have sug- 
gested that scales are of limited usefulness in terms of predicting 
future outcomes such as repetition and suicide. Positive predictive 
values for suicide ranged from 1% to 17%, with the most relevant UK 
study suggesting a value of 4%. This means that of 100 people identi- 
fied as being at high risk, four will die by suicide. However, a greater 
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number of suicide deaths occur in the large low- and middle-risk 
groups—the so-called population paradox. These may be missed if 
services are organized according to high-risk paradigms. This has 
important implications for management, with some authorities 
going as far as to suggest that viewing risk assessment as risk pre- 
diction is a fallacy. Management decisions should be based on more 
generic models of assessment which emphasize the need to engage 
with patients, rather than tick-box assessments [51]. 

The development of novel risk measures is the subject of ongoing 
research. One approach is to use data-driven methods to derive the 
most important combination of risk factors in a development data 
set and then test the instrument in a separate validation data set [56]. 
Examples of these empirically derived tools include the Manchester 
Self-Harm and ReACT rules and the Repeated Episodes of Self- 
Harm (RESH) score. Although robustly developed, there is no good 
evidence that these tools are any more useful than older scales [55]. 
A different approach is to use neurocognitive or psychological func- 
tioning as a marker of risk. The advantage of some of these measures 
is that they can be used when people are unaware of their intent or 
not articulating it. Traits that may be predictive include deficits in 
attentional shifting, deficits in verbal fluency, and poor decision- 
making [56]. Another novel method—the Implicit Association 
Test—measures the reaction time for people to respond to images 
(related to suicidal behaviour or neutral images) [57]. Shorter reac- 
tion times are thought to be associated with a greater propensity to 
suicidal behaviour. The predictive ability of the test may be improved 
when used in conjunction with other traditional risk factors [56]. 


Which risk factors are the most important 





clinically? 


Although some risk factors are common in clinical populations and 
the predictive value of risk scales is dubious, clinicians clearly need 
to assess and treat patients. Which risk factors are the most clinically 
important? One practice-based review suggested that mental illness, 
alcohol and drug use, current thoughts and plans, gender, and pre- 
vious self-harm, along with possible protective characteristics (such 
as dependent children, supportive family, religious beliefs) were the 
most important factors to take into account in a real-world clinical 
assessment [58]. 
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To understand the biological underpinnings of multi-determined 
behaviours such as suicide and attempted suicide, it is best to con- 
ceptualize them within an explanatory model that describes the 
potential causal pathways and interrelations between biological, 
clinical, genetic, and environmental factors that all play a role in sui- 
cidal behaviour. Where possible, such a model should be clinically 
explanatory, incorporate biological correlates, be testable in both 
clinical and biological studies, and have some utility in identifying 
high-risk individuals. 

One such model is the stress—diathesis model of suicidal be- 
haviour wherein exposure to a stressor precipitates a suicidal act 
in those with the diathesis, or a propensity for suicidal behaviour 
[1]. Stressors are generally state-dependent factors such as an epi- 
sode of major depression or the occurrence of an adverse life event. 
The diathesis comprises trait characteristics of reactive aggression, 
deficits in executive function, negative or rigid cognitive processes, 
impaired problem-solving, social distortions, and recurrent mood 
disorders [2]. Uncovering the biological mechanisms relevant to the 
stress and the diathesis dimensions of suicidal behaviour will facili- 
tate the identification of both enduring and proximal markers of 
risk, and thus potential targets for prevention interventions. 

This chapter gives an overview of the major neurobiological find- 
ings in suicide and attempted suicide, as well as emerging findings 
from studies of genes related to those neurobiological and stress re- 
sponse systems. 





A key biological correlate of the diathesis for suicidal behaviour ap- 
pears to be low serotonergic activity. Abnormal serotonergic func- 
tion may be the result of numerous factors, including genetics, early 
life experience, chronic medical illness, alcohol use disorders, or 
substance use disorder, many of which have been correlated with 
an increased risk for suicidal behaviour. Moreover, serotonergic 
dysfunction may underlie recurrent mood disorders, as well as be- 
havioural traits that characterize the diathesis such as aggression, 
pessimism and dysfunctional attitudes, hopelessness, and deficits in 
decision-making. 

Serotonin is involved in brain development, behavioural regu- 
lation, sleep, mood, anxiety, cognition, and memory and is shown 





to be disturbed in various psychiatric disorders. Serotonergic func- 
tion is under genetic control, and moreover deficits in functioning 
have been shown to be enduring, marking it as a biological trait. The 
serotonergic system became a target for investigation in relation to 
suicide when, more than 40 years ago, Asberg and colleagues ob- 
served that depressed individuals who had either attempted suicide 
by violent means or subsequently died by suicide in the study follow- 
up period were more likely to have lower 5-hydroxyindoleacetic acid 
(5-HIAA) levels in the cerebrospinal fluid (CSF) [3]. Since that time, 
the function of the serotonergic system in suicide and attempted 
suicide has been examined in many paradigms, and while not all 
studies agree, there is substantial consensus that individuals who die 
by suicide, or make serious non-fatal suicide attempts, exhibit a de- 
ficiency in CNS serotonin neurotransmission. 

Evidence of hypofunction comes from CSF and post-mortem 
studies. 5-HIAA is the major metabolite of serotonin, and the level 
of CSF 5-HIAA is a guide to serotonin activity in parts of the brain, 
including the prefrontal cortex. There have been over 20 studies of 
CSF 5-HIAA and suicidal behaviour in mood disorders, and a meta- 
analysis of prospective studies of 5-HIAA found that in mood dis- 
orders, lower CSF 5-HIAA levels increased the chance of death by 
suicide by over 4-fold over follow-up periods of 1-14 years [4]. 

Several post-mortem studies of suicide have reported lower brain- 
stem levels of 5-HIAA and/or serotonin [5-hydroxytryptamine (5- 
HT)] (for a review, see [5]). These deficits in 5-HT or 5-HIAA are 
observable across diagnostic groups [6] and, despite early reports 
to the contrary, appear to be independent of the suicide method. 
A limitation of these studies is that they did not use toxicological 
sensitive assays of brain tissue or psychological autopsy interviews 
to rule out recent antemortem antidepressant exposure which would 
potentially lower 5-HIAA levels. A more recent study that carefully 
ruled out such medication exposure found higher levels of 5-HT and 
5-HIAA in the brainstem of depressed suicides [7]. This observation 
appears to be largely specific to the brainstem, because no differences 
are generally found between suicides and controls in 5-HT level in 
other brain regions, including the hippocampus, occipital cortex, 
frontal cortex, temporal cortex, caudate, striatum, or hypothalamus 
[5]. Serotonin neuron cell bodies are in the brainstem raphe nuclei, 
while their axons innervate most of the brain, including the ven- 
tral prefrontal cortex. Morphological analysis of stained serotonin 
neurons in the brainstem of depressed suicides and non-suicides 


observed greater cell density in the dorsal raphe nucleus in suicides 
[8] and greater expression of the rate-limiting biosynthetic enzyme 
for serotonin—tryptophan hydroxylase—per neuron [9], suggesting 
that reduction in serotonin activity is associated with dysfunctional 
cells, and not with fewer neurons. That observation is consistent 
with the report of more 5-HT in the brainstem of suicides [7]. 

Neuroendocrine challenge studies using fenfluramine provide 
further evidence of anomalous serotonergic function associated with 
suicidal behaviour. Fenfluramine is a serotonin-releasing agent and 
a reuptake inhibitor that may also directly stimulate post-synaptic 
5-HT receptors. The release of serotonin by fenfluramine causes an 
increase in serum prolactin levels that is an indirect index of central 
serotonergic responsiveness. In depressed patients, those with a his- 
tory of suicide attempts have a more blunted prolactin response to 
fenfluramine challenge than non-attempters, with evidence that the 
effect is more related to lethality of past suicide attempts [10]. 

Lower serotonergic transmission in the central nervous system 
(CNS) may be accompanied by a compensatory upregulation of 
some serotonergic post-synaptic receptors, such as 5-HT,, and 5- 
HT, receptors, and a decrease in the number of serotonin trans- 
porters [5]. There is a reported increase in the concentration of 
post-synaptic 5-HT,, receptors in the prefrontal cortex of suicides, 
compared with non-suicides [11]. This increased binding is re- 
flected in more protein and may be due to elevated gene expression 
in young suicides [12]. Elevated 5-HT,, binding has also been re- 
ported in the amygdala in depressed suicides [13]. In depressed and 
non-depressed suicides, there is evidence that 5-HT,, receptors are 
upregulated in the dorsal prefrontal cortex, but not the rostral pre- 
frontal cortex [11]. 

Platelet studies examine 5-HT,, in living subjects with respect 
to non-fatal suicide attempt. 5-HT,, receptors, serotonin reuptake 
sites, and serotonin second messenger systems are present in blood 
platelets, and changes in these platelet measurements may reflect 
similar changes in the CNS. Multiple studies have reported higher 
platelet 5-HT,, receptor numbers in suicide attempters, compared 
with non-attempters and healthy controls [14]. 

Studies of second messengers indicated impaired 5-HT, 
receptor-mediated signal transduction in the prefrontal cortex of 
suicides [15], and in platelets, 5-HT,, receptor responsivity is sig- 
nificantly blunted in patients with major depression who have made 
high-lethality suicide attempts, compared to depressed patients who 
have made low-lethality suicide attempts [16]. The implications of 
such a defect in signal transduction, if present in the brain, would 
be that although there may be greater density of 5-HT,, receptors, 
the signal transduced by 5-HT,, receptor activation may be blunted. 

Some post-mortem studies of the post-synaptic 5-HT|, receptor 
have reported higher binding in the prefrontal cortex and more ros- 
tral segments of the raphe nuclei, and lower binding in the more 
caudal raphe nuclei, hippocampus, prefrontal cortex, and temporal 
cortex [17]. Less 5-HT,, auto-receptor gene expression has also been 
reported in the dorsal raphe [18] and would favour higher serotonin 
neuron firing rates. 

Post-mortem studies of depressed suicides reported fewer 5-HT 
transporters in the prefrontal cortex, hypothalamus, occipital cortex, 
and brainstem [19]. Moreover, in suicides, this deficit appears local- 
ized to the ventromedial prefrontal cortex and anterior cingulate, 
whereas depressed individuals who died of other causes had lower 
binding throughout the prefrontal cortex [20]. 
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The emerging picture from post-mortem studies of greater 5- 
HT, receptor binding in the frontal cortex of depressed individ- 
uals who die by suicide, fewer brainstem 5-HT,, auto-receptors, 
and fewer serotonin transporters in the cortex, as well as findings of 
greater tryptophan hydroxylase (the rate-limiting step in serotonin 
synthesis) immunoreactivity in serotonin nuclei in the brainstem 
[9], all point to homeostatic changes designed to increase deficient 
serotonergic transmission, evidenced by low 5-HIAA in the CSF and 
brain, and blunted prolactin response to fenfluramine challenge. 


Serotonergic dysfunction and suicide endophenotypes 


Reactive aggressive traits are potentially part of the diathesis for sui- 
cidal behaviour [1]. Increased aggression has been associated with 
suicide and more highly lethal suicide attempts, and impulsivity 
has shown a stronger relationship to non-fatal suicide attempts 
[21]. Reduced activity of the serotonergic system has been impli- 
cated in impulsive violence and aggression in studies in a variety of 
paradigms, including: low CSF 5-HIAA levels in individuals with a 
lifetime history of aggressive behaviour with personality and other 
psychiatric disorders [22, 23]; a blunted prolactin response to the 
serotonin-releasing agent fenfluramine in personality disorder pa- 
tients [24, 25]; and greater platelet 5-HT,, binding correlated with 
aggressive behaviour in personality and other psychiatric disorder 
patients [26, 27]. In a post-mortem study of aggression, suicidal be- 
haviour, and serotonergic function, a positive relationship was found 
between lifetime history of aggression scores and 5-HT,, binding in 
several regions of the prefrontal cortex of individuals who had died 
by suicide [28]. 

Positron emission tomography (PET) studies have shown a defi- 
cient response to serotonergic challenge in the orbitofrontal cortex 
and medial frontal and cingulate regions in individuals with impul- 
sive aggression, compared to controls [29, 30], and lower serotonin 
transporter binding in the anterior cingulate cortex in impulsive and 
aggressive individuals, compared to healthy controls [31]. The pre- 
frontal cortex is important in the inhibitory control of behaviour, 
including impulsive and aggressive behaviour [32]. Thus, aggres- 
sive/impulsive traits, related to serotonergic dysfunction, are po- 
tentially an aspect of the diathesis for suicidal behaviour, whereby 
aggressive/suicidal behaviour is manifested in response to stressful 
circumstances or powerful emotions. This tendency might be con- 
ceived of as a diminution in brain inhibitory circuits or as a volatile 
cognitive decision-making trait. 





Noradrenergic system 


According to the stress—diathesis model of suicidal behaviour, it is 
the confluence of stressful events with the diathesis that is thought 
to precipitate a suicidal act. Thus, investigating the functioning 
of stress response systems in suicidal individuals is important for 
elucidating neurobiological concomitants of suicidal behaviour and 
identifying targets for preventative intervention. The noradrenergic 
system and the hypothalamus-pituitary—adrenal (HPA) axis are two 
key stress response systems. 

The majority of noradrenaline neurons in the brain are located 
in the brainstem’s locus caeruleus. Post-mortem studies of suicides 
have documented fewer noradrenergic neurons in the rostral locus 
caeruleus, the part that projects to the brain [33]. There are also 
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indications of cortical noradrenergic overactivity, including lower 
a- and high-affinity B,-adrenergic receptor binding [34], and lower 
B-adrenoceptor density and a,-adrenergic binding in the prefrontal 
cortex in individuals who died by suicide [35]. There is some, but not 
unanimous, evidence from prospective studies of lower CSF levels 
of 3-methoxy-4-hydroxyphenylglycol (MHPG), a metabolite of 
noradrenaline, in future suicides [36], although not in those making 
non-fatal suicide attempts [37]. Low CSF MHPG levels predict both 
the probability of a suicide attempt and the lethality of the attempt. 

Fewer noradrenergic neurons observed in depressed suicides may 
indicate a lower functional reserve of the noradrenergic system, 
which, if accompanied by an exaggerated stress response with a 
greater release of noradrenaline, may result in noradrenaline deple- 
tion, leading to depression and hopelessness, both of which are con- 
tributory factors to suicidal behaviour. 

Noradrenergic response to stress in adulthood appears to be 
greater in those reporting an abusive experience in childhood [38]. 
Such individuals are potentially at greater risk in adulthood for major 
depression and suicidal behaviour. Childhood abuse may be associ- 
ated with an increased risk for depression and suicidal behaviour be- 
cause of a dysfunctional stress response both via the noradrenergic 
system and via the HPA axis, and secondary effects of noradrenaline 
depletion and elevated cortisol levels. There is interaction between 
the noradrenergic system and the stress response activity of the HPA 
axis, with reciprocal neural connections between corticotropin- 
releasing hormone (CRH) neurons in the hypothalamic paraven- 
tricular nucleus and noradrenergic neurons in the human brainstem 
and locus caeruleus [39]. 





The hypothalamic-pituitary-adrenal axis 


The HPA axis is a major stress response system. Major depression 
is associated with hyperactivity of the HPA axis [40], and suicidal 
patients in diagnostically heterogenous populations exhibit HPA 
axis abnormalities, most commonly failure to suppress cortisol nor- 
mally after dexamethasone [36]. We found most future suicides 
were dexamethasone suppression test (DST) non-suppressors [4]. 
In mood disorders, DST non-suppressors had a 4.5-fold greater risk 
of dying by suicide, compared with suppressors [4]. Moreover, non- 
suppression may be characteristic of more serious attempts that re- 
sult in greater medical damage [41, 42] or the use of violent methods 
in the suicide attempt [43]. In other indices of HPA axis function, 
suicide attempters had attenuated plasma cortisol responses to 
fenfluramine, although that may indicate less serotonin release, and 
not an HPA abnormality [44, 45], and lower CSF CRH levels, com- 
pared to non-attempters [46], though not all studies agree. 

Larger pituitary and adrenal gland volumes have been reported 
in depressed suicides [47, 48] and fewer CRH binding sites in the 
prefrontal cortex of depressed suicide victims, which may mean 
receptor downregulation due to elevated corticotropin-releasing 
factor (CRF) release [49]. 

As with the noradrenergic system, early life adversity appears to 
have lasting effects on stress response in the HPA axis in adulthood. 
Abnormalities in HPA axis function have been implicated in poor re- 
sponse to drug treatment and a greater likelihood of relapse in major 
depression, both of which increase the risk for suicidal acts [36]. 
Increased anxiety and agitation reflect another potential pathway 


whereby an abnormal stress response, in both the noradrenergic and 
the HPA axis, contributes to risk for suicidal behaviour. 


Other biologic systems 


Abnormality in the dopaminergic system has been reported in de- 
pressive disorders [50]. However, studies of dopaminergic function 
and suicidal behaviour are relatively few and inconclusive [5]. Low 
dihydroxyphenylacetic acid levels, indicative of reduced dopamine 
turnover, in the caudate, putamen, and nucleus accumbens have 
been reported in depressed suicides [51], although the same group 
of investigators found no difference in number or affinity of the 
dopamine transporters [52]. Accordingly, it is unlikely that the re- 
duced dopamine turnover initially observed in depressed suicides 
is a result of decreased dopaminergic innervation of those regions. 
Prospective studies disagree as to whether CSF homovanillic acid 
(HVA) levels predict suicidal behaviour [53-55]. Low CSF HVA 
levels may be related to higher lethality suicide attempts in males. 

There is a well-documented relationship between thyroid dys- 
function and depression [56], and some studies have linked thy- 
roid function and suicide. Abnormal thyroid-stimulating hormone 
(TSH) response to thyrotropin-releasing hormone (TRH) has been 
observed in individuals who died by suicide in a follow-up study 
[57]. Abnormal TSH response to challenge tests has also been asso- 
ciated with poor response to antidepressant treatment and a higher 
relapse rate which may increase the risk for suicidal behaviour [58]. 

Neurotrophins are involved in brain development and growth, 
neuronal functioning, and synaptic plasticity. Lower protein levels 
and gene expression of brain-derived neurotrophic factor (BDNF) 
in the prefrontal cortex and hippocampus [59, 60] and less mRNA 
of nerve growth factor, neurotrophin 3, and neurotrophin 4/5 in 
the hippocampus [61] have been reported post-mortem in suicides. 
Lower plasma BDNF levels have been reported in major depres- 
sive disorder (MDD) suicide attempters, compared to MDD non- 
attempters and healthy controls [62]. 

Suicide is more common in groups with very low cholesterol 
levels or after cholesterol lowering by diet (for a review, see [63]). 
This relationship between cholesterol levels and suicide may be me- 
diated by serotonergic function, as studies of non-human primates 
on a low-fat diet found lower serotonergic activity and increased 
aggressive behaviours [64]. Long-chain polyunsaturated fatty acids, 
particularly omega-3 [docosahexaenoic acid (DHA)], may also be a 
mediating factor in the relationship between low cholesterol levels 
and increased risk for depression and suicide [65]. A decrease in 
DHA concentration was associated with a 14% increased risk for 
suicide death for every standard deviation decrease in DHA level in 
a large study of military suicides in the United States [66]. However, 
other post-mortem studies did not find any association [5]. Lower 
DHA percentage of total plasma polyunsaturated fatty acids and a 
higher omega-6:omega-3 ratio predicted depressed individuals who 
made a suicide attempt during a 2-year follow-up [67], and lower ei- 
cosapentaenoic acid is found in red blood cells of suicide attempters, 
compared to controls [68]. 

Other potential candidate neurobiological systems for investiga- 
tion with respect to suicidal behaviour come from genetic studies. 
One novel approach examined the correlations between candidate 
genes and gene expression in the prefrontal cortex of suicides, with 


the aim to identify correlations that were independent of mood dis- 
order diagnosis [69]. That study identified genes related to suicidal 
behaviour which had not been previously identified in candidate 
gene studies of suicide, but which did overlap with the broad biologic 
domains identified in earlier expression studies in suicide, including 
CNS development, homophilic cell adhesion, regulation of cell pro- 
liferation, and transmission of nerve impulse [69-71]. Other re- 
cent work points to potential involvement of the immune system, 
for example CD44, a gene connected with the immune system that 
has been identified in three studies of suicides, and Galfalvy et al. 
additionally found its expression in both BA9 and BA24 was signifi- 
cantly low [69]. Immune system dysregulation has been reported 
in major depression [72, 73], but there has been little investigation 
of its role in suicide. The inflammatory and neurodegenerative hy- 
pothesis of depression hypothesizes that neurodegeneration and 
reduced neurogenesis are caused by inflammation, cell-mediated 
immune activation, and their long-term sequelae [74]. Disordered 
neuroimmune function may be present in suicide, and its pathogen- 
esis related to genes identified in these studies. Finally, another area 
of emerging interest is a possible link between suicide and allergic 
reactions that may alter the function of the orbital prefrontal cortex 
[75, 76]. 





Neurobiology, genetics, and suicidal behaviour 


Family, twin, and adoption studies support a genetic contribution 
to suicidal behaviour, independent of psychiatric disorder (for a 
review, see [77]), and genetic studies have sought to determine the 
responsible genes for suicide and suicide attempt though linkage 
and single-nucleotide polymorphism (SNP) association studies. 
Candidate genes for most studies were selected based on evidence 
from neurobiological studies in suicide, as a result of which the 
serotonergic system has been the most extensively investigated. 
A tri-allelic polymorphism in the serotonin transporter pro- 
moter has two alleles with lower transcriptional activity and fewer 
transporters. In varied psychiatric populations, despite some 
negative findings, the S, or the more common lower-expressing 
allele, has been associated with suicide and suicide attempts, 
particularly violent or high-lethality attempts [78]. Functional 
MRI studies have found greater amygdala activation in individ- 
uals with the SS genotype when they were exposed to negative 
stimuli such as angry or fearful faces, negative words, or aversive 
pictures (for a review, see [79]). The amygdala is densely innerv- 
ated by serotonergic neurons and 5-HT receptors are abundant, 
and the amygdala plays a central role in emotional regulation and 
memory. Excessive responses to emotionally negative events, such 
as abuse, may be over-encoded and contribute to stress sensitivity 
in adulthood, and thereby to major depression after stress and 
even suicidal behaviour. 

Other genetic studies of the serotonergic system, including 5- 
HT œ 5-HT,,, 5-HT,,, and other serotonin receptors, have largely 
reported negative results, although there have been some positive 
findings for the 5-HT,, 102C allele and attempted suicide or sui- 
cidal ideation [78]. For tryptophan hydroxylase (TPH1 and TPH2 
are the two forms of tryptophan hydroxylase, with TPH1 only 
expressed in the brain during development), associations have 
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been reported with suicide and suicide attempt and TPH1 SNPs; 
however, multiple negative findings have also been reported. 
Haplotype and SNP studies have suggested the involvement of 
the TPH2 gene in suicide and suicide attempt; however, again not 
all studies agree [78]. Monoamine oxidase A (MAO-A) plays a 
key role in the metabolism of amines. Low MAO activity results 
in elevated levels of serotonin, noradrenaline, and dopamine in 
the brain. The MAO-A gene has functional variable number of 
tandem repeats; however, no association has been found between 
this upstream variable number of tandem repeats (uVNTR) and 
suicidal behaviour, although there is some indication that it may 
be related to aggression [78], and it is linked to the impact of ad- 
versity in childhood on adult antisocial behaviour and trait im- 
pulsiveness [80, 81]. 

Geneticstudiesonthe dopaminergic system, noradrenergicsystem, 
and BDNF are few and generally negative [78], although there are re- 
ports of a positive association of the catechol-O-methyltransferase 
(COMT), a major catecholamine-catabolic enzyme, gene in Finnish 
and Caucasian suicide attempters [82] and Japanese suicides [83]. 
A recent study implicates gamma aminobutyric acid (GABA)- and 
glutamate-related genes and suicidal behaviour [84]. Inconsistent 
findings in genetic studies of suicidal behaviour may be due to the 
complexity of the suicide phenotype, gene-gene interactions, the 
presence of multiple psychiatric disorders, population racial dif- 
ferences, possible epigenetic effects, and the influence of gene/ 
environment interactions. Nonetheless, new microarray technolo- 
gies that test the expression of thousands of genes simultaneously, 
allowing better gene coverage, and haplotype mapping approaches 
offer promise for future investigation. Other options include exam- 
ining more basic endophenotypes such as mood regulation and 
decision-making. 


Genes and environment 


Early life stress, in conjunction with genetic vulnerability, can have 
enduring effects into adulthood and affect psychopathology and 
the functioning of biologic systems, including the serotonergic 
and stress response systems [85]. For example, monkeys exposed 
to maternal deprivation in infancy and having the 5-HTTLPR 
lower-expressing S allele in the serotonin transporter gene mani- 
fested a lowering of CSF 5-HIAA levels that persisted into adult- 
hood [86]. In 6-month-old macaque monkeys exposed to social 
stress, those with the S allele had a higher adrenocorticotrophic 
hormone (ACTH) response, an HPA axis hormone related to stress 
response, compared with those without that allele and S allele ani- 
mals who were maternally reared [87]. Thus, the low-expressing 
S allele not only increased vulnerability to stress in development, 
but early life stress may also further interact with the genotype 
to lower the serotonergic function and increase the sensitivity to 
stressful events later in life, both of which are risk factors for sui- 
cidal behaviour. 

In human studies, among individuals who had experienced child- 
hood maltreatment, those with the low-expressing S allele were at risk 
for suicidal ideation and suicide attempt [88], and those with a lower- 
expressing variant of the MAO-A gene were more likely to manifest 
antisocial behaviour and more impulsivity as adults [80, 81]. 
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Future directions 


There is much still to be learnt about the biologic aetiology of sui- 17. 
cidal behaviour and the pathways and mechanisms through which 
biologic dysfunction is involved in suicidal acts. New techniques for 
imaging the brain, identification of basic intermediate phenotypes, 
and denser gene markers will contribute to elucidating the biologic 
factors and mechanisms involved in suicide and attempted suicide 
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The World Health Organization (WHO) estimates that approxi- 
mately 800,000 people die by suicide worldwide every year and 
regards suicide prevention as a major public health priority [1]. 
However, unlike other major causes of death such as cancer or heart 
disease, suicide is a behaviour, not a diagnosis, with a myriad of con- 
tributory factors. Although 87-91% of people who die by suicide 
are thought to have had a diagnosable mental disorder [2, 3], an 
understanding of the suicidal mind requires consideration of other 
key factors underlying a person’s decision to take their life. Access to 
methods of suicide, cognitive style, social problems, and social mod- 
elling, for example, are all targets for interventions that interrupt the 
pathway from suicidal ideation to suicide attempt [4]. Therefore, 
a number of population and high-risk approaches have been em- 
ployed, with varying degrees of success, and these are summarized 
in this chapter. 

The design of interventions to prevent suicide and respond to 
those who self-harm is founded on an understanding of recent 
international trends in suicide and self-harm. These have been de- 
scribed in Chapters 125-127 in this section. The changing nature of 
the epidemiology of suicide, both at a national and an international 
level, means that interventions can become outdated and may fail 
to target groups that have become at risk. This requires ongoing 
surveillance of patterns of suicide, translated into appropriate and 
effective new intervention approaches. Most notably, emerging 
methods of suicide require a rapid response in terms of restricting 
access to means. 

It is important to note that conventions regarding suicide and self- 
harm nomenclature differ both temporally and internationally. For 
simplicity, in this chapter, the term ‘self-harm is used to include any 
act of intentional non-fatal self-poisoning or self-injury, irrespective 
of the motive or degree of suicidal intent [5]. The term ‘suicide is 
used in the place of completed suicide or fatal suicide attempt. Given 
the relationship of suicide to self-harm, this chapter should be read 
in conjunction with Chapter 126 describing the epidemiology and 
management of self-harm. 


The WHO recommends that for national responses to suicide to 
be effective, a comprehensive multi-sectoral suicide prevention 
strategy is needed [1]. Their emphasis is on: restricting access to 
means of suicide, including pesticides, firearms, and certain medi- 
cations; early identification and effective management of psychiatric 
disorders and harmful alcohol use; and social support for vulnerable 
individuals in communities. High-income countries have devel- 
oped suicide prevention strategies that combine broad population 
or universal approaches with complementary high-risk or targeted 
prevention interventions, discussed further later in this chapter, 
with evidence favouring population strategies [6]. Relatively rapid 
changes in the epidemiology of suicide require suicide prevention 
strategies to be able to respond to emergent high-risk groups [6]. 
They must also be able to meet the needs of the diverse groups at 
risk of suicidal behaviour, ranging from those with experiences of 
early trauma in childhood, people in specific occupational groups, 
those facing acute life events, to those with debilitating mental illness, 
including those with prior self-harm. Interventions intended to reduce 
the risk of suicide in these groups are likely to have their effects medi- 
ated via improvements in wider mental health and social functioning. 
Because the risk of suicide varies by ethnic group [7], the effectiveness 
and cultural acceptability of specific suicide prevention interventions 
are likely to vary in different cultural settings. It is therefore important 
that tailored suicide prevention programmes are developed both na- 
tionally and for specific cultures sub-nationally. For example, among 
younger age groups, the male excess of suicides observed internation- 
ally is reversed in India and China [8], identifying young women in 
rural areas as requiring tailored suicide prevention interventions. 
While ethical issues in relation to suicide prevention are not 
covered in detail in this chapter, it is important that they are recog- 
nized. The concept of rational suicide and whether suicide should 
always be prevented is particularly contentious. For example, in the 
Netherlands, where euthanasia or assisted suicide (EAS) is legal, 
EAS of psychiatric patients tends to occur in patients with com- 
plex and chronic psychiatric, medical, and psychosocial histories, 
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predominantly women, and usually after discussion with multiple 
physicians, however often with disagreement between them [9]. 


Principles of prevention 


Broadly there are two approaches to suicide prevention (Box 128.1). 
One can distinguish between population approaches, which aim 
to decrease risk in the population as a whole, and high-risk group 
strategies, in which specific groups that are at increased risk are tar- 
geted. High-risk group strategies often appear more attractive and 
realistic. However, risk factors for many disorders are widely spread 
in the population and so the high-risk strategy tends to exclude a 
large number of people at moderate risk and is often ineffective in 
reducing the burden of a disease at the population level. Conversely, 
population strategies may appear more difficult to achieve but are 
more likely to be effective in reducing population levels of disease 
(see also Chapter 125). 

It is unclear to what extent national suicide prevention pro- 
grammes are effective, although evidence of effectiveness for spe- 
cific components of such strategies is emerging [11]. An impressive 
programme, developed in Finland, was based on information from 
a detailed national study of all suicides in 1 year and included a wide 
range of elements [12]. The programme was evaluated as relatively 
successful [13]. In England, a national suicide prevention strategy 
with a suicide target was introduced in 2002 [14], followed by a 
revised strategy in 2012, although this time without a target [15]. 
Strategies have now been introduced in a large number of countries 
[1]. While prevention strategies are difficult to evaluate [16], there 
are indications that programmes for prevention of suicide on a na- 
tional scale may be effective. 

It is also important to note that suicide prevention is age-specific, 
and preventive interventions in different age groups are likely to 
differ. 


Box 128.1 Examples of strategies for prevention of suicide and 


attempted suicide 





Population strategies 

e Reducing the availability of means for suicide” 

e Educating primary care physicians 

e Influencing media portrayal of suicide 

e Educating the public about mental illness and its treatment 
e Educational approaches in schools’ 

e Befriending agencies and telephone helplines 


High-risk strategies 

Prevention of suicide in: 

e Patients with psychiatric disorders 
e Specific age groups 
e Suicide attempters 
e High-risk occupational groups 
e Prisoners 
e People bereaved by suicide 

“Denotes those with established evidence supporting effectiveness [9, 10] 




















Population strategies 
Reducing availability of means for suicide 


This is the most widely discussed population strategy [17]. It is 
based on evidence that if the availability and/or danger of a popular 
method for suicide changes, then this tends to have an impact on 
suicide rates. The general principles of prevention through reducing 
the availability of means are firstly that if a dangerous means is avail- 
able, then acts of self-harm are more likely to result in death, espe- 
cially when they occur impulsively, and secondly that the eventual 
suicide rate in survivors of serious attempts is remarkably low. Also 
the common adage that if people are intent on dying by suicide they 
will find a means is not borne out by the evidence on the effective- 
ness of means restriction. The success of policies to restrict access to 
specific means will depend on what other means are available, their 
relative lethality, and the extent to which substitution of a method 
occurs. The following are some examples of impacts of means re- 
striction policies. 


Gas 


Perhaps the clearest example of the potential effectiveness of re- 
stricting access toa common method of suicide was the reduction in 
suicides in the UK which occurred in the 1960s and early 1970s when 
toxic coal gas supplies were gradually replaced with non-toxic North 
Sea gas [18]. Prior to this time, coal gas poisoning through people 
placing their head in a gas oven was the most common method of 
suicide in the UK. As North Sea gas was gradually introduced, the 
suicide rate dropped steadily, eventually being reduced by approxi- 
mately a third [18]. It is estimated that as many as 6000 deaths may 
have been prevented by this change [18]. The effect also illustrates 
the point that when one method of suicide is no longer available, 
people do not automatically turn to another, or if they do, it may be 
to one that is less likely to cause death. Thus, it was some years before 
the suicide rate rose again, this being related to an increase in deaths 
from poisoning with carbon monoxide from car exhausts. Another 
factor that may have been relevant to the decline in suicides was the 
reduction in prescribing of barbiturates, these being replaced by far 
less toxic benzodiazepines. 

Suicide by carbon monoxide poisoning from car exhausts used to 
be a common method. However, this became less common as cars 
were fitted with catalytic converters, which detoxify the exhaust gas. 
This resulted in a decline in suicide rates in countries where this 
method of suicide had become more common, particularly in young 
males [19]. 


Firearms 


The widespread availability of guns in certain countries, particularly 
the United States, has been proposed as an important reason for 
their relatively high suicide rates. Guns are used in more than half of 
all suicides in the United States, and their use for suicide correlates 
with the holding of gun licences in households [20]. Some contro- 
versy surrounds the question of whether restricting the availability 
of firearms through, for example, making gun purchase more diffi- 
cult and improving firearm storage in households leads to a reduc- 
tion in suicide rates, but the weight of evidence seems to indicate 
that it does [11]. 
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Analgesics 


In the UK and some other countries, there has been particular 
concern about deaths from self-poisoning with paracetamol (acet- 
aminophen). Due to evidence that countries which have fewer 
tablets per pack seem to have a lower rate of mortality from para- 
cetamol self-poisoning and because overdoses of paracetamol are 
often taken impulsively and involve household supplies, legislation 
was introduced in the UK in 1998 to reduce the number of tablets of 
paracetamol (and aspirin) available per pack. This resulted in fewer 
overdoses, decreased cases of hepatotoxicity due to paracetamol tox- 
icity, and a reduced number of deaths from both paracetamol and 
aspirin (21, 22]. 

Also in the UK, withdrawal of the analgesic co-proxamol (para- 
cetamol combined with the highly toxic opiate dextropropoxyphene) 
in 2005-2007 resulted in virtual elimination of deaths due to self- 
poisoning with this drug and no evidence of an increase in deaths 
involving other analgesics, at least until the end of 2010 [23]. 


Popular sites for suicide 


Much attention has been paid to improving safety at popular sites 
for suicide. This includes, for example, erecting suicide barriers on 
bridges, multi-storey car parks, and other sites. For example, erec- 
tion of barriers on the Clifton Suspension Bridge in Bristol in the 
UK, a popular site for suicide, has resulted in far fewer deaths by 
jumping [24]. It has been estimated that safety measures at such sites 
may reduce suicide considerably [25]. 


Pesticides 


Ingestion of pesticides is common in several low- and middle- 
income countries in which there are large numbers of small rural 
farming communities. It is thought to be responsible for a large 
proportion of suicides globally [1]. Banning sales of more toxic pes- 
ticides is one approach that has been shown to have a profound ef- 
fect on national suicide rates. Safer storage of pesticides in domestic 
settings and improved emergency access to medical treatment are 
other approaches. 


Hanging 


Hanging is an extremely common method of suicide in many coun- 
tries. This may reflect the decreased availability of other methods, 
together with the frequency with which it is portrayed in the media. 
Prevention in terms of restricting access to means is difficult, ex- 
cept in institutional settings. Making psychiatric hospital wards 
safer through removing ligature points and introducing collapsible 
curtain rails may reduce hanging deaths in this setting [26]. Similar 
safety measures in prisons may also be helpful. 

Clinicians involved in the development of suicide prevention 
strategies should look very carefully at local patterns of suicide 
which might provide clues about potentially effective measures for 
reducing access to methods. This includes prescribing less toxic 
medication where feasible, especially to patients at risk of self-harm. 


Attempts to limit contagion effects 


Exposure to suicidal behaviour of other people can increase the risk 
of suicide and self-harm. This can result in suicide clusters where a 
greater than expected number of suicides occur in a locality or set- 
ting in a particular time period (so-called ‘point clusters’). Clusters 
may be temporal where, for example, there is a spike in the number 


of suicides that are geographically unrelated, but similar in terms 
of methods and characteristics of individuals involved. Clusters of 
suicides and of self-harm are not infrequent in young people and 
in institutional settings such as psychiatric hospitals, schools, and 
prisons [27]. In planning local suicide prevention strategies, it is 
important to prepare policies to address such clusters, with, for ex- 
ample, identification of the agency that will co-ordinate the local re- 
sponse, a plan for addressing media responses to the cluster, and a 
policy for identifying people at risk and providing help [28]. 


Influencing media portrayal of suicidal behaviour 


A growing, if mixed, evidence base demonstrates that irresponsible 
reporting and portrayal of suicide in fictional or non-fictional con- 
texts can give rise to imitative suicidal behaviour [29, 30], through a 
process of social modelling [31]. Practices such as featuring a suicide 
on the front page of a newspaper, repeated coverage, speculation 
about the triggers, and publishing details of the method, location, or 
suicide note are thought to increase the risk of suicidal behaviour in 
vulnerable people. 

This effect appears to be particularly powerful in relation to the 
death by suicide of entertainment and political celebrities [32] and 
in young people [29]. Conversely, reporting that highlights the 
ability of suicidal individuals to cope positively in adverse circum- 
stances may be negatively associated with suicide rates [33] and is to 
be encouraged. 

In view of these concerns, prevention agencies in several countries 
have attempted to work with the media to reduce sensationalism and 
inappropriate language when reporting or portraying suicide or self- 
harm to minimize possible negative influences. There is mixed evi- 
dence for the effectiveness of such media activity and of guidelines 
[34], and little in the way of sanctions for those who do not adhere to 
them [35]. Experiences of working with media agencies in Europe, 
Australia, and New Zealand suggest that reactive and punitive ap- 
proaches, such as enforcing regulatory agencies’ relatively weak 
penalties for breaching guidelines, are unlikely to be as effective as 
collaborative approaches [35]. Mental health professionals are en- 
couraged to work with the media, where opportunities arise, to pro- 
mote responsible reporting of suicide and mental ill-health and to 
challenge stigma. 

With the growth of new media, the sheer volume of online and 
print content overwhelms most systems of surveillance, and it often 
falls on the general public to take a ‘name and shame approach to 
promote journalistic change. The Internet also offers multiple fora 
for people to describe or encourage specific suicide methods. If not 
moderated appropriately, there is a risk of unchecked encouragement 
of suicide in vulnerable individuals. Attempts to build alert systems 
into sites such as Facebook and Twitter have not always been suc- 
cessful. However, use of search engine optimization and advert 
targeting has the potential to promote supportive organizations such 
as the Samaritans. 


Education of the public about mental illness, its treatment, 
stigma, and help-seeking 


Given estimates that 87-91% of people who die by suicide have a 
diagnosable mental disorder [2, 3] and that the distress caused by 
mental illness is a key mediator of suicidality, optimizing treatment 
of underlying mental health problems may contribute to decreasing 
suicide mortality rates. The same might be said for self-harm, given 
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that up to 84% of patients who present to hospital having self- 
harmed are thought to have an underlying psychiatric diagnosis 
[36]. Optimal treatment for mental disorder starts with detection 
and referral by general practitioners (GPs)—described later in this 
chapter—but also the willingness of patients in distress to consult a 
health professional. The stigma of mental illness is a major barrier 
to help-seeking, as is ignorance about mental illness and the poten- 
tial to treat it. The contribution of this to the reluctance of men and 
young people to seek formal help is a particular concern in the UK, 
given high rates of suicide in young and middle-aged men [8]. It has 
also been suggested that the stigma of mental illness contributes to 
suicidality, although this may be mediated by correlates of stigma 
such as social isolation, unemployment, hopelessness, or stress [37]. 
Public education about the recognition and treatment of mental 
illness is one approach intended to reduce stigma and increase self- 
presentation to health services among people suffering from mental 
distress. The intended effects of such programmes are to reduce 
stigma stress, increase detection and treatment of mental disorder, 
and reduce suicidal behaviour. Again, such programmes are hard to 
evaluate, given that mortality outcomes are so distal and subject to 
intercurrent socio-economic influences. Any effect on suicide rates is 
likely to be mediated by more proximal outcomes such as rates of de- 
pression, anxiety, and social functioning. A review of international evi- 
dence describing the impact of anti-stigma campaigns found modest 
evidence for their effectiveness in increasing knowledge and reducing 
stigmatizing attitudes, limited to short-term effects [38]. Harnessing 
the Internet to extend the reach of anti-stigma messages has the po- 
tential to penetrate specific groups and to reset social norms that have 
tended to stigmatize those who struggle with mental ill-health. 


Education of primary care physicians 


Primary care physician education in the recognition and treatment 
of depression had previously been regarded as one of the suicide 
prevention interventions for which there was relatively strong evi- 
dence for effectiveness [10]. This was based on the experiences of 
the Swedish island of Gotland [39-41]. However, despite short-term 
benefits in terms of proactive treatment of depression and a reduc- 
tion in suicides [39], these effects are now understood to be short- 
lived [40] and more marked in women [41]. An updated systematic 
review noted the lack of trial data published since the Gotland ini- 
tiative, with weak evidence from ecological studies that primary 
care physician education might reduce suicide rates in Hungary, 
Slovenia, and Sweden [11]. 

Primary care is regarded as the setting in which there is great po- 
tential to identify mental illness and prevent suicide, given that in up 
to 75% of suicide cases in the United States [42] and up to 63% of cases 
in the UK [43], a primary care presentation occurred in the year be- 
fore death. However, this relies on primary care practitioners making 
a diagnosis of depression (or other mental disorder) and initiating 
acceptable evidence-based treatment. Additionally, psychological 
autopsy studies indicate that the population attributable fraction for 
a mental disorder in suicide ranges from 47% to 74% [2]. Predicting 
which patients are most likely to attempt suicide goes beyond what 
can be expected from any clinician, given our current understanding 
of the poor predictive value of risk prediction tools [44, 45]. 

Given the wider benefits demonstrated in the Gotland study, GP 
educational interventions might be regarded as beneficial to public 
mental health, even if the impact on suicide rates is less marked. 


Educational approaches in schools 


School or college interventions are usually universal in nature, as 
opposed to targeting those most at risk of youth suicide. 

There have been three broad approaches in trying to prevent 
suicide through school-based programmes. ‘The first of these has 
tended to focus on increasing pupils’ knowledge and awareness of 
suicide. Some evidence from the United States showed that such a 
programme appeared to lead to a small increase in pupils’ ratings of 
the acceptability of suicide, although it is not known if this was asso- 
ciated with any change in the rates of self-harm or suicide. 

Suicidal behaviour in young people often appears to be related to 
depression, anxiety, low self-esteem, difficulties during upbringing 
(for example, abuse, deprivation), life events (especially break-up of 
relationships, family problems, and bullying), and poor problem- 
solving skills [46]. Troubled and suicidal young people often seek 
help from their peers. A second school-based strategy has been the 
development of educational programmes in schools about recogni- 
tion of psychological distress in individuals and their peers, problem- 
solving, and peer support. A trial of a Youth Aware of Mental Health 
Programme intervention significantly reduced (by more than 50%) 
the incidence of suicide attempts and suicidal ideation at 12 months’ 
follow-up, compared with the control group. Similar reductions 
were not associated with a manualized gatekeeper programme or 
screening for high-risk individuals by professionals [47]. This sug- 
gests that changes in suicidal behaviour are more likely to occur 
when pupils are personally engaged in the intervention. 

A third approach is to screen adolescents with questionnaires to 
detect children and adolescents at risk of psychiatric disorder and 
possible suicidal behaviour. Pupils that are so detected will then 
need referral to an appropriate agency for further assessment and 
possible treatment. While this approach was not found to be asso- 
ciated with any reduction in suicidal behaviour, when compared to 
no intervention, it is likely to be important in the early detection 
and treatment of pupils with emerging psychopathology. (Suicide 
in children and adolescents is considered further in Chapter 125.) 

For psychiatrists and others involved in developing local preven- 
tion strategies, it is important to recognize that school-based ap- 
proaches to prevention constitute a highly sensitive area and one 
where the most effective (and least risky) approach is at present 
unclear. Another important aspect of suicidal behaviour in school 
pupils is management of the aftermath of suicides and its impact on 
other pupils (see postvention in People bereaved by suicide, p. 1309) 
and how to tackle outbreaks of self-harm [46]. 


Telephone helplines and Internet-based support 


Some individuals who attempt suicide will find the anonymity of a 
telephone helpline or an Internet forum more acceptable than con- 
sulting health professionals about their problems. Others prefer to 
use leaflets or the Internet to access information on self-management 
of suicidal thoughts. The voluntary sector plays a key role in the pro- 
vision of Internet-based information and non-judgemental support 
for those who feel suicidal. This might be in the context of social 
isolation, mental health problems, relationship problems, or other 
social issues. The model used by Samaritans is one of the best known 
and, over the years, has evolved from a drop-in service to include 
a service accessible via telephone, letter, or email. Other telephone 
helplines offering support to suicidal people are now widely available 
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in Australia, Canada, the United States, and northern Europe. 
Freephones may be installed at sites frequently used as a location 
for suicide [48], and helplines may be advertised on websites in a 
targeted fashion where suicidal language is detected on web posts. 

Survey methods have demonstrated a high level of satisfac- 
tion with suicide helplines, with users tending to discuss suicidal 
thoughts and mental health problems and to use the service as part 
of a range of support sources [49]. However, methodological prob- 
lems make it difficult to trial the effect of telephone helplines and 
other Internet resources on suicide rates. Until randomized con- 
trolled studies are conducted of callers to suicide helplines, with suf- 
ficient follow-up, it remains uncertain whether suicide helplines are 
effective. Their effects on the mediators of suicide risk, such as social 
isolation, severity of depression, and impaired problem-solving, 
may be easier to evaluate. 


Gatekeepers 


The term gatekeepers refers to ‘individuals in a community who 
have face-to-face contact with large numbers of community mem- 
bers as part of their usual routine’ [50]. They are usually trained in 
the identification of those at risk of suicide and then in referring 
them for support and treatment. This approach has been used in 
schools and the military, as well as in primary care and emergency 
department settings. The use of gatekeeper programmes is popular 
in many countries. Observational data suggest there is only limited 
evidence that such approaches serve to reduce suicides or suicide 
attempts [11], and there have been no randomized controlled trials. 


Strategies for high-risk groups 


A wide range of prevention strategies can be targeted at high-risk 
groups. 


Patients with psychiatric disorders 
Risk identification and reduction 


All psychiatric disorders carry an increased risk for suicidal behav- 
iour. Those with the highest suicide risk are those with borderline per- 
sonality disorder, anorexia nervosa, depression, and bipolar disorder 
[51]. Risk of suicide appears to be highest in the first few months fol- 
lowing a diagnosis across all mental disorders. Other risk factors for 
suicide include previous attempts, family history of suicidal behav- 
iour, and living alone. Comorbidity of alcohol abuse, personality dis- 
order, and other psychiatric disorders also increases the risk. There is 
a robust association between mood instability and suicidal ideation, 
irrespective of the diagnosis, and this association persists even when 
accounting for mood, anxiety, and substance abuse [52]. 

One difficulty in using a risk identification approach is that the 
risk factors identified from studies of groups of people who have 
died from suicide are often misleading when applied to individual 
patients. Many of the identified risk factors apply to a relatively large 
number of individuals (for example, those living alone), the majority 
of whom will not be at risk. In clinical practice, it is important to be 
aware of patients who, due to their individual characteristics, are at 
long-term high risk, as well as the dynamic factors which may serve 
to increase risk (for example, drugs and alcohol). 


Risk scales 


There is increasing evidence that scales for predicting the risk 
of suicide and non-fatal suicidal behaviour are ineffective at the 


individual patient level [44] and that they should not be relied on 
to guide patient management [53]. The most pragmatic approach 
is to ensure that proven effective treatments for patients with psy- 
chiatric conditions are available, to focus on risk reduction in all 
patients, and to be particularly cautious at times of apparent high 
risk. The first few weeks after psychiatric hospital discharge are 
a key risk period [54]. In a UK study, a third of suicides among 
psychiatric patients occurred within 3 months of discharge from 
hospital, almost half of whom died within the first month [55]. 
Other risk periods follow a relationship break-up or other signifi- 
cant loss, shortly after discharge from hospital, and following re- 
cent suicidal behaviour by another patient or someone else close 
to the individual. 


Preventative strategies 


Prevention of suicide in patients with psychiatric disorders should 
be a major element of suicide prevention strategies [56]. Important 
strategies in preventing suicide in patients with psychiatric disorders 
include active treatment of individual episodes of illness, psycho- 
logical therapy to improve compliance with treatment, review of 
access to means, and encouraging self-management. 


Pharmacological treatments 
Antidepressants 


The role of drugs licensed for depression in the treatment of sui- 
cidal behaviour is debated. In a non-systematic review, reductions 
in suicide attempts of between 40% and 80% have been reported in 
depressed patients treated with antidepressants [57]. The antidepres- 
sant response may also relate to clinical presentation. A trial in the 
Netherlands in which paroxetine was compared with placebo in pa- 
tients who had all repeated self-harm but who did not suffer from 
current depressive disorder showed apparent benefits for a subgroup 
of patients who received paroxetine, namely those who had a his- 
tory of 1-4 episodes of self-harm. Patients with a history of five or 
more episodes did not seem to benefit [58]. A meta-analysis of all 
data submitted to the US Food and Drug Administration (FDA) re- 
vealed a reduction in suicidal thoughts and behaviour in those aged 
25 years and over, but an increase on those measures for participants 
below the age of 25 [59]. The risk of attempting suicide is 2.5 times 
higher in the month preceding antidepressant treatment than in the 
month following treatment [60]. 

These findings have highlighted the need to be cautious in the 
use of antidepressants, to provide early follow-up after initiating 
therapy, and to consider combining antidepressant treatment with 
other therapies, especially for adolescents (in whom only fluoxetine 
is currently recommended for the treatment of depression). 
Lithium 
The protective effect of lithium in people with affective disorders is 
well established. A systematic review of trials of lithium therapy vs a 
range of other drugs and placebo in patients with affective disorders 
has shown convincing evidence that lithium may prevent suicide 
[61]. Lithium reduced the risk of death and suicide by 60%, com- 
pared to placebo. The anti-suicidal effect was found to be greater 
than the effect on mood episodes, suggesting that it may be a specific 
effect perhaps related to changes in aggression or impulsivity. It is 
not known if lithium has anti-suicidal properties in other groups of 
patients. 
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Neuroleptics 


Suicidal acts are common in those suffering from psych- 
otic illnesses. Clozapine has been identified in some studies as 
having a protective effect. In the Intersept (International Suicide 
Prevention Trial) trial, clozapine was compared to olanzapine in 
a multi-centre randomized, open-label study in individuals with 
schizophrenia or schizoaffective disorder who were deemed to 
be at high risk of suicide [62]. Clozapine was associated with a 
significant reduction in suicidal ideation and suicide attempts 
and fewer interventions to prevent suicide. There were also fewer 
deaths from suicide, but this was not a statistically significant 
finding. Data from the Clozaril National Register also suggest that 
current users of clozapine have a lower mortality rate than recent 
or past users [63]. However, this analysis used no control group 
and discontinuation of clozapine was associated with a worse clin- 
ical outcome. 


Ketamine 


Ketamine can have rapid antidepressant effects and may be a useful 
treatment option for acute suicidality. A case series involving 14 
subjects who presented to an emergency department showed rapid 
and significant decreases in suicidality, which were sustained over 
10 days [64]. In one randomized controlled trial, total absence of 
suicidal ideation was observed in over half of patients treated with 
ketamine, compared with 24% of those given midazolam [65]. Two 
systematic reviews of the effectiveness of ketamine in unipolar de- 
pression have shown that it may reduce suicidality in patients being 
treated for depression [66]. Ketamine is also reported to be effective 
in reducing suicidal ideation in people with bipolar disorder [67]. 
These findings are promising, but little is known about the effects of 
ketamine on self-harm or how long the anti-suicidal effects might 
persist. 


Service provision and continuity of care 


Aspects of the provision of mental health services can affect the 
rates of suicide in clinical populations. In the UK, lower suicide 
rates were observed in areas which implemented the recom- 
mendations of the National Confidential Inquiry into suicide and 
homicide by people with mental illness [68]. Recommendations 
included removal of ligature points, provision of 24-hour crisis 
teams, 7-day follow-up following discharge, and training for 
frontline staff. Continuity of care is likely to be a particularly im- 
portant factor in preventing suicide in patients at risk, involving 
care being continued during periods of remission. Unlike affective 
disorders where risk is greatest during depressive episodes, the 
risk in schizophrenia tends to be highest between episodes of acute 
psychotic illness when patients may have insight and feel hopeless 
about their circumstances and prospects. Risk is related more to 
affective symptoms than to core features of the disorders [69]. In 
the UK, specific concerns have been raised about the use of out- 
of-area inpatient beds, given a significant increase in suicides after 
discharge from a non-local unit [55]. This is likely to relate to social 
isolation and disruption in communication and care planning as- 
sociated with out-of-area admissions. Frequent reorganizations 
of psychiatric services have also been linked to increased suicidal 
behaviour [70]. 


Specific at-risk groups 


Those who misuse drug and alcohol 


Programmes for suicide prevention must reinforce healthy behav- 
iours with respect to alcohol and substance misuse, irrespective of 
whether an individual has a formal diagnosis. Treatment of those 
who abuse substances is likely to be the best approach to prevention 
for patients, in parallel with social measures such as restricting ac- 
cess to alcohol or minimum pricing. The introduction of policies to 
restrict access to alcohol has been associated with a reduction in sui- 
cide in Slovenia and Russia, although the effect was limited to men 
[71-73]. The elevated risk in the weeks following the break-up of a 
relationship for patients with severe alcohol abuse [74] again points 
to the need for continuity of support in the community. 

Prevention of suicide in patients with comorbid disorders, es- 
pecially the combination of depression with alcohol abuse and/or 
personality disorder, is challenging, as compliance with treatment 
is often less than in patients with single disorders. Effective preven- 
tion is likely to depend on close integration of care between different 
statutory care agencies. 


Chronic physical illness and pain 


Chronic illnesses, particularly cancer and respiratory diseases, 
are associated with an increased risk of suicide [75, 76]. The rela- 
tionship between pain and suicidal behaviour is also highly sig- 
nificant [77, 78]. Risk is particularly associated with severe and 
recurrent headache, psychogenic pain, chronic abdominal pain, 
and medicolegal issues related to pain. Suicide prevention in this 
group is complicated by the bi-directional relationship between pain 
and psychopathology. All patients with chronic illness should be 
screened for depressive disorder and asked about suicidal ideation 
if screening positive. Proactive treatment should be instigated if a 
mood disorder is identified. 


People who self-harm 


In view of the clear association between non-fatal self-harm and sui- 
cide, establishment of adequate services for people who self-harm, 
including the provision of careful assessments of patients in the gen- 
eral hospital, and offering treatments for which at least some indica- 
tors of benefit are available, are important elements in any national 
suicide prevention strategy. There is good evidence that well-trained 
non-medical psychiatric staff can effectively carry out assessments 
and arrange aftercare. Furthermore, psychosocial assessment fol- 
lowing self-harm may be associated with a reduction in repeat epi- 
sodes [79]. Models for ideal services exist such as those published 
by the National Institute for Health and Care Excellence [53, 80] in 
the UK. 


High-risk occupational groups 


Certain occupational groups are known to be at relatively high risk 
of suicide. In England, recent evidence indicated that these include 
men in the construction industry and in media and cultural occu- 
pations and women in primary school teaching, nursing, and cul- 
tural and media occupations, together with people of both genders 
in caring occupations. 
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Prisoners 


There are relatively high suicide rates in prisoners [81], especially 
young males held on remand. While one aspect of prevention is 
through ensuring that prisons and police cells are safe in terms of ab- 
sence of ligature points, there are a range of other potentially useful 
and humane strategies [82]. These include careful assessments of 
new inmates using risk assessment procedures, training of staff 
with regard to both assessment skills and attitudes towards mental 
health problems and suicide prevention, in-reach programmes by 
befriending organizations such as Samaritans, court diversion of in- 
dividuals with major mental health treatment needs, and prompt ac- 
cess to psychiatric and psychological services. 


People bereaved by suicide 


People bereaved by suicide have an increased risk of depression, psy- 
chiatric admission, suicide attempt, and suicide, when compared 
with those bereaved by other types of death [83, 84]. Risk of suicide 
attempt after a close contact’s death is elevated, regardless of whether 
they were related to the deceased or not [85], suggesting that wider 
social networks are vulnerable to adverse mental health outcomes. 
The term postvention is used to describe support offered to someone 
after the suicide of a close contact. Such post-bereavement inter- 
ventions are intended to prevent the emergence of mental health 
problems and include national helplines, online support, individual 
counselling, group work, school and college in-reach after suicide, 
and support from professionals. Countries such as the United States 
and Australia lead the way in terms of the range of services and public 
health resources devoted to postvention. In most countries, the ma- 
jority of provision lies within the voluntary sector. Much of the mo- 
mentum for this has been provided by people bereaved by suicide 
who, in the United States, are also known as suicide survivors. 

There is weak evidence supporting the effectiveness of postvention 
offered in school-based, family-focused, and community-based set- 
tings to reduce the risk of suicide-related outcomes [86]. Trials of 
interventions to reduce other adverse outcomes of suicide bereave- 
ment have found little evidence for effectiveness [87]. Evaluations of 
any such interventions will need to be conducted on a country-by- 
country basis, given cultural dimensions of grief, bereavement, and 
suicide and of stigmatizing attitudes to those bereaved by suicide. 


Treatment of people who self-harm 


Self-harm occurs for a wide range of reasons. In many cases, the pri- 
mary aim is not death, but some other outcome such as processing 
distress, temporary escape, or communicating distress to other 
people. The needs of individual patients vary widely, meaning that 
a broad range of potential treatments are required. Treatments in- 
clude both psychosocial and pharmacological approaches, which 
might be combined, for example, if a patient suffers from moderate 
or severe depression in the setting of employment and financial dif- 
ficulties, when treatment with an effective drug might be combined 
with problem-solving therapy. 

Factors relevant to treatment needs in people who self-harm are 
discussed in the following sections. 


Repetition of attempts and risk of suicide 


Repetition of attempts is common, with 15-25% of those who pre- 
sent to hospital engaging in further self-harm within a year [88, 89]. 
Self-harm is also associated with an elevated risk of suicide, al- 
though the frequency of suicide following attempted suicide varies 
internationally [90], depending partly on the overall characteristics 
of the patient population and the general population suicide rates. 
Repetition of self-harm increases the risk of suicide [88]. Prevention 
of repetition of self-harm and suicide is understandably a major aim 
in treating people who intentionally harm themselves. 


Psychiatric disorders and psychopathology 


In a systematic review of studies worldwide, an estimated 87% of 
adults who presented to hospital having self-harmed had at least one 
psychiatric disorder, with depression, anxiety, and substance misuse 
being particularly common [36]. In adolescents, a pooled average 
of 81% had psychiatric disorders, with depression, anxiety, and 
attention-deficit/hyperactivity disorder (ADHD) being the most 
frequent. In addition, some four out of ten patients may have per- 
sonality disorders [91]. While treatment directed at the underlying 
causes of such disorders, where possible, will be important in 
managing patients who self-harm, often the disorders themselves 
will require specific treatment. 


Negative life events 


Social problems are particularly common in people who self-harm 
[92], including difficulties in interpersonal relationships, especially 
with partners and other family members, employment problems, 
particularly in males, and financial difficulties. Physical and sexual 
abuse, neglect, and domestic violence are relatively common. Life 
events, especially loss of a relationship, a job, or housing, frequently 
precede self-harm [93]. 


Poor problem-solving skills 


Difficulties in problem-solving, particularly in relation to interper- 
sonal relationships, are characteristic of people who self-harm [94]. 
This distinguishes them from patients with psychiatric disorders 
who have not carried out a suicidal act. 


Impulsivity and aggression 


There are strong links between suicidal behaviour and impul- 
sivity and aggression. There is also accumulating evidence that 
hypofunction of brain serotonergic systems is linked to aggression 
(and possibly impulsivity) and also to suicidal behaviour [95] (see 
Chapter 127). It is unclear whether this represents a state phenom- 
enon associated with psychiatric disturbance or a trait phenomenon, 
but current evidence favours the latter. 


Hopelessness and low self-esteem 


Hopelessness, or pessimism about the future, is an important pre- 
dictor of repetition of suicidal behaviour and a risk factor for even- 
tual suicide [93]. Low self-esteem is another important characteristic 
associated with suicidal behaviour [4]. There is likely to be a link 
between low self-esteem and a tendency to experience hopelessness 
when facing adverse circumstances. 
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Motivational problems and poor compliance 
with treatment 


Some people who attempt suicide may be poorly motivated to en- 
gage in aftercare, which affects treatment adherence. Organizational 
factors in general hospital psychiatric services (including continuity 
of care) and the attitudes of clinical staff may be important factors 
influencing whether patients engage in aftercare. In adolescents, 
family group interventions appear to improve adherence with after- 
care, as do brief interventions in the emergency department [96]. 


Psychosocial assessment 


While conducting the assessment, the clinician should try to as- 
sess the imminent risk of suicide or further self-harm (Box 128.2). 
However, suicide risk assessment scales perform badly [44, 45] and 
reliance on them to determine aftercare is discouraged. Several fac- 
tors associated with an increased risk of suicide following attempted 
suicide relate to the nature of the act (for example, violent method, 
leaving a suicide note, high suicidal intent), but even these have low 
predictive power, especially in the longer term. The best predictor 
of repeat self-harm is a previous episode. Rather than focusing pri- 
marily on risk assessment, it is recommended that clinicians direct 
their attention to risk reduction in all patients. This includes safety 
and crisis plans such as restricting access to means and collaborative 
crisis planning with the patient. Safety planning may be enhanced by 
involvement of family members and other key individuals. Another 
key component is clear communication between agencies, with 
the patient and with family members and other key individuals (as 
appropriate). 


Box 128.2 Assessment of patients who self-harm 


Factors to cover in history-taking 

e Life events preceding the attempt 

e Motives for the act, including suicidal intent (Box 128.1) 

e Problems facing the patient 

e Psychiatric disorders 

e Psychiatric history 

e Previous suicide attempts 

e Personality traits/disorders 

e Alcohol and drug misuse 

e Family and personal history 

e Current circumstances 

e Social (for example, nature of social relationships) 

e Domestic (for example, living alone) 

e Occupation (for example, whether employed) 

e Exposure to suicide/self-harm in friends or family members 
Accessing Internet sites relevant to suicidal behaviour 
Online social networking or similar activities related to suicidal 
behaviour 




















Additional suggested assessments 

e Risk of further self-harm and of suicide 

e Coping resources and supports 

e What treatments are appropriate to the patient's needs? 

e Motivation of the patient (and significant others where appropriate) 
to engage in treatment 








Planning aftercare 


Aftercare following self-harm should be planned according to each 
patient's specific needs. An individual with moderate to severe de- 
pression and difficulties with employment or finances might best 
be helped through a combination of pharmacotherapy and psy- 
chological therapy such as problem-solving and support. Someone 
with relationship and substance misuse difficulties might best be 
helped with a combination of specific substance misuse treatment 
and couple or family therapy. Someone with difficulties in impulse- 
control and attachment problems might be offered longer-term psy- 
chotherapy such as dialectical behaviour therapy (DBT). Whichever 
therapy is offered, risk reduction measures and careful monitoring 
of mood and suicidal ideation are recommended (Box 128.3). 


Psychosocial treatments 


A range of psychosocial therapies for patients who self-harm have 
been evaluated in randomized controlled clinical trials [97, 98]. The 
findings from these reviews and some individual studies are sum- 
marized here. 


Brief cognitive behavioural therapy-based interventions 


A meta-analysis of the results of trials of cognitive behavioural 
therapy (CBT)-based psychotherapy, compared with treatment as 
usual (TAU), indicated that CBT-based psychotherapy (including 
either problem-solving therapy or CBT) is associated with fewer in- 
dividuals repeating self-harm [97]. There is also evidence of other 
positive outcomes, including greater reductions in depression, 
hopelessness, and suicidal ideation. This approach is useful, either 
used alone or in the context of other treatments. 


Dialectical behaviour therapy 


DBT is a relatively intensive form of therapy, combining individual 
therapy sessions focused on addressing an individual's life problems, 
group therapy focused on acceptance of emotions, reducing emo- 
tional responsivity and development of interpersonal skills, and ac- 
cess to therapist support. 

DBT is thought to be appropriate for patients with a history of 
multiple episodes of self-harm, especially those with features as- 
sociated with borderline personality disorder. Treatment in adults 


Box 128.3 Factors that suggest high suicidal intent 


e Act carried out in isolation 

e Act timed so that intervention unlikely 

e Precautions taken to avoid discovery 

e Preparations made in anticipation of death (for example, making a 
will, organizing insurance) 

e Preparations made for the act (for example, purchasing means, saving 
up tablets) 

e Communicating intent to others beforehand 

e Extensive premeditation 

e Leaving a note 

e Note alerting potential responders after the act 

e Subsequent admission of suicidal intent 
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usually lasts a year. While few trials have been conducted in pa- 
tients who self-harm, a meta-analysis of trials comparing DBT 
to TAU indicated that individuals receiving DBT had fewer sub- 
sequent episodes of self-harm. DBT has also been developed for 
adolescents, but in a briefer format. Early evidence suggests that, 
compared with TAU, DBT may reduce repetition of self-harm and 
benefit mood and self-esteem in adolescents with a history of re- 
peated self-harm. 


Contact interventions 


Contact interventions are those in which contact is maintained re- 
motely with patients who have self-harmed, for example through 
regular mail contacts with patients, usually via postcards. Typically 
these include expression of concern for the patient’s welfare and de- 
tails of emergency contact information and are sent monthly for a 
few months and then bi-monthly for the remainder of the first year 
after a self-harm episode. A meta-analysis of trials comparing this 
approach combined with TAU with TAU alone suggested equivocal 
results in terms of self-harm repetition. However, a trial in Iran, 
where community psychiatric service provision (that is, TAU) is 
limited, suggested a significant benefit in terms of reduced repetition 
of self-harm [99]. There is little evidence for benefits of other contact 
interventions such as those via telephone. 


Other psychosocial treatment innovations 


As treatment adherence is often an issue in individuals who self- 
harm, efforts have been made to improve attendance at treatment 
sessions. Continuity of care, through having the assessing clinician 
provide aftercare, has been shown to increase the proportion of pa- 
tients who attend for treatment [100]. Among patients who failed to 
attend an outpatient appointment, those who had a home visit from 
a nurse were more likely to attend subsequent follow-up than those 
where no such visit occurred [101]. For adolescents, ‘therapeutic 
assessment’ has been developed, in which, in addition to a routine 
assessment, there is a focus on understanding the meaning of their 
self-harm, on formulation of their problem, and linking this to bene- 
fits of aftercare, together with family involvement in the process. In 
trials, this was associated with more adolescents attending treat- 
ment sessions than where a routine assessment was provided [102]. 
However, it should be noted that none of these service-orientated 
innovations was associated with reduced subsequent frequency of 
repeated suicide attempts, although existing trials had limited power 
to evaluate this outcome. 

A rapidly growing recent development is the provision of therapy 
through mobile health applications (apps) on remote electronic de- 
vices such as smartphones [103]. This is clearly a growth area with 
the major advantage that it might be helpful for people who do not 
access clinical services. However, trials are so far lacking. 


Pharmacological treatments 


In spite of the high prevalence of psychiatric disorder, particularly 
depression [36], in people who self-harm, there have been relatively 
few trials evaluating the effectiveness of pharmacological agents in 
this population [104]. This perhaps reflects the problems of treatment 
adherence and risk of overdose. The likely most effective pharmaco- 
logical approach is treatment of any underlying psychiatric disorder. 


Care should be taken to prescribe the safest appropriate treatment. 
Tricyclic antidepressants should be avoided in view of their high 
toxicity. Selective serotonin reuptake inhibitors (SSRIs) are gener- 
ally deemed to be the safest, although citalopram is significantly 
more toxic than other SSRIs [105]. Prescriptions should be limited 
to small quantities of medications where necessary. Lithium has the 
strongest evidence for prevention of suicidal behaviour [61]. 

In adolescents, there are concerns about the use of SSRIs in par- 
ticular, because of an association with suicidal ideation. However, 
reduced prescribing of SSRIs in response to warnings from regula- 
tory bodies was actually associated with increased rates of suicide 
in young people [106]. There is also evidence that a combination 
of fluoxetine and CBT may have greater benefit in terms of suicidal 
ideation than either alone in the treatment of adolescents with de- 
pression [107]. Where a drug is prescribed for either adults or young 
people with depression, early follow-up (within a few days) is indi- 
cated because this is when agitation and possible suicidal ideation 
are most likely to occur. 


Conclusions 


In this chapter, we have focused on a wide range of specific initiatives 
that may help prevent suicide, especially in people with psychiatric 
disorders. We have emphasized the need for a comprehensive multi- 
sectoral approach to suicide prevention. This should be tailored to 
specific local patterns of suicide. For people in contact with psychi- 
atric services, prevention may be achieved through specific aspects 
of service provision, as well as through indicated therapies. Evidence 
of effectiveness of specific initiatives is gradually emerging. We have 
focused in some detail on people who have self-harmed. ‘This is jus- 
tified both because of the strength of the association between self- 
harm and suicide and also because of the impact that self-harm 
has on the individuals involved and the family and friends. Again, 
potentially effective therapies are being identified. In considering 
prevention of self-harm and suicide in people with psychiatric dis- 
orders, an essential complement is a population-based strategy that 
ensures easy access to well-designed and supportive services, staffed 
by those with an awareness of the needs after self-harm and training 
in suicide prevention. 
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Deconstructing dualism 


The interface between physical and mental illness 


Michael Sharpe and Jane Walker 








Patients usually attend doctors because they are concerned about 
symptoms. When these symptoms are associated with persistent dis- 
tress and/or disability, we refer to the patient as having an illness. 
When assessing the patient's illness, the doctor aims to make a diag- 
nosis, on the basis of which management can be planned. The diag- 
noses available to doctors are conventionally categorized into either 
‘medical or ‘psychiatric. This division of illnesses into two distinct 
types is such an accepted feature of current medical practice that we 
take it for granted. But is it really the best way to think about patients’ 
illnesses and to plan their care? In order to answer this question, 
we examine below what is meant by ‘medical’ and ‘psychiatric’ diag- 
noses. The disadvantages of this dichotomous approach will then be 
considered, and potential solutions outlined. 


| Diagnosis 





Medical diagnosis 


A medical diagnosis is a label for a type of illness that is: (1) con- 
ventionally treated by medical (non-psychiatric) doctors; and 
(2) listed in classifications of medical conditions such as the 
International Classification of Diseases, tenth revision (ICD-10). 
Most medical diagnoses are based on identifiable bodily path- 
ology (abnormal structure and/or function). Therefore, to make a 
medical diagnosis (such as cancer), doctors enquire about specific 
bodily symptoms, before confirming the presence of bodily path- 
ology by seeking physical signs and doing biological investigations 
(such as X-rays). 


Psychiatric diagnosis 


A psychiatric diagnosis is a label for a condition that is: (1) con- 
ventionally treated by psychiatrists; and (2) listed in the psychiatric 
diagnostic classifications of ICD and the Diagnostic and Statistical 
Manual of Mental Disorders (DSM). Psychiatric diagnoses are 
not based on bodily pathology (with the possible exception of the 





so-called ‘organic disorders’). They are instead associated with the 
idea of ‘psychopathology; that is, proposed abnormalities of the 
mind. Unlike bodily pathology, these abnormalities of the mind 
cannot be objectively identified. Rather they are inferred from the 
patient’s reported mental symptoms and their behaviour. 


What makes an illness psychiatric, rather than medical? 
Why are some illnesses regarded as ‘mental’ or ‘psychiatric, as op- 


posed to ‘medical’? Examination of the criteria for those diagnoses 
reveals that factors common to most ‘psychiatric illnesses are: 


e An absence of identifiable bodily pathology. 
e An abnormal mental state implying psychopathology. 
e A presentation with abnormal behaviour. 





| Mind-body dualism 





The fundamental assumption underpinning this dichotomous view 
of illness as either medical or psychiatric is that itis both conceptually 
valid and clinically useful. The idea that the mind is fundamentally 
distinct from the body (or the brain) is referred to as mind-body 
dualism. Mind-body dualism is commonly attributed to the writ- 
ings of the philosopher Descartes. This so-called Cartesian dualism 
has exerted a profound influence on Western medical thought and 
still shapes our thinking, training, and service provision today [1]. 

However, it is increasingly clear that dualism is at best an over- 
simplification and at worst a cause of serious theoretical and clin- 
ical problems. Theoretically, it can be argued that there is no such 
thing as a purely ‘bodily’ illness or a purely ‘mental’ illness and that 
all illnesses have both mental and bodily aspects [2]. Clinically, the 
associated assumption that bodily symptoms always indicate bodily 
pathology and that mental symptoms always indicate psychopath- 
ology is not consistent with clinical experience. Two important 
examples are: (1) when bodily symptoms occur without bodily path- 
ology; and (2) when mental symptoms occur with bodily pathology 
(Table 129.1). 
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Table 129.1 Diagnoses, symptoms, and bodily pathology 





Bodily symptoms Present Medical diagnosis 
Absent Somatization 

Mental symptoms Present Comorbidity 
Absent Psychiatric diagnosis 


‘Medically unexplained symptoms’ 


When patients present with bodily symptoms, but no evidence of 
bodily pathology, it is unclear whether their illness should be best 
regarded as ‘psychiatric’ or as ‘medical’ because they do not clearly 
fulfil the criteria for either. But the imperative to classify every illness 
as either medical or psychiatric in order to determine the type of 
treatment requires that an allocation is made. 

One solution is to allocate these illnesses to psychiatry. This is 
done by assuming that the patient’s somatic symptoms are, in fact, 
explained by psychopathology. The relative absence of mental symp- 
toms, from which psychopathology is usually inferred, is explained 
by the idea that the psychopathology has been ‘somatized. This 
means that an abnormal mental state is not apparent because it has 
been ‘converted’ into bodily symptoms. The process of somatization 
(literally making the mental physical) is poorly specified. However, 
the concept can be seen in a variety of diagnostic labels, including 
‘somatization, ‘somatoform disorder, ‘conversion disorder, and 
‘bodily distress disorder’ 

Another solution is to allocate the illness to medicine. This is 
made possible by assuming that the patients really do have bodily 
pathology (even though it is as yet unidentifiable). The absence of 
identifiable bodily pathology may be explained on the basis that it 
is present but simply not detectable using currently available diag- 
nostic technology. A medical diagnosis that implies disease, such 
as ‘fibromyalgia in rheumatology or ‘syndrome X’ in cardiology, is 
then made. 

A third, and all too common solution, is for the patient to be re- 
jected as ‘not really ill’ by both psychiatry and medicine, leaving 
them in a no-man’s-land between these specialties. 

The problems with all three of these solutions has been particu- 
larly well illustrated by the controversy and conflict surrounding the 
condition called chronic fatigue syndrome (CFS), or myalgic en- 
cephalomyelitis (ME), in which there has been often fraught contro- 
versy about the nature of the illness and an associated lack of care [3]. 


Comorbidity 


When a patient has both bodily pathology and mental symptoms, 
they may be given both a medical diagnosis (based on the bodily 
pathology) and a psychiatric diagnosis (based on presumed psycho- 
pathology). The resulting mental-physical ‘comorbidity’ or ‘multi- 
morbidity’ gives rise to both theoretical and practical problems, 
however. 

The theoretical problem concerns the psychiatric diagnosis. To 
make this diagnosis, the doctor must identify symptoms which 
are considered to be evidence of the underlying psychopathology. 
However, some symptoms may be considered as resulting from ei- 
ther psychopathology or bodily pathology. For example, if a patient 


has mental symptoms of weight loss and a lack of energy and a 
medical diagnosis of cancer, based on bodily pathology, should the 
weight loss and lack of energy be counted towards a psychiatric diag- 
nosis of depression or be regarded as asymptom of the cancer? There 
is no generally agreed answer to this conundrum (although a variety 
of ways of addressing it have been proposed [4]), probably because it 
is a manifestation of the fundamentally flawed dualistic assumption. 

The main practical clinical problem that results from making two 
diagnoses, one medical and one psychiatric, is a failure to adequately 
treat the patient. This is because the patient is considered to have 
two illnesses, each of which requires diagnosis and treatment by a 
different specialty, often based in different hospitals. As treatment of 
the medical condition usually takes precedence, the patient's psychi- 
atric diagnosis often goes untreated [5]. 


Solutions to dualism 


Theoretical solutions 


New scientific knowledge, such as the demonstration of a bodily 
(neural) basis to many ‘mental’ symptoms, is increasingly rendering 
crude dualistic thinking theoretically untenable [2]. Mind and brain 
are increasingly regarded as two sides of the same coin—the ‘mind/ 
brain. If this paradigm shift is adopted, it will imply that ‘psychi- 
atric’ illnesses are no more distinct from ‘medical conditions’ than 
the nervous system is separate from the rest of the body. According 
to this new way of thinking, it would make no sense to dichot- 
omize illnesses into ‘medical’ and ‘psychiatric’ types. Illness is just 
illness [6]. 


Practical solutions 


However the theoretical arguments play out, for the present, we 
must accept that dualism continues to shape our everyday thinking, 
practice, and service organization. But we can surely agree already 
that there is a need for psychiatry to become less ‘brain-less’ and for 
medicine to become less ‘mind-less’ [7]. In this regard, the psych- 
iatrist is especially well placed to make a major contribution to the 
care of all patients by ensuring that biological, psychological, and 
social aspects of illness are considered in every case. This so-called 
‘bio-psychosocial’ approach, first proposed by George Engel, pro- 
vides a framework for considering multiple causes of illness [8]. 
A further enhancement of this approach is to divide the relevant 
biological, psychological, and social aetiological factors into those 
that predisposed the patient to the illness, those that precipitated 
or triggered it, and those that are perpetuating it or impeding re- 
covery. The last group of causes is a target for treatment, and the first 
two for prevention. A way of tabulating factors to consider in a bio- 
psychosocial formulation is shown in Table 129.2. 


Service solutions 


Finally, the consequence of the long-standing professional and 
organizational separation of medicine and psychiatry must be 
addressed. One service solution has been the establishment of so- 
called liaison (linking) psychiatry services to general hospital in- 
patient units. Another recent and welcome development has been 
the increasing integration of psychological management into gen- 
eral medical care [9]. 
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Table 129.2 A bio-psychosocial formulation 





Sub-factors Predisposing Precipitating 


Biological Disease 
Physiology 
Psychological Cognition 
ood 
Behaviour 
Social nterpersonal 


Social and occupational 





Health care system 


Such developments are likely to be important for psychiatry’s fu- 
ture status as a medical discipline [10]. 


Conclusions 


Hitherto we have taken it for granted that it is both appropriate 
and desirable to dichotomize illnesses into medical and psychiatric 
types, based on identifiable bodily pathology and inferred psycho- 
pathology, respectively. Such an approach has, however, created 
obstacles to integrated and effective patient care. A better under- 
standing of neuroscience is challenging the theory of dualism and 
may lead to a more unified conceptualization of illness. The growing 
need to address the epidemic of patients with multi-morbidity is al- 
ready making dualistic management redundant and favouring more 
integrated models of care. The influence of dualism is finally on the 
wane. The question is, ‘will it be missed?’ 
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Neural mechanisms in chronic pain 


relevant for psychiatric interventions 


Chantal Berna and Irene Tracey 





Pain is defined as ‘an unpleasant sensory and emotional experience 
associated with actual or potential tissue damage, or described in 
terms of such damage’ [1]. Acute pain is a warning system alerting 
the subject of possible harm and leading to corrective or preventive 
action, both immediate (flight or fight) and delayed (learning, avoid- 
ance). However, inherent to this definition, it is clear that pain can 
exist as an emotional experience also without actual ‘tissue damage’ 
or physical harm. This broad definition is important as it encapsu- 
lates what can occur in chronic pain, that is, persistent pain outliving 
the usual time for tissue healing (>3-6 months). Chronic pain is 
one of the largest medical health problems in the developed world, 
affecting 19% of adults in Europe, with a considerable impact on 
quality of life, productivity, and costs for the society [2]. A recent 
report by the American Institute of Medicine assessing the signifi- 
cance and impact of chronic pain in America presented similar fig- 
ures, with an estimated cost to society of about 600 billion dollars 
per annum, due to both medical care and lost productivity [3]. 








Nociception, that is, the neural and molecular processes involved 
in acute pain perception, has been progressively elucidated. This in- 
cludes the transduction of the nociceptive input through different 
peripheral sensory neurons (nociceptors) into a neural signal, fol- 
lowed by the multisynaptic transmission to the central relays and 
perception in the sensory cortices (Fig. 130.1). 

Most nociceptors are myelinated A fibres or unmyelinated small- 
diameter axons (C fibres). They project to the superficial laminae 
I and V (AS) and I and II (C fibre) of the dorsal horn of the spinal 
cord [4]. Local spinal circuits— excitatory and inhibitory—allow 
for bi-directional changes in spinal outputs. Excitatory transmit- 
ters, such as glutamate and substance P, can facilitate and ‘wind 
up’ neuronal responses, whereas inhibitory interneurons releasing 
endogenous opioids and gamma aminobutyric acid (GABA) can 





modulate spinal activity. From the dorsal horn, three major projec- 
tion pathways lead the signal to the brain: the spinomesencephalic 
[to the periaqueductal grey (PAG)], the spinothalamic (to the thal- 
amus as a relay to the cortex), and the spinoreticular pathway (to the 
reticular formation of the medulla and pons as a relay to the thal- 
amus and cortex) [5]. Molecular biology and the study of patients 
suffering from congenital insensitivity to pain have contributed 
significantly over the past decades to form a more complete under- 
standing of these mechanisms of transduction and transmission, 
providing new opportunities for possible pharmacological targets 
such as the sodium channel NAv.1.7, the transient receptor poten- 
tial (TRP) family, and acid-sensing ion channels (ASICs) [6]. In par- 
allel, improved knowledge of the molecular and cellular changes that 
occur post-injury and in the chronic pain state is providing novel 
options for therapeutic interventions such as nerve growth factor 
(NGF) antagonism [7]. These advances, among others, have con- 
tributed to chronic pain being classified as a disease in its own right 
[8, 9], with specific underlying mechanisms distinct from the ones 
involved in acute pain. 








Neuroimaging has helped to develop knowledge about the neural 
networks of sensory perception. The current concept for the central 
aspects of pain perception is mostly based on acute induced pain 
studies and considers a large bilateral network available for vari- 
able activation, including the thalamus, the posterior and anterior 
insulae, the secondary somatosensory cortex, the anterior cingu- 
late cortex, and the PAG matter, among other regions (Fig. 130.2) 
[10, 11]. 

The principle is that different brain regions and networks can 
be recruited variably and dynamically for activation (or not), 
depending on the nociceptive drive, context, attention, emotions, 
and cognitions, allowing for rapidly adaptive behavioural responses 
to each individual specific situation [5]. It should be obvious that 
most of the brain regions active in this dynamic network are not 
pain-specific (that is, the ‘hurt’ of pain) but subserve many features 
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Fig. 130.1 (see Colour Plate section) Acute nociception as a three-neuron order system. The peripheral nociceptor is responsible for the transduction 
into a neural signal (1). This is followed by transmission through neurons (2), with possible descending and local modulation (3), leading to the final 


perception at the cortical level (4). 


relevant to this multifactorial experience in matters of anticipa- 
tion, anxiety, attention, sensory discrimination, motor preparation, 
etc. (for example, [12]). Some regions of this brain network also 
likely encode the saliency of pain within a particular context [13]. 
Overlap of activity in regions like the anterior cingulate and divi- 
sions of the insula, as well as distinctions in the activated networks 
when comparing physical pain, empathy (that is, feeling for others 
in pain), and moral suffering (such as studied through social rejec- 
tion), highlight that while many cortical areas are recruited by mul- 
tiple processes, specific networks and patterns of activation can be 
characteristic of a certain perception or function. Such distinctions 
are being increasingly disambiguated with more advanced imaging 
analysis methods and paradigms [14-16]. We should remember that 
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experiencing emotional pain is what makes us human, as underlined 
by William James and other philosophers. Distinguishing between 
pain that has a peripheral maintaining drive (sometimes wrongly la- 
belled ‘physical’) and centrally generated or maintained pain (often- 
times labelled ‘emotional’) has been a preoccupation for scientists, 
the medical profession, lawyers, and insurance brokers, with fre- 
quent misunderstandings and confusion—leading to downgrading 
the relevance of ‘emotional’ pain because it does not have a ‘physical’ 
origin. Newer research, taking into account the complexities of the 
neural pathways that integrate emotional and physical experiences, 
can allow distinctions at more subtle levels. Hence, an emotional re- 
sponse to social exclusion should not be confused or amalgamated 
with centrally amplified pain following a nerve injury with resulting 


C j) j 


Fig. 130.2 (see Colour Plate section) Cerebral areas and networks involved in pain perception and regulation, as well as reward. The key networks 
consist of the descending pain modulatory system (DPMS, green) and the reward network (purple). Alterations in function, connectivity, and structure 
have been described in these networks in chronic pain. rACC/mACC, rostral/medial anterior cingulate cortex; vIPFC, ventrolateral prefrontal cortex; 
dIPFC, dorsolateral prefrontal cortex; mPFC, medial prefrontal cortex; OFC, orbitofrontal cortex; insula/S2, insular and secondary somatosensory 
cortex; S1, primary somatosensory cortex; Am, amygdala; Hip, hippocampus; Hypo, hypothalamus; Thal, thalamus; PAG, periaqueductal grey; VTA, 


ventral tegmentum. 


Adapted from Nat Neurosci., 17(2), Denk F, McMahon SB, Tracey |, Pain vulnerability: a neurobiological perspective, pp. 192-200, Copyright (2014), with permission from 


Springer Nature. 
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central sensitization. We are not making judgements about which 
pain (or suffering) is more important—for the patient, the experi- 
ence is paramount, whatever the cause. However, better discrimin- 
ation and identification of what constitutes an individual’s pain (that 
is, balance of peripheral and central components) are important if 
we are to target treatments more effectively. 


Modulation of pain perception 


Nociception and pain can have a highly non-linear relationship; 
there is frequently a mismatch between the nociceptive input and 
the resultant pain perception. In fact, the bottom-up input can be 
modulated by a number of mechanisms along the pain neuraxis, 
leading to different final experiences. For instance, the processing of 
the signal can be modified through either dampening of the signal or 
the recruitment of supplementary cortical areas. This is important as 
it allows for a person’s pain experience and subsequent behavioural 
response to adapt to the situation in which they are—enabling an 
improved decision and outcome. A major regulatory pathway is the 
descending pain modulatory system (DPMS), which has central re- 
lays in the dorsolateral prefrontal cortex (DLPFC), rostral anterior 
cingulate cortex (rACC), amygdala, hypothalamus, rostral ventro- 
medial medulla (RVM), and PAG matter and releases endogenous 
opioids in the dorsal horn of the spinal cord, as well as noradren- 
aline and serotonin [10, 17] (Fig. 130.2). In the RVM, serotonin 
(5-HT) can inhibit (via 5-HT1 receptors) or facilitate (via 5-HT2 
or 5-HT3 receptors) spinal neuronal activity; endorphins also have 
bi-directional effects, while noradrenaline has inhibitory activity 
through the a2 adrenoceptor. The DPMS is a powerful system that is 
recruited for its inhibitory (pain-inhibiting) features when subjects 
are distracted from pain or during placebo analgesia. The opposite 
facilitatory arm is recruited in response to injury. It is now known 
from animal and human studies that an imbalance in this system 
(too much facilitation and not enough inhibition) is a major factor 
explaining the chronic pain phenotype. 

Many studies in healthy participants have demonstrated the po- 
tential for modulation of acute pain through cognitive processes such 
as distraction, expectations, or context [18-20]. The involvement of 
the central relays of the DPMS was demonstrated in experimental 
placebo analgesia, with naloxone (opioid antagonist) blocking this 
effect and inhibiting the functional connectivity between the rACC 
and the PAG [21]. Opposite effects can take place with negative ex- 
pectations. For example, the effect of an infusion of remifentanyl, a 
powerful opioid, can be annulled when participants are deceptively 
led to believe the medication was not delivered anymore [22]. 

Furthermore, anxiety and negative mood can heighten the pain 
experience to the same nociceptive input either through direct 
modulation of core pain-processing areas or via indirect means. 
These affective modulations in the experimental context seem to rely 
on recruiting other pathways, involving the prefrontal cortex (PFC), 
hippocampus, and amygdala, among other structures [23-27]. 

These concepts become more complex in the clinical situation, 
as the three levels (transduction, transmission, perception) can all 
be modified in chronic pain—as further discussed in the section on 
acute and chronic pain [28, 29]. For instance, the bi-directional con- 
trol means that the DPMS has also been involved in facilitating noci- 
ceptive processing. Indeed, it has now been shown that an imbalance 


between facilitation and inhibition (too much facilitation and too 
little inhibition) contributes to the development, maintenance, and 
exacerbation of chronic pain states in both animal (for example, 
[30, 31]) and human studies [32-35]. Therefore, an intervention 
in chronic pain only targeting the periphery (trying to dampen the 
volume of the transduction) is most often bound to fail, as it omits 
central processes, such as anxiety, depression, or central sensitiza- 
tion, which can upregulate the experience. 


Transition from acute to chronic pain—a process 


with possible predisposing factors 





Notall patients with acute pain go on to develop a chronic condition, 
and there seems to be different levels of risk. While there are not yet 
any readily useable detection methods for individuals at higher risk, 
there is a growing understanding of the vulnerability and protective 
factors that might lead to implement early preventive interven- 
tions or to develop better-targeted treatments. Vulnerability factors 
for developing chronic pain can be present at birth, such as female 
gender or genetic make-up; brought on by the environment such 
as adverse life events; or, in many circumstances, be the result of a 
combination of both (Fig. 130.3) [36]. On the other hand, inherent 
features of resilience are progressively being investigated and dem- 
onstrated to be different from vulnerability factors (or the absence 
thereof) [37]. For example, a novel line of research is focusing on the 
value of positive affect [38]—an important area for further research 
and development. 


Genetic factors involved in pain perception 


The genetic make-up of certain enzymes and receptors is of import- 
ance for pain perception. Catechol-O-methyltransferase (COMT) 
is an enzyme degrading catecholamine neurotransmitters (that is, 
adrenaline, dopamine, and noradrenaline) in the synaptic cleft, 
thereby regulating different neurophysiological functions. Certain 
frequent polymorphisms of COMT seem to affect experimental 
pain processing, endogenous opioid transmission, and perhaps the 
response to opioids [39]. While COMT variants do not appear to 
be predisposing factors for developing chronic pain, they have been 
shown to impact a number of affective and cognitive tasks in which 
the PFC plays a key role [40]. For example, COMT variations can af- 
fect the response to placebo modulations in patients [41] and could 
also be involved in the response to psychological approaches to pain 
management [40]. 

Preclinical models have shown that serotonin is involved in 
chronic pain, although it is a complex system with serotonergic re- 
ceptors demonstrating pro- or anti-nociceptive function, depending 
on a number of factors (type of animal model, anatomical site, sub- 
type of receptor). Polymorphisms in serotonin receptors could be 
involved in the regulation of pain affect and perhaps response to 
opioids [40]. 


Priming processes 


Animal studies of early-life exposures to pain are striking examples of 
how life events can mark future pain perception; it can result in height- 
ened pain sensitivity in fully developed individuals [42]. In addition 
to prior pain, early-life stress, such as maternal separation, can also be 
sufficient to induce hypersensitivity in later life [43]. Increased axonal 
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Fig. 130.3 Factors of vulnerability for a chronic pain state. These factors emerge at different levels, whether endogenous (genetic predispositions and 


cerebral structures) or environmental or through a combination of the two. 


Adapted from Nat Neurosci., 17(2), Denk F, McMahon SB, Tracey I, Pain vulnerability: a neurobiological perspective, pp. 192-200, Copyright (2014), with permission from 


Springer Nature. 


sprouting, NGF-induced neuronal plasticity, changes to the opioid 
system, or alterations in the hypothalamic—pituitary—adrenal axis and 
in the spinal microglia could possibly explain these processes [42, 44]. 
Parallel human work was done through neuroimaging in school-age 
children who were born preterm; central functional alterations have 
been demonstrated, with an impact on cognitive outcomes and pain 
processing [45, 46]. A similar mechanism, known as priming, has been 
demonstrated in adult animals; for example, repeated nerve injury 
or stress before an experimental nerve lesion can result in prolonged 
hypersensitivity, when compared to a single lesion [47]. The main path- 
ways examined to explain this phenomenon are the peripheral afferents 
and the spinal microglia [36]. 

Hypothetically, early-life stressors that are known to relate to neur- 
oticism and anxiety could have developmental influences on the DPMS 
via alterations in the coupling of the amygdala—-PFC network to the 
brainstem nuclei, leading to an unfavourable imbalance in inhibitory 
and facilitatory drive, which could predispose individuals towards 
developing chronic pain. Based on this hypothesis, personal specificities 
such as variants in personality, psychological traits, response to opioids, 
or reactivity to standard pain testing could rely on changes in central 
networks and represent risk factors for increased pain perception [32, 
48-51]. While differences were found at the central level between these 
groups and controls, it is unclear if the central correlates of these be- 
havioural and psychological variants in healthy people translate to 
any clinically meaningful impact regarding the risk for chronic pain. 
Nevertheless, testing the DPMS function in patients could classify them 
for the risk in response to surgery; two pain modulatory processes 
thought to be mediated through the DPMS—temporal summation 
(that is, an individual’ level of pain in response to a rapid succession of 
similar noxious inputs) and diffuse noxious inhibitory control (DNIC) 
(that is, the level of pain reduction when the stimulus of interest is as- 
sociated to a second noxious stimulus)—were shown to be predictive of 
post-surgical pain levels [52, 53]. 


Importance of the 


reward-motivation-learning system 





While studies in patients have shown clear functional and structural 
differences, compared to healthy controls, this does not provide 


evidence of a causal link. It also leaves open the question of whether 
these changes in central networks are causal or consequential to 
chronic pain. Yet, recent longitudinal studies have underlined the 
particular wiring of the central reward—motivation-learning net- 
work as a potential key factor of vulnerability (Fig. 130.2). This is 
especially interesting, when one considers relief as an integrated re- 
ward process in pain [54]. An impressive and large prospective ob- 
servational neuroimaging study followed 159 patients with subacute 
low back pain over the course of a year (N = 69 completed follow- 
up) as they recovered (N = 30) or developed chronic pain (N = 39, 
prolonged follow-up to 3 years) [55-59], providing important data 
and insight into the evolution of chronic pain. Results from the 
first neuroimaging session during the subacute pain phase dem- 
onstrated that greater functional connectivity or ‘coupling’ of the 
nucleus accumbens (NAC) with the PFC was strongly predictive of 
pain persistence. Hence, cortico-striatal circuitry could be causally 
involved in the transition from acute to chronic pain. Interestingly, 
this stronger coupling remained present throughout the transi- 
tion to chronic pain, despite grey matter density decreases in the 
NAC. Furthermore, in the group that went on to develop chronic 
pain, white matter fractional anisotropy differences in the PFC were 
identified at an early time point, possibly representing structural 
vulnerabilities. These results suggest a role for the brain’s reward- 
motivation-learning circuitry during the transition from acute to 
chronic pain. Finally, stronger white matter and functional connec- 
tions within the dorsal medial PFC-amygdala—accumbens circuit, 
as well as smaller amygdala volume, represented independent risk 
factors, accounting together for 60% of the variance of pain persist- 
ence at 3 years’ follow-up [59]. 

Separate work has underlined differences in NAC activity in re- 
sponse to relief from acute experimental pain between controls and 
patients with chronic low back pain [60], as well as patients with 
fibromyalgia [61]. 

Healthy volunteer studies have also shown that baseline activity 
in the ventral tegmental area and the NAC was predictive of sub- 
sequent opioid-induced analgesia and central activity in the DPMS 
[51]. Animal studies supported also the need for a functioning re- 
ward system to experience analgesia [62]. 

Provided the appropriate relative context, the hedonic value of pain 
can be ‘flipped’ from threat to reward, with increased activity in reward 
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regions, working in concert with the DPMS [18]. Such work provides 
further evidence for the importance of these networks in pain appraisal, 
which is a key feature of ongoing chronic pain states. Furthermore, 
dispositional optimism and pessimism influence the central pro- 
cessing of unexpected relief outcomes, with diametrically opposite 
NAC activity [63]. Finally, the dopaminergic system has also been in- 
volved in placebo effects, with a correlation between placebo analgesia 
and levels of dopamine release [64]. While these findings raised the 
question of possible direct effects of dopamine on placebo analgesia, an 
interesting separate study suggested rather that this dopaminergic ac- 
tivity is related to the relief of pain because when fully blocked through 
a dopamine antagonist, the placebo effect still took place [65]. 

It seems likely that the transition to chronic pain is dependent on 
the state of the mesolimbic—prefrontal connectivity and function, 
that is, of the motivation-learning and reward circuitry. Yet it re- 
mains ambiguous what this means. While it has been interpreted as 
a predictor for a transition from acute to chronic pain, an alternative 
explanation could be that these patients suffer from an inability to ex- 
perience relief and hence analgesia from treatment, that is, they have a 
‘broken analgesic system and so, by definition, remain in pain as they 
get no relief [66]. 


Other cognitive factors of risk 





Other cognitive factors have been investigated as predisposing to 
chronic pain. Most notably, catastrophizing, a cognitive-emotional 
process involving magnification, helplessness, and rumination 
when facing pain, has been shown to be tied to negative outcomes 
[67] and to be an independent predictor of ill adaptation to pain and 
pain-related affective dysregulation [38]. 

Another proposed vulnerability factor could consist of the chosen 
attentional focus when confronted with concomitant nociceptive 
stimuli and cognitive tasks. In fact, two different profiles were iden- 
tified, with participants demonstrating either a ‘pain focus’ or being 
able to remain focused on a parallel cognitive task, with resulting 
faster reaction times [68]. Participants focusing on pain demon- 
strated more grey matter in regions implicated in pain and sali- 
ence, greater functional connectivity in sensorimotor and salience 
resting-state networks, less white matter integrity in the internal and 
external capsules, and anterior thalamic radiation and corticospinal 
tract, while well matched for sex, neuroticism, pain sensitivity, and 
catastrophizing. Interesting parallels can be drawn with the litera- 
ture on anxiety where cognitive biases towards threat, as well as in- 
efficient attentional regulation of emotionally and non-emotionally 
salient stimuli, have been demonstrated [69, 70]. 


Chronic pain as a brain disease: maladaptive 





plasticity and psychological maintaining processes 


Once pain is chronic, the central circuitry involved in processing 
further nociceptive stimuli might be quite different from what is 
studied in acute pain, both at a structural and at a functional level. 
In fact, meta-analytic studies have shown decreases in grey matter 
volumes in different cortical and brainstem areas, as well as increases 
in the thalamus, and this has been found across different chronic 
pain conditions [71]. Whether these differences are causal or 


consequential to chronic pain is still up for debate, although human 
and animal studies are suggestive of pain causing the changes, as 
when pain is relieved, the structural changes normalize [72-75]. 
Such reversal is important, as it calls into question the early inter- 
pretations of what these volume changes represent (that is, incom- 
patible with neurodegeneration) [76]. 

A cerebral region of specific interest is the hippocampus, given 
its role in pain amplification in the context of anxiety and stress; re- 
duced hippocampal volumes were found in patients with chronic 
pain, compared to controls, suggesting a sustained endocrine 
stress response. This came with associated increased activity in the 
hippocampal complex in response to acute pain [77]. These elem- 
ents together could, in turn, contribute to the persistent pain state. 

Ata functional level, pain perception in patients with chronic low 
back pain involves less somatosensory and more limbic networks, 
compared to acute pain models [56]. As mentioned, there is also an 
imbalance in the DPMS, with too little descending inhibition and 
too much descending facilitation. In fact, central sensitization is a 
normal adaptive response post-injury; yet failure to resolve this be- 
yond the acute stage is a maintaining factor of chronic pain states. 
The signs of central sensitization have been demonstrated by in- 
creased PAG activity in certain patients with osteoarthritis [32], 
replicating experimental human models of brainstem amplification 
mechanisms [34, 35, 78]. This factor could also be involved in func- 
tional pain syndromes or fibromyalgia [79, 80]. Importantly, central 
aberrant mechanisms are the target of duloxetine, a serotonin and 
noradrenaline reuptake inhibitor that supposedly ‘rebalances’ the 
inhibitory/facilitatory drive, as well as of tricyclic antidepressants 
[81]. In fact, a study in painful diabetic neuropathy found that poor 
DNIC efficiency predicts the efficacy of duloxetine and that this 
treatment allowed to re-establish normal DNIC [82]. 

Of course, the poster-child for maladaptive plasticity is phantom 
limb pain where original theories suggested that altered brain rep- 
resentation of body parts into the ‘vacated’ deprived cortex (from 
the missing limb) causally produced pain in patients [83]. However, 
recent work now suggests different interpretations might better ex- 
plain the pain of phantom limb, with lively debate and discussion 
ensuing [84-87]. This is not only an academic question; the mal- 
adaptive model has also been used to support treatments, which 
might be misdirected in this newer light [88, 89]. 

Psychiatric comorbidity is extremely frequent, with most patients 
suffering from chronic pain declaring symptoms of anxiety or de- 
pression [90-92]. Post-traumatic stress disorder is also a frequent 
co-occurrence with possible reciprocal maintenance [93, 94]. As 
in the experimental context described, depressed mood can have a 
negative impact on pain control and perception [95]. Furthermore, 
catastrophic thinking has been shown to affect pain perception 
through altered PFC activity [96]. Anxiety, fear of pain, avoidance, 
and related cognitive behavioural mechanisms have been exten- 
sively described as relevant to pain suffering and loss of function 
[97, 98]. These factors need to be addressed through psychological 
or mind-body approaches in chronic pain [99-101]. 


Therapeutic implications 





Clarifying the maintaining factor, whether peripheral or central, 
and diagnosing emotional, cognitive, and neurogenic (autonomous 
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Fig. 130.4 Contrasting acute pain coping to the chronic pain condition. This non-exhaustive graphical representation highlights some of the 
commonalities and differences in neural processing areas and cognitive-emotional processes between acute pain in the healthy state and chronic 
pain pathology. The alterations in chronic pain shown here cannot be addressed by acute pain medication and require psychiatric or rehabilitative 


approaches to achieve improved function. 


nervous system, DPMS, etc.) processes allow for appropriate treat- 
ment targeting. When there is evidence for central maintaining 
factors, it appears illusory to expect that medications, which act per- 
ipherally, such as non-steroidal anti-inflammatory drugs, could ad- 
dress the full depth of the problem. Molecules acting centrally and 
psychotherapy, as well as rehabilitation, may be needed (Fig. 130.4). 
Hence, there is an indication for medications such as duloxetine, tri- 
cyclic antidepressants, gabapentin, and pregabalin [81]. Opioidergic 
agents present somewhat a special case worth discussing. Opioids 
act centrally, at the very core of the DPMS, and there is even evi- 
dence for acute relief of emotional suffering [102]. Yet the clinical 
literature suggests long-term opioid therapy has a rather poor out- 
come in chronic pain [103], and there is a concerning association 
between opioid dose and the risk of activity interference, depres- 
sion, and suicide [104, 105]. Mechanisms underlying the adverse 
outcomes of long-term opioid therapy for chronic pain are not yet 
well understood, but tolerance and induced hyperalgesia (a form of 
central sensitization due to opioids) could be involved. 

Psychological and mind-body approaches, such as cognitive be- 
havioural therapy, meditation, hypnosis, or yoga, are supported by 
clinical research [101, 106-108]. Increasingly, there is also brain 
imaging evidence for such therapies, with hints at the mechanisms 
involved in their benefits, mostly from healthy volunteers [109- 
112]. In patients, it has been demonstrated that a course of cogni- 
tive behavioural therapy can increase PFC activation of patients with 
fibromyalgia in response to pain or change resting-state connectivity 
that seems tied to catastrophizing [113, 114]. 

In sum, treating the psychiatric comorbidity and adverse psycho- 
logical adaptation mechanisms needs to be taken as seriously as ad- 
dressing the physical injury [115, 116]. This understanding is key 


to the involvement of psychiatrists in the treatment of chronic pain 
[117], a field where multi-disciplinary care has shown to be very im- 
portant [118] and in which exclusive opioid pharmacological treat- 
ment has been recently strongly warned against [103, 119]. 


Considerations on somatic symptom disorder 





In certain circumstances, chronic pain is associated with somatic 
symptom disorders, which can be diagnosed in the presence of at 
least one of three psychological criteria: health anxiety, dispropor- 
tionate and persistent concerns about the medical seriousness of the 
symptoms, and excessive time and energy devoted to the symptoms 
or health concerns. However, this new DSM-5 diagnostic category 
is under debate [120]. Beyond the syndromes including pain, the 
neural mechanisms involved in somatic symptom disorders are not 
yet well qualified [121]. Our understanding of certain functional 
pain syndromes, as well as somatic symptom disorders, might benefit 
from new theories and research regarding the nature of perception 
[122]. This line of research has also offered explanatory models for 
placebo effects [123]. Bayesian models of perception suggest that 
prior information can alter the interpretation of new stimuli, with 
prior beliefs and sensory information biasing new perceptions by 
overriding the current input [124]. 


Conclusions 


Finally, whatever the cause, the underlying risk factors, or the spe- 
cific pathophysiological path of a chronic pain sufferer, ‘terminal’ 
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chronic pain can be seen as failure of an individual’s coping abilities 
and of their adaptation resources. If the condition has been ongoing 
for long enough, patients are likely to present with depression or 
anxiety, social isolation, functional impairment, and possibly opioid 
reliance. They are at a dead-end, which requires a fundamental 
change of course to get back to a life worth living and to experience 
well-being and happiness. When medications, surgeries, and pro- 
cedures are not helpful, psychological techniques fostering change 
are required. In any case, the comorbid psychiatric or psychological 
conditions, which can act as maintaining factors, must be addressed. 
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Clinical presentation and epidemiology 


Fibromyalgia (FM) is the second most common musculoskeletal 
presentation, after osteoarthritis. It is a complex presentation (Box 
131.1) [1], combining the core symptom (chronic widespread pain) 
with other very commonly associated symptoms (for example, fa- 
tigue, sleep disturbance, cognitive difficulties, non-pain somatic 
symptoms, low mood, anxiety). In addition, coexisting somatic syn- 
dromes [for example, irritable bowel syndrome (IBS), chronic head- 
ache, temporomandibular disorder, chronic fatigue] are common. 
Its complex nature, spanning traditional medical specialty bound- 
aries, presents considerable challenges to health care professionals. 
FM is best conceptualized as a centralized disease state, in which 
symptoms and sensations from any part of the body are centrally 
amplified, rather than as a peripheral disorder of fibrous tissue and 
muscle. 

The point prevalence of FM is estimated at 2.7% globally [2]. 
It is higher among women, with a female:male ratio of perhaps 
2:1, a ratio which has shifted to much closer to parity since the 
requirement for multiple tender points was dropped from the 
diagnostic criteria. Prevalence is higher in middle-aged and older 
adults and in lower socio-economic status groups, especially the 
unemployed. 


Diagnostic issues 


FM was known by other terms until 1990 when diagnostic criteria 
were published that included both symptoms and, on examination, 
‘tender points’ [3]. Recently, updated criteria have emphasized the 
importance of chronic (>3 months) pain that is widespread across the 
body [4]. Importantly, an FM diagnosis does not now exclude the 
diagnosis of any other disorder, and additional diagnoses are likely 
in order to fully describe the clinical condition of a typical patient. 
It is likely that the diagnosis and classification will start to consider 
FM and similar, often overlapping, disorders in a more integrated 


way, emphasizing their considerable similarities, rather than their 
differences [5, 6]. 

FM has some unique features but shares some symptoms, epi- 
demiological features, and underlying mechanisms with other ‘chronic 
overlapping pain conditions’ (COPCs) (for example, tension-type 
headache, temporomandibular disorder) [5] and chronic non-pain 
conditions [for example, chronic fatigue syndrome CFS)]. Indeed, FM 
is commonly comorbid with COPCs with non-pain somatic symptom 
syndromes (for example, CFS and IBS) and with mental disorders (es- 
pecially anxiety disorder and depressive disorder). Furthermore, the 
population of patients with rigorously diagnosed FM is not discrete— 
there is a population penumbra, with patients on either side of the 
cutpoints of diagnosis being similar, rather than different. In other 
words, we are not carving nature at the joints. Finally, clinical diagnoses 
of FM are not confirmed by objective diagnostic tests, because such 
tests do not exist. These issues present major challenges for clinicians, 
and especially for researchers, and may also contribute to diagnostic 
delay; in a typical FM patient, there has often been more than 2 years of 
symptoms, and several medical consultations, before diagnostic con- 
firmation [7]. A positive early diagnosis is likely to improve outcome, 
and FM should be suspected in any patient with chronic widespread 
pain not straightforwardly explained by injury or inflammation. 


Aetiology 


This chapter focuses on the role of three key factors which form the 
focus of successful treatment: central sensitization, fear avoidance, 
and physical deconditioning. Interested readers are referred else- 
where [8] for a broader review of psychosocial factors in chronic 
pain. A broader range of relevant factors are listed in Table 131.1. 


Central sensitization 

In the last two decades, the focus for understanding FM has shifted 
from peripheral nerves and muscles towards the role of the cen- 
tral nervous system (CNS). This shift is reflected in the range of 
recommended treatments, all of which act directly (for example, 


CHAPTER 131 Treatment of fibromyalgia (chronic widespread pain) and chronic fatigue syndrome 


Box 131.1 The ‘FFIBRO’ mnemonic for the core symptoms 


of fibromyalgia 




















Fatigue Fatigue and tiredness 
Fog Cognitive dysfunction, memory and concentra- 
ion impairment 
Insomnia Poor sleep initiation, sleep maintenance, poor 
sleep refreshment 
Blues Depression, anxiety 
Rigidity Stiff muscles, stiff joints 
Ow! Pain, which is widespread across the body and 
ong-standing 
Source: data from Nat Rev Rheumatol., 5(4), Boomershine C, Crofford L, A symptom- 
based approach to pharmacologic management of fibromyalgia, pp. 191-199, 
Copyright (2009), Springer Nature. 





antidepressants) or indirectly (for example, physical exercise) on the 
CNS. CNS sensitization appears to play a central role, such that the 
brain amplifies signals arriving from the peripheries by facilitating 
spinal transmission and by modifying cortical processing [9]. The 
person therefore feels more pain than is expected from the nocicep- 
tive stimulus (hyperalgesia) or feels pain from a stimulus that does 
not usually cause pain (allodynia). Central sensitization therefore 
contributes to the development of FM, other centralized pain states, 
and non-pain somatic symptom syndromes and allows us to attri- 
bute and manage a single cause in complex patients presenting with 
multiple symptoms spanning body systems. 


Fear-avoidance (or ‘kinesiophobia’) 


The person's fear of pain leads to avoidance of activities believed to 
cause or worsen the pain, that is, avoidance of movement and phys- 
ical exertion. This leads to physical deconditioning and adverse psy- 
chological consequences, and thereby to exacerbation of the pain 
experience [10]. Importantly, the fear-avoidance model incorporates 


Table 131.1 Aetiological factors in FM (key treatment targets 
in italics) 


Central sensitization 

Reduced physical activity and consequent physical 
deconditioning 

Genetic/familial factors 

Maladaptive posture/gait 

Non-restorative sleep 


Physical 


Fear-avoidance 

Cognitive factors, including catastrophizing, reduced 
self-efficacy 

Patient behaviours, including hypervigilance, reduction of 
activities 

Abnormal mood states, including depression and anxiety 


Psychological 


Social Reactions and responses of others, including: 

e Health care professionals, including over- 
investigation, medicalization, and undertreatment by 
psychological means 

è Relatives and other carers 

Unemployment or underemployment 

Reinforcement of illness behaviour by, for example, financial 


benefits/provision of physical aids 


both cognitive elements (beliefs about the consequences of move- 
ment and about the meaning of pain) and behavioural elements 
(avoidance of movement and exercise). Fear-avoidance is common 
in FM patients and correlates well with symptom severity and phys- 
ical disability [11]. It provides a target model for treatment develop- 
ment with wide applicability in chronic musculoskeletal pain, and 
emerging evidence suggests that reduced fear-avoidance mediates 
the physical benefits of structured exercise interventions in FM. 


Physical deconditioning 


This plays a key role in the development and maintenance of central 
sensitivity. A vicious cycle is set up (Fig. 131.1). 

These three factors—central sensitization, fear-avoidance, and 
physical deconditioning—usually enable a coherent explanation 
of causation to be provided to patients and carers. They also pro- 
vide a firm basis for a management plan that is fundamentally 
patient-centred, incorporating: a phenotype that it predisposed to 
be pain-prone; fears about the impact of physical activity and be- 
liefs about the need to rest and protect the body from the adverse 
effects of disease; and consequent and inherently reasonable behav- 
ioural responses, including symptom monitoring and excessive rest. 
These maladaptive beliefs and behaviours can be reinforced by the 
responses of relatives and other carers, and also by the responses 
of health care practitioners; both groups may encourage rest, in the 
hope that this may enable the body to recover from an as yet undis- 
covered disease process. 


Assessment 


In primary care, the assessment of FM will necessarily be pragmatic. 
In secondary care, clinicians should aim for a thorough under- 
standing [12]. In both settings, assessment is a key part of treatment, 
not only by guiding it, but also by providing the trust and reassur- 
ance that chronic pain patients need to be able to move forward with 
potentially counter-intuitive approaches such as exercising des- 
pite increased pain. The interested reader is referred to Lotze and 
Moseley [13], who eloquently integrate compassion and modern 
pain science in the clinical assessment. 
Assessment should include: 


e Pain behaviours, for example, complaints of pain to others, 
including health care professionals, requests for medicines, avoid- 
ance of feared activities, reduced mobility, use of aids, protective 
postures. 

e Pain cognitions, including dysfunctional beliefs, such that pain 
signals an underlying disease or that pain should respond to 
avoidance of usual activities; a tendency to catastrophize, that is, 
to assume the worst possible outcome; and low self-efficacy, that 
is, a lack of confidence in the person's own ability to influence their 
pain, now and in the future. 

e Pain impacts, including on mood (low or anxious), on sleep, and 
on physical, social, and occupational functioning, 

e History from corroborants, to confirm the patients beliefs and 
behaviours, and allow the assessment of carers’ beliefs and 
behaviours. 


An appropriate physical examination should be carried out to: (1) 
rule out the presence of the inflammatory arthritides or, if one is 
present, to assess its contribution to overall morbidity; (2) assess the 
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(reduced exercise tolerance) 











Fig. 131.1 Vicious cycle of deconditioning. 


extent of physical deconditioning; and (3) provide appropriate re- 
assurance to the patient that important physical diagnoses have been 
appropriately considered and excluded. A small number of appro- 
priately targeted investigations are likely to be appropriate, but each 
should be clearly justified [14]. Clinicians should carefully balance 
the need for physical investigation with the prudent avoidance of 
over-investigation and over-medicalization of FM. 


Management 


FM can be challenging to manage, but clinicians should be encour- 
aged that a reasonable evidence base exists and that this has helpfully 
been summarized and interpreted in recent clinical guidelines and 
reviews [14-18]. The guidelines differ in their details, partly due to 
recent changes in regulatory approvals for specific pharmacological 
approaches. However, there is broad agreement on principles (Box 
131.2). In addition to these clinical guidelines, a recent review has 
documented the important academic and leadership challenges [19]. 
Finally, there is scope for a note of caution; for most of the evidence- 
based treatments described in this chapter, there remains a dearth of 
randomized controlled trials (RCTs) of high quality. An inevitable 
side effect is that robust evidence for comparative effectiveness of in- 
dividual treatments is very limited. The interested reader is referred 
to a recent network meta-analysis [20]. 


Overall approach 


RCTs have demonstrated the effectiveness of several treatments, 
both non-pharmacological and pharmacological, in FM. Initially, 
non-pharmacological approaches should be pursued [18]. They 
may be used in combination in an integrated, multi-disciplinary 


Box 131.2 Key aspects of management of fibromyalgia, derived 


from current clinical guidelines 





1 Deliver initial care for FM in primary care; 

2 Incorporate both pharmacological and non-pharmacological ap- 
proaches, but with an initial focus on the non-pharmacological; and 

3 Encourage active patient participation in treatment, facilitated by 

4 High-quality patient education. 





(reduced muscle tone and volume) <———_ 





Increased pain, fatigue, and other 
physical symptoms 











Activity avoidance 
(mediated by beliefs that, for example, rest 


will reduce symptoms and enable recovery 
from the ‘disease’) 








way (for example, patient education plus exercise programme 
supervised by a physiotherapist). Failure of this approach should 
lead to the considered introduction of a pharmacological agent, 
alongside continued non-pharmacological management and with 
ongoing monitoring of key symptoms and function. The approach 
throughout is individualized for the particular needs and prefer- 
ences of a particular FM patient. 

Self-management plays a fundamental role in the management of 
FM throughout the spectrum of severity. Patients must understand 
the nature of the illness, its likely causes, the role of health care prac- 
titioners, and, importantly, their own role in active day-to-day man- 
agement. This statement applies equally to key carers. Table 131.2 
outlines the characteristics of an optimal management approach to 
be promoted by the health care professional to the FM patient. 

The European League Against Rheumatism (EULAR) guideline 
[18] provides a flowchart incorporating a stepped-care approach 
(Table 131.3), which facilitates prompt treatment. It is important to 
intervene promptly when patients present with FM, because longer 
pain duration is associated with the development and embedding 
of maladaptive illness behaviours, kindling of maladaptive pain 
pathways, more challenging management, and poorer prognosis. 
Referral to secondary care is appropriate when the illness does not 
respond to initial treatments, when there is uncertainty over the 
diagnosis, perhaps in the presence of significant comorbidity, and 
when there is significant psychiatric comorbidity that does not re- 
spond to initial treatments. 


Empirically supported treatments 


The specific treatments for FM for which there is reasonable or 
good evidence are now addressed in turn. Each treatment will be 
outlined, key clinical issues described, and the available evidence 
summarized. 


Patient education 


Effective patient education is pivotal to the effective care of FM. 
Multi-faceted, patient-focused education is likely to provide much 
needed coherence and reassurance that there is hope. Whenever 
possible, close family members/key carers should also be involved in 
discussion (see Table 131.4 for the key messages for patients). 
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Table 131.2 Characteristics of an optimal management plan in FM 
Internally consistent That is, with a clear conceptualization of treatment 
that is based on an explanation of causes, especially 

maintaining factors 


Externally consistent That is, deliberately target and reduce or eliminate 
ambiguity between involved health care professionals 
concerning the causes of the illness, and the methods 


and goals of treatment 


Incorporating the 
reduction or stopping 
of inappropriate 
medication 


That is, those that are inconsistent with the aetiological 
understanding being promoted, and that are not 
required for the management of comorbidities 


Multimodal That is, involving both physical and educational/ 


psychological components, in every case 


Promoting a graded 
increase in physical 
activity 


Aiming to reduce disability and improve morale and 
well-being, rather than to reduce pain 


Promoting an increase Rather than maintaining dependence on health 
in patient responsibility services 
and independence 


Individualized To the needs and preferences of the specific patient, 
and to their response or non-response to specific 


treatments 


Structured With a beginning (assessment, education, first 
step), a middle (empirical, stepwise approach to 
selecting successful treatment(s)), and an end (relapse 


prevention) 


Addressing the 
information needs of 
carers 


Who need to share a common understanding of 
causes and cures, and the chosen approach 


Adopting the So that the maximum number of patients can benefit 
principles of stepped from appropriate management, in a resource- 
care constrained health care system 


The first, and fundamental, part of patient education is giving the 
diagnosis. Most FM patients will have tolerated incoherent and in- 
consistent explanations, without a clear way forward, for months 
or years. There is reassuring evidence that an FM diagnosis does 


Table 131.3 Stepped care according to the revised EULAR 
guidelines 


Patient education and information sheet; 
leading to, if insufficient response ... 


Step one 


Step two Physical therapy, with individualized graded physical exercise, 
which may be combined with other recommended non- 
pharmacological therapies such as hydrotherapy/acupuncture, 
if necessary; 


leading to, if insufficient response ... 


Step three Reassess the patient to target individualized treatment, focused 
as follows: 
e |f pain-related depression, anxiety, catastrophizing, 

overly passive—psychological therapies—mainly 

CBT—for more severe depression or anxiety, consider 
psychopharmacological treatment 

© |f severe pain or sleep disturbance, then 

pharmacotherapy: for severe pain—duloxetine, pregabalin, 
ramadol; for severe sleep problems—low-dose amitriptyline, 
cyclobenzaprine (tricyclic-related), or pregabalin at night 

e |f severe disability or sick leave, multimodal rehabilitation 
programmes 











Source: data from Ann Rheum Dis., 76(2), Macfarlane G, Kronisch C, Dean L, et al., EULAR 
revised recommendations for the management of fibromyalgia, pp. 318-328, Copyright 
(2017), BMJ Publishing Group Ltd and the European League Against Rheumatism. 





not reinforce the sick role, and most patients should respond posi- 
tively to a clear diagnosis, supported by a coherent and rational 
explanation. 

Crucially, however, a diagnosis is not simply about delivering a 
name for the disorder. It is about explaining the nature of the illness, 
explaining that it is common and affects many people, and pro- 
viding a coherent explanation of the cause which maps onto the 
proposed management plan. A high degree of fit between patient 
understanding of the cause (and especially maintaining factors) and 
the management plan is likely to predict treatment engagement and 
clinical response. ‘Explaining pair (EP) treatments or ‘pain biology 
education’ are of increasing interest [13, 21]. In these, the focus is 
on helping the pain patient understand that pain is a creation of the 
brain, designed to indicate the possibility of harm, rather than an 
accurate measure of tissue damage. The focus is on using education 
about pain biology to change the meaning of pain which then alters 
the experience of pain. Clearly, there is significant overlap here with 
more explicitly cognitive interventions. 

The core of patient education should be about effective manage- 
ment, as patient behaviours are pivotal. The patient needs to under- 
stand what is needed to improve outcomes, and needs to feel enabled 
to start to act (self-efficacy). A graded increase in physical activity is 
the critical intervention, and patients must understand the rationale 
for this somewhat counter-intuitive approach in order to make a 
material and persistent change in their lifestyle. Some brief cognitive 
work may be needed to tackle unhelpful beliefs impeding engage- 
ment with a graded exercise programme. This emphasizes that edu- 
cation should be individualized to the needs of the specific patient. 
For example, patients who fear the adverse consequences of phys- 
ical activity (fear-avoidance) need a targeted cognitive intervention, 
perhaps assisted by behavioural experiments. Such an intervention 
need not take place within the feasibility constraints of a formal psy- 
chological treatment but rather should usually be delivered by non- 
specialist staff in the clinic setting. 

Evidence for the effectiveness of patient education in FM is ex- 
tensive but is skewed towards highly time- and resource-intensive 
educational interventions. There are clearly important questions 
about the feasibility of such interventions in primary care. However, 
the fundamental importance of effective patient education is not 
doubted. The clinical challenge is to maximize its effect within the 
usual significant resource constraints. Undoubtedly, good-quality 
written information can help to leverage benefit, with minimal add- 
itional cost. Careful selection of the recommendations for a particular 
patient and active endorsement of the treatment approach by a trusted 
clinician are also critical. 


Graded exercise 


A graded, but persistent, increase in physical activity is the critical 
recommendation for patients with FM. EULAR guidance [18] con- 
cludes that there is ‘strong evidence for’ exercise and that exercise 
may benefit pain, function, sleep, and morale. 

The relevant evidence comprises many RCTs, which have been 
summarized in three key Cochrane Collaboration reviews [22- 
24]. The best evidence is for supervised aerobic exercise training, 
which improves aerobic fitness, pain, physical function, and well- 
being. Strength (resistance) training also has a range of benefits and 
appears to be more effective than flexibility training, but possibly 
less effective than aerobic exercise. Aquatic (that is, water-based) 
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Table 131.4 Key messages for patient education 
You have a real illness 
help. 

The illness is called FM 
FM is problematic, but benign 


FM is an illness of the nervous system, not 
an illness of the muscles and joints 


We see many patients with problems like yours. We know that you are not imagining it or making it up. We want to 


It is common—about one in 40 people have this illness. 
It is an illness that causes symptoms and disability. But it is not an illness which causes a long-term decline. 


In FM, the body is especially good at generating pain signals, even when there is no cause for the pain. This is why 
medicines usually used for other illnesses, such as epilepsy and depression, can help; they change the way that the 


nerves in the brain and spine transmit pain signals from the muscles. 


FM is not caused by persistent infection 


explains their symptoms.) 


Exercise is the most effective treatment that 
we have for FM 


Exercise causes temporary muscle 
discomfort, which is a sign of the muscle 
reorganizing to become stronger 





FM symptoms are worsened by stress and 
low mood 





FM is helped by active involvement of the 
patient 


With treatment, FM can have a much 
smaller impact on your lifestyle 





he future. 


exercise training appears to be as effective as land exercise training. 
The choice of exercise modality (aerobic vs strength/resistance; land 
vs water) therefore depends on patient preference and on feasi- 
bility, determined by both the availability and the impact of medical 
comorbidities. Fast walking, cycling, and swimming are good initial 
recommendations, as they have lower musculoskeletal impact than 
running. 

FM patients have often been through a lengthy period of reduc- 
tion in physical activity, frequency, duration, and intensity, due to 
their pain and related symptoms, and the majority would be clas- 
sified as sedentary (<5000 steps per day). A sudden and significant 
step change in activity level is unlikely to be successful. Graded in- 
crements in activity, clearly prescribed or described by a health care 
professional or sports professional, are needed. Although small in- 
creases in activity (2000-3000 steps per day) may help, the greater 
the increase in activity, the greater the improvement in function, 
mood, and physical health-related quality of life, but not pain in- 
tensity [25]. Increased physical activity is, of course, also likely to 
have associated benefits for physical health (for example, body mass 
index, blood pressure, and glucose tolerance) and mental health. 

Early worsening of pain and fatigue is common, and patients need 
encouragement to persist and to attribute such symptoms to the 
body’s rebuilding in response to increased activity, rather than at- 
tributing them to worsening of a disease process. Many patients are 
concerned about pain, soreness, and tenderness arising in muscles 
in the hours and days following exercise, especially in the early stages 
of exercise training. This phenomenon is well known to athletes and 
others who undertake physical exercise for training purposes and 
is known as delayed onset of muscular soreness’ (DOMS). ‘This is 
an inevitable consequence of physical activity beyond the person's 
norm, in which muscular breakdown following exercise is accom- 
panied by soreness and muscular fatigue, before the muscles rebuild 


An illness, such as a flu infection, may be a trigger for the illness, but ongoing infection is very unlikely. (Note: many 
patients, especially those who also satisfy the criteria for CFS, believe that they are harbouring an occult infection which 


Exercise helps to build up the body and make it more resistant to challenges. Exercise improves blood flow to the 
muscles by causing blood vessels to grow more thickly through the muscle. Exercise may help to ‘reset’ the nervous 
system, so that it deals better with pain signals. 


Exercise breaks the body down before it builds it back up, and so muscle pain and muscle tenderness are entirely 
normal after exercise, and not a sign of illness or abnormality. Competitive and recreational athletes know this very well. 
Use a simple model to explain this, describing, for example, how muscle fibres disaggregate after acute exercise and 
hen reorganize over a period of 24-72 hours.) 


Learning self-help approaches to anxiety (relaxation, cognitive approaches, behavioural approaches) and depression is 
important. Antidepressants and other formal treatments, including CBT, also have a role to play. 


can't cure this illness for you. | need you to work hard with the treatment plan, and | will do my best to support you. 


Together, we can work at enabling you to manage and control your illness, so that it has much less of an impact on your 
ife. | want to work with you on your rehabilitation, focused on what you can do, and on what you would like to do in 


more strongly over a period of 2-3 days. In other words, increased 
muscle pain in this context is a sign of adaptation and improving 
physical fitness, rather than of disease. This is an important message 
for FM patients—physical sensations of pain and fatigue during and 
after exercise do not mean that the body is being damaged, but ra- 
ther they are signs that the body is responding normally to the chal- 
lenge and becoming stronger. It may therefore be better to use the 
word ‘training, rather than exercise, with FM patients. 

Understandably, motivation to exercise, and to exercise in a planned 
and strategic way over a period of weeks and months, is crucial to suc- 
cess. A minority of patients will find this straightforward, but a ma- 
jority will struggle and need assistance at the outset and in an ongoing 
way. Tips for success are listed in Box 131.3. 

So-called ‘activity cycling’ is common and unhelpful. In this situ- 
ation, patients are overactive on ‘good days’ and then underactive on 
the inevitable painful and tired subsequent ‘bad days. The aim should 
be that the patient gradually, but steadily, increases their physical ac- 
tivity level, independent of how they feel on the day—in other words, 
on good days, they may do less than what they feel capable of, but on 
bad days, they aim to do more than what they feel capable of. 


Cognitive behavioural therapy 


It makes intuitive good sense to incorporate cognitive elements into a 
purely behavioural intervention (exercise) for FM. Many patients will 
be reluctant to engage in a graded exercise programme, due to con- 
cerns about feasibility (for example, access to means or time), effective- 
ness (why should this help?), and toxicity (won't this make me worse’). 
Personalized assistance with overcoming practical concerns, and with 
the identification and challenge of maladaptive beliefs about the effect- 
iveness and adverse effects of exercise, is highly appropriate. There are 
several key cognitions in FM that deserve targeting. These include that 
pain or other symptoms indicate actual bodily harm, that exercise will 
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Box 131.3 Tips for success with exercise for FM 


e Involve key carers/relatives when explaining the rationale and 
he detail. 
e Explain the muscular deconditioning cycle, and check 
understanding (Fig. 131.1). 
e Be specific about what graded exercise is, and personalize the 
discussion. 
e Help the patient to set clear treatment goals. 
e Interpret ‘exercise’ very broadly and sensibly, especially when there 
are problems starting. 
e Emphasize the importance of persistent engagement with the exer- 
cise programme-—stopping and starting is unlikely to be helpful. 
e Consider using a mechanical pedometer or a smartphone app to 
provide a daily record of activity levels. 
e Be aware of the potency of fear-avoidance of physical activity. 
e Be aware of ‘activity cycling’. 




















cause pain or other symptoms, and that the increased symptoms fol- 
lowing exercise are a sign of increased activity in the disease process, 
which then leads logically to resumed avoidance of exercise. 

EULAR guidance [18] concludes that there is ‘weak evidence for’ 
cognitive behavioural therapy (CBT), based upon a Cochrane review 
[26] of 23 RCTs, in which there was a statistically significant difference 
in pain and mood at end of CBT and in which improvement was sus- 
tained at 6 months. Unfortunately, the RCTs have substantial hetero- 
geneity. It is therefore difficult to extrapolate directly to clinical practice 
in FM. Furthermore, there are several difficulties associated with the 
practical use of CBT, including highly varied cognitive behavioural 
interventions, the availability of therapists and their varied skill level, 
and varied treatment fidelity. Limited availability may, in part, be ad- 
dressed by innovative delivery via groups or the Internet [27]. 


Multicomponent treatments 


EULAR guidance [18] concludes that there is ‘weak evidence for 
the combination of treatments, and especially psychological or edu- 
cational therapies combined with graded exercise. A systematic re- 
view [28] defined multicomponent therapy as ‘at least 1 educational 
or other psychological therapy, and at least 1 exercise therapy’ and 
concluded that it was effective in reducing pain post-treatment, 
compared to control interventions. However, there is considerable 
complexity in the nature of ‘multicomponent’ interventions, espe- 
cially regarding their level of integration [29]. ‘Multi-disciplinary 
and ‘interdisciplinary’ interventions lie along a continuum where 
the latter implies that the various disciplines have a common, co- 
herent approach, rather than continuing to work within their 
own silo. 


Antidepressants and anticonvulsants (as ‘neuromodulators’) 


Most guidance, including EULAR [18], recommends pharmaco- 
logical approaches, including antidepressants, as second line, that 
is, following failed intervention with education and exercise. An op- 
timal therapeutic approach may be described as ‘cautious pharmaco- 
logical input’ [30]. A recent summary of medicines for which there is 
evidence of effectiveness in FM, including specific contraindications 
and cautions, is helpful [31]. 

Importantly, in treating the core syndrome of FM, antidepressants 
and anticonvulsants are not functioning via their impact on mood 


but are functioning as neuromodulators influencing central sensi- 
tivity; the analgesic effect of antidepressants occurs in FM patients 
who are not depressed and is independent of any antidepressant 
effect. It is important to emphasize this to FM patients for whom 
these medicines are being prescribed, to avoid their objection that 
either their illness is ‘real, and not ‘mental’ or that they are not de- 
pressed. Most patients will accept the explanation that antidepres- 
sants change the functioning of ‘nerves in the brain, to improve the 
outcome of depression, and that they can also therefore change the 
functioning of nerves in the spine and muscles, and so can alter the 
experience of pain in FM. 

Hauser suggested three main reasons for why antidepressants 
should be used second line: (1) modest evidence of effectiveness; 
(2) high individual variability in response; and (3) problematic ad- 
verse effects [32]. While she concludes, somewhat implausibly, that 
amitriptyline, duloxetine, and milnacipran ‘are first-line options for 
the treatment of FM, she goes on, thoughtfully, to state: 


‘Physicians and patients should be realistic about the potential 
benefits of antidepressants in FM ... A small number of patients 
experience a substantial symptom relief with no or minor adverse 
effects ... However, a remarkable number of patients drop out of 
therapy because of intolerable adverse effects or experience only 
a small relief of symptoms, which does not outweigh the adverse 
effects.” 


The usual recommendation is to start with a low or very low dose 
of amitriptyline (25 or 10 mg, or even 5 mg, at night) and to titrate 
up, according to tolerance, to 75 mg. However, many patients will 
not be able to tolerate this modest dose and, even if they do, many 
will take the medicine for less than a year [33]. The usual cautions 
apply in older patients, and especially those with cardiac disease. 

If amitriptyline is ineffective or poorly tolerated, then the sero- 
tonin/noradrenaline reuptake inhibitors (SNRIs) duloxetine or 
milnacipran should be considered [34]. The use of the tricyclic ami- 
triptyline and these SNRIs is firmly rooted in the relevant biology— 
both serotonin and noradrenaline influence the descending 
modulatory pain pathways. There is little randomized evidence to 
support the use of the SNRI venlafaxine in FM. Cyclobenzaprine, 
a tricyclic-like muscle relaxant, is another alternative to amitrip- 
tyline (‘weak evidence for’) [18]. Finally, other alternatives include 
the anticonvulsants pregabalin (‘weak evidence for’) and gabapentin 
(‘research only’) [18]. Both of these medicines affect voltage-gated 
calcium channels, which are modulators of afferent pain signals. 

The choice of an appropriate medicine should consider the 
symptom profile, likely side effects profile, likely patient tolerance 
of side effects, and patient preferences, which may be influenced 
by other FM patients’ experiences. Clinical practice tends to favour 
amitriptyline or an SNRI with prominent depression, an anticonvul- 
sant with prominent sleep disturbance, and an SNRI with prominent 
fatigue. Intolerance of medicines, even at low or very low doses, is a 
frequent problem [35], probably due to increased somatic sensitivity 
in FM, although other mechanisms are also likely to contribute to 
this nocebo effect. 





' Reproduced from CNS Drugs, 26(4), Hauser W, Wolfe F, Tolle T, et al., The role 
of antidepressants in the management of fibromyalgia syndrome: a systematic 
review and meta-analysis, pp. 297-307, Copyright (2012), with permission from 
Adis Data Information BV. 
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The prescription of strong opioid analgesics is inappropriate, as 
is long-standing prescription of any opioid. This reflects the lack 
of evidence for their effectiveness, with consequent lack of support 
from guidelines, alongside an increasing concern about their exces- 
sive use in some countries, with a high prevalence of use (10-60%) 
among FM opioid users [36]. Some evidence supports the use of 
tramadol, an opioid with some SNRI activity, and EULAR [18] con- 
cludes ‘weak evidence for; but adverse effects and misuse are fre- 
quent issues. 

Rather than ‘prescribe and forget, an integrated approach to 
pharmacological treatment should be adopted. This should include: 


e Careful patient and carer education; 

e Continued exercise/activity intervention; 

e Careful selection and targeting of medicines; 

e An empirical approach, based on simple trials of one medicine at 
a time, with patient monitoring of key symptoms and function; 

e Stepwise increase in dosing, to a dose likely to be therapeutic; 

e Monitoring of adherence; 

e Prompt discontinuation of ineffective medicines; and 

e Pruning of medicines lists to reduce iatrogenic harm. 


The last issue is crucial—FM patients are typically taking 8-10 
medicines, for a variety of reasons [33], and such a number is un- 
likely to represent optimal management, even where there are mul- 
tiple comorbidities. Application of simple n of 1 trials in this patient 
group may advance the rigour of decision-making and improve in- 
dividual patient outcomes from pharmacotherapy [37]. 

Emerging randomized evidence suggests that combinations of 
CNS-acting medicines may increase response rates. For example, 
in a recent randomized crossover trial, patients received placebo, 
pregabalin, duloxetine, and pregabalin—-duloxetine combination 
[38], with pain response of 18%, 39%, 42%, and 68%, respect- 
ively. Unsurprisingly, adverse effects, including drowsiness, were 
more common with combination treatment, and it seems prudent 
to reserve this approach for patients who have failed both first- 
line non-pharmacological treatment and second-line single-agent 
pharmacological treatment. 


Failure to respond 


Continued symptoms and disability are common, despite education, 
graded exercise, and a single pharmacological agent at the maximum 
tolerated dose. In this situation, a number of pragmatic approaches 
may be adopted, chosen according to the patient’s particular clinical 
situation and preferences. These include: 


e Combinations of drugs, rather than monotherapy. One approach is to 
combine a morning SNRI with an evening anticonvulsant. Another 
is to combine a morning selective serotonin reuptake inhibitor (SSRI) 
with a low dose of amitriptyline in the evening. There is some limited 
evidence that SSRIs alone may provide some benefit. Adjunctive 
medicines, such as paracetamol, tramadol, and non-steroidal anti- 
inflammatory drugs (NSAIDs), may give additional benefit. 


Supervised exercise or comprehensive rehabilitation programmes. 
Patients who do not respond adequately to an exercise pro- 
gramme, often due to either poor motivation or poor exercise tol- 
erance, may benefit from structured exercise in a more supervised 


setting, perhaps following referral to a physiotherapist, or in an 
exercise group supervised by a health care professional. 


Psychiatric referral. This may be appropriate when a patient with 
depression has not responded to first- and second-stage anti- 
depressant treatment, when suicidal ideation is significant or 
worsening, and when psychiatric problems may benefit from the 
input of the multi-disciplinary psychiatry team. 

e Pain management programme referral. For more complex patients, 
perhaps with multiple comorbidities, consider referral to a pain 
management programme, which delivers more intensive support 
for self-management, with a mixed group of chronic pain patients, 
but using core principles of education, graded activity, and cogni- 
tive behavioural approaches [39]. 


Other approaches for which there is more limited evidence such as 
complementary or alternative therapies. If the patient is enthu- 
siastic about an empirically unsupported treatment, it should be 
considered in a non-judgemental way if the clinician does not 
view it as actively detrimental or unsafe—the placebo effect is a 
powerful tool in FM. 


Empirically unsupported treatments 


EULAR guidance [18] does not recommend a variety of treatments, 
including biofeedback, capsaicin gel, hypnotherapy, massage, and 
S-adenosyl-methionine (all ‘weak evidence against’); other comple- 
mentary and alternative therapies (‘strong evidence against’); and 
chiropractic (‘strong against, due to safety concerns). It also does 
not recommend many other treatments due to inadequate evidence, 
including electrothermal therapy, phytothermotherapy, music 
therapy, storytelling, magnet therapy, transcranial magnetic therapy, 
and direct current stimulation. 

Many FM patients use complementary and alternative therapies. 
Clinicians should routinely ask patients about their attitudes to such 
therapies and about their use of them but should be clear that there is 
currently insufficient evidence to recommend them. Indeed, the use 
of such therapies alongside treatments with an established evidence 
base may be counterproductive, due to concomitant engagement 
with an alternative aetiological model. 


Management of comorbidities 


Comorbid diagnoses impact significantly on an FM patient's clin- 
ical presentation, prognosis, and treatment. It is therefore desir- 
able to fully diagnose comorbid disorders, in order to characterize 
the patient's situation, understand their likely illness trajectory, and 
plan appropriate and comprehensive management. While targeting 
multiple problems simultaneously is unrealistic, and unlikely to be 
effective, if one or two symptoms are especially problematic, they 
may benefit from targeted treatment. A syndrome such as IBS, for 
example, may benefit from dietary advice, increased physical activity, 
and an antispasmodic. 

Low mood sufficient to constitute a depressive episode deserves 
prioritization, as it reduces engagement with treatment, is associated 
with a worsened prognosis, and increases suicide risk, which is a 
significant concern in FM. In the presence of pain, mental disorders, 
such as depression, tend to be missed and, even when recognized, 
tend to be treated inadequately. This is despite their undoubted con- 
tribution to disability and chronicity. 
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Prognosis 


In secondary and tertiary care, continued pain, fatigue, and sleep 
and mood disturbance are the norm over a period of several years 
[40]. This reinforces the need for early identification and energetic 
evidence-based treatment in primary care, based upon the rehabili- 
tative model described in the previous section. The focus of many 
patients is on symptoms, including pain and fatigue; while these are 
unlikely to resolve, the impact on them and on health services can be 
significantly reduced. The aims of FM treatment should therefore be 
broad and include increased activity/reduced disability, improved 
mood and self-efficacy, increased employment, reduced sickness/ 
benefit payments, and reduced use and cost of health care services. 

The prediction of outcome in an individual patient at treatment 
outset is challenging. Psychosocial factors may help to predict 
prognosis, such as: (1) beliefs that the pain is harmful or severely 
disabling; (2) low mood and social isolation; (3) fear-avoidance; 
(4) an expectation that the patient will be a passive recipient of care, 
rather than an active participant in treatment; and (5) comorbid 
depression. 

Overall, FM is not associated with increased mortality, but it has 
been associated with a 10-fold increased risk of suicide in a sec- 
ondary care population [standardized mortality ratio (SMR) 10.5, 
95% confidence interval (CI) 4.5-20.7] [41]. This emphasizes the 
need for: 


e Therapeutic optimism, facilitated by and facilitating a pain re- 
habilitative approach; 

e The identification and management of comorbid depression; and 

e Routine or urgent referral for psychiatric assessment when needed. 


Relapse prevention 


Perhaps surprisingly, despite the chronicity of symptoms in most 
patients with FM, little attention has been paid to long-term man- 
agement and prevention of relapse from treatment response. There 
is some evidence that the benefits of exercise may persist [42]. 
However, predisposing factors, such as genetic vulnerability, imply 
that FM responders remain vulnerable to setbacks and that con- 
tinued improvement should not be taken for granted. 

A common-sense approach suggests that, prior to discharge from 
health care, responders should be reminded that: 


e They have a lifetime vulnerability to central sensitivity and FM. 

e They should maintain non-sedentary levels of physical activity. 

e They should identify early warning signs of a return of FM and 
share those signs with key relatives/carers. 

e Ifthe early warning signs emerge, they should redouble their self- 
care strategy. 

e If their self-care fails to deliver early improvement, they should 
seek additional help from their health care provider. 


It may be helpful to provide these key messages in written form for 
the patient's and carer's records. 


Chronic fatigue syndrome 


CFS is a complex disorder, in which severe fatigue, combined with 
several other symptoms, causes impairment across a wide spectrum, 


from mild restriction to severe and bedbound. Clinical manage- 
ment and advances in understanding are impeded by entrenched 
and diametrically opposed positions on the roles of rest and physical 
activity, which are each seen variously as either cause or cure. The 
following text reviews in brief some of the issues and ends with a plea 
for a meeting of minds. 


Clinical presentation and epidemiology 


Fatigue is a common, almost universal, sensation and a normal 
bodily response to sustained effort. The challenge for clinicians and 
researchers has been to determine when ‘normal’ fatigue ends and 
when an important and problematic clinical disorder begins. In es- 
sence, this relates to chronicity and severity, although, as an indicator 
of the extent of difficulty, no less than nine sets of diagnostic criteria 
have been developed in the last 30 years. These have helpfully been 
summarized in a recent systematic review [43]. The ‘Oxford criteria 
[44], used in the largest and most controversial treatment trial to 
date [45], are summarized in Box 131.4, and the most recent criteria 
[46] for ‘systemic exertion intolerance disease’ (SEID) in Box 131.5. 
This new term is intended to convey the undoubted impact of the 
disorder and to reduce what the authors called the ‘stigmatization 
and trivialization’ resulting from the existing terminology. 

While the diagnostic criteria for CFS/SEID are relatively straight- 
forward, it is clear that the population of patients with CFS is a com- 
plex and heterogenous one, with multiple comorbidities. Muscle 
pain and joint pain are common, and there is considerable overlap 
between FM and CFS populations, with FM co-diagnosis indicating 
a more burdened CFS population. Other functional somatic syn- 
dromes are common, and diagnostic overlap has contributed to an 
energetic debate regarding the advantages of ‘lumping’ such syn- 
dromes together to emphasize commonalities in features and aeti- 
ology vs ‘splitting’ to emphasize differences [47]. Depression and 
anxiety are more common in CFS than in FM or IBS, but far from 
universal [48]. 


Epidemiology 


The prevalence of CFS is uncertain, in part due to significant dis- 
crepancy between point prevalence in the community determined 
by self-reporting of symptoms (3.5%) and clinical assessment (0.9%) 
[49]. An added complication is the decline in prevalence of CFS 
through time. Between 1990 and 2001, combined fatigue diagnoses 


Box 131.4 The Oxford criteria for chronic fatigue 
syndrome (CFS) 


e Fatigue is the main symptom. 
e Fatigue is severe and disabling, and affects physical and mental 
function. 

e Fatigue present at least 50% of the time for a period of at least 

6 months. 

e Other symptoms may occur, including muscle pain, mood problems 
including depressive and anxiety disorder), and sleep problems. 

e Not explained by a medical condition, for example severe anaemia. 
e Notin a person with schizophrenia, bipolar disorder, substance 
misuse, eating disorder, or organic brain disease. 





Source: data from J R Soc Med., 84(2), Sharpe MC, Archard LC, Banatvala JE, 
et al., A report - chronic fatigue syndrome: guidelines for research, pp. 118- 
121, Copyright (1991), SAGE Publications. 
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Box 131.5 The US criteria for systemic exertion intolerance 





disease (SEID) 


Requirement for all three of: 

1 Substantial impairment of occupational, educational, social, or per- 
sonal activities: 
e Present for >6 months; 
e With fatigue that is: new (that is, not lifelong); and 
e Not due to ongoing excessive exertion; and 
e Not substantially alleviated by rest 

2 Post-exertional malaise 

3 Unrefreshing sleep Plus at least one of: 

4 Cognitive impairment 

5 Orthostatic intolerance (for example, recurrent dizziness when 
standing from recumbent) 








Adapted from Institute of Medicine, Beyond Myalgic Encephalomyelitis/Chronic 
Fatigue Syndrome: Redefining an Illness, Copyright (2015), with permission from 
National Academies Press. 











[CFS, post-viral fatigue syndrome (PVFS), asthenia] fell by 44% [50]. 
Recent findings confirmed a continued fall between 2001 and 2013 
[51], with a modest decline in CFS incidence (3% annually), a more 
rapid decline in PVFS (8%), and a precipitate decline in asthenia 
(30%). Alongside this shift, there has been a rise in FM diagnosis, 
such that FM in the UK is now about three times more frequent than 
CFS [51]. This is confirmed by evidence from high-quality commu- 
nity surveys (0.8% CFS vs 2.7% FM) [2, 49]. The reasons for these 
trends are unclear; there is no good evidence for changes in diag- 
nostic labelling. Recent data from Norway indicated two age peaks 
for incident CFS (10-19 and 30-39 years) and a female:male ratio 
of >3:1 [52]. 


Aetiology 


There is no clearly established, widely agreed aetiological under- 
standing for CFS. This undoubtedly reflects the clinical heterogen- 
eity and aetiological complexity but also reflects the fundamental 
disagreements about the nature and causes of the illness. The aim 
here is to encourage the reader to engage with the relatively rapid 
advance in evidence relating to relevant aetiological factors and to 
form balanced opinions that are supported by evidence. 

There is no doubt that physical disorder, such as common viral 
illnesses, often acts as a trigger/precipitating factor for severe fa- 
tigue and, in some individuals, the development of CFS. What are 
in dispute are: (1) the extent to which psychological or social fac- 
tors predict the development of CFS after such illness (that is, are 
predisposing factors); and (2) the extent to which ongoing physical 
illness (such as chronic viral infection) contributes to persistent 
symptomatology (that is, is a maintaining factor). In a study of pa- 
tients with glandular fever, chronic fatigue was not predicted by the 
severity of the glandular fever, as measured by symptom count [53]. 
Predictors included personality factors, such as perfectionism, and 
also cognitive factors such as ascribing symptoms to their glandular 
fever, believing that the illness would last a long time and impact 
detrimentally on their life, and not believing that they could con- 
trol their illness. Emerging evidence suggests that CFS patients 
have specific cognitive biases which reinforce and maintain mal- 
adaptive cognitions and associated behaviours. These include an 
attentional bias for fatigue-related information and a tendency to 


interpret ambiguous information as implying bodily dysfunction 
[54]. As in FM, fear of movement and avoidance of physical activity 
are common in CFS and related to symptom severity and physical 
disability [11]. Fear-avoidance lies at the core of the cognitive be- 
havioural model of CFS, in which catastrophic interpretations of 
physical symptoms (and an unhelpful focus on symptoms) motivate 
behaviours, including activity avoidance and activity cycling. 

A fundamental divide in thinking about CFS is the role of rest. On 
one side, rest (or relative rest) is viewed as an essential component of 
treatment of CFS, to allow the body to recover from the illness and 
gain in strength to fight putative infection or other ongoing physical 
challenge to wellness. On the other side, rest is viewed as an essential 
part of the cause of CFS [55], by leading to deconditioning. The ad- 
verse effects of excessive rest on multiple physiological systems are 
well established in healthy humans [56] and in patients with chronic 
physical illness such as renal failure [57]. 

There has been an ongoing search for physical causes of chronic 
fatigue. There is no doubt that, in some patients, a clear physical 
cause can be found. Prudent history-taking, physical examination, 
and investigation are therefore needed in possible cases of CFS to 
rule out significant physical pathology. Specific red flag features have 
been identified such as: difficulty in focusing the eyes, inflamma- 
tory arthritis or connective tissue disease, cardiorespiratory disease, 
weight loss, sleep apnoea, and lymphadenopathy [58]. However, 
little appears to be gained by an aggressive search for a physical 
disorder in the presence of normal physical examination and unre- 
markable screening investigations. In recent years, there was much 
early excitement when the retrovirus xenotropic murine leukaemia 
virus-related virus (XMRV) was found in the blood of most CFS pa- 
tients, only for new evidence to dash hopes of a cure based on this 
finding [59]. The meaning of abnormalities in brain regional con- 
nectivity in CFS patients, the extent of which appears correlated with 
fatigue severity [60], is uncertain. Demonstrable effects of exercise 
[61] and psychological treatment [62] on neuroinflammation offer 
encouragement that a more integrated understanding of the causes 
of CFS can be developed. 


Management 


The UK’s National Institute for Health and Care Excellence (NICE) 
guidance [58] recommends referral to specialist care within 3- 
4 months for adults with moderate symptoms, and immediately 
for patients with severe symptoms. Specialist care includes patient- 
centred rehabilitation programmes, which aim to improve physical 
and cognitive function and to manage symptoms. Patient educa- 
tion plays a key role and contributes to genuinely collaborative care, 
including shared decision-making. There is robust empirical sup- 
port for two treatments: graded exercise therapy (GET) and cognitive 
behavioural therapy (CBT), and this is considered in the following 
sections. The evidence for pacing is disputed. There is little evidence 
for the effectiveness of neuromodulators, including antidepressants, 
unless a depressive illness is present. The evidence for pharmaco- 
logical interventions has recently been reviewed, with no promising 
solutions evident [63]. 

Systematic reviews of RCTs support the use of GET [64] and 
CBT [65], but trials are small and there is a lack of comparative evi- 
dence with pacing. There is debate about the optimal specification of 
pacing in CFS, but in essence, it is an approach in which the patient 
functions within their perceived energy levels, rather than aiming to 
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systematically increase their level of functioning, as in GET or CBT. 
It is popular with patients, often advocated by patient groups, and 
it is unfortunate that better evidence does not exist. The PACE trial 
[66] was intended to provide a definitive answer to the important 
question about the relative merits of specialist medical care alone 
vs additional pacing, GET, or CBT. The main conclusions were that 
CBT and GET improved outcomes when added to specialist med- 
ical care, pacing did not, and adverse events seemed similar across 
the trial treatments. Overall, however, outcomes were modest and 
the proportion of recovered patients was low, perhaps reflecting 
clinical heterogeneity. Unfortunately, following the publication of 
short-term [45], adverse events [67], and long-term [68] results, 
fundamental and apparently bitter disagreements between the re- 
searchers and their critics have emerged, and their positions appear 
entrenched to a degree that is unique in medicine. The respective 
positions are well summarized in recent papers [69, 70]. The dif- 
ferences are clearly mirrored in the divergent recommendations of 
patient support organizations and medical organizations [71]. Those 
who are interested in uncertainty and the nature of scientific evi- 
dence, facts, and beliefs are strongly encouraged to read the cited 
documents. As the then deputy editor of the BMJ stated in 2011 [72]: 


“Why can’t CFS/ME be like other common chronic conditions where 
patients, carers, doctors, and researchers work together to pose re- 
search questions, gain understanding, and—in the absence of clear 
explanations and cures—at least find ways to respond to patients’ 
needs, help them live with and manage symptoms, and get more out 
of life?’ 
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Patients do not always present their symptoms in an honest manner. 
Their symptom reports may be affected by goals and motives that 
lie outside the realm of proper diagnosis and treatment of existing 
health problems. Distorted symptom presentations may occur in the 
forensic context where substantial incentives or secondary gain (for 
example, injury claims for compensation) are immanent, but also in 
clinical, as well as rehabilitative, settings. 

Patients sometimes strive for certain benefits without their ther- 
apists being aware of this (that is, patients follow a ‘hidden agenda’). 
Van Egmond, Kummeling, and van Balkom [1] asked psychiatric 
outpatients whether or not they expected to gain something from 
being a patient. A substantial minority (42%) said that this was in- 
deed the case. They admitted that their patient status may help them 
to get a new home, sickness benefits, or a resident permit. A hidden 
agenda may fuel feigning and treatment stagnation [2]. The link be- 
tween feigning and poor therapeutic success—including lack of co- 
operation, high dropout, and increased health care utilization—has 
now been well documented [3, 4] and should in itself be sufficient 
reason to consider feigning as a diagnostic option. 

Exaggerated symptom reports may be motivated by a need to pro- 
tect self-esteem. For example, Smith, Snyder, and Perkins [5] found 
that some individuals reported more symptoms in an evaluative vs 
non-evaluative situation, suggesting a self-protecting motive, ac- 
cording to the authors’ interpretation. Mild symptom exaggeration 
or denial will not substantially hamper diagnostic decision-making. 
More blatant forms of distortions, however, may compromise clin- 
ical decisions. Such illness deception is commonly labelled factitious 
disorder or malingering. Both are commonly used categorical de- 
nominators to describe forms of significantly distorted illness be- 
haviour, but phenomenologically they might not be distinguishable 
from each other at all. Indeed, some authors have argued that facti- 
tious disorder and malingering constitute one diagnostic entity [6]. 
We will return to this issue. 

According to widely accepted definitions, malingering is the 
deliberate invention or gross exaggeration of health problems 
(symptoms), motivated by external incentives (secondary gain). 
The Diagnostic and Statistical Manual of Mental Disorders, fourth 
(DSM-IV) and fifth editions (DSM-5) [7, 8] described this deliberate 





invention as ‘intentional production’ of false symptoms. However, 
Young [9] argued that the DSM definition should be modified by 
substituting production by presentation, highlighting different forms 
of how false or exaggerated symptoms can be demonstrated. Thus, 
Young defines malingering as ‘the intentional presentation with false 
or grossly exaggerated symptoms [physical, mental health, or both; 
full or partial; mild, moderate, or severe], for purposes of obtaining 
an external incentive ...’ (p. 180; italics in the original). 

The extant literature on malingering includes different mani- 
festations, ranging from pure fabrications of symptomatology over 
grossly exaggerated symptoms and perseverations (continued asser- 
tion of symptoms that have been present in the past) to false imput- 
ations (that is, intentionally attributing genuine symptoms to a false 
source) [10]. The wide range of distorted symptom reports that are 
brought under the definition of malingering partly explains largely 
different views on its prevalence. There is no consensus on where to 
draw the line between mild and gross exaggeration, and how to treat 
the intermediate zone. 

In and of itself, malingering is not a mental disorder, but rather a 
behavioural strategy that healthy or sick people might use in certain 
situations (for example, after a motor vehicle accident, in a situation 
of social misery). This strategy may be appropriate or not in a spe- 
cific situation; it may be judged morally wrong or right (and differ- 
ently so by different people or from different angles), and it may be 
highly adaptive or severely maladaptive. 

In contrast, factitious behaviour is commonly perceived to 
be a mental disorder and described as such by the International 
Classification of Diseases (ICD) [11] and the DSM [7, 8]. Exactly like 
malingering, it describes the deliberate false presentation (inven- 
tion, production, gross exaggeration) of health problems. Yet, the 
underlying motive for factitious symptom presentation is thought to 
be internal (that is, primary gain), rather than external, as is the case 
with malingering. This internal motivation is commonly assumed 
to be unconscious and related to the sick role (patient care, medical 
procedures, diagnosis and treatment of claimed health problems). 
The Munchausen’s syndrome (named after the German Freiherr von 
Münchhausen, dubbed the Baron of Lies), in which patients actively 
try to mimic symptoms (for example, by hurting themselves), was 
first described by Asher [12] and continues to be considered as the 
most prototypical form of factitious behaviour. Factitious behaviour 
is closely tied to self-harm [13] (but for a discussion, see [14]). Thus, 
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Table 132.1 The differential diagnosis between three forms of non-authentic symptom presentation largely rests on subjective judgment 


about the intention and the motivation for symptom distortion 


Diagnostic catego Distorted symptom presentation Incentives for distorted symptom Examples of incentives 
8 gory ymp p! ymp P 
presentation 


Cognizant, self-reflected (conscious); 
external incentive, secondary gain 


Intentional, deliberate, controlled 
(conscious) 


Malingering 


Intentional, deliberate, controlled 
(conscious) 


Factitious disorder 


Somatoform, dissociative, and 
conversion disorders 


Unintentional, involuntary, uncontrolled 
(unconscious) 


Non-reflective, unaware (unconscious); 
internal incentive, primary gain 


Non-reflective, unaware (unconscious); 
internal incentive, primary gain; secondary 





Sick leave, financial compensation, 
escape from legal responsibility 


Medical treatment, including surgery, 
sick role 


Conflict management, stress reduction 


gain may also be present 








Reproduced from Merten T, False symptom claims and symptom validity assessment. In: Otgaar H, Howe M [eds]., Finding the truth in the courtroom? Problems with deception, lies, and 


memories, Copyright (2018), Oxford University Press. 


unlike malingering, factitious behaviour is maladaptive, which may 
be the reason why it is included in subsequent versions of the DSM 
as a disorder. 

However, motivational aspects are crucial for distinguishing be- 
tween malingering and factitious disorder. The presence of external 
rewards per se does not exclude factitious disorder, given that the 
potential for some form of secondary gain is usually considerable 
for patients in Western welfare states (such as sick leave, sick pay). 
The fact that secondary gain can be identified does not automatic- 
ally mean that it plays a reinforcing role in patients’ symptom pres- 
entation [15]. In many cases, it will be difficult to clarify the true 
and complete motivational background of false or grossly distorted 
illness presentations. This may become completely impossible when 
both external and internal factors are apparently present and interact 
in a complex way (for case examples, see [16]). All that might be said 
with some confidence about such complex cases is that symptoms 
are feigned. 

Feigning comprises both malingering and factitious disorder. It 
is defined as ‘the deliberate fabrication or gross exaggeration of psy- 
chological or physical symptoms without any assumptions about its 
goals’ ({17], p. 6). However, before an expert can ascertain feigning 
in a given case, he or she has to take another decision that is con- 
ceptually as complex and problematic as the clarification of the 
underlying motivation—whether or not false symptoms are pre- 
sented intentionally, voluntarily, consciously. 


Feigning and genuine mental disorder 





Striking discrepancies between subjective symptom report and 
objective signs are not limited to feigned health problems. In fact, 
they constitute the key feature of what historically has been called 
hysteria. In 1917, Jones and Llewellyn [18] wrote: ‘Nothing, it may 
be said, resembles malingering more than hysteria; nothing hysteria 
more than malingering. In both alike we are confronted with the 
same discrepancy—between fact and statement, between objective 
sign and subjective symptom ...’ (p. 117, italics in the original). 
Phenomenologically, patient behaviour in the context of factitious 
disorder, malingering, and somatoform or dissociative disorders 
may be completely indistinguishable. To address this problem, 
DSM-IV [7] stipulated as a necessary criterion for diagnosing both 
somatization disorders and conversion disorder: “The symptoms are 
not intentionally produced or feigned (as in Factitious Disorder or 


Malingering)’ (p. 462). This requirement was given up in DSM-5 for 
Somatic symptom and related disorders. Still, the dimension of in- 
tentionality delineates a distinction between self-deceit (in genuine 
mental disorder) and other-deceit (in feigning), as described by 
Turner [19]. 

Intentionality (or not) and the nature of incentives are the two 
dimensions with which various forms of discrepant symptom pres- 
entations can be differentiated. The distinction between the three 
diagnostic entities listed in Table 132.1 critically hinges on the as- 
sumption that both intentionality and true motivation can be re- 
liably and, much more importantly, validly ascertained by a third 
party (that is, the clinician or forensic expert) [20]. Many authors 
have criticized this assumption [21]. Moreover, in many cases, it ap- 
pears doubtful whether intentionality and motivation can be reliably 
dichotomized at all. Diagnostic options such as Feigning, unspeci- 
fied or Feigning, not otherwise specified [22], as well as previous at- 
tempts to rename somatoform disorders into medically unexplained 
symptoms [23], are reflective of this nosological inconclusiveness. 
However, the traditional approaches of DSM and ICD stick to the 
categorical system and have largely ignored attempts to introduce a 
multi-faceted and dimensional approach [24-26]. 

Incidentally, Browns [27] definition of medically unexplained 
symptoms as ‘a heterogeneous group of conditions characterized 
by persistent physical symptoms that cannot be explained by med- 
ical illness or injury’ (p. 769) would encompass somatoform, dis- 
sociative, factitious, and malingered symptom presentations, plus 
symptoms in the context of a yet unknown physical disease. A more 
detailed discussion of the conceptual problems of distinguishing 
between somatoform and feigned disorders and its practical conse- 
quences can be found in Merten and Merckelbach [28]. 

In the descriptions below, we largely ignore the conceptual conun- 
drum surrounding factitious disorder and malingering and depict 
their ideal-typical manifestations as they are treated in the literature. 


Definition 

Malingering is the intentional false presentation (invention, gross 
exaggeration, or otherwise gross misrepresentation) of symptoms 
or health problems with the primary goal of attaining an external 
reward. External gain may be a financial incentive (for example, 
compensation), but it may also relate to obtaining drugs, escaping 
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from formal duty (for example, sick leave, military service, sitting 
an exam), legal responsibility, or punishment, or obtaining other 
material or non-material advantage. The specifier external indicates 
that the motivation for false illness presentation does not primarily 
relate to stress management, internal (interpersonal) conflict, or 
the sick role. Malingering is not a mental illness and should not be 
pathologized. Still, pathologization of people who, in a certain con- 
text, engage in malingering is not uncommon [29], as indicated by 
phrases such as healthy people don’t do this; cry for help; you need to 
be mad in order to fake madness. 

In DSM-5, the term malingering has been deleted from the index 
and is difficult to find in the text. It is treated under the headline 
Nonadherence to medical treatment, with the code V65.2 (pp. 726- 
7). DSM-5 lists four criteria that were adapted from DSM-III [30], 
even though these criteria have repeatedly been shown to be com- 
pletely inadequate [24, 31, 32]. According to DSM-5, any combin- 
ation of the following four criteria would be strongly suggestive of 
malingering: (1) medicolegal context of presentation; (2) marked 
discrepancy between symptom presentation and objective find- 
ings; (3) low degree of co-operation in diagnosis and treatment; and 
(4) presence of an antisocial personality disorder. 

To illustrate the inappropriateness of these criteria, it may suffice 
to note that any person with a diagnosed antisocial personality dis- 
order who is examined in a medicolegal context should be ‘strongly 
suspected’ of malingering according to these criteria (when, in fact, 
the link between antisocial features and malingering is, at best, weak 
[33]). Also, any patient with a somatic symptom disorder or a con- 
version disorder should be ‘strongly suspected’ of malingering in 
any medicolegal context, given the fact that both disorders are char- 
acterized by a marked discrepancy between subjective symptom 
presentation and objective findings and observations. 

The DSM criteria for malingering have virtually not changed since 
1980. They are based on an outdated (criminological) conception of 
malingering prevalent in the 1970s (with antisocial personality con- 
sidered to be a key issue in feigned health problems [34]), and they 
ignore the vast empirical and conceptual work with several thou- 
sands of publications written during the last three decades [24]. 

In ICD-10, malingering is coded as Z76.5, without further elab- 
oration. Interestingly enough, ICD-10 contains another category of 
exaggerated or excessively prolonged symptoms that was conceptu- 
alized as Elaboration of physical symptoms for psychological reasons 
(F68.0). It corresponds roughly to the older concept of compensa- 
tion neurosis (for a more recent appraisal, see [35]). However, for 
unknown reasons, it is explicitly limited to physical symptoms and 
is poorly defined. Cooper’s pocket guide [11] stressed the psycho- 
logical causation of it (including the aspiration of obtaining financial 
compensation). Contrary to malingering, F68.0 is conceived to be a 
mental disorder. 


Assessment 


Different medical disciplines have developed their own strategies 
and lists of criteria for the detection of malingered symptomatology. 
Among traditional medical disciplines, ophthalmology has shown 
a particularly strong interest in differentiating true visual deficits 
from feigned impairment [36]. The determination of malingering 
is usually based on an analysis of plausibility and consistency within 
the available information or data on a case, with general criteria fo- 
cusing on discrepancies, such as the following [31, 37]: 


e Excessive, overgeneralized symptom report. 

e Bizarre symptom presentation. 

e Inconsistency in symptom presentation. 

e Absence of functional impairment outside the diagnostic setting, 
unimpaired functional level in the private sphere. 

e Disproportionate claimed disability, as compared to the objective 
severity of illness or injury. 

e Implausibility of symptoms and complaints with regard to current 
medical/psychological knowledge (symptoms do not make sense), 
symptom presentation contradicts the functional-anatomic 
principles. 

e Contradictions between claimed history and documented facts. 

e No improvement under treatment with known efficiency. 

e Evasiveness. 


For different disciplines (for example, neurology, ophthalmology, 
neuropsychology), lists of special malingering markers have been 
developed, related to the functional system in question. In psych- 
iatry, such lists are currently en vogue with regard to judging the 
authenticity of claimed symptoms of post-traumatic stress disorder 
PTSD). Such red flags may include early and spontaneous patient's 
symptom report, the quality of reported flashbacks, the presence or 
not of observable emotional arousal in the patient, the quality and 
extent of reported amnesia for aspects of the traumatic event, con- 
tents and the frequency of reported nightmares, temporal stability 
or instability of claimed symptoms, the presence or not of survivor 
guilt, or blame of others instead of self-blame [38, 39]. 

The use of checklists for malingering is hampered by a number 
of difficulties. Firstly, they ignore the conceptual problems. Often, 
the criteria used for determining malingering equally apply to 
factitious disorder, making them criteria for feigning, rather 
than malingering. Also, similar or identical criteria are used for 
identifying somatoform, conversion, and dissociative disorders 
[40]. As a consequence, they may rather be called criteria for non- 
genuine symptom presentation. 

Secondly, the criteria on checklists often have been established in- 
tuitively, on the basis of what may be called clinical lore, but nobody 
knows about their reliability and validity or their accuracy of clas- 
sification (sensitivity, specificity). Usually, they are poorly described 
and handled in an intuitive, individualized way. They have not been 
operationalized (that is, there are no rules that describe how to apply 
and score them). Inter-rater reliability must be expected to be a 
major problem. 

Thirdly, there is no systematic knowledge about decision rules 
(which and how many criteria should be checked in which context, 
how many of them must be positive for a positive classification, etc.). 
A strict application of such criteria may result in an inappropriately 
low threshold for malingering [41]. 

For the last two decades, the bulk of research in the domain of 
malingering has been done by neuropsychologists and published in 
neuropsychological journals [42]. The reason for neuropsycholo- 
gists’ prominence is their reliance on test data. Psychological test 
results are highly dependent upon the individual's willingness to co- 
operate. With the development of what was then called symptom 
validity tests (SVTs), malingering assessment entered a new stage 
and became a primarily data-driven and evidence-based method- 
ology. A vast number of instruments for practical use were devel- 
oped, comprising cognitive SVTs (today mostly called performance 


validity tests), self-report validity measures (questionnaire ap- 
proaches), and clinical interviews and ratings. It is beyond the 
scope of this chapter to give an overview over these psychological 
test measures; the reader can easily find summary reports in about 
a dozen of books available in English language [9, 43, 44]. In the 
past, SVTs were often referred to as malingering tests, but this is 
misleading because malingering is just one possible reason why in- 
dividuals may fail on such tests. 

Under certain circumstances, some SVTs (those that operate on 
a forced-choice format) are able to identify response patterns that 
are indicative of intentional, voluntary response distortions [45]. In 
such cases, test results are paramount to the confession of feigning. 
Accordingly, they occupy a special place in the most noteworthy 
attempt to formalize diagnostic reasoning about malingering—the 
criteria proposed by Slick, Sherman, and Iverson [46]. While their 
original criteria were related to malingered neurocognitive dysfunc- 
tions, they were later extended to other conditions, in particular to 
pain disorder [47]. 


Prevalence 


Available base rate estimates of malingering differ immensely. In 
clinical contexts, the base rate of malingered symptom presenta- 
tions is usually expected to be very low. Consequently, the possi- 
bility of intentional response distortions is often dismissed at the 
outset, conforming to the traditional role concept of mental health 
providers. To illustrate this point, Reuber et al. [48] went as far as 
maintaining that: “Fortunately, it is rarely necessary for a clinician 
to determine whether symptoms are intentional’ (p. 308, emphasis 
added). In other settings, malingering appears to be rather common. 
For the United States military, Morel [49] maintained that: “No clin- 
ical opinion is more problematic, more weighty, or more pressing 
for a solution in military psychiatric medicine than differentiating 
malingering from post-traumatic stress disorder (PTSD)’ (preface). 

Potential factors responsible for the wide range of base rate esti- 
mates of malingering are the following: 


e The exact definition or underlying concept of the phenomenon 
that is being evaluated (for example, feigning, malingering, 
symptom exaggeration, negative response bias, suboptimal effort, 
effort test failure, overreporting, underperformance, symptom 
invalidity). 


The referral context of the examination (for example, forensic, 
clinical, rehabilitative). 


Within forensic contexts, the legal issue at stake (for example, di- 
minished criminal responsibility, liability, workers’ compensation, 
disability, custody). 

e The diagnostic group (diagnosed or claimed condition, for ex- 
ample mild traumatic brain injury, moderate or severe traumatic 
brain injury, PTSD, whiplash injury, chronic toxic encephalop- 
athy, attention-deficit disorder). 

e The method used for detecting feigning (for example, unspecified 

clinical impression, expert rating using specified criteria, rating or 

questionnaire methods, performance validity tests, application of 

the Slick criteria [46]). 

The standardized measures or SVTs if any were employed (for 

example, for questionnaire methods, MMPI-2 Fake Bad Scale, 

F scale, Response Bias Scale, combination of different scales, 
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Structured Inventory of Malingered Symptomatology, Assessment 
of Depression Inventory). 


When SVTs were used, the diagnostic decision rule (for example, 
based on a score in a single instrument or scores on a whole set of 
instruments; the number of SVTs administered and the number 
of positive single indicators necessary to classify the response pat- 
tern as feigned). 


The classification accuracy (sensitivity, specificity) of the specific 
methods used for diagnostic decision-making. 


Which cut scores were used for the single instruments (for a 
number of SVTs, different cut scores are available and some cut 
scores underwent modification or refinement in the course of 
time, after first publication). 


Some estimates resort to the criterion of below-chance response 
patterns in performance validity tests [45, 46]. This is a very con- 
servative criterion with a very high specificity, but low sensitivity 
in most real-world referral contexts. Consequently, such estimates 
necessarily result in a considerable underestimation of the true 
prevalence. In a similar vein, reliance on obsolete screening instru- 
ments (for example, the Fifteen-Item Test [50]) or embedded meas- 
ures with known low sensitivity will yield grossly deflated estimates. 
Conversely, the employment of larger batteries of validity measures, 
and particularly of so-called embedded performance validity indi- 
cators, with a low threshold of diagnosing negative response bias will 
yield gross overestimations. Rogers et al. [41] pointed out that the in- 
fluential model of Malingered Neurocognitive Dysfunction (MND) 
[46] defines a low threshold, in particular for the determination of 
possible malingering. As a consequence, prevalence estimates based 
on criteria suggestive of possible MND (and, to a much lower de- 
gree, probable MND) will inevitably produce grossly overestimated 
base rates. 

With all these factors in mind, existing prevalence estimates 
must be understood as very rough approximations for the true 
base rates to be expected in different contexts. The most frequently 
cited prevalence data stem from a survey by Mittenberg et al. [51]. 
Probable malingering or symptom exaggeration was judged by 131 
participating neuropsychologists to be present in almost a third of 
all litigating patients (in personal injury, disability, and workers’ 
compensation contexts). Larrabee [52] identified mild trau- 
matic brain injury as one of the most delicate conditions when it 
comes to malingering, with a prevalence of about 40% of false or 
grossly distorted symptom presentations (see a more recent study 
by Peck et al. [53] also reporting a 40% base rate in their sample). 
For some referral contexts and/or claimed conditions, base rate es- 
timates reaching or exceeding the 50% mark have been published 
such as: litigating patients after whiplash injury [54]; patients with 
reported hearing loss, in independent medical examinations [55]; 
criminal defendants [56]; Social Security disability claimants in the 
United States [57, 58]; students with claimed attention-deficit dis- 
orders, but without learning disability [59]; British personal injury 
litigation patients claiming memory impairment [60]; or United 
States veterans claiming mild traumatic brain injury [61]. Adding to 
these studies, Spanish researchers [62] performed a survey among 
161 Spanish medical doctors who were asked to estimate the per- 
centage of malingering patients with different health conditions. 
Mean estimates of 50% or higher were reported for patients with 
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claimed whiplash injury, fibromyalgia, chronic cervicalgia, depres- 
sion, and anxiety disorders. 

Young [63] performed an extensive literature search on recently 
published prevalence estimates and also came to the conclusion that 
‘jt is difficult to arrive at one percentage or range of percentages that 
are definite about the proportion of malingering found in forensic 
disability and related examinations’ (p. 196). He argued that 15% is 
the most appropriate estimate at the time being, with a large vari- 
ation of another 15% in both directions, depending on the referral 
background, the condition in question, and other factors. 


Factitious disorder 


Definition 

Factitious disorder is considered to be a mental disorder charac- 
terized by a non-authentic, feigned symptom presentation, either 
in the form of a voluntary production of symptoms (for example, 
self-inflicted injury, artificial fever, self-induced wound healing dis- 
turbance) or false, misleading, or grossly exaggerated symptom re- 
port. DSM-5 [8] describes this condition in the chapter on Somatic 
symptom and related disorders. The key criterion of factitious dis- 
order (300.19) is described as a falsification of physical or mental 
signs or symptoms (instead of false presentation), and it extends to 
the induction of injury or disease. This behaviour must be deceptive 
and deception has to be identified. Furthermore, deceptive illness 
presentation is assumed to be evident, ‘even in the absence of ob- 
vious external rewards’ (p. 324). As with malingering, these diag- 
nostic criteria are judged to be of little clinical validity [64]. Neither 
do they foster diagnostic reasoning nor do they help to arrive at a 
reliable diagnosis or assist in differential diagnosis. 

In ICD-10, factitious disorder is coded as F68.1 under Intentional 
production or feigning of symptoms or disabilities, either physical or 
psychological, despite the fact that this description comprises both 
malingering and factitious disorder. Cooper’s pocket guide [11] gives 
a very short description stressing that false symptom presentation 
which may include self-harm occurs ‘for no obvious reason, with an 
assumed internal motivation of adopting the sick role. Included are 
the older concepts of the hospital hopper syndrome, Munchauser’s 
syndrome, and the peregrinating patient. Asher [12], who originated 
the concept of Munchausen’s syndrome, commented: “These patients 
often seem to gain nothing except the discomfiture of unnecessary 
investigations or operations’ (p. 339). Munchausen’s syndrome is es- 
timated to represent about 5-10% of all factitious cases [65]. 

Different types of factitious disorder have been described, first 
the abdominal, the haemorrhagic, and the neurological type by 
Asher [12] (laparatomophilia migrans, haemorrhagica histrionica, 
neurologica diabolica), later types like dermatitis autogenica, 
hyperpyrexia figmentatica, or cardiopathia phantastica. Also, patho- 
logical lying (pseudologia phantastica) is being discussed in the con- 
text of Munchausen’s syndrome [66]. More recently, manifestations 
of factitious illness behaviour in Internet communities, such as 
cancer support groups, have been described [67]. 


Prevalence and assessment 


The incidence of factitious disorder in patient groups is estimated to 
be well below 1%, but using a more liberal criterion, Catalina et al. 


[68] found in their sample of psychiatric inpatients that 8% exhib- 
ited factitious symptoms. These authors further noted that in this 
group, women were overrepresented, the mean age was below 40, 
and symptoms became more intense when patients faced discharge. 
Fliege et al. [69] surveyed senior hospital consultants and phys- 
icians in private practices. They estimated the 1-year prevalence of 
factitious disorder at 1.3%. Relatively higher rates were reported by 
dermatologists and neurologists. Another study reported a preva- 
lence of 1.8% [70]. In contrast, Wallach [71] estimated the base rate 
of factitious disorders at 5% of all physician encounters. With these 
estimates in mind, factitious disorder should not be considered a 
rare condition, having in mind that in any large general hospital in 
the Western world, several patients with factitious disorder should 
be expected to be present at any time—mostly going undiagnosed 
for this very condition. 

The diversity of prevalence estimates is partly due to a lack of 
diagnostic guidelines or assessment procedures. With a few ex- 
ceptions [72], researchers have employed little effort to develop 
screening methods for factitious behaviour. In most cases, facti- 
tious disorder must be expected to be left undetected. Suspicion 
is raised in patients with an atypical illness course and marked in- 
consistencies. Behavioural manifestations may include gratuitous, 
self-aggrandizing lying, non-compliance with medical prescrip- 
tions, disruptive actions, or excessive request for invasive medical 
procedures [64]. 

The ample literature on factitious behaviour consists, to a large 
extent, of case studies. One recent exception is a study by Lawlor and 
Kirakowski [73] who analysed text communications of two online 
communities of people who said they engaged in factitious behav- 
iour. One remarkable result was that a large majority reflected upon 
their motives for playing the sick role, which suggests that factitious 
behaviour is more consciously motivated than assumed by some 
workers in the field. 


Feigning by proxy 


Factitious disorder by proxy has long been discussed as a major 
problem in childcare, but it may also be present where caregivers 
exert pressure on persons with intellectual disability, dementia, or 
mental disease or directly impose symptoms on them. It includes 
such extreme forms as non-accidental poisoning or non-accidental 
suffocation [74]. Typical Munchausen presentations would then be 
called Munchausen by proxy (or Munchausen’s syndrome by proxy). 
Moreover, factitious illness behaviour is sometimes related to ani- 
mals, in particular pets [75]. 

In DSM-5, such behaviour is coded as Factitious disorder imposed 
on another, with the code 300.19. The diagnosis is given to the per- 
petrator, not to the victim. 

Little attention has so far been paid to cases in which external gain 
(such as monetary benefit) is clearly sought by parents or caregivers 
who exert pressure on the patient (a child or a cared person) to in- 
vent or exaggerated symptoms. With a clearly identified secondary 
gain as the central motive for exerting this pressure, such behaviour 
is called malingering by proxy [76]. In cases where motivational ana- 
lysis can favour neither primary nor secondary gain as being cen- 
tral for the grossly distorted symptom presentation, this abnormal 
illness behaviour may be called feigning by proxy. 


Summary and conclusions 


Factitious disorder and malingering are two forms of abnormal 
health care-seeking behaviour [64]. Their common denominator is 
deception, a conscious intention to deceive medical personnel (or 
other people), or what Kozlowska [77] called a ‘strategic use of illness 
behaviours in relationships with other individuals’ (p. 1368). In the 
case of malingering, external incentives (such as monetary compen- 
sation or sick leave) prevail and health care can be considered to be 
only the means through which these external goals are expected to 
be reached. In factitious disorder, health care and the patient role 
are thought to central. However, both forms of deceptive behaviour 
can often not be reliably differentiated from each other and from 
what has previously been called hysteria. The core feature of all three 
categories—malingering, factitious disorder, and hysteria (or un- 
explained medical symptoms)—is the presence of marked incon- 
sistencies, implausibilities, and contradictions within and between 
different sources of information. While the clinical manifestation 
may be indistinguishable between these forms of distorted symptom 
presentation, differential diagnosis requires determinations about 
both the motivation and the degree of consciousness about the false- 
ness or gross exaggeration of symptoms. However, both dimensions 
are, in fact, neither categorical nor stable in time nor can they re- 
liably be rated by a third party. This imposes significant limits to 
the validity of the clinical entities used for the description of non- 
genuine, feigned symptom presentations. Consequently, the utility 
of maintaining the distinction between malingering and factitious 
disorder has been questioned, and the apprehension of factious 
disorder as a mental disorder in its own right has likewise been 
questioned. Both malingering and factitious disorder can be com- 
prehended as expressions of interpersonal behaviour; “... they have 
no strategic or communicative function except when someone else 
can observe and respond to the behaviour [77] (p. 1368). 

In a historical perspective, factitious disorder may be conceived 
of as an attempt of the medical profession ‘to acknowledge the de- 
ception while pathologizing it. The doctor could make a medical, 
not legal, ‘diagnosis, keeping their medical hat on, and allowing the 
doctor-patient relationship to remain a therapeutic one’ [78] (p. 76). 

While factitious disorder is mostly discussed in the context of 
patient care, most studies on malingering centre on forensic or 
medicolegal contexts. There may be a bias in this dichotomy. Be 
that as it may, the bottomline of most prevalence studies is that sig- 
nificant response distortions, substantial symptom exaggeration, 
and feigning are to be expected in a sizable proportion of patients in 
medicolegal contexts. Whatever the exact numbers are, the problem 
of feigning is prevalent enough to proactively test it in any patient 
with a medicolegal background. Ruling out feigning should also be 
mandatory practice in clinical patients with a current or foresee- 
able litigation background, instead of automatically assuming that 
symptom presentation is genuine under all circumstances (which 
seems to be the common clinical stance today). 
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Functional neurological symptom disorder, also called conversion 
disorder, refers to symptoms of motor and sensory dysfunction 
which can be positively identified as incompatible or incongruent 
with neurological disease. Common symptoms include blackouts 
(that may look like epileptic seizures or syncope), limb weakness, 
movement disorder such as tremor, dystonia, or jerks, and sensory 
disturbance, including visual loss. In this chapter, we trace the his- 
tory of the disorder, especially as it relates to psychiatry, and describe 
the terminology, classification, epidemiology, clinical features, and 
treatment. 









































The term ‘hysteria has been used since the time of Hippocrates 
(around 400 Bc) to describe symptoms that were considered to 
occur only in women, such as choking, and to be caused by a ‘wan- 
dering womb exerting its influence throughout the body. 

From the early seventeenth century onwards, physicians, such 
as Lepois (1618), Willis (1667), and Sydenham (1682), located 
the origin of these ‘hysterical’ symptoms in the brain and nervous 
system, rather than the uterus, not only because of changes in neuro- 
anatomical thinking, but also because of a recognition that the same 
symptoms also occurred in men [1]. 

In the nineteenth century, Briquet (1859), Charcot (1880s), and 
Janet (1890-1920s) pursued a more systematic approach and fo- 
cused on symptoms that we would now regard as neurological such 
as paralysis, numbness, seizures, and blindness. Pierre Janet used the 
clinical study of hysteria to advance his views on ‘dissociation’ as a 
way of understanding how a person could lack the ability to control 
their own voluntary movements. 

In 1895, Breuer and Freud’s book Studies on Hysteria described 
five case studies in which hypnosis was used to find ‘reminiscences’ 
thought to be relevant to the onset of hysterical symptoms. These 
initial cases formed the basis not only of psychoanalytic theory, but 



































also of the conversion hypothesis. The conversion hypothesis pro- 
posed that symptoms, such as paralysis, were created unconsciously 
by conversion of intrapsychic conflict. The main benefit or ‘primary 
gain’ was considered to be the reduction in the psychological conflict 
the person experienced. Additional benefits of ‘secondary gain were 
considered to occur if the symptom provided an escape from the ex- 
ternal sources of the conflict, for example by leading to the inability 
to carry out an act. 

Freud’s ideas, despite being largely unfalsifiable, had a profound 
effect on clinical practice over the subsequent century, not least 
in the adoption of the term conversion disorder in the Diagnostic 
and Statistical Manual of Mental Disorders (DSM) of the American 
Psychiatric Association—the only psychoanalytic diagnosis to re- 
main in the manual to the present day. The effect on practice was 
that neurologists were able to confidently dismiss the problem as 
psychiatric and psychiatrists felt an obligation to attempt therapy to 
make the proposed conflict conscious [2]. 

Consequently, patients have often ended up falling between neur- 
ology and psychiatry, often with neither being sure of what the 
correct treatment is. To add to the uncertainty, Eliot Slater’s influ- 
ential, but flawed, study on ‘hysteria, published in 1965, suggested 
incorrectly that many such patients had misdiagnosed neurological 
disease [3]. 

There has been progress and in the last 20 years, it has been es- 
tablished that patients with neurological symptoms unexplained by 
an identifiable pathophysiological process are, in fact, common (the 
second most common reason to see a neurologist). It has also be- 
come clear that they can be reliably diagnosed using positive clin- 
ical features identified predominantly on physical assessment (and 
not only as a diagnosis of exclusion). Studies have also suggested 
that the symptoms have an inconsistent relationship to trauma and 
stress (that is, some have it, but some do not). In summary, after 
a long period of confusion and ignorance, we are starting to see 
progress. We are hopefully now entering a more optimistic era of 
clinical science which builds on the notion that a brain-and-mind 
approach is required to understand its aetiology and to deliver ef- 
fective treatment. 
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Terminology and classification 


Terminology can be difficult for both doctors and patients. Broadly, 
the current terms can be categorized as follows: 


e Psychogenic/somatization/somatoform/conversion—implying 
that the symptoms are essentially physical manifestation of a psy- 
chological problem. 

e Non-organic/non-epileptic/medically unexplained—defining the 
problem as not being due to disease. 

e Functional—suggesting a disorder of bodily or nervous system 
functioning without aetiological presumption. 

e Dissociative—suggesting a disconnection between nervous 
system component functions. 


‘These terms each have advantages and disadvantages. We will use 
the term ‘functional neurological disorder’ here for the scientific and 
pragmatic reasons outlined in this chapter. 

The classification of functional neurological disorder has been 
through several iterations in DSM. In DSM-I, it was Conversion 
Reaction (DSM-I), in DSM-II Hysterical Neurosis—Conversion 
Type, and in DSM-III and IV Conversion Disorder [4]. In DSM- 
IV, conversion symptoms were also listed under somatization dis- 
order, a relatively tightly defined long-standing vulnerability to 
‘somatoform’ symptoms starting before the age of 30, involving 
symptoms which were neurological (21 symptoms), pain (24 symp- 
toms), gastrointestinal (=2 symptoms), and sexual (21 symptoms). 
In the latest iteration of DSM (DSM-5), the disorder is now called 
Conversion Disorder (Functional Neurological Symptom Disorder) 
(300.11). 

It should be noted that wilfully manufactured or grossly exagger- 
ated symptoms which are deliberately feigned to obtain medical care 
(factitious disorder, which is considered a mental disorder) or ma- 
terial gain (malingering, which is simply a behaviour, not a mental 
disorder) are classified separately. 

There have also been important changes in the diagnostic criteria 
in DSM-5 [5] (Table 133.1). 

Firstly, it has been recognized that it is not simply a diagnosis of 
exclusion—patients with this diagnosis should have specific findings 
on neurological examination such as Hoover's sign of leg weakness 
or a positive tremor entrainment test (see Examination, p. 1352). 
Importantly, this allows the diagnosis to be made even in patients 
with a coexisting neurological disease, as long as that condition is 
not a better explanation for the symptom. For example, a patient 


may have both epilepsy and dissociative (non-epileptic) attacks/ 
seizures, or multiple sclerosis and functional limb weakness. 

Secondly, there is a recognition that stressors are not always iden- 
tifiable [6]. Sometimes this is because they only emerge later, but 
sometimes this may simply be because they are not present. 

Lastly, the specific requirement to positively exclude feigning has 
been removed. While it is rarely possible to obtain positive evidence 
of feigning, it is arguably impossible to positively exclude it. 

The World Health Organization (WHO)’s International 
Classification of Diseases, tenth revision (ICD-10) has, to date, pre- 
ferred the term ‘Dissociative Disorder’, “Dissociative (Conversion) 
Seizures/ Paralysis, etc. Moreover, as with all ICD diagnoses, there 
are no operationalized diagnostic criteria. 


Epidemiology 


Most of the data on functional neurological symptom disorder 
(FNSD) are from clinical settings, rather than the general popu- 
lation. It has been diagnosed from early childhood to the ninth 
decade but is rare before the age of 10. Patients with dissociative 
(non-epileptic) seizures have an average age of onset in the twen- 
ties, with a second late onset group, often in relation to the devel- 
opment of physical disease. Patients with functional movement 
disorders have a mean age of presentation in the late thirties. Most 
presentations of FNSD occur more commonly in females, with a 
ratio of 2-3:1. 

Patients with FNSD are commonly encountered in neurological 
or general medical settings. In a study of 3781 new neurology out- 
patients in Scotland, 16% had a primary diagnosis of a functional 
disorder and 6% a primary diagnosis compatible with FNSD. 
Patients with functional limb weakness are probably as common 
as patients with multiple sclerosis, with an incidence of around 5/ 
100,000 [7]. Patients with dissociative (non-epileptic) seizures have 
a similar incidence of around 5/100,000 [8]. Up to 50% of patients 
presenting to accident and emergency with apparent epileptic status 
may have dissociative (non-epileptic) seizures. 

Studies of FNSD have consistently found severity of disability 
similar to that of patients with equivalent disease [7, 9, 10], and 
usually a higher rate of psychiatric disorder [11]. Health care costs 
are also high, especially with respect to repeated investigations and 
referral [12]. 

Epidemiological factors related to aetiology and mechanisms are 
discussed further later in this chapter. 


Table 133.1 A summary of the main changes in DSM-5 in relation to functional neurological symptom disorder (conversion disorder) 


Motor or sensory symptom .. 


.. causing distress or difficulty for the patient v y 
Positive physical signs of internal inconsistency X v 
or incongruity with recognized disease 
Patient must have an identifiable psychological v x 
stressor 
Intentional feigning must be excluded v x 


Includes limb weakness, movement disorder, gait, blackouts, vision, speech, and 
hearing symptoms. Not pain, fatigue, dizziness, or memory symptoms 


Not a diagnosis of exclusion. Emphasizes how these disorders should be 
diagnosed 


Often not present. Therefore, many patients without stressors were previously 
rejected by psychiatrists 


This applies to all psychiatric disorders. It is impossible to exclude feigning 
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Clinical features 


History 


Patients with FNSD commonly have many symptoms, in addition 
to motor and sensory symptoms. These may include fatigue, pain, 
poor memory and concentration, sleep disturbance, and symptoms 
of other functional disorders such as irritable bowel syndrome or 
chronic widespread pain/fibromyalgia. It is therapeutically and 
diagnostically helpful to make a list of all the patient’s symptoms at 
the beginning of the consultation. 

Dissociative symptoms, such as derealization (‘feeling discon- 
nected/unreal/cut off/floating’) and depersonalization (‘my leg 
doesn't feel part of me’), are also common in FNSD. Patients with 
acute-onset functional symptoms (both motor symptoms and 
blackouts) often describe dissociation at the moment the symptoms 
began. Establishing this sequence can be particularly helpful when 
explaining the mechanism of symptom production to the patient or, 
in the case of dissociative (non-epileptic) seizures, targeting the pro- 
dromal symptoms during therapy (as for panic disorder). 

Enquiry about the patient's beliefs about their illness is important. 
A patient sent from a neurologist with a very firm belief they have 
‘chronic Lyme disease’ or ‘multiple sclerosis’ is going to require a 
different consultation to someone who believes they have a func- 
tional disorder. When asking about beliefs, however, it is generally 
best to avoid getting into an unproductive debate about whether 
the symptoms are neurological or psychological (since they are best 
considered as both). It can be more helpful to establish whether the 
patient sees their symptoms as potentially reversible or as a result of 
irreversible damage (the analogy here is software vs hardware fault), 
as the former is less of an obstacle to rehabilitation. It can also be 
helpful to ask the patient’s views about what treatments they think 
would help them and which would not, before discussing options. 

Patients with FNSD may be reticent about reporting psycho- 
logical symptoms. This may be, in part, because they fear being told 
that all their symptoms are imagined. Therefore, rather than ask 
direct questions about depression or anxiety, it can be more helpful 
to begin the enquiry by asking about the emotional and psycho- 
logical consequences of having disabling physical symptoms. Panic 
disorder is also common, but often not recognized as such by the 
patient experiencing sudden chest pain or shortness of breath with 
dizziness. Equally, the psychiatrist must bear in mind that the pa- 
tient may have no obvious psychological symptoms. This does not 
necessarily mean that there is ‘no psychiatric disorder’ and that the 
patient cannot be helped; FNSD remains classified as a psychiatric 
disorder at the interface with neurological practice and the psych- 
iatrist has an important role, as described in this chapter. 

One psychological symptom, so-called ‘belle indifference’, requires 
a particular comment. This refers to a cheerful indifference to dis- 
ability and has historically been viewed as a typical clinical feature 
of FNSD. However, modern studies suggest that while this symptom 
does occur, it has no discriminating diagnostic value, as it also occurs 
in patients with organic brain disease, especially those with frontal 
lobe involvement [13]. When apparently present, in our experi- 
ence, it most commonly reflects an attempt to ‘put on a brave face’ 
Persistent ‘belle indifference’ may, in fact, suggest factitious disorder, 
a condition that can only really be diagnosed in a neurological con- 
text by finding evidence of marked discrepancy between reported 


and observed function (for example, walking normally when they 
said that would be impossible) or significant deception. 


Examination 


The assessment must address the wide range of possible neuro- 
logical diagnoses that may present with the observed neurological 
symptoms. FNSD is not a diagnosis of exclusion; rather it depends 
on finding positive evidence of clinical signs, which are strongly 
suggestive of functional disorder (Table 133.2). The diagnosis can 
therefore also be made in the presence of neurological disease. 

The physical signs are: (1) either to aim to find out if the symptom 
briefly resolves when attention is directed to another body part (for 
example, Hoover’s sign of functional leg weakness or tremor en- 
trainment test) (Fig. 133.1); or (2) are characteristic of a functional 
disorder in the correct context (for example, the new onset of a 
clenched fist or an inverted ankle in an adult, or a sudden episode of 
motionless unresponsiveness and amnesia with the eyes closed for 
longer than 2 minutes). Awareness of how these physical signs work 
puts the psychiatrist in a much better position to explain the illness 
to the patient. Like all physical signs, none is completely reliable on 
its own, although published studies have suggested they have good 
sensitivity and specificity. 

Cognitive symptoms are common in association with other pres- 
entations of FNSD, although they are not included in the DSM-5 
definition [14]. Typical positive features of functional cognitive 
symptoms include inconsistency between reported symptoms and 
ability to follow dramas on television or perform a job, difficulty 
with easy cognitive tasks, compared to difficult ones, ability to an- 
swer multiple-component questions, and offering of elaboration and 
detail regarding memory lapses. Someone with more severe func- 
tional or feigned cognition difficulties may perform badly on tests 
of effort in which patients should score by chance or at least above 
patients with severe dementia. Rarely, questions such as ‘how many 
legs has a horse got?’ may be met with an ‘approximate’ answer such 
as ‘five’-—the so-called Ganser syndrome that often suggests facti- 
tious disorder or malingering. 

Another distinct type of cognitive presentation is a pure retro- 
grade or dissociative amnesia. This presents as a ‘soap opera style’ 
memory loss in which a portion of retrograde memory is missing, 
sometimes for years, but with normal anterograde memory. It 
can sometimes be associated with wandering geographical loca- 
tion when it is called fugue. Typically, there is loss of information 
about personal identity, which is rare in other amnestic disorders, 
although other causes such as encephalitis or epileptic amnesia 
should be considered. 

Dizziness can also be acommon functional symptom but again 
is not listed in the DSM5 criteria. Typically, the patient with 
chronic functional dizziness, now called ‘persistent perceptual 
postural dizziness, reports a continuous sense of movement, often 
a ‘swaying’ sensation, worse on standing and walking, which can 
be mixed with dissociative symptoms. This usually arises after a 
vestibular trigger, such as labyrinthitis or benign paroxysmal pos- 
itional vertigo, and becomes persistent through a complex process 
of abnormally focused attention and alteration of normal postural 
control [15]. Health anxiety and a daily routine dominated by the 
experience of dizziness can respond well to treatment with cog- 
nitive behavioural therapy and/or treatment with antidepressant 
medication. 
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Table 133.2 Examples of positive signs in functional neurological symptom disorder 


Functional limb weakness 
oover’ sign [44] (Fig. 133.1) 























ip abductor sign [44] 





Other clear evidence of inconsistency 
Global pattern of weakness 


Dragging gait 
rotated 


Functional movement disorder/gait 


Tremor entrainment test [45] 





Typical ‘functional’ hemifacial overactivity [46] 
(Fig. 133.2) 


Fixed dystonic posture [47] (Fig. 133.3) 


unctional hemiparesis 


Distraction during standing [48] 


improve significantly 


Dissociative (non-epileptic) attacks [49] 





Prolonged attack of motionless unresponsiveness 
Long duration 

Closed eyes 

Ictal weeping 

Memory of being in a generalized seizure 


Ictal hyperventilation 
attack 


Presence of an attack resembling epilepsy with a 


normal EEG evidence 
Visual symptoms [50] 


Fogging test 





ip extension weakness that returns to normal, with contralateral hip flexion against resistance 

ip abduction weakness that returns to normal, with contralateral hip abduction against resistance 
For example, weakness of ankle-plantar flexion on the bed, but patient able to walk on tiptoes 
Weakness that is global, affecting extensors and flexors equally 


A gait in which the forefoot remains in contact with the ground, typically with the hip externally or internally 


Patient with a unilateral tremor is asked to copy a rhythmical movement with their unaffected limb. The tremor 
in the affected hand ‘entrains’ to the rhythm of the unaffected hand, stops completely, or the patient is unable to 
copy the simple rhythmical movement 


Orbicularis oculis or oris over-contraction, especially when accompanied by jaw deviation and/or ipsilateral 


Atypical fixed dystonic posture, characteristically of the hand (with flexion of the fingers, wrist, and/or elbow) or 
ankle (with plantar flexion and dorsiflexion) 


Patient with an apparently positive Romberg’s test is asked either to guess numbers written on their back or 
o Carry out a complex motor task (for example, with a phone). In a functional gait problem, their balance will 





Paroxysmal motionlessness and unresponsiveness, with amnesia lasting longer than 2 minutes 
Attacks lasting longer than 2 minutes (but be careful of misleading witness histories) 

Closed eyes during an attack, especially if there is resistance to eye opening 

Crying either during or immediately after the attack 

Ability to recall the experience of being in a generalized shaking attack 


During a generalized epileptic seizure, respiration ceases, but it commonly speeds up during a non-epileptic 


Normal EEG does not exclude frontal lobe epilepsy or deep foci of epilepsy but does provide supportive 


Vision in the unaffected eye is progressively ‘fogged’ using lenses of increasing dioptres, while reading an acuity 


chart. A patient who still has good acuity at the end of the test must be seeing out of their affected eye 


Tubular visual field (Fig. 133.4) 
Speech 


Highly inconsistent speech, new-onset stuttering 


in an adult or telegraphic speech [51] functional 





sevier Masson SAS. 


Investigations 


In the presence of typical clinical features, the role of investigation 
is primarily to seek comorbid neurological disease, rather than to 
refute the diagnosis. Patients with arm or leg weakness will typically 
require structural imaging with magnetic resonance imaging (MRI) 
or computed tomography (CT). The diagnosis of dissociative seiz- 
ures can be greatly aided by recording a normal electrocardiogram 
(ECG) and electroencephalogram (EEG) during an attack (but note 
that some forms of epilepsy cannot be seen on surface EEG). It is 
the positive recording of an event with typical features of dissocia- 
tive (non-epileptic) seizures that matters, rather than merely normal 
investigations. Smartphone footage from family and friends can be 
especially helpful. It should be remembered that investigations com- 
monly reveal incidental abnormalities in the brain and spine. Such 


A patient is found to have a field defect which has the same width at 1 m as it does at 2 m 
Patients with neurological diseases also can develop stuttering. Persistent foreign accent syndrome is often 


Adapted from Neurophysiol Clin., 44(4), Stone J, Functional neurological disorders: The neurological assessment as treatment, pp. 363-73, Copyright (2014), with permission from 
E 


abnormalities are found in around one in six on MRI brain scans. 
Disc prolapses occur in asymptomatic individuals at a frequency 
similar to the ‘age of the patient plus 10’ (for example, around 50% of 
40-year olds have some disc prolapse) [16]. 


Psychiatric examination 


A skilled neurologist should be able to make a positive diagnosis of 
FNSD. A psychiatric assessment has an important role in ensuring 
a complete history has been taken and addressing the psychiatric 
differential diagnosis. 

It may reveal important life stressors and factors inhibiting re- 
covery (such as an insurance claim). But it should be remembered 
that it is not necessary to make such a finding (or to hypothesize it) 
to make the diagnosis. 
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J P 
yi 


Test hip extension - it's weak 


Fig. 133.1 Hoover's sign of functional leg weakness. 





Test contralateral hip flexion against 


resistance-hip extension has become strong 


Reproduced from Pract Neurol., 9(3), Stone J, Functional symptoms in neurology, pp. 179-89, Copyright (2009), with permission from British Medical Journal. 


The assessment also provides the foundation for treatment and 
is an opportunity to further develop a shared and positive under- 
standing of the disorder with the patient and carer. 


Differential diagnosis 





The differential diagnosis of FNSD encompasses many neurological 
conditions and several psychiatric ones. It should usually be diag- 
nosed by a neurologist or a specialist familiar with neurological 
diagnosis and a psychiatrist, working either together or sequentially. 

A common pitfall in diagnosis is failure to consider the pres- 
ence of a comorbid neurological disease, especially symptoms in 
the prodrome of a degenerative disease such as Parkinson's disease. 





Fig. 133.2 Functional facial overactivity can look like facial weakness— 
typically with platysma overactivity, jaw deviation, and/or contraction of 
the orbicularis oculis. 


Reproduced from Pract Neurol., 13(2), Stone J, Reuber M, Carson A, Functional 
symptoms in neurology: mimics and chameleons, pp. 104-13, Copyright (2013), 
with permission from British Medical Journal. 


Misdiagnoses of neurological disease as FNSD appears especially 
likely in relation to an isolated gait disorder with normal examin- 
ation on the bed (for example, stiff person syndrome and spastic gait 
from cervical myelopathy can present like this); frontal lobe epilepsy 
(which can look odd and present sometimes with a normal EEG); 
autoimmune encephalitis and other neurological disorders where 
there may be prominent psychiatric symptoms [17]. 

The psychiatric differential is anxiety and especially panic dis- 
order, depression, obsessive-compulsive disorder, and post- 
traumatic stress disorder (PTSD). Symptoms of dissociation occur 
as part of both panic and PTSD. These disorders may be the primary 
diagnosis and the patient may recover with treatment for them; 
more often, they are comorbidities that may still require treatment 
in their own right. 

Finally, it is important to consider the possibility that the patient is 
wilfully manufacturing or exaggerating their symptoms. When this 
is done for medical care only, it is factitious disorder, classified as 
a psychiatric disorder which often coexists with a personality dis- 
order. When symptoms are manufactured for material gain only, 
then it is malingering, and not a psychiatric disorder. Wilful exag- 
geration can only be detected reliably by the presence of a marked 
discrepancy between reported and observed function (note that this 
is not the same as variability in symptoms) or by evidence of lying. 
Most experts seeing patients with FNSD agree that clinically sig- 
nificant wilful exaggeration is relatively rare outside legal scenarios. 
These disorders are discussed further in Chapter 132. 





Aetiology and mechanism 


The ‘conversion hypothesis in which symptoms of FNSD are always 
presumed to be a physical manifestation of the psychological effect 
of a recent stressor has been superseded by a more complex model. 
This explains the considerable heterogeneity between patients and 
suggests a mechanism of symptom production (Table 133.3). Many 
biological, psychological, and social factors appear to be relevant, 
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Fig. 133.3 Functional dystonia with characteristic ankle inversion and 
plantar flexion. 


Reproduced from Continuum (Minneap Minn)., 21(3), Stone J, Carson A, Functional 
Neurologic Disorders, pp. 818-37, Copyright (2015), with permission from American 
Academy of Neurology. 


including childhood sexual, physical, and emotional abuse and neg- 
lect as predisposing factors and life events as precipitating factors. 
Adverse experiences like this can promote a vulnerability to dis- 
sociation and heighten the experience of threat produced by minor 
pathophysiological or physiological stimuli. Reviews of the relative 
magnitude of these factors in individual patients have demonstrated 
that although more frequent than in those with neurological dis- 
ease and in healthy controls, they are not universal [6]. For example, 


while a detailed study of life events in FNSD suggested an increase in 
frequency of events just prior to symptom onset, especially of the ‘es- 
cape’ type that Freud suggested, not all patients were found to have 
experienced severe or adverse events of any type [18]. 

The ‘conversion model of aetiology has also been unsatisfactory in 
explaining why someone should develop a weak arm or blackouts in 
relation to stress. The solution proposed was that the symptom was 
‘symbolic of the difficulty’ or represented an inability to function. 
Systematic studies of patients with FNSD have suggested a more 
prosaic explanation for particular symptoms—that they are simply 
continuations of acute symptoms that arose for many reasons. For ex- 
ample, patients with dissociative (non-epileptic) seizures commonly 
experience transient dissociative or panic symptoms just before 
their blackouts [19]. Studies have supported a model in which the 
‘blackout’ occurs as a response to this arousal which then becomes 
habitual, perhaps through classical conditioning [11]. More recently, 
evidence has accumulated to support the idea that physical injury or 
physiological states, such as migraine, commonly occur at the onset 
of these symptoms, which then become perpetuated via persistent 
abnormal attention [20, 21]. Cognitive neuropsychological models 
with their basis in a concept of the brain as a ‘predictive’ organ help 
bridge the gap between neuroscience and psychological models of 
this disorder and support new ways of delivering treatment [22]. 

Functional brain imaging of patients with FNSD has found dif- 
ferences between patients and volunteers pretending to have similar 
symptoms. In addition, functional imaging of patients in experi- 
mental hypnotic states has revealed changes similar to those seen 
in patients with FNSD, with hypoactivation of cortical and sub- 
cortical motor areas, without activation of prefrontal areas often 
involved in motor inhibition. Limbic areas, such as the amygdala, 
are also commonly overactive in comparison with controls [23]. In 
addition, some studies have suggested hypoactivation of a network 
identified with self-agency, including the temporo-parietal junc- 
tion, in keeping with the notion that the movements do involve the 





Examination at 
50 cm 


Examination at 
150 cm 


Fig. 133.4 In patients with functional visual loss, bedside testing may reveal a tubular field defect, which disobeys the laws of physics. 
Reproduced from J Neurol Neurosurg Psychiatry, 76(Suppl 1), Stone J, Carson A, Sharpe M, Functional symptoms and signs in neurology: assessment and diagnosis, i2-12, 


Copyright (2005), with permission from British Medical Journal. 
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Table 133.3 A range of potential aetiological factors in patients with functional neurological symptom disorder 





Factors acting at 


e Pathophysiological disease 
all stages 


e History of previous functional symptoms 


Predisposing e Genetic factors affecting personality 


e Biological vulnerabilities in nervous system? 


Precipitating e Abnormal physiological event or state (for 
example, hyperventilation, sleep deprivation, 
sleep paralysis 


e Physical illness/injury/pain 





Perpetuating e Plasticity in central nervous system motor and @ 
sensory (including pain) pathways 

e Deconditioning/‘habit’ 

e Neuroendocrine and immunological 
abnormalities similar to those seen in 


self-help 








Adapted from Pract Neurol., 9(3), Stone J, Functional symptoms in neurology, pp. 


voluntary motor system, but the person genuinely does not have the 
sense that they are ‘doing it’ [24]. 

Neurophysiological studies have also revealed potentially 
intriguing differences in evoked responses between patients with 
FNSD and controls and also in a parameter known as sensory at- 
tenuation, which measures the person's ability to predict their own 
actions [25]. Interestingly, FNSD may be a paradigmatic disorder of 
willed action which has escaped scrutiny because of its clinical simi- 
larities to ‘willed action of pretending. 


Prognosis and misdiagnosis 





The prognosis is highly variable. Symptom duration may be as brief 
as hours or as long as years. Symptoms may be mild or severely 
disabling. In general, the outlook for patients with dissociative 
(non-epileptic) seizures is better than for patients with functional 
movement disorders. A systematic review of 24 outcome studies of 
functional motor disorder found that 39% of patients were the same 
or worse after a mean of 7.5 years [26]. In a review of 25 studies 
of patients with dissociative seizures, the total weighted remission 
rate was around 33% (that is, about one-third of patients’ seizures 
stopped). While patients may still have some seizures, social and 
occupational functioning may improve [27]. Positive prognostic 
factors noted in these studies included young age (especially child- 
hood), early diagnosis, positive reaction to the diagnosis, and short 
duration of illness. Negative prognostic factors reported are pres- 
ence of personality disorder, somatization disorder, or ongoing 
litigation. Patients with poor prognostic factors can, however, some- 
times respond well to treatment, and vice versa [26]. 

We referred earlier to the well-known follow-up study of Slater in 
1965 which suggested that as many as two-thirds of patients with a 
diagnosis of hysteria actually had misdiagnosed neurological dis- 
ease. Although methodologically flawed, mainly by selection bias, 
this study convinced at least two generations of psychiatrists that they 


© Emotional disorder 
e Personality disorder 


e Perception of childhood experience as adverse 
e Personality traits 
e Poor attachment/coping style 


IIness beliefs (patient and family) 
Perception of symptoms as being due to 
disease/damage/outwith the scope of 


ot feeling believed 
depression and anxiety © Avoidance of symptom provocation 


e Socio-economic deprivation 
e Life events and difficulties 


e Childhood neglect/abuse 
e Poor family functioning 
© Symptom modelling 


© Perception of life event as negative, 
unexpected 
e Acute dissociative episode/panic attack 


© Presence of a welfare system 

e Social benefits of being ill 

e Availability of legal compensation 

© Stigma of ‘mental illness’ in society and 
from the medical profession 

© Ongoing medical investigations and 
uncertainty 


79-89, Copyright (2009), with permission from British Medical Journal. 


need not worry about this disorder; it was just an example of neur- 
ologists getting it wrong. However, a systematic review of 27 studies 
published since Slater’s found a misdiagnosis rate of <5% since 1970 
(interestingly, this was long before MRI scans and videotelemetry 
were in routine use) (Fig. 133.5). This rate of misdiagnosis is similar 
to that for other neurological and psychiatric disorders. Subsequent 
large prospective studies have also been reassuring [28], as have 
follow-up studies of other somatoform disorders [29]. 


The treatment of FNSD should usually be multi-disciplinary. Its 
foundation is a plausible and agreed explanation of the diagnosis, 
followed by effective triage to the correct therapy (and in some cases 
recognizing that the patient is unlikely to benefit from therapy). 

The traditional sequence of events has been for a neurologist to 
tell a patient there is ‘nothing wrong’ and then to refer to a psych- 
iatrist who is expected to detect the psychological stressor and then 
treat the patient. It is hardly surprising that this process usually re- 
sults in frustration for both patients and clinicians. A combined and 
positive approach to diagnosis is preferable. 


Explaining the diagnosis 


It is crucial that someone, and preferably every member of the clin- 
ical team, can deliver a coherent and consistent explanation of how 
the diagnosis has been made and how the symptoms can be im- 
proved with therapy. 

Although there is much debate about whether the patient should 
be told that their illness is ‘psychogenic’ or ‘functional’ or ‘conver- 
sion, it seems likely that there are other more important ingredients 
of a useful explanation. These include: (1) taking the problem ser- 
iously; (2) giving the problem a positive diagnostic label (rather than 
just telling the patient what they do not have); (3) demonstrating to 
the patient the rationale for the diagnosis (showing the patient their 
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Fig. 133.5 Frequency of misdiagnosis of functional neurological 
symptoms disorder/conversion disorder/hysteria in 27 studies and 1466 
patients, with a mean duration of 5 years. 


Reproduced from J Neurol Neurosurg Psychiatry, 76(Suppl 1), Stone J, Carson A, 
Sharpe M, Functional symptoms and signs in neurology: assessment and diagnosis, 
i2-12, Copyright (2005), with permission from British Medical Journal. 


positive Hoover’s sign or how their tremor stops briefly during the 
entrainment test, or going through the typical features of a dissocia- 
tive seizure); (4) emphasizing the potential for reversibility, for ex- 
ample using metaphors such as software vs hardware; (5) providing 
written information, for example, such as that found at https://www. 
neurosymptoms.org or http://www.nonepilepticattacks.info; and 
(6) triaging to an appropriate treatment. 

Our personal preference for a diagnostic label is to use the terms 
‘functional’ or ‘dissociative. For us, these are terms most consistent 
with the current state of our knowledge in that they allow for a bio- 
psychosocial model in which there can be a discussion of abnormal 
brain function, as well as a discussion of how symptoms may per- 
sist because of psychological ‘patterns’ or habits. In addition, there is 
the critical implication that brain function (as opposed to structural 
damage) can be improved by many interventions, including medica- 
tion, activity, and psychological treatment. 


Physiotherapy 


Current evidence suggests that physiotherapy should be a first-line 
treatment for patients with functional movement disorders and limb 
weakness, as long as the patient has some confidence in the diagnosis 
and is motivated to change. In recent years, the nature of physio- 
therapy for functional motor disorders has become more clearly 
defined and is being tested in randomized trials. Physiotherapy tech- 
niques are based on an understanding of how symptoms in FNSD 
can be dependent on abnormally focused attention and tend to 
improve with distraction. There has been a realization that generic 
physiotherapy, as one would use for stroke, may, in fact, be coun- 
terproductive. Consensus recommendations have been published 
for the various motor symptoms with which patients present [30]. 
A case series and randomized controlled trial found evidence of ef- 
ficacy for this approach in patients selected for their understanding 
and motivation. Despite a symptom duration of 5.8 years, 72% hada 
good outcome with specific therapy, compared to only 18% who had 
physiotherapy of a non-specific nature [31]. Another trial found sus- 
tained benefit from 3 weeks of inpatient rehabilitation [32]. Further 
studies of inpatient treatment are also encouraging [33, 34]. 


Psychological and psychotropic drug treatment 


Psychiatric assessment and formulation have a key role to play in 
the management of many patients with FNSD. A bio-psychosocial 
formulation can put together the factors seen in Table 133.3 and 
present them back to the patient as an explanation of what may have 
led to the development of their symptoms and what may now need 
to be addressed to achieve recovery. 

Comorbid and psychiatric disorders, such as panic, anxiety, and 
depression, often contribute to FNSD and may need to be targeted in 
treatment, whether pharmacological or psychological. 

A tailored cognitive behavioural therapy (CBT)—an informed 
guided self-help programme—was found to be better than treat- 
ment as usual for patients with a range of functional neurological 
symptoms in a randomized trial [35]. Another specific form of cog- 
nitive behavioural psychological therapy has been developed specif- 
ically for dissociative (non-epileptic) seizures [36, 37]. In a pilot trial 
of this CBT-based approach, developed from therapy for panic dis- 
order, the number needed to treat for seizure remission was 5 [36]. 
This therapy involves: education about the disorder and setting of 
goals, teaching specific distraction techniques for episodes, learning 
to make connections between emotions and seizures, using graded 
exposure to change behaviour restricted by the events, and looking 
more widely, if necessary, at psychological antecedents or perpetu- 
ating factors. A large multi-centre trial is currently under way [38]. 
There is some evidence for a role of psychodynamic psychotherapy 
in other functional somatic disorders [39], but little specifically for 
FNSD [40]. 


Other treatments 


Other treatments used since the nineteenth century may still have 
a place. Hypnosis may have some efficacy, especially in those keen 
to have it [41]. Electrical stimulation of the arms and legs has been 
used for centuries to treat patients with functional disorders but 
lost popularity after the First World War. More recent studies of 
transcranial magnetic stimulation have suggested it may have a role, 
especially in demonstrating reversibility of symptoms, but trial evi- 
dence is lacking [42]. Finally, abreaction and therapeutic sedation, 
which were used as long ago as the First World War, may still have 
a role for patients with fixed dystonia, mutism, or deficits, where 
transient improvement cannot be demonstrated using bedside man- 
oeuvres [43]. There is, however, no specific evidence for the use of 
psychotropic drugs in FNSD, although they may be required to treat 
comorbid anxiety or depression. 


Stopping treatment 


While the prognosis for established FNSD is poor without treat- 
ment, we may have been too pessimistic about the response to treat- 
ment. However, not all patients can be successfully treated and it is 
at times useful to share this conclusion with the patient, where ap- 
propriate, emphasizing that this is something that can be revisited. 


Conclusions 


Functional disorders are common in neurological practice and are 
one of the most common reasons for referral to psychiatrists working 
in that setting. Significant advances over recent years have improved 
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our understanding of both aetiology and treatment. The modern 
clinical assessment emphasizes the primacy of diagnosis based on 
typical characteristics of the physical symptoms themselves. Joint 
neurological, psychiatric, and multi-disciplinary teamworking is the 
cornerstone of management for most patients. 
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In policy terms, mental health problems are among the most com- 
plex of all health issues, characterized by a number of pervasive chal- 
lenges. Seen as a group, prevalence rates are high and onset is often 
relatively early in life. Most mental health problems are chronic, 
often with life-long negative impacts. Aetiology represents a me- 
lange of genetic and environmental influences. People who have 
mental health problems will often experience stigma, discrimin- 
ation, and victimization—opportunities available to other people 
are often closed off. Psychological well-being—to be separated from 
psychiatric symptoms—is often very poor for people experiencing 
mental illness. 

There are close links between mental illness, self-harm, and sui- 
cide. There are also strong links between some mental health prob- 
lems, antisocial behaviour, and crime. Consequently, most countries 
have legal powers of compulsory detention and treatment to pro- 
tect individuals from themselves or to protect other people. Mental 
health problems may have other negative collateral effects, including 
on siblings, parents, peers in schools, and work colleagues, and per- 
haps also in local communities. 

Responses to mental health problems are many and various, and 
often lead to high public sector and privately borne costs that may 
extend over many years. Young people with mental health prob- 
lems are at above-average risk of failing to achieve their educational 
potential. In adulthood, there are often considerable employment- 
related difficulties—long spells of unemployment, high rates of ab- 
senteeism, and under-par performance when at work—which can 
lead to substantial productivity losses for the economy and income 
losses for the individual and family. 

It is the combination of these often very challenging characteris- 
tics and experiences—for individuals, families, and communities— 
which require careful, multi-dimensional, far-sighted policy 
responses. Mental health policy-making that concertedly aims to 
address these challenges will necessarily be complex. The focus of 
this chapter is such policy-making. I will briefly consider the main 
drivers of mental health problems (in Section 2) as a platform for 
identifying the domains where policy action might be needed to 



































prevent or ameliorate illness and the distress associated with it. I will 
then turn to the main consequences of mental health problems, 
emphasizing how widely spread those consequences will often be 
(Section 3). Societal responses to those consequences will be dis- 
cussed. The focus of Section 4 is on the main dimensions of a good 
mental health policy, providing some illustrations. A brief con- 
cluding section pulls together the main arguments. 



































There are many factors that can precipitate, prolong, or reignite mental 
health problems, representing a set of complex interactions between 
genes and the environment; although some mental illnesses have high 
rates of heritability, there are very strong environmental effects. Age and 
gender are relevant factors in understanding the incidence and preva- 
lence, and demographic trends over time can affect the patterns of 
mental illness and the nature of the policy challenge. Traumatic experi- 
ences, lifestyle choices, and other health problems (acute or chronic) 
are important, and economic conditions can play a major part. Other 
chapters in this book have discussed many of these ‘drivers’ in more de- 
tail, and here I briefly summarize some of them as a way to identify in- 
dividual, family, and societal consequences, which, if recognized, will 
then often be prompts for policy action. 

Three-quarters of mental health problems (except dementia) have 
emerged before age 24, and perhaps as many as half by age 14 [1]. 
Psychoses most commonly emerge in late adolescence and early 
adulthood, which is precisely the age when most individuals would 
be making key decisions that will shape the rest of their lives: fin- 
ishing their education, moving away from the parental home, taking 
their first steps into the world of work, and having their first serious 
relationships. A mental health problem that is not adequately treated 
can have life-long personal and economic consequences. Later in 
life, Alzheimer’s disease and other dementias are highly prevalent; 
as populations age—as they are doing rapidly across the world— 
overall numbers of people with these disorders will grow too and 
will clearly need concerted policy attention, especially given that 
there is currently no cure. 
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The influences of traumatic experiences on mental health are often 
pernicious and persistent—experiencing war, military combat, ter- 
rorism, rape, or sexual abuse can bring on a range of mental health 
problems, including post-traumatic stress and depression. Bullying 
victimization in childhood is known to be still affecting someone's 
mental and physical health many decades later [2, 3], with sizeable 
impacts on mental health care systems. 

Numerous lifestyle choices are known to affect mental health. For 
example, excessive cannabis use in young people is a recognized risk 
factor for developing schizophrenia [4]. Being breast-fed, on the 
other hand, can be a protective factor—not being breast-fed for very 
long independently predicts poor mental health during childhood 
and early adolescence, even after adjusting for other factors [5]. 

There are well-known links between other health problems and 
mental illness. People with long-term conditions, such as diabetes, 
cancer, and coronary heart disease, have above-average risks of psy- 
chiatric morbidity, which often affect how their physical health de- 
velops and is managed, with consequences for quality of life and 
costs [6, 7]. For example, it has been shown that the health care costs 
of treating 11 different chronic health problems are significantly 
higher when a patient has comorbid depression, while there can 
also be considerable impacts on workplace performance and absen- 
teeism from work [8]. Individuals with neurodevelopmental condi- 
tions can also be at high risk; the prevalence of anxiety is as high as 
42% among autistic children [9]. 

Economic conditions in a country and the economic status of an 
individual can affect mental health, and in fact, there are multiple 
and two-way links between economic factors and mental illness. 
‘Social causation’ arguments suggest that core aspects of economic 
disadvantage, such as poverty and long-term unemployment, in- 
crease the risk of mental illness by initiating or exacerbating financial 
stress (particularly unmanageable debt), social exclusion, stigma, 
and poor diet, and also by undermining protective factors such as 
social capital, education, and personal resilience. An alternative ex- 
planation is the ‘social selection’ or ‘drift’ argument that people with 
mental health problems have higher risks of remaining or falling 
into poverty because of the (privately borne) costs of their treatment, 
disrupted employment, and reduced earnings [10]. A general eco- 
nomic recession, such as the recent global financial crisis, can have 
profound effects on employment, job security, productivity, earn- 
ings, and social cohesion, with knock-on effects on mental illness, as 
shown most immediately by suicide rates [11]. 


Consequences and societal responses 


Most mental health problems are complicated and distressing. They 
can cause temporary incapacity, abject misery, self-loathing, and 
personal shame, and lead to public stigma and discrimination in 
many different settings. They can prompt violent behaviour, self- 
harm, and suicidal ideation, causing societies to impose restrictions 
on individual liberty because of assumed or confirmed incapacity 
or dangerousness. They are associated with poor health behav- 
iours and premature mortality. But there may occasionally also be 
some positive aspects; a mental illness may give an individual new 
insights, help them to embrace change, or energize their creativity. 
Overwhelmingly, however, the negatives outweigh any positives. 


This wide-ranging mix of (largely negative) consequences also 
makes mental illness look ‘economically expensive’ in the sense that 
scarce resources must be devoted to respond to them, often over long 
periods of time. Asa result, decision-makers in health and other sys- 
tems must think carefully about how they use the resources under 
their control in order to meet needs and preferences associated with 
mental illness or—better still—how these illnesses can be prevented 
in the first place. 


Economic and social exclusion 


As already noted, people with mental health problems have 
above-average rates of economic difficulties, particularly linked to 
employment (especially finding and keeping a job, and chronic un- 
employment), rent and mortgage arrears, other unsecured debt, low 
earnings, low household income, poor social networks, loneliness, 
and social isolation [12]. 

An authoritative commentary on some of these issues from 
the Organisation for Economic Co-operation and Development 
(OECD) drew on national statistics and robust studies [13]. It illus- 
trated the multifarious links between mental health problems and 
work; people who have experienced mental health problems find it 
harder to acquire and retain paid employment, have higher rates of 
absenteeism and presenteeism (reduced productivity when at work 
because of symptoms), earn lower salaries, and are more likely to 
take early retirement [14]. The typical age of onset of serious mental 
health problems often prevents individuals from completing their 
education. For most people, employment is their primary source of 
income, and so long-term employment difficulties can propel people 
into unmanageable personal debt and poverty, which, in turn, can 
worsen mental health [15]. Employment is, of course, more than 
just a source of income—it influences social position and commu- 
nity roles, fosters social participation, and, for many people, is a key 
source of self-concept. 

At a national level, these employment difficulties damage prod- 
uctivity and economic growth. For example, a study in eight very dif- 
ferent countries found that the extent and costs of depression-related 
absenteeism and presenteeism in the workplace were considerable, 
both in absolute monetary terms and in relation to the proportion 
of the country’s gross domestic product (GDP). Absenteeism had a 
large impact across all countries, but the costs associated with pres- 
enteeism were 5-10 times higher [16]. 

Employment issues mean that many individuals with mental 
health problems are reliant on social security (welfare) payments. 
The OECD report cited earlier found that between one-third and 
one-half of new disability benefit claims in the countries they 
studied were made for reasons connected to mental illness. Benefits 
systems need to be designed so as to provide appropriate compensa- 
tion, but not to make people feel trapped—anxious, perhaps, about 
losing their benefit entitlements if they ‘take the risk of moving 
into a paid job that might prove stressful and reignite an otherwise 
stable condition. But benefits systems that leave people at risk of 
poverty could create a different kind of downward spiral—their 
economic circumstances could exacerbate their symptoms, which 
then make it harder to secure employment and lead to further social 
and economic exclusion. If an individual struggles to keep up their 
rent or mortgage payments, then homelessness might even be the 
result [17]. 


Crime 


If not treated effectively, some mental health problems can result 
in either antisocial or violent crime [18-20]. Drug and alcohol 
misuse might prompt acquisitive crime, as individuals seek to find 
the resources to purchase their substance of choice, and there are 
also associations between such misuse and violent crime. Conduct 
disorder—the most common childhood psychiatric disorder, with a 
UK prevalence of 4.9% for children aged 5-10 years [21]—is asso- 
ciated with antisocial behaviour (and substance misuse) in adoles- 
cence and criminal activity in young adulthood, with high costs for 
both criminal justice bodies and victims. Indeed, an 18-year follow- 
up study in London found that these crime-related costs dominated 
all other economic consequences of conduct disorder [22]. However, 
people with mental health problems are also at above-average risk of 
being victims of crime, particularly violent crime [23]. 


Self-harm and suicide 


According to the World Health Organization (WHO) [24], globally 
there are more than 800,000 suicides each year, each of them po- 
tentially preventable. Suicide and self-harm are major public health 
issues, and because there are links with mental health problems (es- 
pecially depression, eating disorders, and alcohol misuse), some 
governments have included suicide prevention strategies within 
their mental health policy frameworks. Generally, however, WHO 
argues that insufficient strategic attention is paid to suicide preven- 
tion. Another obstacle is the taboo surrounding suicide. Suicide 
is one of the priorities in the WHO Mental Health Gap Action 
Programme (mhGAP) launched in 2008 in low- and middle-income 
countries. In high-income countries, there is strong evidence that 
economic difficulties are associated with higher suicide risks. For 
example, the global financial crisis led to significantly higher suicide 
rates, particularly for men [25], and an earlier recession in South 
Korea widened income-related inequalities in depression and sui- 
cidal behaviour, especially in lower-income groups [26]. Some au- 
thors have further argued that government ‘austerity’ measures have 
exacerbated the situation [11]. 


Premature mortality 


It is not only suicide rates that are higher for people with mental 
health problems; premature mortality for other reasons is another 
major challenge [24]. This is particularly so for people with more 
severe (schizophrenia, bipolar disorder, and moderate to severe de- 
pression). Life expectancy for people with schizophrenia is 20 years 
lower than for the general population [27], while for people with 
depression, the risk of death is 1.8 times higher than for the gen- 
eral population. Cardiovascular disease and type 2 diabetes are 
quite prevalent among people with schizophrenia, and other phys- 
ical health problems commonly experienced by people with mental 
health problems that can shorten their lives include respiratory dis- 
eases, hypertension, and infectious diseases such as HIV, hepatitis, 
and tuberculosis. 

Many of the reasons for poor physical health and premature mor- 
bidity are not hard to identify, linked to lifestyle, poor health be- 
haviours, and, to some extent, the side effects of treatment. Tobacco 
use is high [28]. Obesity is also an issue, linked to poor diet [29], 
physical inactivity, and weight gain brought on by some medications 
(particularly some antipsychotics) [30]. Infectious diseases, such as 
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HIV and hepatitis, are more common among people with mental 
health problems, linked to socio-economic disadvantage and behav- 
iours such as intravenous substance abuse. The symptoms of mental 
illness might make it hard for individuals to seek medical treatment, 
and social isolation and homelessness will add to the barriers. Stigma 
and discrimination associated with severe mental illness might also 
result in poorer physical health care. Every one of these risk factors 
is, in principle, preventable and ought to be part of any mental health 
policy strategy. 


Transmitted impacts on the family and others 


The negative effects of mental health problems can be experienced 
by people other than the person with the illness. One obvious trans- 
mitted effect is the ‘burder’ of being a family carer; for example, sup- 
porting someone with moderate to severe dementia can be stressful, 
with a high risk of anxiety or depression [31], as well as requiring 
out-of-pocket payments for services and transport costs and missing 
opportunities for paid employment or social activities [32]. On the 
other hand, good family care for someone with dementia can greatly 
improve their well-being and delay nursing home admission [33]. 
Supporting someone with schizophrenia can be distressing and 
costly, and the effects on parents of a child with externalizing behav- 
ioural problems, such as conduct disorder, can be considerable [34]. 
Postnatal depression can affect the birth-child’s emotional, behav- 
ioural, and intellectual development, with attendant long-term eco- 
nomic consequences [35]. Attention-deficit/hyperactivity disorder 
can affect classmates at school, work colleagues, and members of the 
local community [36]. 


Stigma and discrimination 


Stigma and discrimination are deep-rooted, pervasive challenges. 
The consequences for individuals include social isolation, bullying 
in schools, discrimination in the workplace, active hostility or 
neglectful indifference in communities, and a consequent reluc- 
tance on the part of mentally ill individuals to seek the help they 
need [37]. Evidence from a seven-country European study in work- 
places showed that managers’ attitudes, employers’ policies on flex- 
ible working arrangements, and national benefits systems can have 
significant effects on individuals with lived experience of mental 
illness [38]. Negative attitudes can have serious knock-on effects 
on health, especially mental health, quality of life, access to oppor- 
tunities, services, and welfare entitlements [12]. Stigma may stem 
from lack of awareness but often reflects a more insidious suspicion 
and hostility. A mature mental health policy framework will include 
concerted anti-stigma efforts, as well as intervention to tackle dis- 
crimination and victimization. 


High and wide-ranging costs 


As is already clear, mental health problems generate high and wide- 
ranging costs, not only for the health care system, but also for public 
and other agencies responsible for welfare benefits, social care, edu- 
cation, employment, criminal justice, and other systems. There can 
be substantial productivity losses associated with mental illness; in- 
deed, the costs of unemployment, absenteeism, and presenteeism 
tend to dominate any service-related costs. For some disorders, such 
as dementia, the costs directly or indirectly borne by the family and 
other carers are, in aggregate, larger than the costs of health and 
social care services that people with dementia will use [39]. Many 
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childhood disorders will continue to generate substantial private 
and public costs many decades later, in terms of employment disrup- in Mental Health (UK Government, January 2014) 
tion [40] and earnings, criminal behaviour, and high use of health 
services [41]. 


Box 134.1 Closing the Gap: Priorities for Essential Change 





This policy document set out 25 priorities for change in how children 
and adults with mental health problems should be supported and 
cared for: 


Policy dimensions 1 High-quality mental health services with an emphasis on recovery 


should be commissioned in all areas, reflecting local need. 


What is a mental health policy? 2 We will lead an information revolution around mental health and 


well-being. 
According to the WHO, ‘A mental health policy is an organized set 3 We will, for the first time, establish clear waiting time limits for 
of values, principles and objectives for improving the mental health mental health services. 
and reducing the burden of mental disorders in a population. It de- 4 Weill tackle inequalities around access to mental health services. 


5 Over 900,000 people with benefit from psychological therapies 
every year. 
6 There will be improved access to psychological therapies for chil- 


fines a vision for the future and helps to establish a model of how 
action should be taker’ [42]. Typically, a policy document will range 
over a number of key domains, responding to the kinds of chal- 














dren and young people across the whole of England. 

lenge and negative consequences described in the previous section. 7 The most effective services will get the most funding. 
These could include: human rights and advocacy; financing (such 8 Adults will be given the right to make choices about the mental 
as taxation or insurance); payment mechanisms for providers (such health care they receive. 
as fee-for-service or capitation); organization and co-ordination of 9 We will radically reduce the use of all restrictive practices and take 
services within the health care sector and across other sectors; pre- action to end the use of high-risk restraint, including face-down re- 
vention of illness and promotion of well-being; treatment and re- swain and holding PEERS the floor. : 

10 We will use the Friends and Family Test to allow all patients to com- 


habilitation interventions, including access to medicines and other 














Boat ee , ment on their experience of mental health services—including 
services in different country contexts; quality of services and per- dicene menel reali senviiees. 
formance monitoring; information systems; recruitment, training, 11 Poor-quality services will be identified sooner and action taken to 
and retention of skilled human resources; and commitment to re- improve care and where necessary protect patients. 
search and evaluation. 12 Carers will be better supported and more closely involved in deci- 
A recent example can be given from England—a comprehensive sions about mental health service provision. 
policy framework from the 2010-2015 Coalition government (Box 13 Mental health care and physical health care will be better integrated 
134.1) [43]. More recently, an independent Taskforce was commis- ey level. : , 
14 We will change the way frontline health services respond to 


sioned by NHS England. Its report summarized developments in a 

mental health treatment in England over recent decades and set out 15 No-one experiencing a mental health crisis should ever be turned 

a number of ‘priority actions’ for the National Health Service (NHS) away from services. 

for the next 5 years [44]. 16 We will offer better support to new mothers to minimize the risks 

: i and impacts of postnatal depression. 

Financing 17 Schools will be supported to identify mental health problems 

There are numerous reasons why a formal mental health policy is PORET : 

needed. There are too many distressing, damaging, and durable 18 We will end the cliff edge of lost support as children and young 
ways : : people with mental health needs reach the age of 18. 

consequences for any civilized society to ignore. There are collat- 


ne ; 19 People with mental health problems will live healthier lives and 
eral damages for communities and the national economy. Another longer lives 


reason is that families cannot be left to address the issues alone, and 20 More people with mental health problems will live in homes tha 
there is only so much that relatively informal, locally initiated, and support recovery. 

sustained community action can go. There is often market failure, 21 We will introduce a national liaison and diversion service so tha’ 
which manifests itself in policy terms in an emphasis on collective re- mental health needs of offenders will be identified sooner and ap- 





propriate support provided. 
22 Anyone with a mental health problem who is a victim of crime wil 





sponsibility for managing, delivering, and/or financing mechanisms. 
Most high- and middle-income countries—and gradually a 























: ; be offered enhanced support. 
number of low-income countries—rely on prepayment systems of 
. : . 23 We will support employers to help more people with mental health 
revenue collection as the basis for financing health care. In those problems to remain in or move into work. 
countries, the most common methods of financing are tax-based 24 We will develop new approaches to help people with mental health 
(37% in OECD countries), mandatory (social) insurance (36%), problems who are unemployed to move into work and seek to sup- 
out-of-pocket payments (19%), and voluntary (or private) insur- port them during periods when they are unable to work. 
ance (6%) [45]. Prepayment through social health insurance, private 25 We will stamp out discrimination across mental health. 
health insurance, or taxation is preferable to out-of-pocket payments Reproduced from Department of Health, Closing the Gap: Priorities for Essential 
because each individual’s need for health care is uncertain, but when Cinare Ma (lanet slabs Crepin (0N) Crown C spyn Rapreisieeeluncler thie 
, i e oe terms of the Open Government Licence v3.0. Available at https://assets. publishing. 
that need arises, it can generate enormous (perhaps ‘catastrophic’) service.gov.uk/government/uploads/system/uploads/attachment_data/file/281250/ 
Closing_the_gap_V2_-_17_Feb_2014.pd 


economic impacts in terms of treatment costs and lost earnings. 


Prepayment systems pool risks, thereby potentially redistributing 
benefits towards people with greater health needs, and can also be 
made progressive, so that poorer individuals pay less than wealthier 
people for equivalent access to health care. Obstacles to the use of 
prepayment mechanisms in low-income countries include the state 
of the economy, unstable governance structures, and the informality 
of much employment, making revenue collection impracticable. 

Financing arrangements for mental health services vary consider- 
ably from country to country, with out-of-pocket payments (which 
are both inefficient and unfair) still dominating in most low-income 
countries [46], but actually still fairly common in some other coun- 
tries [47]. As far as mental health systems are concerned, almost 
every country has a mix of public and private funding. Mental and 
physical health care needs are generally not treated with parity [48]; 
some insurance or managed care arrangements exclude mental 
health coverage in an attempt to cap expenditure, but this will 
then have predictably undesirable consequences for access, knock- 
on costs, societal inefficiencies, and inequity [49]. The so-called 
Obamacare in the United States (centred around the Affordable 
Care Act 2010) made important breakthroughs in expanding access 
to health insurance, including mental health coverage [50], and il- 
lustrates well the need for strategic policy efforts to be made to ad- 
dress financing issues. 

The revenue collected from taxes or insurance premiums reaches 
service providers via various commissioning or purchasing routes. 
Provider reimbursement can be retrospective (such as fee-for- 
service) or prospective (capitation and fixed budgets). Fee-for- 
service arrangements encourage productivity but also perversely 
encourage resource consumption through unnecessary visits, diag- 
nostic investigations, and hospitalization (‘supplier-induced de- 
mand’) and so can push up overall costs. Prospective payments are 
therefore increasingly being used to encourage cost-consciousness 
among providers. Capitation is a fixed payment for a defined set of 
benefits (treatments or health gains). It encourages efficiency but has 
some risks, including adverse selection and cream-skimming (where 
providers have no incentive to treat ‘complicated or expensive’ pa- 
tients), which could shift costs onto other providers or sectors or 
lead to undertreatment [51]. A commonly used mode of prospective 
payment is by case-mix, for example using diagnosis-related groups 
(DRGs)—patients are categorized by clinical diagnosis and treat- 
ment process and a standard payment is made to the provider. DRG- 
based pricing is not always straightforward to implement in mental 
health contexts. 


Prevention and risk reduction 


Reducing the number of people who develop mental health prob- 
lems and delaying or avoiding relapse and symptom exacerbation 
must surely be high policy priorities—to head off or reduce distress 
and suffering for individuals and families, and potentially avoid 
substantial future streams of expenditure and productivity losses. 
Prevention and early intervention therefore figure prominently in 
many policy frameworks and may require action outside the health 
sector. Such actions can be highly effective and cost-effective, as the 
following examples illustrate. 

Postnatal depression (PND) has high prevalence but is often un- 
diagnosed and untreated. The effects for women and their children 
can be considerable. A recent systematic review looked at a range of 
interventions developed to prevent PND [52], finding a number of 
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beneficial interventions, including midwifery-redesigned postnatal 
care, person-centred approaches, and cognitive behavioural 
therapy—which are intended to be universal—as well as some inter- 
ventions targeted on high-risk women such as interpersonal psycho- 
therapy (IPT). The study also compared interventions in terms of 
cost-effectiveness, compared to usual care, and although there was 
uncertainty about what might be the ‘best buy, the accumulated 
international evidence clearly shows how prevention can have con- 
siderable benefits for mother and child and make economic sense. 

School-based interventions can also be effective at preventing 
some mental health problems and also cost-effective such as those 
that aim to develop social and emotional development [53], build 
resilience and thereby reduce the risk of later depression [54], 
or instil positive behaviours to reduce tobacco use and improve 
well-being [55]. 

Suicide prevention is possible, but complex. The WHO recom- 
mends reducing access to the means of suicide (pesticides, guns, 
certain medications), more responsible media reporting, policies to 
reduce alcohol misuse, early identification and treatment of people 
with mental or substance use disorders who could be high risk, 
training of health workers to recognize, assess, and manage suicidal 
ideation, and follow-up support for people who have attempted 
suicide [56]. 

Another area where there is good evidence of the potential for 
risk reduction is in relation to dementia. Mid-life risk factors for 
developing Alzheimer’s disease and other dementias in late life are 
physical inactivity, smoking, diabetes, hypertension, obesity, depres- 
sion, and lower educational attainment [57]. These risk factors are 
clearly interconnected, but even after adjusting for correlations, al- 
most a third of Alzheimer’s disease cases might be ‘attributable’ to 
risk factors that are potentially modifiable [58]. 


Effective treatments 


An obvious element within any policy framework would be to en- 
sure that effective treatments are available to people who are ill. 
There is not a bottomless pit of resources, however, and so a policy 
framework will also need to be clear about how to prioritize needs 
and target treatments and ensure that resources (public and private) 
are not wasted on treatments that are ineffective or expensive, rela- 
tive to their outcomes. The extent to which a mental health policy 
engages in the details of prioritization, targeting, evidence gener- 
ation, or cost-effectiveness recommendation depends heavily on 
a country’s overall political approach to health policy. In England, 
with its central ‘command-and-control’ NHS, policy documents 
in mental health and other areas have often been quite prescriptive 
about the treatments to be delivered and the people who should 
receive them. In contrast, the health system in the United States is 
dominated by the private decisions of citizens (for example, in re- 
lation to the insurance plan they purchase), the independent case- 
level decisions of clinicians, and state-level decisions about priorities 
and funding generosity. 

In England and Wales, much of the responsibility for evidence ap- 
praisal and prioritization has been devolved by the government to 
the National Institute for Health and Care Excellence (NICE). With 
its thorough technology appraisals and its transparently developed 
clinical guidelines, NICE aims to help individual clinicians and 
other professionals to choose the best treatments, defined as those 
of proven effectiveness and also cost-effectiveness in the context 
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of NHS structures and resources. Some other countries also have 
health technology appraisal (HTA) mechanisms, varying in struc- 
ture from country to country and leading to variation in decisions 
too [59]. 

One of the most impressive policy initiatives of recent years in 
some countries has been the closure of the old psychiatric asylums. 
In most European countries, and for many decades, large mental in- 
stitutions dominated the landscape. Although originally built with 
some good intentions (to group people together to make best use 
of scarce skilled professionals, achieve economies of scale, and pro- 
tect vulnerable individuals), these asylums were also convenient 
ways to allow families to hide away their ‘disturbed’ or socially 
embarrassing relatives and, in some countries, were instruments of 
political oppression. Many institutions provided appalling, abusive 
care, breaching internationally accepted human rights [60]. Where 
evaluations have been conducted, they demonstrated clearly the po- 
tential to replace institutional care with good-quality community- 
based care, so long as it is adequately funded [61]. 

There are areas where effective treatments are simply not reaching 
people who need them and where mental health policy has failed. 
Youth mental health is one area where there is an enormous treat- 
ment gap in many countries, with people aged 16-25 with mental 
health problems having low contact rates with services. Another 
failure is the frequent neglect of the physical health needs of people 
with mental health problems, leading to significantly truncated life 
expectancy. A third example of common policy failure is in relation 
to dementia where evidence on interventions is now accumulating 
rapidly [62], but where few countries have yet pulled together a co- 
herent overall strategy, although with some notable exceptions [63]. 
But even these failings pale into insignificance when compared to 
the huge mental health treatment gap in most low-income countries 
of the world [64]. The WHO’s mhGAP initiative mentioned earlier 
seeks to scale up services for mental, neurological, and substance use 
disorders for all countries. 

Because mental health problems often affect families and other 
community members, a policy framework should include attention 
to the needs of family members, especially if they are regular (but 
unpaid) carers. Some countries have already developed formal carer 
strategies. 

Treatments are not necessarily delivered only through the health 
care system, as conventionally defined, but could be delivered in 
schools and colleges, workplaces, prisons, and community services. 


Tackling inequity 


There will always be inequalities, but not necessarily all of them are 
unfair. Inequality and inequity are not the same; the latter refers to 
an allocation of resources or responsibilities that is deemed to be un- 
fair by reference to some external criteria, such as a decision-maker’s 
preferences or society’s agreed priorities, while inequality simply de- 
scribes the difference between two or more things. According to the 
WHO, ‘Equity is the absence of avoidable or remediable differences 
among groups of people, whether those groups are defined socially, 
economically, demographically, or geographically’ [65]. 

What is meant by ‘fair’ is a value judgement, and hence likely to be 
reflected (though perhaps implicitly) in political statements or posi- 
tions. Most people would probably agree that it would be equitable 
to allocate more treatment resources to people with greater needs, 
and fewer resources to people with less needs. But there might not 


be such ready agreement that it is equitable for poorer individuals to 
pay less for their treatment than wealthier individuals; not everyone 
supports redistributive policies of this kind. 

It is abundantly clear from many studies that the risk of mental 
illness is not equally or fairly distributed across the population, and 
neither is access to effective treatment. Socio-economic status plays 
a big part in these unequal distributions. Both absolute and relative 
poverty affect the development of emotional, behavioural, and psy- 
chiatric problems [10, 66]. It is not only socio-economic status that 
can influence incidence, prevalence, and treatment, but also race, 
ethnicity, gender, age, language, religion, sexual preference, and 
place of residence [67]. 

As noted earlier, there is a vicious cycle linking mental illness 
with social and economic disadvantage. For any mental health 
policy to be successful, it must surely find ways to break into that 
cycle. It needs to ensure that effective treatment and support are 
accessible to everyone, regardless of individual characteristics, 
social position, or the ability to pay. It should support efforts to 
build resilience and invest in mental health literacy. It needs to sup- 
port wider efforts to tackle poor housing, unequal access to educa- 
tion and employment, and basic material poverty. It must protect 
and promote human rights [68]. In countries without prepayment 
financing mechanisms (tax- or insurance-based) or where such ar- 
rangements are only affordable by the wealthy, introducing funding 
reform would make an enormous difference to access and better 
mental health [46]. 


Opportunities, choice, and control 


When people with mental health problems were contained in 
large, remote asylums, they were denied the most basic of liber- 
ties and rights, often for many decades. A prominent feature of 
most mental health policies today is to broaden the range of op- 
portunities that are available, tackling the endemic inequities that 
damage so many people's lives. Indeed, there are a number of dif- 
ferent, but connected, themes that feature in policy discussions 
today, including opportunities, choice, control, personalization, 
and recovery. 

One important area is employment; creating better opportun- 
ities and breaking down attitudinal barriers can help to socially in- 
tegrate someone with a history of mental health problems, make 
them economically independent, and potentially head off exacerba- 
tion of their illness. Widening choice is seen as intrinsically desir- 
able; it leads to greater autonomy and arguably better well-being. 
Giving people more choice and control over their lives, including 
the support services they use, can benefit both health and well-being 
and also prove cost-effective. Experiments in England in devolved 
funding through personal budgets for social care and continuing 
health care achieved encouraging results [69, 70]. 

The concept of recovery—not in the clinical sense of symptom 
alleviation or cure, but in a broader, personal sense—is attracting 
growing attention. It is associated particularly with arguments pro- 
pounded by Bill Anthony some years ago: 


‘[Recovery is] a deeply personal, unique process of changing one’s 
attitudes, values, feelings, goals, skills, and/or roles. It is a way of 
living a satisfying, hopeful, and contributing life even with limita- 
tions caused by illness. Recovery involves the development of new 
meaning and purpose in one’s life as one grows beyond the cata- 
strophic effects of mental illness’ [71] 


The top priority in the English government's 2014 mental health 
policy statement was “High quality mental health services with an 
emphasis on recovery should be commissioned in all areas’ (Box 
134.1). Specific recovery-focused initiatives might include peer 
support arrangements, opportunities for self-managing treatment, 
supported employment, advice on welfare entitlements and how to 
manage debt, advance treatment directives, health promotion, per- 
sonal budgets, and recovery colleges [72]. 


Attitudes and discrimination 


At a strategic or societal level, policies are needed to raise aware- 
ness of mental illness, improve knowledge about prevention and 
treatment, tackle prejudice and stigma, and directly combat dis- 
crimination. In terms of individual experience, policies are needed 
to stamp out bullying and victimization. In these ways, it is hoped 
that there will be a number of gains—more supportive attitudes will 
lead to less treatment avoidance by people experiencing poor mental 
health, and so fewer delays in their taking up appropriate treatment, 
and hence better health and well-being. National anti-stigma pro- 
grammes have been successful [73] and can be cost-effective [74]. 
More targeted anti-bullying interventions can certainly help too. 


Co-ordinated action 


Policy intentions and actions must be co-ordinated, given that people 
with mental health problems, especially if severe or enduring, often 
have needs that span a number of different service systems. Their 
needs could require action and support from health, social care, 
education, housing, employment, welfare benefits, criminal justice, 
and other systems, and these will need co-ordination to ensure that 
vulnerable people do not ‘fall through the cracks. A mental health 
policy therefore needs to include a strategy for bringing together 
the various local, regional, and national bodies and processes that 
are involved, directly or tangentially, in supporting individuals and 
families. It needs to resolve funding issues (especially given the pro- 
pensity for silo-budgeting), co-ordinate assessment procedures, and 
agree on both short-term and longer-term objectives. Co-ordination 
is needed not just across the public sector, but also with non-public 
entities such as employers, schools, and community groups. 


Concluding comments 


As noted at the outset, mental health problems represent or combine 
a number of complex issues and characteristics. Incidence is com- 
monly early in life, and prevalence of many conditions is enduring 
over much of the life-course. The distress associated with mental 
illness can be immense, and there can be adverse consequences 
for family members and friends. People with severe and enduring 
problems may face enormous difficulties in securing and retaining 
paid employment. Many will face discrimination in employment 
and other areas of everyday life: education, participation, citizen- 
ship, and access to community resources. It is still common in many 
countries today for people with mental health problems to be locked 
away in large institutions that deny basic human rights and offer little 
active treatment. People with mental health problems have shorter 
lives because of their poor physical health, due in large part to their 
lifestyle, and they are at much higher risk of self-harm and suicide. 
The symptoms of illness or needs for material resources may lead a 
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minority of people to commit violent or acquisitive crimes. All soci- 
eties have the legal powers to compulsorily detain and treat people. 

The policy challenges of mental illness therefore are many and 
huge. A policy framework needs to recognize the complex aetiology 
of illness and respond to the wide range of individual, social, and 
economic impacts that often follow. It also needs to tackle some wide 
inequalities, many of them representing gross inequities in indi- 
vidual experiences and opportunities. Commitment to social justice 
should be a core part of any mental health policy. 

A good mental health policy would invest significant resources in 
prevention and risk reduction and ensure timely diagnosis and early 
intervention. It would include a strategy for developing, delivering, 
and ensuring access to effective and cost-effective treatments. Parity 
with physical health treatment is essential. Barriers to treatment and 
services need to be removed, whether due to ignorance, inability to 
pay, funder prejudice, or provider discrimination. The symptoms of 
an illness may themselves contribute to access difficulties. 

A good mental health policy would adopt a life-course perspec- 
tive, intervening early enough to prevent more serious symptoms 
and their consequences. It would promote co-ordinated multi- 
sector efforts to identify, assess, respond, and fund appropriate ac- 
tions. Families and communities may need to be involved in some 
decisions and may themselves have needs that should be addressed. 
But through it all, the individual with a mental illness should be at 
the centre and should be given as many opportunities and as much 
choice and control as is possible and appropriate, given their cir- 
cumstances and state of health. Finally, a good mental health policy 
should include ongoing commitment to change societal attitudes to 
mental illness, promoting recovery in its modern interpretation. 
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The aim of this chapter is to assist clinicians and managers review and 
plan services effectively for their local population. Severe psychiatric 
disorders manifest themselves in social relations and often disrupt 
social structures; they have wide-ranging consequences, so services 
need to be comprehensive, providing both health and social care. 
Neither can be ignored in the wider context of their management. 

Planning mental health services rarely starts with a clean sheet. 
Where there is one, Tansella and Thornicroft’s ‘matrix’ model [1] isa 
particularly thorough and structured guide. It outlines service prin- 
ciples and needs assessment (at national, regional, and local levels) 
through to monitoring the cycle of planning and provision. Their 
hierarchical approach reflects the level of spend [2]. Case identifi- 
cation and outpatient treatments in primary care are the priority for 
low-income countries, and only with increased resources the estab- 
lishment of a secondary care mental health service [usually a form 
of generic community mental health team (CMHT)]. Not until these 
are well established are specialist and inpatient services indicated. 
This process must, however, take account of what is already in place. 

The range of spend and local cultures in mental health services is 
enormous, and attempting to address all of these would result in a 
series of unhelpful generalized statements. In this chapter, we will 
focus on the UK because it is an example of a relatively coherent, de- 
veloped system with a culture of research. The specificity of many of 
the descriptions should not be interpreted as a prescription, but as a 
platform for local consideration and decisions. 





The last 40 years have seen an explosion of mental health services 
research, alongside the shrinking and closure of mental hospitals 
(see Chapter 134). The research agenda is disproportionately anglo- 
phone, testing innovative alternatives to institutional care. More 
routine practices, crucial for safe and effective care, have been rela- 
tively neglected. 








This chapter will describe the essential components ofa mental health 
service. It will cover how they relate to one another, how they are 
linked into other essential services, and how their evolution should 
be monitored. Services for adults of working age (18-65 years) will 
be used as template. In many settings, these may be the only services. 
In better-resourced systems, a range of specialized services have 
evolved which are described elsewhere (see Chapter 138). 





Mental illnesses are common, and it is estimated that around 25% of 
people will suffer from a mental disorder at some point in their lives. 
However, most of these people will not see a psychiatrist. A useful 
starting point in service planning is to use any available prevalence 
figures to estimate the number of people with particular conditions in 
the community. In the UK, the periodic national Psychiatric Morbidity 
Survey [3], run by the UK Office of National Statistics (ONS), re- 
vealed the following prevalence of disorders among adults in England 
in 2007 [according to the criteria of the World Health Organization's 
International Classification of Diseases, tenth revision (ICD-10)] [4]: 


e Mixed anxiety and depression, 9.7%. 

e Generalized anxiety disorder (GAD), 4.7%. 

e Depressive episode, 2.6%. 

e Phobia, 2.6%. 

e Obsessive-compulsive disorder (OCD), 1.3%. 

e Panic disorder, 1.2%. 

e Psychotic disorders 
disorder), 0.5%. 


(including schizophrenia and bipolar 


In the UK, the first six of these disorders are termed common 
mental health disorders (CMHDs), and the last severe (or serious) 
mental illness (SMI). The overall prevalence of CMHDs, which 
are not mutually exclusive, was found to be 17.6% [3], while the 
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prevalence of SMI was likely to have been underestimated by the 
community survey, and other estimates range between 0.5% and 2%, 
with higher rates found in the inner city [5]. 


Treatment in primary care 


In the UK, around 70-80% of people with CMHDs are treated only 
in primary care by general practitioners (GPs), usually with SSRI 
antidepressants [6], while 10-15% are referred, or refer themselves, 
for psychological therapies offered by the Improving Access to 
Psychological Therapies (IAPT) programme [7]. That means only the 
remaining 5-10% of people with CMHDs receive care from multi- 
disciplinary mental health services, although they will be the most se- 
verely affected people. GPs may also be involved in the mental health 
care of people with SMI in the UK, although the majority, at least 
two-thirds, will be receiving care from mental health services [8]. 

This means at any one time, around 15-20 adults per 1000 will be 
in contact with mental health services in the UK, of whom the large 
majority will be suffering from SMI. It also means that a relatively 
small increase, of say 5%, in the proportion of people with CMHDs 
referred from primary care to mental health services would increase 
the workload of those services by around a third to a half! 


Referral from primary to secondary care 





Ideally, there should be a clear understanding between psychiatrists 
and their local GPs, to ensure that only the more needy patients who 
really require a specialist opinion are referred. Box 135.1 lists sug- 
gested referral criteria for patients with depression. 

Agreeing referral criteria like these, however, does require the 
psychiatrist to accept referrals which meet them. The psychiatric 
team needs to offer treatments which are not usually provided by 
the GP, including intensive support with or without hospital admis- 
sion, electroconvulsive therapy (ECT), prophylactic lithium or other 
mood stabilizers, etc. Simply prescribing another antidepressant and 
returning the person to their GP’s care is not an adequate response 
to a GP who has already tried three courses of antidepressants com- 
bined with psychological therapy. 


Building blocks of secondary mental health 





services: care and treatment 


Most mental health treatments (whether psychological, pharma- 
cological, or social) involve face-to-face interviews and require no 





Box 135.1 Indications for referral of patients with depression 


e Poor response to antidepressants (three different drugs sequentially) 
and psychological therapy (if available). 

e Recurrent episode within 1 year of last episode. 

e Self-neglect. 

e Postnatal depression. 

e Assessment for possible bipolar disorder. 

e Suicidal ideas and plans (urgent referral). 

e Psychotic symptoms (urgent referral). 














sophisticated equipment. Asylums provided social care, protecting 
disabled individuals while they recovered and, sometimes, pro- 
tecting society from them. Patients needing long-term institutional 
care are now very few, but psychiatry is still judged on how they are 
cared for and services must pay them due attention. 


No comprehensive service can survive without access to 24-hour 
nursing supervision for acute episodes of severe illness. They are 
needed for patients at risk from neglect or suicide, usually lacking 
sufficient insight to co-operate with treatment. Wards usually ac- 
commodate 10-20 patients. It is rarely possible to effectively staff 
and run stand-alone units of fewer than 3-4 such wards (30-60 
beds). Ward size is a trade-off between privacy and domesticity 
against safety and effective supervision. Single rooms afford privacy 
and, while initially expensive, improve flexibility and reduce conflict. 

Smaller, more flexible units, such as ‘crisis houses, offering 24- 
hour care are a useful complement to inpatient wards, but not an 
effective replacement. Ward design and management are increas- 
ingly crucial as improved community care means that inpatients are 
mostly involuntary and highly disturbed. 


How many acute beds? 


‘How many beds do we need for our local population?’ is often the 
first question asked by planners or managers. Unfortunately, there is 
no reliable or precise answer to this. Supply will drive use (perceived 
as need) and beds are rarely left empty despite enormous variation 
in their availability. International comparisons are complicated by 
differences in methods of reporting and a profusion of overlapping 
ill-defined local terms (for example, night hospitals, crisis homes, 
step-down wards). Levels of hostels and day care also impact the 
need for acute beds, which reduces as community services become 
more comprehensive. 

European provision of general acute mental health beds in the 
public sector in the early 2000s ranged from 64 beds per 100,000 in 
Germanic settings to 6 per 100,000 in Italy and Spain [9]. However, 
these figures tell us relatively little unless we know the levels of pri- 
vate and social services care provision. In the UK, with little parallel 
private care, overall provision has shrunk by 62% between 1988 and 
2008 [10]. 

Current low bed usage in the UK reflects recently established 
home treatment and crisis resolution teams and the expansion of 
forensic care, but also a shift in practice. The duration for admissions 
has steadily reduced over the last four decades. Mean durations are 
heavily skewed by short (1- to 2-day) crisis admissions, but patients 
with uncomplicated psychotic relapses usually stay between 3 and 
6 weeks. 


Longer inpatient care 


Acute inpatient wards are designed to admit patients for weeks or, at 
most, a couple of months, with rapid discharge anticipated, but some 
patients require longer or more secure care because of either illness 
severity or legal restrictions. Modern long-stay and rehabilitation 
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wards are essentially restricted to patients with behaviour that 
is persistently unacceptable to local communities. Forensic and 
secure services are usually a regional or national, rather than local, 
responsibility. 


Diagnosis-specific wards 


Alcohol and substance abuse wards are common in much of Europe, 
and diagnosis- or disorder-specific wards in academic services. 
Specialized wards for patients with anorexia nervosa or resistant 
schizophrenia provide highly specific regimes, but usually alongside 
acute admission wards, rather than as an alternative. Some services 
now organize wards by care pathway clusters (for example, a psych- 
osis unit, an affective disorders ward), instead of general wards. 
Such specialization is still experimental and not particularly feasible 
in comprehensive services for smaller populations, as they reduce 
flexibility and absorb energy in ‘boundary disputes. 


Day care is provided either in day hospitals or in day centres. The two 
terms and their practices overlap and vary enormously. Attendance 
generally ranges from half a day to 5 days a week, and many now re- 
main open in the evenings and at weekends. 

Day hospitals are provided by the health service, include med- 
ical and nursing staff, and provide treatments such as medication 
and psychotherapies. They figured prominently in the plans to move 
to district general hospital psychiatry. However, their role shrank as 
CMHITs expanded, taking over much of their anticipated role. Day 
hospitals were scaled down or even closed, shifting ongoing support 
to social services day centres. However, they have recently been re- 
invigorated as a key ingredient in home treatment teams. 

Day centres, provided by social care organizations, do not usu- 
ally provide complex treatments or employ clinical staff. However, 
their practice is highly specific to local context. A drop-in day centre 
may provide psychiatric assessment and treatment in deprived areas 
with high levels of homelessness. Generally, day centres provide 
long-term social support, and day hospitals focused interventions 
and treatments [11]. The “Club House’ is a specialized rehabilitation 
day centre, popularized in the United States, where members run the 
centre with minimal supervision. 

Acute day hospitals in Europe and partial hospitalization in the 
United States have been trialled as alternatives to inpatient care [12] 
but did not catch on, although they have been recently revitalized as 
bases for crisis resolution teams. Specialized therapeutic day hos- 
pitals serve specific groups such as personality disorders [13] or 
eating disorders. Day care is problematic in rural settings, but adap- 
tations such as travelling day centres or open days run by CMHTs 
have all been tried successfully. 





Supported accommodation and residential care 


Many patients can only survive outside hospital with support and 
supervision to ensure self-care and continued medication and to an- 
ticipate and defuse crises. This can be provided by voluntary agencies, 


social services, or health services. Voluntary agencies tend to be more 
efficient at providing long-term residential care [14], but they may be 
reluctant to accept high-risk patients with a history of violence or sub- 
stance abuse. A mixed economy works best, and the need for health 
services provision will depend on the vigour of local voluntary and 
social services. Purpose-built units do exist, but shared adapted houses 
are more common and promote integration and reduce stigma. 

Supported or sheltered accommodation can be classified by levels 
of increasing need: 


1. Group homes have no regular staff and are reserved for relatively 
independent patients, with visits by CMHT staff. 

2. Day-staffed hostels have one or two day staff to support and 
monitor patients (encouraging cooking and cleaning, etc.). 
Specific treatments are provided by the CMHT. 

3. Night-staffed hostels have non-clinical staff sleeping over, pro- 
viding safety and support. 

4. 24-hour staffed/nursed hostels have on-site clinical staff overnight 
either sleeping in or, sometimes, awake. They are expensive, gen- 
erally for patients with long-term severe illnesses, sometimes 
including involuntary patients (‘hospital hostels’). They tend to 
be larger, usually with 10-20 residents. 


Most comprehensive local services provide levels 1 and 2, and 
most social services undertake to provide level 3. Level 4 hostels are 
relatively rare. 





Office-based care and outpatient clinics 


In insurance-based systems (the United States and much of Europe), 
psychiatrists may run individual office practices and manage pa- 
tients on their own. In directly state-funded systems, this is rare and 
most psychiatrists work in outpatient clinics or CMHTs. Financial 
considerations can inhibit integration and service developments if 
they pose a threat to practitioners’ livelihoods. Office-based prac- 
tice is neglected in academic and policy publications. It is usually 
focused on either psychotherapy or pharmacotherapy and is poorly 
equipped for managing severe disorders. 

Outpatient clinics are increasingly replacing office practice. 
Psychiatrists and psychologists may still operate independently 
within them, but with access to enhanced resources and second 
opinions. In the public sector, outpatient clinics may operate either 
alongside CMHTs or as part of them (which works better for severe 
illness) [15]. They provide for efficient assessment, treatment, and 
monitoring. 


Community mental health centres 


Mental hospitals, for all their faults, had no problems co-ordinating 
care; what little was available was all in the same place. Community 
mental health centres (CMHCs) were established in the late 1960s, 
initially in the United States, to provide a wide range of services lo- 
cated in shared buildings (for example, depot clinics, a day hospital, 
psychotherapy services). Early CMHCs in the United States failed 
to engage psychosis patients because they lacked any outreach, and 
this, with an ideological down-playing of the ‘medical model’, made 
it impossible to recruit psychiatrists. 
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Now CMHCs provide accommodation for CMHTs and other 
services such as day care. This shared accommodation helps sustain 
clinical standards by reducing professional isolation with dispersed 
community services otherwise risking becoming idiosyncratic and 
rigid in their practice. 


Multi-disciplinary community mental 


health teams 





Most community mental health services consist of one form or 
another of multi-disciplinary CMHT. This comprises psychiat- 
rists, nurses, and social workers plus other specialist staff such as 
psychologists, occupational therapists, employment specialists, etc. 
Regular meetings to assess and review the management of patients 
incorporate a wide range of professional perspectives and allocate 
tasks based on skills and needs. 


The sector CMHT (‘the CMHT’) 


Who it is for 


The CMHT is the fundamental building block of most modern com- 
munity mental health services. Mental hospital catchment areas, 
which often covered a whole city or county, were divided into sectors 
of 50,000-100,000 inhabitants to facilitate continuity of care. The 
team could gain familiarity with most of its long-term patients and 
also some personal knowledge of the referring doctors and commu- 
nity resources. Western European sectors now range from 20,000- 
50,000 population, determined both by resources (shrinking as 
investment and specialization increase) and by local characteristics. 
With more specialized teams, CMHT responsibilities shrink and 
sector size may increase, but keeping its caseload fairly constant. 
Two hundred or so patients is about the maximum manageable to 
exploit multi-disciplinary working effectively. 

CMHTs offer assessment and care for patients who cannot be ad- 
equately treated in primary care, so they prioritize severe mental illnesses. 
A trial of community mental health nurse care for people with CMHDs 
[16] showed no benefit over usual GP care, even when the nurses were 
trained in problem-solving therapy (a brief CBT-based therapy for 
CMHDs). Diversion of mental health staff away from a focus on people 
with SMI does not appear to be a cost-effective strategy. However, diag- 
nosis is not all—social adversity, personality difficulties, or substance 
abuse can make secondary mental health care necessary, even for ap- 
parently ‘minor’ disorders. Instruments to clarify this threshold have 
been tried and, indeed, introduced in some countries [17] but are of 
questionable value. Refusal to assess a patient, not surprisingly, irritates 
referring GPs and most teams rely on clinical assessments. 

CMHITs can be remarkably inefficient if little thought is given to 
their structure and procedures. Agreement is needed on their pur- 
pose, clientele, and systems of management, and they have often suf- 
fered from a lack of clarity and leadership. 


Staffing and management 


CMHT staffing varies enormously and there is no uniform model, 
although the general trend is towards bigger, better-staffed teams. 


Small teams of less than six can rarely provide comprehensive care 
or cross-cover, while teams of more than 15 or so, or with many 
part-time workers, become unwieldy, overwhelmed with manage- 
ment and information transfer. CMHTs emphasize skill-sharing and 
a degree of generic working and have evolved an informal, demo- 
cratic style [18]. This often generates confusion about clinical lead- 
ership (originally provided by senior medical staff). With increased 
staff numbers and treatment complexity, ‘team managers’ now co- 
ordinate procedures, varying from being purely administrative to 
setting clinical priorities and supervising staff. Establishing a clear 
understanding of clinical leadership in CMHTs (without inhibiting 
initiative and creativity) is essential for effective functioning. If lead- 
ership and management are separated, the roles need to be well de- 
fined and the relationships good. 


The key to good care is accurate assessment (see Chapter 137). 
Traditionally, psychiatrists conducted all initial assessments (usually 
in an outpatient clinic) and involved team members in treatment. 
Increasingly, other disciplines have taken a role in assessments, ei- 
ther individually or jointly with the psychiatrist. This issue generates 
strong feelings, but there is surprisingly little research into it. With 
highly developed primary care, non-medical assessments may be 
effective, but otherwise medical staff should prioritize time for as- 
sessments. With severely ill patients, home-based assessments pay 
considerable dividends [19]. 


Case management 


Most CMHT staff act as clinical case managers [20, 21] with respon- 
sibility for the co-ordination, delivery, and review of care for their 
patients. Caseloads should be explicitly limited (usually 15-25), 
and reviews systematic and recorded. In the UK, this has been for- 
malized as the Care Programme Approach (CPA) [22]. Fig. 135.1 
shows a care plan indicating a patient’s needs or problems, the inter- 
ventions proposed to meet them, who is responsible and who is 
informed, plus an agreed date for review. Such concise structured 
paperwork (as with the risk assessment and contingency plan) (Fig. 
135.2) can be adapted to any team. It co-ordinates complex care and 
serves as a natural focus for clinical reviews. The level of detail needs 
to be clinically (not managerially) determined. 


Team meetings 


Many of the newer, specialized teams meet daily. For generic 
CMHTs, it would be an inefficient use of time. They generally have 
1-2 regular meetings of 1.5-2 hours each per week to cover clin- 
ical and administrative business. The degree of structure depends on 
team style and remit. 


Allocation of referrals 


Referrals can be allocated by who is first available or by matching the 
clinical problem against available skill and training. Time discussing 
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CPA REVIEW 


Patient's name: Jenny T 


Address: 56 Acacia Avenue 


Phone: 


Date of birth: 09.06.61 


GP: Dr Findlay 


Phone: 


CMHT: West Central 

Phone: 

New patient: VESNO 

If NO, date of review: 20.10.07 


Diagnosis: 


1...Major depressive disorder 





You must consider the following: 1) Mental health, including indicators of relapse; 2) Physical health; 3) Medication; 4) Daytime 
activity; 5) Personal care / living skills; 6) Carers, family, children and social network; 7) Forensic history; 8) Alcohol or substance misuse 


9) Cultural factors; 10) Housing/finances/legal issues. 


Complete a risk assessment and include: i) a crisis plan; ii) a contingency plan 


Assessed needs or problem 


1. Depressed mood, apathetic and self critical 


2. Suicidal thoughts 


3. Daughter's school problems 


4. Plan for recovery 
Professionals involved in care: 


Psychologist 


Present at planning meeting: v Psychologist 


Copy given to patient? 


Care co-ordinator(print): Billie Jarvis (BJ) 


Care co-ordinator (signature): 


Job title: 


On Supervision Register? 


On Supervised Discharge? 


Fig. 135.1 Care programme review document. 


allocations before assessment is wasteful, and most well-established 
teams delegate the task to the manager or a senior clinician. 


Reviews should be held for: (1) new patients; (2) routine monitoring; 
and (3) discharge. Reviews can range from simply reporting the 


Care management? 





Relapse + risk plan required? 


Intervention Res p.of 


Regular home visits, assess mental state 
Encourage compliance with antidepressants 
Encourage activity - take to shops etc 


Explore severity (+/- plans) at each visit 
Support mother and husband who are scared 
of suicidal thoughts 


BJ/ Cons 


Maintain links with class teacher 
Keep family informed of her progress 


Link with support group when mood lightens 
Help reapply for part-time cleaning job 


Ward Nurse ACT 


Ward Nurse ACT Other 


YES/ MO Copy sent to GP? YES/ MO 


Phone 


YES/ NO Risk history completed? YES/NO 


YES/ NO 


problem and proposed treatment in uncomplicated cases through to 
detailed, structured, multi-disciplinary case conferences, including 
other services (for example, GP, housing, child protection). New 
patient reviews are an excellent opportunity for providing a broad, 
experienced overview and ensuring rational and fair allocation to 
caseloads. Routine monitoring is often overlooked yet is probably 
the most important for team efficiency. It should be systematic, and 
not only responsive to crises and problems. It shapes and redirects 
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CONFIDENTIAL: RELAPSE AND RISK MANAGEMENT PLAN 


Name: Alastair W 


Categories of Risk Identified: 


-¥ES/NO 
YES/NO 


Aggression and violence 
Exploitation (self or others) 
Suicide and self-harm 
Other (please specify} 


Severe self-neglect 
Risk to children & young adults 


YES/NO- 
YES/NO 


Current factors which suggest there is significant apparent risk: 
(For example: alcohol or substance misuse; specific threats; suicidal ideation; violent fantasies; anger; 
suspiciousness; persecutory beliefs; paranoid feelings or ideas about particular people) 


Continued excessive drinking—especially when depressed. Makes him more suspicious and hostile. 


Clear statement of anticipated risk(s): 


(Who is at risk; how immediate is that risk; how severe; how ongoing) 


Clear risk to strangers (not family or staff), usually in bars. Often when poor medication compliance. 


Action Plan: 


(Including names of people responsible for each action and steps to be taken if plan breaks down) 


Relapse plan discussed and agreed—to increase antipsychotics and contact when concerned with 


people plotting (‘to help you cope with them’). 


If he feels seriously threatened to seek admission through the emergency room 


Date Completed: XX/XX/XXx 


Fig. 135.2 Risk assessment and contingency plan. 


treatment and identifies patients ready for discharge, and monitors 
the burden on individual staff members. Routine monitoring is a 
requirement of the CPA. Discharge reviews are an excellent oppor- 
tunity for audit and learning within the team. 


Managing waiting lists and caseloads 


Effective CMHTs need to guarantee prompt access. Routine assess- 
ments should be within 2-4 weeks. Sooner is rarely productive, and 
delays above 3 weeks result in a rapidly rising rate of failed appoint- 
ments [23]. Urgent assessments (most psychotic episodes) need to 
be seen within a week, usually within a couple of days. Emergency 
assessments for those at immediate risk (for example, hostile behav- 
iour or suicidal intent) need to be seen on the same day. These prag- 
matic guidelines may soon be superseded by imposed targets. 

A practical approach to waiting lists is to count the assessments in 
the preceding year and allocate routine appointments for 20% more. 
Thus, for a team with 400 assessments in the preceding year, allo- 
cating nine slots a week will have one slot available weekly for emer- 
gencies. Rapid routine assessment reduces pressure for urgent and 
emergency referrals much more efficiently than emergency rotas. 


Communication and liaison 





Team meetings ensure internal communication, but CMHTs 
need good links with a wide network of professional colleagues. 
Structured liaison is advisable with primary care and general hos- 
pitals. Hospital links may be between specific CMHTs and wards, 


Review date: 





XX/XX/XX 


or CMHTs may provide input to patients from their sectors in the 
absence of dedicated liaison psychiatry services. 


General practice liaison 


As so much mental health care is delivered in primary care, effective 
co-ordination is essential. ‘Consultation-liaison’ also represents effi- 
cient use of a specialist team’s time if it increases the capacity of pri- 
mary care through improving GPs’ skills and knowledge. However, 
there has been little systematic research evaluating consultation li- 
aison interventions and this model would apply only in those coun- 
tries with more developed primary and secondary care services [24]. 
GP liaison systems range from informal contact through to shared 
care and co-location of CMHTs in GP health centres [25]. An ef- 
fective system need not be time-consuming but requires regular, 
timetabled meetings between the two teams or a ‘link CMHT 
member. Monthly meetings where shared and complex patients are 
discussed facilitate prompt problem-solving and crisis anticipation. 
However, it is important to be clear about responsibilities; fudging 
boundaries is risky. 


Liaison with other agencies 





The same principles apply to liaison with other agencies (social 
services, housing, etc.). Whether regular meetings are feasible 
and cost-effective will depend on the volume of shared work, but 
showing up and meeting people (even just once) pays enormous 
dividends. 
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Assertive outreach teams 


The most replicated and researched specialist CMHT is the assertive 
outreach (AO) team. The original US model [26] substantially re- 
duced hospitalization without increased cost. AO teams (Box 135.2) 
are staff-intensive and reserved for the most difficult (‘hard-to- 
engage’ or ‘revolving-door’) psychotic patients. These have frequent, 
often dangerous, relapses and poor medication compliance, compli- 
cated by alcohol or drug abuse. 

AO emphasizes proactive outreach—visiting patients at home, 
even when they are reluctant. It exploits enhanced teamworking 
with daily meetings and several staff involved with most patients 
both for safety considerations and to meet patients’ extensive needs. 
The culture is very practical (taking patients shopping, sorting out 
accommodation, delivering medicines daily if need be), often well 
beyond traditional professional boundaries. 

AO brought much needed clear thinking and research to CMHT 
practice. However, they failed to deliver the same expected benefits 
in the UK as in the United States [27]. Similar results were found 
without slavishly following the original model [28, 29]. Detailed 
examination of AO studies [29] showed that CMHTs delivered 
equivalent outcomes at much less cost. Consequently, the UK re- 
quirement for these teams has been abandoned, with intensive 
care provided within the CMHT. A more structured approach to 
identifying and monitoring patients in CMHTs needing intensive 
care has been proposed as FACT (functional assertive community 
treatment) [30]. 


Crisis teams 


Crisis teams play a crucial role where local services are poorly de- 
veloped (they may be the only community services). They priori- 
tize rapid response and accessibility. Early crisis services often had 
a short lifespan as they either evolved into more durable CMHT 
services or became overwhelmed by inappropriate referrals. 


Crisis resolution/home treatment (CR/HT) teams 





The CR/HT team model implemented in the UK and Europe re- 
flects increased public demand for access in crises plus a drive to 
reduce inpatient care costs. It draws heavily on AO practice with 
limited shared caseloads, flexible working, extended access, and 
an emphasis on outreach. Reduction in hospitalization is believed 


Box 135.2 ACT core components 


e Assertive follow-up. 

e Small caseloads (1:10-1:15). 

e Regular (daily) team meetings. 

e Frequent contact (weekly to daily). 

e In vivo practice (treatment in home and neighbourhood). 
Emphasis on engagement and medication. 

Support for family and carers. 

Provision of services using all team members. 

Crisis stabilization 24 hours a day, 7 days a week. 














to offset much of their cost [31]. How they integrate with generic 
CMHTs varies markedly, but they work with patients who would 
otherwise be in hospital—either by supporting them in early dis- 
charge or seeking alternatives to admission. They offer intensive 
visiting (usually daily for a limited period) and considerable prac- 
tical support and work with patients’ social networks. Many are 
based in a crisis day hospital, and input is limited to a few weeks. 
Such intensive teamworking requires highly effective communi- 
cation and the teams meet daily (often twice at shift handovers). 
Information transfer can be burdensome and liaison with CMHTs 
complex, requiring absolute clarity on local arrangements for clin- 
ical responsibility. 


Variations in practice and sustainability 





The CR/HT teams may reduce the need for hospital care [31, 32]. 
The UK model is precisely specified (including who it should and 
should not care for) (Box 135.3), but practice varies considerably. 
A full 24-hour service is rarely provided; most rely on established 
on-call services. Strict observance of the prescribed patient group 
‘otherwise in hospital’ is difficult to achieve, and engagement may be 
longer. The main criticism is of multiple repetitive visits from end- 
lessly changing staff members. Good medical staffing is needed, and 
CMHT responsibilities need to be carefully negotiated, mutually 
agreed, and crystal clear if confusion is to be avoided. 


Crisis houses and respite care 





Crisis house admissions involve a minimum of formality with less 
intense supervision than hospitals. They are usually small (4-8 beds) 
in a relatively domestic setting and admit for days, occasionally a 
week or two. They are favoured for vulnerable women and early 
intervention services. Most have one staff member overnight and 
a couple during the day, with input from patients’ case managers. 
They improve acceptability and access but do not replace inpatient 


Box 135.3 Remit of UK crisis resolution/home treatment teams 





‘Commonly adults (16 to 65 years old) with severe mental illness (schizo- 
phrenia, manic depressive disorders, severe depressive disorder) with 
an acute psychiatric crisis of such severity that, without the involvement 
of the CR/HT team hospitalisation would be necessary.’ 

‘The service is not usually appropriate for individuals with: 

e Mild anxiety disorders. 
e Primary diagnosis of alcohol or other substance abuse. 

e Brain damage or other organic disorders including dementia. 
e Learning disabilities. 

e Exclusive diagnosis of personality disorder. 
e Recent history of self harm but not suffering from a psychotic or ser- 
ious depressive illness. 

e Crisis related solely to relationship issues.’ 








Reproduced from Department of Health, The mental health policy implementation 
guide, Copyright (2001), Crown Copyright. Reproduced under the terms of the Open 
Government Licence v3.0. Available at https://webarchive.nationalarchives.gov.uk/ 
20120514200638/http://Awww.dh.gov.uk/prod_consum_dh/groups/dh_digitalassets/ 
@dh/@en/documents/digitalasset/dh_4058960.pdf 

















CHAPTER 135 Planning and providing mental health services for a community 


care and need careful management to avoid becoming chaotic or 
blocked. 


Early intervention service 


Concern that a long duration of untreated psychosis (DUP) confers 
poorer prognosis [34] spurred the development of early interven- 
tion service (EIS) teams which have now become standard in the 
UK. Developed mainly from Australian and UK models [35, 36], 
they vary remarkably. Some down-play diagnosis in favour of easy 
access; others restrict to schizophrenia; some emphasize a ‘youth 
service, while others take all first-episode patients, irrespective of 
age [18]. Even more confusing, there are three quite different activ- 
ities which may, or may not, be part of the service (Box 135.4). 

The core of EIS is a specialized CMHT based closely on AO team 
practice which case-manages first-episode psychosis patients. EIS 
emphasizes protecting social networks and functioning, aiming 
to keeping patients at college or work, and assuming a return to 
premorbid functioning. Crisis and respite houses are preferred to 
hospital. Some EIS teams go beyond this basic model and conduct 
public awareness campaigns, lecturing in schools and colleges [37]. 
A minority of research teams attempt to identify and treat ‘ultra- 
high-risk’ patients to prevent progression to psychosis [38]. EIS 
teams are highly popular with families, staff, and commissioners. 
They are generally better staffed and funded than other CMHTs, and 
inevitably problems arise with ‘step-down to routine services. 


Adjunct or replacement for CMHTs? 


The four teams outlined comprise the fundamental building blocks 
of most community services. AO EIS and CR/HT teams were ori- 
ginally proposed as replacements for CMHTs. However, both ex- 
perience and research evidence [39] suggest that they are better 
considered as additions to improve the quality of care from otherwise 
well-functioning CMHTs, rather than as cost-saving alternatives. 





Highly specialized and diagnosis-specific teams 


Several specialized teams are generally organized at a regional level. 
They impact on local services, removing some of the CMHT’s clin- 
ical load, but need clear and negotiable thresholds if confusion is to 
be avoided. 


Forensic and rehabilitation teams 





Community-focused services face particular difficulties with 
treatment-resistant patients, particularly those with socially 


Box 135.4 Components of early intervention teams 





e Case management—ongoing care of identified patients. 
e Early identification—awareness-raising campaigns for psychoses. 
e High-risk and prodromal patient identification and treatment. 





unacceptable or offending behaviour who fit poorly into open 
wards. Specialized forensic teams provide secure care. Increasingly, 
they also run community teams to provide intensive case manage- 
ment of dangerous patients. Integrating them with general services 
can be problematical, and boundaries are invariably controversial. 


Rehabilitation 


A significant number of patients remain disabled and require long- 
term management of disability, rather than episode-based care, 
despite optimal treatment. Rehabilitation teams generally serve pa- 
tients who cannot survive without supervised accommodation, even 
when at their best. They comprise a disturbed ‘new long-stay’ popu- 
lation, characterized by comorbid substance abuse and behavioural 
disturbances. 


Diagnosis-specific teams 


Highly specialized teams for individual disorders (for example, 
eating disorders, personality disorders, bipolar patients) concentrate 
on specific skills and provide specialized treatments and are usu- 
ally provided at a regional level. They provide settings for intensive 
academic research and treatment refinement, but their opportunity 
costs need careful thought. 


Compulsion and ethics in community mental 


health care 





Balancing patients’ welfare with their autonomy and balancing their 
individual rights with those of their families and the wider commu- 
nity have both become more complex as CMHTs have developed. 
The introduction of community treatment orders (CTOs) [40] high- 
lights these dilemmas. Compulsion and coercion (either explicitly 
in the form of legal requirements or informally through professional 
or social pressure [41-43]) are now a pervasive feature of practice. 
Improved clinical provision and legal scrutiny make compulsory 
treatment possible in the community. 

Most developed countries have enacted forms of CTO (‘mandated 
community treatment, ‘outpatient committal’). Their introduction 
has generally been controversial, and this is heightened by mounting 
evidence of their ineffectiveness [44]. CTOs have the advantage of 
legal scrutiny, unlike most of the ethical dilemmas facing CMHT 
staff in their day-to-day work which require case-by-case discus- 
sion. How proper is it to inform neighbours if a patient may pose 
a risk to them but will not give consent? Is it right for a patient to 
deny information on his or her treatment to parents on whom he or 
she is heavily dependent? Guidelines exist only for extreme circum- 
stances, and teams need regular discussions of these issues. 


Stepwise planning and adaptation 


Local population needs assessment 


Psychiatric morbidity varies considerably with social deprivation 
and is much higher in cities than in stable rural or suburban settings 
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(see Chapters 134-136). At regional and national levels, compara- 
tive need can be predicted fairly well from established indexes 
incorporating age profiles, number of migrants, overcrowding, pov- 
erty, etc. [45]. Catchment areas should broadly reflect these differ- 
ences. At the local level, however, these figures are of limited value. 
How can one factor in travelling time or differences in the quality of 
primary care? A process of negotiation is best to agree on local allo- 
cation following these general guidelines. However, a concentration 
of hostels for the mentally ill or homeless or the presence of a railway 
station or an international airport may swamp these differences. 
Such local factors should be provisionally estimated in the planning 
process, but regularly reviewed. 


Opportunity costs and unintended consequences 





Planning mental health services is driven by political and policy 
considerations, at least as much as on international evidence, often 
including cost-effectiveness analyses. However, these analyses rarely 
address the opportunity costs across a whole system. A study may 
demonstrate that one acute day hospital is more cost-effective than 
another, but does the whole system gain when considering diverting 
the inpatient nurses redeployed to staff it? Rigorous intervention 
studies of large systems are formidably difficult to conduct and even 
more so to interpret. 

The impact of enthusiasm and the migration of the best staff to 
such research services can be especially misleading [46]. Successful 
new services are always reported, but not when they lose their edge 
or are abandoned [15, 47]. It is best to visit examples of services that 
you are considering, and not to assume automatically that what 
works for them will necessarily work for you [28]. 

Skilled manpower is as significant a resource limitation as funding 
is, hence the need for a system-wide appraisal. Although costs are 
usually cited, planning decisions reflect wider values and expect- 
ations, rather than simply outcomes [2]. 


Cultures and funding 


Health care cultures, their structures and their funding systems 
vary enormously. Services must be congruent with them otherwise 
they will not survive. The current emphasis on the ‘recovery’ model 
[48], while it may not make any major differences in practice, has 
to be incorporated into service planning proposals if they are to be 
funded. Occasionally, service planners can influence the culture but 
more often have to adapt to it. Obtaining relatively small changes 
in funding arrangements (or external governance) can deliver quite 
major improvements. However, caution is needed as the risks of un- 
intended consequences and perverse incentives are ever present. 
Enthusiastic clinicians are often blind to the risks of overprescrip- 
tion which can lock in outmoded practices (for example, a highly 
specific form of day hospital or crisis facility) that may not meet local 
needs but are too rigidly prescribed to then be adapted. 

How ‘integrated’ mental health care should be is a highly local 
decision. Well-established systems often strive for integration with 
general medicine or with social services to reduce both discrimin- 
ation and administrative barriers to integrated care. The current em- 
phasis on ‘parity of esteem’ for mental health is sometimes taken to 


imply such integration. For many services, the value of a distinct, 
separate identity can be considerable—not least the ability to protect 
its resources. 

The task of integration is easily underestimated, particularly the 
energies required to accommodate contrasting health and social 
care cultures. Compromises need to be agreed before such integra- 
tion and the negotiations can be exhausting, but they are infinitely 
preferable to fudges and misunderstandings. The costs and bene- 
fits will depend very much on the local situation, relationships, and 
history. 


Relationships with the voluntary sector and 


patients movement 





Current policies emphasize outsourcing, where practical, any wel- 
fare provision for which there is a ‘suitably qualified provider’ 
Relations with local non-health statutory services (housing, edu- 
cation, police) and with the voluntary sector will determine much 
of the success of mental health services, and their strengths need 
to be exploited. Non-statutory services are usually well adapted to 
local needs, but less comprehensive or reliable long term. Patient and 
carer advocacy and support organizations are now a major force; ef- 
fective working with them will significantly enhance both the design 
and the delivery of services. 


Monitoring and review 


Careful monitoring and review are as important as careful plan- 
ning, probably more so. Services can easily drift to those who de- 
mand them from those who most need them; delivery of treatments 
(particularly psychological and psychotherapeutic treatments) may 
become mechanical, losing their effectiveness. The most needy pa- 
tients are rarely the most demanding or well informed, and engaging 
them in long-term treatments is often difficult. 

A consistent effort to deliver even the most basic proven inter- 
ventions will make a substantial difference to patient welfare. The 
PORT study in the United States demonstrated how schizophrenia 
care was strikingly inconsistent and fell below accepted standards 
[49]. Regular audit, particularly of national guidelines such as those 
produced by the National Institute for Health and Care Excellence 
(NICE), ensures that services remain targeted and quality is main- 
tained. Monitoring can vary in sophistication—from a simple head 
count of who is getting what (for example, clozapine treatment) 
through to careful evaluation of care pathways. The audit review 
process feeds back into the development process. Audit invariably 
rewards its investment. 


Routine outcome measures (ROMs) 


Audit and measurement bring rigour and reflection, which are often 
lost in the immediacy of the therapeutic relationship. Measurement 
also serves a training purpose by benchmarking interdisciplinary 
understandings of symptoms and outcomes. Systematic, periodic 
recording of patients’ clinical or social status is increasingly used 
in both planning and research. Structured outcomes can be generic 
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[for example, Health of the Nation Outcome Scales (HoNOS) [50]] 
or for specific disorders (for example, the Brief Psychiatric Rating 
Scale [51]) or even locally developed. Their value lies in their con- 
sistency of use. 


Physical health care of people with SMI 


In the UK, GPs take responsibility for health promotion and care 
of the physical health of people with SMI. They are incentivized 
through the GP contract quality and outcomes framework (QOF) 
to review the needs of their SMI patients annually and ensure a care 
plan is in place [52]. This proactive care is crucial, as mortality rates 
for people with SMI are 2-3 times higher than for the general popu- 
lation. While this is partly due to suicide, two-thirds of the excess 
mortality is a result of physical disorders, especially smoking- and 
obesity-related cardiovascular and respiratory diseases [53-55]. 
The annual review of physical health for people with SMI includes: 


e Use of alcohol and drugs and smoking behaviour. 
e Blood pressure. 

e Body mass index. 

e HbA\Ic blood test for diabetes. 

e Cholesterol. 

e Cervical screening, if appropriate. 

e Medication review. 


In addition, GPs in the UK check blood lithium levels and monitor 
thyroid and kidney function of people with SMI taking lithium. 

Mental health service planning should take into account the 
physical health care needs of people with SMI. It will have to 
provide that care in countries with less well-developed primary 
care systems or residential mental health care settings. Even in 
the UK, the primary care of people with SMI varies widely in 
quality, and one cannot assume their physical health needs will 
automatically be met through registration with a GP. An audit of 
the care of schizophrenia in mental health services in England 
and Wales revealed only 29% received a fully comprehensive 
assessment of important cardiometabolic risk factors. Among 
those with high blood pressure and high cholesterol levels, there 
was frequently no evidence of appropriate investigation or treat- 
ment [56]. The authors recommended all health professionals 
working with such patients should have training in physical 
health problems and interventions for treating them, including 
access to the correct monitoring equipment. They also stressed 
that mental health services and primary care services need to 
work together to agree who will monitor and treat these physical 
health problems [56]. 


Planning and providing mental health services require flexibility 
and compromise. Epidemiological and service statistics provide a 
framework for local planning, but it is still primarily a practical and 
‘political’ activity. The scientific evidence is dominated by anglo- 
phone alternatives to hospital care studies, but local history, cul- 
ture, mental health law, and political imperatives cannot be ignored. 


Box 135.5 Developing local community mental health services 


e Make a careful inventory of what services exist and any special 

local needs. 
e Consult locally and invest heavily in building coalitions with policy- 
makers, statutory services, and voluntary groups. 
Test research evidence for durability and relevance. If possible, visit 
established services and ask ‘around the service’. 
Monitor and review regularly. Improved consistency of current prac- 
tice often delivers more than introducing new treatments. 
e Consider carefully opportunity costs. Include both the impact of 
a specific improvement across the whole service and the costs of 
system change itself. 
Avoid excessive reorganization—not all change is innovation. 














A high-profile patient homicide, or a strong public endorsement 
of services by a politician or celebrity, can derail months of careful 
planning. 

This chapter has attempted to draw out some principles (Box 
135.5) for the process, but these can only be guidelines. Inevitably, 
the decision will be based on what is possible locally. Most solutions 
rely on the tried and tested structures described here, but their con- 
figuration will depend on what is available and local values. Above 
all, the mentally ill and their families deserve reliable and predictable 
services. Not all change is innovation. Research findings should be 
judged in terms of their sustainability and translation from research 
efficacy to clinical effectiveness more than on the P-values in indi- 
vidual trials. 
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Resources available in the health care system are limited and are 
exceeded by needs. Public health and long-term care expenditures 
have been rising steadily, relative to national income, for several 
decades. Among the Organisation for Economic Co-operation and 
Development (OECD) countries, they increased by more than a 
fifth to approximately 6% of the annual national income or gross 
domestic product (GDP) between 2000 and 2010, and are pre- 
dicted to more than double, increasing to almost 14% of the GDP 
by 2060 unless active cost containment efforts are put in place [1]. 
Currently, the costs of mental ill-health account for up to 14% of 
health spending in OECD countries and this proportion is also ex- 
pected to rise further [2]. Worldwide, costs due to psychiatric dis- 
eases are expected to double by 2030 [3]. Decisions on how best to 
allocate scarce resources are therefore becoming increasingly prom- 
inent among health policy-makers. 

This chapter summarizes the health economic methods used to 
inform the key questions of decision-makers in health care related 
to efficiency and equity, which are ‘Who gets what, where, and how?. 
It provides an explanation of the underlying economic and analyt- 
ical frameworks and gives a summary of the different techniques of 
health economic analyses, supported by a selection of recent rele- 
vant published examples from the mental health field. 












































Health economic analyses are concerned with the efficient (and 
equitable) allocation of limited resources for the promotion of health 
and the prevention, treatment, and care of diseases to maximize the 
health of the population. Cost descriptions [for example, cost-of- 
illness studies (COIs)] assess the cost of interventions, services, or 
illnesses with no comparison in place. When information on out- 
comes is also described, but no comparison to status quo or other 
alternative is given, the analysis is called cost-outcome description. 
Cost analyses compare two or more alternatives, based only on their 
costs. To assess the overall value of a health intervention, however, 





decision-makers need information on the effect, the resources used 
to generate the effect, and the cost of these resources incremental to 
the next best alternative action. The value of the next best alterna- 
tive forgone is known as the opportunity cost. Economic evaluations 
are the comparative assessment of both costs and outcomes of alter- 
native health care interventions. They provide an explicit economic 
framework for the systematic identification, measurement, and 
valuation of the resource inputs and outcomes of alternative activ- 
ities, and the subsequent comparative analysis of these based on the 
evidence available [4] (Table 136.1). 


Cost descriptions 


Mental health disorders are common, often recurrent or chronic, 
have major impacts on individual and social well-being, cause ex- 
cess premature mortality, and therefore pose great disease burden. 
Worldwide, the burden of mental and substance use disorders in- 
creased by 38% between 1990 and 2010, driven mostly by popula- 
tion growth and ageing [5]. In 2010, they were among the ten leading 
causes of total disease burden in terms of disability-adjusted life years 
(DALYs) (7.4% of the total) and were the leading cause of years lived 
with disability (YLDs) worldwide (23% of the total) [5]. Depression 
particularly is considered the third leading cause of disability due 
to morbidity and mortality in Europe, causing 3.8% of all DALYs 
[6]. By 2020, depression is predicted to become the leading cause 
of disease burden, alongside cardiovascular diseases, worldwide [7]. 
COIs estimate the economic burden of an illness in terms of its costs 
either for a year based on its prevalence or for a lifetime based on 
its incidence. They may use top-down methods by disaggregating 
health (and possibly other intersectoral) expenditure data according 
to disease, or a bottom-up approach by projecting population-level 
cost of an illness from patient-level data. They are useful in pro- 
viding information on resource allocation in health systems, ana- 
lyse time trends, make projections of future health expenditure, and 
help make international comparisons. Comparability of top-down 
calculations between countries has improved substantially since the 
introduction of the System of Health Accounts (SHA) method by 
OECD in 2000, which provides a framework for a family of inter- 
related tables for standard reporting of expenditure on health [8]. 


Table 136.1 Health economic analyses 


Is there a comparison? 
C 


Is evidence only MES Cost description Cost analysis 


available on costs? 


NO Cost-outcome Economic evaluation 


description 


COIs can also greatly contribute to the identification of discrepancy 
between epidemiological and health service use data and identify 
unmet needs. For example, a pan-European cost of brain disorders 
study from 2010 concluded that less than one-third of all cases re- 
ceived treatment and suggested a considerable level of unmet needs 
[9]. There are several potential reasons for this, besides the general 
underfunding of mental health services. Issues such as availability 
of trained personnel, poor geographical distribution of services, 
and other access and utilization issues (language, socio-economic 
status, stigma, low weight of individual needs and preferences), lack 
of long-term thinking at decision-making level, provision of cost- 
ineffective services, bureaucracy, and, last but not least, fragmented 
funding and provision all play a role. 


Cost categories 


The direct medical costs of mental ill-health include medical expend- 
iture of all goods and services related to the prevention, diagnosis, 
and treatment of an illness (for example, physician visits, hospi- 
talizations, medication) due to an increased need for health care. 
Poor mental health also drives up the cost of treating other health 
problems. It is more expensive to treat physical ill-health conditions 
when the patient is suffering from a mental illness (for example, de- 
pression), and people with mental ill-health are more likely to suffer 
from physical comorbidities (for example, cardiovascular diseases) 
(Box 136.1). 

Direct non-medical costs refer to the cost of other goods and 
services related to the disorder, including social services, long-term 
care, special accommodation, and costs to the patients and their 
families (for example, travel costs, out-of-pocket payments). Mental 
illnesses also have broader societal impacts that lead to significant 
indirect costs. People with mental ill-health experience higher rates 
of unemployment. According to OECD estimates, people with se- 
vere mental health disorders are 6-7 times more likely, while people 
with mild to moderate disorders are 2-3 times more likely, to be 
unemployed than people with no mental health disorders [2]. In 
addition, people with mental ill-health retire earlier, have more 


Box 136.1 Costs of medications 


Often direct medical costs that are easy to estimate, for example medica- 
tion costs, are in the focus of resource allocation debates. Antidepressant 
consumption has at least doubled in the OECD countries between 2000 
and 2013 [10]. Globally, antidepressant medications were ranked ninth 
among all prescription drugs, with sales of over US$20 billion in 2011 
[11]. Despite the general trend of increasing consumption, drug costs 
only have limited effect on the total direct costs (between 6% and 29% of 
the total), with hospitalization remaining the main cost driver (between 
30% and 79%) in terms of the economic burden of depression and other 
mental illnesses internationally [12]. 
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absences from work (absenteeism), and also suffer more from pres- 
enteeism, meaning reduced productivity at work. In the case of de- 
pression, for example, lost productivity costs due to presenteeism 
are five times as high as those incurred due to absenteeism [13]. 
Indirect costs also include economic costs attributable to lost re- 
sources, but not involving direct payments such as informal care 
provided by family members and friends. A full accounting of the 
costs of poor mental health should include relevant intersectoral 
costs of lower educational achievements, increased homelessness, 
and crime. The intangible costs of mental ill-health are considerable 
and include the monetary value of reduced well-being, emotional 
distress, pain, and other forms of suffering (for example, stigma). 
Due to difficulties in quantifying the latter costs, however, they are 
often neglected in COIs, resulting in an underestimation of the true 
disease costs (Box 136.2). 

COIs, however, are limited regarding their interpretation and 
policy use in resource allocation. Bias towards already costly disease 
areas is likely to occur without further consideration of whether the 
current allocation is optimal and what kind of trade-offs may be done 
that represent good value for money and may contribute to the re- 
duction of both the disease and the economic burdens effectively. As 
mental health services are usually complex with multiple impacts, it 
is difficult to estimate their real cost-saving potentials without rele- 
vant considerations in a full economic evaluation framework. 


Cost analyses 


Comparative cost analyses are seen as partial economic evaluations 
[4]. Although, without information on outcomes, they are not 
useful in determining the cost-effectiveness of alternative options, 
cost analyses may be used to highlight the difference in costs be- 
tween two alternative interventions or services and are useful tools 
in health services research and planning. The perspective of the ana- 
lysis depends on the included cost categories. Budget impact analyses 
(BIAs) are specific cost analyses that address the expected changes 
in the expenditure of a health care system after the adoption of a 
new intervention or service in comparison to status quo. They pre- 
sent information on the relevant monetary impact and its diffusion, 
depending on the time horizon of the analysis and the included cost 
categories. BIAs are used for budget or resource planning by those 
who manage and plan health care budgets (for example, administra- 
tors of national or regional health care programmes, private health 


Box 136.2 Costs of mental health in Europe 


The total European cost of brain disorders in 2010 was estimated at 
€798 billion, of which direct health care costs accounted for 37%, direct 
non-medical costs for 23%, and indirect costs for 40%. The average 
cost per inhabitant was €5550. From these, the aggregated economic 
costs of depression were calculated at €92 billion, attributing to one- 
third of the total cost of brain disorders, and therefore being the most 
costly brain disease in Europe. The average total cost per patient for 
depression was estimated at €3034, of which direct health care costs 
amounted to €797 (26%) and direct non-medical costs at €454 (15%). 
With €1782 (59%), indirect costs were responsible for the highest pro- 
portion of costs per patient, making lost productivity the greatest con- 
tributor to the overall economic burden of depression and emphasizing 
its large societal cost [14, 15]. 
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Box 136.3 Costs of mental health in the UK 


A recent UK cost analysis estimated the mental health service use and 
cost impacts of remote mood monitoring of patients with bipolar dis- 
order introduced in a specialist mood disorders clinic via the True 
Colours system over 1 year before and after engagement with moni- 
toring. Average compliance with monitoring was over 80%. No sig- 
nificant changes in the annual use and costs of mental health services 
were found, except for patients who were newly referred to the clinic 
during the pre-monitoring period. Annual psychiatric medication costs 
increased, however, on average by £235 across all patients due to more 
frequent changes to psychiatric medications and higher levels of multi- 
drug use as a possible consequence of a tighter medication regime [17]. 

















insurance companies, health care providers). A BIA can be an inde- 
pendent analysis or part of a comprehensive economic assessment of 
a health care intervention, alongside an economic evaluation of its 
cost-effectiveness [16] (Box 136.3). 


Economic evaluations 


Techniques of economic evaluations 


The main question an economic evaluation tries to answer is: ‘Is an 
intervention or service cost-effective, that is, meaning ‘good value 
for money; in comparison to an alternative use of resources, based 
on the available evidence?’ For this, both costs and outcomes of com- 
peting alternatives have to be compared and an appropriate com- 
parator should be chosen. For example, when the cost-effectiveness 
of a new antidepressant medication is investigated, it is not sufficient 
to compare its outcomes and costs to that of a placebo, but rather to 
the current gold standard active antidepressant treatment options. 
Depending on the resource allocation question at hand, dif- 
ferent types of economic evaluations applying different types of out- 
come measures are appropriate [4] (Table 136.2). When there is 
good evidence that the compared alternatives are equally safe and 
effective, and therefore have identical outcomes, the question of 
finding the least costly option can be answered within a so-called 
cost-minimization analysis (CMA). Because of the inherent un- 
certainty around both the cost and outcome estimates, however, it 
has been questioned whether CMA can be seen as the correct ap- 
proach in any decision situation [18]. When competing alternatives 
within the same disease area are compared, it is appropriate to use 


Table 136.2 Types of economic evaluations 


ES Co 


Cost-minimization onetary uni (Preliminary evidence of equal 

analysis (CMA) outcomes) 

Cost-effectiveness onetary uni Natural units (for example, 

analysis (CEA) relapse prevented, life years 
gained) 

Cost-utility analysis (CUA) Monetary uni QALY, DALY 

















Cost-benefit analysis onetary uni Monetary unit 

(CBA) 

Cost-consequences onetary uni Multiple outcomes tabulated 
analysis (CCA) 


outcome measures that are common natural units meaningful for 
the given disease within a cost-effectiveness analysis (CEA). For ex- 
ample, for people with depression, meaningful outcomes could be 
expressed as relapse prevented or depression-free days. The pre- 
ferred outcome measure for economic evaluations that allows com- 
parison across disease areas is the generic quality-adjusted life year 
(QALY) used in a cost-utility analysis (CUA). QALYs combine both 
quantity of life and quality of life outcomes into a single metric. For 
this purpose, quality of life is measured in terms of the utility value 
of a given health state. The length of time an individual spends in 
a given health state is weighed by its utility value to calculate the 
QALY. Utility values of 1 indicate full health, and 0 indicate a health 
state value equivalent to being dead. Health states considered worse 
than being dead can be represented by negative values. Whether 
patients, experts, or the general public are best suited to perform 
utility valuations is an ongoing debate. From a social perspective, 
the general public is considered as the most appropriate group due 
to the dominantly public financing of health care systems and the 
need for unbiased valuations across a wide range of different dis- 
ease areas and health states. Other generic outcome measures pre- 
dominantly used in international CUAs are disability-adjusted life 
years (DALYs) and healthy year equivalents (HYEs) [19]. When 
the relevant resource allocation question crosses the boundaries of 
public policy sectors (for example, investment in better health care 
vs investment in better education), outcomes necessarily need to be 
measured in a metric that is common to all sectors, that is, in mon- 
etary terms within a cost-benefit analysis (CBA). It is also possible 
to report costs and outcomes of an economic evaluation in a dis- 
aggregated way in a tabulated format within a cost-consequences 
analysis (CCA) framework. In these cases, while information on 
the relevant ranking of alternatives is possible alongside different 
criteria, no single cost-effectiveness ratio is presented, leaving the 
overall synthesis and trade-off between different outcome attributes 
for the decision-maker thereby less explicit. More recent attempts 
for eliciting weights by analysts for different criteria are emerging 
within a multi-criteria decision analysis (MCDA) framework [20]. 

The resource allocation question at hand also determines the ap- 
propriate perspective of an economic evaluation, which, in return, 
defines the categories of costs (and types of outcomes) that need 
to be included in the assessment. Many stakeholders require an 
economic evaluation to be conducted from their relevant perspec- 
tive (for example, provider perspective, payer perspective). In the 
evaluation of mental health services (particularly with their typ- 
ically fragmented financing arrangements), such an approach is 
usually considered insufficient as large proportions of the relevant 
costs and consequences fall outside the health care sector, with im- 
pacts on social care, patients and their families, and other sectors 
of the economy (for example, employment, justice, education). 
The broadest possible perspective of an economic evaluation is the 
societal perspective, which includes all possible costs (and out- 
comes). Alternative perspectives can result in different conclusions 
concerning the cost-effectiveness of interventions and services. 
Therefore, it is not uncommon for an economic evaluation to pre- 
sent results from several perspectives (Box 136.4). 

Randomized clinical studies are a great source of collecting 
relevant cost and outcome data at individual patient level. Often, 
however, data for all relevant comparisons are not available dir- 
ectly. Furthermore, unless an economic evaluation is prospectively 


Box 136.4 Costs of community treatment orders in the UK 


A recent economic evaluation carried out alongside the OCTET Trial as- 
sessed the cost-effectiveness of community treatment orders (CTOs) for 
revolving-door psychiatric patients in the UK over 12 months [21, 22]. 
The primary cost-utility analysis used QALYs as outcome measure, while 
a secondary economic evaluation explored the OxCAP-MH capability 
index as outcome measure. The study found no evidence that CTOs have 
any significant impact on individuals’ health-related quality of life or cap- 
abilities, in comparison to the non-CTO group. The mean total health and 
social care costs per participant were also found not different (£35,959 
in the CTO group and £36,003 in the non-CTO group). Participants in 
the CTO group, however, had significantly higher indirect care costs than 
those in the non-CTO group (£6138 vs £2993) and significantly higher 
legal procedure burden. The overall results of the economic evaluation 
did not support the original hypothesis that CTOs result in reduced hos- 
pitalization costs or other health or social care cost savings, and showed 
that CTOs are unlikely to be cost-effective either from a health and social 
care perspective or from a broader societal perspective [22]. 

















planned alongside a clinical trial, information on important out- 
comes or costs is frequently not collected. It is also important 
that the analytical time horizon of an economic evaluation is long 
enough to capture all important cost and outcome impacts of the 
compared alternatives. This aspect becomes extremely influential 
when costs and outcomes occur at different time points, for example, 
when assessing the value for money of preventative programmes. 
The time frame of clinical studies is often too short to provide all the 
necessary evidence. A fourth potential limitation of clinical studies 
as data sources is the representativeness of their investigated patient 
populations to real life. Due to these limitations, evidence synthesis 
of cost and outcome data from multiple sources, including litera- 
ture, routine administrative databases, and expert opinions, within 
a decision analytic modelling framework often becomes necessary for 
economic evaluations [23]. 


Identifying, measuring, and valuing costs 


Costing in economic evaluations include the identification of the re- 
sources used as inputs in the provision of an intervention or service 
(for example, medications taken, days spent in hospital, GP visits), 
measurement of the actual resource use for each relevant category, 
and the valuation of these amounts by their respective unit costs (for 
example, cost per usual defined daily dose of an SSRI, cost of 1 day 
in hospital, cost of a GP visit). 

Information on individual-level resource use can be derived from 
routinely collected data (for example, hospital records). In addition, 
linkage of different routine databases across multiple providers may 
become necessary. Alternatively, patients, caregivers, or profes- 
sionals may be asked via specifically designed resource use ques- 
tionnaires and diaries. A comprehensive collection of such resource 
use measurement instruments is publicly accessible via the DIRUM 
(Database of Instruments for Resource Use Measurement) database 
(http://www.dirum.org/). When patient-level variability of resource 
use is of lesser importance, relevant estimates may be taken from the 
literature or expert estimates are used. 

Unit cost information for the valuation of health and social care 
resources may come from specific unit cost libraries, if available 
(for example, http://www.pssru.ac.uk/project-pages/unit-costs/), 
or be estimated anew using top-down or bottom-up (also called 
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micro-costing) approaches, depending on data availability and per- 
spective. Unit cost estimates usually include relevant cost categories 
for staff costs, capital costs, and overhead costs [24]. 


Identifying, measuring, and valuing outcomes 


In clinical practice, disease-specific and symptom-specific outcome 
measures [for example, Beck Depression Inventory (BDI), Brief 
Psychiatric Rating Scale (BPRS)] are routinely used to monitor and 
evaluate outcomes focusing on a particular disease or symptom area. 
For the purposes of economic evaluations where often outcomes 
have to be measured and compared across different disease areas or 
patient populations or within the general population, there has been 
a growing interest in more generic outcome measures that are able 
to capture health or broader social functioning and well-being [for 
example, Short Form Health Survey 36 (SF-36), EuroQoL (EQ-5D)]. 

Both disease-specific and generic instruments can be further clas- 
sified as patient-reported outcome measures (PROMs) when values 
for health state descriptions are directly reported by individuals 
who experience them, vs assessor-reported outcome measures when 
someone else, often a doctor, nurse or carer, reports the values. 
PROMs do not include biomedical measures or are used as specific 
diagnostic instruments in clinical practice. A great compilation of 
different PROM instruments can be found at http://phi.uhce.ox.ac. 
uk/home.php. 

Outcome measures can also be classified as non-preference-based 
or preference-based instruments. Non-preference-based instruments 
measure symptoms, disease progression, or health-related quality of 
life, alongside a predefined profile, and yield profile scores on the 
relevant domains without assessing the value individuals place on 
the different domains. Their applicability in economic evaluations 
is restricted, as their comparability is frequently limited, they are 
not linked to any underlying valuation of health states, the trade-off 
across different domains in the profile is difficult to interpret, and 
they do not provide a single score usable for the development of a 
single cost-effectiveness ratio. In contrast, preference-based instru- 
ments combine different dimensions of health to generate quality of 
life estimates for multiple health states, which are then valued to yield 
a single summary utility score [25]. When utility values from gen- 
eric preference-based outcome measures are not directly available 
for economic evaluations, statistical methods, known as ‘mapping, 
can be applied to predict health state utilities from disease-specific 
scales, given there is an overlap in content between the measures of 
interest [26]. 

There are several instruments and elicitation methods to esti- 
mate health state utilities. Among these, the EuroQoL EQ-5D is 
the most widely recommended tool for quantifying utilities and 
conducting economic evaluations [27]. There is, however, growing 
concern whether generic preference-based measures, including the 
EQ-5D, capture all important aspects of health-related quality of life 
for specific mental disorders [28]. Although there are studies that 
have reported that the EQ-5D demonstrates construct validity and 
responsiveness for common mental illnesses, such as depression, the 
results for severe mental illnesses, including schizophrenia and bi- 
polar disorders, are less convincing [29, 30]. 

In the light of the ongoing discussion on the appropriateness of 
the existing outcome measures for certain disease areas, including 
people with mental health problems, new approaches of measuring 
well-being based on the capability approach have emerged and been 
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increasingly applied in general outcome measurement in health eco- 
nomics. Instruments, including the ICECAP (ICEpop CAPability) 
and ASCOT (Adult Social Care Outcome Toolkit) questionnaires, 
have been developed to use in the health and social care sectors 
[31]. The OxCAP-MH (Oxford CAPabilities questionnaire-Mental 
Health), a 16-item PROM, has been specifically designed for out- 
come measurement in mental health [32]. The validation of the 
instrument with patients suffering from severe mental health prob- 
lems showed good feasibility, acceptability, content, and construct 
validities, including significant correlations with psychiatric symp- 
toms, social functioning, and other quality of life measures [33]. 


Cost-effectiveness decision framework 


Economic evaluations compare the difference in costs to the dif- 
ference in outcomes (effects) between two competing alternatives 
(alternative A vs alternative B) in the form of an incremental cost- 
effectiveness ratio (ICER). 

ICER = (CA - CB)/(EA - EB) 

Depending on the type of economic evaluation, the ICER may be 
expressed as the ‘incremental cost per case prevented’ (CEA) or as 
the ‘incremental cost per QALY’ (CUA). Including the inherent un- 
certainty in estimating the difference in costs and outcomes, there 
are nine possible decision scenarios that can emerge. These scenarios 
are listed and depicted on the cost-effectiveness plane in Fig. 136.1. 


e Scenario 1: The new intervention/service is more effective and less 
costly than the control one; the new intervention/service domin- 
ates and should be accepted. 

e Scenario 2: The new intervention/service is less effective and more 
costly than the control; the new intervention/service is dominated 
and should be rejected. 


e Scenario 3: The new intervention/service is more effective and 
costs the same as the control; the new intervention/service dom- 
inates and should be accepted. 

e Scenario 4: The new intervention/service is less effective and costs 
the same as the control; the new intervention/service is dominated 
and should be rejected. 

e Scenario 5: The new intervention/service is equally effective and 
less costly than the control; the new intervention/service domin- 
ates and should be accepted. 

e Scenario 6: The new intervention/service is equally effective and 
more costly than the control; the new intervention/service is dom- 
inated and should be rejected. 

e Scenario 7: The new intervention/service is more effective and 
more costly than the control; no clear decision. 

e Scenario 8: The new intervention/service is less effective and less 
costly than the control; no clear decision. 

e Scenario 9: The new intervention/service is equally effective and 
equally costly as the control; no clear decision. 


In case of scenarios 7 and 8, the cost-effectiveness decision de- 
pends on the threshold value (T) society is willing to pay or can af- 
ford for a unit of outcome. In England, the National Institute for 
Health and Care Excellence (NICE) established a range between 
£20,000-30,000 as the relevant cost-effectiveness threshold for 
one QALY gained [27]. According to this, interventions/services 
with a cost per QALY of below £20,000 would be funded; between 
£20,000-30,000, the funding decision would depend mostly on 
other decision factors (for example, availability of alternative treat- 
ment options, burden of disease); and above £30,000, it would be 
funded only in rare cases. Similar affordability criteria have been set 
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Fig. 136.2 Cost-effectiveness acceptability curve. 


up for developing countries by the WHO [34]. Here, interventions 
are considered very cost-effective if a DALY averted costs less than 
the GDP per capita, and cost-effective if a DALY averted costs less 
than three times the GDP per capita. Cost-effectiveness acceptability 
curves show the probability that an intervention is cost-effective, de- 
pendent on the threshold value (Fig. 136.2). These curves are based 
on the net benefit (NB), an alternative summary measure of the value 
for money of interventions/services derived at by the following cost- 
effectiveness decision rule: 


NB = T x (EA - EB) - (CA - CB) 


If NB >0, the new intervention/service is cost-effective. If NB <0, 
the new intervention/service is not cost-effective. If NB = 0, there is 
an indifference between the alternatives. A similar decision rule ap- 
plies to the results of CBAs. 


Applications 


Economic evaluations are seen by decision-makers as useful tools 
with growing importance due to the increasing need for resource 
allocation decisions. Many countries have now established organiza- 
tions to develop economic evaluations to inform policy-making either 
within or outside a health technology assessment (HTA) framework 
(for example, NICE for England, CADTH for Canada, IQWIG for 
Germany, CVZ for the Netherlands). A comprehensive list of rele- 
vant HTA organizations can be found under the umbrella organiza- 
tions of the International Network of Agencies for Health Technology 
Assessment (INAHTA) (www.inahta.org), Health Technology 
Assessment international (HTAi) (https://htai.org/about-htai/), and 
European Network for Health Technology Assessment (EUnetHTA) 
(https://www.eunethta.eu/about-eunethta/). Furthermore, the re- 
quirement to demonstrate cost-effectiveness evidence, in addition 
to quality, safety, and efficacy, for the reimbursement of pharmaceut- 
ical, medical technology, or biotech products has been introduced as 
a ‘fourth hurdle’ in many health care systems [35]. 

It is therefore not accidental that a rapid review of evidence on the 
costs and benefits of interventions in the area of mental health from 
Australia found that among the included 50 studies, pharmacological 
treatments were the most commonly studied interventions, followed 
by psychosocial interventions [36]. Only a few studies investigated 
the cost-effectiveness of employment programmes, art programmes, 
internet strategies, electroconvulsive therapy, discharge models, and 


joint crisis plans. Most studies were from the UK and adopted a health 
sector perspective, with only a small number considering other sec- 
tors (for example, housing, education, employment, or (criminal) 
justice sectors) and relevant intersectoral aspects. In terms of disease 
area, depression was the most frequently studied, followed by schizo- 
phrenia, anxiety disorder, conduct disorder, attention-deficit/hyper- 
activity disorder (ADHD), and panic disorder [36]. 

Another systematic review of published economic evaluations 
that compared antidepressant treatments in depression, based on 
database analyses and prospective clinical studies, identified 40 pa- 
pers that met the inclusion criteria [37]. Among these, a relatively 
large number of industry-sponsored evaluations of escitalopram 
were identified, which found escitalopram to be potentially cost- 
effective in depression treatment. Evidence of cost-effectiveness 
differences between other individual SSRIs was not unequivocally 
established. Inconsistent findings further emerged concerning the 
cost-effectiveness of SSRIs vs tricyclic antidepressants between 
retrospective database analyses and prospective studies [37]. 

Incorporation of economic analyses in evidence-based clinical 
guidelines is also becoming common practice. One of the first deci- 
sion analytic models developed for this purpose relates to the 2004 
NICE depression guideline [38, 39]. This evaluated the outcomes and 
likely costs of the first-line use of combination therapy of antidepres- 
sant medication with psychological therapy for moderate and severe 
depression in the UK. The study found that combination therapy 
is likely to be cost-effective for severe depression, with an average 
cost per QALY gained of £5777 (95% CI: £1900-33,800), while its 
cost-effectiveness is much more uncertain for moderate depression 
(average cost per QALY gained: £14,540; 95% CI: £4800-79,400). 

Economic evaluations may also be used to evaluate mental health 
programmes for whole populations and show how much of the 
burden may be averted by selecting cost-effective interventions. 
A comparative cost-effectiveness analysis of interventions for re- 
ducing the burden of major neuropsychiatric disorders formed 
part of the WHO CHOICE project (CHOosing Interventions that 
are Cost-Effective) [40]. The project assembled databases on cost- 
effectiveness of key health interventions in 14 epidemiological 
subregions of the world, using a generalized cost-effectiveness 
framework in which costs and outcomes of current and new inter- 
ventions were compared to ‘doing nothing. The costs and effective- 
ness of pharmacological and psychosocial interventions in primary 
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care or outpatient settings for psychiatric disorders were compared 
in a population model. Effects were measured as DALYs averted 
and costs in international dollars. Compared to no treatment, the 
most cost-effective strategy for averting the burden of psychosis 
and severe affective disorders in developing regions of the world is 
a combined intervention of first-generation antipsychotic or mood- 
stabilizing drugs with adjuvant psychosocial treatment delivered by 
community-based outpatient services. For more common mental 
disorders treated in primary care settings (depressive and anxiety 
disorders), the single most cost-effective strategy was the scaled- 
up use of older antidepressants due to their lower cost, but broadly 
similar efficacy to newer antidepressants. However, as the price dif- 
ference between older and generic newer antidepressants continued 
to narrow over time, generic SSRIs may represent the current treat- 
ment of choice. Proactive disease management, including long-term 
maintenance treatment with antidepressant drugs, also represents 
a cost-effective way of significantly reducing the enormous burden 
of depression in developing regions. As the authors point out, these 
findings provide relevant information regarding the relative value of 
investing in neuropsychiatric treatment and prevention, and so may 
help to remove one of many remaining barriers to a more appro- 
priate public health response to mental health needs [40]. 

In addition, economic evaluations can highlight the potential 
economic benefits of improved service and care provision at the 
population level. A global return on investment analysis for scaling 
up depression and anxiety treatment focusing on the benefits of in- 
creased productivity found a benefit-to-cost ratio of 2.3-3.0 when 
only economic costs and benefits were considered, and a ratio of 
3.3-5.7 when the intrinsic value of health gains were also included 
across the different country income groups [41]. 

Due to the growing number of published economic evaluations 
to guide resource allocation decisions, it is of utmost importance 
that the methods used and the standards of reporting are harmon- 
ized. International checklists to support the critical appraisal of the 
robustness of methods and results have now been developed and are 
used to guide publications standards as well [42, 43]. 


Conclusions 


Health economic analyses of mental health services are aimed to 
support needs assessment, service development, budget planning, 
and resource allocation decisions. Among these, economic evalu- 
ations provide information on the cost-effectiveness of competing 
alternative interventions and services. They are seen as useful 
decision-making tools with increasing importance worldwide. 

The rising disease and economic burden of mental health prob- 
lems, together with the growing economic evidence on simple cost- 
effective interventions with great return on investment potentials, 
put the scaling up of some mental health services high on the pol- 
itical agenda. Availability of current cost-effectiveness information 
reflects the increased submission requirements for the reimburse- 
ment of new technologies. As a consequence, evidence on the cost- 
effectiveness of preventative measures for mental health disorders 
remains limited, potentially shifting away the attention from such 
options. 

Although the rigour of the methods and reporting standards 
of economic evaluations have considerably increased since the 


introduction of relevant checklists, several methodological ques- 
tions persist. Outcome measurement for the evaluation of mental 
health services remains a major challenge. Recently growing con- 
cerns about the validity and responsiveness of common, generic 
quality of life measures for people with severe mental disorders 
point towards the need for alternative, broader well-being measures 
to be implemented in this context. Novel instruments using the cap- 
ability approach as their underlying framework have now emerged 
as promising new concepts. Due to the important broader societal 
consequences of mental health problems, including their negative 
impacts on families, it is also imperative to shift standard outcome 
evaluation from individual patients towards family units and pro- 
gress relevant methods. For the same reason, economic evalu- 
ations of mental health services require broad costing perspectives. 
Although the methodology of costing in economic evaluations may 
seem more straightforward than that of outcome measurement, 
internationally harmonized methods and tools for the identifica- 
tion, definition, measurement, and valuation of costs have received 
considerably less attention, and comparability among economic 
evaluations is often lacking in this respect. Not only is there a need 
for increased availability of harmonized national unit cost compen- 
diums for health and social care services internationally, the lack of 
similar comprehensive intersectoral unit cost information also re- 
mains a major challenge. 
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There are many proactive health care interventions that have 
proven to be effective with acceptable risk-reward ratios. Organized 
Western medicine, however, continues to focus on administering 
health care resources, particularly in hospital-based psychiatry, to 
those patients with advanced illnesses despite knowing about early 
manifestations of mental illness and associated disorders of person- 
ality and addiction. Consultation-liaison psychiatric services, where 
treatment may be considered reactive and minimalist, relative to 
what consult services within other disciplines believe is adequate 
care, are particularly vulnerable to glossing over early signs of illness 
due to the lack of mental health professionals in general hospitals 
and the gulf of understanding between hospital staff and psychi- 
atric service providers. A variety of problems flow from this type of 
practice. Practitioners from other medical disciplines and specialties 
are rarely well versed in accurately recognizing subtle and/or early 
signs of mental illness and emotional suffering. This is a matter of 
training and skill, but also a product of the way that many mental 
illnesses are conceptualized as a continuum of traits and qualities. 
As such, there is no clear line of demarcation between illnesses and 
normality. Consequently, medical practitioners may not understand 
or have adequate experience utilizing psychiatric treatments that 
may be appropriate for their patients. This may result in ‘psychiatric 
treatment nihilism’ or a phenomenon in which the non-psychiatric 
practitioner feels that there is little that can be done to help the pa- 
tient, partly because treatments are limited, but also because of a lack 
of skills and knowledge to deal with the clinical presentation accur- 
ately [1]. A sense of futility may ensue so that the practitioner calls 
for a psychiatric consult late in the hospitalization or does not call a 
mental health specialist at all. 

The appropriateness of the consultation request does not always 
match the putative patient. Some requests for services are inappro- 
priate such as in the case of a normal reaction to bad news or physical 
suffering that does not require the specialized skills and know- 
ledge of a mental health professional. This may happen while the 





undetected mentally ill patient suffers in silence in the absence of ap- 
propriate care. Nursing staff members are left to use ‘common sense’ 
to manage these patients, which may suffice when distress, and not 
mental illness, is the cause of a patient's symptoms. Common sense 
in the face of mental illness, however, is often uncommon among 
non-psychiatric hospital clinical workers. Therefore, it is inappro- 
priate for practitioners in disciplines other than mental health to 
decide whether mental health services should intervene during an 
episode of care for a somatic ailment. Non-psychiatric physicians do 
not always understand mentally ill patients nor do they have know- 
ledge of the resources that might be available to both patients and 
practitioners. It is thus understandable, but not entirely forgivable, 
why psychiatric patients often do not get maximal care for their psy- 
chiatric troubles when they are in a non-psychiatric setting in the 
hospital. 

Part of the responsibility for this lack of awareness about mental 
health issues redounds to the mental health practitioners who do 
not educate non-mental health providers and/or lack the ability to 
deal professionally with their skepticism about psychiatry. In that re- 
gard, mental health practitioners may need to be more assertive and 
proactive in advocating for these services. Chen, Evans, and Larkin 
[1] reviewed the literature from 1965 to 2015, on the reasons that 
hospital-based physicians are reluctant to refer to psychiatry. They 
noted that physicians frequently did not call for psychiatric consult- 
ations due to feeling uncomfortable with consultation-liaison psych- 
iatry (CLP). Operant factors that favoured psychiatric referrals were 
a dedicated CLP service, an active consultant, and physicians who 
were comfortable with psychiatric patients, as well as with medically 
ill patients. 

In this chapter, we address psychiatric consultation services for 
early intervention, review conceptual and practical challenges, and 
present where to find more detail about efforts of early intervention. 
We distinguish the difference between proactive and preventative 
services by the following—‘proactive’ refers to activity on the part 
of the practitioner that is designed to eliminate or arrest the pro- 
gression of an already recognized condition and/or illness, whereas 
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‘prevention’ entails activities that keep an illness or a disability from 
being established. Common examples of preventative measures 
are vaccination programmes against infectious diseases, protective 
gear in athletic endeavours, or mandatory seat belts in automobiles 
to prevent traumatic injury. We review successful non-psychiatric 
proactive health care interventions to establish universal structural 
factors that need to be considered in any kind of proactive or pre- 
ventative intervention. For ease of presentation, we will use the term 
‘early intervention to describe preventative and/or proactive efforts 
together. While we propose a clear-line distinction between pro- 
active and preventive, in practice, this distinction may not always 
be discernable. 

Prevention as a concept in medicine has been modernized 
by an Institute of Medicine work group [2], which divides it into 
three subcategories: universal preventive interventions, selective 
preventive interventions, and indicated preventive interventions. 
Universal preventive interventions are targeted to the public or to 
whole population groups that have not been identified as having the 
illness in question but perhaps have associated increased risks for 
the disorder(s) being considered. In this instance, there would be no 
discernable, definitive evidence of the presence of the condition(s) 
in question, but there would be a perceived risk or vulnerability. 
Universal interventions are typically characterized by low risk 
and low cost. The annual flu shot and other vaccinations urged 
upon populations would be examples of universal interventions. 
Psychiatry does not have a disorder that clearly fits into this category. 

Selective preventive interventions refer to operations and inter- 
ventions for subgroups of the population whose risk for developing 
a disorder is higher due to present known individual traits and char- 
acteristics or social risk factors. Such a group might be the non- 
symptomatic carriers of a recessive gene for the development of a 
severe disorder such as Huntington’s disease or coal miners who, due 
to their line of work, are at higher risk for developing black lung dis- 
ease. The risk-reward ratio of the intervention and the likelihood of 
developing the condition dictate whether an intervention may be 
worth implementing. 

Indicated preventive interventions are for high-risk individuals 
who are identified as having minimal, but detectable, signs or symp- 
toms which foreshadow the expression of a disorder. Biological 
markers or symptoms that signal a predisposition for a mental dis- 
order in the carrier, but do not meet diagnostic criteria for the dis- 
order at the current time, would also fall into this group [2]. Indicated 
preventive interventions may be worthwhile, even if intervention 
costs are high and the intervention entails some considerable ex- 
pense or risk to the patient, relative to the possible consequences 
of not providing the intervention. Interventions for conditions that 
demonstrate that a disorder is present should be considered treat- 
ment, rather than prevention. 

Proactive interventions in psychiatry, and perhaps for many 
medical conditions, fall somewhere between treatment and indica- 
tive preventive interventions. However, psychiatric conditions pre- 
sent conceptual, as well as practical, challenges to this cartography 
of intervention in that the present diagnostic system in psychiatry 
is largely based on descriptive accounts, rather than precise aetio- 
logical, biologically based characteristics. Furthermore, as noted, 
psychiatric conditions are frequently expressed along a spectrum 
of the illness in question, so that there may be several conditions 
that create suffering and disability but do not rise to diagnosable 


disease proportions. There may be ambiguity as to whether a person 
is diagnosable as mentally ill or not. Additionally, some conditions 
have similar characteristics, which may result in different providers 
diagnosing a person differently over time. Finally, some normal 
symptomatic conditions are difficult to discern from mental illness, 
such as in the case of grief which may, in some cases, be indistin- 
guishable from depression. These conditions can be easily con- 
fused with pathological states if the historical data are not clear or 
understood. 

Psychiatric conditions are often chronic, beginning slowly and 
eventually manifesting as peaks and valleys of symptoms and mor- 
bidity, interspersed with periods of relatively normal functioning. 
Experienced psychiatric clinicians can usually identify prodromal 
symptoms and recognize deterioration in clinical status and capacity. 
Yet it may be difficult to differentiate between illness and normal 
variation in many circumstances, particularly when a clinician is 
without a longitudinal view of the patient or significant training for 
recognizing mental illness. 

Diagnostic uncertainty in psychiatry has, in part, added to the lack 
of preventive prescriptions that are feasible in effectiveness, cost, or 
secondary risk, to which the interventions may expose the patient 
and/or others. In most of medicine, with the exception of trauma, ser- 
ious diseases may present with minor symptoms or findings that are 
frequently ambiguous in terms of their implications. Physicians and 
patients are left to opt for a wait-and-see approach, to push aggres- 
sively for a definitive diagnosis immediately despite economic cost 
and/or personal risks, or to pursue a course of assessment and treat- 
ment somewhere in between these two polar positions. To illuminate 
the possibility of proactive and/or preventive measures in mental 
health, we review some of the literature on early interventions in other 
fields. In doing so, some of the conceptual considerations for proactive 
and preventive approaches in psychiatry will be brought to light. 
Desan et al. [3] similarly described a new vision of liaison psychiatry 
for the future, in which psychosomatic medicine will be proactive and 
increasingly integrated into specialty and general medical care. 


Structural examples of ‘early recognition’ 


interventions in clinical care 





A review of the literature on early intervention revealed some 
common structural features of successful programmes (including 
assessment). Certainly, an important feature is a clear concept of 
the illness/disease in question, both in its treatment as well as in its 
morbidity and mortality. Of particular importance is its manner 
of early expression. In other words, detectable features (signs and/ 
or symptoms) must be present for early detection and subsequent 
treatment to be applied effectively and appropriately. There must 
also be a means of detecting and recognizing the early signs in the 
form of a screening process that has acceptable accuracy and ease 
of use and that poses no threat to the safety of the patient. For some 
diseases, this may be screening for biological markers, but in psych- 
iatry, markers will inevitably be historical facts, present symptoms, 
and/or behaviour (which includes thoughts, intentions, emotions, 
etc.). Standardization of these data is important to understand, as 
it is through standardization that we can compare outcomes and 
course, as well as risk and results. Ideally, screening is standardized 
across a variety of barriers such as time, place, condition, and user. 
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iew of proactive treatments 


One form of proactive treatment that has gained traction is the rapid 
response team (RRT). Most of these teams are used to counteract 
extreme illnesses such as cardiovascular events and other med- 
ical emergencies that respond to highly specialized technical care. 
Garretson et al. [4] offered that the use of rapid assessment breaks 
traditional hospital hierarchies or silos that delay care, noting that 
the role of the team is to assess the patient, assist the bedside nurse 
in providing the most appropriate care for the patient, and deter- 
mine whether the patient needs a higher level of care and where that 
care can be provided. The effectiveness of RRTs has been measured 
by reduced unplanned intensive care unit (ICU) admissions [5, 6], 
reduced incidence of cardiac arrest and mortality [7], and increased 
patient, family, and staff satisfaction [8, 9]. These teams adopt a pro- 
active approach, which contrasts with the reactive nature of conven- 
tional care such as traditional cardiac arrest teams [10]. 

One modification of the RRT is a ‘rover team’ which identifies 
at-risk patients and facilitates the prompt administration of time- 
sensitive therapies [11]. This proactive approach provides a critical 
care resource and improves clinically important outcomes, at a lower 
level of cost than a reactive approach. One randomized controlled 
study measured the efficacy of a proactive infection control team 
[12]. The team visited patients on the study wards daily and identified 
risk factors for developing health care-associated infections (HAIs). 
If risk factors were identified, the team worked with the treating 
clinicians to eliminate those risk factors, which resulted in a signifi- 
cant decrease in the incidence of HAIs. Another quasi-experimental 
study [13] found that a proactive response system identified subtle 
signs of deterioration faster than a control unit without a system of 
proactivity. This experimental unit used the physiological measures 
of patients to generate an alert for early warning signs of patient de- 
terioration. These measures were updated regularly throughout the 
day, and if a patient generated a score above a set threshold, an RRT 
was activated. This is an example of delivering expedient and appro- 
priate expertise to patients in need. On the back end of hospitaliza- 
tion, when cost and oversubscription become a problem, a proactive 
palliative care nurse consultation was shown to reduce the length 
of stay (LOS) in a medical ICU by almost half (8.96 vs 16.28 days; 
P = 0.0001) without any impact on mortality or other salient clinical 
measures [14]. In yet another study, proactive consultations for geri- 
atric patients undergoing repair of hip fractures resulted in an 18% 
reduction of post-operative delirium [15]. Additionally, proactivity 
has been found to affect patient and staff experience, with one pro- 
active approach to hourly rounding by nursing staff resulting in im- 
proved patient satisfaction and staff morale [16]. 

In several studies of medical rounding, researchers pursued a pro- 
active approach through the inclusion of non-conventional profes- 
sionals. These professionals were sought for their expertise and skills 
which were employed to help detect signs of problems that might 
cause complications in patients’ hospital stays and to offer solutions. 
In an innovative proactive approach, investigators found that the in- 
clusion of a clinical medical librarian as a part of the medical team 
during daily rounds helped to improve the performance of the team, 
as measured by lower LOS and lower readmission rate [17]. The li- 
brarian helped to clarify clinicians’ questions immediately during 
rounds or after through prompt email responses. Shortened LOS 


and lower readmission rates were also the outcomes of a multicentre 
quasi-randomized controlled clinical trial that included a pharma- 
cist on a medical team in two internal medicine and two family 
medicine services [18]. The pharmacist helped to resolve drug- 
related issues and make suggestions for improvement by reviewing 
patients’ medication histories and participating in rounds and dis- 
charge counselling. Another example is the development of a new 
system of multi-disciplinary rounds [19]. A team of clinicians from 
medical wards, as well as other specialty services, designed a new 
process for performing rounds that targeted care inefficiency, poor 
communication, and redundant documentation. The new standard- 
ized method of rounding increased the frequency to daily, included 
all disciplines involved in patients’ care in the interaction, and made 
workloads more predictable by requiring orders to be written during 
the rounds. The new method of rounding resulted in both decreased 
cost and decreased LOS in the treatment groups, as compared to 
controls. 

This review of proactive services for disciplines outside psych- 
iatry illuminates the critical design features that a proactive service 
might contain. Fig. 137.1 outlines the eight proposed critical design 
features. Perhaps the most important design feature for a psychi- 
atric proactive approach is that it must comprise clinicians from 
various disciplines (at least nurses, physicians, and social workers). 
Additionally, these clinicians should be flexible and devoted, en- 
quiring practitioners who are looking for the right clinical answers 
to the patients’ troubles. Having multiple practitioners who overlap 
in tasks and skill sets offers more flexibility and depth in personnel, 
and knowledge. A second design feature is the availability and clear 
definition of discoverable facts within the patients’ narratives and 
behaviours that may be harbingers of mental illness. Screening al- 
gorithms serve this purpose well. Furthermore, practitioners must 
have the skills to detect and interpret these signs and symptoms. The 
third necessity for a successful team is for the clinicians to be well 
trained, so that they can interview patients efficiently and quickly 
develop an accurate idea of what may be wrong. The fourth feature 
is to have a standardized and thorough assessment which helps to 
ensure accuracy of diagnosis. The fifth feature is the availability and 
prescription of an effective treatment. The sixth is the follow-up, 
monitoring, and assessment of treatment delivery and effective- 
ness and assisting in post-hospitalization care. A seventh feature is 
that the practitioners should be fluent with each other's disciplines 
and aware of each other’s strengths and weaknesses. This allows for 
timely and unambiguous communications, as well as on-site educa- 
tion. And finally, there should always be an educational function and 
goal with non-specialty staff. 


Programme descriptions 


Desan et al. [20] reviewed the literature on the impact of consult- 
ation services on LOS and discussed the impact of a proactive ap- 
proach. In the review, ten studies were identified that demonstrated 
increased consultation rates, four of which demonstrated improved 
LOS. Twelve studies examined the salience of consultation to geri- 
atric patients, four of which demonstrated significant improvement 
in LOS. This review did not include the proactive psychiatric studies 
from Yale, Columbia, or Birmingham City Hospital. The review 
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Fig. 137.1 Design features of a proactive intervention programme. 


did include the proactive geriatric service described by Sennour 
et al. [21]. 

Three comprehensive models of proactive psychiatric services 
have been developed and reported in the literature in the last 7 years. 
All three models are general hospital-based and would qualify as 
‘indicative prevention/treatment approaches, as all patients within 
the total inpatient population on or within an organization (service, 
floor, unit, etc.) are screened for the presence of mental illness, sub- 
stance abuse, severe character disorders, or situational stressors. One 
is from the United Kingdom, and two are from the United States. All 
interventions came into being within a year of one another, though 
none were aware of the others. 

The innovative proactive Rapid Assessment, Interface, and 
Discharge (RAID) model in the United Kingdom was developed 
to serve the 600-bed City Hospital of Birmingham [22]. RAID is a 
rapid-response, ‘age-inclusive, comprehensive mental health inter- 
vention. During the study period, the team responded to 91% of ac- 
cident and emergency (A&E) service cases within an hour (average 
of 24 minutes), and 89% of general and specialty ward cases within 
24 hours (average of 16 hours). It also provided periodic training 
to the staff in these settings about the care of patients. The pres- 
ence of RAID increased the detection of mental illness and reduced 
readmissions. 

The assessment of the effectiveness of RAID entailed a novel 
approach of comparing populations in a before-and-after trial, in 
which patients were matched by risk factors and drawn to create 
three different analytic groups. The first group consisted of patients 
treated prior to the implementation of the RAID intervention and 
who had only received standard consultation services. The second 
group were the RAID recipients, and the third were those who were 
contemporaneous to the RAID recipients but had not received the 
intervention and were referred to as RAID-influenced. The clinical 


outcome variables were LOS, readmission rate, cost savings, and 
quality of care, as measured by the percentage of mental health diag- 
noses detected. The authors noted that they reduced the LOS, which 
equated to savings of 21-42 beds per day and reduced readmissions, 
saving 22 beds per day. The economic assessment carried out by the 
London School of Economics used the estimate of at least 44 beds 
per day, resulting in an estimated savings of £3.5 million. The au- 
thors believed this was a conservative estimate and that the actual 
savings would have been more like £4-6 million. During the inter- 
vention, the City Hospital was able to close 60 beds without cutting 
down on services. 

From the United States, Muskin et al. [23] reported on a ‘quality 
improvement program’ in which psychiatrists and internists co- 
managed patients with comorbid psychiatric and medical condi- 
tions. In a before-and-after trial on a single unit, they found that the 
intervention resulted in a statistically significant reduction in LOS 
of 1.19 days among patients, with an LOS of less than 10 days. When 
annualized, the total number of saved days was 2889. The estimated 
cost per day was $600, resulting in an expected annual savings of 
over $1.7 million per year. The intervention paid for itself, as the 
estimated savings were more than three times the cost of the inter- 
vention teamworkers’ salaries (2.5 full-time equivalent psychiatrists, 
in addition to a social worker). 

Aspects of the Yale proactive approach have been reported in three 
different publications from two different, but related, interventions. 
The first, a pilot and feasibility study [24] demonstrated the value and 
effectiveness of a psychiatric proactive approach, revealing potential 
clinical and financial advantages, as well as areas for improvement. 
This initial study was conducted in a 30-bed general internal medi- 
cine inpatient unit at Yale New Haven Hospital. In an A-B-A single 
case study design, the patients’ average LOS and consultation rates 
were examined in three different time frames. These were, in order, 
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the pre-intervention period in which there was an initial study of 
the LOS in the usual treatment phase of the study for 6 months, the 
proactive intervention period for roughly 5 weeks, and the 6-month 
post-intervention period in which the unit resumed the usual con- 
ventional consultations as the mode of psychiatric treatment for the 
patients. On this general service, among patients whose LOS was 
30 days or less, the intervention reduced the LOS by 0.9 days (from 
an average of 3.8 days, chi square = 6.38, df = 2; P = 0.04). The con- 
sultation rate was 22.6% during the 5-week intervention period, and 
9.3% and 12.0%, respectively, during the two control periods of be- 
fore and after the intervention. 

The intervention team screened 100% of all admissions, with the 
initial contact being within 34.6 hours of admission, while the con- 
ventional consultations occurred, on average, at 72.5 hours into the 
admission during the two control time periods (note that the average 
LOS for the unit which was 3.8 and 3.7, respectively, for the two con- 
trol times in the A-B-A comparison). The average length of the pro- 
active screening assessment was slightly less than 3 minutes. There 
were no discernable differences in age, ethnicity, or medical acuity 
among the three different patient groups upon which the conven- 
tional and proactive services consulted (N = 257, 62, 274, respect- 
ively). The screenings revealed that 51% of patients on the study unit 
had a psychiatric condition that required treatment or consultation. 
Substance abuse disorders accounted for 43%, the largest portion of 
the detected difficulties. Mood and anxiety disorders (30%), psych- 
otic disorders (13%), suicide attempts (7%), and delirium and de- 
mentia (6%) accounted for the rest. 

After the initial pilot study was completed, and the service was 
terminated due to a shortage of staff, we used the results and experi- 
ences of the staff and patients to create a proposal for an enhanced 
proactive service. In addition to the reduction in LOS, responses of 
the nursing staff on a satisfaction measure administered after the ini- 
tial study described high praise, with 80% of staff giving the service 
the highest rating possible. The medical team also expressed their 
pleasure with the increased presence of a psychiatrist [roughly 0.25 
full-time equivalent (FTE)] on the unit. The estimated financial 
benefit was roughly $828,722 ($237,286 for cost avoidance by reduc- 
tion of LOS, and $591,436 for revenue enhancement from backfilling 
the saved days). The only added cost was $56,550 (annualized) for 
the psychiatrist's extra time during the 5 weeks that the intervention 
was in place. The annualized net financial benefit was estimated to 
be $772,172, with a cost-to-revenue ratio of 4.2 (cost-effectiveness is 
calculated by adding the expenses of the intervention to usual costs 
and dividing that into the total dollars received). 

Given the promising LOS reduction, high staff morale, and good 
contribution to the bottomline of expenses and revenues, a case 
was successfully made for the implementation of another version 
of a proactive psychiatric consultation service. This version was a 
team approach that was given the name behavioural intervention 
team (BIT). This second iteration was implemented on three in- 
ternal medicine inpatient units (92 total patient bed capacity), 
with a team of a part-time psychiatrist (0.5 FTE), a full-time nurse 
practitioner (APRN), and a full-time social worker. We worked 
with a bio-psychosocial understanding of the patient’s hospitaliza- 
tion, and hence the skills and perspectives of the aforementioned 
various medical professional groups were essential [25-28]. We 
decided on a team approach, as there were multiple tasks and do- 
mains that required different skills and knowledge for a maximally 


effective hospital experience. Examples of such tasks are support, 
encouragement, reassurance, medication when needed, accurate 
diagnoses and formulation of their problems, education, social and 
family support, and the mental health practitioners’ support of staff. 
Again, we achieved a good reduction in LOS (0.65 days; P <0.02) in 
the before-and-after trial from the three general internal medicine 
units. We ran the comparison for the same 11 months in two succes- 
sive calendar years. There were 509 patients in the treatment group 
and 535 in the control group. The team not only screened 100% of 
the admissions, but also evaluated and cared for the psychiatric pa- 
tients who could not be effectively managed by the regular medical 
staff and provided education for the staff about these patients. The 
multi-disciplinary nature of the team also allowed the treatment 
team to educate the staff in a practical and individualized manner 
about the mental health and social issues of each patient. This type 
of education was termed ‘just-in-time education. The added dimen- 
sion of education enhanced the satisfaction of the staff and provided 
more skilful and effective interventions for patients. 

The economic benefits of the BIT were characterized by a reduc- 
tion in LOS, as well as the provision of added services and features as- 
sociated with caring for patients who were behaviourally challenged. 
The incremental costs of the team were compared to the incremental 
economic benefits to the hospital as a result of the team’s work. We 
conceptualized the economic benefit of the team’s work in terms of 
the overall cost of the patient’s hospitalization, as measured by the 
LOS and extra services such as sitters or other types of increased 
staffing. It was hypothesized that the intervention would have the 
greatest impact on LOS, readmission rate, and sitter use. There was 
an additional economic benefit of more filled beds which were made 
available by the increased efficiency of bed use (0.65 days of reduced 
LOS) without increased expense. Furthermore, when these patients 
with mental illness or psychological distress were given the care they 
needed in a timely, effective manner, both their medical and psycho- 
logical outcomes improved. At the time of this study, the hospital 
was reimbursed for medical services almost exclusively by a case 
rate methodology, as opposed to a per diem rate. We compared the 
experience of the three medical units in a before-and-after imple- 
mentation evaluation design of the intervention over successive and 
same 11-month calendar periods. 

The average LOS for patients in the two intervention periods are 
presented in Table 137.1 [27]. The before-period consisted of treat- 
ment as usual with reactive conventional consultation liaison (CCL). 
The after-period was when the proactive BIT was employed. For the 
LOS expense calculations, the comparison was based on a blended 
rate. Included in the expenses were sitter costs, as well as room and 
medication costs, and associated overhead allocations. For income, 
a blended rate that reflected the proportion of different payers was 
utilized. The details of this economic relationship are presented in 
detail in another publication [27]. In the pilot study of proactive 
consultation with only a psychiatrist providing the services for one 
internal medicine unit, the economic benefit or cost-effectiveness 
ratio was estimated, conservatively to be 4.2. The same ratio in this 
before-and-after implementation was 1.7, meaning that for every 
dollar spent over and above usual care, $2.70 were received. 

Table 137.2 [27] shows the direct cost metrics and comparison of 
the second study. The total saved cost for the intervention (not an- 
nualized) was $107,027 or a case rate of $210 saved on average per 
patient. There is also the opportunity for enhanced revenue by the 
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Table 137.1 Length of stay metrics for the second study 





Population CCL period |BIT period | Period effect 
N N Test statistic 
Mean Mean (df) 
(Gp) (9) P-value 

Patients with psychiatric 535 509 T=2.86 

intervention and LOS 

<31 days 7-29. 6.65 (1042) 
(5.76) (5.75) 0.004 

All patients with LOS <31 days 5158 5391 F=8.39 
4.98 4.68 (215, 457) 
(4.62) (4.38) 0.0002 

All patients 5251 5490 F = 6.90 
5.87 5.58 (215, 755) 
(8.9) (9.12) 0.001 


Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 
consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 


creation of empty beds. The value of enhanced revenue will vary by 
institution and setting. For instance, the freed-up bed space due to 
reduced occupancy from the shortened LOS allows for more rev- 
enue. Table 137.3 [27] summarizes the calculations for increased 
revenue realized due to the increased capacity created by the LOS 
reduction over 11 months. 

The actual realized income will vary, depending on the mech- 
anism of payment and the amount. For example, a case rate will be 
potentially more lucrative than a per diem rate, as the case rate in- 
centivizes efficiency and reduced LOS. Furthermore, the magnitude 
of benefit realized by increasing capacity will vary per rate of backfill. 
Table 137.4 [27] illustrates the change of revenue, depending on the 
percentage of occupancy rate. 

It should be clear there will likely be a substantial economic 
benefit from a proactive psychiatric consultation service, regardless 
of the payer, unless the utilization of beds is lower than the break- 
even point which, in our institution, is at 50% occupancy. The same 
will hold for revenue per case. Table 137.5 [27] summarizes the ex- 
pected consequences of various scenarios for revenue per case. 

Table 137.6 [27] summarizes the annualized costs of the pro- 
gramme in terms of salary expenses, incorporates the credits of 
reduced costs and enhanced revenue, and provides an integrated fi- 
nancial return on the BIT. 


Table 137.2 Direct cost comparisons 


Population Direct cost/case 


BIT, LOS <31 only 509 $6550 
CC, LOS <31 only 535 $6760 
Total cost per case difference 


BIT minus CL ($210) times 509 
cases 


($107,027) 


Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 
consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 


Table 137.3 Incremental net revenue (over 11 months) 


Population Patient days | ALOS Number of 
cases 





BIT or CL period 3, LOS <31 only 3383 6.65 

CL period 2, LOS <31 only 3902 VD 535 
Not BIT nor CL, LOS <31 only 67,240 466 14,416 
LOS >31 (that is, LOS outliers) 16,476 5529) 298 
Total 91,001 ETES 
ALOS difference (7.29 - 6.65) 0.65 

Patient days difference (0.65 ALOS ° 509 cases) 329.4 

Potential new cases (329.37 days/5.77 ALOS), 57.08 

assuming 100% backfill 

Net revenue per case $12,682 

Potential incremental revenue $723,889 





Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 
consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 





Clinical process and considerations for the future 


Clinically, in the 11-month trial, there were 5641 admissions to the 
three units. While we screened almost all patients on admission 
for the BIT trial, 945 patients, or 17%, were noted likely to be in 
need of psychiatric services. The incidence of the major diagnostic 
groups that required the attention of psychiatric services because 
their psychiatric condition was requiring psychiatric care was 63% 
(of the 945 screened); 52% required specialized (in mental health) 
discharge planning; 34% were seen for addiction; 18% were seen for 
delirium/dementia; and 17% had a behavioural issue that interfered 
with medical care [28]. 

The outcome of this study allowed us to implement the BIT on all 
medical units at Yale New Haven Hospital, York Street Campus, and 
on half of the medical units at our recently acquired nearby campus 
of the former Hospital of Saint Raphael. The multi-disciplinary 
teams perform the various clinical tasks noted in Fig. 137.1. Through 
screening 100% of the admitted patients, BIT staff quickly identify 
those who will require and/or benefit from a psychiatric consultation. 
This is accomplished by reviewing the medical and social histories 
provided in the patients’ electronic medical records within 36 hours 


Table 137.4 Incremental revenue based on bed demand: alternate 
estimates of incremental net revenue 


oe 


$723,889 100 57.08 $789,697 
$651,500 90 5137 $710,727 
$579,111 80 45.66 $631,757 
$506,722 70 39.96 $552,787 
$434,333 60 34.25 $473,818 
$361,944 50 28.54 $394,848 


Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 
consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 


1397 





1398 





SECTION 23 Service provision 


Table 137.5 Alternative estimates of average net revenue per case 





Alternate Change to Net revenue per 

estimates of netrevenue | case 

incremental net | per case 

revenue 

$868,667 110% $13,950 

$829,182 105% $13,316 

$789,697 100% $12,682 Our institution 

$750,212 95% $12,048 

$675,191 90% $11,414 

$573,912 85% $10,780 
Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 





consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 





of their admission and discussing the patients with the admitting 
nurses and/or attending physicians. Those patients who screen posi- 
tive for the likelihood of being able to benefit from a mental health 
consult are evaluated by the end of the second day of their stay. The 
‘treatment’ that they receive may be extra support and attention to 
their mental illness by either the APRN or the doctor, depending 
on the problem and/or disposition assessment by the social worker 
who evaluates the various requirements related to different social 
elements such as family, living arrangements, and work. Those who 
will be at risk for difficulty at discharge due to their mental health 
or social issues are evaluated early on in their hospital stay, so that 
any serious psychiatric issues can be resolved early enough to ac- 
cept an appropriate transfer to a psychiatric facility or discharge. 
They are evaluated by the psychiatric social worker on the team who 
works with the medical-floor social worker to fashion a suitable dis- 
position. The unit’s general medical nurses and the medical social 
workers are encouraged to support the patient in their capacity and 
look out for problems at disposition. If this is not enough to manage 
and treat the patient effectively, BIT members provide the mental 
health services and use that intervention as a teaching exercise for 
the staff on the units. 

The doctors on the BIT oversee the treatment plans and spear- 
head the resolution of medically related problems for disposition. 
The doctor is also the leader of the team. However, the team is in- 
tensely nurse-oriented, and the medical-floor nurses can ask for 


Table 137.6 Summary of net financial return on BIT 


Estimated financial benefit |11 months | 12months | 


Incremental net revenue from backfill $723,889 $789,697 
(filled at 100%) 

Overall reduction in cost/case for BIT $107,027 $116,757 
cases, including 

Reduction in use of sitters 

Subtotal, estimated financial benefit $830,916 $906,454 
Estimated additional expenses $306,230 $334,069 
Estimated benefit minus expenses $524,686 $572,384 


Reproduced from Health Econ Outcome Res., 2(4), Sledge WH, Bozzo J, White-McCullum 
B, et al., The cost-benefit from the perspective of the hospital of proactive psychiatric 
consultation service on inpatient general medicine services, pp. 122, Copyright (2016), 
Sledge WH, et al. Reproduced under the Creative Commons Attribution License CC BY. 


consultations and support from any one of the team members. In 
terms of our cartography of the interventions for prevention and 
treatment noted, this is universal in that all patients get screened 
and may be categorized into a group that needs treatment (at least 
25% of the patients have a history of mental illness). This includes 
our educational efforts with the nursing and medical staff at recogni- 
tion, diagnosis, and treatment. The next level of preventive interven- 
tion that makes this an ‘early’ or proactive intervention is that those 
who may have a pre-existing psychiatric condition or who demon- 
strate some likelihood of disruptive behaviour or suffer from mental 
illness, such as confusion, irritability, fear, or depression, are treated 
immediately with no lapse in their care, before their behavioural or 
mental health condition can become a disruptive factor. For those 
who are clearly manifesting psychiatric problems, their care is taken 
on immediately by the mental health professionals and their condi- 
tion elaborated and explained to the nursing staff as the character- 
ization of the intervention moves from selective to indicated. 

The team functions both as practitioners of psychiatric care for 
patients, but also as educators and assistants for the regular unit staff. 
BIT team members and floor staff co-monitor patients who are in 
the selective category of prevention and treat or supervise the care 
of those who are in the indicated category, while they monitor and 
teach the treatment given by the medical/nursing team with primary 
responsibility. The education can take many forms such as: formal 
and informal supervision, tutorials, case conferences, case reports, 
and orientation. One common form is the ‘just-in-time’ education 
previously mentioned. This is education in which staff observe the 
experts in action and may informally seek their advice. Tadros et al. 
described a similar interactive education provided by RAID and 
noted that the instruction of the RAID team members helped junior 
doctors and nurses to better manage difficult patients and more ac- 
curately identify those patients who should be referred to psychi- 
atric services [22]. Typically, the ‘just-in-time’ intervention will be 
triggered by a request from a medical or nursing staff member for 
advice on the care of a patient. 

The interventions employed are: (1) the assessment of risks and 
whether risk categories have reached the level of indicated treatment; 
(2) the prescription of care in the development of a behaviourally 
oriented treatment plan, at all times supporting the patient; (3) the 
provision of just-in-time education that increases the effectiveness 
of the floor staff in their interactions with patients and encourages 
them to try out the perspective and skills themselves; (4) the assess- 
ment of the family and subsequent discharge; and (5) the actual care 
itself for those difficult patients. The goal is to produce outstanding 
clinical care in the most cost-effective manner possible. 

At present, we have implemented the BIT only for internal medi- 
cine services on two of the hospital campuses. The programme is 
especially popular with nursing staff, as nurses are the hospital pro- 
fessionals most directly affected by the presence of the BIT. There 
are plans eventually to expand to cardiology, cancer, and appropriate 
surgical specialties. In each of these instances, we will take into con- 
sideration the dynamics of admission, treatment, and discharge, so 
that we can design a programme that functions well at the decisional 
points in the patients’ hospital career. 

Common sense tells us that proactivity and early intervention are 
clearly strategies that are effective in the face of the cascade effect 
[29]. In health care, the cascade effect is the process of deterioration 
of health when the resources available to the patient do not match 
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the needs of the patient’s environment, so that a feed-forward state 
ensues, in which unmet demands in one sector (that is, attending 
to medical appointments) result in an ever more serious and nu- 
merous unmet demands in other sectors as the person sacrifices 
resources from one sector (for example, saving money previously 
used for transportation) in order to correct unmet other demands 
(buying groceries). This cycle continues until the patient’s health be- 
comes unstable to the point where hospitalization or acute care is 
warranted. Effective holistic programmes are those that have a clear 
understanding of the costs and benefits of the intervention pro- 
gramme in its totality, so that the system in which the patient lives 
can be stabilized and recovery becomes an option for more people 
sooner in the cycle of their illnesses than would otherwise pertain. 


The future 


Other features that we intend to investigate and possibly develop are 
the integration of inpatient and outpatient experiences through the 
maintenance of continuity of care. We believe that integrating care 
should make the continuity of care between inpatient and outpatient 
more effective, which will further reduce the need for, and length of, 
hospital stays. In addition, we plan to document and better under- 
stand the role and value of informal consultations, or ‘curbside con- 
sultations’ which are not typically noted in the medical records and 
have no income associated with them. Nevertheless, they appear to 
be effective and valuable to the recipients. We believe that the RAID 
programme's educational effect is similar to what we call ‘just-in- 
time’ education. 

And finally, our ambition for the future is to create a consortium 
comprising a regionally based group of hospitals or other health care 
entities capable of serving large populations of patients and that can 
develop this model and share experiences, so that information, data, 
ideas, and experiences can be shared and a more substantial research 
programme can be developed to refine and pilot innovation. 
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This chapter will consider the mental health needs of refugees and 
other populations forcibly displaced because of exposure to mass 
conflict. Migration has been a hallmark of humanity over mil- 
lennia, the reasons leading individuals or groups to move being 
numerous and often multi-faceted. Migration can be forced or by 
choice or a combination of these factors; for example, poverty or 
natural disasters might lead a person or group to leave their home 
out of choice, but elements of compulsion can play a role such as 
severe food insecurity. This chapter will consider those obliged to 
leave their homelands for reasons of persecution and exposure to 
mass conflict, populations broadly referred to as refugees. The terms 
utilized to describe these populations are summarized in Box 138.1 
[1], highlighting the different groups to consider; however, for the 
purposes of this chapter, refugees will be used to describe this popu- 
lation, unless reference is made to specifically defined groups. 

The mental health of refugees continues to be an important area of 
clinical work and research, and one that highlights the complex inter- 
play among biological, psychological, social, and cultural processes 
in determining how individuals vary on a spectrum of adaptation to 
frank mental illness. The psychological impact on a person, family, 
and groups forced to migrate because of conflict can be considerable, 
the effects generally being negative, but, in some aspects, potentially 
positive. To leave an environment of extreme insecurity in order to 
reach a new country of safety is likely to enhance the mental health 
and well-being for many; the focus of this chapter, however, will be 
on the varied mental health risks associated with forced migration 
and the factors preceding and following that major event. We will 
describe the mental health impacts of forced displacement and the 
essential role that both previous and ongoing exposures to traumatic 
events, ongoing stresses, and broader psychosocial influences play 
in generating or maintaining the psychopathology and the appro- 
priate interventions that may assist in overcoming these adverse out- 
comes. Selective high-risk subpopulations will be described in more 
detail, including unaccompanied refugee minors, persons living in 
states of protracted insecurity, such as in refugee camps, women and 
their children, and those caught in cycles of violence, for example 





in situations where post-traumatic anger presents a risk to the indi- 
vidual, those close to them, and the wider community. 

The global pressures causing forced displacement show no signs 
of abating, including the Syrian civil unrest which has caused a sub- 
stantial movement of forcibly displaced populations. In 2015, the 
estimated number of people displaced by mass conflict reached 
65 million worldwide [2]. A total of 12 million were displaced in 
2015, with at least 40 million believed to be internally displaced. The 
majority of refugees live in protracted insecure situations, often in 
makeshift settlements in countries neighbouring the site of the con- 
flict, almost all in low-income countries. Only a small portion are 
able to travel to high-income countries. Table 138.1 lists refugees 
and asylum seeker populations by country of origin and destin- 
ation [2], and Fig. 138.1 is a graph showing the changes over the last 
15 years in the numbers being displaced across the globe. 





Refugee populations are exposed to a number of stressors that may 
impact on mental health, many of these factors operating simul- 
taneously or in sequence. Refugees commonly experience personal 
grief often complicated by the nature of the multiple losses they ex- 
perience, including murders, disappearances, kidnapping, atroci- 
ties, and sex slavery, and more general potentially traumatic events 
such as violence, torture, sexual abuse, and arbitrary incarceration. 
As a group, refugees experience major psychosocial and cultural up- 
heavals of having to leave their families, homes, communities, and 
work to seek personal security. Evidence suggests that two sets of 
factors are of key importance in impacting on the mental health of 
refugees: exposure to past and ongoing potentially traumatic events, 
and the complexities of navigating the post-migration environment 
as both individuals and groups [3-5]. These inter-relationships have 
been best described by Miller and Rasmussen (2016), from which 
Fig. 138.2 has been adapted [3]. 

The actual rates of mental illness identified among refugees vary 
by population, their exposures to potentially traumatic events, their 
current living arrangements (where and with whom they are currently 
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Box 138.1 Definitions of migrant populations and subgroups 





Migrant: A person who has moved across an international border or 
within a state away from their habitual place of residence, regardless 
of their legal status, whether the movement is voluntary or involuntary, 
what the causes for the movement are, or what the length of the stay is. 


Refugee: A person who, owing to a well-founded fear of persecution for 
reasons of race, religion, nationality, membership of a particular social 
group, or political opinions, is outside the country of his or her nation- 
ality and unable or, owing to such fear, is unwilling to avail themselves 
of the protection of that country; often strictly defined according to the 
1951 UNHCR Refugee Convention. 


Internally displaced person: A person who has been forced to leave 
heir place of habitual residence as a result of armed conflict, gener- 
alized violence, violations of human rights, or natural or human-made 
disasters, and who has not crossed an internationally recognized state 
border. 











Asylum seeker: A person who seeks safety from persecution or serious 
harm in a country other than their own and awaits a decision on the 
application for refugee status under relevant international and national 
instruments. 


Stateless person: A person who is not considered as a national by any 
State under the operation of its law. As such, a stateless person lacks 
hose rights attributable to protection of a State, no inherent right of 
sojourn in the State of residence, and no right of return in case he or 
she travels. 








Irregular migrant: A person whose movement takes place outside the 
regulatory norms of the sending, transit, and receiving countries and in- 
creasingly used for a person who has been smuggled or trafficked. 





Trafficked: A person who has been recruited, transported, transferred, 
harboured, by force or other forms of coercion, abduction, and decep- 
tion, to achieve one person having control over another person, for the 
purpose of exploitation. Trafficking in persons can take place within the 
borders of one State or may have a transnational character. 


Reproduced from Institute of Migration, Glossary on Migration, International 
Migration Law Series No. 25, Copyright (2011), with permission from 
International Organization for Migration. Available from https:/Awww.iom. 
int/key-migration-terms 











Table 138.1 Refugee data (2015) by country of origin and 
destination 


Top ten source countries 
of refugees 


Top ten refugee host 
countries 


Highest number of 
refugees per 100 
inhabitants in host 
country 





Syria Turkey (2.5 million) Lebanon 
Afghanistan Pakistan (1.6 million) Jordan 
Somalia Lebanon (1.1 million) Nauru 
Sudan Iran (979,400) Chad 
South Sudan Ethiopia (736,100) Djibouti 
Democratic Republic of Jordan South Sudan 
Congo 

Myanmar Kenya Turkey 
Central African Republic Chad Mauritania 
Iraq Uganda Sweden 
Eritrea China Malta 


Reproduced from UNHCR, Global Trends: Forced displacement in 2015, Copyright (2016), 
United Nations High Commissioner for Refugees. Available from http://www.unhcr.org/ 
576408cd7.pdf 


residing), and the long-term security of their place of residence. In 
general, rates of depression, post-traumatic stress disorder (PTSD), 
and anxiety disorders are high, compared to non-displaced popula- 
tions, with some suggestion that the prevalence of psychotic disorders 
is also increased [4, 6, 7]. Nevertheless, there is considerable hetero- 
geneity in reported rates of mental disorder across studies, most likely 
because of variation in sampling, measurement, and the character- 
istics of the groups under inquiry. The sociodemographic profile of 
the population alone may influence the overall mental health status 
of individual groups. In a meta-analysis published in 2005 on pre- 
and post-displacement influences on mental health, refugees who 
were older, more educated, and female, those who had higher pre- 
displacement socio-economic status, and persons from a rural back- 
ground had worse mental health outcomes [4]. Some studies have 
identified higher rates of somatization [5], and others have suggested 
an elevated suicide risk among refugees, especially when there are sig- 
nificant post-migration stressors [8]. Several factors have been shown 
to be associated with suicidal risk among refugees, including lack of 
employment, access to resettlement services and social support, dis- 
tress related to separation from families, past trauma exposure, and 
integration difficulties in the resettlement country. Discrimination 
against ethnic minorities is also associated with poor mental health 
among refugees, a society-wide problem that is of particular relevance 
to contemporary resettlement environments. These factors, along 
with the stigma that often accompanies mental illness, can complicate 
efforts to seek treatment after resettlement [8] (Box 138.2). 


Long-term outcomes 


The complexities of studying refugee populations are most evident 
when considering the array of long-term mental health implications 
associated with their experiences; although studies are limited, there 
is evidence that mental disorders, especially PTSD, tend to be highly 
prevalent in these populations, even years after resettlement [15]. As 
identified in a systematic review on studies of populations resettled 
for more than 5 years, this increased risk seems to be a consequence 
of exposure to both conflict-related trauma, post-migration socio- 
economic factors, and more general living difficulties [16]. The sys- 
tematic review identified 29 studies, of which only 13 were deemed 
of high quality. Consistent with past findings, there was substantial 
heterogeneity across studies in the prevalence rates of depression 
(range 2.3-80%), PTSD (4.4-86%), and anxiety disorder (20.3- 
88%). Nevertheless, prevalence estimates were typically in the range 
of 20% and above for any disorder, the lowest rates being found in the 
higher-quality studies. Descriptive synthesis suggested that greater 
exposure to pre-migration traumatic experiences and post-migration 
stressors, the higher the rates of all three categories of mental disturb- 
ance (PTSD, depression, anxiety), while a poor post-migration socio- 
economic status was particularly associated with depression. 


Impact of exposure to previous and ongoing 


traumatic events 





Impact of other post-migration stressors 


Specific post-migration displacement stressors that have been 
shown to influence mental health include social isolation resulting 
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Fig. 138.1 Graph to show rates of newly displaced persons from 2003 to 2015. 


from the loss of social networks [17], unemployment (either due 
to limited rights to work or because of local employment demands 
[17, 18]), poverty [19, 20], perceived discrimination [21], increased 
violence against women [22-26], and a lack of safety when living 
in refugee camps [20, 27]. These post-migration living difficulties 
prolong pre-existing feelings of insecurity and deplete the capacity 
of displaced persons to manage ongoing challenges, a compounding 
of prior conflict-related experiences and ongoing stressors, a com- 
bination of factors that increase the vulnerability to disorders such 
as PTSD [28]. 


A conceptual model for understanding 
the refugee experience 


When attempting to understand the full range of experiences of 
refugees, it is important to consider a number of interacting do- 
mains: early developmental experiences; the context and culture of 
origin; and the sequence of changes that have occurred through the 
phases of mass conflict, displacement, transition, and final resettle- 
ment [29]. There are multiple levels of influences (political, social, 
cultural, familial, and physical/biological) that impact at each phase 
on the adaptive capacity of refugees, their families, and the wider 
community; the outcome of these attempts to adapt may be positive 


or, in conditions of overwhelming stress, may result in severe distress 
and ultimately mental disorder. The Adaption and Development 
After Persecution and Trauma (ADAPT) model identifies five core 
psychosocial pillars challenged by the sequence of experiences that 
refugees encounter [30]. These pillars, which form the foundations 
of stable societies, maintain social cohesion, as well as individual 
mental health. Specifically, the pillar of safety and its maintenance 
are vital to the person’s sense of security and protection, with condi- 
tions of pervasive or recurrent threat generating anxiety and fear, the 
extreme outcome being disorders such as PTSD. Maintaining inter- 
personal bonds (nuclear and extended family, networks) is essential 
to mental well-being. Multiple threats to attachments and repeated 
traumatic losses can lead to severe distress, including symptoms of 
complicated grief and separation anxiety. The maintenance of roles 
and identities (cultural, social, family, personal, work-related) is vital 
to prevent alienation, isolation, and marginalization, a state referred 
to in sociology as anomie. Finally, the sense of meaning and coher- 
ence in life is critical to forming a positive view of the present and 
future, the existential domain which may be expressed in political 
values, religious beliefs, social and cultural affiliations, and/or spir- 
ituality. All five domains, and the institutions and social rules that 
support them, tend to be undermined by the refugee experience, 








Fig. 138.2 Relationship of armed conflict, displacement, and mental health. 


Adapted from Epidemiol Psychiatr Sci., 26(20), Miller K, Rasmussen A, The mental health of civilians displaced by armed conflict: an ecological model of refugee distress, 


pp. 129-138, Copyright (2017), with permission from Cambridge University Press. 
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Box 138.2 Assessment of refugees in mental health services 


In addition to the usual assessment required of any person referred to 
mental health services, there are a number of areas that it might be of 
additional importance to question. These include those identified in the 
ADAPT formulation described in the text, but the core elements of as- 
sessment are detailed here. 


Background history 


e Description of previous life before difficulties that led to 

displacement. 

e Sensitive enquiry of reasons as to why they had become dis- 

placed: any experience of incarceration, torture, abuse directed to 

hemselves or family members and other close associates, including 

sexual abuse. Significant losses experienced of family members, 

friends, and colleagues. 

e Enquiry as to how they arrived in their current place of resi- 

dence: how did they travel, were they placed in hands of strangers or 

raffickers, did they spend time in a refugee camp, experiences and 
exposure to abuses in these environments. 

e Their perceived roles in their families, previous occupation, and 
communities. 








Physical health 


e Review of physical health needs is important, as this can be over- 
looked. It should include chronic health conditions, oral health, and 
skin. The screening would need to include questions about previous 
health and exposure to any infectious diseases endemic in the coun- 
tries of origin and transit, as well as previous injuries [9-11]. 

e Previous head injury, especially if associated with loss of conscious- 
ness, is an important question to ascertain, as studies have demon- 

strated that this is highly prevalent in torture survivors and can have 

significant long-term psychological effects, including depression and 

PTSD [12-14]. 











Current situation 


e Living circumstances: where are they living; with whom; how many 

house moves have they had since arrival; are they able to afford food, 

clothing, other necessities; how do they spend their time (as those 
awaiting more permanent legal status are often not permitted to work); 
any new family stressors; any family members they are trying to find. 

e Immigration status and understanding of the process: these pro- 
cesses can often be complex and hard to understand, as information 
might not be available in the desired languages; legal representation 
can be hard to access without financial assistance, and there might 
be the constant threat of immigration detention and deportation 
back to either a country of transit or the country of origin. 

e Linguistic ability and access to interpreters: this is likely to be ap- 
parent in the interview. Mental illness can impact on a persons ability 
to learn the host language. 
































impacting at every level on the displaced community and exerting 
reciprocal effects on the individual capacity to adapt. The extent to 
which these disruptions can be repaired or accommodated will de- 
termine where individuals and their collectives are located on the 
continuum of adaptation and functional impairment, a failure of 
adaptation expressing itself in the individual as mental disorder. 
The ADAPT system can be used as a framework for undertaking a 
comprehensive assessment of the experiences and psychosocial re- 
sponses of individual refugees in clinical and other service settings. 
The model can also provide a framework for assessing the overall 
needs of refugee families and communities as a first step in for- 
mulating effective programmes of psychosocial and mental health 
interventions. 


Specific populations of note 


There are a number of subpopulations requiring special attention 
among the heterogenous population of refugees. 


Unaccompanied minors 


An unprecedented number of children and adolescents worldwide 
are forced to migrate on their own to escape war and persecution 
[31]. As minors, these individuals face extreme risks to their phys- 
ical and psychological well-being at an important developmental 
period [27, 32]. The trauma and hardships that accompany these 
experiences have potential to create prolonged mental health 
difficulties [33]. 

War exposures and ongoing social hardships can be complex and 
severe in nature for unaccompanied refugee minors. In contrast to 
single-incident trauma, war-affected youth can experience chronic 
exposure to traumatic events for weeks, months, or even years, often 
representing a significant proportion of their life [34]. Among the 
range of traumatic incidents common in war, children may witness 
violence, lose family members and friends, experience physical, 
sexual, and psychological harm, be exploited in various ways (in- 
volvement in criminality, prostitution, sex slavery), deprived of food, 
water, or shelter, or be compelled to inflict harm on others. These ex- 
posures have varying impacts for each child, but some experiences 
are particularly harmful [35]. For example, it is not uncommon for 
long-term shame and guilt to be experienced by some of these chil- 
dren, responses that are intensified by being forced to serve as ‘child 
soldiers’ and either witnessing or participating in breaking cultural 
taboos such as injuring family members. Managing the social stig- 
matization and isolation that can follow these experiences adds to 
the complexities of rehabilitation of these populations. 

Studies support the high-risk status of unaccompanied mi- 
nors. For example, the prevalence of PTSD was increased among 
unaccompanied minors resettled in Belgium, Norway, and the 
Netherlands [33, 36-38]. War exposure was positively associ- 
ated with depression among unaccompanied minors on arrival in 
Belgium [37] and 6 months post-arrival in Norway [36]. In a follow- 
up study, unaccompanied minors continued to exhibit high scores 
on anxiety, depression, and PTSD over an 18-month period, with 
negative outcomes predicted by the number of traumatic experi- 
ences and daily stressors experienced [33]. 


Living in prolonged insecurity such as refugee 
camp settings 


Refugee camps are settlements, usually built with the intention of 
being temporary, to receive forcibly displaced populations. However, 
many settlements have grown and become semi-permanent, re- 
quiring the development of systems of governance and civic in- 
stitutions [39]. Approximately one-third of refugees in protracted 
situations live in camps, some of which have been in existence for 
over 20 years. In 2015, the largest camps were in Kenya, with oc- 
cupants primarily from South Sudan and Somalia (for example, 
the population of Kakuma exceeds 180,000 and that of Hagaderaa 
more than 100,000); other large camps have also been established in 
Jordan for Syrian occupants [40]. 

Studies have highlighted the mental health risks ofliving in refugee 
camps [41, 42]. For example, among Rwandan and Burundese 
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refugees in a Tanzanian refugee camp, the prevalence of serious 
mental health problems was estimated to be 50% [43]. Children and 
young people living in camps (for example, in Central America, the 
Middle East, and the Former Yugoslavia) have higher rates of mental 
illness, compared to young people living in other transitional settings 
[27]. Refugee camps are often unsafe situations, placing children at 
risk of violence and abuse and consequent negative mental health 
outcomes. Neglect of basic needs, such as sanitation, parental dis- 
tress, high levels of poverty, and lack of access to education increase 
the risk of mental disorder—even though humanitarian organiza- 
tions and international bodies, such as UNICEF, provide substan- 
tial assistance in these settings [44]. Not surprisingly, therefore, a 
systematic review of 20 mental health studies of refugee/displaced 
youth residing in camps highlighted high rates of maladjustment, a 
large number of these young persons experiencing anxiety, somatic 
symptoms, depression, and aggression [45]. 


Women and families 


Violence against women is a global public health problem. There 
is a clear association between exposure to gender-based violence, 
including intimate partner violence, rape, and sexual assault, and 
mental disorders including PTSD, depression, and anxiety, as well as 
adverse psychosocial outcomes such as increased suicidal ideation 
[46]. Social and cultural factors influence the risk of violence against 
women. In particular, violence is higher in societies that are patri- 
archal, that is where authority in the household is reserved for men, 
and many contemporary settings of mass conflict involve popula- 
tions with these traditional customs. In these contexts, women are 
at risk because of the prevailing culture of patriarchy and the dis- 
ruptive and traumatic effects of mass violence [47]. The sequence of 
factors that increase the risk of intimate partner violence is complex, 
commencing with the increased risk among those exposed to early 
childhood abuse and violence (involving both sexes), experiences 
of war-related trauma, and conditions of extreme deprivation and 
poverty in the post-conflict environment, stressors that are common 
among displaced populations in low-income countries [48, 49]. 
Laws and customs that promote and protect women’s rights and en- 
titlements may also be regressive or undermined by war and con- 
flict. The society as a whole may have lost its regulatory capacity, for 
example to intervene in situations of family conflict, and the exigen- 
cies of survival and the stresses arising from these pressures may 
reinforce traditional roles, including gender-inequitable practices. 

War-specific factors can directly endanger women. The militar- 
istic culture associated with war, for example, can intensify women’s 
subordination and increase the risk of gender-based violence [50]. 
Rape is commonly used as a weapon of war and intimidation, leading 
to demoralization, humiliation, and isolation of women survivors 
and impacting more widely on men, families, and communities [25]. 
A recent meta-analysis of studies found that one in five refugee or dis- 
placed women in complex humanitarian settings have experienced 
sexual violence [51]. This is likely to be an underestimate, given the 
multiple barriers associated with disclosure, particularly in such un- 
safe settings. As indicated, gender-based violence is associated with 
a range of mental disorders, including PTSD, outcomes that further 
undermine women’s functioning and capacity to manage the imme- 
diate demands of the post-displacement environment [46]. 

Women are critically important to ensuring social stability and re- 
covery after conflict, both in the roles they play within the family and 


in guiding societies towards a return to peace and security [26]. Yet, 
qualitative studies confirm that women are often under-represented 
in decision-making in situations where they could provide a sig- 
nificant contribution to recovery and development. For example, 
women’s voices concerning the violent episodes in refugee camps are 
often muted and left unheard [50]. It is crucially important therefore 
to raise awareness of aid personnel and camp authorities to ensure 
that priority is given to promoting gender equality and to recog- 
nizing and addressing factors that may undermine women’s par- 
ticipation such as the presence of mental health-related disabilities 
as a consequence of gender-based violence. Gender equity promo- 
tion programmes and an explicit social and legal infrastructure that 
promotes gender equality should be prioritized to reduce violence 
against women and promote gender equality [47]. 


Cycles of violence and human rights abuses that can be 
perpetuated in peri- and post-migration settings 


There is emerging evidence indicating that under certain circum- 
stances, a post-conflict cycle of violence can occur within families 
and communities in displaced societies. The proposed cycle of vio- 
lence, partly supported by a growing body of research, proposes 
that there is a sequence of events in which adults exposed to trauma 
related to human rights violations are prone to outbursts of exces- 
sive anger in the post-conflict period, a sense of enduring injustice 
playing an important role in engendering this tendency. As a con- 
sequence, there can be inappropriate acts of anger and aggression 
within the family, triggered by relatively minor frustrations and 
conflict. Post-migration conditions of poverty and insecurity can 
exacerbate this tendency (see Fig. 138.3) for a conceptual model to 
describe these cycles of violence. In affected families, aggression may 
manifest as family conflict or in the form of intimate partner violence 
targeting women, and/or in harsh parenting in the upbringing of 
children. The long-term outcome may be the initiation of pathways 
in which the effects of trauma are transmitted to the next generation, 
impacting on their mental health and their tendency towards ag- 
gression and violence. A recent study identified a key pathway in this 
sequence in which male and female partners of survivors of trauma 
were more likely to exhibit high levels of grief and anger symptoms. 
Only women partners of men with high levels of trauma, however, 
showed an increase in PTSD, an important reason appearing to be 
that women tended to identify with the injustices experienced by 
their husbands [52]. 


Societal factors 
Detention 


In over 60 countries worldwide, including high-, middle-, and low- 
income nations, immigration detention is used in inconsistent ways 
to incarcerate some of the persons who have been forcibly displaced. 
In high-income countries, these practices are applied to selective 
groups of refugees such as those who arrive without an entry visa and 
others who are deemed not to have proved their claim for protection. 
Existing studies suggested that immigration detention is associated 
with poor mental health outcomes that can last for many years after 
release, the overall body of evidence indicating that the relation- 
ship is likely to be causal [53-56]. Several studies from the UK and 
Australia showed that the majority (if not all) children surveyed in 
detention centres meet criteria for at least one psychiatric illness 
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[54, 57]. The prevalence of disorder in these settings is remarkably 
high, with some studies showing a 10-fold increase in PTSD, anxiety 
disorders, depression, and sleep disturbances following detention 
[54, 58]. Factors such as ongoing uncertainty and stress associated 
with prolonged and indefinite detention, parental psychological dis- 
tress, disrupted peer and family relationships, exposure to further 
traumatic events, human rights abuses, and witnessing attempts at 
self-harm all increase the risk of mental disorder in detained chil- 
dren and adolescents [53]. In a study of adults who had been de- 
tained, both previous detention and ongoing temporary protection 
after release contributed to the risk of ongoing PTSD, depression, 
and mental health-related disability [55]. 

Providing mental health care in immigration detention facilities 
is associated with major challenges, with the compromise of human 
rights in these facilities serving as an ‘invalidating environment’ that 
potentially undermines the therapist’s attempts to provide psycho- 
logical assistance [59, 60]. 


Trafficking 


Human trafficking represents a further risk for refugees, with the 
lack of protection many experience making them prey to both forced 
labour and sexual exploitation [61, 62]. Studies indicate that traf- 
ficked women, men, and children experience high levels of violence 
and consequent physical health symptoms, including headaches, 
stomach pain, and back pain. The most commonly reported mental 
health problems include depression, anxiety, and PTSD [62, 63]. 
Self-harm among these populations is prevalent. There is a greater 
need therefore for professionals working with refugees to focus on 
these high-risk groups [64, 65]. 


Parenting difficulties and second-generation psychopathology 


The culture of the refugee parenting experience may be characterized 
by disruption and alterations to family structure and organization; 
cultural values and norms; and gender roles [66]. Trauma-exposed 
parents and children may also have problems with attachment and 


related parenting dysfunction [67]. Difficulties can arise in commu- 
nication because of language barriers which can place children with 
greater access to learning the new language in a premature adult role 
in the family. Further, differing beliefs and behaviours concerning 
child-rearing practices can lead to claims of child maltreatment, as 
well as inter-familial conflict [68]. The situation can become more 
complex when older children and parents adjust to differing de- 
grees to western cultural mores and norms regarding freedoms and 
rights [69]. It is important to affirm the positive parenting prac- 
tices of families in post-migration environments and ensure child 
safeguarding needs are addressed using culturally sensitive and 
engaging methods. Therefore, there needs to be specific attention to 
promoting the welfare of children within pre- and post-resettlement 
contexts [32, 66, 70]. 


Interventions 


The study of interventions for refugee and forcibly displaced popu- 
lations has grown substantially over the last decade, although only a 
few have been rigorously evaluated in randomized controlled trials 
[71-73]. This reflects many of the difficulties inherent in studying 
forcibly displaced populations where cultural, linguistic, financial, 
and practical problems abound [27, 74-76]. 

Refugees have diverse mental health needs, and in the ideal 
setting, there should be a range of networked services and agen- 
cies providing assessments and interventions for common mental 
disorders (depression, PTSD, anxiety), severe mental disorders 
(psychosis, schizophrenia, bipolar disorder) which are often neg- 
lected, ‘complex cases’ who may not respond to brief therapies, 
drug and alcohol problems, and organic disorders (often, by de- 
fault, including epilepsy in low-income settings). The heterogeneity 
of needs makes it difficult to draw general conclusions about the ef- 
fects of various interventions on a range of symptomatic and func- 
tional outcomes. The difficulty is to combine specific therapeutic 
components, for example, derived from cognitive behavioural 
therapy, in a broader and multi-level approach to psychosocial 
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interventions, for example based on the principles of the ADAPT 
model [77, 78]. Specific clinical interventions at the individual level 
include treatments for PTSD following multiple traumatic events, 
commonly applied approaches being narrative exposure therapy 
(NET), eye movement desensitization and reprocessing (EMDR), 
and trauma-focused cognitive behavioural therapy (TF-CBT) [79- 
82]. At a more general psychosocial level, several approaches are 
used focusing on groups, the community as a whole, the school, and 
the family [83-86] (Table 138.2). 

Studies of specific interventions usually targeting culturally 
homogenous client samples tend to demonstrate moderate to large 
outcome effects in relation to traumatic stress and anxiety reduc- 
tion [72]. Further work is needed to ensure that these interventions 
can be embedded and sustained within primary health care facil- 
ities, that workers receive adequate supervision to maintain skills 
and prevent burnout, and that the long-term effects of treatment are 
maintained. 

In designing intervention programmes, beliefs held by refugees 
about health care—its role, how to access it, and whether it can 
help—need to be a primary consideration. For example, a study 
among Somali women demonstrated how their beliefs focused on 
situational factors as determinants of mental health, in contrast to 
biological models that tend to drive interventions in Western medi- 
cine [92, 93]. These discordant health beliefs resulted in divergent 
expectations regarding the process and outcomes of treatment and 
health care interactions. Experiencing unmet and varying expect- 
ations, Somali women and their health care providers reported mul- 
tiple frustrations, which often diminished the perceived quality of 
health care. Moreover, during the process, previously silent worries 
about mental health and reproductive decision-making surfaced. 
To provide high-quality, transcultural health care, providers must 
encourage patients to voice their own health explanations, expect- 
ations, and worries. 

There is a consensus that in the resettlement environment, 
positive psychosocial outcomes for youth and adults depend to a 
great extent on the integrity and functioning of families. Yet few 


intervention programmes in mental health focus specifically on 
families in the refugee field. There is a pressing need therefore to 
devise and test mental health interventions that aim to prevent or 
lessen the effects of family dysfunction on individual mental health. 
Weine (2011) described eight characteristics that preventive mental 
health interventions should address to meet the needs of refugee 
families, including: feasibility, acceptability, culturally tailored, 
multi-level, time-focused, prosaicness, effectiveness, and adapt- 
ability [94]. To address these eight characteristics in the complex 
environment of refugee resettlement programmes requires modi- 
fying the process of research to introduce innovative strategies that 
build these principles into mental health services. Important prin- 
ciples are adopting a resilience (rather than an illness) framework; 
promoting community collaboration, participation, and leadership; 
and utilization of mixed-methods approaches, including focused 
ethnography. At a wider systems level, promoting preventive mental 
health programmes for refugee families requires appropriate sup- 
porting policy directives, multi-systemic partnerships, and training 
in flexible implementation research designs that ensure that innova- 
tive programmes are rigorously evaluated and the positive findings 
are captured and disseminated. 

In general, controlled studies among refugees and asylum seekers 
have reported positive intervention outcomes in reducing trauma- 
related symptoms [84]. For example, there is evidence to support 
TF-CBT and NET in certain refugee populations. Findings from 
other intervention studies are limited by methodological constraints 
such as lack of randomization, absence of control groups, and small 
samples. Further evaluations of the array of psychotherapeutic, 
psychosocial, pharmacological, and other therapeutic approaches, 
including psychoeducational and community-based interventions 
that facilitate personal and community growth and change, are 
needed [72]. In addition, there is a need to test the effectiveness of 
rehabilitation strategies for more complex cases and interventions 
for refugees with severe mental disorders such as psychosis. There 
is a need for increased awareness, training, and funding to imple- 
ment and assess broader psychosocial programmes (working in 


Table 138.2 Examples of mental health interventions for refugee populations 


Narrative exposure therapy (NET) Individual sessions provided by lay 


counsellors (6-10 sessions) 


Common elements treatment 
approach 


ndividual sessions provided by lay 
counsellors 


ndividual sessions (with or without 
interpreter) 


Cognitive behavioural therapy (CBT) 


Multiple-family group access 
intervention for PTSD 


ultiple-family group (nine sessions) 





Rwandan and Somali refugees in a 
Ugandan camp [79, 87] 


Significant reduction in PTSD 
symptoms for NET and trauma 
counselling groups. Remission of 70% 
ollowing NET 


Burmese refugees in Thailand [85] Significant reduction in depression, 
post-traumatic stress, anxiety, and 


aggression scores 








Positive PTSD outcomes in both 
groups (interpreter or not) 


ixed refugee group receiving CBT in 
London outpatient service [88] 


w 


Families from Bosnia-Herzegovina in 
Chicago [89] 


mproved access to mental health 
services 








Parenting intervention 


School-based theatre intervention to 
improve mental health and academic 
outcomes 


Classroom-based intervention (CBI) 


Parenting programme (eight sessions) 


Classroom-based drama workshop 


CBI for aftermath of exposure to 
potentially traumatic events (15 
sessions) 


African migrants and refugee families 
n Australia [90] 


ulti-ethnic high schools [91] 





Trials in conflict-affected settings 
(Burundi, Sri Lanka, Indonesia, Nepal) 
[44] 


Positive change in all parenting 
domains tested 


Significant decrease in impairment 
score for first-generation migrants and 
increase for second-generation 


Positive change in coping, hope, 
pro-social behaviour, and functional 
impairment 
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synchrony with clinical services) that collaborate with refugee com- 
munities in promoting adaptation during the stages of resettlement, 
programmes that require an innovative approach to ensuring that 
both the host and refugee communities actively contribute in recip- 
rocal ways to ensure successful integration. 

Notwithstanding the documented risks to mental health that 
the refugee experience generates, these adverse outcomes must 
be balanced against the potential positive adaptive outcomes 
among many displaced populations [95]. There is growing 
interest in issues of youth resilience and post-traumatic growth 
in the face of adversity; for example, many war-affected adoles- 
cents from Uganda did not display psychosocial distress 4 years 
after the war had ended, despite witnessing various atrocities 
[96, 97]. Nevertheless, although the relevance of a resilience- 
oriented approach is broadly recognized, there is little consensus 
about the definition of resilience and substantial variation in the 
operationalization and measurement of that construct, a chal- 
lenge that is increased in relation to ensuring the cross-cultural 
equivalence of relevant concepts [95]. A study of 26 qualita- 
tive studies exploring resilience in young refugees identified six 
sources of resilience: (1) social support; (2) acculturation strat- 
egies; (3) education; (4) religion; (5) avoidance; and (6) hope [95]. 
These sources indicated that both social as well as personal factors 
confer resilience in young refugees. Nevertheless, several funda- 
mental issues need to be clarified, including whether resilience 
is a latent capacity of the individual or is invested in the social 
sphere in which the person is embedded; and whether resilience 
is a unique positive characteristic of active adaptation and maxi- 
mizing the person’s potential, independent of mental disorder or 
simply the absence of the latter. 


Refugee communities and people displaced by mass conflict have 
been increasing in numbers over the last few decades. They can 
experience a potent mix of biological, psychological, and social 
stressors that can lead to increased rates of mental health problems, 
especially depression, anxiety, and PTSD. Furthermore, these popu- 
lations have increased exposures to a sequence of experiences of 
violence and abuse, both in their countries of origin, situations of 
transition such as refugee camps and even when reaching host coun- 
tries, within their own families and self-induced, in the form of sui- 
cidal acts. Refining a suite of broad psychosocial and more specific 
clinical interventions to address the varied needs of this population 
requires much further work, the focus being both on mitigating spe- 
cific mental health problems and preventing adverse society-wide 
outcomes, including potential cycles of violence in the family and 
the community. 
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The perception of an association between mental illness and anti- 
social behaviour has been widespread historically and influenced 
changes in mental health services and policy, including the devel- 
opment of secure hospitals and specific laws for mentally disordered 
individuals in the criminal justice system. Over the last few decades, 
research evidence has clarified the nature of this association and in- 
vestigated mechanisms. Most of the high-quality evidence for asso- 
ciations between psychiatric disorders and offending has come from 
epidemiology. Translating the epidemiological evidence into indi- 
vidual clinical assessment and decision-making remains a challenge 
for the field. 





Historically, many cultures have assumed a link between mental 
illness and criminality [1]. In the 1980s, this view was questioned by 
some research that suggested that schizophrenia and other psychoses 
did not, in fact, increase the risk of violence [2]. However, from the 
1990s, longitudinal population-based studies showed clear evidence 
of associations between most investigated psychiatric disorders and 
offending, and also violent behaviour [3]. The magnitude of this as- 
sociation varied from study to stud: 











from three inner cities in the United States 
ciation for a heterogenous group of diagnoses without substance 
misuse comorbidity [4]. However, the MacArthur stud 








and 
in particular severe mental illness, and strong links were reported 
between comorbidity and violent outcomes [5]. Ina 2009 systematic 
review of 20 studies examining the association between psychosis 
and violence, all included studies found a 







substance misuse [6]. Additional research using population-based 
sibling-control studies in Sweden [7] and Israel [8] has further 











strengthened the evidence of an association. However, any such 
reviews of association studies need to be viewed in the context of 
the risks of violent victimization to individuals with schizophrenia 
[9, 10], the population impact of a particular diagnosis such as 
schizophrenia on all violent crime, and how this population impact 
compares with other more common diagnoses. 

In addition, the robustness of the research evidence needs to be 
considered. Typically in this field, exposures and outcomes have 
been heterogenous. For exposures, different psychiatric disorders 
have been investigated, with varying approaches to determining 
diagnosis. In relation to outcomes, self-report, informant report, ar- 
rest, conviction, and imprisonment have been variously reported. 
Although the relative risks for self-reported and conviction data 
do not appear to differ in mental illness [11], absolute risks do. As 
any association between mental illness and offending is subject to 
a range of confounders and liable to reverse causality, longitudinal 
studies are key. 





Violent crime is particularly important from a public health and 
policy perspective, as itis less prone to measurement error than other 
violence outcomes, is associated with more morbidity and economic 
costs, and may be more generalizable than other outcomes. A con- 
sistent finding from population-based studies is that a significant 
proportion of violent offending is perpetrated by a relatively small 
number of individuals [12, 13]. Among the menta 





14, 15] and 





Mechanisms linking these disorders to violence have been pro- 
posed [19, 20] that has drawn on research examining damage 
to certain areas of the brain (especially the orbitofrontal region) 
[21], genetic research [22], neurochemical investigations [23], and 
imaging studies [24]. There is consistent evidence from animal 
models that 





[25]. In in 
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the blood of males (but not females) who have committed violent 
crimes have been demonstrated [26]. Serotonin is metabolized by 
monoamine oxidase, and it has been hypothesized tha 

ith an 
increased risk of violence [27], with a gene-environment interaction 
[28]. Other genes involved with serotonin metabolism have also been 
investigated such as in the metabolism of 
catecholamines) [29]. However, this research has been influenced by 
small studies and has not been replicated. Although violence is her- 
itable [30], genetic association studies have reported no consistent 
associations [31] and genome-wide association studies are now re- 

uired. Statistical genetic approaches have found 


[32]. 





Schizophrenia and other psychoses have been shown to be associated 
with an increased risk for a number of different crimes, including 
arson, drug, property, and violence [7, 33, 34]. Fo 











compared with 
population controls. However, absolute risks are low, and, for ex- 
ample, the annual risk for a stranger homicide in schizophrenia is 
not higher than one in 11 million annually in Western Europe [37]. 
For violence, the increased odds of violent crime range between 2 
and 6 in men, and slightly higher in women, and remains increased 
when compared with siblings who do not have schizophrenia and 
related psychoses [38]. The impact of environmental influences may 
vary with gender [39]. Overall, around one in 20 violent crimes is 
committed by individuals with psychosis [40]. 


Risk factors 


The strongest predictor of future violence in individuals with 
schizophrenia is a history of previous violence, es ecially convic- 
tions for violent crime [41]. Clinical factors me 

[42], particular if accompanied with anger [43] (consistent 
with earlier work [19, 44]). This was attenuated after treatment—a 
finding consistent with other work using novel within-individual 
designs that has found a reduced rate of violence in patients on 
antipsychotic medication [45]. Other possible mechanisms in- 
clude impulsivity, hostility, and lack of insight [41]. Poor adherence 
to medication and psychological treatments is another risk factor. 
A reliable finding is the effect of substance misuse [41], which acts 
as both a mediator and a moderator of the association and in homi- 
cide in individuals with psychosis [46]. Comorbidity with substance 
misuse increases the risk of violent outcomes 8-10 times, compared 
with the general population [7, 47]. Substance misuse may directly 
contribute to the risk of violent offending by decreasing adherence 
to medication, increasing impulsivity and hostility, and worsening 
certain psychotic symptoms, and indirectly through social networks 
and antisocial peers. Consistent with the importance of genetics and 
early environment, a review of risk factors for violence in psychosis 
found that a history of childhood abuse (OR 2.2, 95% CI 1.5-3.1), 
parental criminal history (OR 1.8, 95% CI 1.5-2.2), and parental al- 
cohol misuse (OR 1.6, 95% CI 1.4-1.8) were all significantly associ- 
ated with an increased risk [41]. No significant association has been 
found with many neuropsychological factors [41], including intelli- 
gence quotient (IQ) [47] (Table 139.1). 


Table 139.1 Strongest associations between risk factors and 
violence in individuals diagnosed with psychosis 



































Criminal istory of assault 21.4 (5.2-86.6) 
history ; ae 
istory of imprisonment for any 4.5 (2.7-7.7) 
offence 
Recent arrest for any offence 4.3 (2.7-6.7) 
istory of conviction for violent 4.2 (2.2-9.1) 
offence 
istory of arrest for any offence 3.5 (2.1-5.8) 
istory of violent behaviour 3.1 (2.2-4.4) 
Substance Comorbid substance misuse disorder 3.1 (1.9-5.0) 
misuse diagnosis 
istory of alcohol misuse D3\((1).7/=3:3)) 
istory of substance misuse (alcohol 2.2 (1.6-2.9) 
and/or drugs) 
Recent drug misuse 2A (RS all) 
Treatment on-adherence to psychological 6.7 (2.4-19.2) 
herapies 


“All significant to the P <0.001 level. 

Adapted from PloS One, 8(2), Witt K, Van Dorn R, Fazel S, Risk factors for violence in 
psychosis: systematic review and meta-regression analysis of 110 studies, e55942, 
Copyright (2013), PloS. Reproduced under the Creative Commons Attribution License 
CC BY 4.0. 





First episode 


Around a third of offending in schizophrenia occurs before any con- 
tact with mental health services [36], and approximately one in three 
individuals with first-episode psychosis exhibit some violent behav- 
iour [48]. Hence, early diagnosis and treatment are likely to reduce 
violence repetition. In addition, a longer period of untreated psych- 
osis is associated with increased risk [48]. Risk factors for violence 
in first-episode psychosis include hostility [odds ratio (OR) 3.5, 95% 
confidence interval (CI) 2.1-5.9], past violence or criminality (OR 3.3, 
95% CI 1.8-6.1), and manic symptoms (OR 2.9, 95% CI 1.9-4.4) [48]. 


Repeat offending 


A systematic review [49] and a recent longitudinal study [50] 
reported a clear increased risk of recidivism in individuals with 
psychosis. The latter study confirmed this association using sibling 
pairs released from prison who were discordant for mental dis- 
orders. This is in contrast to a previous review that included studies 
with heterogenous comparison groups, including some with per- 
sonality disorders and substance misuse, and therefore was poten- 
tially misleading [51]. Further evidence for mental illness as a risk 
factor for repeat offending was found in a large study of Texan pris- 
oners [52] . 


Affective disorders 


Studies investigating associations between affective disorders and 
violence are less frequent than those investigating psychosis [53]. 
Depression has a high prevalence in forensic populations, with up 
to one in ten prison inmates clinically depressed [54]. Early studies 
reported an association between depression and violence, but this 
relationship was mostly attenuated after adjustment for confounders 
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[11, 55, 56]. This was not a consistent finding, however—and the 
MacArthur study [4] and some case register studies [57] found links. 
New work has more clearly demonstrated an association between 
depression and violence, utilizing longitudinal designs and siblings 
and twins as genetically informative controls [18]. The latter research 
reported the odds of committing a violent crime in individuals with 
depression, compared with the general population, was around 
three times higher [18]. When depressed individuals were com- 
pared to their non-depressed siblings, the increased risk was 2-fold. 
The absolute rate of violence in men diagnosed with depression was 
3.7% over 3.2 years, higher than the rates of suicide (0.6%) and self- 
harm (3.3%). Rates in women were significantly lower, with 0.5% 
of women convicted of a violent crime after a depression diagnosis. 

Bipolar disorder is clearly associated with a range of adverse out- 
comes. A longitudinal analysis and systematic review in 2010 in- 
vestigated the risk of violent crime and bipolar disorder [17]. The 
30-year prevalence of violent crime convictions for people with a 
diagnosis of bipolar disorder was 8.4%, compared to the general 
population where it was 3.5%. When meta-analysed with eight other 
investigations, the increased odds of violent outcomes was 4.6 (95% 
CI 3.9-5.4). The impact of early environment and genetic factors is 
important in patients with bipolar disorder, and the rate of violence 
in unaffected male siblings has been reported to be 6.2% [17]. In 
terms of risk factors, no difference in violent crime risk was found in 
bipolar patients if their last episode was psychotic or non-psychotic, 
nor according to crude markers of disease phases. More sensitive 
markers of clinical presentation will be required, and some work has 
shown that links between mania and offending may be mediated by 
disinhibition, irritability, and impulsivity [58]. 


Substance misuse 


Goldstein proposed three mechanisms through which substance 
misuse may cause violent behaviour—directly or ‘psychophar- 
macological violence, or more indirectly via ‘systemic’ or ‘economic 
compulsive’ paths where situational and environmental factors ‘im- 
pact on behaviour’ [59]. Clinical studies focusing on just the ‘psy- 
chopharmacological’ pathways are complicated by confounding, 
comorbidities, and clarifying temporal and spatiotemporal relation- 
ships [20, 60, 61]. Risk factors for substance misuse and psychiatric 
illness overlap and have varying impacts at different points along 
the life course, and many studies rely on self-report. Polysubstance 
misuse is common, especially in individuals with serious mental 
illness—the odds of having another drug use disorder if also suf- 
fering from alcohol use disorder is around 4- to 5-fold, compared 
to those not using alcohol [62]. Despite all these difficulties, associ- 
ations between violent offending and a history of substance misuse 
has been clearly established [20, 63]. 

The evidence for an association between alcohol use and vio- 
lence is relatively strong. Studies have shown that the greater the 
number of symptoms of alcohol dependence, the higher the risk 
of violent offending [64]. The Dunedin birth cohort study found 
that 11% of the violence risk was attributable to alcohol depend- 
ence, with an adjusted OR of 3.4 (95% CI 2.0-5.9) for violence 
in those with significant alcohol use, compared to non-drinkers 
[11]. Other studies suggest a dose-response relationship, with 
weekly binge drinking associated with an increased risk of crime, 


compared to monthly binge drinking [65]. In a case-crossover 
study, alcohol consumption in the previous 24 hours was asso- 
ciated with a relative risk of violence of 13.2 (95% CI 8.2-21.2) 
[66]. Individuals with a history of alcohol misuse were respon- 
sible for 16% of violent crime in Sweden [67]. 

Studies looking at individual substances are fewer in number and 
poorer in quality [68]. A 2016 review of studies of controlled sub- 
stances and violence identified 22 relevant prospective studies, all of 
which had a moderate to serious risk of bias [68]. Eight studies inves- 
tigated marijuana and showed either an increased risk or no change 
in risk of violence [68]. Studies of other illegal drugs have typically 
reported similar mixed results, with some with increased risk and 
others with no association. Data on barbiturates and tranquillizers, 
however, showed negative associations with some self-reported 
violence outcomes [69]. In an investigation based on an American 
inner-city population [69], other illegal drugs (including amphet- 
amine, crack cocaine, opiates, and tranquillizers) were also associ- 
ated with mixed results, based on self-reported violence outcomes. 

There are few studies that have looked carefully at temporal rela- 
tionships between substance misuse and violence. One report exam- 
ined the odds of committing an act of serious violence the day after 
substance or alcohol misuse [70]. Consistent with other studies, it 
found that violence on day 1 was the largest risk factor for violence 
on the following day. It also found that alcohol use increased the 
odds of serious violence on the following day (OR 2.4, 95% CI 1.8- 
3.2). A possible effect of drug use alone (other than marijuana) could 
not be excluded (OR 1.5, 95% CI 0.8-2.8). Similar to many studies in 
this area, information on drug use relied on self-report. 


Personality disorders 


A number of studies have investigated links between personality 
disorder and offending [71, 72]. Birth cohorts and population- 
based studies have shown associations, which tend to be stronger 
for Cluster B disorders (antisocial, borderline, narcissistic, and his- 
trionic). In a meta-analysis, Yu et al. [16] found that 10.7% of in- 
dividuals with a personality disorder diagnosis were violent over 
a median follow-up period of 4.5 years, compared to 1.2% of gen- 
eral population controls. This corresponded to an increase in risk 
(OR 3.0, 95% CI 2.6-3.5) of violent outcomes in individuals with a 
personality disorder diagnosis. Antisocial personality disorder was 
associated with the greatest increase in risk, with elevated odds of 
violence of more than 10, although the individual estimates con- 
tributing to this pooled estimate were mostly unadjusted. When 
studies that excluded patients with antisocial personality disorder 
were compared, the risk of violence remained significantly raised. 
There was no significant difference between genders in studies with 
samples that included all personality disorders. No explanation was 
found for variations between studies when age band, comparison 
group, or diagnostic criteria were considered. Overall, it was esti- 
mated that around a fifth of all violence could be attributed to indi- 
viduals with personality disorder if causality was assumed, and the 
prevalence of personality disorder in the general population is typ- 
ically between 5% and 10% [16]. 

Personality disorder is also The 
odds of any criminal recidivism is doubled in individuals with a per- 
sonality disorder diagnosis [16], with little difference in risk if the 
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comparison group comprised individuals with other psychiatric 
diagnoses or the general population. Studies have found that around 
two-thirds of individuals with a personality disorder diagnosis will 
reoffend, with no difference between different personality disorders. 
As individuals with personality disorder often have comorbid dis- 
orders, future research should account more carefully for this. 





Neurodegenerative disorders 


There are few high-quality studies that estimate the risk of offending 
and violence in neurodegenerative disorders. One of the most 
common behavioural manifestations of dementia is aggression, 
with estimates of over 90% in some patient surveys [73], with plaus- 
ible mechanisms such as misinterpretation of the environment [74]. 
Rarely, this aggression can result in a criminal offence, especially 
prior to a formal clinical diagnosis, and offences committed by pa- 
tients with dementia are not limited to violence and range from theft 
[75] to homicide [76]. A case series of first-time offenders aged 65 
and over have found higher rates of dementia than would be ex- 
pected by age alone [77]. 

Fronto-temporal dementia and Huntington’s disease are two dis- 
orders where illness onsetisearlierthan thatin other neurodegenerative 
disorders. A recent review of offending and neurodegenerative dis- 
eases found associations with fronto-temporal dementia [behavioural 
variant (bvFTD)], primary progressive aphasia, and Huntington's dis- 
ease, all of which have clinical conditions affecting the frontal circuits 
[78]. In a cohort of 2397 patients seen at a memory clinic in California, 
8% presented with criminal behaviour, patients with bvFTD had the 
highest rates of offending (37%) and those with mild cognitive im- 
pairment the lowest (3%), and estimates for vascular dementia and 
Alzheimer’s disease were 15% and 8%, respectively. In 4% of byFTD 
patients, violence was their presenting complaint, which compares 
with 0.7% of Alzheimer’s disease patients. Thus, degenerative pro- 
cesses that predominantly affect non-frontal areas, such as Alzheimer’s 
disease, do not appear to be as strongly associated with offending [78]. 


Neurological disorders 


Traumatic brain injury (TBI) can lead to personality change and 
incident psychiatric and substance use disorders. The odds of a ju- 
venile offender having a history of TBI, compared to a control, is 
3.4 (95% CI 1.5-7.5) [79], and a 2010 meta-analysis estimated the 
prevalence of TBI in offender populations at 60% (95% CI 48-72) 
[80]. Prisoners appear to have a higher lifetime prevalence at 68% 
(95% CI 50-86), with more than half of these injuries including loss 
of consciousness (the diagnostic threshold for mild TBI or concus- 
sion). Absolute rates of violent crime in TBI patients are about 6% 
[81], reflected in an increased odds of 2.3 (95% CI 2.2-2.5) when 
adjusted for socio-economic confounders and substance misuse. 
When compared with unaffected siblings, the odds of violent crime 
remain raised at 2.0 (95% CI 1.8-2.3) [81]. 

Although epilepsy has been historically linked with crime and 
higher-than-expected rates in prisoners [82], both of these conven- 
tions have been questioned. In a population-based study, the absolute 
risk associated with epilepsy was slightly higher than in age-matched 
population controls, but not different to unaffected siblings (OR 1.1, 


95% CI 0.9-1.2) [81]. Further, a systematic review found no support 
for the increased rates in prisoners, and previous findings of increased 
rates were partly based on using the wrong denominator [83]. The 
finding that epilepsy itself is not associated with an increased risk of 
violence is important, given the stigma associated with this condition. 


Neurodevelopmental disorders 


Up to 10% of the child and adolescent population worldwide 
have neurodevelopmental disorders [84], and these disorders are 
overrepresented in the criminal justice system [85]. Of these dis- 
orders, attention-deficit/hyperactivity disorder (ADHD) has been 
consistently associated with an increased risk of offending (OR 
4.6, 95% CI 2.2-10.3 in a large epidemiological study) [86]. In 
some studies, ADHD is a stronger predictor of violence than sub- 
stance misuse [87], although comorbidity is common. Individuals 
with ADHD appear to be overrepresented in the criminal justice 
system, with an estimated prevalence of around 12% in boys in 
detention [88]. Among those diagnosed with ADHD, earlier onset 
of ADHD symptoms is associated with a higher rate of recidivism. 
Individuals with ADHD often have other comorbid psychiatric 
disorders, most often conduct disorder (17%) [89]. A longitudinal 
study in Sweden examined four childhood neurodevelopmental 
disorders and their association with future violent offending, com- 
pared to age-matched controls [90]. After adjustment for con- 
founding, a diagnosis of ADHD was associated with an increased 
risk of violent offending (OR 2.7, 95% CI 2.0-3.8) [90]. When com- 
paring full-siblings of individuals with ADHD with matched con- 
trols and adjusting for confounders, an increased risk of violent 
offending remained (OR 1.3, 95% CI 0.9-2.0), suggesting familial 
confounding. 

It has been hypothesized that autism spectrum disorders (ASD) 
may be associated variously with higher and lower risks of offending 
than the general population. Studies based on offender cohorts have 
suggested a small increased risk of criminality in individuals with 
ASD [85]. More recently, no association was found between ASD 
and violent offending in a Swedish sibling control study (OR 1.1, 
95% CI 0.6-1.9) [90]. The authors suggested that associations seen 
in previous studies were possibly due to other comorbid psychiatric 
diagnoses [90]. A study of individuals with ASD reviewed by the fo- 
rensic psychiatric service in Norway found comorbidities included 
intellectual disability (33%), ADHD (15%) and substance misuse 
(19%) [91]. Risk factors for violence in individuals with ASD in- 
cluded comorbid psychiatric disorder (OR 4.2, 95% CI 1.8-9.8) [92]. 
A diagnosis of Asperger’s syndrome has also been associated with a 
higher risk of violence than ASD [92]. Other neurodevelopmental 
disorders associated with offending include tic disorder [90]. 


There have been very few studies researching associations between 
eating disorders and offending. One investigation compared in- 
dividuals with eating disorders to matched controls and found no 
difference in aggressiveness [93]. There have been case reports of 
an association with shoplifting; however, epidemiological data are 
lacking [94] (Table 139.2). 
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Table 139.2 Associations between psychiatric disorder and offending (comparison groups used 
unaffected siblings where possible; if not, systematic reviews were identified, followed by large 


observational studies) 


Psychiatric disorder Odds of offending/ 
violence (95% Cl) 








eet lee 





Schizophrenia-spectrum disorders 4.2 (3.8-4.5) Siblings 

Bipolar disorder 3.4 (risk ratio) Siblings a 
Depression 2.1 (1.8-2.4) Siblings 18] 
Personality disorder 3.0 (2.6-3.5) General population 16] 
Substance misuse 7.4 (4.3-12.7) General population 6] 
Substance misuse (SMI + alcohol + Ooo Se) General population 6] 
drugs—general) 

Neurodevelopmental disorder (ADHD) 1.4 (0.9-2.1) Siblings 90] 
Neurodegenerative disorder (bv-FTD) 7.2 (P <0.001) Alzheimer's disease 78] 
Neurological (TBI) 2.0 (1.8-2.3) Siblings 81] 
Neurological (epilepsy) 1.1 (0.9-1.2) Siblings 81] 


SMI, severe mental illness. 


Risk assessment 


There has been a proliferation of structured risk assessment tools in 
recent years, which rely on research on risk factors to create a check- 
list of items that can be scored in an unweighted or weighted manner. 
Current tools identify low-risk individuals with reasonable accuracy 
but are less accurate at identifying high-risk individuals who will actu- 
ally go on to offend [95]. The rates of violence in individuals classified 
as high risk by these tools vary substantially [96], and trial evidence 
does not demonstrate improved outcomes using structured clinical 
judgement instruments [97]. In addition, some of these tools are 
subject to the prevention paradox—that most individuals with adverse 
outcomes are categorized in low- or medium-risk categories [17, 49], 
especially for rare outcomes such as homicide. Furthermore, scal- 
ability and simplicity need to be prioritized, as current instruments 
are time-consuming, suffer from authorship and publication biases 
[98], and are expensive to use (often requiring training and costs to 
administer). Recent work has addressed some of the limitations of 
previous work by using large data sets, pre-specifying risk factors and 
thresholds in a protocol, publishing a range of performance measures 
in a validation cohort, and developing a brief, scalable, and free-to-use 
risk assessment tool (OxMIV) for clinical decision-making in indi- 
viduals with schizophrenia-spectrum and bipolar disorders [99]. The 
validation C-index [equivalent to area under the curve (AUC)] for 
this tool was 0.87, higher than comparative tools that take consider- 
ably longer and rely on interviews. Such a scalable tool can be used to 
screen out low-risk persons with psychosis and act as an adjunct to 
clinical decision-making. 


Conclusions and implications 





Offending is a complex human behaviour and has many different 
causes, often acting at the same time. The links between psychi- 
atric disorder and offending can be further clarified by using clear 
outcomes that allow for comparability, longitudinal studies to 
account for temporal relationships between exposure and out- 
come, and methods that adjust for sociodemographic and familial 


confounders. Overall, the data suggest that substance misuse and 
antisocial personality disorder are most strongly associated with an 
increased risk of offending, particularly violent offending, in rela- 
tive and absolute terms. Schizophrenia-spectrum disorders, bipolar 
disorder, and clinical depression are also associated with offending, 
but relative and absolute risks are lower, unless there is comorbidity 
with substance misuse (Table 139.2). Pharmacological treatment of 
schizophrenia-spectrum and bipolar disorders is associated with a 
reduction in reoffending in observational studies and with broader 
violent outcomes in randomized controlled trials, which underscores 
the importance of early diagnosis and treatment. Violence risk as- 
sessment should consider using scalable tools, particularly to screen 
out low-risk persons and as an adjunct to clinical decision-making. 
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Over the last decade, studies have highlighted that young people 
with disproportionately high and multiple needs have clustered in 
the juvenile justice system. These young people experience higher 
levels of diagnosable mental health problems and neurodisability 
than the general population. 

Delinquency, conduct problems, and aggression all refer to anti- 
social behaviours that reflect a failure of the individual to conform 
his or her behaviour to the expectations of some authority figure 
or to societal norms, or to respect the rights of other people. The 
‘behaviours’ can range from mild conflicts with authority figures, to 
major violation of societal norms, to serious violations of the rights 
of others. The term ‘delinquency’ implies that the acts could result in 
conviction, although most do not do so. The term ‘juvenile’ usually 
applies to the age range, extending from a lower age set by the age of 
criminal responsibility to an upper age when a young person can be 
dealt with in courts for adult crimes. These ages vary between, and 
indeed within, countries and are not the same for all offences. 

This chapter provides a developmental approach to under- 
standing the needs of young people in contact with the criminal 
justice system. It reviews the prevalence of a range of mental health 
and neurodevelopmental disorders in young offenders and describes 
the key principles of assessment and intervention approaches. The 
policy and legal framework have been illustrated by reference to the 
system in England but will have relevance to readers from further 
afield. 




































































Adolescence is a transitional stage of development between child- 
hood and adulthood; the developmental tasks of adolescence centre 
on autonomy and connection with others, rebellion and the de- 
velopment of independence, the development of identity, and the 
distinction from, and continuity with, others. Many theories of 





development are stage theories, in which we pass from stage to 
stage, completing the developmental tasks appropriate to each stage 
prior to moving on. However, in reality, progression across domains 
is not equal and frequently non-linear. Additionally, neuroscience 
suggests that there is a lack of synchrony in late childhood and ado- 
lescence in the development of two of the critical brain systems that 
enable the development of adaptive behaviour [1]. The mesolimbic 
system develops more quickly than the frontal system, resulting in 
the adolescent brain having a heightened need for basic reward, but 
a lower capacity to manage short-term rewards for greater long- 
term gains. This discrepancy may be greater for young people at risk 
of antisocial behaviour where reward-driven behaviour is increased 
and neurocognitive deficits in executive functioning more common. 

Epidemiological studies suggest that the development of anti- 
social behaviour involves a complex interaction of intrinsic and 
psychosocial risk and protective factors [2]. Links between early 
adverse events in the prenatal period and the impact of parenting, 
family, and peer relationships on behaviour in early childhood have 
been found [2]. Heritable influences contribute towards a gene- 
environment interaction, suggesting pathways are complex and 
some vulnerabilities become increasingly evident in the context 
of other risk factors. Intrinsic risk factors include socio-cognitive 
deficits (hostile attribution bias), temperamental factors (callous- 
unemotional traits), and low autonomic nervous system arousal 
[2, 3]. The neurocognitive profiles of young offenders include 
deficits in language skills, attention, and impulse control, as well as 
low intelligence quotient (IQ) scores. Deficits in executive function 
can affect the young person's ability to regulate their behaviours 
and plan and generate alternative strategies. However, antisocial 
behaviour also shows strong associations with psychosocial ad- 
versity. Parental mental illness, family breakdown, parenting style, 
and association with other antisocial peers influence outcomes [2]. 
Detachment from school through truancy and exclusion may in- 
crease the risk of offending through reduced supervision and loss 
of any positive socialization effects of school and by creating delin- 
quent groups of young people [4]. 
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Distinctive pathways are described for those with early and 
late onset of offending behaviour, with some gender differences 
suggested. Girls have historically been found to have a later onset 
to their antisocial behaviour, compared with boys [2, 5]. Moffitt 
[5] proposed a developmental taxonomy whereby early-onset 
conduct problems were more likely to be associated with indi- 
vidually based risk factors that increase vulnerability to the de- 
velopment of behaviour problems in the context of adverse family 
and parenting conditions. In adolescent-onset conduct prob- 
lems, by contrast, individual vulnerabilities may be less marked, 
and the impact of environmental risk factors, including the in- 
fluence of antisocial peers, more evident. There is evidence that 


early-onset, life course-persistent antisocial behaviour is more 
likely to be associated with a greater risk of neurodevelopmental 
impairment [6]. 

However, it is important to bear in mind that protective or resili- 
ence factors for the young person or within the system around them 
can also modify outcomes (Fig. 140.1).This is particularly relevant 
for interventions and an important underpinning principle of the 
risk and resilience approach which focuses on building important 
‘assets’ in the child’s health and emotional well-being. Activity can 
include strengthening children’s internal resources, as well as re- 
silience in their families, their communities, and their broader 


environment. 












Risk factors 









Perinatal/parental mental 
illness 


Parental substance misuse 


Harsh or inconsistent 
discipline 


Lack of warmth and 
affection 


Multiple poor health outcomes 


Behavioural health 
outcomes 
Relationship difficulties 
Aggression 
Deviant peer group 
Risk-taking behaviour/ 

delinquency 


Physical health 
outcomes 
Failure to thrive 
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Fig. 140.1 Risk and resilience factors affecting multiple health outcomes. 
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Reproduced from Chitsabesan P, Khan L, Assessment of young offenders; mental health, physical, educational and social needs. In: Forensic Child and Adolescent Mental 
Health: Meeting the Needs of Young Offenders, Bailey S, Chitsabesan P, Tarbuck P [Eds.], pp 41-54, Copyright (2017), with permission from Cambridge University Press. 
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Prevalence of mental health needs and 


neurodevelopmental disorders 





Mental health needs 


Studies have reported the prevalence of psychiatric disorders among 
young males in custody in the United States (US) to be between 60% 
and 70%, and between 60% and 80% for young females in custody 
[8], in contrast to rates of between 7% and 12% in the general popu- 
lation [9]. The high rate of psychopathology in juvenile offenders 
may be the consequence of shared risk factors in the development of 
both antisocial behaviour and psychiatric disorders. 

Table 140.1 summarizes a range of studies that identify rates of 
specific mental health needs and neurodevelopmental disorders 
among young people within the youth justice system and the general 
population [10]. While it is important to highlight that comparing 
such rates is problematic due to differences in methodology between 
studies, prevalence rates found in young offenders are consistently 
higher than in the general population. 


Psychotic disorders 


Young people in the secure estate are approximately ten times more 
likely to suffer from psychosis than the general adolescent popu- 
lation [11]. Fazel et al. [11] found that 3.3% of males and 2.7% of 
females in youth custody were diagnosed with a psychotic illness. 
A first psychotic episode may have multiple causes, including drugs, 
brain injury, physical illnesses, or schizophrenia. Schizophrenia is 
a mental illness characterized by cognitive disruption, delusions 
(fixed, false beliefs), and hallucinations. Sufferers frequently have 
comorbid depression, anxiety, and substance misuse. Mortality rates 
for young offenders with schizophrenia was found to be significantly 
higher than in non-offending peers [12]. 


Table 140.1 Prevalence of psychiatric and neurodevelopmental 
disorders in young offenders 





Type of disorder Reported Reported 
prevalence rates prevalence 
among young rates among 
people in the young 
general population | offenders (%) 
(%) 

Psychotic disorder 04 1-33 

Depressive disorder OL2=3 Sees) 

Anxiety disorder 29 9-21 

Post-traumatic stress disorder 0.4 ess 

Substance misuse disorder 7 3/55 

Learning disabilities 2-4 23=32 

Dyslexia 10 21-43 

Communication disorders Se 60-65 

Attention-deficit/hyperactive 3-9 TES 

disorder 

Autistic spectrum disorder 0.6-1.2 6 

Traumatic brain injury 24-316 65 


Reproduced from Chitsabesan P, Hughes N, Mental Health and Neurodevelopmental 
Disorders amongst Young Offenders: Implications for Policy and Practice. In: Mental 
Health, Crime and Criminal Justice, Responses and Reforms, Winstone J [Ed.], pp. 109-30, 
Copyright (2015), with permission from Palgrave Macmillan. 


Anxiety disorders and post-traumatic stress disorder 


Anxiety disorders include a range of different disorders, from gener- 
alized anxiety disorder to phobias and post-traumatic stress disorder 
(PTSD). In their UK-based study of young offenders, Chitsabesan 
et al. [13] diagnosed anxiety disorder in 16% of cases, consistent 
with findings from a systematic review [14]. 

The NWJP study found that the vast majority of offenders had ex- 
perienced at least one trauma in their lifetime (93%), while 11% met 
the criteria for PTSD [15]. In the UK, the Annual Report of the Chief 
Medical Officer stated that 9% of 13- to 18-year-old young offenders 
in secure care have a diagnosis of PTSD [16]. 

Many young people within the criminal justice system who pre- 
sent with pervasive and complex presentations often have an ex- 
tensive history of repeated interpersonal traumatic experiences. 
Disruption to the early attachment process through interpersonal 
trauma has a direct impact on the development of a child’s brain, 
attachment style, and emotional regulation systems [17]. Over time, 
the brain is more likely to develop connections between the parts of 
the brain based on recognizing and interpreting threat. 


Depression 


A number of studies in adolescents within the criminal justice sys- 
tems, both in the UK and internationally, have reported dispropor- 
tionally higher prevalence rates of depressive disorders in young 
offenders. Fazel et al. [11] conducted a meta-analysis of 25 studies 
of psychiatric morbidity among children and young people in cus- 
tody, including studies from the UK, as well as other international 
contexts. Their findings suggested that around 11% of boys and 29% 
of girls had a major depressive disorder [11]. Adolescents within the 
youth justice system experience high levels of psychosocial adversity 
and are significantly more at risk of developing depression due to 
both genetic and environmental factors. Additionally, these young 
people are at greater risk of continued depression, as they have 
fewer protective influences to compensate for the many adversities 
they face. 


Self-harm and suicidal behaviour 


An association between antisocial behaviour and suicidal behav- 
iour has also been demonstrated in research studies. A study of 200 
young people sentenced to custody found that 20% were reported 
to have harmed themselves and 11% to have attempted suicide [18]. 
Self-harm is more prevalent among offenders, as certain risk fac- 
tors for self-harm are more common among this group. Predictors 
of an increased risk include previous attempts, prolonged low mood, 
attention-deficit/hyperactivity disorder, impulsivity, and substance 
misuse [19]. 

Rates of suicide are similarly much higher in the offender popula- 
tion, compared with non-antisocial peers, possibly as a consequence 
of greater impulsive behaviour and substance misuse, which are 
more common in this group of young people [19]. 


Substance misuse 


Substance misuse disorders are defined by sustained maladap- 
tive behaviours related to, or caused by, substance misuse. Young 
people's substance misuse differs from that of adult substance mis- 
using offenders. Cannabis and alcohol are the main substances of 
choice reported by adolescents, and polydrug use is common. The 








SECTION 24 Forensic psychiatry 


mesolimbic dopaminergic system, or ‘reward system in the brain, 
has an important role as adolescent brains appear particularly sen- 
sitized to dopamine. 

Prevalence rates of substance misuse disorder in young offenders 
in custody have ranged from 41% to 55% [20]. Childhood ex- 
posure to violence and abuse was found to increase the likelihood 
of violence and problematic substance misuse in adulthood [20, 21]. 
Reduced parental supervision and the influence of peer relation- 
ships play a key role in both experimentation with, and ongoing use 
of, substances [20]. Mulvey et al. [21] reported that the disinhibiting 
effects of drugs, such as alcohol, can increase the risk of violent or 
risk-taking behaviour. Positive associations have also been shown 
between substance misuse and other disorders, including depression 
and attention-deficit/hyperactivity disorder (ADHD). Longitudinal 
studies also showed an association with persistent offending and 
both depressive symptoms and drug use into adulthood [22]. 


Psychopathy 


Psychopathy is characterized by a unique cluster of traits, including 
shallow affect; lack of empathy, and guilt; irresponsibility; impul- 
sivity; and poor decision-making. A closely related construct to 
psychopathy is antisocial personality disorder (APD). APD is de- 
fined in the Diagnostic and Statistical Manual of Mental Disorders, 
fifth edition (DSM-V) as having a pervasive disregard for, and a will- 
ingness to, violate the rights of others [23]. While most individuals 
with APD do not meet the diagnostic criteria for psychopathy, psy- 
chopaths are almost invariably diagnosed with APD. 

The Psychopathy Checklist Revised (PCL-R) is a well-known 
measure of adult psychopathic traits, although criticized for its strong 
focus on criminality. The Psychopathy Checklist: Youth Version 
(PCL:YV) was developed from the PCL-R to provide a standardized 
approach to assessing psychopathic-like traits in youth males and fe- 
males aged 12-17 years and has demonstrated predictive validity for 
reoffending and violent offending [24]. An alternative approach which 
is still in its early stage of validation is the Comprehensive Assessment 
of Psychopathic Personality Disorder (CAPP) which has six dimen- 
sions: attachment, behavioural, cognitive, dominance, emotional, and 
self-styles of functioning [25]. The new measure attempts to encapsu- 
late the ‘clinical’ construct, as well as capture change [26]. 

The presence of callous-unemotional (CU) traits has been shown 
to identify a discrete group of conduct-disordered children with 
more severe conduct problems and aggression. A biological theory 
has been proposed that children who develop CU traits have unique 
neural mechanisms characterized by low cortisol and underarousal, 
contributing to a temperamental style which has low emotional 
reactivity and fearfulness to threatening stimuli, poor responsive- 
ness to cues to punishment, and reward-dominant orientation [27]. 
However, Johnstone [26] argues that there are developmental chal- 
lenges associated with extending the construct of psychopathy to 
youth and that environmental factors, such as disrupted parent- 
child attachments and trauma, are also important risk and mediating 
factors which can influence neural brain processes and outcomes. 


Neurodevelopmental disorders 


Neurodevelopmental impairments are expressed through a wide 
range of symptoms, including deficits in cognitive functions 


(reasoning, thinking, and perception) and social-affective func- 
tions (the expression of emotion and formation of relation- 
ships). Specifically, problems with executive functioning, 
cognitive empathy, and emotional regulation associated with par- 
ticular neurodevelopmental disorders can directly influence pro- 
pensity towards aggressive and antisocial behaviour [6]. 


Attention-deficit/hyperactivity disorder 


A meta-analysis reviewing 25 international studies found a rate of 
11.7% for young male offenders [11], in contrast to 3-7% within the 
general population. Longitudinal studies have suggested that child- 
hood ADHD predicts later antisocial behaviour, although more re- 
cent evidence suggests that this association is indirect and mediated 
through the development of conduct disorder, illicit drug use, and 
peer delinquency [28]. Young people with conduct disorder and 
ADHD have been shown to have greater severity and persistence of 
antisocial behaviour. Co-occurrence with substance misuse is also 
high, with evidence of increased frequency and severity of institu- 
tional behavioural disturbances while in custody [29]. 


Autism spectrum disorders 


Autism is a neurodevelopmental disorder characterized by a triad 
of impairment in social communication that affects about 1% of 
children and young people [30]. Certain features of autism may 
predispose young people to offend, including social naivety, mis- 
interpretation of social cues, and poor empathy [30]. While the 
National Autistic Society [31] suggests that young people with 
Asperger's syndrome are seven times more likely to come into con- 
tact with the criminal justice system than their peers, many studies 
demonstrating an increased prevalence rate in the criminal justice 
system have been conducted on a forensic psychiatry sample of of- 
fenders, suggesting further research is required. There is concern, 
however, about the possible links between autism and particular 
types of offending. Certain types of offences may raise the suspi- 
cion of autism spectrum disorder (ASD), including stalking, arson, 
sexual offences, inexplicable violence, computer crime, and offences 
arising out of misjudgement of social relationships [30]. 


Learning disability 


A learning disability is defined as an IQ score of less than 70, together 
with significant difficulties with adaptive or social functioning and 
onset prior to adulthood. High prevalence rates (23-32%) have been 
found in studies of young offenders [32]. The majority of young of- 
fenders with a learning disability frequently have an IQ in the ‘mild 
range’ (between 50 and 69) and may therefore be less likely to have had 
their learning needs identified in mainstream schools where those 
needs are often overshadowed by their challenging behaviour [33]. 


Communication disorders 


Communication disorders relate to problems with speech, language, 
and communication that significantly impact day-to-day func- 
tioning. A study of 72 young offenders attending a youth offending 
service found that 65% of those screened had profiles indicating that 
they had communication difficulties, including expressive language 
difficulties (28%), receptive language difficulties (45%), and articu- 
lation difficulties such as a stammer (8%) [34]. Particular deficits in 
verbal skills may also be highlighted through a discrepancy in verbal 
and performance IQ scores. 
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Traumatic brain injury 


A traumatic brain injury (TBI) is any injury to the brain caused by 
an impact, and the severity is typically measured by the depth of loss 
of consciousness (LOC). The majority of TBIs appear to be mild, 
although long-term effects on academic performance, behaviour, 
emotional control, and social interactions can be reported. This is 
reflected in a range of studies which have demonstrated an asso- 
ciation between TBI and antisocial behaviour, including an earlier 
onset of offending history and more violent offending [35]. Greater 
severity of TBI was also found to be associated with greater impair- 
ment of cognition, earlier onset of criminal behaviour, and increased 
rates of mental illness and substance use. Identified prevalence rates 
of TBI among young offenders are variable, however, ranging from 
4.5% to 72% [36]. 


Gender and ethnicity 


While studies of mental health needs have predominantly focused 
on male offenders, gender differences have been reported. The 
NWJP study found that female offenders were 14 times as likely as 
male offenders to meet the criteria for at least one mental disorder, 
including depression and anxiety disorders [8]. Rates of PTSD and 
self-harm have also been found to be significantly more prevalent in 
females [13]. Explanatory mechanisms may include greater genetic 
vulnerability, as well as increased exposure to trauma and chaotic 
family lifestyles. Female offenders have been shown to experience 
more abuse, neglect, and a family history of mental illness than male 
offenders. There are concerns that there is an overrepresentation of 
some ethnic minority groups within the criminal justice system and 
that these groups of young people have poorer access to services. 
However, studies exploring differences in the prevalence of mental 
disorders among young offenders by ethnicity have been sparse and 
findings inconsistent. A national study within the United States 
using the Massachusetts Youth Screening Instrument-Version 2 
(MAYSI-2) found that differences in prevalence rates varied across 
sites and were generally small [37]. 


Screening and assessment 


Studies to date suggest that one of the most common reasons for 
unmet need is lack of appropriate and timely assessment [38]. Lack 
of identification hampers opportunities for early intervention and 
promotion of healthy development and resilience in young people. 
Initial assessment is a key factor in the successful treatment of 
mental health needs for young offenders. Early identification of 
mental health needs may also reduce the later risk of mental dis- 
orders and their related health costs. However, assessment can be 
complicated by a number of factors, including the non-clear nature 
of emerging mental health symptoms in children and young people, 
in comparison to presentations in adults, the minimization of symp- 
toms by young offenders, and the disinclination of young people to 
engage with mental health services due to fear of stigma. 
Historically, screening and assessment tools have been devel- 
oped focusing on single-problem areas, for example the MAYSI- 
2, a widely used screening tool for mental health needs within the 


US youth justice system [39]. However, young people who offend 
often have multiple areas of need. In recognition of these difficul- 
ties, the Comprehensive Health Assessment Tool (CHAT) was intro- 
duced for young offenders in custody across England and Wales 
[40], supported by the development of Healthcare Standards for 
Children and Young People in Secure Settings [41]. The CHAT is 
a semi-structured assessment developed to provide a standardized 
approach to health screening for all young offenders admitted to the 
secure estate and includes assessment of mental health needs, phys- 
ical health, substance misuse, and neurodevelopmental impairment. 
Assessing and managing unmet health needs can inform individual 
care plans, help to address offending behaviour, and provide a valu- 
able opportunity to re-engage young people with health and educa- 
tional services to address unmet needs. 


Principles of risk assessment 


Risk assessment combines statistical data with clinical information 
in a way that integrates historical variables, current crucial variables, 
and contextual or environmental factors. Structured risk assessment 
instruments have been developed that aim to increase the validity of 
clinical prediction. These scales typically contain a number of risk 
items selected from reviews of research, crime theories, and clin- 
ical considerations. Items are summed to form a total risk score and 
may also reveal specific risk patterns (for example, mainly family or 
child factors). The Structured Assessment of Violence Risk in Youth 
(SAVRY) [42] is a popular risk assessment tool which assesses 24 
risk factors (ten historical risk factors, six social/contextual risk fac- 
tors, eight individual/clinical risk factors) and six protective factors. 
The SAVRY has a good evidence base to support its predictive ac- 
curacy and clinical applicability in a range of settings. 

Risk assessments can vary in the type of risk they assess, for ex- 
ample the Youth Level of Service/Case Management Inventory [43] 
assesses general offending, while the SAVRY assesses interpersonal 
violence. 

The following five-step approach should be considered when 
undertaking risk assessments [44]: 


e Step 1: collecting case information via interviews and collateral 
information from informants. 

Step 2: identifying the presence and relevance of risk factors using 
the structured risk assessment tool. 

e Step 3: identifying the presence and relevance of protective factors 
using the structured risk assessment tool. 

e Step 4: risk formulation. 

e Step 5: management plan. 


For risk and protective factors, consideration needs to be given 
to the presence and relevance of the factor in relation to a specific 
risk, as well as its severity and frequency. A rating (low, moderate, or 
high for risk factors, and either absent or present for protective fac- 
tors) does not fully inform the subsequent risk formulation or man- 
agement plans. The aim of a risk formulation is to provide a better 
understanding of the origins of violence and how it has developed, 
is maintained, and changes [44]. There are a number of frameworks 
which can be used to help explain and communicate risk, including 
the ‘SPs’ model (problem, predisposing factors, precipitating factors, 
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perpetuating factors, and protective factors) and the ‘3Ds’ model 
(which represents drivers, destabilizers, and disinhibitors) [44]. 


A large number of different treatments have been developed to re- 
duce antisocial behaviour. These include psychotherapy, pharma- 
cotherapy, school-based interventions, residential programmes, 
and social treatments. Meta-analyses of treatment approaches to 
juvenile delinquency have produced reasonably consistent find- 
ings [45]. The best results were obtained from cognitive behav- 
ioural, skills-orientated, and multimodal methods. Specifically, 
treatment approaches that were participatory, collaborative, and 
problem-solving were particularly likely to be beneficial. Family and 
parenting interventions also seem to reduce the risk of subsequent 
delinquency among older children and adolescents. 

Cognitive behavioural therapy (CBT) is an effective psycho- 
logical intervention for a number of mental health difficulties in 
childhood and adolescence, including depression, social anxiety 
disorder, and PTSD. Studies that have been conducted with young 
people engaging in high-risk behaviours indicated that CBT may be 
useful for young offenders with a range of mental health needs [46]. 
However, the format and delivery of CBT may require adaptation, 
taking into account the young person's level of cognitive, social, and 
emotional development, including neurodevelopmental impair- 
ment. Adaptations include use of visual cues, encouraging verbal 
learning through games, reducing distractions, and using simplified 
language [47]. 

More recently, therapies include mindfulness-based cognitive 
therapy, dialectical behaviour therapy (DBT), and mentalization- 
based therapies. Overall, the current research base provides support 
for the feasibility of mindfulness-based interventions with children 
and adolescents, and there is an emerging evidence base in relation 
to adolescence with behavioural problems and those in custodial 
settings; however, there is no generalized empirical evidence of the 
efficacy of these interventions at present. DBT was developed as a 
treatment for suicidal adults by developing specific skills in mind- 
fulness, emotional regulation, distress tolerance, and interpersonal 
effectiveness through individual and group sessions. Over recent 
years, DBT has been modified and adapted to meet the needs of 
adolescents within forensic settings. Mentalization-based treatment 
for adolescents (MBT-A) is a psychodynamic psychotherapy pro- 
gramme with roots in attachment theory, aimed at enhancing the 
young person's capacity to represent their own and other’s feelings 
accurately in emotionally challenging situations. MBT-A has been 
influential in the development of a more systemic approach called 
adolescent mentalization-based integrative therapy (AMBIT), spe- 
cifically designed for ‘hard-to-reach’ adolescents with mental health 
problems who may be or are at risk of offending [48]. 

The National Institute for Health and Care Excellence (NICE) [49] 
recommends multimodal interventions for the treatment of conduct 
disorder in children and young people aged between 11 and 17 years. 
A meta-analysis of eight family-based treatment studies of adoles- 
cent conduct disorder found better outcomes for three family-based 
treatments: functional family therapy (FFT), multi-systemic therapy 
(MST), and multi-dimensional treatment foster care (MTFC), com- 
pared to routine treatment [50]. The most well-evaluated is MST. 


MST is a multimodal intervention where interventions are targeted 
at not only the young person, but also their family, school, and peers. 
Evaluation studies of MST have been promising [51]. In particular, it 
has also been shown to be effective for young people with substance 
misuse disorders. However, criticisms of MST include the require- 
ment for a high level of therapeutic expertise, as well as the cost of 
implementation. 
McGuire [52] identified six principles for effective programmes: 


1. The intensity should match the extent of the risk posed by the 
offender. 

2. A focus on active collaboration, which is not too didactic or 
unstructured. 

3. Close integration with the community. 

4. Emphasis on behavioural or cognitive approaches. 

5. Delivered with high quality with training and monitoring 
of staff. 

6. Focus on the proximal causes of offending behaviour (peer 
groups, promoting current family communication, and 
enhancing self-management and problem-solving skills), rather 
than the distal causes (early childhood). 


Serious crimes 


Homicide 


Longitudinal studies are invaluable in mapping out the range of fac- 
tors and processes that contribute to the development of aggressive 
behaviour and in showing how they are causally related [2]. Research 
suggests that risk factors that increase violent offending overlap with 
those increasing the risk for homicides as well, for example repeat 
offending, substance abuse problems, and exposure to early risk fac- 
tors such as family dysfunction and educational difficulties [53, 54]. 
The Pitsburgh study found that the strongest predictors of becoming 
a homicide offender were environmental, rather than individual, 
factors [54]. The study found that there was no single factor ex- 
plaining homicide offending, but instead a cumulative effect, with 
the greater the number of risk factors, the more likely it was that the 
individual would become involved in a homicidal incident. 

Murder and manslaughter are more often committed alone, and 
on average, the perpetrators start their criminal activities at a later 
age and are much less likely to have previous convictions than other 
minors taken into judicial youth institutions. However, there is 
a great variety in terms of motives and victims, etc. Studies show 
that children and adolescents who murder share a constellation of 
psychological, educational, and family system disturbance [53]. 
The majority of young persons who have killed initially dissociate 
themselves from the reality of their act but gradually experience a 
progression of reactions and feelings akin to a grief reaction. The 
young person, while facing a still adversarial and public pre-trial 
and trial process, has to move safely through the process of disbelief, 
denial, loss, grief, and anger/blame. PTSD arising from the partici- 
pation in the sadistic act (either directly or observing the actions of 
co-defendants) has to be treated, as does trauma arising from their 
own past personal emotional, physical, and/or sexual abuse. A com- 
bination of verbal and non-verbal therapies are effective, but fac- 
tors such as a history of severe aggression, low intelligence, and a 
poor capacity for insight are associated with poorer outcomes [53]. 
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In understanding the role of violence and sadism in a young person's 
life, one has to understand their reaction to perceived threat and 
their past maladaptive behaviours aimed at allowing them to feel in 
control of their lives. 


Harmful sexual behaviour 


Harmful sexual behaviours by children and young people is a con- 
siderable problem that impacts on victims and their perpetrators, 
as well as their families. The sexual behaviours of children and 
young people are on a continuum, which ranges, on the one hand, 
from normal and developmentally appropriate to highly abnormal 
and violent on the other. It is important to place any assessment of 
a child’s sexual behaviour within a developmental context, as such 
behaviours may have substantially different motivations and dif- 
ferent developmental significance across these two developmental 
stages. Models have been developed to describe children and young 
people's sexual behaviours at various levels of seriousness or con- 
cern, including age-appropriate and inappropriate [55]. 

Most, but not all, abusers are male, often come from disadvantaged 
backgrounds, with a history of victimization and sexual and physical 
abuse, and show high rates of psychopathology [55]. Females with 
harmful sexual behaviour were found to have higher rates of PTSD 
and a history of family dysfunction and have been victims of sexual 
abuse, in comparison to male offenders [56]. People with learning 
disability (LD) and ASD are overrepresented in services for sexual 
offenders, which may relate to a number of factors, including a lack 
of socio-sexual knowledge, difficulties understanding the nature of 
relationships and the feelings of others, and more limited opportun- 
ities for intimate relationships [30, 55]. 

A core part of any assessment of an adolescent sex offender should 
include a holistic assessment of needs and risks, including an assess- 
ment of mental health [55]. These young people often have a history 
of other offending behaviours, experienced trauma and loss, and are 
likely to suffer from a range of mental health disorders. Assessment 
and treatment of these disorders should be an integral part of the 
process, including the awareness that addressing harmful sexual be- 
haviours may have an adverse impact on the young person’s emo- 
tional well-being and behaviour. 

The AIM assessment model [57] identifies mental health concerns 
as risk factors, distinguishing between those with strong empirical 
support (high levels of trauma, for example own victimization and 
witnessing domestic violence; a formal diagnosis of conduct dis- 
order; poor social and intimacy skills; and highly compulsive/impul- 
sive behaviours) and others based on practice consensus (diagnoses 
of ADHD, depression, or other significant mental health problems). 

New approaches to CBT with sexually abusing youths have re- 
cently been described within the context of relapse prevention, and 
a more complex CBT intervention—mode deactivation therapy 
(MDT)—has been suggested for disturbed, sexually abusive young 
people with reactive conduct disorders or personality disorders [58]. 
CBT group work with sexually abusing children and young people is 
widely practised in the UK [59]. 

More recently, there has been an increased emphasis on strength- 
based models of intervention for young people with harmful sexual 
behaviours. Treatment is therefore seen as an activity that should 
add to the young person's skills and personal functioning [55]. 

For some young people with significantly risky sexually harmful 
behaviour, treatment needs to be undertaken within a close 


supervised, intensive, community-based foster placement with spe- 
cially trained foster carers. Approaches include MTFC and forensic 
foster care. However, there continues to be a lack of longitudinal 
studies to measure the effectiveness of existing programmes. 


Fire-setting and arson 


Arson can have a devastating impact on the victim and the wider 
society. Juvenile arsonists are not a homogenous group, with a 
wide range of familial, social, and developmental needs [60]. High- 
frequency fire-setters (three or more episodes in the past year) were 
more likely to have started their fire involvement before the age of 
10 years and had cumulative risk factors and other problem be- 
haviours, including antisocial behaviour, aggression, impulsivity, 
and alcohol and drug misuse [61]. Other factors identified include 
lower IQ, poor social judgement, lack of empathy, and poor super- 
ego development. Environmental factors such as parenting practices 
(low monitoring or absence ofa parent) and experiences of physical 
and sexual abuse have also been reported. Epidemiological studies 
showed that males are more common perpetrators than females and 
more likely to fire-set on multiple occasions [61]. 

Kolko and Kazdin [62] used a bio-psychosocial framework to 
specify the individual, social, and environmental risk factors re- 
lated to the onset, severity, and maintenance of fire-setting be- 
haviour and highlighted the importance of attraction to fire, 
heightened arousal, impulsivity, and limited social competence. 
A review of different fire-specific assessment tools is available [60]. 
Assessing factors, such as caregiver supervision, behavioural mod- 
elling, and access to fire-setting materials, are included in fire risk 
assessment protocols [62] and as treatment targets in intervention 
programmes [60]. 

The first component of a multi-disciplinary intervention should 
include a home safety visit, as most fatal fires occur in the home. 
Other interventions include fire safety educational sessions (to 
promote the understanding of cause and effect), motivational 
interviewing (to assess capacity for change), and CBT (to increase 
the understanding of the behaviour, including antecedents and 
establishing the behavioural reinforcers) [60]. Parent management 
training for caregivers has been shown to be effective for young 
people with a range of antisocial behaviour. Caregivers working with 
young people provide opportunities to address environmental risk 
factors, including parenting approaches. 


Gangs and violence 


The definition of a gang includes a relatively durable group of young 
people who seem themselves as a group, engage in criminal ac- 
tivity, and have some form of structure. Longitudinal studies have 
identified many risk factors for poor outcomes in later life such as 
poverty, a lack of positive parenting, family conflict, maltreatment, 
and school failure, as well as individual factors [2]. There is an as- 
sociation between the number of risk factors experienced by young 
people and gang membership [63]. Qualitative research also high- 
lights a range of other drivers for gang involvement, including the 
need to feel protected and feel safe, the need to build and maintain 
status or to boost young peoples reputation, and using gangs as a 
substitute family unit [63]. The literature on young women in gangs 
includes family histories characterized by exposure to violence, mal- 
treatment, and abuse [64]. The gang offers an opportunity to feel safe 
and protected within a substitute family unit. 
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There is evidence that young people in gangs, and particularly 
young women, have higher levels of mental health needs [63, 64]. 
Many of these needs are long-standing and unmet, with few young 
people accessing early support [64]. 

The evidence base for interventions to reduce gang involvement 
is still evolving, with many interventions focusing on multiple risk 
factors affecting the young person or on the systems around the 
young person. Primarily developed in the US for males, rather than 
females, few of these interventions help to address young people’s 
mental health problems. MAC-UK’s Integrate approach is delivered 
as an adaptive, flexible intervention for young people who do not 
otherwise easily access support from professionals [65]. Integrate 
projects are intensive and use a multi-agency approach, reconfig- 
uring staff from other agencies to work in partnership with excluded 
young people within their local community. 


Legal framework 


A ‘child’ is defined by the Children Act [66] and the United Nations 
Convention on the Rights of the Child [67] as ‘any person under 
the age of 18. In England, Wales, and Northern Ireland, criminal 
responsibility is set at the age of 10. However, there is evidence that 
brain development continues into the early 20s, with the frontal 
lobes developing later [1]. These areas are involved in consequen- 
tial thinking, inhibition of impulses, empathy, and planning. These 
normal developmental stages are important to consider when as- 
sessing and passing comment on a child’s criminal responsibility 
[68]. Developmental level is important in the context of criminal 
liability (so-called ‘mens rea; the ability to form a guilty mind). 


Fitness to plead and effective participation 


Mental health professionals are frequently asked to assess a young 
person in relation to their ‘fitness to plead, which is a standard set in 
caselaw (Pritchard criteria). ‘Fitness to plead’ in this context requires 
that the individual (whether adult or child): 


Has the ability to understand the charge(s). 

Has the ability to decide whether to plead guilty or not guilty. 
Has the ability to follow the course of proceedings. 

Has the ability to instruct a solicitor. 

Has the ability to challenge a juror. 


a ee ie’ 


Has the ability to give evidence in his/her own defence. 


Important for children and young people is the principle of ‘ef- 
fective participation—which examines the ability of the young 
person to engage with the trial. A child’s age, level of maturity, and 
intellectual and emotional capacities must be taken into account 
when they are charged with a criminal offence, and appropriate steps 
should be taken in order to promote their ability to understand and 
participate in the court proceedings. The responsibility therefore 
falls on the defence lawyer to be aware of the possibility that a young 
person may not be able to participate effectively in the trial process, 
particularly if they are under 14 years old or have learning problems 
or a history of absence from school. 


Capacity 


One fundamental distinction in criminal law is between condi- 
tions that negate criminal liability and those that might mitigate the 


punishment deserved under particular circumstances. Very young 
children and the profoundly mentally ill may lack the minimum 
capacity necessary to justify punishment [68]. Those exhibiting less 
profound impairments of the same kind may qualify for a lesser level 
of deserved punishment, even though they may meet the minimum 
conditions for some punishment. Immaturity, like mental disorder, 
can serve both as an excuse and as mitigation in the determination of 
just punishment. Capacity is a feature that is both situation-specific 
and open to influence. 
Capacity is multi-faceted, with four key elements: 


1. The capacity to understand information relevant to the specific 
decision at issue (understanding). 

2. The capacity to appreciate one’s situation as the defendant is con- 
fronted with a specific legal decision (appreciation). 

3. The capacity to think rationally about an alternative course of 
action (reasoning). 

4. The capacity to express a choice among alternatives. 


Any evaluation of competence should include an assessment of 
psychopathology and emotional understanding, as well the cog- 
nitive level, the child’s experiences, and appreciation of situations 
comparable to the one relevant to the crime and to the trial, and any 
particular features that may be pertinent in this individual and this 
set of circumstances [68]. 


Conclusions 


Politicians and professionals have begun to acknowledge the im- 
portance of meeting the needs of offenders, as long-term costs to 
society become increasingly apparent. The major challenge of al- 
tering the trajectories of persistent young offenders has to be met 
in the context of satisfying public demands for retribution, together 
with welfare and civil liberties considerations. Over the last 30 years, 
there has been increasing research studies evidencing the risk fac- 
tors and developmental pathways to antisocial behaviour. 

This chapter has evidenced the high prevalence of mental health 
needs and neurodisability in young people who offend and outlined 
the implications for policy and practice, including recent develop- 
ments. It demonstrates the increasing importance of providing a 
standardized and evidence-based approach to screening and inter- 
vention for a variety of health needs. Provision of appropriately 
designed programmes can reduce recidivism among persistent of- 
fenders. Initiatives within the community include the development 
of liaison and diversion teams within police and court interfaces 
to screen and divert young people away from the criminal justice 
system where possible. Opportunities also exist through multi- 
agency partnerships. Early co-ordinated care is essential in meeting 
the complex needs of this group of young people, highlighting the 
important role of a multi-agency public health strategy. 
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Though widely condemned, and often severely punished, throughout 
the world, only a relatively small proportion of all reported offences 
involve sexual behaviour. Nonetheless, research has repeatedly 
shown that, even if taken only as minimal figures, large numbers 
of people, both children and adults, report having been subject to 
some form of sexually unwelcome behaviour [1]. The legal term 
‘sexual assault’ is now widely used in most Western countries to em- 
brace any type of non-voluntary sexual act in which the victim is 
coerced or physically forced to engage against their will, or any non- 
consensual sexual touching of another person [2]. It includes forced 
vaginal, anal, or oral penetration (rape), drug-facilitated sexual as- 
sault, groping, forced kissing, child sexual abuse, or sexual torture. 

The long-term effects of victimization have been extensively 
studied, and individuals with a history of childhood sexual abuse, 
for example, report a wide range of long-term psychological conse- 
quences [3]. And those who have been sexually assaulted as adults 
similarly and uniformly report extensive psychological, if not phys- 
ical, repercussions [4]. 

Although often perceived as the purview of specialized forensic 
services, nearly all psychiatrists will at some time meet victims of 
sex offenders, if not offenders themselves. The general psychiatrist, 
for example, is almost certain on occasions to encounter a ‘flasher’ 
(exhibitionist) or a ‘peeping Tom’ (voyeur), as these are usually 
viewed by authorities as minor sex offenders not needing specialist 
attention. It is therefore important for the generalist to have at least 
some knowledge of sex offenders and their treatment, as they may 
be called upon to provide a report for court purposes, especially in 
places where availability of expert forensic services is limited. 








A ‘sex offender’ is simply an individual whose sexual behaviour 
contravenes the law in a particular jurisdiction. However, crim- 
inological preoccupations with particular kinds of sexual offences 
have often varied considerably over time. The types of activities that 
may be proscribed will also vary considerably. Thus, in the last cen- 
tury, incest, homosexuality, and bigamy seem to have been the main 








concerns, and child molestation and sexual assault much less so. 
Though some are more tolerant than others, most societies provide 
sanctions for sexual activity involving children below the age of con- 
sent, non-consensual sexual acts, sexual relations with close family 
members, and sexual interference with animals or corpses. Typically 
also there are legal and other interventions available in cases where 
a person fears sexual harassment or assault, where abuse of a child is 
suspected, and where there has been abuse, or likelihood of abuse, in 
certain professional relationships. As well there is commonly regu- 
lation of pornography and other obscene material. 

‘Sexual deviance’ refers to those sexual behaviours that go be- 
yond societal norms. They may be subject to legal sanctions as 
well, though in many cases not. It involves consistent, rather than 
occasional, sexual interests and behaviours. Common perceptions 
of what is ‘normal’ will also be influenced by whether the acts are 
committed in public or private. As with sex offending, the con- 
cept of sexual deviance will vary greatly across jurisdictions and at 
differing times. 

Finally, ‘paraphilia is a diagnostic term popularized by Money 
[5] and currently used in official classification systems such as the 
World Health Organization’s International Classification of Diseases 
(ICD) and the American Psychiatric Association’s Diagnostic and 
Statistical Manual (DSM) to identify the experience of intense 
sexual arousal to atypical objects, situations, or individuals. The cur- 
rent edition DSM-5 introduced the term ‘paraphilic disorder’ which 
causes the individual distress or interferes with their personal func- 
tioning, whereas a ‘paraphilia’ in itself does not. The ICD prefers the 
term ‘disorders of sexual preference; but both systems require the 
characteristic sexual behaviours or preferences to be persistent or 
recurrent and to cause subjective distress or interference with their 
social, occupational, or other important areas of their lives. 

Although Money and others list numerous unconventional sexual 
interests and behaviours [6], in the two dominant classifications, 
worldwide only a few of the most common are listed individually 
and others relegated to a miscellaneous category such as ‘unspeci- 
fied paraphilia [7]. Not all sex offenders are paraphilic or have a 
paraphilic disorder, and not all those diagnosed with a paraphilia or 
paraphilic disorder are sexual offenders. The psychiatric categories 
of paraphilia and paraphilic disorders and their characteristics are 
outlined in Chapter 115 of this textbook. 
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Types of sex offender 


General psychiatrists will be concerned with sexual behaviours 
such as indecent exposure, obscene telephone calling, voyeurism, 
bestiality, incest, child molestation, and pornography collecting. 
Specialists will also be interested in sexually motivated homicide, 
the more serious forms of sexual assault, and necrophilia. Most 
sexual offenders are male, though a small number of women may 
commit similar crimes [8]. 


Child molesters 


Sexual abuse of children has been reported and generally con- 
demned throughout history [9], and the reportedly more permissive 
attitude in ancient Greece seems more or less an exception [10]. 

Thus, for example, a recent meta-analysis of 217 studies based on 
self-report data [11] estimated a global prevalence of 12.7-18% for 
girls and 7.6% for boys. A World Health Organization study [12] 
earlier reviewed estimates of childhood sexual abuse from 39 coun- 
tries and found that the prevalence of non-contact, contact, and 
intercourse in female children was approximately 6%, 11%, and 4%, 
respectively; corresponding figures for males was about 2% for all 
categories. It is well known that many victims do not report such 
experiences, and when guaranteed confidentiality, individual sex 
offenders admit to many more offences than they were charged 
with or convicted of [13]. Similarly, sex offending against adults has 
also been underreported, though it is estimated that about 13% of 
women and 3% of men have been sexually assaulted at some time 
during their lives [14]. 

An early typology [15] for this group was based on the degree of 
apparent paraphilic attraction (sexual deviancy). The ‘fixated’ sub- 
type identifies those paedophiles with an enduring attraction to chil- 
dren, typically dating from adolescence, and thereby conforming to 
the definitions of paedophilia in DSM-5 and ICD-10. Paedophiles 
attracted to male children are more likely to repeat their offences 
with recidivism at least twice as high as those attracted to girls. 
Homosexual paedophiles tend to victimize boys aged 11-15 years 
of age, whereas heterosexual paedophiles prefer girls of 8-10 years. 
‘Fixated’ paedophiles tend to commit premeditated offences that 
often involve considerable planning, manipulation, and ‘grooming’ 
behaviour to lure, and even abduct, children into sexual activity, and 
they may gain the trust of the parents or other caregivers. Often they 
appear to have an excellent rapport with the child victims and treat 
them kindly, but the motive is primarily for the child to meet their 
sexual needs, rather than the reverse. It is for this reason that ‘needy’ 
children are often selected as victims. Such offenders will typically 
profess their ‘love’ for children and convince themselves that their 
behaviour is not harmful. Other rationalizations, such as that they 
were educating the child or introducing the child to sexual love in a 
caring way, are also common. 

‘Regressed’ or ‘situational’ child molesters are, according to this 
typology, attracted primarily to adult females and may be ina marital 
relationship at the time of their offence. They will often report feel- 
ings of personal inadequacy or low self-esteem, and their offences 
are typically spontaneous and occur in the context of a stressful life 
circumstance. 

In contrast to the ‘fixated’ type, the ‘regressed’ child molester is 
less inclined to ‘groom victims and their caregivers. Victims of this 


type may be older than those involved with the ‘fixated’ type of mo- 
lester. However, molestations may begin before and continue be- 
yond puberty. 

Critical of earlier typologies such as this, as well as of the defin- 
ition in DSM, in their attempts to delineate paedophiles from other 
types of child molester, Knight et al. [16] developed a different 
model incorporating two axes that assess psychological issues, abuse 
behaviours, and the degree of sexual fixation. Though empirically 
validated, others have doubted its clinical usefulness because of its 
complexity [17]. It is worth noting that, in the literature, the terms 
‘child molester’ and ‘paedophile’ are often used interchangeably, 
tacitly acknowledging that it is often difficult to differentiate the two. 
Moreover, the treatment may be determined by actuarial estimates 
and are also very similar. 

Despite dire parental warnings to their children to be aware of 
strangers as potential sexual predators, the majority of individuals 
who molest children are either already acquainted or are related to 
them. The term ‘incest’ is used to identify situations in which a close 
blood relative is subjected to sexual abuse. Some researchers, based 
on statistical reoffence data, have suggested that the phenomenon 
is primarily a symptom of family dysfunction distinct from paedo- 
philia, and that the perpetrator represents a low risk of extending 
their behaviour outside the family [18]. However, other authorities 
have found that as many as half of self-reported father or stepfather 
incest offenders had also abused children outside their families and 
18% had raped adult women [13]. Some men pursue and subse- 
quently marry or cohabit with single mothers with the intent to gain 
access to their children. 

At one time, it was believed that paedophiles were inadequate, 
unaggressive men unable to form relationships with adult women. It 
is now well recognized that gratuitous violence is used by a substan- 
tial minority of paedophiles [19]. 

A further aspect of paedophilia that needs to be mentioned be- 
cause of its increasing identification in the age of the Internet is the 
use of computers to view, manufacture, and distribute pornographic 
images of children [20]. Such imagery is widely used by such men to 
generate sexual fantasies, and the behaviour will then be reinforced 
by masturbation. Some have argued that this sequence essentially 
normalizes the fantasy and further disinhibits the user. This com- 
bination of disinhibition and cognitive distortions regarding the im- 
pact of the behaviour may stimulate the urge to seek more intense 
experiences from ‘hands-on’ experiences, rather than simply on a 
computer screen. However, the risk of this progression is likely exag- 
gerated, as only a minority of Internet child pornography offenders, 
in fact, progress to molest a child [20]. Those with previous or con- 
current contact sexual offences are more likely to have more than one 
reconviction, whether general or sexual, than those with child porn- 
ography offences who are only rarely reconvicted [21]. Nevertheless, 
research has shown that there are psychological and other similar- 
ities between men who use child pornography and those who actu- 
ally do commit such offences. Child pornography offenders more 
often show evidence of arousal to children on phallometric labora- 
tory testing than those who committed ‘hands-on’ offences against 
children. 


Sexual assault of adult women 


As with those who molest children, men who perpetrate sexual 
assault on adult women (rapists) do not necessarily justify a 


diagnosis of a paraphilia [22]. Indeed attempts by some sexolo- 
gists to have individuals who appear to prefer sexual relations with 
non-consenting partners identified by a paraphilia label (terms 
such as biastophilia, rape preference, or paraphilia-not otherwise 
specified—non-consenting) have been repeatedly rejected, often 
based on feminist arguments that to do so would thereby ‘med- 
icalize’ inappropriately a predisposition in males generally to dom- 
inate women [23]. 

The typology proposed by Knight [24], though not extensively 
used and rather complicated for routine clinical use, does have 
the merit of highlighting the fact that several different motivations 
appear to underlie sexual assault of adult women. Thus, ‘anger- 
motivated’ rapists act out deviant fantasies of retaliation towards the 
victim, using violence as a means of expressing generalized anger, 
typically towards women specifically, but sometimes towards people 
in general. The motive of the attack is to humiliate and debase the 
victim who will typically have been picked randomly. The ‘power- 
motivated’ rapist is further divided into the ‘power-reassurance’ and 
‘power-assertive’ subtypes. The former is afflicted by doubts and in- 
securities about their own masculinity and sexual adequacy, often 
uses minimal force, and may apologize to the victim or even seek a 
relationship afterwards. However, the victims may have been stalked 
and the attacks on them premeditated. The ‘power-assertive’ subtype 
is motivated by the desire to dominate women, and such men have 
no doubts about their masculinity. However, unlike the previous 
subtype, they typically do not use gratuitous violence to subdue the 
victim. On the other hand, ‘sadistic’ rapists, the least common but, 
in many ways, most worrisome subtype, derive sexual pleasure from 
inflicting pain and suffering on the victim. However, this is often dif- 
ficult to differentiate from other rapist types where pain, suffering, 
and humiliation are also the consequences, but not the primary mo- 
tivation for the attack [25]. 

Other types of sexual assault may not involve any kind of pene- 
tration but represent related behaviours, including toucheurism 
and frotteurism. These involve, respectively, touching or grabbing 
strangers, typically females, in a way that provides the perpetrator 
with sexual gratification. The former, in particular, may pass un- 
noticed by the victim, as attackers will typically touch or grab sexual 
areas such as breasts, buttocks, or crotch, in crowds and similar situ- 
ations and the incident may be discounted as ‘accidental. 


Non-contact sexual offences 


There are several common types of sexual offence that do not in- 
volve any physical contact with the victim [26]. The group includes 
‘peeping Toms, ‘flashers, and indecent phone callers, with their cor- 
responding psychiatric diagnoses of the paraphilias voyeurism, ex- 
hibitionism, and telephone scatalogia, respectively. 

‘Peepers’ or voyeurs like to observe an unsuspecting female 
stranger undressing, or couples in the act of copulation. They may 
masturbate at the scene or later in private, while recalling what 
they saw. Most voyeurs, like most sex offenders generally who are 
paraphilic, are aware of their deviant impulses while still adolescent, 
but the behaviour may become chronic. Many men will admit to 
enjoying watching adult women in this situation or attend striptease 
shows, making these difficult to differentiate from those who prefer 
this activity. 

‘Flashers’ or indecent exposers will often justify a psychiatric 
diagnosis of exhibitionism. These men become sexually excited 
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while exposing their genitals, sometimes erect and sometimes not, 
to unsuspecting female strangers. The preferred reaction is one of 
shock or outrage. Complete indifference is a useful response if the 
victim has the presence of mind. The perpetrator may masturbate at 
the time or later in private. It is unusual to see an exhibitionist still 
active past the age of 40, but many are nevertheless for a time quite 
intractable and will expose themselves in situations where they may 
well be apprehended or reported. This suggests a degree of risk- 
taking that enhances the excitement. 

Obscene telephone callers, corresponding to the paraphilic diag- 
nosis of telephone scatalogia, call unsuspecting female victims and 
proceed to bombard them with explicit and sometimes frightening 
sexual comments, such that the victim may be very distressed by 
them. However, they often do not report them unless they are par- 
ticularly persistent or if their voice or something they say reveals 
their identity. Often the individual masturbates while doing this. If 
apprehended, which is uncommon, it may be difficult to distinguish 
them from a simple prankster. However, a repetitive pattern will 
usually point to the appropriate diagnosis. 

Though typically seen as ‘nuisance offences, they are, in fact, 
much more common and more psychologically disturbing than is 
generally appreciated, and recent research [27] has indicated that 
large numbers of young women have directly experienced the be- 
haviour of frotteurs and exhibitionists and experience considerable 
subsequent emotional distress. 

As well, these non-contact offences are not generally regarded as 
a danger. However, in some cases, estimates place them somewhere 
around 10%, and the individual may engage in related paraphilic 
behaviours or progress to more serious sexual assault [28]. Attempts 
to approach or speak to the victim may well be warning signs of this. 
The behaviour therefore should not be taken lightly, and all available 
information about the perpetrators’ actual behaviour in the index 
offence, as well as previous incidents, needs to be scrutinized with 
this possibility in mind. 


Assessment of sex offenders 


As was noted at the beginning of this chapter, more serious sex of- 
fences are likely to require the expertise of specialized forensic 
services. The less serious or less intrusive offences may well fall into 
the hands of a general psychiatrist, and it is useful therefore to have 
a sense of what is involved if the matter requires a report or even an 
actual testimony in court. 

Pre-trial situations usually require assessment of fitness (compe- 
tence) to stand trial, though sometimes criminal responsibility will 
also need to be addressed. Only a small number of sex offenders 
show symptoms of a psychotic mental disorder such as schizo- 
phrenia or depression [29]. However, when such disorders are pre- 
sent, it will be necessary to explore whether and how the symptoms 
bear specifically on the sexual behaviour [30]. For example, the pa- 
tient may report that they were acting in response to auditory hal- 
lucinations or specific delusional ideas about the victim or type of 
victim. 

It is more likely, however, that other psychiatric disorders will 
be found among sex offenders. A diagnosis of mental retardation or 
personality disorder or, more particularly, paraphilia will be more 
relevant with respect to the offending behaviour [31]. 
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Especially in more serious cases, the court may be interested in 
sentencing considerations, specifically the risk the offender presents 
to others and whether and what medical and/or other professional 
treatment or interventions might be ordered or recommended by 
the court to reduce that risk. 

Few sex offenders will have presented to a psychiatrist or psycholo- 
gist voluntarily. It is therefore helpful for the interviewer to assume 
that the subject will be at best guarded or possibly frankly hostile in 
their response to the assessment. It may be difficult to achieve the 
level of rapport more commonly experienced with non-psychotic 
general psychiatric patients. A non-judgemental approach is there- 
fore desirable, regardless of the examiner's personal emotional reac- 
tion to the offences that are alleged. For example, an offender who 
has molested a child, or exposed to a group of children, will often 
arouse very strong feelings in the examiner, particularly when they 
have children of their own, and it is necessary to hold such reactions 
in check. Judging the alleged offender, especially when his guilt has 
not been determined by the court, is not an appropriate professional 
attitude. 

Denials, rationalizations, distortions, and minimization are the 
kinds of cognitive distortions that are the norm with sex offenders. 
There is a frequent tendency to prevaricate or frankly lie. It is there- 
fore essential for the examiner to have detailed information from 
police reports and elsewhere about the act or acts that are alleged to 
have occurred. Direct, interrogation-style confrontation is usually 
not particularly productive, and an oblique approach may obtain 
more information that will be useful in identifying a diagnosis and 
determining rational treatment possibilities. 

Particularly unhelpful in gaining rapport or obtaining informa- 
tion is to accuse the individual of dishonesty or dissembling. A sym- 
pathetic approach, suggesting that sometimes people have difficulty 
accepting unpleasant aspects of themselves, may be more helpful. 
Alternatively, one can invite the subject to explore why the victim 
might have made the accusations if they are untrue and whether 
they can accept, if not the whole, then some parts of the accusa- 
tions against them. Also important to remember is that while the 
individual may have been accused of one type of deviant sexual be- 
haviour, other types of paraphilic behaviour may also have occurred 
and been undetected. Paraphilic disorders tend not to occur alone, 
but rather in association with other paraphilias [32], typically at least 
two or three. Thus, for example, an exhibitionist may have been re- 
ported for exposing specifically to children, rather than adults, and 
this will suggest an additional diagnosis of paedophilia. Similarly, an 
individual who is being convicted of rubbing himself against women 
in public places may also have made obscene phone calls and har- 
bour fantasies of rape. 

In the case of child molesters, it is important to consider how the 
perpetrator gained access to his victims. Exploration of the methods 
of ‘grooming’ is important in understanding ways to assist the of- 
fender to avoid risky situations in the future. 

In terms of the overall assessment, identification of psychopath- 
ology outside the domain of sexual deviation is important. The 
presence of psychotic illness will have implications for the type of 
treatment to be recommended, even if it is not common among 
sex offenders [30]. It seems likely that, in some such offenders, the 
sexual behaviour is dismissed as a function of their mental illness 
and the underlying paraphilic disorder may easily be discounted 


or not be considered at all. More commonly, it will be personality 
disorders or traits, alcohol or substance abuse, mild to moderate de- 
pression, and anxiety disorders [31, 33], rather than major mental 
illness that will be noted. Attention to these will nevertheless be 
an important part of any subsequent treatment or management 
recommendations [33]. 

Psychometric testing may contribute additional information to the 
overall assessment of a sex offender, in particular where the subject 
is not forthcoming in a personal interview. There are a number of 
general personality assessment instruments available. These include 
the well-known and widely used Minnesota Multiphasic Personality 
Inventory (MMPI) , the Millon Clinical Multiaxial Inventory-III 
(MCMI-II), and the Personality Assessment Inventory (PAI). All 
have been, and are being, extensively used in offender populations, 
including sex offenders, and common profiles have been identified. 
Although none can specifically identify a sex offender, the informa- 
tion gained concerning impulsivity, denial, judgement, and general 
psychopathology may be very useful. 

In addition, there are a number of psychological tests that have 
been developed specifically for the assessment of sex offenders. 
These include the Multiphasic Sex Inventory-2 (MSI-2) [34] and the 
revised version of the Clarke Sex History Questionnaire (SHQ) for 
Males [35]. The MSI-2 is designed to measure the sexual characteris- 
tics of an adult male (there is a female version as well) alleged to have 
committed a sexual offence or sexual misconduct, including those 
who deny the allegations. Though standardized in the United States 
ona large sample of sex offenders, it is used very widely. It consists of 
560 true/false questions, and the completed questionnaire must be 
sent to the developers for computerized scoring and interpretation. 
The SHQ consists of 508 questions, and the completed question- 
naire may again be sent away for scoring and an interpretive report 
returned. 


Phallometric testing (penile plethysmography) 


Because sex offenders commonly lie and distort their self-report of 
deviant interests and behaviours, a more objective method of as- 
sessment has long been sought. One of the earliest to be developed 
was penile plethysmography (PPG)—phallometry—to measure 
changes in response to erotic stimulation [36]. This method involves 
measuring changes in the size of the penis, either in circumference 
or volume, while presenting the subject with carefully selected im- 
ages, both still and moving, of both sexes and different age groups, 
and audiotaped descriptions of various sexual activities. There are 
certainly problems with PPG testing [37, 38], including the stand- 
ardization of stimulus materials used [39, 40]. Some offenders are 
able to either learn to suppress their physiological response or mas- 
turbate before the testing in order to render themselves unrespon- 
sive. Nonetheless, PPG, most commonly using a circumferential 
device or the volumetric method, is extensively used in the assess- 
ment of sex offenders. 

In the United States in particular, PPG has come under attack 
[41, 42] as, in addition to standardization and reliability issues, it has 
used pictures of children whose consent or that of their parents had 
never been obtained. Computer-generated images have been devel- 
oped in an attempt to obviate this concern [43]. 


Other less intrusive methods of assessment are also widely 
adopted, including the Abel screen [44, 45] which measures time the 
subject spends viewing non-nude images. 

Mention must also be made of the polygraph, or ‘lie detector; 
with sex offenders which is being used extensively in many parts 
of the United States and in the United Kingdom [46, 47]. The 
subject is asked questions relating to the sexual interests and ac- 
tivities, while their pulse, respiration, and skin conductance are 
measured. Research in the area is generally weak, and despite 
its widespread use and perception of usefulness, particularly for 
monitoring sex offenders in the community, the method is con- 
troversial [48, 49]. 


Assessment of risk in sex offenders 





An assessment of the future risk of sex offending is often requested 
by the court, so it is important for any psychiatrist being asked to 
provide this to have an understanding of the factors that contribute 
to this (Boxes 141.1 and 141.2). 


Box 141.1 Established risk factors for sexual recidivism 


e Sexual criminal history: 
— Prior sexual offence 
— Victim characteristics (unrelated, strangers, males) 
Early onset of sex offending 
Diverse sexual crimes 
— Non-contact sexual offences. 
Sexual deviance: 
— Any deviant sexual preference: 
e Sexual preference for children 
e Sexualized violence 
e Multiple paraphilias 
— Sexual preoccupation. 
Attitudes tolerant of sexual assault. 
Lifestyle instability/criminality: 
— Childhood behaviour problems (for example, running away, 
grade failure) 
— Juvenile delinquency 
— Any prior offences 
— Lifestyle instability (reckless behaviour, employments instability) 
— Personality disorder (especially psychopathy) 
— Grievance/hostility. 
Social problems/intimacy deficits: 
— Single (never married) 
— Conflicts with intimate partners 
— Hostility towards women 
— Emotional congruence with children 
— Negative social influences. 
e Response to treatment/supervision: 
— Treatment dropou 
— Non-compliance with supervision 
— Violation of conditional release. 
e Poor cognitive problem-solving. 


e Age (young). 


Source: data from J Consult Clin Psychol., 66(2), Hanson RK, Bussiere MT, Predicting 
relapse: a meta-analysis of sexual offender recidivism studies, pp. 348-62, Copyright 
(1998), American Psychological Association; Sex Abuse, 22(2), Mann RE, Hanson 
RK, Thornton D, Assessing risk for sexual recidivism: Some proposals on the nature 
of psychologically meaningful risk factors, pp. 191-217, Copyright (2010), SAGE 
Publications. 
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Box 141.2 Characteristics with little or no relationship 


with sexual recidivism 





e Victim empathy. 

e Denial/minimization of sexual offence. 

e Lack of motivation for treatment. 

e Clinical impressions of ‘benefit’ from treatment. 

e Internalizing psychological problems (anxiety, depression, low 
self-esteem). 

e History of being sexually abused as a child. 

e Sexual intrusiveness of sexual offences (for example, intercourse). 
e Low social class. 





Source: data from J Consult Clin Psychol., 66(2), Hanson RK, Bussiere MT, Predicting 
relapse: a meta-analysis of sexual offender recidivism studies, pp. 348-62, Copyright 
(1998), American Psychological Association; Hanson RK, Morton-Bourgon K, 
Predictors of sexual recidivism: An updated meta-analysis 2004-02, Copyright 
(2004), Public Works and Government Services Canada; J Consult Clin Psychol., 73(6), 
Hanson RK, Morton-Bourgon KE, The characteristics of persistent sexual offenders: a 
meta-analysis of recidivism studies, pp. 1154-63, Copyright (2005), American 
Psychological Association. 











Risk factors have been divided into: 


1. Static risk, that is, involving those factors which cannot change, 
such as the offender's age, sex, or number of previous criminal 
convictions; and 

2. Dynamic risk, that is, involving those factors which poten- 
tially could change, either as a result of treatment or some other 
intervention or simply the passage of time. This can be further 
subdivided into: relatively stable, but nonetheless potentially 
changeable; factors such as sexual preferences or negative atti- 
tudes; and acute factors such as access to victims, reversion to 
substance use, and active mental illness. 


Static factors have received the most scientific study, and it is 
chiefly these that have been incorporated into various actuarial or 
statistical instruments that have been developed, based on follow-up 
studies of samples of sex offenders [50]. Among these instruments, 
the Static-99 and its recent modifications [51] and the Sex Offender 
Risk Appraisal Guide (SORAG), now replaced by the Violence Risk 
Appraisal Guide-Revised (VRAG-R) [52], are the most widely used. 
Though the assessment instruments have been published or are 
available online, a professional intending to use any of these instru- 
ments needs to be thoroughly familiar with the literature on the topic 
of risk assessment and to have participated in training workshops 
that are given at conferences and elsewhere from time to time [53]. 
Useful though these tools can be, too heavy reliance upon them is no 
substitute for a full understanding of how they were constructed and 
their limitations [54]. 


Treatment issues 


It is rare for a paraphilic individual to present for treatment in order 
to prevent themselves from becoming a sex offender (though see 
[55]). While some paraphilias are usually not associated with crim- 
inal behaviour (for example, transvestic disorder or fetishistic dis- 
order), others, such as paedophilia or sexual sadism, are much more 
likely to be referred from the court or subsequently by probation and 
parole services. 


1437 








SECTION 24 Forensic psychiatry 


It is important to realize that, though some dissent from this view 
[56], sexual preferences are highly resistant, if not impossible, to 
change (see [57]). The most that can usually be expected with sex of- 
fenders who have deviant sexual preferences (paraphilic disorders) 
is to help them learn to control their behaviour and to recognize 
that their propensity will always remain in the background, much 
as alcoholics are advised to consider themselves always vulnerable 
to relapse. 

Psychological treatments have often been attempted with sex of- 
fenders [58]. Psychodynamically based individual and group treat- 
ments have been the most commonly used in the past. It has become 
clear more recently, however, that cognitive-based therapies are the 
preferable strategy to employ, although techniques involving clas- 
sical behaviour therapy, for example covert sensitization, are also 
sometimes used for specific purposes such as creating aversion to 
deviant arousing images and replacement of these with non-deviant 
ones. Cognitive-based therapy itself involves helping the subject to 
develop strategies to alter their thought processes in order to avert 
their deviant behaviour, to improve their social skills, and to remedy 
their distorted beliefs and attitudes. 

There is little or no evidence for the efficacy of psychological treat- 
ments prior to the introduction of cognitive-based therapy. Based on 
a meta-analysis of 43 published studies, it has been shown that treat- 
ment programmes using this approach are associated with a reduc- 
tion in overall recidivism rate from about 17% to 10% [59]. Similar 
findings have been reported by others [60], though, as Hanson and 
Yates pointed out, there have been very few high-quality treatment 
studies [55]. The most effective programmes are those that adhere 
to risk/need/responsivity principles (see [61]). Thus, the highest in- 
tensity of treatment should be given to those of highest risk; treat- 
ment must address factors known to be associated with a risk of 
recidivism, and the therapy needs to be presented in a language and 
format such that the culture and learning styles of the offenders are 
addressed adequately. 

Psychological treatments of all types have, until recently, focused 
on offenders’ deficits and difficulties, rather than their strengths 
and other assets. The development of the ‘Good Lives Model’ [62] 
marked a turning point, and this approach has been integrated in to 
many of the programmes currently in use. The aim of this treatment 
is to help the sex offender meet his needs more appropriately and 
achieve pro-social life goals, thereby reducing the attractiveness of 
his offending behaviour [63]. 

Another approach that has yielded some very positive results 
has been the Circles of Support and Accountability (CoSA) [64]. 
Originating with leaders of the Mennonite Church in Canada, it has 
been adopted in a way that does not require religious affiliation. It 
is designed to provide intensive support for offenders upon release 
from prison and develop realistic risk management strategies to 
achieve successful community reintegration. 

Despite some apparent progress, there is still controversy over the 
effectiveness of sex offender treatment programmes, and there re- 
mains a continuing need for better designed research. 


A number of different medications have been used to treat sex of- 
fenders [65-70]. Based on empirical observations of animals, which 


become less sexually active following neutering, hormonal treat- 
ments that reduce testosterone levels have been extensively em- 
ployed. All require careful discussion with the potential patient 
concerning side effects, and it is important to obtain written, in- 
formed consent [71]. 

Oestrogens proved to be problematic because of the serious risk 
of thromboembolic complications, and a safer alternative was found 
in cyproterone acetate (Androcur ) [72], an anti-androgen which is 
available in Europe, including the United Kingdom, and in Canada, 
but not in the United States. Medroxyprogesterone acetate (Provera ) 
[73, 74] was introduced as an alternative. Both drugs may be respon- 
sible for minor side effects such as weight gain, tiredness, and gy- 
naecomastia (especially with cyproterone), as well as more serious 
problems, including thromboembolism and increased blood sugar. 

More recently, luteinizing hormone-releasing hormone (LHRH) 
agonists [75-77], such leuprolide acetate (Lupron), goserelin 
(Zoladex ), and others, have been found useful as they produce al- 
most total suppression of testosterone production, such as would 
be seen following surgical castration [78]. However, they tend to be 
used mainly in very high-risk offenders or those who have failed 
with other drugs [79]. 

Reduction of oestrogen or androgen removes restraining effects 
on osteoblastogenesis and osteoclastogenesis, and causes an imbal- 
ance between resorption and formation of bone by prolonging the 
lifespan of osteoclasts and shortening the lifespan of osteoblasts. 
Thus, all of these hormone-reducing substances, though especially 
the LHRH agonists, have a tendency to increase the rate of bone re- 
modelling and weaken its structure; it is necessary to monitor care- 
fully for this side effect through annual bone density scanning and 
to administer antidotes, including calcium supplements, vitamin D, 
and possibly bisphosphonates [80]. 

However, the main problem with hormonal treatments is their 
lack of acceptance by those who might potentially benefit. An alter- 
native approach using serotonin reuptake inhibitors has therefore 
been better received, though the necessary double-blind trials are 
still lacking [81]. They depress libido in only about 50-60% of cases. 
Higher doses, such as those used in obsessive-compulsive disorder, 
are sometimes necessary. The risk of prolonging the QT interval on 
the electrocardiogram is then one of the few reasons for which to 
choose one selective serotonin reuptake inhibitor (SSRI) over an- 
other, besides patient tolerance of different subjective side effects 
like nausea and sleep disturbance. 


Ethical problems 


Several ethical issues have so far been mentioned in passing. 
However, it is worth re-emphasizing in conclusion that a disinter- 
ested professional demeanour is important when assessing sex of- 
fenders. No matter what his or her own private views, it is not the 
place of the clinician to decide on guilt or innocence or in any 
other way to pass judgement on the offender or alleged offender. 
Moreover, alienating the offender will present a further impediment 
to gaining information and to providing treatment when indicated 
and necessary. 

It is important, at the assessment stage, to identify for the subject 
the nature of the evaluation, the role of the assessor, and the person 
or agency for whom they are acting, for example a child protection 


service, a defence lawyer, or a crown prosecutor. This may limit the 
degree of co-operation. However, when the purpose of the evalu- 
ation is solely to provide a risk assessment, not to explain this fully 
(and simply to present oneself as a ‘doctor’ in a helping role) would 
be unethical. 

Certain assessment procedures, such as PPG or polygraphy (when 
used in a clinical setting), are particularly contentious. Though both 
may provide useful information, written and fully informed consent 
should always be obtained beforehand. 

Finally, when the drugs that have been developed and marketed 
for other purposes are used to suppress sexual drive (so-called ‘off- 
label’ use), the patient needs to be fully informed of the potential 
benefits, as well as the risks involved, and should not be denied com- 
plementary or alternative treatments should they decide not to ex- 
pose themselves to potential side effects. They also need to be aware 
that, though the clinician is not in a position to insist on such treat- 
ment, their refusal to participate may well have repercussions with 
probation and parole officers or the court itself, and these issues need 
to be fully discussed with the patient to avoid misunderstandings. 
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Stalking occurs when a person repeatedly intrudes upon another in 
such a manner that the recipient fears for his or her safety or experi- 
ences significant distress [1]. The intrusions can involve following, 
loitering nearby, maintaining surveillance, approaching the victim, 
and communicating via telephone, letters, email, social media, or 
notes attached to property. Associated behaviours include ordering 
or cancelling goods and services on the victims behalf (for example, 
pizzas or taxis in the former, and utilities such as gas or electricity 
in the latter), spreading malicious rumours, vexatious complaints, 
threats, property damage, and assault. 

Stalking is motivated by a range of intentions. Some stalk in hope, 
some in anger, some in lust, some in ignorance, and some in mix- 
tures of these motives [1]. Research suggests that the overarching 
term ‘stalking’ encompasses two basic patterns [3]. The first involves 
incursions that are largely confined to following or unwanted ap- 
proaches perpetrated usually by a stranger, which last only a day 
or so before ending, usually without the need for intervention. The 
second is characterized by a broader range of intrusions, perpetrated 
most commonly by someone known to the victim (for example, an 
ex-partner, acquaintance, colleague, or neighbour), that lasts for 
months or even years. The first type of stalking is often distressing 
but rarely culminates in threats or physical violence or inflicts psy- 
chosocial damage. The second type is far more likely to be associated 
with longer-term psychological disturbance and lifestyle alterations, 
as well as threats and assault. The watershed between these two 
forms of stalking is the continuation of the behaviours for more than 
2 weeks [3], which is important to recognize from an early interven- 
tion perspective. 

Stalking is proscribed as a criminal offence in most English- 
speaking countries, as well as a number of European countries 
(Austria, Belgium, Denmark, Germany, Italy, Malta, and the 
Netherlands), although to date only Japan and India have enacted 
laws in the Asian region. The drafting of stalking legislation has not 
been without controversy [4, 5], particularly since stalking behav- 
iours can overlap with interactions that, however unwelcome or 
inappropriate, are nonetheless part of many people’s everyday ex- 
perience (such as being pursued for a relationship) or regarded as 
culturally acceptable. 





Epidemiology of stalking 


Estimates of the prevalence of stalking, as with any other phenom- 
enon, will vary according to definition, sampling, method of enquiry, 
and the willingness of subjects to respond and disclose their experi- 
ence. Reported lifetime rates of victimization suggest that stalking is 
a prevalent social problem, affecting an estimated 10-15% of adults 
[6-10]. Stalking also occurs among juveniles [11, 12], with an es- 
timated 12-month incidence of 16.5% for victimization and 5.3% 
for perpetration [13]. Irrespective of age, the majority of victims are 
female (70-80%), while stalkers are male (80-85%). Victims are typ- 
ically pursued by someone known to them (80%), though strangers 
account for a significant minority. 

Cyberstalking has attracted considerable media interest, but few 
systematic studies. Broadly defined as repeated, unwanted contacts 
via the Internet, social media, and other communication technolo- 
gies, ‘pure’ cyberstalking is relatively uncommon, affecting an es- 
timated 6% of Internet users [14] or 7% of stalking victims [15]. 
Cyberstalking appears to be one more technique in a perpetrator’s 
arsenal, rather than a distinct type of activity. The level of concern 
about cyberstalking appears to be driven more so by the potential, 
rather than the actual, size of the problem. Well over half a billion 
people share personal information every day on Facebook alone, 
let alone other platforms such as Twitter or Instagram. The scope for 
cyberstalking, particularly gathering information about a victim, is 
undoubtedly immense. 


The impacts of stalking 





Stalking is an act of violence in itself that causes psychological dis- 
tress and social disruption (and, in many cases, occupational im- 
pairment), as well as a harbinger of assault. Being stalked can 
produce a corrosive state of fear, arousal, and helplessness. As with 
intimate partner violence, for most victims, it is not the blows which 
are the most destructive, but living in a chronic state of intimidation 
or looming vulnerability. Clinical and epidemiological studies have 
indicated that the majority of stalking victims report psychiatric 
morbidity, most commonly disruptive levels of anxiety (including 
hyperarousal), sleep disturbance, and lowered mood [16-18]. A sig- 
nificant minority also report considering suicide to escape the situ- 
ation [16] and/or admit to homicidal fantasies toward the stalker [1]. 


Psychiatric morbidity appears to be mediated not only by stalking- 
related factors, particularly threats and ongoing harassment, but also 
by individual vulnerability factors, especially the use of avoidance as 
a coping strategy [19], the latter being amenable to treatment. 


Stalkers: classifications and typologies 


Stalking, like most forms of complex human behaviour, can emerge 
from a wide range of psychological, social, and cultural influences. 
In the absence of any comprehensive explanatory theory of stalking, 
over 20 typologies have flourished [20], most of which categorize 
according to the stalkers prior relationship with the victim, an 
underlying mental disorder, and/or the primary motivation for the 
stalking. 

Classifying stalkers by the nature of their prior relationship with 
the victim has the advantages of simplicity and utility. Mohandie and 
colleagues [21] used this approach in their classification, which div- 
ides stalkers into those with a prior relationship and those without. 
This classification’s greatest utility is in predicting the risks of assault, 
with those targeting ex-intimate partners constituting the highest- 
risk group and those targeting public figures/strangers the lowest. 
The nature of the prior relationship is recognized as a critical aspect 
of any classification but, in isolation, is insufficient, as it is unlikely to 
adequately guide clinical or legal interventions. 

The typology developed by Mullen and colleagues [1, 2] is multi- 
axial, depending primarily on the context in which the stalking 
emerges and the motivations which initiated and sustain the be- 
haviour, along with the nature of the prior relationship and any 
underlying mental disorder in the perpetrator. Its appeal has been 
primarily to clinicians managing stalkers and stalking victims [22]. 
The typology enables predictions to be made about the duration 
of the stalking, the nature of the stalking behaviours, the risks of 
threatening and violent behaviour, and, to some extent, the response 
to management strategies [1, 23]. There are five main stalker types: 


1. The rejected whose stalking begins in the context of the break- 
down of a close (usually sexually intimate) relationship. The 
stalking is initially motivated either by the desire for recon- 
ciliation or to express rage at rejection, with a mixture of both 
being common. The stalking is often sustained by the pursuit 
becoming a substitute for the lost relationship, with the satisfac- 
tions from intrusion and control replacing those of intimacy. 

2. The intimacy seeker who is pursuing love. The stalking begins 
in the context of a life lacking intimacy and is motivated by the 
hope, or firm expectation, of obtaining a loving relationship with 
a person on whom they have fixed their amorous attentions. The 
pursuit is sustained in the face of indifference or outright rejec- 
tion, because better a love based on fantasy or delusion than no 
love at all. 

3. The incompetent suitor who is pursuing a sexual encounter or 
friendship. This usually begins in the context of loneliness and 
is motivated by a desire to start some form of relationship with 
someone who has attracted their interest. This group often 
pursues intensely but rarely persists for prolonged periods, pre- 
sumably because multiple rebuffs bring few rewards. 

4. The resentful, whose stalking starts in the context of a grievance 
at being unjustly treated or humiliated. The initial motivation is 
revenge, but this gives way to the satisfactions obtained from the 
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sense of power over someone who has previously been experi- 
enced as an oppressor. 

5. The predatory, which begins in the context of the desire to act 
out violent or sexual fantasies often of a sadistic or paedophilic 
nature. The initial motivation is to gain information about the 
movements of a potential victim (usually a stranger or acquaint- 
ance). The stalking continues because of the satisfactions ac- 
cruing not just from voyeurism, but from the excitement and 
sense of power which comes from rehearsing the planned attack 
in fantasy while watching the future victim. 


Each of the stalker types (hopefully with the exception of the 
predatory) has correlates in normal behaviour [24]. When rela- 
tionships break down, one partner is often confused or distressed 
by the separation and seeks to understand, reconcile, or express 
anger. The incompetent suitor is akin to the awkward adolescent 
or socially inept adult who fails to traverse the social minefields of 
dating or simply making acquaintance. The intimacy seeker is the 
adolescent crush and the enthusiastic fan writ large. Even the re- 
sentful is not far removed from some seekers after justice and those 
asserting their rights. In theory, the boundary between persistent 
approaches as part of socially acceptable behaviour and the crime 
of stalking is difficult to isolate. In practice, the distinction is rarely 
a problem. Stalkers are those who repeatedly impose themselves 
on another person in a manner that creates obvious distress. It is 
the disregard of, or blindness to, the disturbance that their behav- 
iour creates that distinguishes the stalker from their more normal 
counterparts. 


Psychopathology of stalkers 


Stalkers are rarely drawn from the psychologically adequate or so- 
cially able of the world. The proportion of stalkers whose behaviour 
is directly related to mental illness varies according to where the re- 
searchers derived their sample (for example, clinical settings vs law 
enforcement) and the proportion of strangers or acquaintances in 
the sample, with the rates of mental illness, particularly psychosis, 
highest in these groups [25, 26]. 

In broad terms, psychotic disorders are relatively frequent in the 
intimacy-seeking group. In the resentful type, paranoid disorders 
unsurprisingly predominate, though most are not associated with 
frank delusion. The rejected most often have problems around de- 
pendency, entitlement, rigidity, and control, with substance abuse 
and depressive states occasionally complicating the picture. The in- 
competent suitors are socially disabled by shyness, narcissism, intel- 
lectual limitations, or autism spectrum disorders, but always with 
interpersonal insensitivity or indifference. The predatory are sexu- 
ally perverse and not infrequently have marked psychopathic traits, 
but they are rarely psychotic [1]. 

Attachment theory has unsurprisingly been evoked to explain 
stalking [27]. That stalkers as a group do not manage interpersonal 
relationships very well is obvious. Attachment style has been shown 
to differ according to stalker type [28], which may be useful in the 
formulation process, but what connection it may have with any 
theory of early development is speculative. A deficit in verbal, rela- 
tive to performance, intelligence quotient (IQ) has also been found 
[29], which should be considered in relation to an individual’s ability 
to engage in particular psychological treatments. 
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Stalking of health professionals 


Health professionals have a heightened vulnerability to being stalked 
by their clients [30-32]. The risk stems both from resentful, disap- 
pointed, or disgruntled patients, and lonely and disordered people 
who misconstrue empathy and attention for romantic interest 
[33, 34]. While some stalking behaviours constitute little more than 
minor irritations, they may also ruin a clinician’s career. 

Rates differ according to the setting and professional groups sur- 
veyed. For example, 53% of clinical staff in an inpatient psychiatric 
service reported being stalked by a patient [35], compared to 21% 
of psychiatrists in a large mental health organization [36]. General 
practitioners report high rates of intrusiveness by patients (70%), 
with 20% having been stalked and a further 20% subjected to a short 
burst of harassment [37]. Almost one in five psychologists report 
stalking by a client [38], 30% of whom were subjected to vexatious 
complaints to professional boards, which left many considering 
whether they wanted to continue with their profession. 

Clinicians who fall victim to stalking by patients may bear the 
additional burden of implied or overt criticism from colleagues to 
the effect that, had they more adroitly managed the therapeutic en- 
counter, they would not now find themselves in this predicament. 
There should be no tolerance with blaming the victim, even if it 
comes in the guise of collegial advice or supervision. Being stalked 
is an occupational hazard for the health professions. Affected col- 
leagues should be accorded support and help, if for no other reason 
than we do not know when it may be our turn to face the pursuit of 
the vengeful or lustful patient. 


Risk assessment and risk management 


Assessing and managing the stalker requires a primary focus on 
the risks they present to the victim, in terms of physical violence, 
as well as psychosocial damage. The risk that stalkers incur from 
their own behaviour also needs to be considered. The conflict be- 
tween the stalker’s desires and the victims interests is obvious, 
but they are at one in being at risk of damage from the stalking 
situation. There can be a tragic symmetry between the victim be- 
coming preoccupied with the stalking and forced to live an in- 
creasingly restricted life in a state of apprehension and the stalker 
devoting all his/her time and resources to a damaging and ul- 
timately self-defeating pursuit. The victims and the perpetrator’s 
lives can be laid waste. This is not to argue for equivalence be- 
tween victim and perpetrator, but merely to note that they share 
the chance of disaster. A perspective which encompasses the risks 
to stalkers and victims has the advantage for health professionals 
of reducing the ethical dilemma when treating stalkers around 
whose interests one is serving, the patient's or their victim’s. Both 
are assisted to the extent that treatment stops the stalking or re- 
duces its damaging impacts. 


The empirical basis for evaluating risk in the 
stalking situation 


Risk assessment in stalking situations is hampered by a paucity 
of retrospective or prospective studies of representative samples. 
Clinicians do not have the luxury of deferring action until such evi- 
dence emerges, and so must instead depend on integrating know- 
ledge from stalking research or systematic studies of risk in other 
areas and drawing on clinical experience [1]. 


The risk of continued or recurrent stalking 


Once stalking has continued for more than 2 weeks, the chances are 
high that it will continue for months [3]. The duration is longest for 
intimacy seekers and rejected stalkers pursuing ex-intimates, with 
the incompetents usually pursuing only briefly [1, 2]. Stalking recid- 
ivism (of the same victim) is associated with mental status, person- 
ality disorder, and prior criminal offending [39, 40]. 


The risks of psychosocial damage to victims 


There is some indication that female stalking victims report a greater 
psychological impact than males [6, 16]. Male victims are less in- 
clined, however, to report fear (a requirement in most legal defin- 
itions) when they are stalked by a female; this likely explains the 
preponderance of same-gender stalking among men in community 
samples [6, 8]. Psychological distress is higher among victims sub- 
jected to prolonged and repeated following and the experience of 
property theft or destruction [17, 41, 42]. The relationship between 
psychological impact and the experience of physical violence is less 
clear, despite its intuitive appeal [42, 43]. 


The risks of physical violence 
Prior relationship 


Victims who have shared a prior intimate relationship with their 
stalker are at high risk of assault [44-47]. In a random commu- 
nity sample, ex-intimates were the most likely to be attacked (56%), 
followed by estranged relatives or friends (36%), casual acquaint- 
ances (16%), work-related contacts (9%), and strangers (8%) [8]. 
Such findings should not be interpreted as suggesting that victims 
of stalkers who are not ex-intimates are in little danger of physical 
violence. A chance of between 8% and 36% of being assaulted is no 
small risk. 


Threats 


Between 30% and 60% of stalking victims are threatened [2, 26, 44]. 
In a community-based study, 44% of those threatened were subse- 
quently assaulted and 73% of victims assaulted by their stalker had 
previously been threatened [8]. In short, threats are highly associ- 
ated with violence and should be taken seriously. 


Mental disorder 


Research has generally concluded that psychotic stalkers are less 
likely to be physically violent than their non-psychotic counter- 
parts, but the relationship to personality disorder remains unclear 
[46, 48, 49]. 


Substance abuse 

Substance abuse is associated with violence in the stalking situation 
[2, 50-52]. 

Prior offending and antisocial behaviour 


Empirical data on the association between past criminal or violent 
behaviour and stalking violence are inconsistent [43]. However, the 
balance of the evidence favours such a relationship [44, 50]. 


Demographic variables 


The gender of stalkers has no impact on the rates of threats or as- 
sault [53-55]. Preliminary research indicates that the prevalence of 


threats and assault is higher among juvenile stalkers, compared to 
their adult counterparts [12], suggesting that younger perpetrators 
prefer more direct forms of harassment, as opposed to some of the 
more surreptitious pursuit employed by adults. 


The nature of the stalking 


Violence is predicted by escalating intrusiveness and the intensity of 
the stalking behaviour [56]. The strongest association is with phys- 
ical intrusions into the victims house or place of work [49]. 


Assessment and management 


Initial assessments of stalkers often occur in the context of pre- 
sentence or parole board evaluations. Victims may be encountered 
in a wider range of contexts, many seeking help from general, ra- 
ther than forensic, mental health professionals. Stalkers usually 
lack insight into their behaviour and tend to deny, minimize, and 
rationalize their actions. Victims often minimize the experience of 
stalking and over-emphasize their own responsibility for the har- 
assment, which should be of no surprise to anybody experienced 
in working with victims in other contexts. Conversely, the problem 
of false claims of stalking victimization cannot be entirely ignored 
[57] and indeed can be guaranteed to be encountered among prac- 
titioners who specialize in the treatment of stalking victims. This 
group, particularly if delusional, are often obvious, given implausible 
and exuberant accounts of victimization. Care should be taken in 
dismissing claims of being stalked, however, as there are some cal- 
culating stalkers who leave few, if any, objective signs. False victims 
require help and treatment, not rejection, but they require quite dif- 
ferent treatment from actual victims [57]. 

In all cases, it is essential to assess collateral information from 
such sources as witness statements, victim impact reports, judges’ 
sentencing remarks, and professional-to-professional contacts, con- 
fidentiality allowing. Attempts to contact the victim when assessing 
the stalker, or the stalker when assessing the victim, are not advised. 
However, skilfully managed, such contacts tend to be experienced by 
the victim as the professional acting as an agent of the stalker, and by 
the stalker as support for their beliefs that this is a misunderstanding 
within a mutual relationship. 

A comprehensive assessment of the stalker should answer at least 
the following questions, to enable a preliminary formulation: 


1. What motivated the stalking, what sustains it, and what is the 
nature of the prior relationship with the victim (if any)? 

2. What, if any, is the nature of the stalker’s current mental dis- 
order (including salient personality characteristics)? How has 
this been managed previously? 

3. What is the nature and extent of the stalker’s substance use/ 
misuse? 

4. Into which stalking type does the stalker best fit, based on these 
previous questions? 

5. What are the stalker’s current social circumstances/supports? 

6. What role could treatment have in ameliorating the stalking? 

7. What is the probability of continued stalking and/or physical 
violence? 


The management of stalkers remains very much the province 
of forensic mental health professionals, and even among them, 
it is a specialist area. Basic approaches to identifying potentially 
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remediable risks and their management, along with more specific 
interventions for particular stalker types, have been explored by 
Mullen and colleagues [1]. 

Stalking victims rarely present for mental health care after the 
fact; most will be exposed to ongoing intrusions, such that their 
safety needs have to be prioritized, along with their mental health. 
Some victims of prolonged, intense stalking can manifest symptoms 
of complex post-traumatic stress disorder (PTSD) [58] and may 
have a poorer response due to the sense of having been permanently 
changed or damaged by their victimization. Like many victims of 
crime, stalking victims may be reluctant to disclose the details or 
even the existence of traumatic experiences. As noted earlier, self- 
blame (or indeed explicit blame by those around them) is not infre- 
quently part of the picture. 

Stalking victims need good psychiatric care. Strategies to manage 
stress reactions are required, along with methods to ameliorate 
hyperarousal and/or depressed mood. The following is a brief 
account of key interventions to combat stalking that should be can- 
vassed in the treatment of victims: 


1. Inform others. It is essential that victims inform those they 
love or with whom they work or socialize. This performs three 
functions: 

i. It enables others to support the victim and to avoid inad- 
vertently assisting the stalker. 

ii. It prevents those around the victim from being put at risk by 
ignorance or by provoking the stalker and exacerbating the 
situation. 

iii. It allows victims a ‘reality check’ on their fears that the 
stalking is occurring or about the seriousness of their 
situation. 

2. Avoid contact and/or confrontation. All contacts or direct com- 
munications with a stalker risks reinforcing their behaviour and 
must be actively avoided. Once stalking is established, it is usu- 
ally too late to resolve matters by meeting ‘one last time. Threats 
of suicide if the victim refuses to meet or otherwise engage with 
the stalker should not be rewarded or reinforced, despite an ele- 
vated risk of suicide among perpetrators [59]. 

3. Documentation. The best protection for stalking victims often 
lies in criminal law. The police are more likely to respond appro- 
priately if victims can demonstrate that the behaviour is repeat- 
edly occurring and make it relatively easy for them to pursue the 
allegations. Retain phone or text messages, emails, letters, and 
unwanted ‘gifts, and keep a diary noting approaches, threats, or 
property damage, and, where possible, witnesses to (or photo- 
graphic evidence of) such events. This will be invaluable for any 
prosecution and further assist the victim to take back control 
over their lives. 

4. Restraining/intervention/apprehended violence (and other such) 
orders. The effectiveness of these orders to combat stalking is 
debatable, with 35-80% breached in stalking situations [60-62]. 
Nonetheless, these civil orders can be used to demonstrate for 
subsequent criminal proceedings that the stalkers behaviour 
was intentional, which may often be a legislative requirement. 
Reservations have to be expressed, however, not only about their 
utility (being totally ineffective, for example, for delusional in- 
timacy seekers), but also about the level of insecurity and dis- 
tress consequent on their breach. These orders are also often 
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provocative, particularly for the rejected, and victims should be 
advised about the potential for an escalation in violence after 
obtaining an order against a former intimate (particularly if the 
relationship was characterized by violence). 

5. Increased security. Inexpensive measures, such as good locks, 
movement-triggered outdoor lighting, secure mailboxes (or post 
office boxes), and ensuring high-privacy settings on social media 
accounts, may provide a degree of reassurance and a modicum 
of security. 


Querulous behaviour: abnormally persistent 


complaining and vexatious litigation 





In response to a perceived or real personal grievance, querulous in- 
dividuals relentlessly pursue their notion of justice in a manner that 
is seriously damaging to the person’s economic, social, and personal 
interests and disruptive to the functioning of courts and/or other 
agencies involved in attempts to resolve the claims [63]. A minority 
may engage in stalking as part of their campaign, but most pursue 
their quest via relentless complaining and litigation. 

Psychiatry’s interest in the querulous (from Latin, to mutter and 
to mumble) thrived in the late nineteenth and early twentieth cen- 
turies. Classical psychiatry viewed querulousness as a form of para- 
noia, with the term Querulantenwahn (‘litigants’ delusion’) coined in 
1857 by Johann Ludwig Casper to describe: 


‘A form of so called paranoia in which there exists in a patient an 
insuppressible and fanatic craving for going to law in order to get 
redress for some wrong which he believes done to him. Individuals 
who fall victim to this disorder are always strongly predis- 
posed ... extremely egotistical ... know everything better ... differs 
from other forms of paranoia in so far as the wrong may not be quite 
imaginary ... the more he fails the more he becomes convinced that 
enormous wrong is being done to him ... neglects his family and his 
business ... going down the road to ruim [64]. 


The accuracy of this description aside, the classical view of 
querulousness as a form of paranoia was problematic, given that 
the behaviour was usually based on a genuine grievance and fre- 
quently regarded as developing on the basis of vulnerabilities in 
the individual’s personality [65-67]. The diagnosis of querulous 
paranoia was appealed to less and less, and by the latter half of the 
twentieth century, the literature largely fell silent [68]. Unreasonable 
complainants ultimately came to be considered as a purely legal, ra- 
ther than a medical, issue [69]. The disappearance of querulousness 
from the realms of psychiatry paralleled the decline of paranoia as 
a diagnostic entity, but also reflected psychiatry’s increasing reluc- 
tance to play the role of social regulator. Perhaps most significant in 
the diminishing interest in the querulous was the coinciding emer- 
gence of the culture of blame [70], which drew more and more vul- 
nerable people into the systems of complaint management [1]. 

Cultural factors have been little explored in the querulous, but 
individuals who have migrated from Eastern bloc countries ap- 
pear overrepresented in our clinical experience, likely reflecting 
the historical absence of avenues to complaint or redress from in- 
justice under oppressive regimes. We speculate that these individ- 
uals are susceptible to becoming overwhelmed with the myriad of 


opportunities afforded to them with a Western ‘culture of complaint 
and blame. 

Agencies of accountability, including courts and tribunals, 
Ombudsmen’s offices, registration boards, and complaints de- 
partments, are those now primarily charged with navigating the 
problems created by a tiny fraction of people pursuing grievances 
with a persistence and insistence disproportionate to the substan- 
tive nature of their claim. In the civil, administrative, and family 
courts, the number of intractable cases being pursued—usually by 
self-represented litigants—increases each year. By pursuing what 
they regard as their rights through repeated petitions and intru- 
sive approaches to politicians and heads of state, querulants are also 
distracting protection services from more substantial threats [63, 
71-74]. 

The querulous pursue their vision of justice through litigation in 
courts and tribunals, through petitions to the powerful, and through 
the various agencies of accountability. In practice, all three avenues 
are exploited simultaneously. Courts and agencies of accountability 
are designed to deliver conciliation, mediation, reparation, and 
compensation, but rarely retribution, except in the exceptional case 
of punitive damages, and never personal vindication. As the queru- 
lous seek, above all, personal vindication and retribution, from the 
outset, they are doomed to fail. 


Clinical features 


It is not easy to distinguish the querulant from the difficult com- 
plainant, or even from social reformers and victims of gross in- 
justice. The following simple typology may assist [63]: 


1. Normal complainants are aggrieved and seek reasonable com- 
pensation, reparation, or simply an apology. They can engage in, 
and will accept, mediation and conciliation, though they may 
become persistent and insistent if provoked by inefficiency or 
injustice. 

2. Difficult complainants also seek compensation and reparation 
for their grievance but, in addition, can pursue retribution. They 
tend from the outset to manifest victimization and anger, and re- 
sist all solutions but their own. Eventually, they will settle for the 
best deal they can obtain, albeit still complaining of injustice. 

3. Altruistic reformers pursue goals of social progress via the courts, 
petitions, and complaints. They sacrifice their personal interests 
in pursuit of better outcomes for others. Though they may have a 
political agenda that is sectarian (for example, genetically modi- 
fied foods, fathers’ rights), they do not have idiosyncratic and 
personalized objectives. 

4. Fraudsters who knowingly pursue false or grossly exaggerated 
claims. 

5. The mentally ill whose claims are driven by delusional pre- 
occupations that are frequently bizarre in nature, which reflect 
underlying disorders often of a psychotic type. 

6. The querulous who seek personal vindication, in addition to 
compensation, reparation, and retribution. They are on a quest 
for justice that becomes their preoccupation (and often substi- 
tutes any occupation). Unlike reformers, and most of the diffi- 
cult complainants, there is an obvious discrepancy between the 
provoking event and the salience attached to it by the queru- 
lous. They appear to seek not resolution, but continuation of the 


conflict. Their loss of perspective lays waste to their social and 
economic functioning. 


The querulous are predominantly males, who first become em- 
broiled in complaining and claiming later in life, usually in their 
fourth or fifth decade [69, 75]. Premorbidly, most function rea- 
sonably well [69] and may well be above average in intelligence 
or functioning. They rarely have criminal records or prior psychi- 
atric contact, and substance abuse is not prominent. Many have 
close relationships, but by the time they reach mental health pro- 
fessionals, they have usually alienated their family and friends. 
Querulants are often disappointed people who feel their effort 
(indeed striving) and qualities have been ignored and left unre- 
warded. Their pursuit of justice offers an opportunity to vindicate 
their lives and obtain the public recognition so long denied. Their 
personalities tend to have the traits of self-absorption, suspicious- 
ness, and obsessionality, combined with an enviable capacity for 
persistence. 

Clinically, querulants typically present as energized individuals 
eager to convince of the merits of their case. There is an enthusiasm 
that can seem almost manic, but unlike the manic, they are totally 
focused and almost impossible to distract from their narrative of 
injustice. They may attend appointments with reams of documents 
testifying to their misplaced scholarship. If challenged, they usu- 
ally become patronizing as they pedantically refute all objections, 
to their complete satisfaction. Alternatively, they may become 
menacing and overtly threatening. 

Communications from querulants [75] and vexatious litigants 
have been observed to be often characterized by: 


e Multiple methods of emphasis, 
highlighting, capitalization, or italics. 


including underlining, 


e The generous use of exclamation marks and question marks. 

e Numerous footnotes and marginal notes. 

e The use of attachments, some potentially pertinent (for example, 
letters received, copies of legislation), others of less obvious rele- 
vance (for example, Magna Carta, United Nations Declaration of 
Human Rights). 

e And many, many pages. 


The content of communications may also be unusual [75], some- 
times containing: 


e Legal, medical, and other terms used frequently, but often 
incorrectly; 

e Repeated rhetorical questions; 

e Reference to the self in the third person; 

e Veiled threats to harm themselves or others if their wishes are not 
granted; or 

e Exaggerated politeness and attempts to ingratiate. 


Clinical assessment 


The querulous can only be adequately assessed by considering the de- 
velopment over time of their behaviour, as well as their state of mind. 
Inan interview, they may present as merely overenthusiastic or overly 
hopeful, pursuing their legitimate rights with, at worst, a degree of 
fanaticism. It is the unfolding of their story which reveals the damage 
they are suffering and that they have inflicted on those around them. 
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Case example 


An academic in her mid forties, whose fixed-term contract was not re- 
newed, complained to the University that this was a result of bullying 
by her supervisor, who she claimed was seeking to appropriate her in- 
tellectual property for his own career advancement. The University 
investigated but ultimately rejected her complaint, noting that the con- 
tract non-renewal was due to a lack of ongoing funding and role redun- 
dancy. She initiated civil action in the state administrative tribunal and 
submitted dozens of Freedom of Information (Fol) applications per- 
taining to the University, her former supervisor, and numerous other 
departmental colleagues. Only a handful of the Fol applications were 
granted, and the civil action was dismissed. She unsuccessfully ap- 
pealed the latter over the next 12 years (as a self-represented litigant), 
including eventually to the full bench of the national administrative 
court. She repeatedly sought stays and postponements of court hear- 
ings that had been scheduled to determine (that is, finalize) her case, 
the delays causing tremendous inconvenience both to the University 
staff named as respondents in the claims and the court staff. A frequent 
tactic was to ‘reserve the right’ to terminate proceedings due to ‘health 
issues’, which she typically employed at the end of lengthy proceedings 
when an adverse decision (in her opinion) was about to be adjudi- 
cated. Her administrative court claims were joined by complaints to 
the privacy commissioner, the human rights commissioner, and nu- 
merous members of parliament. She also contacted the registries of 
each state administrative tribunal, inundating staff with emails and 
phone calls and revelling in any inconsistent advice provided (which 
she regarded as confirmation of the tribunals’ gross incompetence, 
rather than regional procedural differences). She was ultimately in- 
structed by the Principal Registrar to deal only with the registry man- 
ager of the tribunal in which her original claim was lodged, whom she 
came to believe was responsible for the failure of her various claims. 
She finally made death threats against the registry manager, for which 
she was charged and subsequently convicted. On the day of sentencing, 
she abused and threatened the Magistrate (‘No one has the right to 
stop me! ... Just try and you'll pay, bitch!) and assaulted a court at- 
tendant who intervened. She was imprisoned for a brief period, after 
which she resumed her complaining and litigation with a vengeance. 
She continues to pursue various actions in the courts, with her identity 
(obvious to all who encounter her) inexorably linked to her dogged 
self-righteousness. 

Traditionally, psychiatry has attempted to distinguish between de- 
luded querulants, who are in the business of mental health services, 
and the non-deluded who are not. Unfortunately, the querulous pre- 
sent a formidable phenomenological challenge in this regard. They 
advance their ideas plausibly, making apparently rational connec- 
tions between the underlying grievance (which is almost always 
based on some actual event or injustice) and their current claims 
and complaints. The querulous offer a detailed and apparently lo- 
gical account of the emergence of their grievances and the progress 
of their quest for justice—logical that is, if taken in cross-section, but 
not when considered over time, when gross discrepancies between 
the supposed initiating cause and subsequent behaviour become ap- 
parent. Their persuasive presentation can obscure the essential fu- 
tility of the quest and distract attention from the chaos they have 
created for themselves and those around them. The temptation is 
to normalize the clinical presentation, but this is to ignore both the 
peculiarity of their behaviour and beliefs, as well as the devastation 
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they have wrought on their own lives. Sometimes the querulous are 
obviously deluded, and sometimes they appear to inhibit the border- 
line between overvalued ideas and delusion-like ideas. Debates over 
the phenomenological niceties should not, however, distract from 
recognizing the pathological nature of such querulousness. 


Management 


Psychiatrists currently only tend to be involved in the management 
of querulants after the situation has reached the stage of the person 
either becoming seriously depressed or being charged with threats, 
violence, or contempt. The literature on the therapeutic manage- 
ment of the querulous is meagre. Ungvari [76] reported successful 
treatment using pimozide, which is consistent with our clinical 
experience that relatively low doses of a dopamine antagonist are 
helpful, though the response is slow in coming. The critical hurdle 
in treatment, however, is attaining some semblance of a therapeutic 
alliance with the patient. This requires avoiding being caught up 
in discussions of the rights and wrongs of their quest. The focus 
should be on the price they and their loved ones are paying for the 
pursuit [63]. Interestingly, some of those who come on orders from 
the court that mandate treatment will accept medication and other 
therapeutic interventions, as they wish to make clear they abide by 
the law. Paradoxically, a number have continued voluntarily in treat- 
ment after the end of the order, though they never acknowledge ei- 
ther that they were in error or in need of treatment because of their 
querulousness. What change are the involvement in the querulous 
ideas, the degree of preoccupations, and the behaviour, but the core 
belief that they were right never wavers. Querulous behaviour ap- 
pears to be sustained by a range of cognitive distortions, including: 


e Those who do not fully support their cause are enemies. 

e Any lack of progress is the product of malevolent interference 
from someone. 

e Any compromise is humiliating defeat. 

e The grievance is the defining moment of their lives. 

e That because they are in the right, the outcomes they seek must be 
not only possible, but also necessary. 


These distortions are open to challenge and amelioration, and 
the cognitive therapy approaches advocated for delusions [77, 78] 
should, in theory, also be of value. The problem with the therapeutic 
management of querulous behaviour is that there are no adequate 
clinical trials of treatment, or indeed much beyond case reports. This 
reflects the prevailing zeitgeist that the querulous are not the con- 
cern of mental health, and even if they are, they are largely untreat- 
able. Hopefully, ifthis neglect is overcome and querulous behaviour 
is once more recognized as a legitimate concern for mental health 
professionals, then systematic studies of therapy will follow. 


Conclusions 


Not a few psychiatrists have difficulties with the notion of problem 
behaviours like stalking and querulousness being a proper subject 
for mental health concern. Psychiatry has traditionally been wary 
of concerning itself directly with criminal and antisocial behav- 
iours [79, 80]. The approach taken in this chapter was to define 
patterns of behaviour destructive to the interests of the perpetrator 


and the victim, and then to examine the origins, effects, and po- 
tential therapeutic management. By recognizing that psychiatry 
can have a role in assessing and managing problem behaviours, 
without first performing obfuscating transformations into sup- 
posed mental disorders (such as impulse-control disorders), al- 
lows a more clear-sighted and effective approach to areas of human 
activity where our intervention can benefit both the actor and the 
wider community. 
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Violence and abuse is a pervasive problem in our society. It includes 
physical, sexual, and psychological abuse, coercive behaviour, 
and deprivation. Violence can be perpetrated by family mem- 
bers, intimate partners, and ex-partners (domestic violence) or by 
strangers and acquaintances (non-domestic or community violence, 
including violence by care providers) [1] (Fig. 143.1). Domestic 
violence, the focus of this chapter, is a global public health problem 
[2]. Worldwide, intimate partner violence (IPV) (one form of do- 
mestic violence) has been estimated to account for up to 7% of the 
overall burden of disease among women, mostly due to its impact 
on mental ill health [3, 4]. Annual direct and indirect costs of do- 
mestic violence and abuse (DVA) in the UK have been estimated 
to amount to £16 billion per annum [5]. In the UK, DVA is defined 
as: ‘controlling, coercive or threatening behaviour, violence or abuse 
between people aged 16 or over, who are or have been intimate part- 
ners or family members, regardless of gender or sexuality. It includes 
psychological, physical, sexual, financial and emotional abuse [6]. 
This definition applies to adult family members (>15 years of age). 
Emotional and psychological abuse, including coercive control, are 
therefore recognized as part of DVA [7, 8]. (DVA also encompasses 
traditional practices, including forced marriage, so-called ‘honour 
crimes, and female genital mutilation, in addition to partner vio- 
lence, but these are beyond the scope of this chapter.) The World 
Health Organization (WHO) focuses on IPV, defining it as: “behav- 
iour by an intimate partner or ex-partner that causes physical, sexual, 
or psychological harm, including physical aggression, sexual coercion, 
psychological abuse or controlling behaviours’ In this chapter, we will 
use the term DVA, unless a study refers specifically to IPV only, and 
will specify where studies are referring to non-partner violence. It 
is important to note that victims of DVA are rarely subjected to just 
one form of abuse—around half of women suffering from DVA ex- 
perience more than one type of partner violence [9, 10]. 

DVA is sometimes classified into typologies, for example ‘situ- 
ational couple violence’ and ‘intimate terrorism [11-13], and there 
is some preliminary evidence for the existence of IPV typologies, 
though further research empirically validating typologies is needed 






































[14]. Situational couple violence is generally considered much more 
common than intimate terrorism and includes abuse, but without 
a context of control. Individuals who engage in situational couple 
violence are characterized as poor communicators who respond to 
frustrations with verbal or physical aggression. Although situational 
couple violence is harmful, intimate terrorism is the type of DVA 
that accounts for most serious injury and death. It is also more gen- 
dered than situational couple violence, with women more likely to 
be the victims of repeated and severe DVA by men, although in- 
timate terrorism also occurs in same-sex relationships. Perpetrators 
may display emotionally unstable personality traits, develop strong 
disordered attachment to their victims, and engage in violence in 
order to prevent abandonment. Other intimate terrorists may have 
sociopathic traits and seek to dominate and control many aspects of 
their lives, including their relationships [14]. 





DVA is common. A systematic review of international studies of 
physical violence in intimate relationships found that more than 
one in four women (28.3%) and one in five men (21.6%) reported 
perpetrating physical violence in an intimate relationship [15]. The 
British Crime Survey estimates of DVA (using the broad definition 
given) showed that some 8.2% of women and 4% of men were esti- 
mated to have experienced domestic abuse in 2014/15 [16], while 
27.1% of women and 13.2% of men, between the ages of 16 and 
59, had experienced any domestic abuse since the age of 16. In the 
United States, the National Intimate Partner and Sexual Violence 
Survey (2011) found that severe physical violence by an intimate 
partner (including acts such as being hit with something hard, being 
kicked or beaten, or being burnt on purpose) was experienced by 
an estimated 22.3% of women and 14.0% of men during their life- 
time and by an estimated 2.3% of women and 2.1% of men in the 
12 months before taking the survey [17]. DVA has the highest rate 
of repeated victimization of any violent crime [18-20]. There is sub- 
stantial variation globally; the WHO multi-country study of female 
DVA victims reported that the lifetime prevalence of physical or 
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Fig. 143.1 World Health Organization's typology of violence. 


Adapted from Krug EG, Dahlberg LL, Mercy JA, et al. [Eds.], World Report on Violence and Health, Copyright (2002), with permission from World Health Organization. Available 


at http://www.who.int/violence_injury_prevention/violence/world_report/en/ 


sexual partner violence, or both, varied from 15% to 71%, and past- 
year violence between 4% and 54%, among the countries studied 
[21, 22]. Prevalence is higher in some clinical populations; for ex- 
ample, a systematic review found the highest prevalences of physical 
violence (30-50%) and sexual violence (30-35%) in studies con- 
ducted in psychiatric clinics and obstetrics and gynaecology clinics, 
and the highest mean lifetime prevalence of psychological violence 
in psychiatric clinics and emergency departments (65-87%) [23]. 
Adolescent and young women face a substantially higher risk of ex- 
periencing IPV than older women [24]. Lower rates of physical and 
sexual IPV are reported in older women, but the prevalence of emo- 
tional and economic abuse and controlling behaviours are similar to 
those experienced by younger women [25]. 


Gender differences 


While studies that have investigated the prevalence of DVA in both 
men and women have often found that the prevalence of victimiza- 
tion is similar in both genders, this masks the frequent finding that 
women are at greatest risk of serious and sexual assaults. A study in 
the United States using a representative general population sample 
found the lifetime prevalence of isolated domestic violence incidents 
to be comparable for men and women, but that nearly 25% of sur- 
veyed women, compared to 7.6% of surveyed men, reported that 
they were raped and/or physically assaulted by a current or former 
partner at some time in their lifetime [26]. Gender differences in 
the type of violence experienced were also found in the National 
Violence Against Women Survey (NVAWS), a telephone survey of 
over 6000 women and 7000 men across the United States. While 
they also found similar proportions of women and men reporting 
that they had experienced physical, sexual, or psychological IPV 
during their lifetime (28.9% and 22.9%, respectively), it was found 
that women were significantly more likely than men to experience 
physical or sexual IPV and abuse of power and control, but less likely 


than men to report verbal abuse alone [27]. A systematic review of 
international studies ofhomicide revealed that one in seven of global 
homicides were committed by an intimate partner, and this propor- 
tion was six times higher for female homicides than for male homi- 
cides (38-6% vs 6-3%) [28]. Sexual violence against men by women 
partners has been found to be rare; in contrast, at least one in four 
women report sexual violence by a male intimate partner [29, 30]. 

Reports of comparable lifetime occurrence of ‘all-type’ DVA 
among men and women also mask the gender differences with re- 
gard to the number and type of specific assaults. For example, Tiaden 
and Thoennes [26] found that women in the United States were at 
greater risk of repeated coercive, sexual, or severe physical violence. 
British Crime Survey for England and Wales findings also suggested 
differences in repeat victimization; male victims may experience, on 
average, up to seven instances of repeat victimization, and women 
an average of 20 incidents [31]. Women are also more likely to report 
more severe injuries and to be more likely to be frightened as a result 
of domestic violence [29, 31]. 

However, there may also be gender differences in reporting. 
Brown [32] noted gender discrepancies in the rates of arrest and 
prosecution for IPV. Male victims were reluctant to report the in- 
cident, and the police were unwilling to arrest women accused of 
perpetrating violence, resulting in only 2% of suspected female 
perpetrators being arrested, which raises the concern that national 
prevalence rates based on surveys may not accurately reflect preva- 
lence rates, particularly for men. A number of researchers also have 
challenged the assumption that IPV has a greater impact on women 
than men [33, 34], with studies increasingly examining the psycho- 
logical impact of IPV on men [35] (see also later discussion of DVA 
and mental disorder). 

Less is known about gender differences in male, compared with fe- 
male, homosexual relationships, but a survey from the United States 
found that rates of emotional and physical violence victimization 


among urban men who have sex with men were substantially higher 
than previously reported among heterosexual men—34% reported 
psychological violence, 22% physical violence, and 5% sexual vio- 
lence [36]. Studies from the United States increasingly suggest that 
the prevalence of DVA may be similar across same-sex and hetero- 
sexual relationships and what differs are help-seeking behaviours 
[37]. In a UK survey of 800 homosexual men and women, Hester 
(2009) reported that more than a third of respondents (38%) said 
that they had experienced DVA at some time in a same-sex rela- 
tionship, with similar proportions in women (40%) and men (35%) 
[38]. However, the main prevalence data on DVA worldwide are 
derived from surveys that do not identify individuals in same-sex 
relationships, so current knowledge on violence occurring in homo- 
sexual relationships in epidemiologically representative populations 
is limited. 

While this chapter includes more studies of women as victims and 
men as perpetrators of DVA, the authors acknowledge that this re- 
flects the smaller research literature on women as perpetrators and 
men as victims (irrespective of whether in heterosexual or homo- 
sexual relationships). 


Risk factors for domestic violence and abuse: an 


ecological model 





The WHO uses an ecological model when considering risk fac- 
tors for DVA to provide a theoretical framework to help explain 
the multi-faceted nature of violence (Fig. 143.2) [1]. Risk fac- 
tors operating at the level of the individual include being female, 
young age [39, 40], disability [41], poverty [40, 42], witnessing 
DVA as a child, childhood abuse [40, 43], and substance abuse 
[44]. Many of these factors are also risk factors for mental dis- 
orders, which emphasizes the social determinants of both mental 
disorders and DVA and the complex pathways involved in being 
a victim of violence through the lifespan. Aetiological factors also 
occur at the level of the relationship (for example, partner with 
substance misuse), community characteristics (for example, high 
population density, unemployment, and social isolation), and 
larger societal factors, including health, educational, economic, 
and social policies, cultural norms, gender disadvantage, and 
social inequalities [1, 45]. 


CHAPTER 143 Domestic violence and abuse and mental health 


Victimization 

Being a victim of DVA is associated with mental disorders across 
the diagnostic spectrum, including anxiety, depression, post- 
traumatic stress disorder (PTSD), eating disorders, and psychosis, 
though there are limited data on this relationship for men [46-52]. 
Prospective studies have shown that psychiatric disorder can in- 
crease domestic violence perpetration and victimization, and that 
domestic violence is associated with both risk and chronicity of 
mental disorder [53, 54]. 

The prevalence of DVA victimization is particularly high among 
people in contact with secondary mental health services; a recent 
survey of mental health care service users in London reported that 
70% of women and 50% of men had been DVA victims as adults, 
with many (27% of women, 10% of men) experiencing recent and 
current DVA, that is, significantly higher than the general popula- 
tion [odds ratio (OR) adjusted for sociodemographics (aOR) 2.7, 
95% confidence interval (CI) 1.7-4.0 for women; aOR 1.6, 95% CI 
1.0-2.8 for men] [55]. In addition, of note, family (non-partner) vio- 
lence comprised a greater proportion of overall DVA among SMI 
than control victims (63% vs 35%; P <0.01), and adulthood ser- 
ious sexual assault led to attempted suicide significantly more often 
among SMI than control female victims. Witnessing DVA also im- 
pacts on the health and mental health of children of adults who en- 
gage in IPV [56-58]. Mental health problems and violence can thus 
be transmitted through the generations [59]. 

The association between DVA and mental disorder is likely to 
be complex. Pre-existing mental health problems influence vul- 
nerability to domestic violence. Indeed, there is evidence that a 
bi-directional causal relationship exists between some mental dis- 
orders and DVA. Prospective studies have shown that DVA contrib- 
utes to both the emergence and the exacerbation of mental health 
symptoms [54, 60]. A systematic review of longitudinal studies of 
IPV and depressive symptoms reported both an association between 
depression and subsequent DVA and that DVA increases the risk 
of depression among women with no previous history of symp- 
toms [61]. Moreover, data from an earlier systematic review [48] 
found that rates of depression declined over time once the abuse had 
ceased and that the severity or duration of violence was associated 
with the prevalence or severity of depression. Serious mental illness 
increases the likelihood of being in unsafe environments and rela- 
tionships, and vulnerability to violent victimization [62]. There has, 
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Fig. 143.2 The ecological model for understanding violence. 


Adapted from Krug EG, Dahlberg LL, Mercy JA, et al. [Eds.], World Report on Violence and Health, Copyright (2002), with permission from World Health Organization. Available 


at http:/Awww.who.int/violence_injury_prevention/violence/world_report/en/ 
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however, been limited longitudinal research into the relationship be- 
tween DVA and other mental disorders, and more work is needed 
to disentangle the role of childhood abuse, which is associated with 
both mental disorder and DVA [63]. 

It has been suggested that DVA is more psychologically damaging 
than stranger violence or other forms of trauma [64]. The psycho- 
logical effects of domestic violence can be conceptualized within a 
trauma framework, but the complex presentation of victims who 
have often been exposed to extensive control and repeated assaults, 
particularly if abuse was also experienced in childhood and escape 
was not possible due to physical, psychological, family, or societal 
factors, may be more adequately captured with the concept of ‘com- 
plex PTSD’ [65]. Although not a category in DSM-5 (but will be in- 
cluded in ICD-11), complex PTSD is considered a useful concept by 
many working with chronically traumatized victims [66]. Complex 
PTSD extends beyond the classic cluster of intrusive, avoidance, 
and arousal symptoms to incorporate changes in victims’ attitudes 
about self, the perpetrator, relationships, and beliefs. Symptoms 
include those of PTSD, with additional disturbance in affect regu- 
lation and interpersonal relationships (see associated features of 
post-traumatic stress disorder in DSM-5 [67], including: feelings 
of ineffectiveness, shame, despair, or hopelessness; feeling perman- 
ently damaged; loss of previously sustained beliefs; hostility; social 
withdrawal; feeling constantly threatened; impaired relationships 
with others; or a change from the individual's previous personality 
characteristics). 


Perpetration 


Although there is a wealth of research literature supporting the link 
between mental disorder and increased risk of perpetrating violence 
in general [68-70], fewer studies have examined DVA perpetration 
specifically. A recent systematic review found only 17 studies re- 
porting the risk of DVA perpetration among men and women with 
mental disorders [71]—three reported on past-year violence, and 
these three studies did not consistently report increased risk for any 
diagnosis. Most studies examined lifetime risk of physical violence 
towards a partner and reported increased risk among samples with 
mental disorder of any diagnosis. However, most studies were cross- 
sectional, and importantly, alcohol and substance misuse were not 
examined as potential confounding or mediating factors. Further 
research is needed to investigate whether psychiatric disorders are 
associated with a current risk of violence to partners. 

Recent studies of general violence and mental disorder have ad- 
vanced our understanding of the link between mental disorder and 
violence perpetration. Some have identified the important mediating 
role of alcohol and substance misuse [72], and some have unpicked 
the mediating role of certain symptoms, such as psychotic delusions, 
negative affect, or anger [73, 74] in those with severe mental dis- 
order. These pathways have not been investigated in relation to DVA 
perpetrated by people with mental disorders. Similarly, although 
treatment (for example, with antipsychotics and mood stabilizers 
for those with severe mental disorder) is associated with reduced 
violent crime [75], the impact of treatment for mental disorder has 
not been investigated in relation to DVA. It seems possible, however, 
that similar mechanisms may be important in any relationship be- 
tween DVA perpetration and mental disorders. 

The general premise in the literature, historically, was that even if 
women and men engage in equivalent rates of IPV in heterosexual 


relationships, male-perpetrated violence has more negative con- 
sequences for its victims than does female-perpetrated violence. 
Supportive evidence for this view comes from studies suggesting 
that women are more likely than men to sustain serious physical in- 
jury and negative psychological consequences [27, 76, 77]. However, 
recent cultural shifts have led to the acknowledgement that men can 
also be victimized in their relationships and research has shown 
that men can sustain similar levels of psychological injury. The 
NVAWS of women and men aged 18-65 found that, for both men 
and women, physical IPV victimization was associated with an in- 
creased risk of current poor health, depressive symptoms, substance 
use, and developing a chronic disease, chronic mental illness, and 
injury [27]. In general, psychological IPV was more strongly asso- 
ciated with these health outcomes than physical IPV. The propor- 
tion of IPV survivors meeting criteria for moderate to severe PTSD 
did not differ by gender (20% male, 24% female) [78], and psycho- 
logical abuse was just as strongly associated with PTSD as physical 
abuse. This raises questions about the assumed lesser impact of IPV 
by women towards men, given findings that women are more likely 
to perpetrate psychological than physical aggression towards male 
partners. However, it is also clear that at the population level, being 
a victim of DVA is more prevalent in women and the psychiatric 
burden (estimated as population-attributable fractions, assuming 
causality) of DVA is higher in women [79]. 


Identification of domestic violence and abuse 





by mental health services 


Identification of victims 


Women are more likely to disclose domestic violence to a health care 
professional than to the police [80]. However, despite the high preva- 
lence of DVA among mental health service users, a review in 2010 
found that only 10-30% of DVA victims were identified by mental 
health professionals internationally [19]. Qualitative research in pri- 
mary and secondary care has found that women may not disclose 
unless they are asked [81-83]. Barriers to disclosure for psychiatric 
patients include fear of consequences such as social services in- 
volvement and consequent child protection proceedings, fear that 
disclosure would not be believed, and fear that disclosure would lead 
to further violence. Other barriers included the actions of the per- 
petrator (such as always being present when the victim is seen by 
health professionals) and feelings of shame. A review of qualitative 
studies involving mental health service users found that DVA vic- 
tims want mental health professionals to acknowledge and/or val- 
idate their disclosures of domestic violence in a non-judgemental 
and compassionate manner [81]. Patients reported limited oppor- 
tunities to discuss DVA during clinical consultations, which were 
focused on diagnosing and treating psychiatric symptoms which 
prevented discussion of abuse or the extent of abuse. When disclos- 
ures were not taken seriously or minimized or there is the sense of 
being blamed by professionals, this was considered unhelpful and 
was associated with persistent symptoms. Concerns for future safety 
were prominent, with some responders expressing fear that mental 
health professionals’ responses to the violence could place (and had 
placed) them at risk of further harm (for example, if the perpetrator 
hears about the DVA disclosure). There can be an expectation that 


they leave their partner, but mental health professionals may fail to 
understand that choosing to remain in an abusive relationship may 
be based on a strategic risk—benefit analysis which balances the risk 
of ongoing harm with the potential negative consequences of leaving 
their partner (financial concerns, social stigma, custody of children) 
and the hope that their partner may change [82, 84]. Some women 
fear more severe violence if they leave their partner. Indeed, it has 
been identified that women are at greatest risk of homicide in the 
months immediately following separation [85, 86]. 

There is some evidence that when routine enquiry about DVA is 
introduced across mental health services, detection rates improve. 
But even then, formulations and management plans rarely include 
issues around domestic violence and often do not include safety 
planning or trauma-focused therapy [87]. Therefore, it is now health 
policy in England, and in some parts of the United States, Australia, 
and New Zealand, to routinely enquire about DVA in all mental 
health assessments in psychiatric care [88]. Routine enquiry refers 
to ‘asking all people within certain parameters about the experi- 
ence of DVA, regardless of whether or not there are signs of abuse, 
or whether DVA is suspected. Enquiry and disclosure in mental 
health settings have been found to be facilitated by a supportive and 
trusting relationship between patient and professional [82], with 
similar findings in other health care settings [83]. However, barriers 
to routine enquiry by professionals have also been identified, such 
as a lack of understanding of different types of DVA and a lack of 
confidence in facilitating and managing disclosures [82, 89]. Mental 
health professionals may be unsure of their role in the identifica- 
tion and management of DVA, and surveys have identified know- 
ledge gaps [90]. International guidelines, including from the WHO 
and the National Institute for Health and Care Excellence (NICE), 
recommend that appropriately trained mental health professionals 
should facilitate DVA disclosure as part of comprehensive clinical 
assessments [30, 91]. 

Trials of systemic interventions to improve identification and 
response to DVA victims [92] in primary care (that is, integrated 
training, support, and advice from DVA advocates) have been 
shown to significantly improve health professionals’ facilitation of 
disclosure and their subsequent response [92, 93]. As yet, there have 
been no trials in mental health settings, but a pilot has suggested that 
there are similar increases in identification rates of DVA victim with 
improvements in health and quality of life outcomes [94]. 

All mental health professionals need comprehensive training 
in how to enquire safely, with clarity on subsequent referral and 
care pathways [30]. The World Psychiatric Association (WPA) has 
just published an international competency-based curriculum for 
mental health care providers on IPV and sexual violence against 
women [95]. In England, the Department of Health has recom- 
mended that a question about any past or current violence and abuse 
should be asked during assessments and care programme approach 
meetings [96]. However, disclosure is more likely if more than one 
specific behaviour-based questions are used. Open questions can be 
asked initially about relationships, and normalization of the area of 
enquiry can also be helpful, but more specific questions about each 
type of abuse should also be asked (Boxes 143.1-143.4). Such ques- 
tions can only be asked if a patient is alone or with a professional 
interpreter (rather than a family member). 

However, there is evidence internationally that guideline dissem- 
ination and training in isolation do not necessarily create consistent, 
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Box 143.1 Introductory open questions 


e Are you having any problems with your husband/partner? 

e We know that one in four women (and one in five men) experience 
domestic violence at some time in their life, so | ask everyone if 
that has ever happened to them. Has that happened to you? 

e Some women have these symptoms when they are experiencing 
abuse. Are you afraid of anyone at home? 

e Sometimes partners use physical force. Is this happening to you? 

e Have you felt humiliated or emotionally abused by your 

partner (or ex-partner)? 

e Has your partner ever physically threatened or hurt you? Or have 

you been kicked, hit, slapped, or otherwise physically hurt by 

your partner (or ex-partner)? 

e Inthe past year, have you been forced to have any kind of sexual 

activity by your partner (or ex-partner)? 


= 
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sustainable improvements in identification and response to DVA 
[92, 97], and research into strategies to improve the integration of 
DVA into the scope of mental health services is needed. Integration 
of DVA advocates within mental health teams, with additional dedi- 
cated time for training, may improve rates of identification of DVA 
and outcomes for individuals [89], but training in identification 
may not lead to improved patient outcomes if they do not lead to 
increased engagement in interventions [98]. 


Identification of perpetrators 


A recent UK qualitative study with mental health professionals re- 
vealed that while they routinely ask about violence in general, they 
often fail to ask about current or ex-partner DVA (even though it 
has been shown that the risk of lethal violence increases after sep- 
aration [99] due to inadequacy of current risk assessments, which 
often do not specifically refer to different types of DVA; and lack 
of clarity on information sharing when DVA was also a concern is 
disclosed [100]. In England, the National Confidential Inquiry into 
Homicides and the Home Office reported a failure of mental health 


Box 143.2 Questions about psychological abuse 


e Does anyone insult you, call you names, or swear at you? 

e Does anyone make it difficult for you to see friends/family or leave 
he house? 

e Does anyone act in a jealous way or keep track of where you go? 

e Does anyone put you down, embarrass you, or criticize you? 

e Does anyone undermine your independence or try to make you 
feel small? 

e Does anyone make you feel as if you have to walk on eggshells or as 
if you do nothing right? 

e Does anyone order you around like a servant? 

e Does anyone blame you for things that are not your fault? 

e Does anyone control the money, make you ask for it, or stop you 
earning? 





Reproduced from Adv Psychiatr. Treat., 18(2), Howard LM, Domestic vio- 
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Box 143.3 Questions about sexual abuse 


e Do you ever feel that you have to have sex, even though you don't 
want to? 
e Have you felt forced into sex because of what your partner might do? 
e Has your partner made you have sex or carried on when it was 
painful? 
e Has your partner made you have oral or anal sex when you didn't 
want to? 
e Has your partner used an object in a sexual way that you didn't like? 
e Has your partner made you do things or perform sexual acts that you 
didn't like? 
e Has your partner refused safe sex or to use birth control? 

e Has your partner made you have sex with another person? 


e Has your partner talked about sex or done things in a way you 
didn't like? 
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professionals to assess for DVA perpetration risk [101], despite a 
clear responsibility of mental health services to identify potential 
perpetrators of all forms of violence within current risk assessment 
frameworks. Recent studies using data from statutory domestic 
homicide reviews in England and confidential enquiries into homi- 
cides by people with mental illness have highlighted that a signifi- 
cant proportion of domestic homicides are perpetrated by male 
mental health service users; 14% of intimate partner homicides and 
23% of adult family homicides were perpetrated by men who had 
been in contact with mental health services in the preceding year 
(men made up 80% of both types of homicide perpetrators) [72]. 
A study of homicide-suicides in England and Wales similarly found 
that current or former partners were the victims in two-thirds of 
cases and that 12% of perpetrators had been in contact with mental 
health services in the year before the offence [102]. 

Although there has been less research on how to respond to 
mental health service users who disclose DVA perpetration, there 
is extensive guidance on how to manage risk of violence, which 
includes consideration of the safety of the victim and appropriate 


Box 143.4 Questions about physical abuse 


e Has your partner shaken you or grabbed you roughly? 

e Has your partner shoved you or made you fall? 

e Has your partner slapped you or smacked you? 

e Has your partner tried to hit you with something or used an object as 
a weapon? 
e Has your partner punched you? 

e Has your partner tried to choke you or put his hands round your 
hroat? 
e Has your partner pushed you against the wall or thrown you down? 
e Has your partner pulled your hair? 

e Has your partner burnt you or scalded you with something? 

e Has your partner threatened you with a knife or gun? 

e Has your partner hurt you while you were pregnant? 
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disclosure, if necessary, even if in breach of patient confidentiality. 
Homicide-suicides, 88% of which are perpetrated by men, with 77% 
of victims being female, are commonly preceded by relationship 
breakdown and separation [102]. With such a high proportion iden- 
tified as having mental health problems, more research and guid- 
ance are needed on management risk of DVA perpetration among 
mental health professionals. 


How should mental health professionals respond 


to domestic violence and abuse experienced 
by psychiatric patients? 





International guidelines, including from the WHO and NICE, rec- 
ommend that, following the facilitation of DVA disclosure during 
a comprehensive clinical assessment, mental health professionals 
should provide support, ensure safety, and treat physical and mental 
disorders arising in the context of any DVA [30, 88]. There is de- 
bate about the role of universal screening for DVA in the health care 
system in generic services such as primary care or emergency de- 
partments [83]. However, the prevalence of violent victimization is 
so high in mental health service users [55, 103], and clinical guide- 
lines recommend that mental health professionals routinely ask 
about DVA experienced in childhood and adulthood as part of clin- 
ical assessment and ongoing care [30, 88, 104]. 

As highlighted, however, routine enquiry may not improve out- 
comes if the enquirer is not appropriately trained in how to respond 
[88]. Guidelines recommend appropriate training, in combination 
with the establishment of protocols for inter-agency communication 
and information sharing, referrals to specialist DVA services, and 
referrals for interventions, including to specialist trauma services, 
if necessary. In some jurisdictions, there are specific multi-agency 
risk assessment conferences (MARACs) [105] which enable all 
services, including the DVA sector, to share information and reduce 
risk, and there is evidence to suggest that these can reduce risk of 
subsequent DVA. 

Although there is limited research evidence in this area, good 
clinical practice includes making accurate notes, carrying out a risk 
assessment (particularly regarding immediate safety and consid- 
ering the need for emergency action and/or child protection pro- 
cedures), prioritizing safety planning, avoiding victim-blaming, 
and discussing available options [106]. Information about domestic 
violence services should be given, but professionals need to check 
whether it is safe for the patient to take information home with 
them—there may be an escalation of violence if such information 
is seen by the perpetrator. Potential interventions are discussed later 
in this chapter, but local health care trust policies (on domestic vio- 
lence and on safeguarding of children and vulnerable adults), care 
pathways, and training should include information on what to do 
after disclosure [96]. 

When working with survivors of DVA, mental health profes- 
sionals should ensure women have opportunities to be seen without 
partners, family members, or acquaintances present; provide access 
to independent interpreters; respond sensitively, compassionately, 
and non-judgementally to disclosure; reassure women that they are 
believed and not to blame for their experiences; and offer informa- 
tion and practical support that responds to women’s concerns and 


respects their autonomy [30, 107]. It is also crucial for health profes- 
sionals to recognize children as direct victims of domestic violence 
and abuse through witnessing or being aware of DVA in the family. 
Such experiences are associated with an increased risk of health 
problems and DVA in adulthood. There is also an increased risk of 
them being abused directly themselves by the abusive parent [91]. 
Professionals should recognize children’s experience of the impact 
of DVA and strengthen professional responses to them as direct vic- 
tims, not as passive witnesses to violence [108]. 

There is a need for greater focus on the interaction of the wider 
health infrastructure, including mental health services, in order 
to improve the management and reduction of DVA. This interdis- 
ciplinary and inter-agency approach was highlighted in the 2014 
Lancet series on violence against women [109] and can be extrapo- 
lated beyond violence against women to DVA in general. Although 
additional research is needed, strengthening of health systems can 
enable providers to address DVA, including protocols, capacity 
building, effective co-ordination between agencies, and referral 
networks. 


What interventions are effective for psychiatric 





patients experiencing domestic violence? 


Interventions for victims 


There is now a wealth of evidence for the effectiveness of a variety 
of pharmacological and psychological interventions for victims of 
trauma. NICE recommends cognitive behavioural therapy (CBT) 
and eye movement desensitization and reprocessing (EMDR) for 
PTSD [110], but until recently, there was a lack of research into the 
efficacy of these treatments for survivors of DVA who often present 
with more comorbidity or complex PTSD (as described). There is a 
growing evidence base for the treatment of complex PTSD, though 
this is currently under critical review [111]. However, while evidence- 
based interventions are already well established for victims of trauma, 
without identification and an understanding of the interpersonal dy- 
namics of coercive, controlling DVA, it is likely that treatment may 
be less effective; indeed, some have hypothesized that DVA may be a 
moderator of treatment response if not identified or addressed [112]. 

Systematic reviews of treatment for DVA-related trauma have 
reported that CBT-based interventions may be associated with im- 
proved PTSD and depressive symptoms in survivors no longer in 
abusive relationships [84, 85]. Advocacy and CBT interventions may 
also be effective in reducing physical and psychological DVA [113). 
However, these findings cannot be extrapolated to women with 
more severe psychiatric illnesses. Studies are emerging which have 
shown the effectiveness of NICE-recommended trauma-focused 
therapies in treating PTSD comorbid with psychosis [114, 115], 
but none have specifically focused on trauma in the context of do- 
mestic violence. Few studies have examined whether interventions 
are helpful in reducing psychological symptoms among women still 
subject to abuse. 


Interventions for perpetrators 


Evidence is lacking on the effectiveness and appropriateness of DVA 
perpetrator programmes for people with mental disorders, and these 
programmes receive few referrals from mental health services [116]. 
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However, interventions for modifiable risk factors (such as medica- 
tion for persecutory delusions, psychological interventions, and treat- 
ment of comorbid alcohol and substance misuse), while ensuring the 
safety of the potential victim, may be expected to improve health and 
reduce violence for DVA perpetrators in contact with mental health 
services whose DVA perpetration appears linked to these factors. 


Future research 


As this chapter has highlighted, there is still limited understanding 
of pathways to DVA and mental disorders, particularly in relation 
to DVA perpetration. It is also not clear how treatments need to be 
modified for DVA victims and perpetrators, nor how to prevent re- 
current DVA. Future research needs to investigate the underlying 
mechanisms between DVA and mental disorders in both victims 
and perpetrators, develop and evaluate interventions to address 
these potential targets, and examine systemic interventions to im- 
prove responses of mental health services as a whole. 


Conclusions 


DVA is an endemic problem in our society, but even more of an issue 
in psychiatric patients. Traditional training of psychiatrists has not 
included knowledge about DVA, and in view of the high rates of 
both DVA perpetrators and victims in psychiatric patients and the 
potential impact, this clearly needs to change. Current guidance is 
based on limited evidence, and further research is needed to im- 
prove the evidence base on identification and treatment of both vic- 
tims and perpetrators. 


REFERENCES 


1. Krug EG, Mercy JA, Dahlberg LL, Zwi AB. The world report on vio- 
lence and health. The Lancet. 2002;360:1083-138. 

2. World Health Organization and London School of Hygiene 
and Tropical Medicine. Preventing Intimate Partner and Sexual 
Violence Against Women: Taking Action and Generating Evidence. 
Geneva: World Health Organization; 2010. 

3. Devries KM, Mak JY, Garcia-Moreno C, et al. Global health. The 
global prevalence of intimate partner violence against women. 
Science. 2013;340:1527-8. 

4. Vos T, Astbury J, Piers L, al. Measuring the impact of intimate 
partner violence on the health of women in Victoria, Australia. 
Bulletin of the World Health Organization. 2006;84:739-44. 

5. Walby S. The Cost of Domestic Violence: Update 2009. 
Lancaster: Lancaster University; 2009. 

6. Home Office. Information for Local Areas on the Change to the 
Definition of Domestic Violence. London: Home Office; 2013. 

7. Dutton MA, Goodman LA. Coercion in intimate partner vio- 
lence: toward a new conceptualization. Sex Roles. 2005;52:743-56. 

8. Johnson MP. Violence and abuse in personal relationships: con- 
flict, terror, and resistance in intimate partnerships. In: Perlman 
ALVD, editor. The Cambridge Handbook of Personal Relationships. 
New York, NY: Cambridge University Press; 2006. pp. 557-76. 

9. Coleman K, Jansson K, Kaiza P, Reed E. Homicides, Firearms 
Offences and Intimate Violence 2005/6. Home Office Statistical 
Bulletin. London: Home Office; 2007. 





1458 





10. 


Ile 


12: 


13% 


14. 


I5 


16. 


ke 


18. 


19. 


20. 


Alls 


DOR 


23: 


24. 


25: 


26. 


Die 


SECTION 24 Forensic psychiatry 


Donaldson A, Marshall L. Domestic Abuse Prevalence: Argyll 

and Clyde DAP Study. Glasgow: West Dunbartonshire Domestic 
Abuse Partnership; 2005. 

Graham-Kevan N, Archer J. Intimate terrorism and common 
couple violence: a test of Johnson’s predictions in four British 
samples. Journal of Interpersonal Violence. 2003;18:1247-70. 
Johnson MP. Patriarchal terrorism and common couple vio- 
lence: two forms of violence against women. Journal of Marriage 
and Family. 1995;57:283-94. 

Kelly JB, Johnson MP. Differentiation among types of intimate 
partner violence: research update and implications for interven- 
tions. Family Court Review. 2008;46:476-99. 

Ali PA, Dhingra K, McGarry J. A literature review of intimate 
partner violence and its classifications. Aggression and Violent 
Behavior. 2016;31:16-25. 

Desmarais SL, Reeves KA, Nicholls TL, Telford RP, Fiebert 

MS. Prevalence of physical violence in intimate relationships, 
Part 2: Rates of male and female perpetration. Partner Abuse. 
2012;3:170-98. 

Office for National Statistics. Focus on: Violent Crime and Sexual 
Offences, 2014/15. London: Office for National Statistics; 2016. 
Centers for Disease Control and Prevention. Prevalence and 
Characteristics of Sexual Violence, Stalking, and Intimate Partner 
Violence Victimization—National Intimate Partner and Sexual 
Violence Survey, United States, 2011. Atlanta, GA: Centers for 
Disease Control and Prevention; 2014. 

Dodd T, Nicholas S, Povey D, Walker A. Crime in England and 
Wales 2003/2004. Statistical Bulletin, Home Office Research 
Development and Statistics Directorate. London: Home 

Office; 2004. 

Howard LM, Trevillion K, Agnew-Davies R. Domestic vio- 

lence and mental health. International Review of Psychiatry. 
20103;22:525-34. 

Walby S, Towers J, Francis B. Is violent crime increasing or 
decreasing? A new methodology to measure repeat attacks 
making visible the significance of gender and domestic relations. 
British Journal of Criminology. 2015;56:1203-34. 

Ellsberg M, Jansen HAFM, Heise L, Watts CH, Garcia-Moreno C. 
Intimate partner violence and women’s physical and mental health 
in the WHO multi-country study on women’s health and domestic 
violence: an observational study. The Lancet. 2008;371:1165-72. 
Garcia-Moreno C, Jansen HAFM, Ellsberg M, Heise L, Watts CH. 
Prevalence of intimate partner violence: findings from the WHO 
multi-country study on women’s health and domestic violence. 
The Lancet. 2006;368:1260-9. 

Alhabib S, Nur U, Jones R. Domestic violence against 

women: systematic review of prevalence studies. Journal of 
Family Violence. 2010;25:369-82. 

Stöckl H, March L, Pallitto C, Garcia-Moreno C. Intimate partner 
violence among adolescents and young women: prevalence and 
associated factors in nine countries: a cross-sectional study. BMC 
Public Health. 2014;14:1-14. 

Stöckl H, Penhale B. Intimate partner violence and its association 
with physical and mental health symptoms among older women 
in Germany. Journal of Interpersonal Violence. 2015;30:3089-111. 
Tjaden P, Thoennes N. Prevalence and consequences of male-to- 
female and female-to-male intimate partner violence as meas- 
ured by the National Violence Against Women Survey. Violence 
Against Women. 2000;6(2):142-61. 

Coker AL, Davis KE, Arias I, et al. Physical and mental health ef- 
fects of intimate partner violence for men and women. American 
Journal of Preventive Medicine. 2002;23:260-8. 


28. 


29; 


30. 


aie 


92. 


ab 


34. 


oo 


36. 


Sie 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


Stöckl H, Devries K, Rotstein A, et al. The global prevalence 

of intimate partner homicide: a systematic review. The Lancet. 
2013;382:859-65. 

Barnish M. Domestic Violence: A Literature Review. London: HM 
Inspectorate of Probation; 2004. 

World Health Organization. Responding to Intimate Partner 
Violence and Sexual Violence Against Women: WHO Clinical and 
Policy Guidelines. Geneva: World Health Organization; 2013. 
Walby S, Allen J, Simmons J. Domestic Violence, Sexual 

Assault and Stalking: Findings from the British Crime Survey. 
London: Home Office Research, Development and Statistics 
Directorate; 2004. 

Shorey RC, Ninnemann A, Elmquist J, et al. Arrest history and 
intimate partner violence perpetration in a sample of men and 
women arrested for domestic violence. International Journal of 
Criminology and Sociology. 2012;1:132. 

Hines DA. Posttraumatic stress symptoms among men who sus- 
tain partner violence: an international multisite study of univer- 
sity students. Psychology of Men and Masculinity. 2007;8:225. 
Holzworth-Munroe A. Male versus female partner violence: put- 
ting controversial findings in context. Journal of Marriage and 
Family. 2005;67:1120-5. 

Randle AA, Graham CA. A review of the evidence on the effects 
of intimate partner violence on men. Psychology of Men and 
Masculinity. 2011;12:97-111. 

Greenwood GL, Relf MV, Huang B, Pollack LM, Canchola JA, 
Catania JA. Battering victimization among a probability-based 
sample of men who have sex with men. American Journal of 
Public Health. 2002;92:1964-9. 

Calton JM, Cattaneo LB, Gebhard KT. Barriers to help seeking 
for lesbian, gay, bisexual, transgender, and queer survivors of 
intimate partner violence. Trauma, Violence, and Abuse. 2016; 
17(5):585-600. 

Hester M, Donovan C. Researching domestic violence in same- 
sex relationships—a feminist epistemological approach to survey 
development. Journal of Lesbian Studies. 2009;13:161-73. 
Kessler RC, Molnar BE, Feurer ID, Appelbaum M. Patterns 

and mental health predictors of domestic violence in the 

United States: results from the National Comorbidity Survey. 
International Journal of Law and Psychiatry. 2001;24:487-508. 
Abramsky T, Watts CH, Garcia-Moreno C, et al. What factors are 
associated with recent intimate partner violence? Findings from 
the WHO multi-country study on women’s health and domestic 
violence. BMC Public Health. 2011311:1. 

Khalifeh H, Howard LM, Osborn D, Moran P, Johnson 

S. Violence against people with disability in England and 

Wales: findings from a national cross-sectional survey. PLoS One. 
20133;8:e55952. 

Gass JD, Stein DJ, Williams DR, Seedat S. Gender differences in 
risk for intimate partner violence among South African adults. 
Journal of Interpersonal Violence. 2011;26:2764-89. 

Ehrensaft MK, Cohen P, Brown J, Smailes E, Chen H, Johnson 
JG. Intergenerational transmission of partner violence: a 20-year 
prospective study. Journal of Consulting and Clinical Psychology. 
2003;71:741. 

Devries KM, Child JC, Bacchus LJ, et al. Intimate partner vio- 
lence victimization and alcohol consumption in women: a sys- 
tematic review and meta-analysis. Addiction. 2014;109:379-91. 
Heise L, Garcia-Moreni C. Violence by intimate partners. 

In: Krug EG, Dahlberg LL, Mercy JA, Zwi AB, Lozano R, editors. 
World Report on Violence and Health. Geneva: World Health 
Organization; 2002. pp. 87-122. 


46. 


47. 


48. 


49. 


50. 


Bile 


52, 


Do. 


54. 


557 


56. 


D 


58. 


59. 


60. 


6l. 


62. 


63. 


64. 


Campbell JC. Health consequences of intimate partner violence. 
The Lancet. 2002;359:1331-6. 

Flach C, Leese M, Heron J, et al. Antenatal domestic violence, 
maternal mental health and subsequent child behaviour: a cohort 
study. BJOG. 2011;118:1383-91. 

Golding JM. Intimate partner violence as a risk factor for 

mental disorders: a meta-analysis. Journal of Family Violence. 
1999;14:99-132. 

Oram S, Trevillion K, Feder G, Howard LM. Prevalence of ex- 
periences of domestic violence among psychiatric patients: sys- 
tematic review. British Journal of Psychiatry. 2013;202:94-9. 
Bundock L, Howard LM, Trevillion K, Malcolm E, Feder G, 
Oram S. Prevalence and risk of experiences of intimate partner 
violence among people with eating disorders: a systematic review. 
Journal of Psychiatric Research. 2013;47:1134-42. 

Howard LM, Oram S, Galley H, Trevillion K, Feder G. Domestic 
violence and perinatal mental disorders: a systematic review and 
meta-analysis. PLoS Medicine. 2013;10:e1001452. 

Trevillion K, Oram S, Feder G, Howard LM. Experiences of do- 
mestic violence and mental disorders: a systematic review and 
meta-analysis. PLoS One. 2012;7:e51740. 

Brown GW, Harris TO, Hepworth C, Robinson R. Clinical and 
psychosocial origins of chronic depressive episodes. II. A patient 
enquiry. British Journal of Psychiatry. 1994;165:457-65. 
Ehrensaft MK, Moffitt TE, Caspi A. Is domestic violence followed 
by an increased risk of psychiatric disorders among women but 
not among men? A longitudinal cohort study. American Journal 
of Psychiatry. 2006;163:885-92. 

Khalifeh H, Moran P, Borschmann R, et al. Domestic and sexual 
violence against patients with severe mental illness. Psychological 
Medicine. 2015;45:875-86. 

Levendosky AA, Bogat GA, Martinez-Torteya C. PTSD symp- 
toms in young children exposed to intimate partner violence. 
Violence Against Women. 2013;19:187-201. 

Alisic E, Krishna RN, Groot A, Frederick JW. Children’s mental 
health and well-being after parental intimate partner homicide: a 
systematic review. Clinical Child and Family Psychology Review. 
2015;18:328-45. 

Blair F, McFarlane J, Nava A, Gilroy H, Maddoux J. Child witness 
to domestic abuse: baseline data analysis for a seven-year pro- 
spective study. Pediatric Nursing. 2015;41:23. 

Widom CS, Wilson HW. Intergenerational transmission of vio- 
lence. In: Lindert J, Levav I, editors. Violence and Mental Health: Its 
Manifold Faces. New York, NY: Springer; 2015. pp. 27-45. 
Zlotnick C, Johnson DM, Kohn R. Intimate partner vio- 

lence and long-term psychosocial functioning in a national 
sample of American women. Journal of Interpersonal Violence. 
2006;21:262-75. 

Devries KM, Mak JY, Bacchus LJ, et al. Intimate partner vio- 
lence and incident depressive symptoms and suicide attempts: a 
systematic review of longitudinal studies. PLoS Medicine. 
2013;10:e1001439. 

Khalifeh H, Dean K. Gender and violence against people 

with severe mental illness. International Review of Psychiatry. 
20103;22:535-46. 

Anderson F, Howard L, Dean K, Moran P, Khalifeh H. Childhood 
maltreatment and adulthood domestic and sexual violence 
victimisation among people with severe mental illness. Social 
Psychiatry and Psychiatric Epidemiology. 2016;51:961-70. 
Sharhabani-Arzy R, Amir M, Kotler M, Liran R. The toll of 
domestic violence PTSD among battered women in an Israeli 
sample. Journal of Interpersonal Violence. 2003;18:1335-46. 


CHAPTER 143 Domestic violence and abuse and mental health 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


Ws 


74. 


WS, 


76. 


Ws 


78. 


7) 


80. 


81. 


82. 


83. 


Herman JL. Complex PTSD: a syndrome in survivors of prolonged 
and repeated trauma. Journal of Traumatic Stress. 1992;5:377-91. 
Greenberg N, Brooks S, Dunn R. Latest developments in post- 
traumatic stress disorder: diagnosis and treatment. British 
Medical Bulletin. 2015;114:147-55. 

American Psychiatric Association. Diagnostic and Statistical 
Manual of Mental Disorders, fifth edition. Arlington, 

VA: American Psychiatric Publishing; 2013. 

Fazel S, Lichtenstein P, Grann M, Goodwin GM, Langstrom 

N. Bipolar disorder and violent crime: new evidence from 
population-based longitudinal studies and systematic review. 
Archives of General Psychiatry. 2010;67:931-8. 

Fazel S, Wolf A, Chang Z, Larsson H, Goodwin GM, Lichtenstein 
P. Depression and violence: a Swedish population study. The 
Lancet Psychiatry. 2015;2:224-32. 

Chang Z, Larsson H, Lichtenstein P, Fazel S. Psychiatric disorders 
and violent reoffending: a national cohort study of convicted 
prisoners in Sweden. The Lancet Psychiatry. 2015;2:891-900. 
Oram S, Trevillion K, Khalifeh H, Feder G, Howard LM. 
Systematic review and meta-analysis of psychiatric disorder 
and the perpetration of partner violence. Epidemiology and 
Psychiatric Sciences. 2014;23:361-76. 

Oram S, Flynn SM, Shaw J, Appleby L, Howard LM. Mental 
illness and domestic homicide: a population-based descriptive 
study. Psychiatric Services. 2013;64:1006-11. 

Ullrich S, Keers R, Coid JW. Delusions, anger, and serious 
violence: new findings from the MacArthur Violence Risk 
Assessment Study. Schizophrenia Bulletin. 2014;40:1174-81. 
Keers R, Ullrich S, DeStavola BL, Coid JW. Association of vio- 
lence with emergence of persecutory delusions in untreated 
schizophrenia. American Journal of Psychiatry. 2014;171:332-9. 
Fazel S, Zetterqvist J, Larsson H, Långström N, Lichtenstein P. 
Antipsychotics, mood stabilisers, and risk of violent crime. The 
Lancet. 2014;384:1206-14. 

Moffitt TE, Robins RW, Caspi A. A couples analysis of partner 
abuse with implications for abuse-prevention policy. Criminology 
and Public Policy. 2001;1:5-36. 

Ehrensaft MK, Moffitt TE, Caspi A. Clinically abusive relation- 
ships in an unselected birth cohort: men’s and women’s partici- 
pation and developmental antecedents. Journal of Abnormal 
Psychology. 2004;113:258-70. 

Coker AL, Weston R, Creson DL, Justice B, Blakeney P. PTSD 
symptoms among men and women survivors of intimate partner 
violence: the role of risk and protective factors. Violence and 
Victims. 2005;20:625-43. 

Jonas S, Khalifeh H, Bebbington PE, et al. Gender differ- 

ences in intimate partner violence and psychiatric disorders 

in England: results from the 2007 adult psychiatric morbidity 
survey. Epidemiology and Psychiatric Sciences. 2014;23:189-99. 
Yearnshire S. Analysis of Cohort. Violence Against 

Women. London: Royal College of Obstetricians and 
Gynaecologists; 1997. 

Trevillion K, Hughes B, Feder G, Borschmann R, Oram S, 
Howard LM. Disclosure of domestic violence in mental health 
settings: a qualitative meta-synthesis. International Review of 
Psychiatry. 2014;26:430-44. 

Rose D, Trevillion K, Woodall A, Morgan C, Feder G, Howard 
L. Barriers and facilitators of disclosures of domestic violence by 
mental health service users: qualitative study. British Journal of 
Psychiatry. 2011;198:189-94. 

Feder G, Ramsay J, Dunne D, et al. How far does screening 
women for domestic (partner) violence in different 


IESE) 





1460 





84. 


85. 


86. 


87. 


88. 


89. 


90. 


Ql. 


928 


93: 


94. 


955 


96. 


Oke 


98. 


99. 


SECTION 24 Forensic psychiatry 


health-care settings meet criteria for a screening programme? 
Systematic reviews of nine UK National Screening Committee 
criteria. Health Technology Assessment. 2009;13:iii-iv, xi—xiii, 
1-113, 137-347. 

Warshaw C, Sullivan CM, Rivera EA. A Systematic Review of 
Trauma-Focused Interventions for Domestic Violence Survivors. 
Chicago, IL: National Center on Domestic Violence, Trauma and 
Mental Health; 2013. 

Wilson M, Daly M. Spousal homicide risk and estrangement. 
Violence and Victims. 1993;8:3. 

Campbell JC, Webster D, Koziol-McLain J, et al. Risk fac- 

tors for femicide in abusive relationships: results from a 
multisite case control study. American Journal of Public Health. 
2003;93:1089-97. 

Howard L, Trevillion K, Khalifeh H, Woodall A, Agnew-Davies 
R, Feder G. Domestic violence and severe psychiatric dis- 
orders: prevalence and interventions. Psychological Medicine. 
2010;40:881-93. 

National Institute for Health and Care Excellence. Domestic 
Violence and Abuse: How Health Services, Social Care and 

the Organisations They Work With Can Respond Effectively. 
London: National Institute of Health and Care Excellence; 2014. 
Trevillion K, Byford S, Cary M, et al. Linking abuse and recovery 
through advocacy: an observational study. Epidemiology and 
Psychiatric Sciences. 2014;23:99-113. 

Nyame S, Howard LM, Feder G, Trevillion K. A survey of 
mental health professionals’ knowledge, attitudes and prepared- 
ness to respond to domestic violence. Journal of Mental Health. 
20133;22:536-43. 

National Institute for Health and Care Excellence. Domestic 
Violence and Abuse: Multi-Agency Working. London: National 
Institute for Health and Care Excellence; 2014. 

Zaher E, Keogh K, Ratnapalan S. Effect of domestic violence 
training: systematic review of randomized controlled trials. 
Canadian Family Physician. 2014;60:6 18-24. 

Feder G, Davies RA, Baird K, et al. Identification and Referral to 
Improve Safety (IRIS) of women experiencing domestic violence 
with a primary care training and support programme: a cluster 
randomised controlled trial. The Lancet. 2011;378:1788-95. 
Trevillion K, Byford S, Cary M, et al. Linking abuse and recovery 
through advocacy: an observational study. Epidemiology and 
Psychiatric Sciences. 2014;23(1):99-113. 

Stewart DE, Chabra PS. WPA international competency-based 
curriculum for mental health care providers on intimate partner 
violence and sexual violence against women. World Psychiatry. 
2016;16:223-4. 

Department of Health. Domestic Violence and Abuse: Guidance 
for Health Professionals. London: Department of Health; 2013. 
Choi YJ, An S. Interventions to improve responses of helping pro- 
fessionals to intimate partner violence: a quick scoping review. 
Research on Social Work Practice. 2016;26:101-27. 

Read J, Sampson M, Critchley C. Are mental health services 
getting better at responding to abuse, assault and neglect? Acta 
Psychiatrica Scandinavica. 2016;134:287-94. 

Campbell JC, Glass N, Sharps PW, Laughon K, Bloom T. Intimate 
partner homicide review and implications of research and policy. 
Trauma, Violence, and Abuse. 2007;8:246-69. 


100. 


101. 


102. 


103. 


104. 


105. 


106. 


107. 


108. 


109. 


110. 


AU 


112. 


113. 


114. 


Ss 


116. 


Oram S, Carpon, S., Trevillion, K. Promoting Recovery in 
Mental Health: Evaluation Report. London: King’s College 
London; 2016. 

The National Confidential Inquiry. The National Confidential 
Inquiry into Homicide and Suicide by People with Mental Illness. 
Annual Report 2014: England, Northen Ireland, Scotland, Wales. 
Manchester: University of Manchester; 2014. 

Flynn S, Gask L, Appleby L, Shaw J. Homicide-suicide and the 
role of mental disorder: a national consecutive case series. Social 
Psychiatry and Psychiatric Epidemiology. 2016;51:877-84. 
Desmarais SL, Van Dorn RA, Johnson KL, Grimm KJ, Douglas 
KS, Swartz MS. Community violence perpetration and victim- 
ization among adults with mental illnesses. American Journal of 
Public Health. 2014;104:2342-9. 

Stewart DE, Harriet MacMillan M. Intimate partner violence/ 
La violence entre partenaires intimes. Canadian Journal of 
Psychiatry. 2013;58:S1. 

McCoy E, Butler N, Quigg Z. Evaluation of the Liverpool Multi- 
Agency Risk Assessment Conference (MARAC). Liverpool: Centre 
for Public Health, Liverpool John Moores University; 2016. 
Hegarty K, Taft A, Feder G. Violence between intimate part- 
ners: working with the whole family. BMJ. 2008;337:346. 

HM Government. Multi-Agency Statutory Guidance on Female 
Genital Mutilation. London: Home Office; 2016. 

Callaghan JE, Alexander JH, Sixsmith J, Fellin LC. Beyond 
‘Witnessing’ children’s experiences of coercive control in do- 
mestic violence and abuse. Journal of Interpersonal Violence. 
2015. pii: 0886260515618946. [Epub ahead of print]. 
Garcia-Moreno C, Hegarty K, d’Oliveira AF, Koziol-McLain J, 
Colombini M, Feder G. The health-systems response to violence 
against women. The Lancet. 2015;385:1567-79. 

National Institute for Health and Care Excellence (NICE). 
(2018). Post-traumatic stress disorder. NICE guideline [NG116]. 
https://www.nice.org.uk/guidance/ng116 

Jongh A, Resick PA, Zoellner LA, et al. Critical analysis of 

the current treatment guidelines for complex PTSD in adults. 
Depression and Anxiety. 2016;33:359-69. 

Oram S, Khalifeh H, Howard LM. Violence against women and 
mental health. The Lancet Psychiatry. 2017;4(2):159-70. 
Tirado-Muñoz J, Gilchrist G, Farré M, Hegarty K, Torrens 

M. The efficacy of cognitive behavioural therapy and advo- 
cacy interventions for women who have experienced intimate 
partner violence: a systematic review and meta-analysis. Annals 
of Medicine. 2014;46:567-86. 

van den Berg DP, de Bont PA, van der Vleugel BM, 

et al. Trauma-focused treatment in PTSD patients with 
psychosis: symptom exacerbation, adverse events, and 
revictimization. Schizophrenia Bulletin. 2016;42:693-702. 

van den Berg DP, de Bont PA, van der Vleugel BM, et al. 
Prolonged exposure vs eye movement desensitization and repro- 
cessing vs waiting list for posttraumatic stress disorder in patients 
with a psychotic disorder: a randomized clinical trial. JAMA 
Psychiatry. 2015;72:259-67. 

Kelly L, Westmarland N. Domestic Violence Perpetrator 
Programmes: Steps Towards Change. Project Mirabel Final 
Report. London and Durham: London Metropolitan University 
and Durham University; 2015. 


essing and managing the risk 


iolence to others 


uchanan 























= 
As 














LL 























Assessing the risk of patient violence has always been integral to the 
practice of psychiatry. There are several reasons. Violence risk as- 
sessment is a necessary part of providing safe and effective outpatient 
psychiatric care. It is an essential element of proper decision-making 
around hospital admission and discharge and of providing a safe 
environment to patients and those who care for them. Finally, like 
many other aspects of psychiatric practice, when things go wrong, 
an inadequate assessment of violence risk is a potential source of 
legal liability for clinicians and mental health services. 

While its importance is long-standing, the prominence of vio- 
lence risk assessment has risen over the past 40 years. The most com- 
monly identified causes for this increased prominence include the 
closure of many large psychiatric hospitals and discharge of patients 
to community settings, although empirical evidence linking this to 
an increase in violence is lacking. On both sides of the Atlantic, com- 
munity care is widely regarded as inadequately resourced to manage 
the challenge of increased caseloads and challenging patient behav- 
iour. Individuals and groups seeking to improve community care 
sometimes highlight violence rates in an attempt to highlight the 
need for increased resources. 

These public, press, and political pressures are external. It is also 
the case, however, that over the past 40 years the focus of medi- 
cine itself has shifted. Public health indices, of which violence is 
one, have replaced the relationship between patient and doctor as 
the primary object of medical attention [1]. The value of screening 
clinics, vaccination, and antenatal care is now commonly assessed 
by reference to the health of the general population, not to the 
health of patients in care. Prevention of all types is now more than 
ever the province of medicine. As a recognized, if unusual, conse- 
quence of mental disorder, patient violence has become a recog- 
nized ‘outcome measure. 

In some respects, psychiatry has been unprepared for this in- 
creased focus on the infrequent, but sometimes serious, violence 
of its clients. Throughout the developed world, bed closures have 
reduced the availability of psychiatry’s principal tool in managing 
violence risk—admission to hospital. In the UK, proposals aimed at 





detaining people with personality disorders were justified by refer- 
ence to doctors using ‘loopholes’ in existing legislation to discharge 
dangerous patients. In the United States, psychiatry has struggled 
to find a single path through the conflicting demands that patient 
confidentiality be respected and that potential victims and others be 
warned about risk. The primacy of the doctor-patient relationship, 
once unchallenged, is now seen as out of place in a world where a 
shrinking proportion of those in care see a doctor at each visit. 

This chapter reviews the principles governing the assessment of 
violence risk in psychiatry. It discusses the practice of conducting 
such assessments and examines the limits of psychiatric violence 
risk assessment. In doing so, it draws on material published in pre- 
vious reviews of risk assessment and management [2-4]. While not 
all psychiatric patient violence can be prevented, good practice in 
assessing violence risk is both a necessary element in ensuring the 
safety of patients and others and a vital aspect in the care of people 
with mental disorders. 














Overv 


Much violence risk assessment in psychiatry is invisible, carried out 
routinely by clinicians in the course of their work. Violence risk is 
one of many considerations that inform decisions that range from 
admitting a patient to hospital to deciding on the most appropriate 
form of outpatient care. Even when risk of harm to others becomes 
the principal focus of the doctor's interaction with his or her pa- 
tient, the principles underlying its assessment are the same as those 
underlying psychiatric practice more generally. A good evaluation 
will usually be based on the results of obtaining a history and exam- 
ining a patients mental state. 

Accurate assessment depends on the availability of accurate in- 
formation. In violence risk assessment, this will usually include 
information obtained from collateral sources including medical re- 
cords, informants, and, where the police have been involved, police 
reports. Assessments carried out at the point of admission to hos- 
pital are often brief and unresolved issues of risk require continued 
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Fig. 144.1 Approaches to violence risk assessment in psychiatry. 


Reproduced from Buchanan A, Norko M, Violence risk in community settings. In: Buc 
Second Edition, Copyright (2017), with permission from Oxford University Press. 


attention in the course of an admission. Additional investigation, 
including psychological testing, may be required. Particularly with 
regard to specialist areas of practice, such as assessing the risk of 
sexual offending, it may be appropriate to ask specialist services, 
such as forensic psychiatry, to become involved. 

The literature contains descriptions of numerous approaches 
to risk assessment. These approaches can usefully be described by 
virtue of where they fall on two dimensions. These dimensions are 
presented as the axes in Fig. 144.1. 

The first dimension reflects whether a risk assessment method 
helps clinicians to assess risk by attending to known statistical 
correlations (sometimes referred to as a ‘nomothetic’ approach) 
or through a cause-based analysis of a client’s behaviour (also 
known as an ‘ideographic’ method). The second dimension re- 
flects the degree to which the approach provides a structure for 
the assessor to follow. Some risk assessment methods, including 
actuarial schemes, require the scoring of a prescribed list of items 
and the combining of a subject’s scores on those items in a par- 
ticular way. Other methods, particularly traditional clinical ones, 
permit the assessor greater discretion and, to this extent, provide 
less structure. 


Correlation and cause in risk assessment 


Research findings from the past several decades have identified 
numerous factors associated with psychiatric patient violence. 
Psychiatric patients are, in some ways, like the general population in 
terms of what increases the risk for violent offending, and in other 
ways different. Among the psychiatric population, being male is less 
strongly associated with violence, first offences tend to occur later, 
risk reduces less with advancing age [5], and stable relationships 





anan A, Wootton L [Eds]., Care of the Mentally Disordered Offender in the Community, 


are not as protective, especially when individuals experience poor 
overall function [6-8]. 

There are also many similarities, in terms of risk factors, between 
psychiatric patients and the general population. Substance abuse is 
a significant risk factor, as are unemployment, living in a high-crime 
neighbourhood, and having antisocial peers. Violent crimes tend to 
be committed by young males. Recidivism is lower for violent and 
more serious crime than for non-violent crime. Criminal history is a 
particularly strong predictor [9-12]. 

A second way in which psychiatrists assess risk, in addition to 
looking for risk factors, is by comparing their understanding of the 
patient’s personality, symptoms, and environment with their under- 
standing of the likely causes of violence. Where someone suffers 
from persecutory delusions that concern their spouse, for instance, 
there will usually be available no empirical data from research con- 
ducted on samples of similar patients demonstrating a correlation 
between continued cohabitation and violence. Yet the clinician's 
understanding of the likely causes of violence may still allow him 
or her to conclude that continued cohabitation presents a risk [13]. 

Pollock described the processes involved: 


“The skillful clinician assessing dangerous behavior formulates and 
tests a series of clinical hypotheses to define patterns of violence in 
the individual’s history. Once defined, these patterns can be applied 
to the explanation and prediction of violence in that individual? [14, 
p. 105] 


Approaches to risk assessment based on explanations of this type 
seem to rely heavily on induction. They require the clinician to draw 
conclusions about the future from past observations. Future condi- 
tions will never exactly mimic the conditions in which behaviour 
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has occurred in the past, yet the circumstances of other episodes of 
violence, whether in the patient’s case or more generally, will usually 
be relevant and sometimes critical. Notwithstanding the uncertainty 
inherent in this process, one task of risk assessment is to determine 
the relevance of past patterns. 

Clinicians trying to work out what might cause future violence 
are guided also by the understanding of patterns of behaviour that 
they develop in their training and through clinical practice [15, 16]. 
Claims that ‘causal’ ways of thinking are better than correlation- 
based ones at predicting rare events [17] have not been confirmed 
by empirical research. Instead, the persistence of causal approaches 
when clinicians think about risk may relate to the fact that many 
of the other judgements required in medicine are causation-based 
also—establishing why someone has symptoms, for instance, or 
deciding which further investigations are needed to complete an 
evaluation. Because clinical practice requires each of these judge- 
ments to be integrated into a single plan, it may be that clinicians 
find it helpful to use the same causal heuristic in assessing risk that 
they use in other aspects of their work. 


Structure in risk assessment 


Although not frequently discussed in the literature on risk assessment, 
it is important to recognize that the cause-based analyses of many 
clinicians are structured. The structures are derived from their profes- 
sional training. They include accepted practices and guidelines con- 
cerning taking a clinical history and examining the patient. Structure 
offers the same advantages to risk assessment that it offers to other 
areas of clinical practice. It is a means of integrating and communi- 
cating information [18]. It can be a useful aide-memoire, particularly 
when the clinical question being addressed is unusual (some risk fac- 
tors apply particularly to sex offences, for instance). Learning how to 
structure the clinical approach is a key aspect of clinical training. 

The correlation-based approach is most commonly structured 
using structured risk assessment instruments (SRAIs). Actuarial 
instruments, such as the Violence Risk Appraisal Guide (VRAG), 
formalize the process by which the simultaneous presence of more 
than one correlate of violence increases the perception of risk. They 
do this by rating variables, such as poor school adjustment and al- 
cohol problems, and combining these mathematically to generate 
an overall score or category. A different type of instrument relies on 
‘structured professional judgement: The Historical, Clinical Risk 
Management-20 (HCR-20), for instance, encourages the clinician to 
assess the relevance of a list of pre-identified variables, but also to 
take into account other information, including factors he or she con- 
siders unique to the case, before allocating a case to a risk category. 

There are now a large number of such instruments [19]. Some of 
the most widely used are listed in Table 144.1. Using some of these 
instruments requires specialist training and experience, which are 
described in the references in Table 144.1. 

Structured approaches to violence risk assessment have been 
shown to have greater predictive validity than unstructured ones 
when the follow-up period is measured in years [20, 21]. In the 
shorter term, unstructured predictions made by emergency room 
clinicians have been shown to have levels of accuracy comparable to 
that of structured instruments [22, 23]. 

The proven predictive validity of structured correlational ap- 
proaches has not led to their unqualified clinical acceptance [24]. 
One person's structure is another's straitjacket. Clinicians complain 


Table 144.1 Structured risk assessment instruments 


Type, name, and reference Target population 





Actuarial 

VRAG [53] General psychiatry, forensic 
SORAG [53] Sexual offenders 

Static 99 [57] Sexual offenders 

COVR [58] General psychiatry 

PCL-R [59] General psychiatry, forensic 
PCL-SV [60] General psychiatry, forensic 


Structured professional judgement 








HCR-20 [61] General and forensic 
SARA [62] Spousal assault 
SVR-20 [63] Sexual offenders 
RSVP [64] Sexual offenders 
Source: data from Buchanan A, Norko M, Violence risk assessment. In: Buchanan 
A, Norko M [Eds]., The Psychiatric Report, pp.224-239, Copyright (2011), Cambridge 


University Press; Buchanan A, Norko M, Violence risk in community settings. 

In: Buchanan A, Wootton L [Eds]., Care of the Mentally Disordered Offender in 
the Community, Second Edition, Copyright (2017), with permission from Oxford 
University Press. 


that they have to respond both to risky circumstances not listed in 
an instrument and to mitigating factors such as the incapacitation 
through hospitalization or imprisonment of someone with a high 
actuarial score [25, 26]. One alternative would be to outline permis- 
sible exceptions, when the results of using a scale could be ignored, 
but this course has been opposed by the authors of some of the most 
accurate instruments [27]. A related clinicians’ complaint is that it is 
difficult to use the score on a rating scale to guide treatment when 
research has yet to show whether treatment will change the score or 
whether the risk will then be less. 

On the other hand, unstructured approaches to risk assessment 
may not integrate straightforwardly with other aspects of treatment, 
either. In addition, unstructured approaches are more vulnerable 
to cognitive biases, such as attending disproportionately to recent 
events, that afflict many of the judgements that people make [28], 
risk assessment included [29]. Finally, if the central task of risk as- 
sessment lies in establishing and quantifying statistical correlations, 
the arithmetic techniques contained in some structured approaches 
may be best suited to this task [30]. 

Criteria have been developed in other areas of medicine to estab- 
lish whether a validated rating scale will be useful in an individual 
case [31]. Three are statistical: whether the confidence limits are 
acceptable; whether the scale works for all subgroups of subjects ( 
‘goodness of fit’); and whether its predictions are accurate. While the 
predictive accuracy of many instruments is uncontroversial, confi- 
dence intervals remain contentious [32, 33]. Non-statistical criteria 
are the degree to which the patient resembles patients in the valid- 
ation study, the similarity between the outcome in the study and the 
outcome of interest to the clinician, and the availability to the clin- 
ician of data similar to those that were used in the study. 


Integration 


Correlation-based approaches can indicate whether someone shares 
some characteristics with a high-risk group, but not whether they can 
properly be seen as representative of the group and what intervention 
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is the most appropriate. Causation-based approaches can incorporate 
more kinds of information but offer no obvious mechanism whereby 
the psychiatrist can use data to compare the risk to that posed by other 
people. Because they offer different things, a risk assessment that 
makes use of both approaches is, at least in principle, in a better pos- 
ition to be helpful than one that chooses between them [34]. 

There is little agreement, however, as to how such integration might 
be achieved. One possibility is that the psychiatrist should proceed 
sequentially, first reviewing the established correlates of violence that 
are present and then using this review as a starting point for a cause- 
based analysis that makes use of the psychiatrist’s understanding 
of the subject and his or her circumstances. Correlation-based ap- 
proaches, after all, generate information that courts and others can 
usually obtain by other means, for instance from pre-sentence reports 
prepared by the probation service. In theory at least, causation-based 
approaches have the potential to provide something extra. 

Three factors are likely to govern the extent to which they can do 
this in practice. The first is the degree to which the psychiatrist has 
had access to the patient’s mental state. Only if someone is willing to 
discuss their feelings, intentions, and beliefs, can a clinician incorp- 
orate this information. 

The second is whether circumstances may change. Where consid- 
eration is being given to treating in the community a patient who has 
recently acted violently, for instance, an evaluation that concentrates 
on identifying the population correlates of violence, such as past vio- 
lence and age, is unlikely to be helpful. A more useful assessment 
may employ data concerning past behaviour and include a cause- 
based conclusion that focuses on the origins of recent violence and 
examines what has, and what has not, changed. Where the circum- 
stances are dynamic, for instance when it is not known whether the 
subject of the evaluation will return to live with the partner he has 
abused, a cause-based approach that incorporates different plausible 
scenarios may be particularly helpful. 

The third factor affecting the usefulness of a causation-based ap- 
proach is the period over which the risk assessment will apply. In 
emergency settings, where an assessment is often required to cover 
days or weeks, there are few correlates to guide clinicians [35], al- 
though anger [36] and substance use [37] both seem to be important. 
Instead, a sound risk assessment is more likely to be based on an 
understanding of the likely effect on a patient of the circumstances to 
which they are being discharged. Assessments designed to apply over 
longer periods, on the other hand, are less able to rely on this kind of 
understanding because mental states, circumstances, and the sources 
of risk change. Unless they know what these changes will be, clinicians 
who wish to offer an estimate of risk designed to cover months or years 
have little choice but to concentrate on recognized risk factors, using 
either an unstructured approach or a recognized instrument. 


The practice of violence risk assessment 


Obtaining the background information 


A number of factors that are identified in the course of taking a psy- 
chiatric history have been shown to be associated with future vio- 
lence [38-42]. 

Table 144.2 lists those variables identified most frequently in psy- 
chiatric reviews. Because past violence is the strongest correlate of 


future violence, and because the risk of future violence increases 
with each subsequent offence [43], risk assessments should include 
details of the client’s history of acting violently. The dates and types 
of violent act are important, because criminological variables, such 
as age of onset of violent behaviour and the number of violent con- 
victions, are themselves correlates of future violence. 

A cause-based analysis, however, will usually require more infor- 
mation. The circumstances of each act should be described in suffi- 
cient detail to allow the identification of patterns, if these exist. These 
circumstances include the individual's living situation, their rela- 
tionship to the victim, what they intended to achieve, their mental 
state (and whether they were in treatment), and their use of drugs 
and alcohol. The nature of the victim’s injury should be described; 
hospital treatment is often an indicator of severity. Collateral sources 
that can be of assistance in gathering this information include arrest 
records, police reports, trial transcripts, pre-sentence reports, and 
treatment records. 

The association between substance use and violence is one of the 
most widely reported empirical findings in risk assessment. Moving 
beyond the statistical correlation to include a cause-based analysis 
requires that the assessor addresses the details of an individual’s 
behaviour in relation to substances. Violence may be the result of 
intoxication, and past instances of violence when using substances 
should be noted. It may follow an exacerbation of mental illness pre- 
cipitated by drug use, in which case the history may indicate pre- 
vious instances of drug-induced relapse. Finally, violence may be a 
consequence of involvement in a drug market where territorial con- 
flicts are common and where weapons are carried, although this is 
less of an issue for alcohol. The response of the subject to treatment 
for substance abuse problems will usually be important, especially in 
criminal sentencing. 

The description of the past psychiatric history should make clear 
the relationship, if any, between an individual's history of violence 
and his mental disorder. The nature of any treatment, the individual's 
compliance with treatment, and his or her response to treatment will 
usually be important. The psychiatric history may also suggest what 
psychiatric measures have been helpful previously in reducing risk. 
The subject's social circumstances should be described for past epi- 
sodes of violence (see earlier), for the present, and with reference 
to the period over which the risk assessment is required. Attention 
should be paid to the presence or absence of potentially protective 
factors such as stable relationships, employment, and housing. The 
availability of likely victims, including the victims of past violence, 
should be noted. 


Examining the mental state 


The implications for violence risk of mental state abnormalities de- 
pend on the context. Where someone harbours delusions regarding 
an individual, for instance, the likelihood of their coming into con- 
tact with that individual and the likely nature of that contact are 
important. Pathological jealousy is likely to be managed differently 
if the jealous man plans to live with the wife he believes is being 
unfaithful. The mental state findings and circumstances most com- 
monly listed by reviewers as associated with risk of harm to others 
[44-48] are listed in Table 144.3. 

The weight that can be appropriately attached to mental state ab- 
normalities depends also on the period over which the risk assess- 
ment is intended to apply. Some features listed in Table 144.3, such 
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Table 144.2 Correlates |. Background factors and social circumstances linked to violence 


Sociodemographics 


Past behaviour 


Psychiatric and 
substance abuse 


Social circumstances 


Traits 


History of violence if no change in person or 
circumstance 


Recent violence 


Substance use, especially if rising 


Availability of weapons, familiarity with their use 


Future circumstances likely to resemble those in 
which violence has previously occurred 


Hare psychopathy (Table 144.3) 


Youth 


Male sex 


History of threats not acted upon 


Conduct disorder 


Domestic conflict, especially if 
escalating 


Impulsiveness 


Feckless disregard for consequences 
in schizophrenia 








Good social networks 


Fear of own potential for violence 


Has demonstrated internal resources to 
cope with conflict 


Source: data from Buchanan A, Norko M, Violence risk assessment. In: Buchanan A, Norko M [Eds]., The Psychiatric Report, pp.224-239, Copyright (2011), Cambridge University Press; 
Buchanan A, Norko M, Violence risk in community settings. In: Buchanan A, Wootton L [Eds]., Care of the Mentally Disordered Offender in the Community, Second Edition, Copyright 
(2017), with permission from Oxford University Press. 


Table 144.3 Correlates Il. Mental state features and interactions associated with risk of harm to others 


Mental state features 
(including recent 
behaviours) 


Interactions 


Steps taken in preparation (obtaining weapons, 
surveillance, putting affairs in order) 


Threats creating fear and concern in people 
who know the person threatening 


Violent thoughts or intentions 


Morbid jealousy where object of jealousy 
available 


Delusional systems focused on individuals seen 
as a threat or as obstructing an important goal 


Depressed suicidal mothers of young children 
Suicidal seeking revenge 
Stalking with past violence 


Angry and threatening with a plan to cause 
harm 


Fearful with a plan to cause harm (may be 
pre-emptive) 


Delusions, particularly misidentification 
syndromes 


Clouding of consciousness and confusion 


Stalking without past violence 


Compliant with treatment 
Responding to treatment 


Perceives treatment as effective 


Insight into illness and need for 
treatment 


Source: data from Buchanan A, Norko M, Violence risk assessment. In: Buchanan A, Norko M [Eds]., The Psychiatric Report, pp.224-239, Copyright (2011), Cambridge University Press; 
Buchanan A, Norko M, Violence risk in community settings. In: Buchanan A, Wootton L [Eds]., Care of the Mentally Disordered Offender in the Community, Second Edition, Copyright 
(2017), with permission from Oxford University Press. 
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as threats, carry particular short-term significance. Others, such as 
those related to personality, will usually be more relevant when the 
assessment is intended to cover months or years. 

The description of thought content should include reference to 
thoughts of violence and their quality. Obsessional thoughts con- 
cerning violence in the absence of psychotic symptoms are not 
unusual, and empirical research has not linked them to future vio- 
lence. Command hallucinations have been described in association 
with violence, but the association is inconsistent, the empirical base 
limited and conflicting, and the implications at the population level 
uncertain [49]. As with other symptoms, a detailed analysis of past 
violence may nevertheless reveal their importance in an individual 
case. Violent thoughts that are pleasurable warrant particular con- 
cern and, when the pleasure has a sexual component, particular at- 
tention to the sexual history. 

In describing a subject’s insight into his or her past behaviour, 
a detailed description or verbatim account will usually be more 
helpful than a summary statement. “Lack of remorse; in particular, 
can cover a range of phenomena, of which a general unwillingness 
to take responsibility probably carries fewer implications for future 
dangerousness than a sense of self-justification, the persistence of a 
grudge, or a reference to unfinished business [50]. A subject’s will- 
ingness to address the causes of violence should be assessed in the 
light of their response to previous interventions. Ambivalence may 
not always have prevented successful treatment. 


Psychological testing 


Psychological testing contributes to the assessment of violence risk 
in community settings in several respects. Firstly, the testing of per- 
sonality and cognitive function can contribute to an understanding 
of the reasons behind an act of violence, for instance when neuro- 
psychological testing demonstrates impaired executive function 
secondary to a head injury. 

Secondly, some tests of personality and general psychological 
functioning serve as risk assessments in their own right. Of these, 
the best known is the Hare Psychopathy Checklist [51], which has 
been shown consistently to predict acts of physical violence with a 
level of accuracy comparable with that of the other structured in- 
struments listed in Table 144.1 [52]. The personality traits identified 
include a number of negative markers encompassing inflated senses 
of self-worth, devious and manipulative attitudes towards others, 
behavioural problems, and lifestyle characteristics. 

Thirdly, the understanding of someone's personality structure 
that testing can sometimes offer can be a valuable aid to com- 
munity management, particularly when it helps the clinician to 
understand what aspects of their family and social environment 
people find most threatening, and why. Fourthly, a patient's his- 
tory may indicate a need for specialized assessments such as as- 
sessments of sex-offending risk. These may need to be repeated 
at regular intervals, either for clinical reasons or because a court 
imposes such evaluations as a condition of someone remaining in 
the community. 

Finally, a range of other investigations may be made necessary 
by particular aspects of a patient’s case. Occupational assessments 
offer particular insights into the stresses that may affect a patient 
moving from institutional care to the community, and into possible 
ways of addressing these. Neurological investigation and treatment 
will be an enduring component of the care of some patients. One of 


the tasks of the team providing care, whether they carry the label 
of forensic or general psychiatry, is to ensure that physical health 
needs are adequately met. Some of those health needs are generated 
by the same medications that allow some patients to remain in the 
community. 


The limits to risk assessment 


In conducting any assessment of violence risk, it is important to bear 
in mind the levels of accuracy that can be expected. Those levels of 
accuracy are not simple to describe. One approach is to ask: if a par- 
ticular structured instrument was used as a screening test and those 
identified as likely to be violent were not discharged, over any given 
period, how many patients would need to be detained to prevent one 
unwanted act? This statistic—the number needed to detain (NND) 
[51]—is the inverse of the positive predictive value (PPV) and analo- 
gous to the ‘number needed to treat. Used as a screening test where 
the base rate of violence is 10% and where, as a result, an unselective 
approach would lead to the detention of ten people in order to pre- 
vent one from acting violently, the VRAG (‘Table 144.1) would re- 
quire the detention of five people to achieve the same end [52]. 

Other structured instruments could be expected to perform simi- 
larly. Unstructured approaches have been less studied, but what 
research does exist does not suggest that they fare better. More im- 
portantly, for risk assessments conducted in everyday clinical set- 
tings the NND rises as the prevalence falls [53]. One consequence is 
that the number of mistakes rises when unusual acts, such as serious 
acts of violence, are sought to be prevented. At the base rate recorded 
in the Epidemiologic Catchment Area (ECA) study, where 17% of 
the sample self-reported violence in the previous 12 months, the 
NND to prevent an act of violence is 3.5 [54]. The outcome measure 
in the ECA study did not require injury. When injury is required, the 
base rate falls. In the Clinical Antipsychotic Trials of Intervention 
Effectiveness (CATIE) study, the 6-month prevalence of assault with 
a weapon or causing serious injury was 3.6% [55]. Here, the NND at 
a sensitivity of 0.73 and a specificity of 0.63 is 15. 

Data such as these have been used in the past to support state- 
ments to the effect that psychiatrists ‘can’ or ‘cannot’ distinguish 
which of their patients will act violently. This is to miss the point. 
There is no threshold of accuracy at which psychiatric risk assess- 
ment becomes possible or not possible. Instead, assessments of 
violence risk are just some of the numerous pieces of information 
that clinicians use to manage a case. Good assessments are more 
helpful in this regard than poor ones, and good assessments do 
not simply seek to be accurate, but also seek to inform clinical 
management. 


Conclusions 


The limits to the accuracy of what can be achieved represent one 
reason why violence risk assessment will usually be one factor, 
among many, that clinicians consider in the course of managing a 
case. Particularly where a patient has a history of violence or has 
stated a desire to harm, however, violence risk assessments will 
remain an essential part of clinical practice. Specialist advice will 
sometimes be necessary. 
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It is likely that the most helpful psychiatric violence risk assess- 
ments make use of both known statistical correlations and causal 
explanations. As one author put it: 


‘By supplementing a knowledge of relevant risk factors with a de- 
tailed examination of an individual's mental state, it may be possible 
to build up some picture of the calculus of reasons on which that 
individual's future actions may be based and in this way to give a 
more precise prediction ...’ [56, p. 70]. 


The author went on to point out that the more an action is dom- 
inated by some overriding preoccupation or delusion, the more pre- 
cise will be a prediction. He also pointed out, however, that however 
detailed our understanding of someone's calculus of reasons, we 
usually cannot predict with any certainty what they will do, or when. 

While the ability to inform clinical decisions is not the same as 
predictive accuracy, the accuracy of violence predictions is important 
in helping clinicians decide which approach to risk assessment they 
should use. Research over the past 40 years has demonstrated a 
modest improvement in the level of accuracy that can be achieved 
using structured risk assessment instruments. As those instruments 
are further refined and developed, further research may show that 
they are capable of levels of accuracy beyond those that can cur- 
rently be achieved. This work should be complemented by research 
on the cause-based approaches to risk assessment that remain wide- 
spread in clinical practice. Finally, research should examine the ways 
in which correlational and causal approaches can most helpfully be 
combined. 
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As an expert witness in court, the psychiatrist ceases to be simply a 
doctor, as a psychiatrist’s report and testimony address issues on the 
boundary between law and psychiatry. The law is not primarily con- 
cerned with the welfare of the defendant. Criminal law is concerned 
with justice, fact-finding, and the attribution of guilt, while psych- 
iatry concerns itself with the welfare of the individual, their mental 
disorder, and its treatment. This chapter will explore the legal frame- 
work for expert reports and testimony, expert witness standards, and 
an ethical framework for practice and will provide practical guid- 
ance on the preparation of reports and testimony. 

Expert witness reports and testimony can only be usefully dis- 
cussed within the legal framework of a particular jurisdiction. This 
chapter will discuss expert witness evidence in the context of juris- 
dictions based on common law (UK, Ireland, United States, Canada, 
and New Zealand). The legal framework for expert witness evidence 
in England and Wales is used as the primary jurisdiction, but the 
discussion will be applicable to other common law areas. While civil 
law jurisdictions (most of Europe) manage expert evidence differ- 
ently from common law jurisdictions, both share concerns about 
admissibility, reliability, and impartiality of expert witness evidence. 
The focus of this chapter is on expert testimony in criminal courts, 
but the chapter will also discuss how expert evidence is handled in 
civil courts. 








Witnesses in court can only give evidence of facts they personally 
perceived, and not evidence of their opinion. The opinion of an ex- 
pert witness is an exclusion to this general rule, because the court 
needs the assistance of experts to consider issues beyond their 
knowledge. The judge, whether sitting alone or in a jury trial, acts 
as gatekeeper on whether expert evidence is admissible. In criminal 
courts, it is for the jury to draw inferences from, and adduce weight 
to, the evidence. In the absence of a jury, the judge decides both on 
admissibility of, and weight accorded to, expert evidence [1-4]. 





Miscarriages of justice and development of rules 
governing expert evidence 


All jurisdictions have had miscarriages of justice attributed, at least 
in part, to poor-quality expert evidence. Two cases in the English 
jurisdiction—R v Clark and R v Cannings—concerned miscarriages 
of justice where medical evidence influenced the court’s decision. 
Both cases concerned two infant deaths in the same family due to 
sudden infant death syndrome. An expert paediatrician gave evi- 
dence that there was only a 1:73 million chance of such deaths in the 
same family in the absence of a genetic or an environmental cause, 
thereby implying that the alternative proposition, that of deliberate 
killing, was more likely (see The prosecutor's fallacy, p. 1472). Both 
the statistical methodology and assumptions proved faulty, and both 
mothers were acquitted on appeal. The appeal court was critical of 
the expert medical evidence. The expert was not a trained statis- 
tician and had not consulted a trained statistical colleague before 
giving his opinion, thereby straying beyond his area of expertise. 
The opinion was dogmatic, that is, was a view based on a hypothesis 
not sufficiently scrutinized or supported by empirical evidence and 
did not give sufficient weight to conflicting views. The appeal court 
was concerned that the criminal court had been swayed by the aura 
of expertise and infallibility of the expert and had not critically ap- 
praised the reliability of the expert’s evidence. These themes will be 
picked up in the rest of this section. 


Case law and expert witnesses 


Two landmark cases from the United States—Frye and Daubert— 
have become highly influential in developing case law on expert 
testimony across jurisdictions. The English courts have cited and 
amended a judgement in the Australian courts (in turn, citing Frye 
and Daubert) Bonython and decided upon four criteria to be con- 
sidered by the court in deciding the admissibility and reliability of 
expert evidence: 


1. Whether the subject matter of expert evidence is beyond the 
‘common knowledge’ of judge and jury, making such evidence 
necessary to the court to be able to form a sound judgement on 
that subject matter. 
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2. Whether the expert's evidence forms ‘part of a body of know- 
ledge that is sufficiently organized or recognized to be acceptable 
as a reliable body of knowledge or experience, thereby making 
the expert evidence of special assistance to the court. 

3. Whether the expert has ‘acquired by study or experience suffi- 
cient knowledge of the subject’ to render his or her opinion of 
value to the court. 

4. Whether the expert can demonstrate that they provide an im- 
partial opinion, recognizing the expert’s duty is to the court. 


By rigorously applying these four criteria to expert evidence, 
courts strive to ensure scientific evidence does not contribute to 
miscarriages of justice. 


Common knowledge 


Lawton L. J. in R v Turner established a ‘common knowledge’ rule 
governing expert evidence, specifically psychiatric evidence in this 
case, as follows: 


‘An expert opinion is admissible to furnish the Courts with sci- 
entific information which is likely to be outside the experience 
and knowledge of a Judge or a jury. If on the proven fact, a Judge 
or jury can form their own conclusions without help, then the 
opinion of an expert is unnecessary. In such a case if it given 
dressed up in scientific jargon it may make judgement more dif- 
ficult. The facts that an expert witness has impressive scientific 
qualifications does not by that fact alone make his opinion on 
matters of human nature and behaviour within the limits of nor- 
mality any more helpful than that of the jurors themselves; but 
there is a danger that they may think it does ... jurors do not 
need psychiatrists to tell them how ordinary folk who are not suf- 
fering from any mental illness are likely to react to the stresses 
and strains of life’ 


This seems to limit psychiatric evidence to recognized 
mental disorder [as in the Diagnostic and Statistical Manual of 
Mental Disorders, fifth edition (DSM-5) and the International 
Classification of Diseases, tenth revision (ICD-10)]. However, ex- 
pert advice is allowed which is ‘outside the experience and know- 
ledge of a judge or jury. The abnormal/normal dichotomy is not 
a rule of law, but guidance. Courts have allowed evidence on a 
variety of conditions which would not normally be thought of as 
an established mental disorder, for example ‘battered women’s 
syndrome. 

Particular problems arise for the court in respect of borderline 
conditions falling short of a recognized mental disorder. Here admis- 
sibility will be determined by the court’s judgement as to whether the 
expert evidence addresses matters outside the experience or know- 
ledge of a judge or jury. The psychiatrist will have other knowledge 
of assistance to the court such as assessing the risk associated with a 
mental disorder or providing an opinion on professional standards 
to be applied in cases of medical negligence. Such evidence will be 
deemed admissible. 

Even if the evidence meets this test, the court has discretion to 
render it inadmissible, for example if the court deems it prejudicial 
or misleading. 


Body of knowledge 


The second limb concerns itself with the reliability of expert testi- 
mony. Given that the court will have decided that the evidence is 


‘outside the experience and knowledge, it faces an inevitable di- 
lemma in reaching a judgement as to its reliability when, by def- 
inition, it lacks the scientific knowledge to do so. Recognizing this 
dilemma, courts have sought tests that can be applied to determine 
this issue. 

The most stringent is the test arrived at by the court in the United 
States in Daubert v Merrell Dow Pharmaceuticals, namely that ‘the 
technique, body of knowledge, or theory can be tested; has been sub- 
jected to peer review and publication; has a known rate of error; is 
subject to maintenance of standards and controls and is generally 
accepted by the scientific community. English courts have been re- 
luctant to adopt this test, as its stringent criteria could rule out con- 
sideration of expert evidence, based on novel or developing science 
for which there may be no definitive peer review and which may 
not be accepted fully by the wider scientific community. Psychiatric 
evidence derives from scientific evidence, but also from expert con- 
sensus and experience—based on knowledge, and thereby not likely 
to meet the Daubert test. The Law Commission [5] considers expert 
evidence to be a continuum between experience-based evidence and 
narrowly scientific evidence and recommended that the criteria ap- 
plied to expert evidence be flexible enough to be able to encompass 
this continuum. A phrase that recurs often in addressing reliability is 
that the evidence should be ‘soundly based; such that simply stating 
‘in my clinical experience...’ would not be sufficient to be classed as 
soundly based, and courts should be mindful of the trap of accepting 
dubious scientific evidence on the basis of prestige and professional 
standing of the witness. 

The Law Commission [5] considered the problem, concluded 
that the common law approach to the admissibility of expert wit- 
ness evidence is one of laissez-faire, and recommended instead that 
a test of reliability be placed on a statutory basis. The Commission 
[5] was particularly concerned that without such a statutory test, 
there would be a ‘danger that juries will abdicate their duty to ascer- 
tain and weigh the facts and simply accept the expert’s own opinion 
evidence, particularly if the evidence is complex and difficult for a 
non-specialist to understand and evaluate. The government was re- 
luctant to limit the discretion of the court in admitting evidence and 
instead issued a practice direction [6] to the courts, with detailed 
guidance on criteria to be applied to expert scientific evidence to 
decide upon its reliability. This practice direction is reproduced in 
Box 145.1. Rule 702 of Federal Rules of Evidence (United States) [7] 
likewise provides judges with guidance. 

The leading textbook on expert evidence in the UK [1] concludes 
‘English judges now have the benefit of the most comprehensive 
guidance in the common law world as to how the reliability of expert 
evidence should be assessed’ This guidance (Box 145.1) should be 
understood and applied by psychiatric experts to their reports and 
could form the structured criteria by which reports are subjected to 
peer review case conferencing. 


Acquired by study or experience 


In English law, there is no statutory definition of what constitutes an 
expert. In R v Bunnis, the court stated that an expert is ‘one who has 
by dint of training or practice acquired a good knowledge of the sci- 
ence concerning which his opinion is sought’ Rule 702 of the Federal 
Rules of Evidence (United States) [7] defines an expert as ‘a witness 
qualified as an expert by knowledge, skill, expertise or training’ 
As psychiatry is a recognized specialty within medicine, which, in 


CHAPTER 145 The expert witness in the criminal and civil courts 


Box 145.1 Criminal Practice Directions 2015 [6] 


19A Factors which the court may take into account when determining 

the reliability of expert opinion, and especially of expert scientific 

opinion, include: 

1. Extent and quality of the data on which the expert's opinion is based 
and the validity of the methods by which they were obtained. 

2. If the expert opinion relies on an inference from any findings, whether 
the opinion properly explains how safe or unsafe the inference is 
(whether by reference to statistical significance or other appropriate 
terms). 

3. Ifthe expert's opinion relies on the results of any method (for instance, 
a test, measurement, or survey), whether the opinion takes proper 
account of matters such as the degree of precision or margin of un- 
certainty affecting the accuracy or reliability of those results. 

4. The extent to which any material upon which the expert opinion is 
based has been reviewed by others with relevant expertise (for in- 
stance, in peer-reviewed publications) and the views of those others 
on that material. 

5. The extent to which the expert opinion is based on material falling 
outside the expert's own field of expertise. 

6. The completeness of the information which was available to the ex- 

pert and whether the expert took account of all relevant information 

in arriving at the opinion (including information as to the context of 
any facts to which the opinion relates). 

7. If there is a range of expert opinion on the matter in question, where 

in the range the expert's opinion lies and whether the expert's prefer- 

ence has been properly explained. 

8. Whether the expert's methods followed established practice in the 

field and, if they did not, whether the reason for the divergence has 

been properly explained. 



































In addition, the court should be astute to identify potential flaws in such 
opinion which detract from its reliability such as: 


1. Being based on a hypothesis which has not been subjected to suffi- 
cient scrutiny or which has failed to stand up to scrutiny; 

2. Being based on an unjustifiable assumption; 

3. Being based on flawed data; 

4. Relying on an examination, technique, method, or process which has 
not been properly carried out or applied, or was not appropriate for 
use in the particular case; or 

5. Relying on an inference or conclusion which has not been properly 
reached. 


























turn, is accepted as a scientific discipline, psychiatrists who have 
completed recognized training, are members of their professional 
body, adhere to standards of professional practice laid down by their 
regulatory body, and hold registration in the area of psychiatry they 
practise should have no difficulty in being accepted as an expert by 
the court. 

When giving evidence based on novel or developing science or 
practice or where the opinion is experience-based, the expert should 
be in a position to demonstrate practical knowledge or have con- 
ducted research in the relevant science or practice and be careful not 
to give evidence outside their professional knowledge or practice. 
The UK regulatory body the General Medical Council (GMC) pro- 
vides the following guidance: “You must only give expert testimony 
and opinions about issues that are within your professional compe- 
tence or about which you have relevant knowledge including, for ex- 
ample, knowledge of the standards and nature of practice at the time of 
the incident or events that are the subject of the proceedings. If a par- 
ticular question or issue falls outside your area of expertise, you should 


either refuse to answer or answer to the best of your ability but make it 
clear that you consider the matter to be outside your competence’ [8]. 


Impartial evidence 


Courts have, over time, defined what is expected of an expert wit- 
ness. The landmark case is that of the Ikarian Reefer. In Field v Leeds 
City Council, the judge stated that the expert witness ‘should provide 
independent assessment to the court by way of objective unbiased 
opinion. These and other judgements have led to the overriding 
principle governing expert evidence being that the expert's obliga- 
tion is to the court, and not to whoever has instructed them. All 
other standards flow from that overriding obligation. 

Case law judgements on expert evidence have been drawn to- 
gether into sets of Rules governing expert witnesses in court. In 
England and Wales, three sets of Rules have emerged: 


1. Civil Procedure Rules (CPR), Rule 35 [9]. 
2. Criminal Procedure Rules (CrPR), Part 19 [10]. 
3. Family Procedure Rules (FPR) [11]. 


In the United States, expert evidence is governed by Federal Rules 
of Evidence [7] and in Australia by the Australian Evidence Act [12]. 

Psychiatrists are, in addition, subject to standards set out by their 
professional and licensing bodies [8, 13, 14]. 

Box 145.2 draws together common standards from court rules 
and professional standards. 


Bias is a shadow to impartiality [15]. 


Cognitive bias 


Psychiatric experts come to the court with their own particular way 
of looking at the world underpinned by their values, cultural and 
ethnic background, professional experience, and theoretical frame- 
work and training. Experts should, through supervision and peer re- 
view, recognize their own preconceptions with resultant due weight 
given to contrary views. The psychiatric expert may have particular 
views within their field, for example particular views on the bio- 
logical vs familial/interpersonal understanding of attention-deficit/ 
hyperactivity disorder (ADHD). Particular training, for example 
in psychoanalysis, may shape the expert’s opinion. To counter this, 
regulatory frameworks emphasize the expert's obligation to give due 
weight to the range of opinion on particular issues (see listed item 7 in 
Box 145.1 and listed item 4 in Box 145.2). Rix [3] recommends, as a 
framework, addressing the range of opinion that might be expressed 
in a peer review case conference by respected consultant colleagues 
who may hold legitimate contrary views on psychiatric issues to the 
experts own. He adds that there may be times when the expert has to 
address unreasonable opinions by colleagues lacking respect. 


Adversarial bias 


Adversarial bias can arise through selection bias and unconscious or 
deliberate partisanship. Legal firms may choose particular experts 
because they are known to support their client’s particular viewpoint 
(Hired Guns). Experts are human and do not like to disappoint their 
instructing party. Deliberate partisanship can arise from the expert 
subtly or otherwise moulds their opinion to whoever hires them. 
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Box 145.2 Regulatory rules for expert witnesses: common 


across jurisdictions [7-14] 


Overarching duty: the paramount duty of the expert is to assist the court 
on matters within his or her own expertise, and this overrides any obli- 
gation to the person from whom the expert has received instructions or 
receives payment. 


Expert reports must contain: 

1. Details of academic and professional qualifications, together with 
experience and accreditation relevant to the opinions expressed in 
he report (usually as a summary in the introduction, with more de- 
ail within an appendix); 

2. Astatement of the range and extent of expertise, together with limi- 
ations upon that expertise, particularly declaring when a particular 
issue is outside his/her expertise; 








3. Astatement setting out the substance of all instructions received, to- 
gether with a listing of all materials provided and considered, upon 
which the opinion is based; 





4. Where there is a range of opinion on matters dealt with in the re- 
port, a summary range of opinion, together with reasons for the 
expert's preferred opinion; 





5. A declaration of any facts, materials, or investigations which might 
bear upon, or be made against, the expert opinion; 








6. Extracts of the literature or any other material upon which the scien- 
tific evidence is based; 





7. A statement of which facts are within the expert's own knowledge 
and which are assumed; 


8. Where an opinion is qualified, a statement to that effect; 


9. A statement that the expert will inform all parties, including the court, 
in the event that his or her opinion changes on any material issue; and 


10. A declaration of truth. 
* Courts distinguish true and assumed facts. The only facts the psychiatric ex- 


pert will routinely know to be true are the results of examination and results of 
tests or investigations. All other facts will be assumed to be true. 











Confirmation bias 


Confirmation bias can arise where the expert starts with a particular 
preconception on issues to be addressed to the report. Evidence is 
then sought and funnelled through that preconception (opinion- 
based evidence). 


The prosecutor's fallacy 


In the cases of R v Clark and R v Cannings, two conflicting proposi- 
tions to explain two unexplained infant deaths in the same family 
were proposed: (1) the two deaths occurred due to sudden infant 
death syndrome; and (2) the two deaths were due to double murder 
by the parent. The first was presented in court where the likelihood 
was judged to be one in 73 million, with the inference being that 
the second proposition was therefore more likely. The fact that 
two sudden infant deaths are unlikely by itself is misleading, as the 
other proposition may be even more unlikely, that is, two deaths by 
murder by the same parent. The presentation of only one of the com- 
peting hypotheses in these cases has been dubbed the prosecutor’s 
fallacy, as it usually favours the prosecution case. Rules on expert 
evidence therefore stipulate that the expert must consider and as- 
sign due weight to alternate views on any scientific issue before ex- 
pressing their own opinion (see listed item 7 in Box 145.1 and listed 
item 4 in Box 145.2). 





Strict adherence to Court Rules (Box 145.2) and ensuring reports 
meet tests of reliability (Box 145.1) mitigate against bias. 

Other forms of bias are recognized, which are more applicable to 
forensic science practice (for a full review, see [15]). 


Legal governance 


Should any party in court proceedings have reason for concern re- 
garding an expert witness's adherence to standards of practice, there 
are a range of remedies available (UK case law, but similar remedies 
apply in other jurisdictions): 


1. A judge in court can refer an expert to their regulatory body (the 
GMC for psychiatrists) to consider their fitness to practise. 

2. The court can make an expert witness subject to a wasted costs 
penalty if they have seriously delayed or obstructed the progress 
of the case through poor adherence to the standards expected of 
an expert witness. 

3. An expert witness can be held in contempt of court through 
wilful disregard or disrespect for the authority of the court. 

4. In Jones v Kaney, the Supreme Court overturned a 400-year-old 
convention that an expert in court should enjoy immunity from 
suit. This means that the expert psychiatric witness can be sued 
for damages, but that is limited to parties to whom the expert 
owes a duty of care, as when the expert is instructed by one party 
in a civil dispute. The implications of this judgement continue 
to be worked out, but it appears a professional witness to fact, 
court-appointed experts, or experts instructed by the presiding 
judge do not owe a duty of care to a particular party in an adver- 
sarial system and therefore retain immunity from suit. 

5. The expert witness can be charged with perjury where they are 
held to be wilfully telling an untruth or making a misrepresenta- 
tion in their evidence. 


The practical implication for the psychiatric expert witness is 
that they must have their own personal indemnity insurance for 
court work. 


Ethical practice (with special reference to risk 


assessment) 





Stone [16] used the term ‘dual role’ to describe the psychiatrist in the 
legal context. In Stone’s view, the roles of the clinician and expert are 
ethically irreconcilable. The dual role arises from the conflicting expect- 
ations and responsibilities between their duty to the evaluee to maxi- 
mize their welfare (medical ethics) and the duty to put their clinical 
knowledge and experience at the disposal of the court (justice ethics). 
Evidence on the risk associated with mental disorder is an exem- 
plar for the dual role dilemma. Judges when considering an inde- 
terminate (life) sentence or a longer-than-usual sentence, will, for 
offenders who may be mentally disordered, routinely require a psy- 
chiatric opinion on mental disorder, risk, and treatability. That assess- 
ment may result in welfare disposal in the form ofa hospital order for 
treatment, instead of a custodial sentence, or where a mental disorder 
(for example, personality disorder) is identified and linked to future 
risk, but no treatment disposal recommended, the judge may justi- 
fiably conclude that the protection of the public would be best pro- 
tected by imposing an indeterminate or longer-than-usual sentence. 
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One solution is to separate the clinical and legal roles [16], such 
that the expert does not have a clinical role to play in respect of 
the evaluee. Adshead [17] argues that this is not viable in the UK. 
For any one offender, the psychiatrist may have multiple roles. In a 
public health system, the psychiatrist will have obligations to the 
state to fairly distribute health resources and thereby act as gate- 
keeper to psychiatric health care. At trial, the psychiatrist may give 
evidence on legal matters (for example, diminished responsibility) 
and, at the point of sentencing, provide an opinion impacting upon 
disposal (welfare or custodial). The psychiatrists opinion may, at 
a later time, for the same offender, be central to legal decisions— 
the Mental Health Review Tribunal or parole hearing on release. 
Considering these overlapping and contradictory roles, Martinez 
and Candilis (United States perspective) [18] and Adshead (UK 
perspective) [17] reject the strict view of experts as being answer- 
able to either the law (justice ethics) or their profession (medical 
ethics). Professional responsibilities cannot be divided along any 
absolute or clear lines. 

The narrow domain of medical ethics does not remove from doc- 
tors the duty to consider the interests and the rights of other people 
and to consider the distribution of benefits and risks [19]. To work 
ethically within this framework presupposes a just state. Adshead 
[17] argues that the psychiatrist has duties as a citizen to partici- 
pate in processes that could protect others from risk arising from 
individuals with mental disorder. The psychiatrist must, if acting 
as an expert witness, continuously negotiate a struggle between the 
welfare of the individual defendant, the legitimate need to the court 
for expert testimony, and duties to the wider society to address the 
risk to others associated with mental disorder. Very high standards 
are required of the expert in presenting findings on risk and their 
reliability. While probabilistic risk assessments may be accurate 
at a group level, at an individual level, the confidence intervals for 
any risk instrument are so wide as to make individual predictions 
highly problematic [20]. This has led Mullen [21] to conclude that 
‘the margin of error in every actual or conceivable risk assessment 
instrument is so wide at the individual level that their use for sen- 
tencing or any formal detention, is unethical? While the court will 
recognize the limitations of risk assessment, nevertheless, risk as- 
sessment for the mentally disordered will fall outside the ‘common 
knowledge’ of the judge and jury. If the state has a legitimate claim on 
the psychiatrist’s expertise in passing sentence or when considering 
parole, the psychiatrist would appear to have little option but to re- 
port their assessment of risk despite its clear limitations. This places 
grave responsibility on the psychiatric expert to report the findings 
on risk in a way that informs the court on the limitations of the risk 
assessment, particularly the problem of extracting an individual 
risk assessment from probabilistic group risk. Furthermore, when 
obtaining informed consent for reports, the psychiatrist should pro- 
vide information to the evaluee on the use the court or tribunal may 
make of the psychiatrist's risk assessment. 

Common to approaches to developing an ethical framework for 
the psychiatric expert witness is the necessity to draw upon mul- 
tiple ethical perspectives and values when undertaking evaluations 
and recognizing that there may be conflict between those ethical 
perspectives. These approaches combine ethics derived from trad- 
itional medical ethics, but also incorporate ethics derived from the 
perspective of justice (for a full theoretical account of this approach, 


see [19]). Martinez and Candilis [18] proposed the following prin- 
ciples to guide practice: 


e Respect for persons: everyone subject to a psychiatric evaluation 
for the court should be regarded with respect and professional 
fairness. 

e Respect for privacy and confidentiality: while acknowledging that 
there are inherent limitations to this value in the legal context, 
nevertheless, care must be exercised in dealing with sensitive in- 
formation. Balance and perspective are critical to the presentation 
of clinical findings, avoiding the provision of gratuitous informa- 
tion and avoiding bias through selective use of information. 

e Respect for consent: psychiatric experts must ensure that those they 
evaluate understand the purpose of the evaluation, the limitations 
of confidentiality, and the potential use of the expert’s opinion by 
the court (dual-role obligations). 

e Commitment to honesty and striving for objectivity: this is reflected 
in the formulation of procedure rules across jurisdictions (Box 
145.2), which must be adhered to by the psychiatric expert wit- 
ness. The expert must resist the hired gun bias and exhibit thor- 
oughness and transparency in respect of their personal values. 


To this list, the author would add: 


e Respect for the human rights of others: balancing the distribution 
of benefits and risks for the person being evaluated and society, 
including fair distribution of health resources [17, 19]. 


Individual psychiatrists and their professional bodies have a duty 
to ensure that psychiatric experts are properly trained and under- 
stand their dual obligations as a doctor and as a witness in court. 
This can be achieved through professional development, appraisal, 
and revalidation. The psychiatric expert must be willing to submit 
their medicolegal work to peer scrutiny and, through peer review 
and training, gain knowledge of the values and prejudices they bring 
to evaluations. Professional obligations include a willingness to 
tackle poor practice by colleagues, including referral to professional 
regulatory bodies. 


The psychiatric report 


Rix [22] and Buchanan and Norko [23] provide detailed guidance 
on the preparation of reports for courts and tribunals. The main 
points are summarized in Box 145.3. The court or tribunal has little 
time to read lengthy reports and are primarily concerned with the 
expert's findings and opinion. However, if detailed facts or narrative 
are necessary for whatever purpose, then the details can be confined 
to appendices. The aspiring expert should seek out formal training 
in medicolegal report writing and be thoroughly familiar with the 
subtle, but important, differences between reports for criminal, civil, 
and family courts (see [3]). 

If there is no evidence of mental disorder, then the privileged ex- 
ception accorded to psychiatric experts may no longer apply (see 
Common knowledge, p. 1470). However, the court may use psychi- 
atric evidence where the boundary between disorder and normality 
is blurred. As in all aspects of interaction with the court, transparency 
is key to acting as a responsible expert. Psychiatric expertise derives 
from established scientific evidence, but also from expert consensus 
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Box 145.3 Report structure 


The report should be typed using numbered paragraphs and head- 
ings and subheadings, and should make use of short sentences and 
paragraphs. 

The psychiatric report should contain the following sections: 
(a) Instructions, documentation, and methodology 
The report should include details of instructions received, the method- 
ology, documents received, assumptions adopted, and interviews con- 
ducted. The overriding principle is transparency. Full disclosure of all 
documents and information upon which the opinion is based is essen- 
ial, along with disclosure of limitations and assumptions employed. Ina 
psychiatric report, assumptions might include that the subject has been 
ruthful and that medical records are accurate. 


(b) The facts 
The report should record briefly the facts upon which the opinion is 
based and should avoid interpretation which is the proper function of 
he opinion section. In psychiatric reports, the only facts that are within 
he psychiatrist's own knowledge are likely to be those based on findings 
of the mental state examination. All other facts are assumed. If structured 
ests are utilized, they may also constitute facts within the psychiatrist's 
own knowledge. Sufficient detail, recognizing that the court will have full 
nowledge of the case, for the court or another expert to understand the 
basis for opinion is what is required of the report. Further detail can be 
relegated to appendices. 























(c) Opinion 
The role of the psychiatric expert in the criminal court is to provide 
an opinion on mental disorder and its implications for the matters 
before the court. The features that lead to a diagnosis of mental dis- 
order should be described, avoiding jargon and including mental state 
findings, so that others can understand how the opinion is reached. 
The diagnosis or differential diagnosis should be clearly stated, using 
a recognized classification (ICD-10, DSM-5). Where a condition is de- 
scribed which is not part of such recognized classification systems or 
where ‘leading-edge’ scientific findings or opinion based on experience 
or special knowledge are used to support the opinion, that should be 
justified by disclosure (as an appendix to the report) of the relevant lit- 
erature to support the expert's opinion, together with, if applicable, the 
expert's own experience and knowledge of the field. In the civil court, 
opinion in clinical negligence should be anchored to established codes 
of practice, clinical practice guidelines, or service protocols and should 
be backed by providing relevant citations (which may form the basis for 
cross-examination). 

The second stage of the opinion is to apply the psychiatric findings 
o the issues to be addressed in the report. Each issue should have its 
own section. The content should show how the opinion derives from 
he facts. The opinion should highlight the strength of the opinion, giving 
due weight to matters that might undermine the opinion or could fairly 
be made against it. The opinion should highlight any unusual or contra- 
dictory findings, must be balanced, and must give due weight to the 
range of reasonable opinion on the issue to be addressed (the psychi- 
atric case conference model). 









































(d) Declarations and statement of truth 

Court rules require the report writer to provide a declaration that the 
expert understands and has complied with their duty to the court (Box 
145.2). Court Rules require a statement of truth to the effect that the con- 
ent of the report is true and that the report writer is aware of their liability 
o be prosecuted for untruth in court. 


(e) Qualifications and experience 

Usually as an appendix. 

(f) Appendices 

ncluded here are recorded lists of documents, publications cited, and 
definitions of any technical terms. 


Source: data from Rix K, The structure, organisation and content of the generic report, 
n: Rix K, Expert Psychiatric Evidence, pp 35-51, Copyright (2011), The Royal College 
of Psychiatrists. 














and experience-based knowledge. In broad terms, the court needs to 
be informed of the level of evidence underpinning opinion. 

Particularly in the civil court, the expert may be given reports of 
other experts in the case and due weight should be accorded to their 
opinion before commenting on their opinion and setting out the 
basis for agreement or disagreement (see listed item 7 in Box 145.1 
and listed item 4 in Box 145.2). 

There may be conflict in the factual evidence, and the report 
should avoid preference for one side or the other but instead give 
alternative opinion, based on differing judgements on the evidence. 

Terms such as ‘diminished responsibility, ‘insanity, or ‘automa- 
tism have precise legal definitions, and the report should address 
how the psychiatric findings translate to the legal definitions em- 
ployed by the court. 

Where recommendations for a mental health disposal under spe- 
cific Acts are included, the precise wording of the relevant section of 
those Acts should be employed. 


Opinion on the ultimate issue 





There is long-standing prohibition in common law jurisdictions 
against the expert providing an opinion on the ultimate issue before 
the court. The judge will decide if a hospital is negligent in a civil 
negligence claim. The jury will decide in a criminal court whether 
a defendant is guilty. In deciding upon the ultimate issue, the court 
may require knowledge outside their common knowledge, for ex- 
ample interpreting the McNaughton rules for insanity or applying 
the criteria for a finding of diminished responsibility. In such cases, 
the court has allowed opinion on the ultimate issue. The court in the 
United States generally allows opinion that embraces the ultimate 
issue, but this is limited by Rule 704 of the Federal Rules of Evidence 
[7], which states that the expert ‘must not state an opinion about 
whether the defendant did or did not have a mental state or condi- 
tion that constitutes an element of the crime charged or of a defence. 
Those matters are for the trier of fact alone. The psychiatric expert 
should be careful when expressing an opinion on the ultimate issue 
that their opinion is necessary to assist the court on matters outside 
the common knowledge of the court. 


Confidentiality 


Medicolegal work undertaken by psychiatrists is governed by the 
same rule of confidentiality as applied to other clinical work. Reports 
cannot be disclosed to a third party without the consent of the body 
commissioning the report. Psychiatric reports do not form part of a 
person’s medical record, except by the express consent of the indi- 
vidual or their legal representative. Defence solicitors can exercise 
a right not to disclose a report to court. Failure to comply with the 
rules of confidentiality can lead to civil action or reporting to a pro- 
fessional regulatory body. Breaching confidentiality may be justified 
in particular cases. 

While the psychiatric expert owes an overriding obligation to the 
court, nevertheless, the expert remains a doctor with consequent 
medical responsibilities. If the report writer concludes that the 
evaluee requires treatment, then the expert must consider whether 
their medical practitioner should be alerted to that treatment need. 
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This is usually done with the agreement of the evaluee or their legal 
representative. Where, for whatever reason, that recommendation 
is not passed on to the evaluee’s medical practitioner, consideration 
must be given to disclosure in the person’s best interests. 

A psychiatrist who believes that the evaluee is not co-operating 
with the preparation of a report because of mental illness has a duty 
to consider whether the evaluee’s mental illness could interfere with 
a fair trial (for example, fitness to plead or lack of consideration of 
a mental health disposal). The psychiatrist must then make a judge- 
ment whether it is in the best interests of the evaluee for sufficient 
information to be provided to the court to alert them of the doctor's 
concerns. Such disclosure will almost certainly be justified in the 
interests of a fair trial and justice. 

The other situation where a breach of confidentiality may be justi- 
fied is where a report is not disclosed, but the report writer believes 
that the court ought to consider the report findings on potential risk 
to the public or where the court is unaware of information that might 
have a crucial bearing on outcome. In BW v Edgell, the court held that 
the doctor’s duty of confidence did not prevent a psychiatrist from 
taking steps to communicate the grounds of concern to the court. The 
strong public interest in disclosure to prevent a court from making 
decisions based upon inadequate information was held to override 
the psychiatrists duty of confidentiality. Where a doctor is consid- 
ering disclosure in these circumstances, advice should be sought 
from an experienced colleague and the case law consulted, and the 
psychiatrist should thoroughly understand their regulatory bodies’ 
guidance on disclosure, as well as consult their indemnity insurer. 

In all cases, disclosure should be limited to providing sufficient 
information to the receiving body (court or medical practitioner) to 
make a decision. In Cornelius v DeTaranto, a psychiatrist who wished 
to arrange treatment for a claimant in a civil case disclosed the full 
report to the claimant’s medical practitioner without the claimant's 
consent and was held to be negligent in doing so. Limited disclosure 
would have been sufficient to alert the medical practitioner to the 
treatment need, rather than full disclosure of the report. 

It is the responsibility of the expert to ensure the security of 
clinical records and court documents, and the expert must ad- 
here to relevant data protection legislation. The three Rs should 
be followed: 


e Retain: all documentation until the statutory time limits for re- 
tention of documents has passed and to at least the time of any 
potential appeal, civil suit, or referral to regulatory bodies. 

e Record: keep structured, clear, legible, and comprehensive records 
in a format suitable for other instructed experts to consult. 

e Reveal: all sources of information in the report. 


At the beginning of the interview, the examining psychiatrist should 
explain carefully to the defendant the nature of the doctor's dual 
role, the limits of confidentiality in producing a medicolegal report, 
and that the court will have full disclosure of all material known to 
the report writer (no off-record material). It is prudent to obtain a 
signed record of this discussion and to include it as part of the intro- 
duction to the report. Whenever possible, an informant should be 
interviewed—by telephone, if necessary. 


Appearing in court 


Experts are allowed, unlike witnesses of fact, to sit in court and 
hear the evidence of other witnesses before they, themselves, give 
evidence. An expert can be called by any interested party in pro- 
ceedings. The cardinal rule when giving oral evidence in court is 
that, although called by one party, the expert witness is not giving 
evidence for that party’s side but is under duty to provide fair and 
impartial evidence to the court, even where this conflicts with the 
interests of the party calling them [1, 3]. 

In the criminal court, the report will have been pre-read by the 
judge, and it is usual for the examiner to refer to relevant sections 
of the report. A report with numbered paragraphs is easier for the 
court to follow. The jury will not have read the report and will not 
usually be given sight of the report. Their knowledge of the expert's 
report will come from submissions made by either side and through 
the judge’s questioning and summing up. 

When calling an expert witness, the advocate must elicit the 
following: 


1. The expert's qualifications: the report should have been pre- 
pared according to court procedure rules and therefore contain 
a biography setting out the qualifications and experience of the 
witness. It will then be usual for the advocate to lead this part 
of the evidence by reference to the biography supplied in the re- 
port. It will be perfectly permissible for the other side to call into 
question the expert's qualifications. This should be met politely 
by outlining the reasons why the expert believes they have the 
requisite qualifications and experience to answer the questions 
posed in instructions and have not strayed beyond their area of 
expertise. 

2. Advocates are under duty to challenge disputed evidence. Thus, 
where more than one expert opinion is provided and they differ, 
the expert must expect their opinion to be disputed. The ex- 
pert must resist pressure from one party to deviate from, or ex- 
press greater certainty about, an opinion they have reached in 
the written report. The use of joint expert statements helps this 
process. 

3. An expert witness may be cross-examined as a hostile witness 
if there is good reason to suppose that they are not telling the 
truth. Thankfully, this is extremely rare. The possibility of delib- 
erate or inadvertent bias must, however, always be considered. 


(See [1, 3, 24, 25]). This chapter has concentrated upon practice in 
the criminal court, and the discussion applies equally to the civil 
court. However, there are important differences. The civil court is 
far more complex than its criminal counterpart. Each jurisdiction 
will have its own CPR. Psychiatric experts familiar with work in the 
criminal court will, when undertaking occasional expert reports in 
the civil court, be relative novices and need to thoroughly familiarize 
themselves with court processes and the stringent requirements of 
civil proceedings. If coming to the civil court for the first time, it 
may be prudent to prepare reports under supervision from an ex- 
perienced colleague. Adversarial bias is a particular hazard in civil 
proceedings. 
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Important differences in England and Wales include: 


The burden of proof is at the level of balance of probabilities, ra- 
ther than the more stringent test—beyond reasonable doubt—in 
the criminal court. 


Civil actions have multiple stages governed by different rules 
(preliminary reports, pre-action stage, claim). Preliminary re- 
ports are for the sole use of the legal team, and not subject to CPR, 
while reports at the claim stage are subject to the specific and 
stringent CPR. 

e The CPR encourage the appointment of single experts. 


Where more than one expert is involved, exchange of reports is 
mandatory, and case conferencing between experts with joint re- 
ports on areas of agreement/disagreements routine. While not 
covered specifically by CrPR, this is becoming common practice 
in the criminal court. 


The psychiatric expert will need to take particular care to under- 
stand the questions asked of them and understand the case law 
applicable to the questions asked. The reliability of the experts 
evidence, including evidence on standards of care in medical neg- 
ligence, will be central to the courts assessment of the expert's evi- 
dence. In medical negligence cases, the opinion should be anchored 
to established codes of practice, clinical practice guidelines, and 
practice protocols applicable at the time of the medical event under 
scrutiny. Though the civil court does not have formal guidance on 
reliability in England and Wales, the guidance for the criminal court 
is equally applicable to the civil court (Box 145.1). 


It is beyond the scope of this chapter to examine the expert’s role 
in civil jurisdictions, which includes most of Europe, but some dif- 
ferences and common concerns can be recognized. Inquisitorial 
systems typically employ two stages: an investigative stage and a 
trial stage. The judge is an active protagonist in adducing the truth. 
The divide between both systems is not clear-cut; for example, in 
Scotland, the Procurator Fiscal directs an investigative stage in the 
most serious crimes, followed by a trial employing the adversarial 
approach. 

In contrast to common law jurisdictions, the courtroom is not the 
setting for examining the scientific reliability of expert evidence and 
the expert’s evidence is not subject to cross-examination at the trial 
stage. Issues of admissibility or reliability are not subject to formal 
criteria, but there is equal concern about these matters within the 
European Union [26]. The examining magistrate has wide discretion 
in what evidence is admitted at the trial stage. 
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Homicide is the killing of one person by another. This simple def- 
inition unites a wide range of motives, methods, and characteristics 
of both the offenders and the victims. Murder has been defined as 
the unlawful and intentional killing of one person by another, and 
psychological aspects of the motive for killing are central to the legal 
verdict of murder. All societies with recorded history have forbidden 
homicide, and an enduring fascination with homicide is reflected in 
our myths, plays, and fiction and the continued public appetite for 
stories about homicide. 

Most homicides are committed by people with no identifiable 
mental disorder, and few people with a mental disorder will ever 
commit a homicide. However, homicide is an important topic 
in psychiatry, because people with mental disorders, especially 
schizophrenia, commit a disproportionate number of homicides. 
Psychiatrists have a role in assisting the court in determining the 
degree of criminal responsibility of mentally disordered homicide 
offenders and in providing long-term care and advice regarding the 
release of offenders to the community. Moreover, several notorious 
homicides involving people with mental illness have influenced the 
development of mental health law, policy, and services. 
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Homicide is a type of unnatural death, along with fatal accidents 
and suicides [1]. Deaths from military conflicts are not usually con- 
sidered to be homicides, although those resulting from civil unrest, 
mass killings of civilians by governments, and deaths from terrorism 
are included in the definition. This chapter is mainly concerned with 
intentional homicides, defined as ‘unlawful death purposefully in- 
flicted on a person by another person’ [1]. Non-intentional homicide 
(such as in motor vehicle accidents) and lawful killings (in self- 
defence or during law enforcement) are not considered. Intentional 
homicides can be divided into primary, or interpersonal homicide, 
when the death or injury of a particular victim was the aim of the 
offender, and secondary homicide which is the result of other crim- 
inal activities [1]. 

Without wishing to understate the importance of deaths due 
to military conflict, civil unrest, terrorism, negligence, and lawful 








killings, or any overlap between primary and secondary homicide, 
the focus of this chapter is on primary or interpersonal homicide, 
particularly homicide committed by people with a mental disorder. 
An equivalent to the commandment “Thou shalt not kill can be 
found in the lore of all ancient societies [2]. Most legal classifications 
of homicide have developed around the offender’s motive and the 
degree of mental responsibility for their behaviour. For example, in 
Anglo-Saxon law, at the time of the Norman Conquest, ‘slaying by 
stealth’ was distinguished from ‘slaying openly’; the former was pun- 
ishable by death, and the latter could be remedied with compensa- 
tion to the victim’s family [3]. More modern definitions of criminal 
homicide emerged in twelfth-century England when ‘slaying with 
malice aforethought’ was distinguished from justified homicides, 
such as the killing of a thief caught in the act, and excusable homi- 
cides such as accidental homicide and homicide in self-defence [3, 4]. 
The perceived intent and the capacity to form the requisite intent to 
kill have become central to modern legal determinations. A clear- 
minded intention to harm or kill is what distinguishes murder from 
deaths resulting from dangerous acts committed without the specific 
intention of causing death. In most jurisdictions, there are provi- 
sions that recognize the effect of mental disorder on the ability to 
form the rational intention to kill, together with mechanisms for 
diversion for treatment and supervision of mentally ill offenders. 


In 2012, almost half a million people died by homicide [1]. Most 
homicides are single homicides committed by one person, and 
therefore the rates of death by homicide and homicide offending are 
generally similar. The global homicide rate is about six per 100,000 
people, but there is marked variation in homicide rates between re- 
gions, countries, and communities (T; 146.1). In the Americas, 
the rate of death by homicide was over 16 per 100,000 in 2012; in 
Africa, it was over 12 per 100,000, while rates of homicide in Europe, 
Oceania, and Asia were about three per 100,000. The homicide rate 
in Russia in 2012 was over nine per 100,000, whereas in England and 
Wales, as in most other countries in Western Europe, it was about 
one per 100,000. In Canada, the rate of homicide was 1.6 per 100,000; 
in the United States, it was 4.7 per 100,000, and in Honduras, it was 
over 90 per 100,000 per annum. 














Table 146.1 Homicide rate and proportion of male homicide 
victims in the ten most populous countries, England and Wales, 
Canada, Australia, and New Zealand 


Country Homicides per 100,000 | Proportion of male 
per annum victims 


China 1.0 78.1 
India 35 592 
United States 4.7 77.8 
Indonesia 0.6 80.3 
Brazil Dow 89.8 
Pakistan ey, 76.7 
Bangladesh 2m 63.3 
Russian Federation IA WS) 
Japan 0.3 47.1 
England and Wales EO 70.3 
Canada 1.6 69.8 
Australia 11 67.3 
New Zealand 0.9 48.8 


Source: data from UNODC, Global Study on Homicide 2013, Copyright (2012), United 
Nations Office on Drugs and Crime. 


Being male is one of the most stable and well-established risk fac- 
tors for homicide, and about 95% of homicide offenders are male, a 
proportion that does not vary much between countries or according 
to the rate of homicide. About 80% of homicide victims are also male, 
although the ratio of male-to-female victims varies more than that of 
offenders, and in a small number of countries, including Germany, 
Japan, several Nordic states, and New Zealand, the number of female 
victims exceeds the number of male victims [1]. 

Homicide offending by children is rare, and the rate of offending 
is highest between the ages of 15 and 29 and declines steadily with 
age. The age distribution of homicide victims varies between re- 
gions. In the Americas, 15- to 29-year-old men and women are most 
at risk, whereas in Europe, males aged 30-59 are more likely to be 
the victims of homicide than younger men. 

Rates of homicide have declined in most economically devel- 
oped countries in the last 20 years, for reasons that may include 
ageing of the population, better policing, changes in patterns of sub- 
stance use, and a decline in the number of domestic homicides [5]. 
Improvement in emergency treatment of trauma victims might also 
have influenced the measured rates of homicide, especially in the 
United States [6]. Changes in homicide rates in low- and middle- 
income countries have been less uniform but appear to be related to 
economic development. For example, the greatest decline in the rate 
of homicide in South America has been in Chile, a country that has 
enjoyed consistent economic progress [7]. 

There is no complete explanation for the wide variation in homi- 
cide rates. Differing patterns of weapon availability and substance 
use are two of the most frequently cited reasons. Worldwide, more 
than 40% of homicides are committed using a firearm; higher rates 
of firearm ownership are associated with higher rates of homicide 
[1, 8], and being a gun owner is associated with an increased prob- 
ability of committing homicide, compared to non-gun owners from 
the same community [9, 10]. There is clear evidence that measures to 
regulate gun ownership can have an impact on homicide rates. For 
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example, in the wake ofa mass killing in Australia in 1996, stricter li- 
censing and storage requirements and compulsory government buy- 
back of automatic and semi-automatic weapons was followed by an 
accelerated decline in firearm homicides and a marked reduction in 
the number of mass killings [11]. 

High homicide rates can be found in some areas with low levels of 
gun ownership, mainly in places with high-per-capita alcohol con- 
sumption. There is a well-established association between alcohol 
intoxication and homicide at the level of the offender [12], and there 
is an association between changes in national alcohol consumption 
and fluctuations in homicide rates [13]. There is also an association 
between more broadly defined rates of substance use and homicide 
[14], an association that might be explained by the criminal activity 
associated with illegal drug use and possibly even the mental effects 
of abuse of illegal drugs, especially stimulants. In regions with very 
high homicide rates, such as South Africa and some Central and 
South American countries, there is often a combination of ready 
availability of weapons, hazardous patterns of substance use, drug 
crime, economic deprivation, and weak law enforcement. 


Homicide and mental disorder 





The two main methods of examining the association between crime 
and mental disorder are by psychiatric evaluation of offenders and 
by linking criminal and health registers. Both methods have ad- 
vantages and disadvantages. The examination of offenders is liable 
to sampling bias, as not all offenders are the subject of a detailed 
examination and because those evaluations are retrospective. To 
date, only a few countries have had the benefit of centralized health 
information in order to perform large-scale data linkage studies. 
This method relies on the reliability of primary health diagnoses 
and overlooks the presence of mental illness in offenders in the first 
episode of an illness and those who avoided contact with health 
services. However, both types of study provide strong evidence for 
an association between schizophrenia and homicide [15]. The asso- 
ciation between homicide and mental disorders other than schizo- 
phrenia is less clear and might well be explained by other factors 
such as comorbid substance use [16, 17]. 


Homicide and schizophrenia 


People with schizophrenia are overrepresented among samples of 
homicide offenders. Meta-analyses suggest that those with schizo- 
phrenia are about 20 times more likely to commit a homicide than 
other members of the community [18] and make up about 6.5% 
of all homicide offenders [15]. The population rate of homicide by 
people with an established diagnosis of schizophrenia appears to be 
about ten per 100,000 annum [19], although there is some variation, 
and the rate of homicide by people with schizophrenia is higher 
in regions with high total rates of homicides [15]. A meta-analysis 
found that about 40% of those homicides occur in the first episode 
of psychosis and the absolute risk of homicide by people during the 
period of their first episode of psychosis, regardless of how long the 
illness had been untreated, is more than ten times the annual risk of 
homicide after treatment [19]. 

Homicides by people with schizophrenia are often committed 
in response to frightening delusional beliefs [20]. The importance 
of delusional beliefs and positive symptoms of schizophrenia is 
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supported by the association between delusions and non-lethal vio- 
lence [21], the apparent fall in the homicide rates after initial treat- 
ment [19], and the decline in homicide by people with schizophrenia 
following the development of community mental health services in 
England and Wales in the 1970s [22]. However, other factors con- 
tribute to the association between schizophrenia and homicide, es- 
pecially after initial treatment of the illness, including the high rates 
of substance use among people with schizophrenia, the emergence 
of schizophrenia among young males, a group already at greater risk 
of violence, and the association between social disadvantage and 
both violence and schizophrenia. The importance of factors other 
than delusional beliefs in homicide is evident in the finding that rates 
of homicide by people with schizophrenia are strongly correlated 
with rates of other homicides, rather than with the epidemiology of 
schizophrenia. One interpretation of this finding is that people with 
schizophrenia are vulnerable to the social factors that determine the 
overall rate of homicide [15]. 


Homicide and other mental disorders 


The proportion of homicide offenders with mental illness varies 
widely between studies. A nation-wide study from New Zealand 
found that fewer than 10% of all homicide perpetrators were deter- 
mined to be ‘mentally abnormal by the courts [23]. In contrast, a na- 
tional study of Swedish homicide offenders found that only 10% had 
no mental health diagnosis [14]. The association between homicide 
and mental disorders other than schizophrenia has also been studied 
by the psychiatric evaluation of offenders and by linking health and 
criminological data. Any study of the homicide rates among people 
with mental disorders is likely to have been affected by the proportion 
of homicide offences that result in the identification of an offender, 
whether the study was conducted by clinical interview, co-registration 
of criminological and health data, or research interviews. Studies that 
include substance use disorders and personality disorders in the def- 
inition of mental disorder are also likely to report a higher propor- 
tion of mentally disordered offenders than studies that exclude those 
diagnoses. It has been suggested that places with a higher homicide 
rate have a smaller proportion of mentally ill offenders because of a 
dilution effect [24], although this is disputed [25]. 

There is evidence for an association between substance use dis- 
orders and homicide. A case linkage study from Sweden found that 
20% of homicide offenders had been identified to have a substance 
use disorder prior to the homicide [14], whereas a study conducted 
in Australia using similar methods found that 5% of homicide of- 
fenders had a previous substance use disorder [26]. Studies based on 
the psychiatric evaluations of homicide offenders after the offence 
also vary in the proportion of offenders with a substance use dis- 
order, with studies from the United States [27], Finland [28], Austria 
[16], and New Zealand [23] reporting rates of substance use disorder 
of 47%, 12%, 1.4%, and 0.7%, respectively. 

There is marked variation in the proportion of homicide of- 
fenders considered to have a mood disorder. Case linkage studies 
from Australia [26] and Sweden [14] both reported that about 
2% of homicide offenders had been treated for depression prior 
to the offence, whereas studies relying on the clinical examin- 
ation of offenders have reported varying proportions of offenders 
with mood disorders. Studies from Singapore [29] and the 
United States [27] both reported that 9% of homicide offenders 
had a diagnosis of depression. Studies from other regions have 


reported lower rates of depression, including 5% in Finland [28], 
4% in England and Wales [30], 1.4% in Austria [16], and 1% in 
New Zealand [23]. 

There is an obvious association between antisocial personality 
disorder and violent crime, in part because a pattern of impulsive 
behaviour and criminal conduct is part of the definition of the con- 
dition, but few population-based studies of the psychiatric diagnoses 
of homicide offenders have specifically reported the proportion of 
offenders with the diagnosis of personality disorder. A Swedish case 
linkage study found that almost 30% of homicide offenders had been 
diagnosed with personality disorder prior to the homicide [14], and 
a recent study from the United States found that 23% of homicide 
offenders met the criteria for at least one personality disorder [27]. 
The reported rates of organic mental disorders, epilepsy, or brain 
damage among homicide offenders also vary. A Swedish case linkage 
study reported no cases of organic mental disorder [14], whereas 
a case linkage study from Australia reported that 7% of homicide 
offenders had a pre-existing organic mental disorder, most com- 
monly a history of traumatic brain injury [26]. The reported rates of 
intellectual disability among homicide offenders range from 2% in 
Finland [28] and the United States [27] to under 1% in New Zealand 
[23] and Sweden [14]. 


Types of primary homicide 


Domestic homicide 


Domestic homicide includes the killing of an intimate partner, 
the killing of one’s own children (filicide), the killing of parents 
(matricide and patricide), and the killing of siblings (fratricide). 
In England and Wales, about 20% of all homicides are classified as 
domestic homicides, 80% of which involve the killing of current or 
former partners [31]. Like other homicides, the offenders are mostly 
male and have a mean age in the late thirties. Unlike other forms 
of homicide, the victims are overwhelmingly female. In the United 
States, the pattern of male perpetrators and female victims in do- 
mestic homicide is similar, although a higher proportion of deaths 
involve a firearm [32]. In regions where firearm ownership is less 
common and total homicide rates are low, kitchen knives are the 
most common method [33]. Domestic homicides are often a form 
of fatal domestic violence, in which there has been an escalating pat- 
tern of violence in the relationship before the killing. Less common 
forms of domestic homicide include homicide by a mentally ill 
family member and homicide-suicide. 


Homicide of strangers 


Stranger homicide can be defined as the killing of a person who is 
not known to the perpetrator until shortly before the offence. In the 
United Kingdom, as many as one in five homicides are stranger homi- 
cides, usually involving young men who become involved in a fight in 
a public place while affected by alcohol [34]. Stranger homicides by 
people with mental illness are very rare. A meta-analysis of data from 
England, Germany, Denmark, Finland, and Australia estimated the 
population rate of stranger homicide by people with schizophrenia 
to be about one in 14 million people per annum [35]. Stranger homi- 
cide offenders with psychosis are often homeless, have a history of 
violence, and are acutely psychotic at the time of the offence, often 
without any previous psychiatric treatment [35]. 


Neonaticide, infant homicide, and infanticide 


About half of child homicide victims are under one year of age [36]. 
Neonaticide is the killing of a child in the first 24 hours of life and 
is typically committed by disadvantaged and psychologically dis- 
tressed single women who concealed their pregnancy and kill their 
children by abandonment or neglect [37]. Infant homicide is the 
killing of a child below the age of one and includes neonatacide. The 
most common cause of infant homicide is fatal child abuse, usually 
by male partners [36, 38]. Rates of infant homicide are correlated 
with rates of both homicide and other violent offending [39, 40]. 

In some jurisdictions, infanticide is a separate category of homi- 
cide offence and refers to the killing of a child less than one year 
of age by the mother during a post-partum mental illness. Mothers 
with mental illness commit a small proportion of infant homicides 
[36, 38]. In jurisdictions without specific infanticide laws, the role of 
maternal mental illness on criminal responsibility is determined in 
the same way as that of other forms of mental illness. Societal atti- 
tudes to the killing of infants influence the types of verdicts and the 
penalties imposed [41]. 


Child homicide 


Over 5% of homicide victims are under 15 years of age. About half 
are killed as a result of fatal physical abuse, and about 20% are due 
to parental mental illness, usually involving the child’s mother 
[38]. Other causes are fights between teenagers, and children being 
caught up in criminal homicides or homicide-suicide by parents 
[42]. Despite the publicity that follows the offences, sexually motiv- 
ated homicides of children are rare. 


Homicides by older people 


People over 60 commit a small proportion of all homicides, al- 
though this proportion is likely to increase with ageing of popula- 
tions around the world. Homicide by people in advanced old age 
is extremely rare [43-45]. Studies of older offenders emphasize the 
role of cognitive impairment and morbid jealousy associated with 
long-term alcohol use [46]. 

Many older homicide victims are intimate partners of a similar 
age to the perpetrators. Homicide of the elderly also includes homi- 
cides committed in the course of a robbery, intentional neglect, and 
physical abuse of a frail elderly people, and altruistic homicide of 
chronically ill and terminally ill partners, for example those with 
dementia [47]. 


Homicide-suicide 


Homicide that is closely followed by the suicide of the perpetrator 
is known as homicide-suicide or murder-suicide. Worldwide, al- 
most one in ten homicides are followed by the suicide of the offender 
[48]. Most homicide-suicides are by men who kill their partners, 
and sometimes their children, and familial ties between the of- 
fender and the victim make suicide after homicide more likely [49]. 
Establishing the motives for homicide-suicide is difficult because of 
the death of both parties. A quantitative review of 49 international 
studies of homicide-suicide found that rates of homicide-suicide 
are strongly associated with rates of homicide, which is under- 
standable, given that the initial event is a homicide [48]. There is 
less evidence for an association between rates of suicide and those of 
homicide-suicide, suggesting that, in most cases, homicide-suicide 
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should be considered to be a form of homicide, rather than ‘extended 
suicide. However, altruistic homicide of a sick or infirm partner, fol- 
lowed by the suicide of older offenders is a well-established pattern 
of homicide-suicide. 


Medical killings and euthanasia 


A full discussion of euthanasia is beyond the scope of this chapter. 
However, two points are worth noting. The first is that deaths clas- 
sified as euthanasia occur on a spectrum from legal and ethical de- 
cisions not to resuscitate or continue medical care of a terminally 
ill person to the use of terminal sedation, assisted suicide of people 
with varying degrees of competence, to premeditated killings by 
health practitioners. An increasing number of jurisdictions permit 
physician-assisted suicide for terminally ill people, and the respon- 
sibility for determining capacity to give consent often falls to psy- 
chiatrists. At the other end of the spectrum, physicians played a role 
in the state-sponsored killing of disabled patients in Nazi Germany, 
and England’s most prolific serial killer was a trusted local doctor 
Harold Shipman [50]. 


Homicide in institutions 


Suicides and deaths from natural causes in institutional care far 
outnumber homicides, and state-sanctioned execution is probably 
more common than homicide as a cause of death among prisoners 
worldwide. Homicides do occur among inmates of prisons and in- 
clude cases of mentally ill inmates killing their cellmates. Homicides 
in secure hospitals and other psychiatric hospitals are rare events. 
A study of homicides in Australian and New Zealand psychiatric 
hospitals found that all of the 11 homicides were by people with 
schizophrenia, and ten of the victims were fellow patients. The rate 
of homicides per bed day was estimated to be similar to that of the 
wider community, although the victims would not have been placed 
in danger had they not been detained in hospital [51]. 


Mass killings, spree killings, and serial killings 


Mass killing is variously defined as the killing of more than three 
or, in some studies, four people at one time. Mass killing is distin- 
guished from spree killings, which are random killings of more than 
one person in a short time, sometimes in company, and from serial 
killings, in which a series of homicides are committed, usually by 
one person, over a longer period. 

Mass killings have been described as a relatively modern phe- 
nomena, although there are historical accounts of random mass kill- 
ings [52]. Mass killings include multiple homicides of a family by 
a family member, often followed by the suicide of the perpetrator, 
and mass killings of strangers or colleagues and acquaintances by 
disturbed men, often with an automatic or semi-automatic weapon. 
A proportion of mass killings are committed by people with mental 
illness, often with emerging or untreated psychosis [53]. 


Homicidal ideation 


The incidence of thoughts or plans to commit homicide among 
members of the general community has never been reported. 
Further, the extent of any association between homicidal ideas and 
actual homicide is not known. Two studies of psychiatric patients 
have reported the rates of homicidal ideation. One study found 22% 








SECTION 24 Forensic psychiatry 


of 517 unselected psychiatric outpatients reported previous homi- 
cidal ideation, that 7% had persisting homicidal thoughts or a homi- 
cide plan, and about 4% had made an attempt to kill another person 
[54]. Gender, race, marital status, and the diagnosis of schizophrenia 
were not associated with homicidal ideation, but those with schizo- 
phrenia and homicidal ideation were more likely to have been vio- 
lent. Another study of 223 adult patients with chronic schizophrenia 
found that 15% had ‘severe or extreme homicidality’ and that homi- 
cidal ideation was associated with the presence of mania, psychotic 
symptoms, and impaired social function [55]. Another study fol- 
lowed up 613 people who had been convicted of threatening to kill 
another person found that 44% were convicted of a violent offence 
in the next 10 years, including 3% who were charged with homicide 
[56]. Although this study suggested a possible link between homi- 
cidal ideation and homicide, it should not be forgotten that homi- 
cidal ideas must be several orders of magnitude more common than 
homicide in most patient groups. 

The relationship between homicidal ideas and homicide was the 
subject of one of the best known legal cases in psychiatric history. In 
1967, Prosenjit Poddar met Tatiana Tarasoff while they were both 
university students in California. Poddar was rebuffed by Tarasoff 
who subsequently sought psychiatric treatment. In the course of 
counselling, Poddar disclosed homicidal ideas towards Tarasoff, 
and some time later stabbed her to death. The court later held that 
treating doctors owed duties not only to the patient, but also to warn 
and protect members of the public who might be endangered by 
the actions of their patients [57]. Although the duty to warn after 
Tarasoff only applies in some states in America, it has influenced the 
laws and claims for negligence internationally. 


Attempted homicide 


The two elements of murder are death and the intent of the offender 
to kill. However, even when there is a clear intention to kill, injuries 
inflicted are not always fatal, and the victim fighting back, the inter- 
vention of bystanders or medical care, might prevent death. Hafner 
and Boker’s landmark study of homicide and mental disorder in 
West Germany between 1955 and 1965 included cases of both homi- 
cide and attempted homicide [58]. Their study found similar num- 
bers of attempted and actual homicides, and few differences between 
the offenders in each group. A recent Australian study suggested 
there were few differences between mentally ill homicide offenders 
and mentally ill people who committed serious non-lethal violence 
[59]. Assaults by mentally ill people that result in a serious injury are 
rare, compared to other less serious types of assault. For example, 
in a recent meta-analysis, pooled estimates of the proportion of pa- 
tients with first-episode psychosis committing any violence, serious 
violence, and violence resulting in serious injury were 34.5%, 16.6%, 
and 0.6%, respectively [60]. 





Legal proceedings and inquiries after a homicide 


Several notorious homicides by people with mental illness have re- 
sulted in changes in the way in which people with mental illness are 
regarded and treated. An early example was a trial following the at- 
tempted shooting of George III by James Hadfield in 1800, which 


led to the emergence of both the verdict of ‘not guilty by reason of 
insanity’ and of legislation to regulate the treatment and control of 
mentally ill offenders [61]. 

Homicides are usually followed by police investigations to deter- 
mine the cause of death and who might be responsible. The ‘clear- 
ance rate’ is the proportion of homicides in which either the killer 
is identified or a conviction is obtained. About 60% of homicides 
worldwide result in an arrest, a proportion that varies markedly be- 
tween jurisdictions, mainly according to the number of homicides 
to be investigated [1]. For example, in countries where the homicide 
rate is less than one per 100,000, the clearance rates are mostly above 
90%. Clearance rates fall to below 80% in countries with homicide 
rates of between one and ten per 100,000 and below 50% when the 
homicide rate is above ten per 100,000 [1]. Clearance rates can affect 
the results of studies of the epidemiology of homicide by the men- 
tally ill, because it may be that homicides by the mentally ill are more 
likely to be detected. 

Criminal proceedings after homicide offences initially involve 
establishing that the accused person committed the homicide. 
However, if the facts of the offence are clear, a conviction for the 
most serious form of intentional homicide can depend on the mo- 
tive and state of mind of the accused person. When the accused has 
a mental illness, the question before the court is the extent to which 
the mental illness reduced the defendants degree of responsibility. 
Defences on the basis of mental infirmity have been described since 
Roman times [61]. In common law tradition, the defence of mental 
illness depends either on the person being aware that the killing was 
legally wrong or, in some places, morally wrong, from the perspec- 
tive of the mentally ill offender. The most influential precedent in as- 
sessing criminal responsibility in the presence of mental illness were 
made by the House of Lords after the acquittal of Daniel M’Naghten 
for the 1843 killing of Edward Drummond, who M’Naghten had 
mistaken for the then British Prime Minister Robert Peel. In the 
M’Naghten Rules, as they came to be known, a verdict of not guilty 
by reason of insanity required that it be ‘clearly proved that at the 
time of committing the act the party accused was labouring under 
such a defect of reason, from disease of the mind, as not to know the 
nature and quality of the act he was doing, or as not to know that 
what he or she was doing was wrong’ [62]. 

Many jurisdictions have partial defences to murder on the basis that 
a psychiatric disorder may cause an offender to have reduced ability to 
form the required intent to commit murder or to control their actions 
(Box 146.1). A finding of diminished responsibility in the UK [63] re- 
duces the verdict from murder to manslaughter, with a lower range of 
sentences and often an order for secure hospital treatment. 

In addition to criminal proceedings, homicides are often the 
subject of civil inquests and proceedings on the basis of alleged 
negligence. In the UK, homicides by known patients are routinely 
examined in a process of confidential inquiry [64]. Coroners some- 
times conduct inquiries to determine the cause of homicides that did 
not result in immediate criminal charges. Several well-publicized 
homicides by patients have resulted in more wide-ranging public 
inquiries, for example, the enquiry into the care and treatment of 
Christopher Clunis who killed a young man standing with his family 
on a London railway station [65], which produced recommenda- 
tions for wide-ranging changes in clinical practice. 

Homicides by patients with mental illness deserve close scrutiny 
to establish whether future tragedies might be prevented. However, 


Box 146.1 Approximate legal definitions of homicide 





in English law 


Homicide: a broad term for the killing of one human by another, some- 
imes restricted to unlawful killings. 


Murder: an unlawful death inflicted with the intent to kill or seriously 
injure, and in the case of felony murder, a killing that takes place in the 
course of committing a serious crime, even if the killing was not intended. 


Manslaughter: an unlawful death inflicted without the intent to kill or 
injure, including killings where the offender's criminal responsibility is 
reduced because of the effect of a mental disorder. 








Not guilty by reason of insanity: at the time of committing the of- 
fence, the offender was unable to appreciate the nature and quality or 
he wrongfulness of his or her acts because of the effect of a mental 
illness or condition. 











Diminished responsibility: where a person kills or is party to the killing 
of another, he or she shall not be convicted of murder if he or she was 
suffering from such an abnormality of the mind (whether arising from 
a condition of arrested or retarded development of the mind or from 
any inherent causes or induced by disease or injury) as substantially 
impairing his or her mental responsibility for his or her acts and omis- 
sion in doing or being party to the killing. 








Infanticide: when a woman or girl who unlawfully kills her child (under 
12 months) under circumstances which, but for this section, would 
constitute wilful murder or murder, does the act which causes death 
when the balance of her mind is disturbed because she is not fully re- 
covered from the effect of giving birth to the child or because of the 
effect of lactation consequent upon the birth of the child, she is guilty of 
infanticide only. 





Homicide-suicide: when the homicide offender commits suicide 
shortly after the homicide, sometimes defined as within a week of the 
homicide. 














homicide is a very rare event that is not possible to meaningfully pre- 
dict [66, 67]. Decisions regarding the release of mentally ill homicide 
offenders should take into account the cognitive biases that influence 
the interpretation of catastrophic events, including the tendency 
to hindsight bias in apportioning blame. The rarity of homicides 
by the mentally ill, and the absence of identified factors associated 
with a greatly increased probability of homicide, means that it is not 
possible to predict which patients might go on to commit homicide 
or when a homicide might occur. 


Homicide recidivism 





Homicides committed by people after the conclusion of proceedings 
for an earlier homicide offence are known as recidivist homicides. 
Schizophrenia has been reported to be a risk factor in recidivist 
homicide [68], but in most advanced countries, recidivist homicide 
by people with schizophrenia is rare or unknown, probably because 
of long-term secure hospitalization and effective systems for en- 
suring long-term care after release [69]. 


The role of mental health professionals 


in homicide prevention 





People with mental illness commit a minority of homicides, and 
only a small proportion of homicide offenders are in contact with 
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mental health services at the time of the homicide. Hence, treatment 
of mental illness can only ever have a small effect on the overall rate 
of homicide. However, mental health professionals do have a role 
in the prevention of some homicides, particularly domestic homi- 
cide, homicide by people seeking treatment for substance use, and 
homicide by people with emerging or established psychotic illness. 
Because domestic homicides are a common form of homicide, the 
most important role health professionals can play is in the detec- 
tion of domestic violence and in interventions to protect the victim 
and treat the perpetrator. Domestic violence victims present with a 
range of emotional disorders and seek help from a range of services, 
often without disclosing their situation [70]. Health professionals 
can assist domestic violence victims by notifying the authorities 
and referral to respite and support. In many jurisdictions, there is 
now mandatory notification of violence involving children. Mental 
health professionals may also have a role in notifying licensing au- 
thorities about the emergence of mental illness or the presence of 
domestic violence among gun owners. 

Improvements in the overall management of substance use dis- 
orders, in particular alcohol use disorders, and of comorbid substance 
use among people with mental illness could be expected to reduce the 
incidence of violence, including homicide, among those seeking care. 

Health services can also play a part in reducing the incidence of 
violence by the mentally ill, by developing systems to ensure early 
treatment of psychotic illness and by ensuring the continuity and 
standards of ongoing care. 

Threats of homicide warrant careful evaluation. Warning of po- 
tential victims and notifying authorities can be an acceptable breach 
of patient confidentiality, if the information that has become avail- 
able to the clinician indicates the presence ofa real threat to another 
person’s safety. 

While it is likely that good clinical care can prevent some homi- 
cides, the benefit of interventions is not apparent or easy to dem- 
onstrate. Most clinicians will never have a patient who commits a 
homicide. Psychiatrists do have a more tangible role in providing 
care to both mentally ill homicide offenders and a much larger 
number of people who have experienced the trauma and grief of the 
loss of a relative by homicide. 
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comorbidity 695 
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